Skip to main content
Frontiers in Pharmacology logoLink to Frontiers in Pharmacology
. 2024 Apr 2;15:1375522. doi: 10.3389/fphar.2024.1375522

A comprehensive review of the recent advances on predicting drug-target affinity based on deep learning

Xin Zeng 1, Shu-Juan Li 2, Shuang-Qing Lv 3, Meng-Liang Wen 4, Yi Li 1,*
PMCID: PMC11019008  PMID: 38628639

Abstract

Accurate calculation of drug-target affinity (DTA) is crucial for various applications in the pharmaceutical industry, including drug screening, design, and repurposing. However, traditional machine learning methods for calculating DTA often lack accuracy, posing a significant challenge in accurately predicting DTA. Fortunately, deep learning has emerged as a promising approach in computational biology, leading to the development of various deep learning-based methods for DTA prediction. To support researchers in developing novel and highly precision methods, we have provided a comprehensive review of recent advances in predicting DTA using deep learning. We firstly conducted a statistical analysis of commonly used public datasets, providing essential information and introducing the used fields of these datasets. We further explored the common representations of sequences and structures of drugs and targets. These analyses served as the foundation for constructing DTA prediction methods based on deep learning. Next, we focused on explaining how deep learning models, such as Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), Transformer, and Graph Neural Networks (GNNs), were effectively employed in specific DTA prediction methods. We highlighted the unique advantages and applications of these models in the context of DTA prediction. Finally, we conducted a performance analysis of multiple state-of-the-art methods for predicting DTA based on deep learning. The comprehensive review aimed to help researchers understand the shortcomings and advantages of existing methods, and further develop high-precision DTA prediction tool to promote the development of drug discovery.

Keywords: deep learning, drug-target affinity, dataset, representation, methods

1 Introduction

Drug-target affinity (DTA) is a critical metric and the core of drug discovery. While the wet experiments have been used to calculate DTA with high accuracy, the time-consuming and laborious nature of these experiments can no longer meet the demands of modern drug screening, especially with the massive drug-target pairs. Fortunately, the emergence of computational methods for predicting DTA has accelerated the drug screening process, helping to shorten the drug development cycle and reduce the costs (Kairys et al., 2019; Abbasi et al., 2021; Xu et al., 2021; Zhang et al., 2023a).

At present, while there are non-machine learning methods available for computing DTA, such as FEP (Free-Energy Perturbation) (Jorgensen and Thomas, 2008) and MM/GBSA (or MM/PBSA) (Çınaroğlu and Timuçin, 2020), which can effectively estimate the binding free energy or affinity of drug-target, these methods not only demand a significant amount of computing resources, but also exhibit slow processing speeds when dealing with a large number of drug-target pairs. In contrast, data-driven machine learning methods offer fast processing speeds and high computational accuracy. The computational methods based on machine learning for predicting DTA can be classified into two categories: traditional machine learning methods and deep learning methods. Traditional machine learning methods employ linear regression, random forest regression, nearest neighbor regression, and support vector machine regression (Ballester and Mitchell, 2010; Li et al., 2015; Shar et al., 2016) to predict DTA. Although these methods perform well performance, they cannot automatically extract high-level hidden features from drugs and targets. With the emergence of deep learning models, DTA prediction methods based on deep learning (Öztürk et al., 2018; Wang et al., 2021a; Rube et al., 2022) can automatically extract high-level hidden features from the sequences and structures of drugs and targets, resulting in the improved performance compared to traditional machine learning methods. Except for a few methods that utilize deep learning to extract high-level features from target-target and drug-drug interaction networks (Dehghan et al., 2023; Rafiei et al., 2023), based on the different combinations of multiple modal features of drugs and targets, such as fingerprints, SMILES, two-dimensional molecular topology graphs, three-dimensional spatial structures, physicochemical properties, sequences, and contact maps, deep learning-based DTA prediction methods can be broadly divided into three categories: sequence-based, hybrid-based, and structure-based methods.

Sequence-based methods (Öztürk et al., 2018; Karimi et al., 2019; Wang et al., 2021a; Li et al., 2022b; Ghimire et al., 2022; Zhao et al., 2022; Gim et al., 2023; Jin et al., 2023; Kalemati et al., 2023; Ru et al., 2023; Zhou et al., 2024) aim to extract implicit sequence features from drug SMILES (Simplified Molecular Input Line Entry System) (Weininger, 1988) and target sequences using deep learning models. These methods leverage various sequence deep learning models such as Convolutional Neural Networks (CNNs) (LeCun et al., 2015), Recurrent Neural Networks (RNNs) (Zaremba et al., 2015), and Transformers (Vaswani et al., 2017). In the current sequence-based methods, 1D-CNN, RNNs, and Transformers are commonly used to extract high-level sequence features. On the other hand, 2D-CNN is employed to extract sequence features from a two-dimensional matrix composed of drugs or targets. For instance, DeepDTA (Öztürk et al., 2018) utilized a CNN module with three consecutive 1D convolutional layers to extract sequence features from drug SMILES and target sequences, respectively. SimCNN-DTA (Shim et al., 2021), on the other hand, employed 2D-CNN to predict DTA by utilizing the outer product between column vectors of two similar matrices representing drugs and targets. While CNNs effectively capture the local features from drug SMILES and target sequences, they may overlook long-range dependencies between atoms or amino acids. To address this issue, RNNs with memory functions can be utilized to extract long-range dependent features, as demonstrated in DeepAffinity (Karimi et al., 2019) and DeepCDA (Abbasi et al., 2020). However, CNNs and RNNs may not focus on the key features influencing drug-target interaction or provide interpretability for the model’s effectiveness. Some attention mechanisms (Vaswani et al., 2017) are employed to capture the key features (Zeng et al., 2021; Chen et al., 2022; Ghimire et al., 2022; Monteiro et al., 2022; Zhang et al., 2022; Zhao et al., 2022). For example, AttentionDTA (Zhao et al., 2022) utilized attention mechanisms to focus on subsequences within drug SMILES and target sequences that played a crucial role in affinity prediction. MRBDTA (Zhang et al., 2022) incorporated multi-head attention mechanisms, effectively capturing drug-target interaction sites and providing interpretational analysis for its effectiveness. CAPLS (Jin et al., 2023) employed the cross-attention mechanism to capture the mutual effect of protein-binding pocket and ligand. MT-DTA (Zhu et al., 2023b) built a variational autoencoders system with a cascade structure of attention model and CNNs to extract the implied high-level interactive features from target sequences and drug SMILES. Sequence-based methods have the advantage of easily obtaining the target sequences and drug SMILES data. These methods excel in processing sequence data swiftly, without demanding substantial computing resources, and exhibit a fine performance in predicting DTA based on the extracted high-level sequence features. Nevertheless, these methods overlook additional multimodal information related to targets and drugs, like topology graphs and 3D structures. It is important to note that structural information harbors crucial features that significantly influence DTA prediction. Disregarding this essential structural data may limit the accuracy, depth, and interpretability of understanding in predicting DTA. However, utilizing the structures of targets to enhance DTA faced limitations in the early states, as only a small portion of target sequences had known structures. Consequently, the exploration of hybrid-based methods emerged by incorporating the structural features of drugs into sequence-based approaches.

Hybrid-based methods (Karimi et al., 2021; Wang et al., 2021b; Zhang et al., 2021; Cheng et al., 2022; Li et al., 2022a; Lin et al., 2022a; Tian et al., 2022; Yang et al., 2022; Jiang et al., 2023; Pan et al., 2023; Wang et al., 2023a; Wang and Li, 2023; Xia et al., 2023; Yang et al., 2023; Zeng et al., 2023; Zhang et al., 2023b; Zhang et al., 2023a; Zhu et al., 2023a; Zhu et al., 2023c; Nguyen et al., 2022.) leverage deep learning models to extract sequence features from drug SMILES and target sequences, as well as the structural features from two-dimensional molecular topology graphs and three-dimensional structures of drug small molecules. These methods focus on integrating the structural features of drugs into sequence-based approaches. For the structures of drugs, tools like RDKit (Landrum, 2013) are commonly used to convert drug SMILES into the molecular graphs. GNN models (Xu et al., 2019) are employed for capturing the structural features of drugs. For instance, GraphDTA (Nguyen et al., 2021) utilized GCN and CNN to extract the structural features from drug molecular graphs and sequence features from target sequences, respectively. These extracted features were then combined as inputs and passed through fully connected layers to predict DTA. SAG-DTA (Zhang et al., 2021) incorporated a GCN with multiple self-attention graph pooling layers to extract the hidden features from drug molecular graphs. CNN was directly applied to the target sequences for learning high-level features. TDGraphDTA (Zhu et al., 2023c) introduced the transformer and diffusion to predict drug-target interactions using multi-scale information interaction and graph optimization. Hybrid-based methods combine the structural features of drugs with sequence-based approaches, enriching the features of drugs. Typically, GNN are employed to extract the drug structural features from molecular graphs converted from drug SMILES. These molecular graphs are relatively small and have minimal impact on the computational speed of the model. However, three-dimensional structural features of drugs are underutilized in hybrid-based methods. Furthermore, these methods completely overlook the structural features of target and make it difficult to provide explanatory analysis for the effectiveness of the model, leaving ample opportunity for performance enhancement. However, with the advent of AlphaFold (Jumper et al., 2021) and ColabFold (Kim et al., 2023), two target structural prediction tools, obtaining target structures has become less challenging. Consequently, there is a growing interest in methods that utilize the structures of drugs and targets for predicting DTA. Structure-based methods are gaining increased attention from researchers in this context.

Structure-based methods (Gomes et al., 2017; Stepniewska-Dziubinska et al., 2018; Zhang et al., 2019; Jiang et al., 2020; Seo et al., 2021; Shen et al., 2021; Lin et al., 2022b; Ma et al., 2022; Lu et al., 2023; Wu et al., 2024) employ deep learning models like GNN and 3D Convolutional Neural Network (3D-CNN) to extract implicit structural features from the molecular graphs of drugs and targets or the 3D structures of drug-target complexes. Using GNNs (Li et al., 2021; Yuan et al., 2021; Chu et al., 2022; Jiang et al., 2022; Liao et al., 2022; Pandey et al., 2022; Bi et al., 2023; Wang et al., 2023b; Zhang et al., 2023d; Zhang et al., 2023c; Ma et al., 2023; Mekni et al., 2023; Tsui et al., 2023; Tian et al., 2024), the molecular graphs of drugs and targets are fed into GNN to obtain the structural features. For example, PSG-BAR (Pandey et al., 2022) served as an example where a contact map was generated based on the 3D structure of target. Target graph was then constructed using the contact map, and the structural features were extracted using RGAT. For drug, the graph was generated based on its structural file, and RGAT was also employed to extract high-level features. AttentionMGT-DTA (Wu et al., 2024) represented drugs and targets by a molecular graph and binding pocket graph, respectively. Graph transformer module was utilized to extract the structural features of drugs and binding pockets. WGNN-DTA (Jiang et al., 2022) constructed protein and molecular graphs through sequence and SMILES that can effectively reflect their structures. Weighted graph neural networks were used to extract the structural features of molecules and proteins for predicting DTA. On the other hand, 3D-CNN-based methods (Zheng et al., 2019; Kwon et al., 2020; Liu et al., 2021; Wang et al., 2022) directly take the 3D structure of drug-target complex as input and use the extracted spatial features of complex as input for the FC network to predict DTA. For example, AK-Score (Kwon et al., 2020) employed the ensemble of multiple independently trained networks that consisted of multiple channels of 3D-CNN layers to predict the binding affinity of a complex. Sfcnn (Wang et al., 2022) converted drug-target complex into 3D grids for CNN training to extract the structural features. Structure-based methods offer effective utilization of the structural features of drugs and targets, yielding impressive performance. They are especially valuable for providing explanatory analyses that shed light on the model’s effectiveness, thereby facilitating research into DTA prediction methods and promoting wider application of these models. However, it is important to acknowledge some limitations. One such limitation is the reliance on tools like AlphaFold to obtain the target structures. While AlphaFold has shown higher accuracy in predicting the structures of monomeric proteins, its performance in predicting the structures of other proteins still requires optimization. Additionally, structure-based methods extract structural features from protein structure graphs, which can be computationally demanding and result in slower processing speeds.

In this review, we aimed to highlight the crucial significance of precise DTA prediction, followed by a comprehensive overview of the universal datasets and widely used representation methods for the sequences, structures, and complexes of drugs and targets. We then focused on the widespread application of popular deep learning techniques in DTA prediction. Our goal was to provide a comprehensive overview of datasets, representation, methods, and deep learning techniques for predicting DTA. By doing so, we intend to empower researchers to effectively utilize these resources in developing innovative DTA prediction methods, thereby providing essential support for drug discovery, design, and repurposing endeavors. The main contributions of this review can be summarized as follows:

  • (1) A comprehensive statistical analysis has been carried out on datasets, representations, model architectures, and performance evaluation of state-of-the-art methods based on deep learning for predicting DTA.

  • (2) Elaboration on the extraction process of crucial implicit features from diverse modalities of drugs and targets using cutting-edge deep learning technologies like CNN, RNN, GNN, and Transformer.

  • (3) An in-depth analysis of the strengths and limitations of advanced deep learning methods for predicting DTA is conducted from three perspectives: sequence, hybrid, and structure. This analysis serves as a foundation for researchers to develop novel and more accurate tools for DTA prediction.

2 Dataset

A high-quality dataset of drug-target binding affinity serves as the fundamental basis for the development of computational methods that leverage deep learning for predicting DTA. Currently, the most widely used datasets for DTA prediction include PDBbind (multiple versions) (Wang et al., 2005), Davis (Davis et al., 2011), KIBA (Tang et al., 2014), BindingDB (Liu et al., 2007), and Metz (Metz et al., 2011). To supplement these universal affinity datasets, UniRef (Suzek et al., 2015), UniProt (The et al., 2021), Protein Data Bank (PDB) (Berman, 2000), STITCH (Kuhn et al., 2007), and ZINC (Irwin and Shoichet, 2006) can provide additional sequences and structures for drugs and targets that may be missing.

2.1 Statistical analysis of commonly used datasets for DTA prediction

We performed a comprehensive statistical analysis on the datasets utilized in nearly 80 references on drug-target affinity cited in this review to assess their usage. The results of our analysis, as depicted in Figure 1, revealed that PDBbind, Davis, KIBA, BindingDB, and Metz were the five most frequently employed datasets. Among these, PDBbind and BindingDB were primarily utilized for deep learning methods based on hybrid or structure. These two datasets offer comprehensive sequence and structural data for drugs and targets. On the other hand, Davis, KIBA, and Metz were predominantly employed for sequence-based deep learning methods, although some hybrid or structure-based deep learning methods also utilized them. It is worth mentioning that the structures of targets in Davis, KIBA, and Metz were sourced from the PDB database.

FIGURE 1.

FIGURE 1

Statistics on the usage of the datasets for predicting DTA based on deep learning.

2.2 Introduction to commonly used datasets for DTA prediction

In this section, we provide a detailed introduction to the five most frequently used datasets: PDBbind, Davis, KIBA, BindingDB, and Metz. In addition, we will also introduce the ToxCast dataset (Feng et al., 2019), which is commonly used in multi-task prediction methods.

2.2.1 PDBbind

PDBbind dataset comprises four commonly used versions: 2013, 2016, 2018, and 2020. Each version of the dataset consists of two distinct parts: the general set and the refined set (Table 1). To illustrate, let’s consider the PDBbind dataset (version 2016), which includes a total of 13,283 samples: 9,226 samples for the general set and 4,057 samples for the refined set. The refined set is obtained based on the quality protocols, including measured resolution and experimental precision. This process ensures the exclusion of ligands, ternary complexes, or steric hindrance complexes with resolutions above 2.5 Å, R factors exceeding 0.25, instances of covalent bonding, and complexes lacking reported binding affinities in terms of Kd (dissociation constant) or Ki (inhibitor constant), or falling outside the necessary range (Kd < 1 pM). Each sample represents a drug-target pair labeled with affinity value known as the dissociation constant (Kd). Notably, each sample provides drug SMILES and target sequence, as well as the 3D structure of target and pocket information related to drug-target binding. Thanks to the extensive sequence and structural information available for drugs, targets, and pockets in PDBbind dataset, it has become widely recognized as a universal dataset for predicting DTA in sequence, hybrid, and structure-based deep learning methods. Furthermore, it is worth mentioning that the CASF series datasets used to test the performance of models in certain studies (Stepniewska-Dziubinska et al., 2018; Wang et al., 2022), such as CASF-2013 (Li et al., 2014), CASF-2016 (Su et al., 2019), are the core sets derived from the corresponding refined sets of PDBbind datasets.

TABLE 1.

Statistic of commonly used PDBbind dataset with different versions.

Version Total number of samples General set Refined set
2013 11,184 8,225 2,959
2016 13,283 9,226 4,057
2018 16,126 11,663 4,463
2020 19,443 14,127 5,316

2.2.2 Davis and KIBA

Davis dataset (Table 2) comprises 68 compounds and 442 proteins, generating a total of 30,056 compound-protein affinity samples, each labeled with the dissociation constant (Kd). It is worth noting that all drug-target pairs that can not be experimentally measured for bioactivity are assigned a bioactivity value of 10 μM (corresponding to a pK d of 5) in Davis dataset. But the number of data points within this range is very large. Consequently, some methods have chosen to remove the data points with a bioactivity value of 10 μM from the Davis dataset, thereby creating what is known as the Filtered Davis dataset (Rifaioglu et al., 2021). KIBA dataset (Table 2) includes 246,088 interaction pairs of samples derived from 467 proteins and 52,498 compounds. Notably, KIBA contains three types of labels: inhibition concentration 50 (IC50), dissociation constant (Kd), and inhibition constant (Ki). Due to their focus on only providing the sequences of drugs and targets, Davis and KIBA are predominantly utilized in sequence-based deep learning methods. Nonetheless, a few hybrid or structure-based approaches have also been successfully employed using these datasets. It is important to note that while Davis and KIBA do not include the 3D structures of targets and drugs, they can be accessed by downloading them from the PDB and ZINC databases, respectively.

TABLE 2.

Detailed information on datasets Davis, Filtered Davis, and KIBA.

Dataset Compounds Proteins Total number of samples
Davis 68 442 30,056
Filtered Davis 68 379 9,125
KIBA 52,498 467 246,088

2.2.3 BindingDB

BindingDB dataset is primarily composed of drug-target pair samples sourced from some scientific literatures, encompassing four different types of affinity labels: IC50, Kd, Ki, and EC50 (median effect concentration). Table 3 displays the number of drugs, targets, and drug-target interaction pairs in each label category. Notably, BindingDB provides drug SMILES and target sequences, while their structures can be obtained from PDB and ZINC databases, respectively. This comprehensive information enables BindingDB to be widely utilized in sequence, hybrid, and structure-based methods, typically to evaluate the generalization performance of DTA prediction methods.

TABLE 3.

Details of BindingDB dataset.

Dataset Label Drugs Targets Total number of samples
BindingDB IC50 265,627 2,793 376,751
Kd 5,895 812 12,589
Ki 93,437 1,619 144,525
EC50 31,970 513 37,896

2.2.4 Metz

Metz dataset comprises 1,423 drugs and 170 targets, resulting in a total of 35,259 drug-target pairs. Each pair is labeled with an affinity value represented by Ki (in the form of pK i value). Furthermore, the relationship between drugs and targets can be accessed from the STITCH database, which consolidates diverse chemical and protein networks.

2.2.5 ToxCast

Toxcast is a toxicology research dataset derived from high-throughput in vitro screening of chemicals, primarily measuring AC50, which represents the concentration at half of the maximum activity. This dataset has a large scale, covering different types of proteins, and contains qualitative results from more than 600 experiments involving over 8,000 compounds. With around 530,000 observations of drug-target pairs and over 600 labels, it is well-suited for multi-task prediction. Its subsets are frequently utilized for case studies or generalization performance testing of DTA methods.

2.3 Introduction to supplementary used databases for DTA prediction

2.3.1 Uniprot

Uniprot database (The et al., 2021) is a sequence database designed specifically for proteins that contains approximately 220 million protein sequences and related annotation information on the biological functions of proteins. It has the ability to add new protein entries, as well as supplement and update publicly available annotation information, and is widely regarded as the protein database with the most extensive collection and comprehensive annotation information.

2.3.2 PDB

PDB database (Berman, 2000) is the premier collection of 3D structures for biological macromolecules, such as proteins, nucleic acids, etc., which contains the 3D structures of all resolved proteins. In addition to annotating the 3D structural information of proteins, PDB also provides various file types for downloading and visualizing the 3D structures of proteins.

2.3.3 STITCH

STITCH database (Kuhn et al., 2007) is a valuable resource that includes information on interactions between 43,000 compounds and 9,643,763 proteins from 2,031 species. It shares protein interaction data with the STRING database (https://cn.string-db.org/), making it an important database for studying compound sequences. Each interaction in STITCH database is assigned a score value, which represents the affinity or probability of the interaction between a compound and a protein. STITCH also provides information on compounds that are similar to the drug of target, along with their similarity scores.

2.3.4 ZINC

ZINC (Irwin and Shoichet, 2006) is a free commercial database used for virtual screening of compounds, which provides access to 3D structures of over 230 million molecules. It offers multiple docking program interfaces, user-defined molecular operations, and web-based database search and browsing capabilities.

3 Representation

3.1 Sequence representation

Drug SMILES and target sequences are composed of different characters. Therefore, they are commonly encoded using one-hot encoding or label encoding in sequence and hybrid-based methods. Their sequence features are extracted using CNN, RNN, or Transformer. In structure-based methods, the extracted features from sequences are utilized as node features in the graphs of drugs and targets. In addition, traditional sequence features such as molecular fingerprint, position-specific score matrix (PSSM) (Altschul, 1997), and Hidden Markov Matrix (HMM) (Remmert et al., 2012) are also widely employed in DTA prediction.

3.2 Structure representation

For drug, the structure representation often involves graph. One common type of the drug graph is based on the drug SMILES, which can be converted using RDKit tool. Another type of the drug graph is based on the 3D structural file, where atoms serve as vertices and bonds act as edges. Node features in the drug graph can be derived from the physical-chemical properties of atoms or extracted from drug SMILES using deep learning techniques.

For target, the secondary structural information can be obtained directly from the relevant file of target and is widely employed in traditional machine learning and deep learning methods. The tertiary structural graph of target can be roughly categorized into two types: contact map and spatial topology graph. Contact map is created based on the sequence or tertiary structure of target, generating a map of interaction between amino acids. Structural features of target can be extracted directly from the contact map using CNN models. Alternatively, the contact map can be converted into a target graph, allowing the use of GNNs to extract structural features. Spatial topology graph of target is constructed based on the 3D structural file. Nodes in the graph represent amino acids, typically carbon α atoms, and edges are formed based on distance thresholds, such as Euclidean distance between carbon α atoms.

3.3 Interaction network graph representation of drug-target complex

In DTA prediction, the interaction between a drug and its target is often represented as a graph. This involves extracting interaction features using GNN. The construction of the interaction network graph is based on the 3D structure of drug-target complex. To create the graph, the atoms of drug and the carbon atom of amino acid in target (typically the carbon α atom) are selected as the vertices of graph. The Euclidean distance between each atom and the carbon atom is then calculated. If the distance is less than or equal to a specified threshold (usually set to 8 Å or 10 Å), an edge is created to connect the corresponding atom to the amino acid. Any atoms and amino acids that did not participate in the construction of the interaction graph are excluded. This process results in an interaction network graph that represents the drug-target complex, which can be used for analysis and prediction of DTA.

3.4 3D structural spatial grid representation of drug-target complex

While the interaction network graph of drug-target complex can provide valuable information about the structural features, some atoms and amino acids are ignored. As a result, deep learning methods that utilize a complete 3D structural spatial grid representation of drug-target complex are widely used. The 3D structural spatial grid representation of the complex is composed of the spatial coordinates of all atoms, and 3D-CNN is used to extract the spatial structural features from the complex’s 3D structure.

4 Drug-target affinity prediction methods based on deep learning

Currently, computational methods for predicting DTA using deep learning can be broadly categorized into three groups based on the progression from sequence to structure: sequence-based, hybrid-based, and structure-based methods. In the following chapters, we will provide a comprehensive overview of the feature extraction process for each category.

4.1 Sequence-based deep learning methods

Sequence-based deep learning methods (Figure 2) utilize drug SMILES and target sequences as input. These methods employ various deep learning techniques, including CNN, RNN, Transformer, and attention mechanisms, to extract essential features from the input sequences. In the following sections, we will provide an overview of some classic sequence-based methods.

FIGURE 2.

FIGURE 2

The overview architecture of sequence-based deep learning methods for predicting DTA.

4.1.1 DeepDTA

In DeepDTA (Öztürk et al., 2018), drug SMILES and target sequences were encoded as label encodings and used as inputs. The sequence feature extraction was conducted by two independent CNN blocks, each comprising three 1D convolutional layers. Drug SMILES and target sequences, were separately processed through the embedded layers and passed into their respective CNN blocks. This allowed for the extraction of high-level sequence features from drugs and targets. Subsequently, the extracted sequence features were concatenated and fed into a three-layer FC network to predict DTA. DeepDTA not only showed superior performance compared to traditional machine learning methods, but also enabled automatic extraction of sequence features and end-to-end DTA prediction. This contributed to the transition from traditional machine learning methods to deep learning methods in the field of DTA prediction.

4.1.2 DeepCDA

In the architecture of DeepCDA (Abbasi et al., 2020), drug SMILES and target sequences were used as inputs. Initially, both drug SMILES and target sequences underwent encoding via coding layers. The encoded representations were then separately fed into identical feature extraction networks. Each feature extraction network consisted of two components: a CNN block and an LSTM block. CNN block comprised three convolutional layers, responsible for extracting short-distance features from the sequences. These short-distance features were subsequently inputted into a multi-layer LSTM block to capture long-distance dependent features. By combining CNN and LSTM, DeepCDA effectively considered local and long-range dependent features of the sequence. To further extract crucial information influencing drug-target interaction, a bidirectional attention mechanism was employed to fuse the extracted sequence features. This fusion process enabled comprehensive feature mining that accounts for the interaction between drugs and targets. Finally, the fused features were fed into a FC layer to predict DTA.

4.1.3 AttentionDTA

AttentionDTA (Zhao et al., 2022) took drug SMILES and target sequences as input, which were encoded using label encoding. A character embedding layer was inserted between the label encoding layer and the feature extraction block to convert drug SMILES and target sequences into embedding matrices. These matrices were then passed through a CNN block consisting of multiple 1D-CNN layers to extract implicit sequence features. To capture the non-covalent interactions between the atoms of drug and the amino acids of target, AttentionDTA incorporated a bilateral multi-head attention mechanism. This mechanism took the features extracted by the CNN block as input, allowing it to capture the interaction information that affected drug-target interaction. The resulting key interaction information was subsequently fed into a multi-layer perceptron (MLP) for DTA prediction.

4.2 Hybrid-based deep learning methods

Hybrid-based deep learning methods (Figure 3) have been at the forefront of utilizing the structural features of drugs. The process begins by obtaining the graph representation of drug directly from its SMILES using RDKit tool. Subsequently, GNN is employed to learn implicit high-level structural features from the graph. Finally, these extracted structural features are combined with the sequence features of target to predict DTA. These methods effectively integrate the sequence and structural information to enhance the performance.

FIGURE 3.

FIGURE 3

The overview architecture of hybrid-based deep learning methods for predicting DTA.

4.2.1 GraphDTA

GraphDTA (Nguyen et al., 2021) was a representative hybrid-based deep learning method for predicting DTA. It leveraged the structural features of drugs and the sequence features of targets. Initially, drug SMILES was converted into a molecular graph using the RDKit tool. Subsequently, a three-layer GNN was employed to extract the structural features. As for target, the sequence underwent label encoding and embedding layers before being inputted into a convolutional block comprising three 1D-CNN layers to learn sequence features. Finally, the extracted structural features of drugs and the sequence features of targets were combined and fed into a FC network with multiple layers to estimate DTA.

4.2.2 MGraphDTA

MGraphDTA (Yang et al., 2022) took a similar approach by leveraging the structures of drugs and target sequences. However, it enhanced the global structural features extraction by employing a deeper multi-scale GNN (MGNN). This allowed for a comprehensive understanding of the global relationships between atoms in drug and captured various features within the structure of drug. Simultaneously, multi-scale CNN (MCNN) was applied to extract multi-scale features from target sequences. Following this, the multi-scale features from the structures of drugs and target sequences were separately fused, and the resulting fused features were concatenated to form a combined representation of drug-target pair. Finally, the combined representation was fed into MLP to predict DTA.

4.2.3 ColdDTA

Deep learning methods have exhibited promising performance on randomly split public datasets, but their performance tends to significantly decrease when applied to practical scenarios. To address this issue, ColdDTA (Fang et al., 2023) utilized the structural knowledge of drugs and target sequence information to enhance the model’s generalization performance by data augmentation and attention-based feature fusion techniques. The construction process of ColdDTA was as follows: firstly, a new drug-target pair was generated by removing a subgraph from the original graph of drug. Next, the structural features of drug and the sequence features of target were extracted using GNN and CNN, respectively. These extracted features were then fused via an attention-based fusion block to better capture the interaction mechanism between drug and target. Finally, the fused features were inputted into MLP to predict DTA.

4.3 Structure-based deep learning methods

Currently, structure-based deep learning methods for predicting DTA can be broadly categorized into two types. The first type involves extracting structural features from the molecular graphs of drugs and targets using GNN, followed by fusing the extracted features to predict DTA using a FC network (Figure 4A). The second type is based on 3D structures of drug-target complexes, where high-level structural features are extracted using 3D-CNN to predict DTA (Figure 4B). With the emergence of AlphaFold and ColabFold, obtaining the structures of targets has become more feasible. Furthermore, the rapid development of GNN and 3D-CNN has provided critical support for extracting structural features. As a result, structure-based methods have garnered increasing attention from researchers.

FIGURE 4.

FIGURE 4

The overview architecture of structure-based deep learning methods. (A) The extraction of structural features from molecular graphs of drugs and targets using Graph Neural Networks (GNN), and (B) the extraction of structural features of drug-target complexes from their 3D structures using 3D Convolutional Neural Networks (3D-CNN).

4.3.1 GSAML-DTA

GSAML-DTA (Liao et al., 2022) employed a hybrid network model combining GNN and GAT to extract structural features from drugs and targets. The process began by converting drug SMILES and target sequences into drug molecular graphs and contact maps, respectively, using different tools. Subsequently, drug molecular graphs and contact maps were separately inputted into the hybrid network model of GNN-GAT with an attention mechanism to extract structural features of drugs and targets. Following this, the extracted features were concatenated and further optimized through an interactive information module. Finally, the optimized features were fed into a FC network to predict DTA. By leveraging the GNN-GAT hybrid network model with attention mechanism and the interactive module, GSAML-DTA aimed to enhance the accuracy of DTA prediction.

4.3.2 HGRL-DTA

HGRL-DTA (Chu et al., 2022) utilized a hierarchical graph representation learning model for predicting DTA. This model established a hierarchical graph framework where the drug-target binding affinity data was represented as an affinity graph, with drugs and targets serving as vertices within the graph. Simultaneously, drugs and targets were represented as molecular graphs, respectively. To begin, GNN was employed to learn global-level affinity relationship within the affinity graph. Additionally, GNN was also used to separately capture the local chemical structural features of drugs and targets. Through a message propagation mechanism, the learned hierarchical graph information was integrated, and the structural features of drugs and targets were refined using GCN. Finally, these extracted structural features of drugs and targets were combined and inputted into a FC network to predict DTA. By leveraging the hierarchical graph setup, GNN-based representation learning, and message propagation mechanism, HGRL-DTA aimed to improve the accuracy of DTA prediction.

4.3.3 MSGNN-DTA

MSGNN-DTA (Wang et al., 2023b) employed a multi-scale graph construction approach to capture the structural features of drugs and targets from multiple perspectives. For drugs, two types of graphs were constructed. Firstly, an atomic level graph was generated using RDKit tool based on drug SMILES. In this graph, atoms were represented as vertices, chemical bonds between atoms were represented as edges, and the topology was represented by a two-dimensional matrix. Secondly, a motif level graph was created by considering certain motifs (e.g., benzene rings) as vertices, with edges indicating the presence of chemical bond connections between motifs. Regarding target, target sequence was converted into a contact map using ESM-1b (Rives et al., 2021). Additionally, a weight map was constructed based on WGNN-DTA (Jiang et al., 2022). In weight map, residues served as vertices, interactions between residues served as edges, and weights of edges were probability values. To obtain multi-scale topological feature representations, GNN was utilized to extracted high-level structural features from the atomic level graphs and motif level graphs of drugs, as well as the weight graphs of targets. Subsequently, an attention mechanism was employed to fuse the multi-scale structural features and generate a join feature representation. The joint feature representation was then inputted into a multi-layer FC network for DTA prediction.

4.3.4 Sfcnn

Aside from utilizing GNN to extract the structural features from molecular graphs of drugs and targets, there are some methods that use 3D-CNN to extract the structural features from drug-target complexes. One such method was Sfcnn, which employed 3D-CNN to generate a score function for DTA prediction. To begin, the drug-target complex was transformed into a 3D grid representation. This grid served as input to 3D-CNN, which learned high-level structural features. Finally, multiple density layers were applied to the extracted features for DTA prediction.

5 Performance analysis of multiple state-of-the-art methods based on deep learning

5.1 Common performance evaluation metrics

In this review, predicting drug-target affinity is a regression task, and commonly used performance evaluation metrics of the model include Mean Absolute Error (MAE), Mean Square Error (MSE), Root Mean Square Error (RMSE), Pearson Correlation Coefficient (PCC), Spearman ( ρ ), Concordance Index (CI), and R 2.

MAE Eq. 1 is used to measure the mean absolute error between prediction value and actual value. It reflects the size of actual prediction error.

MAE=1ni=1nyiyi^,0,+) (1)

MSE Eq. 2 and RMSE Eq. 3 are often used to measure the deviation between prediction value and actual value. It is a measure of accuracy used to compare the prediction errors of different models for specific dataset and measure the error rate of the regression model. For MAE, MSE, and RMSE, the smaller their values are, the better effect of the model is.

MSE=1ni=1nyiyi^2 (2)
RMSE=1ni=1nyiyi^2,0,+) (3)

R2 Eq. 4 is mainly used to measure how well the prediction value fits the actual value. When our model does not have any deficiencies, R2 will get the maximum value of 1. If R2 is 0, our model is equal to the baseline model. When R2 is less than 0, it means that our model is not as good as the baseline model.

R2=1i=1nyiyi^2i=1nyiy¯2,0,1 (4)

In formula Eqs 14, n is the number of samples, yi is the vector of actual value, yi^ is the precdition vector, and y¯ is the average value of all actual values yi1in .

PCC Eq. 5 is used to measure the mutual relationship (linear correlation) between two variables X and Y, and its range is [−1, 1]. PCC is widely used in academic research to measure the strength of the linear correlation between two variables. Cov (X, Y) represents the covariance of two variables X and Y. σX is the standard deviation of X. If ρXY>0 , it means that X and Y are positively correlated; ρXY<0 , X and Y are negatively correlated; ρXY=0 , X and Y are not correlated.

ρXY=covX,YσXσY,1,1 (5)

Spearman Eq. 6 is a nonparametric measure of the dependence of two variables. n is the number of samples. The difference between prediction value and actual value of each group is di1in . The closer value of correlation coefficient ρ is to +1 or −1, the stronger correlation between two variables.

ρ=16di2nn2 (6)

CI Eq. 7 is used to evaluate the prediction accuracy of the model. Where bi is the prediction value for the larger affinity δi , bj is the prediction value for the smaller affinity δj , Z is a normalization constant. For function φx , it is 1 if the value of x is greater than 0, 0.5 if the value of x is equal to 0, and 0 if the value of x is less than 0.

CI=1Zδi>δjδbibj1i,jn (7)

5.2 Performance analysis of multiple state-of-the-art methods based on PDBbind, KIBA, and Davis datasets

Figure 1 highlights PDBbind, KIBA, and Davis datasets as commonly used datasets for predicting DTA using deep learning. We summarized the performance evaluation metrics values of several state-of-the-art methods on PDBbind, KIBA, and Davis datasets, as reported in recently published literatures (Wang et al., 2023a; Zhu et al., 2023a; Bi et al., 2023; Xia et al., 2023; Tian et al., 2024; Wu et al., 2024; Zhou et al., 2024), without considering the specific partitioning of the corresponding datasets by these methods. Although the statistical results (Tables 4, 5; Figures 57) showed that these methods have achieved good prediction performance for DTA on commonly used benchmark datasets, the further improvement in DTA prediction still faces challenges. Researchers are actively working on extracting high-level implicit features from the sequences, structures, or complexes of drugs and targets, with the aim of developing novel methods with even better performance for predicting DTA.

TABLE 4.

Performance comparison of multiple state-of-the-art methods based on PDBbind dataset.

Methods MSE RMSE CI
Pafnucy 1.418 1.129 0.789
DeepDTA 1.443 1.148 0.771
DeepDTAF 1.355 1.073 0.799
FusionDTA 1.504 1.2 0.766
DataDTA 1.274 1.012 0.806
GraphDTA 1.579 1.193 0.66
WideDTA 1.633 1.295 0.638
DeepGS 1.385 1.096 0.784
DeepFusionDTA 1.235 1.203 0.774
AffinityVAE 1.398 1.102 0.792

TABLE 5.

Performance comparison of multiple state-of-the-art methods based on KIBA and Davis datasets.

Dataset Methods MSE CI Methods MSE CI
KIBA GanDTI 0.469 0.878 DGraphDTA 0.127 0.902
GraphDTA 0.441 0.881 HiSIF-DTA 0.12 0.904
WGNNDTA 0.43 0.886 GTAMP-DTA 0.123 0.917
MGraphDTA 0.427 0.889 TransVAE-DTA 0.2536 0.8221
UCMCDTA 0.421 0.891 AttentionMGT-DTA 0.14 0.893
Davis GanDTI 0.236 0.885 DGraphDTA 0.202 0.905
GraphDTA 0.225 0.895 HiSIF-DTA 0.191 0.907
WGNNDTA 0.211 0.898 GTAMP-DTA 0.177 0.923
MGraphDTA 0.205 0.899 TransVAE-DTA 0.3229 0.8596
UCMCDTA 0.203 0.9 AttentionMGT-DTA 0.193 0.891

FIGURE 5.

FIGURE 5

Performance analysis of multiple state-of-the-art methods based on PDBbind dataset. The general set and refined set are used as the training dataset, while the core set serves as the test dataset. The evaluation metric values of these methods in the figure are sourced from References (Wang et al., 2023a; Zhu et al., 2023a).

FIGURE 7.

FIGURE 7

Performance analysis of multiple state-of-the-art methods based on Davis dataset. The evaluation metric values of these methods in the figure are sourced from References (Bi et al., 2023; Xia et al., 2023; Tian et al., 2024; Wu et al., 2024; Zhou et al., 2024).

FIGURE 6.

FIGURE 6

Performance analysis of multiple state-of-the-art methods based on KIBA dataset. The evaluation metric values of these methods in the figure are sourced from References (Bi et al., 2023; Xia et al., 2023; Tian et al., 2024; Wu et al., 2024; Zhou et al., 2024).

6 Conclusion

Deep learning-based computational methods for DTA prediction have become a crucial component of drug discovery in the pharmaceutical industry. Despite the significant progress achieved by these methods, there is still a gap between their current prediction accuracy and the expectations of researchers. Therefore, to further facilitate the development of novel and high-precision computational methods for DTA prediction, this review provides detailed statistics, summaries, and elaboration on commonly used datasets, the sequence and structural representations of drugs and targets, as well as representative deep learning methods.

From the comprehensive overview of advanced methods for predicting DTA based on deep learning, three key points stand out:

  • 1. It is essential to thoughtfully combine deep learning models like CNN, RNN, and GNN to extract crucial implicit features influencing DTA prediction from the sequences, structures, and other data related to drugs and targets.

  • 2. Deep learning models are employed to extract numerous features from diverse modalities of drugs and targets. Further refinement and effective fusion of these features are vital to obtain comprehensive deep features.

  • 3. Most DTA prediction methods using deep learning lack explanations of their effectiveness. This absence hinders researchers from enhancing current methods.

In the future, it is imperative to delve into DTA prediction methods based on deep learning from three key perspectives:

  • 1. Alongside commonly used deep learning models like CNN, RNN, and GNN, it is essential to incorporate unsupervised learning models such as contrastive learning to comprehensively capture the pivotal features influencing DTA prediction.

  • 2. The emergence of tools such as AlphaFold has made it no longer difficult to obtain the structures of targets, with these structures playing a crucial role in determining molecular function. Hence, delving deeper into the three-dimensional spatial structural features of drugs and targets will help enhance the performance of DTA prediction.

  • 3. While some deep learning-based methods for DTA prediction have shown promising results on standard datasets, their generalization performance is not satisfactory. Therefore, focusing on selecting specific datasets within particular fields and constructing deep learning models for DTA prediction that directly cater to practical application requirements will emerge as a prominent area of research interest.

Funding Statement

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by the National Natural Sciences Foundation of China (No. 62366002), Yunnan Fundamental Research Projects (No. 202101BA070001-227), Yunnan Young and Middle-aged Academic and Technical Leaders Reserve Talent Project in China (No. 202405AC350023), and a grant (No. 2023KF005) from State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University.

Author contributions

XZ: Writing–original draft. S-JL: Investigation, Supervision, Writing–review and editing. S-QL: Visualization, Writing–review and editing. M-LW: Project administration, Writing–review and editing. YL: Writing–review and editing.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

  1. Abbasi K., Razzaghi P., Poso A., Amanlou M., Ghasemi J. B., Masoudi-Nejad A. (2020). DeepCDA: deep cross-domain compound–protein affinity prediction through LSTM and convolutional neural networks. Bioinformatics 36, 4633–4642. 10.1093/bioinformatics/btaa544 [DOI] [PubMed] [Google Scholar]
  2. Abbasi K., Razzaghi P., Poso A., Ghanbari-Ara S., Masoudi-Nejad A. (2021). Deep learning in drug target interaction prediction: current and future perspectives. Curr. Med. Chem. 28, 2100–2113. 10.2174/0929867327666200907141016 [DOI] [PubMed] [Google Scholar]
  3. Altschul S., Madden T. L., Schäffer A. A., Zhang J., Zhang Z., Miller W., et al. (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402. 10.1093/nar/25.17.3389 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Ballester P. J., Mitchell J. B. O. (2010). A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking. Bioinformatics 26, 1169–1175. 10.1093/bioinformatics/btq112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Berman H. M., Westbrook J., Feng Z., Gilliland G., Bhat T. N., Weissig H., et al. (2000). The protein Data Bank. Nucleic Acids Res. 28, 235–242. 10.1093/nar/28.1.235 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bi X., Zhang S., Ma W., Jiang H., Wei Z. (2023). HiSIF-DTA: a hierarchical semantic information fusion framework for drug-target affinity prediction. IEEE J. Biomed. Health Inf., 1–12. 10.1109/JBHI.2023.3334239 [DOI] [PubMed] [Google Scholar]
  7. Chen H., Li D., Liao J., Wei L., Wei L. (2022). MultiscaleDTA: a multiscale-based method with a self-attention mechanism for drug-target binding affinity prediction. Methods 207, 103–109. 10.1016/j.ymeth.2022.09.006 [DOI] [PubMed] [Google Scholar]
  8. Cheng Z., Yan C., Wu F.-X., Wang J. (2022). Drug-target interaction prediction using multi-head self-attention and graph attention network. IEEE/ACM Trans. Comput. Biol. Bioinform. 19, 2208–2218. 10.1109/TCBB.2021.3077905 [DOI] [PubMed] [Google Scholar]
  9. Chu Z., Huang F., Fu H., Quan Y., Zhou X., Liu S., et al. (2022). Hierarchical graph representation learning for the prediction of drug-target binding affinity. Inf. Sci. 613, 507–523. 10.1016/j.ins.2022.09.043 [DOI] [Google Scholar]
  10. Çınaroğlu S. S., Timuçin E. (2020). Comprehensive evaluation of the MM-GBSA method on bromodomain-inhibitor sets. Brief. Bioinform. 21, 2112–2125. 10.1093/bib/bbz143 [DOI] [PubMed] [Google Scholar]
  11. Davis M. I., Hunt J. P., Herrgard S., Ciceri P., Wodicka L. M., Pallares G., et al. (2011). Comprehensive analysis of kinase inhibitor selectivity. Nat. Biotechnol. 29, 1046–1051. 10.1038/nbt.1990 [DOI] [PubMed] [Google Scholar]
  12. Dehghan A., Razzaghi P., Abbasi K., Gharaghani S. (2023). TripletMultiDTI: multimodal representation learning in drug-target interaction prediction with triplet loss function. Expert Syst. Appl. 232, 120754. 10.1016/j.eswa.2023.120754 [DOI] [Google Scholar]
  13. Fang K., Zhang Y., Du S., He J. (2023). ColdDTA: utilizing data augmentation and attention-based feature fusion for drug-target binding affinity prediction. Comput. Biol. Med. 164, 107372. 10.1016/j.compbiomed.2023.107372 [DOI] [PubMed] [Google Scholar]
  14. Feng Q., Dueva E., Cherkasov A., Ester M. (2019). PADME: a deep learning-based framework for drug-target interaction prediction. Available at: http://arxiv.org/abs/1807.09741 (Accessed March 16, 2024).
  15. Ghimire A., Tayara H., Xuan Z., Chong K. T. (2022). CSatDTA: prediction of drug–target binding affinity using convolution model with self-attention. Int. J. Mol. Sci. 23, 8453. 10.3390/ijms23158453 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Gim M., Choe J., Baek S., Park J., Lee C., Ju M., et al. (2023). ArkDTA: attention regularization guided by non-covalent interactions for explainable drug–target binding affinity prediction. Bioinformatics 39, i448–i457. 10.1093/bioinformatics/btad207 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Gomes J., Ramsundar B., Feinberg E. N., Pande V. S. (2017). Atomic convolutional networks for predicting protein-ligand binding affinity. Available at: http://arxiv.org/abs/1703.10603 (Accessed January 4, 2023).
  18. Irwin J. J., Shoichet B. K. (2006). Zinc – a free database of commercially available compounds for virtual screening. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Jiang M., Li Z., Zhang S., Wang S., Wang X., Yuan Q., et al. (2020). Drug–target affinity prediction using graph neural network and contact maps. RSC Adv. 10, 20701–20712. 10.1039/D0RA02297G [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Jiang M., Shao Y., Zhang Y., Zhou W., Pang S. (2023). A deep learning method for drug-target affinity prediction based on sequence interaction information mining. PeerJ 11, e16625. 10.7717/peerj.16625 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Jiang M., Wang S., Zhang S., Zhou W., Zhang Y., Li Z. (2022). Sequence-based drug-target affinity prediction using weighted graph neural networks. BMC Genomics 23, 449. 10.1186/s12864-022-08648-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Jin Z., Wu T., Chen T., Pan D., Wang X., Xie J., et al. (2023). CAPLA: improved prediction of protein–ligand binding affinity by a deep learning approach based on a cross-attention mechanism. Bioinformatics 39, btad049. 10.1093/bioinformatics/btad049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Jorgensen W. L., Thomas L. L. (2008). Perspective on free-energy perturbation calculations for chemical equilibria. J. Chem. Theory Comput. 4, 869–876. 10.1021/ct800011m [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Jumper J., Evans R., Pritzel A., Green T., Figurnov M., Ronneberger O., et al. (2021). Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589. 10.1038/s41586-021-03819-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Kairys V., Baranauskiene L., Kazlauskiene M., Matulis D., Kazlauskas E. (2019). Binding affinity in drug design: experimental and computational techniques. Expert Opin. Drug Discov. 14, 755–768. 10.1080/17460441.2019.1623202 [DOI] [PubMed] [Google Scholar]
  26. Kalemati M., Zamani Emani M., Koohi S. (2023). BiComp-DTA: drug-target binding affinity prediction through complementary biological-related and compression-based featurization approach. PLOS Comput. Biol. 19, e1011036. 10.1371/journal.pcbi.1011036 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Karimi M., Wu D., Wang Z., Shen Y. (2019). DeepAffinity: interpretable deep learning of compound–protein affinity through unified recurrent and convolutional neural networks. Bioinformatics 35, 3329–3338. 10.1093/bioinformatics/btz111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Karimi M., Wu D., Wang Z., Shen Y. (2021). Explainable deep relational networks for predicting compound–protein affinities and contacts. J. Chem. Inf. Model. 61, 46–66. 10.1021/acs.jcim.0c00866 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Kim G., Lee S., Karin E. L., Kim H., Moriwaki Y., Ovchinnikov S., et al. (2023). Easy and accurate protein structure prediction using ColabFold. Protoc. Exch. 10.21203/rs.3.pex-2490/v1 [DOI] [PubMed] [Google Scholar]
  30. Kuhn M., von Mering C., Campillos M., Jensen L. J., Bork P. (2007). STITCH: interaction networks of chemicals and proteins. Nucleic Acids Res. 36, D684–D688. 10.1093/nar/gkm795 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Kwon Y., Shin W.-H., Ko J., Lee J. (2020). AK-score: accurate protein-ligand binding affinity prediction using an ensemble of 3D-convolutional neural networks. Int. J. Mol. Sci. 21, 8424. 10.3390/ijms21228424 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Landrum G. (2013). RDKit: a software suite for cheminformatics, computational chemistry, and predictive modeling. Available at: https://www.rdkit.org/RDKit_Overview.pdf.
  33. LeCun Y., Bengio Y., Hinton G. (2015). Deep learning. Nature 521, 436–444. 10.1038/nature14539 [DOI] [PubMed] [Google Scholar]
  34. Li H., Leung K.-S., Wong M.-H., Ballester P. (2015). Low-quality structural and interaction data improves binding affinity prediction via random forest. Molecules 20, 10947–10962. 10.3390/molecules200610947 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Li Q., Zhang X., Wu L., Bo X., He S., Wang S. (2022a). PLA-MoRe: a protein–ligand binding affinity prediction model via comprehensive molecular representations. J. Chem. Inf. Model. 62, 4380–4390. 10.1021/acs.jcim.2c00960 [DOI] [PubMed] [Google Scholar]
  36. Li S., Zhou J., Xu T., Huang L., Wang F., Xiong H., et al. (2021). “Structure-aware interactive graph neural networks for the prediction of protein-ligand binding affinity,” in Proceedings of the 27th ACM SIGKDD conference on knowledge discovery and data mining (USA: Virtual Event Singapore: ACM; ), 975–985. 10.1145/3447548.3467311 [DOI] [Google Scholar]
  37. Li T., Zhao X.-M., Li L. (2022b). Co-VAE: drug-target binding affinity prediction by Co-regularized variational autoencoders. IEEE Trans. Pattern Anal. Mach. Intell. 44, 8861–8873. 10.1109/TPAMI.2021.3120428 [DOI] [PubMed] [Google Scholar]
  38. Li Y., Han L., Liu Z., Wang R. (2014). Comparative assessment of scoring functions on an updated benchmark: 2. Evaluation methods and general results. J. Chem. Inf. Model. 54, 1717–1736. 10.1021/ci500081m [DOI] [PubMed] [Google Scholar]
  39. Liao J., Chen H., Wei L., Wei L. (2022). GSAML-DTA: an interpretable drug-target binding affinity prediction model based on graph neural networks with self-attention mechanism and mutual information. Comput. Biol. Med. 150, 106145. 10.1016/j.compbiomed.2022.106145 [DOI] [PubMed] [Google Scholar]
  40. Lin S., Shi C., Chen J. (2022a). GeneralizedDTA: combining pre-training and multi-task learning to predict drug-target binding affinity for unknown drug discovery. BMC Bioinforma. 23, 367. 10.1186/s12859-022-04905-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Lin X., Xu S., Liu X., Zhang X., Hu J. (2022b). Detecting drug–target interactions with feature similarity fusion and molecular graphs. Biology 11, 967. 10.3390/biology11070967 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Liu Q., Wang P.-S., Zhu C., Gaines B. B., Zhu T., Bi J., et al. (2021). OctSurf: efficient hierarchical voxel-based molecular surface representation for protein-ligand affinity prediction. J. Mol. Graph. Model. 105, 107865. 10.1016/j.jmgm.2021.107865 [DOI] [PubMed] [Google Scholar]
  43. Liu T., Lin Y., Wen X., Jorissen R. N., Gilson M. K. (2007). BindingDB: a web-accessible database of experimentally determined protein-ligand binding affinities. Nucleic Acids Res. 35, D198–D201. 10.1093/nar/gkl999 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Lu R., Wang J., Li P., Li Y., Tan S., Pan Y., et al. (2023). Improving drug-target affinity prediction via feature fusion and knowledge distillation. Brief. Bioinform. 24, bbad145. 10.1093/bib/bbad145 [DOI] [PubMed] [Google Scholar]
  45. Ma D., Li S., Chen Z. (2022). Drug-target binding affinity prediction method based on a deep graph neural network. Math. Biosci. Eng. 20, 269–282. 10.3934/mbe.2023012 [DOI] [PubMed] [Google Scholar]
  46. Ma W., Zhang S., Li Z., Jiang M., Wang S., Guo N., et al. (2023). Predicting drug-target affinity by learning protein knowledge from biological networks. IEEE J. Biomed. Health Inf. 27, 2128–2137. 10.1109/JBHI.2023.3240305 [DOI] [PubMed] [Google Scholar]
  47. Mekni N., Fooladi H., Perricone U., Langer T. (2023). Encoding protein-ligand interactions: binding affinity prediction with multigraph-based modeling and graph convolutional network. Chemistry. 10.26434/chemrxiv-2023-bvps7-v2 [DOI] [Google Scholar]
  48. Metz J. T., Johnson E. F., Soni N. B., Merta P. J., Kifle L., Hajduk P. J. (2011). Navigating the kinome. Nat. Chem. Biol. 7, 200–202. 10.1038/nchembio.530 [DOI] [PubMed] [Google Scholar]
  49. Monteiro N. R. C., Oliveira J. L., Arrais J. P. (2022). DTITR: end-to-end drug–target binding affinity prediction with transformers. Comput. Biol. Med. 147, 105772. 10.1016/j.compbiomed.2022.105772 [DOI] [PubMed] [Google Scholar]
  50. Nguyen T., Le H., Quinn T. P., Le T., Venkatesh S. (2022). GraphDTA: predicting drug–target binding affinity with graph neural networks. PLoS One 17, e0278387. 10.1371/journal.pone.0278387 [DOI] [PubMed] [Google Scholar]
  51. Nguyen T., Le H., Quinn T. P., Nguyen T., Le T. D., Venkatesh S. (2021). GraphDTA: predicting drug-target binding affinity with graph neural networks. Bioinforma. Oxf. Engl. 37, 1140–1147. 10.1093/bioinformatics/btaa921 [DOI] [PubMed] [Google Scholar]
  52. Öztürk H., Özgür A., Ozkirimli E. (2018). DeepDTA: deep drug–target binding affinity prediction. Bioinformatics 34, i821–i829. 10.1093/bioinformatics/bty593 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Pan S., Xia L., Xu L., Li Z. (2023). SubMDTA: drug target affinity prediction based on substructure extraction and multi-scale features. BMC Bioinforma. 24, 334. 10.1186/s12859-023-05460-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Pandey M., Radaeva M., Mslati H., Garland O., Fernandez M., Ester M., et al. (2022). Ligand binding prediction using protein structure graphs and residual graph attention networks. Molecules 27, 5114. 10.3390/molecules27165114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Rafiei F., Zeraati H., Abbasi K., Ghasemi J. B., Parsaeian M., Masoudi-Nejad A. (2023). DeepTraSynergy: drug combinations using multimodal deep learning with transformers. Bioinformatics 39, btad438. 10.1093/bioinformatics/btad438 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Remmert M., Biegert A., Hauser A., Söding J. (2012). HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat. Methods 9, 173–175. 10.1038/nmeth.1818 [DOI] [PubMed] [Google Scholar]
  57. Rifaioglu A. S., Cetin Atalay R., Cansen Kahraman D., Doğan T., Martin M., Atalay V. (2021). MDeePred: novel multi-channel protein featurization for deep learning-based binding affinity prediction in drug discovery. Bioinformatics 37, 693–704. 10.1093/bioinformatics/btaa858 [DOI] [PubMed] [Google Scholar]
  58. Rives A., Meier J., Sercu T., Goyal S., Lin Z., Liu J., et al. (2021). Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proc. Natl. Acad. Sci. 118, e2016239118. 10.1073/pnas.2016239118 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Ru X., Zou Q., Lin C. (2023). Optimization of drug–target affinity prediction methods through feature processing schemes. Bioinformatics 39, btad615. 10.1093/bioinformatics/btad615 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Rube H. T., Rastogi C., Feng S., Kribelbauer J. F., Li A., Becerra B., et al. (2022). Prediction of protein–ligand binding affinity from sequencing data with interpretable machine learning. Nat. Biotechnol. 40, 1520–1527. 10.1038/s41587-022-01307-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Seo S., Choi J., Park S., Ahn J. (2021). Binding affinity prediction for protein–ligand complex using deep attention mechanism based on intermolecular interactions. BMC Bioinforma. 22, 542. 10.1186/s12859-021-04466-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Shar P. A., Tao W., Gao S., Huang C., Li B., Zhang W., et al. (2016). Pred-binding: large-scale protein–ligand binding affinity prediction. J. Enzyme Inhib. Med. Chem. 31, 1443–1450. 10.3109/14756366.2016.1144594 [DOI] [PubMed] [Google Scholar]
  63. Shen H., Zhang Y., Zheng C., Wang B., Chen P. (2021). A cascade graph convolutional network for predicting protein–ligand binding affinity. Int. J. Mol. Sci. 22, 4023. 10.3390/ijms22084023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Shim J., Hong Z.-Y., Sohn I., Hwang C. (2021). Prediction of drug–target binding affinity using similarity-based convolutional neural network. Sci. Rep. 11, 4416. 10.1038/s41598-021-83679-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Stepniewska-Dziubinska M. M., Zielenkiewicz P., Siedlecki P. (2018). Development and evaluation of a deep learning model for protein–ligand binding affinity prediction. Bioinformatics 34, 3666–3674. 10.1093/bioinformatics/bty374 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Su M., Yang Q., Du Y., Feng G., Liu Z., Li Y., et al. (2019). Comparative assessment of scoring functions: the CASF-2016 update. J. Chem. Inf. Model. 59, 895–913. 10.1021/acs.jcim.8b00545 [DOI] [PubMed] [Google Scholar]
  67. Suzek B. E., Wang Y., Huang H., McGarvey P. B., Wu C. H. the UniProt Consortium (2015). UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches. Bioinformatics 31, 926–932. 10.1093/bioinformatics/btu739 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Tang J., Szwajda A., Shakyawar S., Xu T., Hintsanen P., Wennerberg K., et al. (2014). Making sense of large-scale kinase inhibitor bioactivity data sets: a comparative and integrative analysis. J. Chem. Inf. Model. 54, 735–743. 10.1021/ci400709d [DOI] [PubMed] [Google Scholar]
  69. The U. P. C., Bateman A., Martin M.-J., Orchard S., Magrane M., Agivetova R., et al. (2021). UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res. 49, D480–D489. 10.1093/nar/gkaa1100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Tian C., Wang L., Cui Z., Wu H. (2024). GTAMP-DTA: graph transformer combined with attention mechanism for drug-target binding affinity prediction. Comput. Biol. Chem. 108, 107982. 10.1016/j.compbiolchem.2023.107982 [DOI] [PubMed] [Google Scholar]
  71. Tian Q., Ding M., Yang H., Yue C., Zhong Y., Du Z., et al. (2022). Predicting drug-target affinity based on recurrent neural networks and graph convolutional neural networks. Comb. Chem. High. Throughput Screen. 25, 634–641. 10.2174/1386207324666210215101825 [DOI] [PubMed] [Google Scholar]
  72. Tsui L.-I., Hsu T.-C., Lin C. (2023). “NG-DTA: drug-target affinity prediction with n-gram molecular graphs,” in 2023 45th annual international conference of the IEEE engineering in medicine and biology society (EMBC) (Sydney, Australia: IEEE; ), 1–4. 10.1109/EMBC40787.2023.10339968 [DOI] [PubMed] [Google Scholar]
  73. Vaswani A., Shazeer N., Parmar N., Uszkoreit J., Jones L., Gomez A. N., et al. (2017). Attention is all you need. Available at: http://arxiv.org/abs/1706.03762 (Accessed May 18, 2023).
  74. Wang K., Li M. (2023). Fusion-based deep learning architecture for detecting drug-target binding affinity using target and drug sequence and structure. IEEE J. Biomed. Health Inf. 27, 6112–6120. 10.1109/JBHI.2023.3315073 [DOI] [PubMed] [Google Scholar]
  75. Wang K., Zhou R., Li Y., Li M. (2021a). DeepDTAF: a deep learning method to predict protein–ligand binding affinity. Brief. Bioinform. 22, bbab072. 10.1093/bib/bbab072 [DOI] [PubMed] [Google Scholar]
  76. Wang M., Li W., Yu X., Luo Y., Han K., Wang C., et al. (2023a). AffinityVAE: a multi-objective model for protein-ligand affinity prediction and drug design. Comput. Biol. Chem. 107, 107971. 10.1016/j.compbiolchem.2023.107971 [DOI] [PubMed] [Google Scholar]
  77. Wang R., Fang X., Lu Y., Yang C.-Y., Wang S. (2005). The PDBbind database: methodologies and updates. J. Med. Chem. 48, 4111–4119. 10.1021/jm048957q [DOI] [PubMed] [Google Scholar]
  78. Wang S., Shan P., Zhao Y., Zuo L. (2021b). GanDTI: a multi-task neural network for drug-target interaction prediction. Comput. Biol. Chem. 92, 107476. 10.1016/j.compbiolchem.2021.107476 [DOI] [PubMed] [Google Scholar]
  79. Wang S., Song X., Zhang Y., Zhang K., Liu Y., Ren C., et al. (2023b). MSGNN-DTA: multi-scale topological feature fusion based on graph neural networks for drug–target binding affinity prediction. Int. J. Mol. Sci. 24, 8326. 10.3390/ijms24098326 [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Wang Y., Wei Z., Xi L. (2022). Sfcnn: a novel scoring function based on 3D convolutional neural network for accurate and stable protein–ligand affinity prediction. BMC Bioinforma. 23, 222. 10.1186/s12859-022-04762-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Weininger D. (1988). SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. J. Chem. Inf. Model. 28, 31–36. 10.1021/ci00057a005 [DOI] [Google Scholar]
  82. Wu H., Liu J., Jiang T., Zou Q., Qi S., Cui Z., et al. (2024). AttentionMGT-DTA: a multi-modal drug-target affinity prediction using graph transformer and attention mechanism. Neural Netw. 169, 623–636. 10.1016/j.neunet.2023.11.018 [DOI] [PubMed] [Google Scholar]
  83. Xia L., Xu L., Pan S., Niu D., Zhang B., Li Z. (2023). Drug-target binding affinity prediction using message passing neural network and self supervised learning. BMC Genomics 24, 557. 10.1186/s12864-023-09664-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Xu K., Hu W., Leskovec J., Jegelka S. (2019). HOW POWERFUL ARE GRAPH NEURAL NETWORKS? Int. Conf. Learn. Represent. 10.48550/arXiv.1810.00826 [DOI] [Google Scholar]
  85. Xu L., Ru X., Song R. (2021). Application of machine learning for drug–target interaction prediction. Front. Genet. 12, 680117. 10.3389/fgene.2021.680117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Yang X., Yang G., Chu J. (2023). GraphCL-DTA: a graph contrastive learning with molecular semantics for drug-target binding affinity prediction. Available at: http://arxiv.org/abs/2307.08989 (Accessed January 11, 2024). [DOI] [PubMed]
  87. Yang Z., Zhong W., Zhao L., Yu-Chian Chen C. (2022). MGraphDTA: deep multiscale graph neural network for explainable drug–target binding affinity prediction. Chem. Sci. 13, 816–833. 10.1039/D1SC05180F [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Yuan H., Huang J., Li J. (2021). Protein-ligand binding affinity prediction model based on graph attention network. Math. Biosci. Eng. 18, 9148–9162. 10.3934/mbe.2021451 [DOI] [PubMed] [Google Scholar]
  89. Zaremba W., Sutskever I., Vinyals O. (2015). Recurrent neural network regularization. Available at: http://arxiv.org/abs/1409.2329 (Accessed May 18, 2023).
  90. Zeng X., Zhong K.-Y., Jiang B., Li Y. (2023). Fusing sequence and structural knowledge by heterogeneous models to accurately and interpretively predict drug–target affinity. Molecules 28, 8005. 10.3390/molecules28248005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Zeng Y., Chen X., Luo Y., Li X., Peng D. (2021). Deep drug-target binding affinity prediction with multiple attention blocks. Brief. Bioinform. 22, bbab117. 10.1093/bib/bbab117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Zhang H., Liao L., Saravanan K. M., Yin P., Wei Y. (2019). DeepBindRG: a deep learning based method for estimating effective protein–ligand affinity. PeerJ 7, e7362. 10.7717/peerj.7362 [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Zhang L., Ouyang C., Liu Y., Liao Y., Gao Z. (2023a). Multimodal contrastive representation learning for drug-target binding affinity prediction. Methods 220, 126–133. 10.1016/j.ymeth.2023.11.005 [DOI] [PubMed] [Google Scholar]
  94. Zhang L., Wang C.-C., Chen X. (2022). Predicting drug–target binding affinity through molecule representation block based on multi-head attention and skip connection. Brief. Bioinform. 23, bbac468. 10.1093/bib/bbac468 [DOI] [PubMed] [Google Scholar]
  95. Zhang L., Wang C.-C., Zhang Y., Chen X. (2023b). GPCNDTA: prediction of drug-target binding affinity through cross-attention networks augmented with graph features and pharmacophores. Comput. Biol. Med. 166, 107512. 10.1016/j.compbiomed.2023.107512 [DOI] [PubMed] [Google Scholar]
  96. Zhang S., Jiang M., Wang S., Wang X., Wei Z., Li Z. (2021). SAG-DTA: prediction of drug–target affinity using self-attention graph network. Int. J. Mol. Sci. 22, 8993. 10.3390/ijms22168993 [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Zhang S., Jin Y., Liu T., Wang Q., Zhang Z., Zhao S., et al. (2023c). SS-GNN: a simple-structured graph neural network for affinity prediction. ACS Omega 8, 22496–22507. 10.1021/acsomega.3c00085 [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Zhang X., Gao H., Wang H., Chen Z., Zhang Z., Chen X., et al. (2023d). PLANET: a multi-objective graph neural network model for protein–ligand binding affinity prediction. J. Chem. Inf. Model., Acs.jcim., 3c00253. 10.1021/acs.jcim.3c00253 [DOI] [PubMed] [Google Scholar]
  99. Zhang Y., Hu Y., Han N., Yang A., Liu X., Cai H. (2023e). A survey of drug-target interaction and affinity prediction methods via graph neural networks. Comput. Biol. Med. 163, 107136. 10.1016/j.compbiomed.2023.107136 [DOI] [PubMed] [Google Scholar]
  100. Zhao Q., Duan G., Yang M., Cheng Z., Li Y., Wang J. (2022). AttentionDTA: drug-target binding affinity prediction by sequence-based deep learning with attention mechanism. IEEE/ACM Trans. Comput. Biol. Bioinform. 20, 852–863. 10.1109/TCBB.2022.3170365 [DOI] [PubMed] [Google Scholar]
  101. Zheng L., Fan J., Mu Y. (2019). OnionNet: a multiple-layer intermolecular-contact-based convolutional neural network for protein–ligand binding affinity prediction. ACS Omega 4, 15956–15965. 10.1021/acsomega.9b01997 [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Zhou C., Li Z., Song J., Xiang W. (2024). TransVAE-DTA: transformer and variational autoencoder network for drug-target binding affinity prediction. Comput. Methods Programs Biomed. 244, 108003. 10.1016/j.cmpb.2023.108003 [DOI] [PubMed] [Google Scholar]
  103. Zhu Y., Zhao L., Wen N., Wang J., Wang C. (2023a). DataDTA: a multi-feature and dual-interaction aggregation framework for drug–target binding affinity prediction. Bioinformatics 39, btad560. 10.1093/bioinformatics/btad560 [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Zhu Z., Yao Z., Qi G., Mazur N., Yang P., Cong B. (2023b). Associative learning mechanism for drug‐target interaction prediction. CAAI Trans. Intell. Technol. 8, 1558–1577. 10.1049/cit2.12194 [DOI] [Google Scholar]
  105. Zhu Z., Yao Z., Zheng X., Qi G., Li Y., Mazur N., et al. (2023c). Drug–target affinity prediction method based on multi-scale information interaction and graph optimization. Comput. Biol. Med. 167, 107621. 10.1016/j.compbiomed.2023.107621 [DOI] [PubMed] [Google Scholar]

Articles from Frontiers in Pharmacology are provided here courtesy of Frontiers Media SA

RESOURCES