Dynamic category-sensitive hypergraph inferring and homo-heterogeneous neighbor feature learning for drug-related microbe prediction

Ping Xuan; Zelong Xu; Hui Cui; Jing Gu; Cheng Liu; Tiangang Zhang; Peiliang Wu

doi:10.1093/bioinformatics/btae562

. 2024 Sep 18;40(9):btae562. doi: 10.1093/bioinformatics/btae562

Dynamic category-sensitive hypergraph inferring and homo-heterogeneous neighbor feature learning for drug-related microbe prediction

Ping Xuan ^1,², Zelong Xu ³, Hui Cui ^4,⁵, Jing Gu ⁶, Cheng Liu ⁷, Tiangang Zhang ^8,^✉, Peiliang Wu ^9,^✉

Editor: Jonathan Wren

PMCID: PMC11441325 PMID: 39292557

Abstract

Motivation

The microbes in human body play a crucial role in influencing the functions of drugs, as they can regulate the activities and toxicities of drugs. Most recent methods for predicting drug–microbe associations are based on graph learning. However, the relationships among multiple drugs and microbes are complex, diverse, and heterogeneous. Existing methods often fail to fully model the relationships. In addition, the attributes of drug–microbe pairs exhibit long-distance spatial correlations, which previous methods have not integrated effectively.

Results

We propose a new prediction method named DHDMP which is designed to encode the relationships among multiple drugs and microbes and integrate the attributes of various neighbor nodes along with the pairwise long-distance correlations. First, we construct a hypergraph with dynamic topology, where each hyperedge represents a specific relationship among multiple drug nodes and microbe nodes. Considering the heterogeneity of node attributes across different categories, we developed a node category-sensitive hypergraph convolution network to encode these diverse relationships. Second, we construct homogeneous graphs for drugs and microbes respectively, as well as drug–microbe heterogeneous graph, facilitating the integration of features from both homogeneous and heterogeneous neighbors of each target node. Third, we introduce a graph convolutional network with cross-graph feature propagation ability to transfer node features from homogeneous to heterogeneous graphs for enhanced neighbor feature representation learning. The propagation strategy aids in the deep fusion of features from both types of neighbors. Finally, we design spatial cross-attention to encode the attributes of drug–microbe pairs, revealing long-distance correlations among multiple pairwise attribute patches. The comprehensive comparison experiments showed our method outperformed state-of-the-art methods for drug–microbe association prediction. The ablation studies demonstrated the effectiveness of node category-sensitive hypergraph convolution network, graph convolutional network with cross-graph feature propagation, and spatial cross-attention. Case studies on three drugs further showed DHDMP’s potential application in discovering the reliable candidate microbes for the interested drugs.

Availability and implementation

Source codes and supplementary materials are available at https://github.com/pingxuan-hlju/DHDMP.

1 Introduction

The human body hosts a diverse community of microbes, including bacteria, viruses, protists, archaea, and fungi (Gill et al. 2006, De Lorenzi-Tognon et al. 2023). These microbial communities are closely linked to human health (Aggarwal et al. 2022, VanEvery et al. 2023), with studies suggesting that they can promote human metabolism and enhance the immune system (Ventura et al. 2009, Sommer and Bäckhed 2013). However, disruptions in these communities can lead to diseases such as hypertension and diabetes (Wen et al. 2008, Xiong et al. 2021).

Microbes can also chemically modify drugs to enhance their efficacy (Zimmermann et al. 2019, Kumbhare et al. 2023). For example, Vich Vila et al. (2020) have shown that the human gut microbiota is involved in drug metabolism processes, and compounds present in these drugs can promote the growth of gut bacteria. Conversely, microbes can develop resistance to drugs, negatively impacting human health (Uddin et al. 2021, Li et al. 2023). Therefore, predicting drug–microbe associations is beneficial for understanding drug mechanisms of action and drug–microbe interaction mechanisms.

Recently, the computational methods were proposed for predicting the miRNA–disease associations (Li et al. 2021), drug-target interactions (Li et al. 2017, Song et al. 2024). Computational prediction methods also have proven to be effective in predicting candidate microbes associated with drugs and screening reliable candidates for subsequent biological experiments. Researchers have proposed several approaches for predicting drug–microbe associations. Methods such as GCNMDA (Long et al. 2020a), EGATMDA (Long et al. 2020b), and SCSMDA (Tian et al. 2023) use simple graph convolutional networks (Kipf and Welling 2017) to aggregate neighbor information without differentiating between neighbor types. GSAMDA (Tan et al. 2022) and MDASAE (Fan et al. 2023) use sparse autoencoders, while Graph2MDA (Deng et al. 2022) employs a variational graph autoencoder (Kipf and Welling 2016) to learn node representation in the drug–microbe heterogeneous graph. However, these methods do not capture the complex relationships among multiple drugs and microbes. GACNNMDA (Ma et al. 2023) uses a convolutional neural network to learn features of drug–microbe node pairs but ignores the spatial dependencies between attributes of node pairs.

We present DHDMP, a method driven by node category-sensitive hypergraph convolutional networks, graph convolutional network with cross-graph feature propagation (GCNFP), and spatial cross-attention (SCA) for predicting drug-related microbes. Our method fuses the characteristics of multiple drugs and microbes that are related, and integrates the heterogeneity of multiple types of nodes and the long-distance spatial correlations of drug–microbe node pairs. The contributions of this article are summarized as follows:

A microbe might affect the functions of multiple drugs, and multiple microbes are usually involved into the process that a drug exert its function. Thus, there are complex relationships among multiple drugs and multiple microbes. The hyperedges are designed to denote these relationships from multiple perspectives, and their embeddings are learnable. The learnable strategy is helpful for forming the hypergraph topology with dynamic changes.
Most previous methods based on hypergraph learning only integrated the hypergraph topology and the node attributes and they ignored the heterogeneity of feature distributions of multiple categories of nodes. We designed their category features for each category of nodes to discriminate the heterogeneity of drug and microbe nodes. A node category-sensitive hypergraph convolutional networks (NHCN) is presented to encode the dynamic hypergraph topology and the node heterogeneity.
A homogeneous graph composed of drug nodes, a homogeneous graph composed of microbe node, and a heterogeneous graph composed of both drug and microbe nodes, are established to reflect the similarities among the nodes from the same category and the associations among the nodes from different categories. In these graphs, a target node has its homogeneous neighbors and heterogeneous neighbors which reveal its characteristics from diverse perspectives. A feature propagation strategy is also proposed to supplement the node features from homogeneous graph to the heterogeneous graph learning. A GCNFP was designed to fuse the attributes of these neighbors and their topologies.
The attribute embedding of a pair of drug and microbe node was constructed and there are long-distance spatial correlations among some attributes. We proposed a SCA to separate the pairwise attributes into multiple patches and integrate the long-distance correlations among these patches. The experimental results showed that our method achieved higher prediction performance than the compared advanced methods.

2 Materials and methods

We propose a new prediction model, DHDMP (as shown in Fig. 1), designed to predict the microbes associated with specific drugs. DHDMP consists of three components that respectively learn the complex relationships among multiple drugs and microbes, information from homogeneous and heterogeneous neighbors, and long-distance spatial correlations between pairs of drug and microbe nodes. We construct a hypergraph to represent the associations among multiple drugs and microbes and design a NHCN to capture the complex relationships among drugs and microbes. To integrate features from both homogeneous and heterogeneous neighbors of the target drug or microbe node, we propose a GCNFP. We introduce a SCA mechanism to capture long-distance spatial correlations between the features of drug and microbe node pairs. These components work together to enhance the predictive power of DHDMP in identifying drug–microbe associations.

Figure 1. — Overall framework of DHDMP model. (a) Construct the drug and microbe homogeneous graphs and drug–microbe heterogeneous graph. (b) Learn complex relationships among multiple drugs and microbes by NHCN. (c) Learn homogeneous and heterogeneous neighbor representations based on GCNFP. (d) Encode long-distance spatial correlations by SCA. (e) Integrate multiple representations and estimate the association score

2.1 Dataset

The associations between drugs and microbes, drug chemical structure similarity, Gaussian kernel similarity, and microbe sequence similarity were obtained from previous research (Deng et al. 2022). The dataset contains associations on 2470 pairs of drugs and microbes, covering 1373 drugs and 173 microbes. The data on drug–microbe associations were originally obtained from the Microbe–Drug Associations Database (MDAD) (Sun et al. 2018). Furthermore, 365 984 drug interactions were obtained from the DrugBank database (Wishart et al. 2018), while the raw microbe gene sequence data were retrieved from the National Center for Biotechnology Information (NCBI) database.

2.2 Construction of homogeneous and heterogeneous graph for drugs and microbes

We construct a drug homogeneous graph $G^{drg} = (V^{drg}, E^{drg})$ and a microbe homogeneous graph $G^{mic} = (V^{mic}, E^{mic})$ . Here, $V^{drg}$ and $V^{mic}$ represent the sets of drug and microbe nodes, respectively. An edge $e_{i, j}^{drg} \in E^{drg}$ connects drug nodes $v_{i}^{drg}, v_{j}^{drg} \in V^{drg}$ , while microbe nodes $v_{i}^{mic}, v_{j}^{mic} \in V^{mic}$ are connected by $e_{i, j}^{mic} \in E^{mic}$ . Additionally, we construct a drug–microbe heterogeneous graph $G^{drg-mic} = (V^{drg} \cup V^{mic}, E^{drg-mic})$ , where an edge $e_{i, j}^{drg-mic} \in E^{drg-mic}$ connects a drug node $v_{i}^{drg}$ and a microbe node $v_{j}^{mic}$ .

The drug similarity matrix $D^{sim}$ and microbial similarity matrix $M^{sim}$ are defined as follows,

D^{sim} = (\frac{d_{i, j}^{struct} + d_{i, j}^{gauss}}{2}) \in R^{N_{d} \times N_{d}}, if v_{i}, v_{j} \in V^{drg},

(1)

M^{sim} = (m_{i, j}^{sim}) \in R^{N_{m} \times N_{m}}, if v_{i}, v_{j} \in V^{mic},

(2)

where $N_{d}$ and $N_{m}$ represent the number of drugs and microbes, respectively. For a pair of drug nodes $v_{i}^{drg}$ and $v_{j}^{drg}$ , their structural similarity $d_{i, j}^{struct} \in [0, 1]$ is calculated by the SIMCOMP2 (Hattori et al. 2010) measurement based on their chemical structure graphs. The Gaussian kernel similarity between the i-th drug and the j-th one $d_{i, j}^{gauss} \in [0, 1]$ is computed based on the two groups of microbes interacted with them,

d_{i, j}^{gauss} = exp (- μ | | A (v_{i}^{drg}) - A (v_{j}^{drg}) | |^{2}),

(3)

where $A (v_{i}^{drg})$ is the ith row of the drug–microbe association matrix. $μ$ represents the normalized kernel bandwidth,

μ = μ^{'} / (\frac{1}{N_{d}} \sum_{i = 1}^{N_{d}} | | A (v_{i}^{drg}) | |^{2}),

(4)

where $μ^{'}$ is the original bandwidth and it was set to 1. The final drug similarity is obtained by averaging the structural similarity and the Gaussian kernel similarity. Principal component analysis (PCA) (Weber et al. 2016) is used to calculate the similarity $m_{i, j}^{sim} \in [0, 1]$ between microbe nodes $v_{i}^{mic}$ and $v_{j}^{mic}$ based on their gene sequences. These calculation methods were proposed by Long et al. (2020a). Since each row of the drug similarity matrix and the microbe similarity matrix contains the associations of the respective drug or microbe with all drugs and microbes, we can regard these matrices as the adjacency matrices of the drug homogeneous graph and the microbe homogeneous graph, respectively, and rename them as $R^{drg}$ and $R^{mic}$ .

The adjacency matrix of the drug–microbe heterogeneous graph is defined as,

\begin{matrix} \begin{matrix} R^{drg-mic} = (r_{i, j}^{drg-mic}) \in R^{(N_{d} + N_{m}) \times (N_{d} + N_{m})} \\ if v_{i} \in V^{drg}, v_{j} \in V^{mic} \end{matrix} \end{matrix},

(5)

where $r_{i, j}^{drg-mic} = 1$ denotes the known association between the drug node $v_{i}^{drg}$ and the microbe node $v_{j}^{mic}$ , and $r_{i, j}^{drg-mic} = 0$ otherwise. The node attribute embedding matrix $Z \in R^{(N_{d} + N_{m}) \times (N_{d} + N_{m})}$ includes the drug similarity matrix $D^{sim}$ , the microbe similarity matrix $M^{sim}$ , and the drug–microbe association matrix $R^{drg-mic}$ ,

\begin{matrix} Z = [\begin{matrix} D^{sim} & R^{drg-mic} \\ R^{{drg-mic}^{T}} & M^{sim} \end{matrix}] \end{matrix},

(6)

where $R^{{drg-mic}^{T}}$ represents the transpose matrix of $R^{drg-mic}$ . If a pair of microbes and drugs are associated or similar with more common drugs and microbes, they are more likely to be associated. The $i$ th row of matrix $Z$ , denoted as $Z_{i}$ , contains the similarities and associations between node $v_{i} \in V^{drg} \cup V^{mic}$ and all drugs and microbes. Therefore, $Z_{i}$ can be considered the attribute of the node.

2.3 Multiple drugs and microbes association learning based on NHCN

A microbe often participates in multiple drug function processes, and multiple microbes may simultaneously participate in the same drug function process, leading to complex relationships between multiple drugs and microbes. To capture such relationships, we establish multiple hyperedges and form a drug–microbe hypergraph. We construct a hyperedge embedding matrix $E \in R^{(N_{d} + N_{m}) \times N_{e}}$ , where $E$ is randomly initialized and trainable during the training process, and $N_{e}$ is the number of hyperedges. $e_{i, j}^{h} \in E$ indicates the tendency of the $i$ -th node to belong to the $j$ -th hyperedge. Combining the node attribute embedding matrix $Z$ , we form a hypergraph with dynamically changing topology, where its adjacency matrix is denoted as $H$ ,

H = Z \cdot E .

(7)

The hypergraph topology is continuously updated during the training process, which is helpful for better reflecting the complex relationships among multiple drugs and microbes.

In the hypergraph, there are two types of nodes: drugs and microbes, and their feature distribution is heterogeneous. We design category features to represent node types. The category feature matrix for all drug nodes is denoted as $Y_{d} \in R^{N_{d} \times dim}$ , and for microbe nodes, it is denoted as $Y_{m} \in R^{N_{m} \times dim}$ , where $dim$ represents the dimension of the node category features. $Y_{d}, Y_{m}$ are randomly initialized, and the category features of nodes with the same type have the same initial vector. $Y_{d}, Y_{m}$ are stacked vertically to form the category feature matrix $Y \in R^{(N_{d} + N_{m}) \times dim}$ for all drug and microbe nodes. The node attribute matrix embedded with category features is referred to as $X \in R^{(N_{d} + N_{m}) \times (N_{d} + N_{m} + dim)}$ . To encode the complex relationships among multiple drugs and microbes, we design NHCN (node category-sensitive hypergraph convolution network). After $l$ layer encoding, we obtain the feature map ${\tilde{X}}^{l}$ ,

\begin{matrix} \begin{matrix} {\tilde{X}}^{l} = LeakyRe L U (D_{B}^{- \frac{1}{2}} \cdot H \cdot D_{E}^{- 1} \cdot H^{T} \cdot D_{B}^{- \frac{1}{2}} \cdot {\tilde{X}}^{l - 1}) \\ l = 1, 2, \dots, L_{hnn} \end{matrix} \end{matrix},

(8)

where $LeakyRe L U (\cdot)$ is the activation function, $L_{hnn}$ is the number of convolution layers of the hypergraph. $D_{B}$ and $D_{E}$ are the degree matrices of nodes and hyperedges, corresponding respectively to the sum of rows and columns in the hypergraph adjacency matrix. ${\tilde{X}}^{0} = X$ is an attribute matrix embedded with category features. We perform hypergraph convolution on all nodes and then update the hyperedge embedding matrix. The updated hyperedge embedding matrix is multiplied with the node attributes to form a new hypergraph topology. We perform hypergraph convolution on the new hypergraph topology and repeat the iteration process (as shown in Fig. 2). After $L_{hnn}$ rounds of hypergraph convolution, we obtain multi-node association representations ${\tilde{X}}_{drg, i}^{L_{hnn}}$ and ${\tilde{X}}_{mic, j}^{L_{hnn}}$ of $v_{i}^{drg}$ and $v_{j}^{mic}$ , respectively. ${\tilde{X}}_{drg, i}^{L_{hnn}}$ and ${\tilde{X}}_{mic, j}^{L_{hnn}}$ are stacked to form the multi-node association representation for drug and microbe node pairs,

\begin{matrix} X_{i, j}^{mna} = [\begin{matrix} {\tilde{X}}_{drg, i}^{L_{hnn}} \\ {\tilde{X}}_{mic, j}^{L_{hnn}} \end{matrix}] \end{matrix} .

(9)

Figure 2. — Illustration of forming and updating the dynamic hypergraph topology

2.4 Homogeneous and heterogeneous neighbor representation learning based on GCNFP

Neighboring nodes of a target node can reflect the characteristics of that node, we aggregate the features of neighboring nodes to form new features for the target node. A target node has both neighbors of the same type (homogeneous neighbors) and neighbors of different type (heterogeneous neighbors), each offering unique perspectives on the characteristics of the target node. The homogeneous neighbors of the target drug (or microbe) are nodes that have similarities, which from drug (or microbe) homogeneous graph. The heterogeneous neighbors associated with the target drug (or microbe) come from the drug–microbe heterogeneous graph, which reflects relationships between nodes of different types. Thus, the homogeneous neighbors reflect the characteristics of the target node from the perspective of similarity, while the heterogeneous neighbors reveal the characteristics of the target node from the perspective of association. To capture the distinct features reflected by homogeneous and heterogeneous neighbors of the target node, we design the GCNFP. First, we normalize the adjacency matrix $R^{drg-mic}$ of the drug–microbe heterogeneous graph to obtain ${\tilde{R}}^{drg-mic}$ (Kipf and Welling 2017),

{\tilde{R}}^{drg-mic} = D_{drg-mic}^{- \frac{1}{2}} \cdot R^{drg-mic} \cdot D_{drg-mic}^{- \frac{1}{2}},

(10)

here, $D_{drg-mic}^{- \frac{1}{2}} \cdot R^{drg-mic} \cdot D_{drg-mic}^{- \frac{1}{2}}$ represents the Laplacian normalization of $R^{drg-mic}$ , where $D_{drg-mic}$ is the diagonal matrix and ${(D_{drg-mic})}_{i, i}$ represents the number of non-zero entries in the row vector of the adjacency matrix $R^{drg-mic}$ . Similarly, we compute normalized adjacency matrices ${\tilde{R}}^{mic}$ and ${\tilde{R}}^{drg}$ for the microbe and drug homogeneous graphs, respectively. We define the drug–microbe homogeneous adjacency matrix $A_{homo} \in R^{(N_{d} + N_{m}) \times (N_{d} + N_{m})}$ and the drug–microbe heterogeneous adjacency matrix $A_{hete} \in R^{(N_{d} + N_{m}) \times (N_{d} + N_{m})}$ as follows,

\begin{matrix} A_{homo} = [\begin{matrix} {\tilde{R}}^{drg} & O \\ O & {\tilde{R}}^{mic} \end{matrix}] A_{hete} = [\begin{matrix} O & {\tilde{R}}^{drg-mic} \\ {\tilde{R}}^{{drg-mic}^{T}} & O \end{matrix}] \end{matrix},

(11)

where $O$ is a matrix with all elements being 0. After the $l$ -th graph convolutional encoding layer, we can obtain the $l$ -th order homogeneous neighbor representations $B_{homo}^{l}$ for all drugs and microbes,

B_{homo}^{l} = A_{homo} \cdot B_{homo}^{l - 1} \cdot W_{homo}^{l}, l = 1, 2, \dots, L_{gcn},

(12)

where $B_{homo}^{0} = Z$ represents the initial node attribute matrix, $W_{homo}$ denotes the parameter matrix, and $L_{gcn}$ is the total number of layers in the GCNFP. Following $L_{gcn}$ graph convolutional iterations, we can derive first to $L_{gcn}$ -th order homogeneous neighbor representations for both drugs and microbes. Given that these representations from homogeneous neighbors contain similarity information of drugs and microbes, while those from heterogeneous neighbors only encode association information between drugs and microbes, we devised a node feature propagation strategy (NFPS). This strategy leverages the information from homogeneous neighbors as auxiliary data to enhance the learning process of features from heterogeneous neighbors. Consequently, the $l$ -th layer of heterogeneous neighbor representations $B_{hete}^{l}$ is expressed as,

\begin{matrix} B_{hete}^{l} = A_{hete} \cdot (B_{hete}^{l - 1} | | α_{G} \cdot B_{homo}^{l}) \cdot W_{hete}^{l}, \\ l = 1, 2, \dots, L_{gcn}, \end{matrix}

(13)

where $| |$ denotes the concatenation operation, and $W_{hete}$ signifies the parameter matrix of the GCNFP. As each drug (microbe) node has its own unique information, we set a learnable attention coefficient for each node to capture the unique information. The attention coefficient vector of all the nodes is $α_{g} \in R^{(N_{d} + N_{m}) \times 1}$ and it is randomly initialized. The operation $diag ()$ may convert the vector $α_{g}$ into a diagonal matrix $α_{G} = diag (α_{g})$ . $B_{hete}^{0} = Z$ denotes the node attribute matrix of $0$ th layer, containing detailed information about each node. To construct a comprehensive homogeneous neighbor representation for $v_{j}^{drg}$ and $v_{j}^{mic}$ , we concatenate the homogeneous neighbor representations from the $0$ -th to the $L_{gcn}$ -th order. This yields multi-order homogeneous neighbor representations denoted as $B_{homo, i}^{drg} (B_{homo, j}^{mic})$ ,

\begin{matrix} \begin{matrix} B_{homo, i}^{drg} = [B_{homo, i}^{0}, \dots, B_{homo, i}^{L_{gcn}}] \\ B_{homo, j}^{mic} = [B_{homo, j}^{0}, \dots, B_{homo, j}^{L_{gcn}}] . \end{matrix} \end{matrix}

(14)

We utilize a fully connected layer to transform the high-dimensional homogeneous neighbor representations $B_{homo, i}^{drg} (B_{homo, j}^{mic})$ , which contain multi-order neighbor information, into a low-dimensional feature space. This transformation yields ${\tilde{B}}_{homo, i}^{drg} ({\tilde{B}}_{homo, j}^{mic})$ as follows,

\begin{matrix} \begin{matrix} {\tilde{B}}_{homo, i}^{drg} = Tan h (W_{d} \cdot B_{homo, i}^{drg} + b_{d}) \\ {\tilde{B}}_{homo, j}^{mic} = Tan h (W_{m} \cdot B_{homo, j}^{mic} + b_{m}), \end{matrix} \end{matrix}

(15)

where $W_{d}$ and $W_{m}$ represent learnable parameter matrices, while $b_{d}$ and $b_{m}$ are bias vectors. Similar to formulas (14) and (15), we derive heterogeneous neighbor representations ${\tilde{B}}_{hete, i}^{drg}$ and ${\tilde{B}}_{hete, j}^{mic}$ for nodes $v_{i}^{drg}$ and $v_{j}^{mic}$ . We combine these representations with the homogeneous neighbor representations ${\tilde{B}}_{homo, i}^{drg}$ and ${\tilde{B}}_{homo, j}^{mic}$ to obtain the homogeneous and heterogeneous representations of nodes $v_{i}^{drg}$ and $v_{j}^{mic}$ , denoted as $B_{i, j}^{hhn}$ ,

\begin{matrix} B_{i, j}^{hhn} = [\begin{matrix} {\tilde{B}}_{homo, i}^{drg} & {\tilde{B}}_{hete, i}^{drg} \\ {\tilde{B}}_{homo, j}^{mic} & {\tilde{B}}_{hete, j}^{mic} \end{matrix}] \end{matrix} .

(16)

2.5 Long-distance spatial representation learning based on SCA

If $v_{i}^{drg}$ and $v_{j}^{mic}$ are similar or associated with more the same drugs and microbes, their attribute vectors exhibit more similarity in feature distribution. We reshape the attribute vectors $Z_{drg, i} (Z_{mic, j}) \in R^{1 \times (N_{d} + N_{m})}$ of $v_{i}^{drg} (v_{j}^{mic})$ into $Z_{drg, i}^{Pat} (Z_{mic, j}^{Pat}) \in R^{N \times P}$ , where $P$ represents patch size and $N = (N_{d} + N_{m}) / P$ . If $N_{d} + N_{m}$ is not divisible by $P$ , we pad the end of $Z_{drg, i}$ and $Z_{mic, j}$ with zeros until $N_{d} + N_{m}$ is divisible by $P$ . The reshaped matrices $Z_{drg, i}^{Pat}$ and $Z_{mic, j}^{Pat}$ are divided into $N$ patches, where patches with similar features might be widely dispersed in space. The partitioning of attribute patches is helpful for designing an SCA mechanism to capture the spatial correlations among the patches (as shown in Fig. 3).

Figure 3. — Encoding the long-distance correlations among multiple attribute patches of a pair of drug and microbe nodes

We introduce a multi-head attention mechanism (Dosovitskiy et al. 2020) to calculate attention from different perspectives to reduce bias in the learning process. Taking the $m$ -th head also the last head attention as an example, we employ distinct linear transformations to produce drug query matrices $Q_{drg, i}^{m} \in R^{N \times N_{dim}}$ , key matrices $K_{drg, i}^{m} \in R^{N \times N_{dim}}$ , and value matrices $V_{drg, i}^{m} \in R^{N \times N_{dim}}$ for each patch. These transformations are defined as follows,

\begin{matrix} \begin{matrix} Q_{drg, i}^{m} = Z_{drg, i}^{Pat} \cdot W_{Q}^{m} \\ K_{drg, i}^{m} = Z_{drg, i}^{Pat} \cdot W_{K}^{m} \\ V_{drg, i}^{m} = Z_{drg, i}^{Pat} \cdot W_{V}^{m}, \end{matrix} \end{matrix}

(17)

where, $W_{Q}^{m}, W_{K}^{m}, W_{V}^{m} \in R^{P \times N_{dim}}$ are weight matrices. For all the patches of a pair of drug and microbe nodes, $W_{Q}^{m}$ , $W_{K}^{m}$ , $W_{V}^{m}$ are the shared weight matrices. It is beneficial for jointly capturing the spatial correlations among the features of the drug–microbe pair. For the $N$ patches of $v_{i}^{drg}$ and $v_{j}^{mic}$ , we calculate the spatial attention score matrix $Q_{drg, i}^{m} K_{drg, i}^{m^{T}}$ , $Q_{mic, j}^{m} K_{mic, j}^{m^{T}}$ , $Q_{drg, i}^{m} K_{mic, j}^{m^{T}}$ , $Q_{mic, j}^{m} K_{drg, i}^{m^{T}}$ both within and between them. Then, we normalize $Q_{drg, i}^{m} K_{drg, i}^{m^{T}} (Q_{drg, i}^{m} K_{mic, j}^{m^{T}})$ and multiply it with $V_{drg, i}^{m} (V_{drg, j}^{m})$ to obtain the spatial correlations representation $F_{drg, i}^{self, m} (F_{drg, i}^{cros, m})$ for drugs from the perspective of self-attention (cross-attention),

\begin{matrix} \begin{matrix} F_{drg, i}^{self, m} = softmax (\frac{Q_{drg, i}^{m} K_{drg, i}^{m^{T}}}{\sqrt{N_{dim}}}) V_{drg, i}^{m} \\ F_{drg, i}^{cros, m} = softmax (\frac{Q_{drg, i}^{m} K_{mic, j}^{m^{T}}}{\sqrt{N_{dim}}}) V_{mic, j}^{m} . \end{matrix} \end{matrix}

(18)

Similarly, we compute the spatial correlations representations $F_{mic, j}^{self, m}$ and $F_{mic, j}^{cros, m}$ for microbes under self-attention and cross-attention. The self-attention focuses on capturing the spatial correlations among the attributes of a drug (microbe) itself. The cross-attention concentrates on reveal the spatial correlations among the attributes of drug node and those of the microbe node. We combine the spatial correlations representations from these multiple perspectives to form $F_{drg, i}^{m}$ and $F_{mic, j}^{m}$ as follows,

\begin{matrix} \begin{matrix} F_{drg, i}^{m} = F_{drg, i}^{self, m} \oplus F_{drg, i}^{cros, m} \\ F_{mic, j}^{m} = F_{mic, j}^{self, m} \oplus F_{mic, j}^{cros, m}, \end{matrix} \end{matrix}

(19)

where $\oplus$ represents element-wise addition. The final output is obtained by concatenating the results of all attention heads,

\begin{matrix} \begin{matrix} F_{drg, i} = [F_{drg, i}^{1}, \dots, F_{drg, i}^{m}] \\ F_{mic, j} = [F_{mic, j}^{1}, \dots, F_{mic, j}^{m}] . \end{matrix} \end{matrix}

(20)

Subsequently, $F_{drg, i}$ and $F_{mic, j}$ are flattened as vectors, stacked vertically, and concatenated to represent the long-distance spatial correlations of node pairs $F_{i, j}^{lds}$ ,

\begin{matrix} F_{i, j}^{lds} = [\begin{matrix} F_{drg, i} \\ F_{mic, j} \end{matrix}] \end{matrix} .

(21)

2.6 Multiple representations fusion and optimization

Multiple representations, including the multi-node association representation $X_{i, j}^{mna}$ , homogeneous and heterogeneous neighbor representation $B_{i, j}^{hhn}$ , and long-distance spatial correlations representation $F_{i, j}^{lds}$ of $v_{i}^{drg}$ and $v_{j}^{mic}$ are concatenated to obtain the final node pair representation $Z_{i, j}^{fin}$ ,

Z_{i, j}^{fin} = [\begin{matrix} X_{i, j}^{mna} & B_{i, j}^{hhn} & F_{i, j}^{lds} \end{matrix}] .

(22)

We employ a convolution-pooling layer (as shown in Fig. 1e) to further integrate $Z_{i, j}^{fin}$ . After the l-th convolution-pooling layer, the node pair feature is denoted as $Z_{i, j}^{fin, l}$ ,

Z_{i, j}^{fin, l} = max (σ (W^{l} * Z_{i, j}^{fin, l - 1} + b^{l})), l = 1, 2,

(23)

where $Z_{i, j}^{fin, 0} = Z_{i, j}^{fin}$ and $σ (\cdot)$ is activation function. $max (\cdot)$ denotes the max-pooling operation and $*$ represents the convolution operation. $W^{l}$ and $b^{l}$ represent the weight matrix and bias vector of the l-th convolution-pooling layer, respectively. We name the feature obtained from the last convolution-pooling layer as ${\tilde{Z}}_{i j}^{fin}$ , which enables the prediction of scores p associated with $v_{i}^{drg}$ and $v_{j}^{mic}$ through the fully connected layer,

p = softmax (W_{agg} \cdot {\tilde{Z}}_{i, j}^{fin} + b_{agg}),

(24)

where $W_{agg}$ and $b_{agg}$ are the weight matrix and the bias vector, respectively. $p [1] (p [0])$ represents the probability that drug node $v_{i}^{drg}$ has (does not have) an association with microbe node $v_{j}^{mic}$ . The loss of our model is calculated based on the cross-entropy loss function,

loss = - \sum_{i = 1}^{N_{bat}} y [i] log (p [0]) + (1 - y [i]) log (1 - p [0]),

(25)

where $N_{bat}$ is the number of nodes per batch of training, and y represents the true association of a drug and a microbe.

3 Experimental evaluations and discussions

3.1 Parameter settings

We trained DHDMP using the Nvidia GeForce RTX 4060, utilizing the PyTorch framework and optimizing it with the Adam algorithm. The training parameters consisted of 80 epochs, a batch size of 32, and a learning rate of 0.0005. The numbers of the encoding layers in NHCN and GCNFP are selected from ${1, 2, 3}$ . We conducted the experiments on all the combinations of layer number of NHCN and that of GCNFP (Supplementary Table S1). When NHCN and GCNFP contain two encoding layers, the model achieves the best prediction performance. Additionally, we established 32 hyperedges to capture complex relationships (Supplementary Table S2). Within the SCA mechanism, the patch size and feature dimension ( $N_{dim}$ ) were set to $1 \times 50$ , with eight attention heads utilized. The fusion of multiple representations involved two convolution layers with kernel sizes of $2 \times 10$ and $1 \times 10$ , respectively, while the pooling layers utilized a window size of $1 \times 10$ . We also studied the impact of datasets with different positive to negative ratio on DHDMP (Supplementary Table S3) and time complexity analysis are provided in Supplementary File S1.

3.2 Evaluation metrics

Five-fold cross-validation is used to evaluate the prediction performance of DHDMP and other comparative methods. All known drug–microbe associations are randomly divided into five equal parts, called positive samples. The number of unknown drug–microbe associations is much larger than the positive examples and these unknown associations are considered negative samples. During the training and testing process of each fold, four parts of positive examples and an equal number of randomly selected negative examples are used as the training set. The remaining positive and negative samples make up the test set.

Evaluation of the models is based on the area under the receiver operating characteristic curve (AUC) (Hajian-Tilaki 2013) and the area under the precision–recall curve (AUPR) (Saito and Rehmsmeier 2015). AUC and AUPR are computed for each fold, and the average of these 5-fold results yields the final AUC and AUPR scores. In addition, the top k recall rate of a microbe-related drug is also calculated, given the importance of identifying microbes associated with drugs for experimental purposes.

3.3 Ablation experiments

In our ablation experiments, we systematically remove components of our model—NHCN, GCNFP, and SCA—to assess their individual contributions to prediction performance (Table 1). When NHCN is excluded, we observe a decline in prediction performance by 2.7% and 13.2% in terms of AUC and AUPR compared to our final model. This highlights the crucial role of capturing the intricate relationships among multiple drugs and microbes in enhancing the accuracy of drug–microbe association prediction. Similarly, without the learning of homogeneous and heterogeneous neighbor representations by GCNFP leads to a decrease in AUC and AUPR by 1.5% and 13.0%, respectively, compared to the complete model. This confirms the significance of incorporating information from both homogeneous and heterogeneous neighbors, which contributes significantly to predicting drug–microbe associations. Furthermore, when SCA is not utilized, DHDMP experiences a reduction of 1.4% in AUC and 6.4% in AUPR. This demonstrates the effectiveness of SCA in capturing the long-distance spatial relationships between node pairs. After the category features were removed from the prediction model, its AUC and AUPR decreased by 1.7% and 5.5%, respectively. It indicated that the category features may reveal the heterogeneity of multiple kinds of nodes. After the node feature propagation was eliminated from the model, its AUC and AUPR decreased by 1.4% and 4.2%. It demonstrates that propagating node information from homogeneous graphs to heterogeneous graph facilitates heterogeneous neighbor feature learning. The results of our ablation experiments indicate that NHCN makes the most contribution to predicting drug–microbe associations. This is primarily attributed to the ability of the hypergraph to effectively represent the complex relationships among multiple drugs and microbes.

Table 1.

Results of ablation studies of DHDMP.

NHCN	GCNFP	SCA	Category feature	NFPS	Average AUC	Average AUPR
✗	✓	✓	✗	✓	0.932	0.691
✓	✗	✓	✓	✗	0.944	0.693
✓	✓	✗	✓	✓	0.945	0.759
✓	✓	✓	✗	✓	0.942	0.768
✓	✓	✓	✓	✗	0.945	0.781
✓	✓	✓	✓	✓	0.959	0.823

Open in a new tab

The bold values are the highest ones.

3.4 Comparison with other methods

We assess our approach against six leading methods in the field: GCNMDA (Long et al. 2020a), EGATMDA (Long et al. 2020b), GSAMDA (Tan et al. 2022), GACNNMDA (Ma et al. 2023), SCSMDA (Tian et al. 2023), and NGMDA (Xuan et al. 2024). Each of these methods is trained using the optimal parameters mentioned in their original papers. To maintain consistency, we train our model and the comparative methods on the same dataset and use the same partitioning for training and testing. Below, you will find brief descriptions of these six comparative methods.

GCNMDA: This method constructs a network comprising drugs and microbes, leveraging their similarities and associations. By employing GCN and conditional random fields, the model predicts drugs relevant to microbes.

EGATMDA: This approach forms a heterogeneous network covering microbes, diseases, and drugs. It utilizes GCN with node-level attention and graph neural networks with graph-level attention to infer associations between microbes and drugs.

GSAMDA: In this model, drug and microbe similarities are computed using Gaussian interaction profiles (GIP) and Hamming interaction profiles (HIP). The features of drug and microbe nodes are learned through graph attention networks (GAT) and sparse autoencoders.

GACNNMDA: Similarity between drugs and microbes is determined based on GIP and HIP, and association scores between them are computed using convolutional neural networks.

SCSMDA: This method establishes an integrated network of drug and microbe similarities, constructing multiple meta-path-induced networks. It employs a framework based on graph convolution and graph contrastive learning to learn features of drug and microbe nodes.

NGMDA: It builds a drug–microbe heterogeneous graph and enhances node features through fully connected autoencoders. Additionally, a graph transformer is designed to infer associations between drugs and microbes.

Based on the results presented in Table 2, DHDMP achieved the highest average AUC at 95.9%, surpassing GCNMDA by 5.6%, EGATMDA by 1.9%, GACNNMDA by 11.6%, GSAMDA by 5.7%, SCSMDA by 4.3%, and NGMDA by 1.5%. Furthermore, DHDMP’s average AUPR stands at 82.3%, significantly outperforming GCNMDA, EGATMDA, GSAMDA, GACNNMDA, SCSMDA, and NGMDA by 50.8%, 51.6%, 57.6%, 62.7%, 48.3%, and 9.5%, respectively. Our method achieved the highest AUC and AUPR, with NGMDA demonstrating the second-best predictive performance. The superior performance of our method and NGMDA’s AUPR can be attributed largely to the incorporation and utilization of node-pair-level attributes. However, NGMDA falls short compared to our method due to its oversight of the intricate relationships between multiple drugs and microbes. GCNMDA, EGATMDA, and SCSMDA exhibit better performance compared to GSAMDA and GACNNMDA, possible reason is GSAMDA and GACNNMDA only learn node representations based on GATs or GCNs without considering the types of nodes and edges. GCNMDA, EGATMDA, and SCSMDA effectively learn the semantics of connections between drug and microbe meta-path nodes, thus yielding good performance.

Table 2.

Average AUCs and AUPRs of our method and the compared methods over all the 1373 drugs.

Methods	AUC	AUPR
GCNMDA	0.903	0.315
EGATMDA	0.940	0.307
GSAMDA	0.902	0.247
GACNNMDA	0.843	0.196
SCSMDA	0.916	0.340
NGMDA	0.944	0.728
DHDMP	0.959	0.823

Open in a new tab

The bold values are the highest ones.

To determine whether our method’s AUC and AUPR are better than those of other methods, we conducted statistical tests. Using 5-fold cross-validation, we calculated the average AUCs and AUPRs for 1373 drugs for each method. Then, we performed Paired Wilcoxon tests to compare the AUCs and AUPRs of DHDMP with those of the other methods. Table 3 shows the statistical results, indicating a significant improvement (P-value <0.05).

Table 3.

Results of the paired Wilcoxon test by comparing DHDMP and any of the other methods.

P-value	GCNMDA	EGATMDA	GACNNMDA	GSAMDA	SCSMDA	NGMDA
AUC	8.76e-159	5.82e-94	1.54e-165	1.28e-151	6.39e-164	3.47e-20
AUPR	5.25e-188	1.33e-162	1.78e-196	3.31e-190	1.18e-212	8.12e-33

Open in a new tab

For each drug, we calculated the recall rates of candidate microbes at various top k values (Fig. 4). Our method consistently performs better than other methods for different k values. For example, when k = 3, DHDMP achieves a recall rate of 88.1%, surpassing GCNMDA by 43.4%, EGATMDA by 38.9%, GACNNMDA by 60.9%, GSAMDA by 61.4%, SCSMDA by 43.9%, and NGMDA by 11.5%. In the range of $k \in [6, 9, 12]$ , DHDMP maintains the highest recall rate, while NGMDA closely follows with recall rates of 81.3%, 83.4%, and 85.7%, respectively. EGATMDA emerges as the third-best performing method with recall rates of 67.8%, 74.9%, and 80.1%. GCNMDA’s performance lags slightly behind SCSMDA, achieving recall rates of 61.5%, 66.8%, and 70.8%. While GSAMDA’s recall rates are slightly lower at 55.7%, 63.7%, and 68.4%, they still outperform GACNNMDA by 13.2%, 13.8%, and 12.1%, respectively.

Figure 4. — The average recall rates of drugs at multiple top k cutoffs

3.5 Case studies

We conducted case studies on moxifloxacin, ciprofloxacin, and curcumin to demonstrate DHDMP’s ability to predict microbes that are relevant to these drugs. Moxifloxacin is commonly prescribed for upper and lower respiratory tract infections such as acute sinusitis, pneumonia, and skin and soft tissue infections. Ciprofloxacin is used to treat respiratory tract infections and sepsis, and as a preventive measure against infections in immunocompromised patients. Curcumin, a natural phenolic antioxidant, is known for its potential anti-cancer properties and its possible role in preventing Alzheimer’s disease. Our model was trained using all known associations and an equal number of randomly selected unknown microbe–drug associations. For each drug, the candidate microbes were ranked in descending order according to their association scores. The top 20 candidate microbes for each drug are regarded as its potential candidate microbes. We then analyzed the top 20 candidate microbes for moxifloxacin, ciprofloxacin, and curcumin. The detailed results can be found in Tables 4–6, respectively. The MDAD database provides clinically or experimentally supported associations between microbes and drugs, while the aBiofilm database compiles information on anti-biofilm agents relevant to over 140 microbes (Rajput et al. 2018). These two databases, along with relevant bioinformatics literature, were used to validate the predictions of drug–microbe associations. In the case of moxifloxacin, as listed in Table 4, MDAD contains four potential microbes, three of which are documented in aBiofilm, and evidence supporting 12 candidates can be found in the literature. Notably, studies (Dalhoff et al. 2020, Ontong et al. 2021) indicate that certain microbes such as Escherichia coli, Streptococcus pneumoniae, Staphylococcus aureus, and Klebsiella pneumoniae are inhibited by moxifloxacin, while Listeria monocytogenes and Pseudomonas aeruginosa exhibit resistance to it (Ince et al. 2003, Tahoun et al. 2017, Budzinskaya et al. 2019). Regarding ciprofloxacin, Table 5 reveals that nine of the potential microbes have recently been confirmed to have associations in the literature, 10 are recorded in MDAD, and seven can be found in aBiofilm. Similarly, the results presented in Table 6 for curcumin show that evidence from the aBiofilm and MDAD databases supports 18 of the potential microbes, with one confirmed in the literature.

Table 4.

The top 20 candidate microbes related to moxifloxacin.

Rank	Microbe name	Evidence	Rank	Microbe name	Evidence
1	Candida albicans	aBiofilm, MDAD	11	Salmonella enterica	PMID: 22151215
2	Haemophilus influenzae	MDAD	12	Vibrio campbellii	Unconfirmed
3	Stenotrophomonas maltophilia	aBiofilm, MDAD	13	Plasmodium falciparum	PMID: 17214980
4	Mycobacterium avium	aBiofilm, MDAD	14	Influenza B virus	Unconfirmed
5	Staphylococcus aureus	PMID: 12654680	15	Klebsiella pneumoniae	PMID: 33406110
6	Streptococcus mutans	PMID: 29160117	16	Dihydropteroate synthase type-1	Unconfirmed
7	Pseudomonas aeruginosa	PMID: 31691651	17	Streptococcus pneumoniae	PMID: 31542319
8	Escherichia coli	PMID: 31542319	18	Candida tropicalis	Unconfirmed
9	Listeria monocytogenes	PMID: 28739228	19	Human immunodeficiency virus 1	PMID: 18441333
10	Burkholderia pseudomallei	PMID: 24502667	20	Streptococcus sanguinis	PMID: 10629010

Open in a new tab

Table 5.

The top 20 candidate microbes related to ciprofloxacin.

Rank	Microbe name	Evidence	Rank	Microbe name	Evidence
1	Bacillus subtilis	MDAD	11	Streptococcus mutans	PMID: 30468214
2	Staphylococcus aureus	aBiofilm, MDAD	12	Candida albicans	PMID: 31471074
3	Escherichia coli	aBiofilm, MDAD	13	Staphylococcus epidermidis	PMID: 28481197
4	Mycobacterium tuberculosis	MDAD	14	Listeria monocytogenes	PMID: 28355096
5	Proteus vulgaris	aBiofilm, MDAD	15	Enterococcus faecalis	PMID: 27790716
6	Haemophilus influenzae	MDAD	16	Klebsiella pneumoniae	PMID: 27257956
7	Providencia stuartii	aBiofilm, MDAD	17	Vibrio campbellii	Unconfirmed
8	Morganella morganii	aBiofilm, MDAD	18	Salmonella enterica	PMID: 26933017
9	Pseudomonas aeruginosa	aBiofilm, MDAD	19	Streptococcus pneumoniae	PMID: 26100702
10	Stenotrophomonas maltophilia	aBiofilm, MDAD	20	Vibrio harveyi	PMID: 27247095

Open in a new tab

Table 6.

The top 20 candidate microbes related to curcumin.

Rank	Microbe name	Evidence	Rank	Microbe name	Evidence
1	Aeromonas hydrophila	aBiofilm, MDAD	11	Vibrio vulnificus	aBiofilm, MDAD
2	Citrobacter freundii	aBiofilm, MDAD	12	Raoultella ornithinolytica	aBiofilm, MDAD
3	Burkholderia cenocepacia	aBiofilm, MDAD	13	Staphylococcus aureus	PMID: 32708619
4	Candida albicans	aBiofilm, MDAD	14	Vibrio parahaemolyticus	aBiofilm, MDAD
5	Burkholderia multivorans	aBiofilm, MDAD	15	Pseudomonas japonica	aBiofilm, MDAD
6	Vibrio harveyi	aBiofilm, MDAD	16	Klebsiella variicola	aBiofilm, MDAD
7	Proteus mirabilis	aBiofilm, MDAD	17	Escherichia coli	aBiofilm, MDAD
8	Serratia marcescens	aBiofilm, MDAD	18	Streptococcus mutans	aBiofilm, MDAD
9	Enterobacter cancerogenus	aBiofilm, MDAD	19	Pseudomonas aeruginosa	aBiofilm, MDAD
10	Enterobacter ludwigi	aBiofilm, MDAD	20	Staphylococcus epidermidis	Unconfirmed

Open in a new tab

3.6 Prediction of novel microbe–drug associations

We trained our model using all known drug–microbe associations and an equal number of randomly selected unknown associations. Subsequently, we applied our model to predict candidate microbes for 1373 drugs. The top 20 potential candidate microbes for each drug, as predicted by our model, are listed in Supplementary File S2.

4 Conclusion

We proposed a novel method to integrate the attributes of neighbor nodes in the homogeneous and heterogeneous graphs, and encode the long-distance dependencies within the pairwise attributes for predicting the drug-related microbes. The established hyperedges are beneficial for integrating the complex relationships among multiple drugs and microbes from various perspectives. The hypergraph topology was able to be formed dynamically according to the learnable hyperedge embeddings. The designed NHCNs deeply integrated the complex relationships and the heterogeneity of multi-category nodes. A graph convolutional network with node feature propagation enhanced the feature learning for the drug and microbe nodes by fusing the features from homogeneous and heterogeneous neighbors and their topologies. The presented SCA captured the long-distance spatial correlations among multiple attribute patches of the drug–microbe node pair. The cross-validation experimental results showed DHDMP’s higher AUC and AUPR than the compared six methods. The recall rates for the top-ranked candidates indicated that DHDMP was able to retrieve more actual drug–microbe associations. The case studies on three drugs showed DHDMP’s ability in screening the potential candidate microbes associated with an interested drug.

Supplementary Material

btae562_Supplementary_Data

btae562_supplementary_data.zip^{(1.1MB, zip)}

Contributor Information

Ping Xuan, School of Information Science and Engineering, Yanshan University, Qinhuangdao 066004, China; Department of Computer Science and Technology, Shantou University, Shantou 515063, China.

Zelong Xu, School of Information Science and Engineering, Yanshan University, Qinhuangdao 066004, China.

Hui Cui, Department of Computer Science and Information Technology, La Trobe University, Melbourne, VIC 3083, Australia; Australian Centre for AI in Medical Innovation, La Trobe University, Melbourne 3083, Australia.

Jing Gu, School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China.

Cheng Liu, Department of Computer Science and Technology, Shantou University, Shantou 515063, China.

Tiangang Zhang, School of Mathematical Science, Heilongjiang University, Harbin 150080, China.

Peiliang Wu, School of Information Science and Engineering, Yanshan University, Qinhuangdao 066004, China.

Supplementary data

Supplementary data are available at Bioinformatics online.

Conflict of interest

The authors declare that there is no conflict of insterest.

Funding

This work was supported by the Natural Science Foundation of China [62172143, 62372282, 62276028, U20A20167]; Natural Science Foundation of Guangdong Province [2024A1515010176]; Natural Science Foundation of Hebei Province [F202103079]; Natural Science Foundation of Heilongjiang Province [LH2023F044]; Innovation Capability Improvement Plan Project of Hebei Province [22567626H]; and STU Scientific Research Initiation Grant [NTF22032].

References

Aggarwal N, Kitano S, Puah GRY. et al. Microbiome and human health: current understanding, engineering, and enabling technologies. Chem Rev 2022;123:31–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
Budzinskaya M, Strakhovskaya M, Andreeva I. et al. Conjunctival microflora and its antibiotic sensitivity after serial intravitreal injections. Vestn Oftalmol 2019;135:135–40. [DOI] [PubMed] [Google Scholar]
Dalhoff A, Bowker K, MacGowan A.. Comparative evaluation of eight in vitro pharmacodynamic models of infection: activity of moxifloxacin against Escherichia coli and Streptococcus pneumoniae as an exemplary example. Int J Antimicrob Agents 2020;55:105809. [DOI] [PubMed] [Google Scholar]
De Lorenzi-Tognon M, Genton PL, Schrenzel PJ. et al. Synthèse du 8e symposium «feeding the microbiota»: prébiotiques et probiotiques. Rev Med Suisse 2023;19:1149–53. [DOI] [PubMed] [Google Scholar]
Deng L, Huang Y, Liu X. et al. Graph2MDA: a multi-modal variational graph embedding model for predicting microbe–drug associations. Bioinformatics 2022;38:1118–25. [DOI] [PubMed] [Google Scholar]
Dosovitskiy A, Beyer L, Kolesnikov A. et al. An image is worth 16x16 words: transformers for image recognition at scale. In: International Conference on Learning Representations, 2021. OpenReview.net.
Fan L, Wang L, Zhu X.. A novel microbe-drug association prediction model based on stacked autoencoder with multi-head attention mechanism. Sci Rep 2023;13:7396. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gill SR, Pop M, DeBoy RT. et al. Metagenomic analysis of the human distal gut microbiome. Science 2006;312:1355–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hajian-Tilaki K. Receiver operating characteristic (ROC) curve analysis for medical diagnostic test evaluation. Caspian J Intern Med 2013;4:627–35. [PMC free article] [PubMed] [Google Scholar]
Hattori M, Tanaka N, Kanehisa M. et al. SIMCOMP/SUBCOMP: chemical structure search servers for network analyses. Nucleic Acids Res 2010;38(Suppl 2):W652–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ince D, Zhang X, Hooper DC.. Activity of and resistance to moxifloxacin in Staphylococcus aureus. Antimicrob Agents Chemother 2003;47:1410–5. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kipf TN, Welling M. Semi-supervised classification with graph convolutional networks. In: International Conference on Learning Representations, 2017. OpenReview.net.
Kipf TN, Welling M. Variational graph auto-encoders. stat, 1050:21, 2016.
Kumbhare SV, Pedroso I, Ugalde JA. et al. Drug and gut microbe relationships: moving beyond antibiotics. Drug Discov Today 2023;28:103797. [DOI] [PubMed] [Google Scholar]
Li T, Wang Z, Guo J. et al. Bacterial resistance to antibacterial agents: mechanisms, control strategies, and implications for global health. Sci Total Environ 2023;860:160461. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li Z, Han P, You Z-H. et al. In silico prediction of drug-target interaction networks based on drug chemical structure and protein sequences. Sci Rep 2017;7:11174. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li Z, Li J, Nie R. et al. A graph auto-encoder model for miRNA-disease associations prediction. Brief Bioinform 2021;22:bbaa240. [DOI] [PubMed] [Google Scholar]
Long Y, Wu M, Kwoh CK. et al. Predicting human microbe–drug associations via graph convolutional network with conditional random field. Bioinformatics 2020a;36:4918–27. [DOI] [PMC free article] [PubMed] [Google Scholar]
Long Y, Wu M, Liu Y. et al. Ensembling graph attention networks for human microbe–drug association prediction. Bioinformatics 2020b;36(Supplement_2):i779–i786. [DOI] [PubMed] [Google Scholar]
Ma Q, Tan Y, Wang L.. GACNNMDA: a computational model for predicting potential human microbe-drug associations based on graph attention network and CNN-based classifier. BMC Bioinformatics 2023;24:35. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ontong JC, Ozioma NF, Voravuthikunchai SP. et al. Synergistic antibacterial effects of colistin in combination with aminoglycoside, carbapenems, cephalosporins, fluoroquinolones, tetracyclines, fosfomycin, and piperacillin on multidrug resistant Klebsiella pneumoniae isolates. PLoS One 2021;16:e0244673. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rajput A, Thakur A, Sharma S. et al. aBiofilm: a resource of anti-biofilm agents and their potential implications in targeting antibiotic drug resistance. Nucleic Acids Res 2018;46:D894–D900. [DOI] [PMC free article] [PubMed] [Google Scholar]
Saito T, Rehmsmeier M.. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS One 2015;10:e0118432. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sommer F, Bäckhed F.. The gut microbiota—masters of host development and physiology. Nat Rev Microbiol 2013;11:227–38. [DOI] [PubMed] [Google Scholar]
Song W, Xu L, Han C. et al. Drug–target interaction predictions with multi-view similarity network fusion strategy and deep interactive attention mechanism. Bioinformatics 2024;40:btae346. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sun Y-Z, Zhang D-H, Cai S-B. et al. MDAD: a special resource for microbe-drug associations. Front Cell Infect Microbiol 2018;8:424. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tahoun AB, Abou Elez RM, Abdelfatah EN. et al. Listeria monocytogenes in raw milk, milking equipment and dairy workers: molecular characterization and antimicrobial resistance patterns. J Glob Antimicrob Resist 2017;10:264–70. [DOI] [PubMed] [Google Scholar]
Tan Y, Zou J, Kuang L. et al. GSAMDA: a computational model for predicting potential microbe–drug associations based on graph attention network and sparse autoencoder. BMC Bioinformatics 2022;23:492. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tian Z, Yu Y, Fang H. et al. Predicting microbe–drug associations with structure-enhanced contrastive learning and self-paced negative sampling strategy. Brief Bioinform 2023;24:bbac634. [DOI] [PubMed] [Google Scholar]
Uddin TM, Chakraborty AJ, Khusro A. et al. Antibiotic resistance in microbes: history, mechanisms, therapeutic strategies and future prospects. J Infect Public Health 2021;14:1750–66. [DOI] [PubMed] [Google Scholar]
VanEvery H, Franzosa EA, Nguyen LH. et al. Microbiome epidemiology and association studies in human health. Nat Rev Genet 2023;24:109–24. [DOI] [PubMed] [Google Scholar]
Ventura M, O'Flaherty S, Claesson MJ.. Genome-scale analyses of health-promoting bacteria: probiogenomics. Nat Rev Microbiol 2009;7:61–71. [DOI] [PubMed] [Google Scholar]
Vich Vila A, Collij V, Sanna S. et al. Impact of commonly used drugs on the composition and metabolic function of the gut microbiota. Nat Commun 2020;11:362. [DOI] [PMC free article] [PubMed] [Google Scholar]
Weber KS, Knebel B, Strassburger K. et al. Associations between explorative dietary patterns and serum lipid levels and their interactions with ApoA5 and ApoE haplotype in patients with recently diagnosed type 2 diabetes. Cardiovasc Diabetol 2016;15:138. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wen L, Ley RE, Volchkov PY. et al. Innate immunity and intestinal microbiota in the development of type 1 diabetes. Nature 2008;455:1109–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wishart DS, Feunang YD, Guo AC. et al. Drugbank 5.0: a major update to the drugbank database for 2018. Nucleic Acids Res 2018;46:D1074–D1082. [DOI] [PMC free article] [PubMed] [Google Scholar]
Xiong Y, Xiong Y, Zhu P. et al. The role of gut microbiota in hypertension pathogenesis and the efficacy of antihypertensive drugs. Curr Hypertens Rep 2021;23:1–14. [DOI] [PubMed] [Google Scholar]
Xuan P, Gu J, Cui H. et al. Multi-scale topology and position feature learning and relationship-aware graph reasoning for prediction of drug-related microbes. Bioinformatics 2024;40:btae025. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zimmermann M, Zimmermann-Kogadeeva M, Wegmann R. et al. Mapping human microbiome drug metabolism by gut bacteria and their genes. Nature 2019;570:462–7. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

btae562_Supplementary_Data

btae562_supplementary_data.zip^{(1.1MB, zip)}

[btae562-B1] Aggarwal N, Kitano S, Puah GRY. et al. Microbiome and human health: current understanding, engineering, and enabling technologies. Chem Rev 2022;123:31–72. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B2] Budzinskaya M, Strakhovskaya M, Andreeva I. et al. Conjunctival microflora and its antibiotic sensitivity after serial intravitreal injections. Vestn Oftalmol 2019;135:135–40. [DOI] [PubMed] [Google Scholar]

[btae562-B3] Dalhoff A, Bowker K, MacGowan A.. Comparative evaluation of eight in vitro pharmacodynamic models of infection: activity of moxifloxacin against Escherichia coli and Streptococcus pneumoniae as an exemplary example. Int J Antimicrob Agents 2020;55:105809. [DOI] [PubMed] [Google Scholar]

[btae562-B4] De Lorenzi-Tognon M, Genton PL, Schrenzel PJ. et al. Synthèse du 8e symposium «feeding the microbiota»: prébiotiques et probiotiques. Rev Med Suisse 2023;19:1149–53. [DOI] [PubMed] [Google Scholar]

[btae562-B5] Deng L, Huang Y, Liu X. et al. Graph2MDA: a multi-modal variational graph embedding model for predicting microbe–drug associations. Bioinformatics 2022;38:1118–25. [DOI] [PubMed] [Google Scholar]

[btae562-B6] Dosovitskiy A, Beyer L, Kolesnikov A. et al. An image is worth 16x16 words: transformers for image recognition at scale. In: International Conference on Learning Representations, 2021. OpenReview.net.

[btae562-B7] Fan L, Wang L, Zhu X.. A novel microbe-drug association prediction model based on stacked autoencoder with multi-head attention mechanism. Sci Rep 2023;13:7396. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B8] Gill SR, Pop M, DeBoy RT. et al. Metagenomic analysis of the human distal gut microbiome. Science 2006;312:1355–9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B9] Hajian-Tilaki K. Receiver operating characteristic (ROC) curve analysis for medical diagnostic test evaluation. Caspian J Intern Med 2013;4:627–35. [PMC free article] [PubMed] [Google Scholar]

[btae562-B10] Hattori M, Tanaka N, Kanehisa M. et al. SIMCOMP/SUBCOMP: chemical structure search servers for network analyses. Nucleic Acids Res 2010;38(Suppl 2):W652–6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B11] Ince D, Zhang X, Hooper DC.. Activity of and resistance to moxifloxacin in Staphylococcus aureus. Antimicrob Agents Chemother 2003;47:1410–5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B12] Kipf TN, Welling M. Semi-supervised classification with graph convolutional networks. In: International Conference on Learning Representations, 2017. OpenReview.net.

[btae562-B13] Kipf TN, Welling M. Variational graph auto-encoders. stat, 1050:21, 2016.

[btae562-B14] Kumbhare SV, Pedroso I, Ugalde JA. et al. Drug and gut microbe relationships: moving beyond antibiotics. Drug Discov Today 2023;28:103797. [DOI] [PubMed] [Google Scholar]

[btae562-B15] Li T, Wang Z, Guo J. et al. Bacterial resistance to antibacterial agents: mechanisms, control strategies, and implications for global health. Sci Total Environ 2023;860:160461. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B16] Li Z, Han P, You Z-H. et al. In silico prediction of drug-target interaction networks based on drug chemical structure and protein sequences. Sci Rep 2017;7:11174. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B17] Li Z, Li J, Nie R. et al. A graph auto-encoder model for miRNA-disease associations prediction. Brief Bioinform 2021;22:bbaa240. [DOI] [PubMed] [Google Scholar]

[btae562-B18] Long Y, Wu M, Kwoh CK. et al. Predicting human microbe–drug associations via graph convolutional network with conditional random field. Bioinformatics 2020a;36:4918–27. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B19] Long Y, Wu M, Liu Y. et al. Ensembling graph attention networks for human microbe–drug association prediction. Bioinformatics 2020b;36(Supplement_2):i779–i786. [DOI] [PubMed] [Google Scholar]

[btae562-B20] Ma Q, Tan Y, Wang L.. GACNNMDA: a computational model for predicting potential human microbe-drug associations based on graph attention network and CNN-based classifier. BMC Bioinformatics 2023;24:35. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B21] Ontong JC, Ozioma NF, Voravuthikunchai SP. et al. Synergistic antibacterial effects of colistin in combination with aminoglycoside, carbapenems, cephalosporins, fluoroquinolones, tetracyclines, fosfomycin, and piperacillin on multidrug resistant Klebsiella pneumoniae isolates. PLoS One 2021;16:e0244673. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B22] Rajput A, Thakur A, Sharma S. et al. aBiofilm: a resource of anti-biofilm agents and their potential implications in targeting antibiotic drug resistance. Nucleic Acids Res 2018;46:D894–D900. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B23] Saito T, Rehmsmeier M.. The precision-recall plot is more informative than the ROC plot when evaluating binary classifiers on imbalanced datasets. PLoS One 2015;10:e0118432. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B24] Sommer F, Bäckhed F.. The gut microbiota—masters of host development and physiology. Nat Rev Microbiol 2013;11:227–38. [DOI] [PubMed] [Google Scholar]

[btae562-B25] Song W, Xu L, Han C. et al. Drug–target interaction predictions with multi-view similarity network fusion strategy and deep interactive attention mechanism. Bioinformatics 2024;40:btae346. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B26] Sun Y-Z, Zhang D-H, Cai S-B. et al. MDAD: a special resource for microbe-drug associations. Front Cell Infect Microbiol 2018;8:424. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B27] Tahoun AB, Abou Elez RM, Abdelfatah EN. et al. Listeria monocytogenes in raw milk, milking equipment and dairy workers: molecular characterization and antimicrobial resistance patterns. J Glob Antimicrob Resist 2017;10:264–70. [DOI] [PubMed] [Google Scholar]

[btae562-B28] Tan Y, Zou J, Kuang L. et al. GSAMDA: a computational model for predicting potential microbe–drug associations based on graph attention network and sparse autoencoder. BMC Bioinformatics 2022;23:492. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B29] Tian Z, Yu Y, Fang H. et al. Predicting microbe–drug associations with structure-enhanced contrastive learning and self-paced negative sampling strategy. Brief Bioinform 2023;24:bbac634. [DOI] [PubMed] [Google Scholar]

[btae562-B30] Uddin TM, Chakraborty AJ, Khusro A. et al. Antibiotic resistance in microbes: history, mechanisms, therapeutic strategies and future prospects. J Infect Public Health 2021;14:1750–66. [DOI] [PubMed] [Google Scholar]

[btae562-B31] VanEvery H, Franzosa EA, Nguyen LH. et al. Microbiome epidemiology and association studies in human health. Nat Rev Genet 2023;24:109–24. [DOI] [PubMed] [Google Scholar]

[btae562-B32] Ventura M, O'Flaherty S, Claesson MJ.. Genome-scale analyses of health-promoting bacteria: probiogenomics. Nat Rev Microbiol 2009;7:61–71. [DOI] [PubMed] [Google Scholar]

[btae562-B33] Vich Vila A, Collij V, Sanna S. et al. Impact of commonly used drugs on the composition and metabolic function of the gut microbiota. Nat Commun 2020;11:362. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B34] Weber KS, Knebel B, Strassburger K. et al. Associations between explorative dietary patterns and serum lipid levels and their interactions with ApoA5 and ApoE haplotype in patients with recently diagnosed type 2 diabetes. Cardiovasc Diabetol 2016;15:138. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B35] Wen L, Ley RE, Volchkov PY. et al. Innate immunity and intestinal microbiota in the development of type 1 diabetes. Nature 2008;455:1109–13. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B36] Wishart DS, Feunang YD, Guo AC. et al. Drugbank 5.0: a major update to the drugbank database for 2018. Nucleic Acids Res 2018;46:D1074–D1082. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B37] Xiong Y, Xiong Y, Zhu P. et al. The role of gut microbiota in hypertension pathogenesis and the efficacy of antihypertensive drugs. Curr Hypertens Rep 2021;23:1–14. [DOI] [PubMed] [Google Scholar]

[btae562-B38] Xuan P, Gu J, Cui H. et al. Multi-scale topology and position feature learning and relationship-aware graph reasoning for prediction of drug-related microbes. Bioinformatics 2024;40:btae025. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae562-B39] Zimmermann M, Zimmermann-Kogadeeva M, Wegmann R. et al. Mapping human microbiome drug metabolism by gut bacteria and their genes. Nature 2019;570:462–7. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Dynamic category-sensitive hypergraph inferring and homo-heterogeneous neighbor feature learning for drug-related microbe prediction

Ping Xuan

Zelong Xu

Hui Cui

Jing Gu

Cheng Liu

Tiangang Zhang

Peiliang Wu

Roles

Abstract

Motivation

Results

Availability and implementation

1 Introduction

2 Materials and methods

Figure 1.

2.1 Dataset

2.2 Construction of homogeneous and heterogeneous graph for drugs and microbes

2.3 Multiple drugs and microbes association learning based on NHCN

Figure 2.

2.4 Homogeneous and heterogeneous neighbor representation learning based on GCNFP

2.5 Long-distance spatial representation learning based on SCA

Figure 3.

2.6 Multiple representations fusion and optimization

3 Experimental evaluations and discussions

3.1 Parameter settings

3.2 Evaluation metrics

3.3 Ablation experiments

Table 1.

3.4 Comparison with other methods

Table 2.

Table 3.

Figure 4.

3.5 Case studies

Table 4.

Table 5.

Table 6.

3.6 Prediction of novel microbe–drug associations

4 Conclusion

Supplementary Material

Contributor Information

Supplementary data

Conflict of interest

Funding

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases