Multi-scale topology and position feature learning and relationship-aware graph reasoning for prediction of drug-related microbes

Ping Xuan; Jing Gu; Hui Cui; Shuai Wang; Nakaguchi Toshiya; Cheng Liu; Tiangang Zhang

doi:10.1093/bioinformatics/btae025

. 2024 Jan 25;40(2):btae025. doi: 10.1093/bioinformatics/btae025

Multi-scale topology and position feature learning and relationship-aware graph reasoning for prediction of drug-related microbes

Ping Xuan ^1,², Jing Gu ³, Hui Cui ⁴, Shuai Wang ⁵, Nakaguchi Toshiya ⁶, Cheng Liu ⁷, Tiangang Zhang ^8,^9,^✉

Editor: Jonathan Wren

PMCID: PMC10868329 PMID: 38269610

Abstract

Motivation

The human microbiome may impact the effectiveness of drugs by modulating their activities and toxicities. Predicting candidate microbes for drugs can facilitate the exploration of the therapeutic effects of drugs. Most recent methods concentrate on constructing of the prediction models based on graph reasoning. They fail to sufficiently exploit the topology and position information, the heterogeneity of multiple types of nodes and connections, and the long-distance correlations among nodes in microbe–drug heterogeneous graph.

Results

We propose a new microbe–drug association prediction model, NGMDA, to encode the position and topological features of microbe (drug) nodes, and fuse the different types of features from neighbors and the whole heterogeneous graph. First, we formulate the position and topology features of microbe (drug) nodes by t-step random walks, and the features reveal the topological neighborhoods at multiple scales and the position of each node. Second, as the features of nodes are high-dimensional and sparse, we designed an embedding enhancement strategy based on supervised fully connected autoencoders to form the embeddings with representative features and the more discriminative node distributions. Third, we propose an adaptive neighbor feature fusion module, which fuses features of neighbors by the constructed position- and topology-sensitive heterogeneous graph neural networks. A novel self-attention mechanism is developed to estimate the importance of the position and topology of each neighbor to a target node. Finally, a heterogeneous graph feature fusion module is constructed to learn the long-distance correlations among the nodes in the whole heterogeneous graph by a relationship-aware graph transformer. Relationship-aware graph transformer contains the strategy for encoding the connection relationship types among the nodes, which is helpful for integrating the diverse semantics of these connections. The extensive comparison experimental results demonstrate NGMDA’s superior performance over five state-of-the-art prediction methods. The ablation experiment shows the contributions of the multi-scale topology and position feature learning, the embedding enhancement strategy, the neighbor feature fusion, and the heterogeneous graph feature fusion. Case studies over three drugs further indicate that NGMDA has ability in discovering the potential drug-related microbes.

Availability and implementation

Source codes and Supplementary Material are available at https://github.com/pingxuan-hlju/NGMDA.

1 Introduction

The human microbiome is a collection of all microbiota that reside in or on human organs, including bacteria, viruses, protists, fungi, and archaea. Previous human microbiome studies demonstrated that interactions between the human microbes and corresponding hosts regulate human health, such as controlling immune function, providing resistance to pathogens, and even influencing brain physiology and behavior (Duvallet et al. 2017, Zhu et al. 2020). An imbalance of human microbiota and some diseases are closely related, including chronic inflammation, neurological disorders, and breast cancer (Wang et al. 2019, Rackaityte and Lynch 2020).

Microbes can change the toxicity and inhibitory activity of drugs (Nejman et al. 2020, Algavi and Borenstein 2023) and impact the effectiveness of disease treatments by biologically altering a drug’s chemical structure (Yin et al. 2022). Hacioglu et al. (2019) suggested that cooperation between Staphylococcus aureus and Candida albicans leads to drug resistance by strengthening biofilm formation. Also, the gut microbiome produces large quantities of bacterial enzymes that affect therapeutic efficacy (Zimmermann et al. 2019). Therefore, discovering new microbe–drug associations is essential in drug functional studies and precision medicine.

Recently, the computational methods were proposed for predicting the drug–target interactions (Li et al. 2022), incRNA–miRNA interactions (Wang et al. 2022), miRNA–disease associations (Peng et al. 2022a), metabolite–disease associations (Gao et al. 2023), and incRNA–disease associations (Wang et al. 2023a). Computational methods have also shown the ability to determine potential microbe–drug associations and identify reliable drug-related candidates for wet experiments. Microbe–drug association probabilities can be inferred by prediction models using Conditional Random Field (CRF) and Graph Convolutional Network (GCN) (Long et al. 2020a). Long et al. (2020b) proposed EGTMDA to learn node features for microbes and drugs using meta-paths and hierarchical attention mechanism. SCSMDA enhanced representations of drugs and microbes using graph contrastive learning and elaborate meta-paths (Tian et al. 2023). However, shortcomings exist in these methods. GCNMDA used vanilla homogeneous models to learn representations of drugs and microbes without considering abundantly available heterogeneous information. In addition, these methods based on meta-paths focus on neighbors originating from meta-paths while ignoring other non-neighboring nodes across the entire heterogeneous graph.

We proposed NGMDA to predict candidate microbes for drugs by learning the features of drugs and microbes from neighbors and the whole heterogeneous graph. Our contributions are summarized as follows:

The multi-scale topology information of nodes reflects neighbor regions of different ranges, which is important for microbe–drug association prediction. Therefore, topology features of microbe (drug) nodes are designed based on t-step random walks to obtain multi-scale topological neighborhoods of nodes. We also extracted node position features to form the position of each node in the entire heterogeneous graph.
An embedding enhancement strategy (ES) based on fully connected autoencoders with node class labels is proposed to extract important low-dimensional features of the microbe or drug nodes. This strategy also enhances the differences of feature distributions among different types of nodes by determining the node class.
In the microbe–drug heterogeneous graph, different neighbor nodes often have special topological neighborhoods and positional features that affect the importance of neighbors with a target node. A new position-sensitive and topology-sensitive self-attention mechanism (PTA) adaptively distinguishes the contributions of different neighbor nodes. Also, neighbor feature fusion (NFF) models heterogeneity of the graph and aggregates the representations of nodes based on heterogeneous graph neural networks (HGNN) with PTA.
A microbe or drug node may be closely related to distant nodes due to the heterogeneity of the microbe–drug graph. We have designed GFF based on a relationship-aware graph transformer (RAGT) to reveal the diverse connections between the target node and all other nodes in the heterogeneous graph. Comprehensive experiments suggest the superiority of NGMDA by comparing it with advanced methods.

2 Materials and methods

We propose a microbe–drug association prediction model called NGMDA (Fig. 1) that consists of an embedding ES, NFF module, and heterogeneous graph feature fusion (GFF) module. A heterogeneous graph is constructed to describe the diverse connectivity relationships between drugs and microbes (Fig. 1a). The node features of these drugs and microbes are projected into a low-dimensional feature space, and their differences are enhanced to obtain a fine node embedding (Fig. 1a). NFF learns similarity, position, and topology representations between nodes based on position- and topology-sensitive HGNN (Fig. 1b). We use GFF to learn multi-modal representations of various nodes across the heterogeneous graph by a RAGT (Fig. 1c). These four representations are combined into fully connected layers to predict microbe–drug association probabilities.

Figure 1. — Framework of the proposed NGMDA model. (a) Construct microbe–drug heterogeneous graph and enhance similarity and multi-modal embeddings. (b) Fuse features of neighbor nodes by position- and topology-sensitive HGNN. (c) Learn long-distance connection from the entire heterogeneous graph based on RAGT.

2.1 Dataset

Associations between drugs and microbes, similarities between drugs, and the attribute features of the microbes $X^{micr}$ are collected from previously published microbe–drug association prediction work (Long et al. 2020b). We extracted 2470 microbe–drug association data from the Microbe–Drug Associations Database (MDAD) (Sun et al. 2018), which contains 173 microbes and 1373 drugs. Drugbank (Knox et al. 2024) provides the interactions among the drugs. On the basis of the biological hypothesis that the drugs with similar treatment functions are more likely interact with the similar microbes, EGTMDA calculated the Gaussian kernel similarities of drugs based on their interactions. The structural similarity of two drugs was measured based on the common subgraphs within their chemical structures (Hattori et al. 2010). The final drug similarities were obtained by the weighted sum of the drug Gaussian kernel similarities and the drug structure similarities. The sequences of microbes were extracted from NCBI database, and then principal component analysis was utilized to obtain their important features.

2.2 Calculation of microbe similarity

As two microbes with similar gene sequences are typically similar, we calculate the cosine similarity on the attribute characteristics for each microbe. The similarity between microbe $m_{i}$ and $m_{j}$ is $K_{i j}^{micr} \in [0, 1]$ ,

K_{i j}^{micr} = \frac{1}{2} (\frac{X_{i}^{micr} {(X_{j}^{micr})}^{T}}{‖ X_{i}^{micr} ‖ ‖ X_{j}^{micr} ‖} + 1),

(1)

where $X_{i}^{micr}$ is the $i$ -th row of $X^{micr}$ , which contains the main gene sequence characteristics of $m_{i}$ , and ${(X)}^{T}$ is a transposition of $X$ . The microbe similarities were listed in the Supplementary File SF1.

2.3 Microbe–drug heterogeneous graph

We constructed a microbe–drug heterogeneous graph $G = (V, E)$ as shown in Fig. 1a. The node set V consists the drug node subset $V^{drug}$ and microbe node subset $V^{micr}$ and $< i, j > \in E$ represents an edge from node $v_{j}$ to $v_{i}$ . The drug similarity matrix and drug–microbe association matrix are expressed as $K^{drug}$ and $B^{bipa} \in R^{N_{d} \times N_{m}}$ , respectively, where $N_{d}$ (or $N_{m}$ ) denotes the number of drugs (or microbes). If there is a known association exists between drugs $d_{i}$ and $m_{j}$ , then $B_{i, j}^{bipa} = 1$ . Further, $B_{i, j}^{bipa} = 0$ indicates that no connection has yet been found. There are many low similarity data in the similarity matrix, which might be noise in microbe–drug association prediction. When constructing microbe–microbe (or drug–drug) adjacent matrix, connecting edges are added between the microbe (or drug) nodes with a similarity not less than a threshold $β$ . The adjacency matrix of the heterogeneous graph G is represented as $B^{hete} \in R^{(N_{d} + N_{m}) \times (N_{d} + N_{m})}$ , such that

B^{hete} = [\begin{matrix} {\tilde{K}}^{drug} & B^{bipa} \\ {(B^{bipa})}^{T} & {\tilde{K}}^{micr} \end{matrix}],

(2)

where ${\tilde{K}}^{drug}$ (or ${\tilde{K}}^{micr}$ ) is the drug (or microbe) similarity matrix after thresholding.

2.4 Heterogeneous graph node feature construction and enhancement

2.4.1 Heterogeneous graph node feature construction

The heterogeneous graph node features are constructed by a drug–drug similarity matrix, microbe–microbe similarity matrix, and drug–microbe association matrix. The similarity feature matrix is formed by combining the drug and microbe similarities defined above as

H_{simi} = [\begin{matrix} K^{drug} \\ K^{micr} \end{matrix}],

(3)

where $K_{i}^{drug}$ (or $K_{j}^{micr}$ ) contains the similarities between $d_{i}$ (or $m_{j}$ ) and other drugs (or microbes). The multi-modal feature matrix $H_{moda} \in R^{(N_{d} + N_{m}) \times (N_{d} + N_{m})}$ can be represented as

H_{moda} = [\begin{matrix} K^{drug} & B^{bipa} \\ {(B^{bipa})}^{T} & K^{micr} \end{matrix}],

(4)

where the i-th row in $H_{moda}$ records the similarities between $d_{i}$ and all other drugs and the associations between $d_{i}$ and all other microbes. The association with drugs and similarities between microbes are contained in the ( $N_{d} + j$ )-th row. Because similarity and multi-modal features are common node attributes for microbe–drug association prediction (Peng et al. 2017, 2021, Meng et al. 2023, Wang et al. 2023), we designate these as the original features of the nodes. Existing GNN models fail to fully consider the position and topology information of nodes, so we construct position and topology features of the microbe and drug nodes. The position of $v_{i}$ within the heterogeneous graph is determined by the connection between $v_{i}$ and other nodes. The position feature matrix is defined as $H_{posi} = B^{hete}$ , where the position feature of $v_{i}$ is $H_{posi, i}$ . A random walk of t-steps contains a t-hop topological neighborhood of nodes within a heterogeneous graph (Dwivedi et al. 2022) and is defined as

R W^{t} = {(B^{hete} {(D^{hete})}^{- 1})}^{t},

(5)

where t is the number of walking steps and $D^{hete}$ is degree matrix of $B^{hete}$ . $R W_{i, j}^{t}$ represents the probability of visiting $v_{i}$ to $v_{j}$ in the t-th step random walk and contains the topological neighborhood information of the t-th step of $v_{i}$ . The topology feature $H_{topo, i} \in R^{t}$ of $v_{i}$ is defined as

H_{topo, i} = [R W_{i, i}, R W_{i, i}^{2}, \dots, R W_{i, i}^{t}],

(6)

which contains the multi-scale topological neighborhood information of $v_{i}$ .

2.4.2 Enhancing node embedding

The original features specified above are high-dimensional sparse and contain some noise. A projection operation maps drug and microbe node features into the same embedding space, which drops information about the differences in the embedding distributions of different types of nodes. Figure 2 outlines our node embedding ES to learn representative embeddings and enhance the embedding distribution differences of the microbe and drug nodes. As autoencoders could effectively reduce the noise component in these embeddings, we learn important low-dimensional node embeddings based on fully connected autoencoders. The projection and reconstruction process of multi-modal and similarity features are similar, and we use similarity features as an example to describe the process here. The similarity feature of $v_{i}$ , $H_{simi, i}$ , is projected into $N_{p}$ dimensional space to form

H_{simi, i}^{enco, 1} = σ (Linear_enc o_{simi, ϕ (v_{i})}^{(1)} (H_{simi, i})),

(7)

where $Linear_enco$ is a linear layer, $σ$ represents the non-linear activation function ReLU, and $ϕ (v_{i})$ indicates the type of $v_{i}$ . The similarity embedding of $v_{i}$ is learned from the l-th fully connected encoding layer as

\begin{matrix} H_{simi, i}^{enco, l} = σ (Linear_enc o_{simi, ϕ (v_{i})}^{(l)} (H_{simi, i}^{enco, l - 1})), \\ l = 1, 2, \dots, L_{enco}, \end{matrix}

(8)

where $L_{enco}$ is the total number of encoding layers. $H_{simi, i}^{enco, L_{enco}}$ is used as the input of the decoder, and the output of the l-th fully connected decoding layer is

\begin{matrix} H_{simi, i}^{deco, l} = σ (Linear_dec o_{simi, ϕ (v_{i})}^{(l)} (H_{simi, i}^{deco, l - 1})), \\ l = 1, 2, \dots, L_{deco}, \end{matrix}

(9)

where $L_{deco}$ is total number of decoding layers, $Linear_deco$ denotes the linear layer and $H_{simi, i}^{deco, 0} = H_{simi, i}^{enco, L_{enco}}$ . After projection, the multi-modal embedding $H_{moda}^{enco, L_{enco}} \in R^{(N_{d} + N_{m}) \times (N_{p})}$ can be learned. The mean square error estimates the reconstruction loss of the node similarity features as

γ_{reco, simi} = \frac{1}{| T |} \sum_{i \in T} {‖ H_{simi, i} - H_{simi, i}^{L_{deco}} ‖}^{2},

(10)

where T is the batch of nodes in the training set. Similarly, the reconstruction loss of the multi-modal feature is $γ_{reco, moda}$ .

Figure 2. — Enhancing node embeddings of microbes and drugs by supervised autoencoders.

We classify the projected node embedding to enhance the differences between the drug and microbe embedding distributions. Considering a multi-modal embedding, as an example, is the input of the classify and $labe l_{i}$ is the corresponding classification labels. The classification loss of the multi-modal embedding in the training samples is estimated by the cross-entropy loss function

γ_{clas, moda} = \frac{1}{| T |} \sum_{i \in T} \sum_{k = 1}^{2} - l abe l_{i, k} \cdot \log (Linear_cla s^{moda} {(H_{moda, i}^{enco, L_{enco}})}_{k}),

(11)

where $Linear_cla s^{moda} \in R^{N_{p} \times 2}$ . The classification loss of the similarity embedding is represented as $γ_{inty, simi}$ . The total loss of the embedding classification of the drug and microbe nodes is

γ_{proc} = γ_{reco, simi} + γ_{reco, moda} + γ_{inty, simi} + γ_{inty, moda} .

(12)

2.5 Neighbor feature fusion

The topological neighborhood and position information of the neighboring nodes impact their importance with a target node. We propose a NFF module based on HGNN with a PTA to learn representative similarity and the position and topology representations of each microbe and drug nodes. The relationship types between the nodes are critical auxiliary features, so we also calculate the importance of the integrated features, as shown in Fig. 3.

Figure 3. — Calculation of position-sensitive and topology-sensitive self-attention of $v_{j}$ to $v_{i}$ .

The type of relationship between nodes contains the similarity relationship between drugs (or microbes) and the association relationship between drugs (or microbes) and microbes (or drugs). The relationship type of $v_{j}$ to $v_{i}$ is represented as

ψ (< i, j >) = {\begin{array}{l} 0, & i f v_{i}, v_{j} \in V^{drug} \\ 1, & i f v_{i} \in V^{micr}, v_{j} \in V^{drug} \\ 2, & i f v_{i} \in V^{drug}, v_{j} \in V^{micr} \\ 3, & i f v_{i}, v_{j} \in V^{micr} \end{array} .

(13)

Then, the importance of the relationship type of $v_{j}$ to $v_{i}$ is $r_{ψ (< i, j >)}^{l}$ , which is learned during the training process at the l-th layer, and $r_{ψ (< i, j >)}^{0} = 1$ .

Multi-head attention can reasonably stabilize the learning process of self-attention by allocating the attention value of each head (Veličković et al. 2018). After obtaining the similarity representations $H_{simi, j}^{l - 1}$ and $H_{simi, i}^{l - 1}$ of $v_{i}$ and $v_{j}$ , respectively, at $l - 1$ -th layer, we compute the importance of the similarity representation of $v_{j}$ to $v_{i}$ in the next layer by

\begin{matrix} s_{i, j, k}^{l} = {(softmax (W_{p (k)}^{l} W_{ϕ (v_{i}), k}^{l} H_{simi, i}^{l - 1}))}^{T} n orm (W_{ϕ (v_{j}), k}^{l} H_{simi, j}^{l - 1}), \\ l = 1, 2, \dots, L_{nff} and k = 1, 2, \dots, K_{nff}, \end{matrix}

(14)

where $L_{nff}$ is the total layer number, $K_{nff}$ is the head number, $H_{simi, i}^{0} = H_{simi, i}^{enco, L_{enco}}$ , $W_{ϕ (v_{i}), k}^{l}$ and $W_{p (k)}^{l} \in R^{N_{p} \times N_{p}}$ are weight matrices, and norm represents $L_{2}$ normalization. The $H_{simi, i}^{l - 1}$ and $H_{simi, j}^{l - 1}$ terms are transformed into latent representations $W_{ϕ (v_{i}), k}^{l} H_{simi, i}^{l - 1}$ and $W_{ϕ (v_{j}), k}^{l} H_{simi, j}^{l - 1}$ . Then, $W_{p (k)}^{l} W_{ϕ (v_{i}), k}^{l} H_{simi, i}^{l - 1}$ is the distribution of the similarity representation importance, which is converted to a probability distribution using the softmax function. As the magnitude of the latent representation affects the importance score of $v_{j}$ to $v_{i}$ , we standardize $W_{ϕ (v_{j}), k}^{l} H_{simi, j}^{l - 1}$ with $L_{2}$ normalization. The importance score of $v_{j}$ to $v_{i}$ is calculated by the inner product of the importance distribution of $v_{i}$ and the representation of $v_{j}$ . The multiple neighbors of node $v_{i}$ have their various topological neighborhoods and positions, so these neighbors have different importance for $v_{i}$ ’s feature learning. Therefore, the importance of each neighbor node for $v_{i}$ was calculated before the $v_{i}$ ’s features were updated. We calculate the importance $c_{i, j}^{l} \in [0, 1]$ of the position and topology of $v_{j}$ to $v_{i}$ by

c_{i, j}^{l} = [τ \cdot cosine (H_{posi, i}^{l - 1}, H_{posi, j}^{l - 1}) + (1 - τ) \cdot cosine (H_{topo, i}^{l - 1}, H_{topo, j}^{l - 1}) + 1] / 2,

(15)

where $H_{posi, i}^{l - 1}$ and $H_{topo, i}^{l - 1}$ are the position and topology representations of $v_{i}$ , respectively, $H_{posi, i}^{0} = H_{posi, i}$ and $H_{topo, i}^{0} = H_{topo, i}$ . The parameter $τ \in [0, 1]$ balances the contributions between the position and topology representations. The importance score of $v_{j}$ to $v_{i}$ is then

a_{i, j, k}^{l} = softma x_{j \in N (i)} (r_{ψ (< i, j >)}^{l} \cdot s_{i, j, k}^{l} \cdot c_{i, j}^{l}) .

(16)

Here, $a_{i, j, k}^{l}$ is position and topology sensitive by integrating the importance of the neighbor position and topology.

As residuals can alleviate over-smoothing and vanishing gradients (Lv et al. 2021), we add a node residual for every attention head. The similarity representation of $v_{i}$ is then updated as

H_{simi, i, k}^{l} = σ (\sum_{j \in N (i)} a_{i, j, k}^{l} \cdot W_{ϕ (v_{i}), k}^{l} \cdot H_{simi, j}^{l - 1} + W_{res (k)}^{l} \cdot H_{simi, i}^{l - 1}),

(17)

where $W_{res (k)}^{l} \in R^{N_{p} \times N_{p}}$ is the weight matrix. The similarity representations of the different heads are aggregated at the l-th layer to obtain

H_{simi, i}^{l} = \frac{1}{K_{nff}} \sum_{k = 1}^{K_{nff}} H_{simi, i, k}^{l},

(18)

where $K_{nff}$ is the head number of the NFF. As the similarity, position, and topology representations have the same update procedure, the position and topology representations are updated as $H_{posi, i}^{l}$ and $H_{topo, i}^{l}$ in the l-th layer, respectively.

2.6 Heterogeneous GFF

2.6.1 Relationship type encoding

Relationship types of similarity and association can reflect diverse semantic connections between drug and microbe nodes. The relationship type $ψ (< i, j >)$ of $v_{j}$ to $v_{i}$ is represented as a one-hot vector that is linearly transformed to obtain the embedding of the relationship type $e_{ψ (< i, j >)} \in R^{N_{p}}$ .

2.6.2 Relationship-aware graph transformer

To capture the connection between the target node and distant nodes, we designed a heterogeneous GFF module presented in Fig. 1c. A RAGT is proposed within the GFF and inspired by these methods (Diao and Loynd 2022, Peng et al.2022b, c). To embed the relationship type $ψ (< i, j >)$ of $v_{j}$ to $v_{i}$ into the query, key, and value vectors, we concatenate a multi-modal feature of $v_{i}$ (or $v_{j}$ ) and the relationship type embedding $e_{ψ (< i, j >)}$ . We obtain the query, key, and value vectors of the h-th head in the l-th layer by linear transformations such that

\begin{matrix} q_{i, j, h}^{l} = [H_{moda, i}^{enco, l - 1}, e_{ψ (< i, j >)}] W_{ϕ (v_{i}), Q, h}^{l}, \\ k_{i, j, h}^{l} = [H_{moda, j}^{enco, l - 1}, e_{ψ (< i, j >)}] W_{ϕ (v_{j}), K, h}^{l}, \\ v_{i, j, h}^{l} = [H_{moda, j}^{enco, l - 1}, e_{ψ (< i, j >)}] W_{ϕ (v_{j}), V, h}^{l}, \\ l = 1, 2, \dots, L_{gff} and h = 1, 2, \dots, K_{gff}, \end{matrix}

(19)

where $W_{ϕ (v_{i}), Q, h}^{l}, W_{ϕ (v_{i}), K, h}^{l}, W_{ϕ (v_{i}), V, h}^{l} \in R^{2 \times N_{p}, \frac{N_{p}}{K_{gff}}}$ are weight matrices, $H_{enc, i}^{G, 0} = H_{enc, i}^{G}$ , $L_{gff}$ is the total layer number, and $K_{gff}$ indicates the head number of GFF. The importance of $v_{j}$ to $v_{i}$ is

f_{i, j, h}^{l} = softma x_{j \in V} (\frac{{(q_{i, j, h}^{l})}^{T} k_{i, j, h}^{l}}{\sqrt{N_{p} / K_{gff}}}) .

(20)

After aggregating the neighbor representations of $v_{i}$ in the h-th head, we concatenate the representations from each head to form

m_{i}^{l} = ∥_{h = 1}^{K_{gff}} (\sum_{j = 1}^{| V |} f_{i, j, h}^{l} \cdot v_{i, j, h}^{l}),

(21)

where $∥$ is the concatenation operation. The application of layer normalization (LayerNorm) is crucial for this training process and for expressing the capacity of attention (Brody et al. 2023). The multi-modal representation $H_{moda, i}^{l}$ of $v_{i}$ is updated based on LayerNorm, such that

\begin{matrix} n_{i}^{l} = LayerNorm (W_{1}^{l} m_{i}^{l} + H_{moda, i}^{l - 1}), \\ H_{moda, i}^{l} = LayerNorm (σ (W_{2}^{l} n_{i}^{l}) W_{3}^{l} + n_{i}^{l}), \end{matrix}

(22)

where $W_{1}^{l}, W_{2}^{l}, W_{3}^{l} \in R^{N_{p} \times N_{p}}$ represent weight matrices.

2.7 Representation integration and optimization

The original features and representations learned from shallower layers retain the detailed information of the nodes, along with more abstract information are learned from deeper layers. By concatenating the original features and representations from each layer of NFF and GFF, the final representation of the drug $d_{i}$ is formed as

{\tilde{H}}_{i} = [H_{simi, i}, ∥_{l = 0}^{L_{nff}} [H_{posi, i}^{l}, H_{topo, i}^{l}, H_{simi, i}^{l}], H_{moda, i}, ∥_{l = 0}^{L_{gff}} H_{moda, i}^{l}] .

(23)

Likewise, we obtain the final representation ${\tilde{H}}_{j + N_{d}}$ of microbe $m_{j}$ . Following the stack of linear layers and the non-linear activation function, ${\tilde{H}}_{i}$ and ${\tilde{H}}_{j + N_{d}}$ are combined to compute the association prediction score $pre d_{i, j} \in R^{2}$ , such that

\begin{matrix} u_{i, j} = σ (W_{attr} [{\tilde{H}}_{i}, {\tilde{H}}_{j + N_{d}}] + b_{attr}), \\ pre d_{i, j} = softmax (W_{pred} u_{i, j} + b_{pred}), \end{matrix}

(24)

where $W_{attr}$ (or $W_{pred}$ ) is the weight matrix and $b_{attr}$ (or $b_{pred}$ ) is a bias vector. Then, $pre d_{i, j} = [{(pre d_{i, j})}_{0}, {(pre d_{i, j})}_{1}]$ , where ${(pre d_{i, j})}_{0}$ represents the unrelated probability between $d_{i}$ and $m_{j}$ , and the associated probability is ${(pre d_{i, j})}_{1}$ . The loss of the association prediction is represented by

γ_{pred} = \frac{1}{| B |} \sum_{(i, j) \in B} \sum_{k = 1}^{2} - (label {(i, j)}_{k} \cdot \log ({(pre d_{i, j})}_{k}),

(25)

where B is the training example set and $label (i, j)$ is the association label of $d_{i}$ and $m_{j}$ . The final loss of NGMDA $γ$ is the weighted sum of the ES loss $γ_{proc}$ and the association prediction loss $γ_{pred}$ , such that

γ = ε \cdot γ_{proc} + (1 - ε) \cdot γ_{pred},

(26)

where the balance factor $ε \in [0, 1]$ is a hyper-parameter.

3 Experimental evaluation and discussion

3.1 Evaluation metrics

The performance of NGMDA and other comparison methods is evaluated with 5-fold cross-validation. All known associations between drugs and microbes are classified as positive samples, with associations equally divided into five parts. All unobserved microbe–drug associations are taken as negative samples to form a set of negative samples. Four positive examples and an equal number of negative examples are randomly selected from the negative sample to be utilized for training, and the remainder are test examples.

The area under the receiver operating characteristic curve (AUC) (Huang and Ling 2005), the area under the precision–recall curve (AUPR) (Saito and Rehmsmeier 2015), and the recall rate of the top-k candidate microbes associated with drugs are selected as our evaluation indicators. If the association score between $d_{i}$ and $m_{j}$ is less than a threshold $θ$ , then it is considered a negative sample. Otherwise, it is identified as a positive sample. The TPRs, FPRs, precisions, and recalls of each drug were calculated at different threshold $θ$ , we calculated the average AUCs and average AUPRs of 1373 drugs for each fold. The 5-fold AUCs (or AUPRs) were averaged as the final AUC (or AUPR). Considering that high-ranking candidates may be chosen by biologists for humidity experiments, more positive samples are expected to appear as top-rank candidates. Hence, we compute a recall rate of the top-k candidate microbes of drug $d_{i}$ .

3.2 Parameter settings

NGMDA runs on a 2080ti server based on the PyTorch framework and is optimized with the Adam algorithm. The proposed model has some hyper-parameters including the steps of random walking, the layer numbers of NFF and that of GFF, and the balance factor of loss $ϵ$ . We firstly establish the variation range for each hyper-parameter, and then select the value which obtains the best performance for the model as the final value of the hyper-parameter. To assess the effect of random walk step size on the prediction performance, the step size was selected from {1, 2, 4, 8, 16, 32}. The model achieves the highest AUC (AUC = 0.944) and AUPR (AUPR = 0.728) when step size is 2 (Supplementary Table ST1). The random walk steps are set to two for the topological embedding formation. For NFF and GFF, we fine-tuned the layer number within a range, {1, 2, 3}, and performed all the combinations of the layer number of NFF and GFF. As shown in the Supplementary Table ST2, the model gets the best performance when their layer numbers are two. The balance factor $ε$ regulates the importance of the loss of embedding ES and that of the association prediction loss, and it was chosen from the range of {0, 0.1, $\dots$ , 0.5}. Supplementary Table ST3 demonstrates the corresponding results and $ε$ was set to 0.2 finally. The drug (microbe) similarity threshold, $β$ , was selected from {0.5, 0.6, $\dots$ , 0.9}, and it was set to 0.9 in our experiment (Supplementary Table ST4). Parameter $τ$ is utilized to balance the importance of the topology and position features, and $τ$ varies from 0 to 1 with a step size of 0.2. Supplementary Table ST5 indicates $τ$ value of 0.4 is more favorable for the prediction performance of the model.

3.3 Ablation experiments

We perform ablation experiments to evaluate the contributions of position and topology feature learning (PTL), ES, NFF, and GFF as listed in Table 1. For NGMDA without GFF, the AUC and AUPR metrics drop by 1.1% and 6.0%, respectively. The AUC and AUPR of NGMDA without NFF decrease by 1.0% and 5.3%, respectively, compared to the whole model. The AUC and AUPR decrease by 0.6% and 4.6%, respectively, if NGMDA has no ES. The AUC and AUPR of our model achieve 0.9% and 1.5%, respectively, higher than NGMDA without PTL. We built the prediction model without multi-scale topological feature learning and the one without position feature learning, respectively. Their AUCs decreased by 0.8% and 0.4%, and their AUPR decreased by 1% and 0.5%, respectively. After the relationship type integration was eliminated from the prediction model, its AUC and AUPR decreased by 0.6% and 2.7%.

Table 1.

Results of the ablation studies.

Networks	Average AUC	Average AUPR
NGMDA	0.944	0.728
NGMDA w/o PTL	0.935	0.713
NGMDA w/o ES	0.938	0.682
NGMDA w/o NFF	0.934	0.675
NGMDA w/o GFF	0.933	0.668
NGMDA w/o Topo	0.936	0.718
NGMDA w/o Posi	0.94	0.723
NGMDA w/o Rel	0.938	0.701

Open in a new tab

The ablation experiments indicate that merging node features of the heterogeneous graph contributes the most to model performance (Table 1). A possible reason is that some non-neighboring nodes exist across the entire heterogeneous graph that are also closely related to the target node. NFF achieves the second most significant contribution, suggesting that the neighboring node information of the target node is also important. The embedding ES boosts the prediction performance, which suggests its value in reducing noise in the node embeddings and enhancing differences in the node distributions. Multi-scale topology and position features indicated the neighbors with multiple ranges and the location information of each node were important for the improved prediction performance. The experimental results (Table 1) also demonstrated the relationship type integration is helpful for improving the prediction performance.

3.4 Comparison with other methods

NGMDA is compared with five state-of-the-art microbe–drug association prediction methods, including GCNMDA (Long et al. 2020a), EGATMDA (Long et al. 2020b), GSAMDA (Tan et al. 2022), GACNNMDA (Ma et al. 2023), and SCSMDA (Tian et al. 2023). NGMDA and five compared methods were trained and tested by using the same data separation during 5-fold cross-validation. The hyper-parameters of these methods are set according to their corresponding literature. We briefly describe these comparison methods in the following.

GCNMDA (Long et al. 2020a): It established a microbe–drug heterogeneous network and integrated multiple kinds of similarities. These similarities were measured based on the chemical structures of drugs, the Gaussian interaction profiles of drugs (microbes), and the microbe sequences. The prediction model was constructed based on GCN and CRF.
EGATMDA (Long et al. 2020b): It constructed a microbe–disease–drug network and then inferred the microbe–drug associations by a hierarchical attention mechanism.
GSAMDA (Tan et al. 2022): The model calculated the drug (microbe) similarities based on the Gaussian interaction profiles and Hamming interaction profiles of drugs (or microbes), and learned the node features by the graph attention networks and sparse auto-encoder.
GACNNMDA (Ma et al. 2023): The multiple microbe–drug heterogeneous networks were constructed based on the Gaussian interaction and Hamming interaction profiles of drugs (microbes). The potential microbe–drug associations were identified by the convolutional neural networks.
SCSMDA (Tian et al. 2023): The model constructed the microbe–drug networks based on the microbe gene sequence information, the Gaussian kernel interaction profiles of drugs (or microbes), and the chemical structures of drugs. It learned the features of the microbe and drug nodes by graph contrastive learning.

We first compute the AUC and AUPR and then calculate the average AUC and AUPR over 1373 drugs. As shown in Table 2, NGMDA achieves the best average AUC of 0.944, which is 0.4% higher than the second-best EGATMDA model, 4.1% better than GCNMDA, 10.1% over GACNNMDA, 4.2% superior to GSAMDA, and 2.8% greater than SCSMDA. NGMDA also produces the best average AUPR of 72.8%, which is 38.8%, 41.3%, 42.1%, 53.2%, and 48.1% better than SCSMDA, GCNMDA, EGATMDA, GACNNMDA, and GASMDA, respectively. We compute the average AUCs (AUPRs) of 1373 drugs per fold for NGMDA and each of the compared methods. To observe whether NGMDA’s prediction performance is significantly higher than each compared method, the statistical test was conducted. NGMDA has 1373 AUCs (AUPRs) for the 1373 drugs, and the compared methods also have 1373 AUCs (AUPRs) for these drugs. The paired Wilcoxon test was executed on NGMDA’s AUCs (AUPRs) and the AUCs (AUPRs) of the compared methods (Table 3). The results indicated NGMDA obtained the significantly higher prediction performance than all the compared methods.

Table 2.

AUCs and AUPRs of different methods in comparison all the 1373 drugs.

Networks	AUC (%)	AUPR (%)
NGMDA	94.4	72.8
SCSMDA	91.6	34.0
GSAMDA	90.2	24.7
GACNNMDA	84.3	19.6
EGATMDA	94.0	30.7
GCNMDA	90.3	31.5

Open in a new tab

Table 3.

The paired Wilcoxon test result on AUCs and AUPRs of 1373 drugs comparing NGMDA with other compared methods.

	GCNMDA	EGATMDA	GACNNMDA	GSAMDA	SCSMDA
P-value of AUCs	4.21e-155	1.70e-59	7.26e-159	4.38e-150	1.40e-151
P-value of AUPRs	1.47e-186	2.26e-156	6.63e-196	1.30e-186	1.82e-215

Open in a new tab

The performances of GCNMDA, GACNNMDA, and GSAMDA are not as good as NGMDA, EGATMDA, and SCSMDA. This outcome is likely because these learn node representations using simple models (e.g. GCN and GAT) without considering node or edge types in the microbe–drug heterogeneous graph. EGATMDA and SCSMDA learn the features of drugs and microbes from semantic information based on meta-paths. These models only focus on learning features of the neighbor nodes derived from meta-paths and do not consider the remaining nodes across the entire heterogeneous graph.

The average recalls under different top-k candidate microbes for all drugs are presented in Fig. 4. NGMDA outperforms all other methods at different top cutoffs due to its enhanced embedding of the nodes and fusing the features of neighbor nodes and the whole heterogeneous graph. When k = 3, our model achieves the highest recall rate of 76.6%, where the second-best 48.7% is attained by EGATMDA. SCSMDA achieves the fourth-best result with a recall rate of 44.7%, which is 0.5% below GCNMDA. When k is 6, 9, and 12, NGMDA maintains the best recall values of 81.3%, 83.4%, and 85.7%, respectively. The second performer is EGATMDA with recall rates of 67.8%, 74.9%, and 80.1%, respectively. SCSMDA surpasses GCNMDA with recall rates of 63.6%, 67.6%, and 71.4%, respectively, while the recall rates of GCNMDA are lower at 61.5%, 66.8%, and 70.8%, respectively. GSAMDA does not perform well with recall rates of 55.7%, 63.7%, and 68.4%, respectively, while still being consistently higher than GACNNMDA, which obtained the lowest recall rates of 42.5%, 49.9%, and 56.2%, respectively.

Figure 4. — The average recalls of drugs at different top k settings.

3.5 Case studies on three drugs

To confirm NGMDA’s discovery potential of drug-related microbial candidates, case studies with Ciprofloxacin, Moxifloxacin, and Vancomycin are performed. Ciprofloxacin treats skin infections, typhoid fever, pneumonia, endocarditis, and other bacterial infections. Moxifloxacin treats pneumonia, tuberculosis, sinusitis, and chronic bronchitis. Vancomycin is an antibiotic that treats bloodstream infections, endocarditis, and orthopedic infections. All the known microbe–drug associations and the randomly selected equal number of unobserved microbe–drug associations were utilized to train the model for case studies. Candidate microbes are obtained for each of these drugs, and we collected the top 20 candidates, as listed in Tables 4–6.

Table 4.

The top-20 candidate microbes of Ciprofloxacin.

Rank	Microbe name	Evidence	Rank	Microbe name	Evidence
1	Candida albicans	PMID: 31471074	11	Bacillus subtilis	MDAD
2	Pseudomonas aeruginosa	aBiofilm, MDAD	12	Actinomyces oris	Unconfirmed
3	Staphylococcus aureus	aBiofilm, MDAD	13	Human immunodeficiency virus 1	PMID: 9566552
4	Escherichia coli	aBiofilm, MDAD	14	Streptococcus sanguis	PMID: 11347679
5	Streptococcus mutans	PMID: 30468214	15	Stenotrophomonas maltophilia	aBiofilm, MDAD
6	Staphylococcus epidermis	PMID: 10632381	16	Haemophilus influenzae	MDAD
7	Staphylococcus epidermidis	PMID: 28481197	17	Listeria monocytogenes	PMID: 28355096
8	Salmonella enterica	PMID: 26933017	18	Burkholderia cenocepacia	PMID: 27799222
9	Vibrio harveyi	PMID: 27247095	19	Streptococcus pneumoniae	PMID: 26100702
10	Enterococcus faecalis	PMID: 27790716	20	Serratia marcescens	PMID: 23751969

Open in a new tab

The MDAD (Sun et al. 2018) provides microbe–drug associations that were verified by experimental or clinical studies. The aBiofilm database (Rajput et al. 2018) organizes data on anti-biofilm agents disrupting biofilms, covering 1720 drugs and 140 microbes. We use MDAD, aBiofilm database, and literature to verify the microbe–drug association prediction results of NGMDA. Among the top 20 candidate microbes related to Ciprofloxacin, six are recorded by MDAD, and four are contained in the aBiofilm database, which suggests that these microbes are indeed associated with the drug Ciprofloxacin, and these 13 candidates are further confirmed by the literature. For example, several microbes, including Candida albicans, Human immunodeficiency virus 1, Streptococcus mutans, and Streptococcus pneumoniae, are inhibited (or killed) by Ciprofloxacin (Gollapudi et al. 1998, Dridi et al. 2015, Hacioglu et al. 2019, Zhang et al. 2019). The two microbes, Staphylococcus epidermidis and Salmonella enterica, were validated to be highly susceptible to Ciprofloxacin (Eibach et al. 2016, Szczuka et al. 2017). In addition, Vibrio harveyi, Enterococcus faecalis, and Listeria monocytogenes are identified as Ciprofloxacin-resistant microbes (Stalin and Srinivasan 2016, Escolar et al. 2017, Kim and Woo 2017). For the microbe candidates related to Moxifloxacin in Table 5, three candidates are included in MDAD, two in the aBiofilm database and 14 candidates are supported by the literature. Considering the candidate microbes of Vancomycin in Table 6, Staphylococcus is confirmed by the MDAD and aBiofilm databases, and 14 candidates are supported by literature. Among all 60 microbe candidates, nine are unconfirmed, which indicates that no relevant evidence is found to support their association. The above analysis demonstrates that NGMDA can discover potential candidate microbes for target drugs under study.

Table 5.

The top-20 candidate microbes of Moxifloxacin.

Rank	Microbe name	Evidence	Rank	Microbe name	Evidence
1	Pseudomonas aeruginosa	PMID: 31691651	11	Staphylococcus epidermidis	PMID: 11249827
2	Staphylococcus aureus	PMID: 31689174	12	Candida albicans	aBiofilm, MDAD
3	Escherichia coli	PMID: 31542319	13	Streptococcus pneumoniae	PMID: 22407042
4	Vibrio harveyi	Unconfirmed	14	Serratia marcescens	Unconfirmed
5	Bacillus subtilis	PMID: 30036828	15	Acinetobacter baumannii	PMID: 12951327
6	Listeria monocytogenes	PMID: 28739228	16	Actinomyces oris	PMID: 26538502
7	Salmonella enterica	PMID: 22151215	17	Clostridium perfringens	PMID: 29486533
8	Stenotrophomonas maltophilia	aBiofilm, MDAD	18	Klebsiella pneumoniae	PMID: 27257956
9	Burkholderia cenocepacia	PMID: 28355096	19	Burkholderia pseudomallei	PMID: 15731198
10	Burkholderia multivorans	Unconfirmed	20	Haemophilus influenzae	MDAD

Open in a new tab

Table 6.

The top-20 candidate microbes of Vancomycin.

Rank	Microbe name	Evidence	Rank	Microbe name	Evidence
1	Staphylococcus aureus	MDAD; aBiofilm	11	Streptococcus mutans	PMID: 464571
2	Pseudomonas aeruginosa	PMID: 26980934	12	Stenotrophomonas maltophilia	Unconfirmed
3	Escherichia coli	PMID: 33468474	13	Streptococcus pneumoniae	PMID: 10376600
4	Staphylococcus epidermidis	PMID: 20685088	14	Acinetobacter baumannii	PMID: 23422916
5	Bacillus subtilis	PMID: 14165485	15	Actinomyces oris	PMID: 26538502
6	Enterococcus faecalis	PMID: 15528891	16	Salmonella enterica	Unconfirmed
7	Vibrio harveyi	PMID: 25066453	17	Klebsiella pneumoniae	Unconfirmed
8	Listeria monocytogenes	PMID: 10588323	18	Clostridium perfringens	PMID: 16870765
9	Burkholderia cenocepacia	Unconfirmed	19	Serratia marcescens	Literature (Ali 2018)
10	Burkholderia multivorans	Unconfirmed	20	Streptococcus sanguis	PMID: 7287904

Open in a new tab

3.6 Prediction of novel microbe–drug associations

NGMDA is implemented to predict the potential candidate microbes for all drugs. The top-ranked 20 microbe candidates are listed in the Supplementary File SF2, which can be leveraged by biologists to screen reliable candidate microbes.

4 Conclusion

We proposed a novel microbe–drug association prediction model to encode node neighborhood topologies across multiple scales and perform graph inference by propagating different types of connections and information about the nodes. The multi-scale topology feature is formed by estimating the probability that a random walker accesses itself in different steps. The established node embedding strategy enhances the representations of microbe and drug nodes that form the specific distribution of the corresponding node. The NFF combines the features of different types of neighbors and target nodes by adaptively evaluating the weights of the position features, topology features, and original features of the neighbor nodes. The long-distance connections and encoding of the relationship types between the nodes through GFF enable the knowledge propagation of the entire graph and the capture of diverse relationships. Cross-validation experimental results on public datasets suggest the superiority and effectiveness of NGMDA. The average recall rate of drugs and case analyses of experimental results further demonstrate that NGMDA provides reliable microbe candidates for related drugs under investigation.

Supplementary Material

btae025_Supplementary_Data

Click here for additional data file.^{(1.3MB, zip)}

Contributor Information

Ping Xuan, School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China; Department of Computer Science, Shantou University, Shantou 515063, China.

Jing Gu, School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China.

Hui Cui, Department of Computer Science and Information Technology, La Trobe University, Melbourne, VIC 3083, Australia.

Shuai Wang, School of Information Science and Engineering, Yanshan University, Qinhuangdao 066004, China.

Nakaguchi Toshiya, Center for Frontier Medical Engineering, Chiba University, Chiba 2638522, Japan.

Cheng Liu, Department of Computer Science, Shantou University, Shantou 515063, China.

Tiangang Zhang, School of Computer Science and Technology, Heilongjiang University, Harbin 150080, China; School of Mathematical Science, Heilongjiang University, Harbin 150080, China.

Supplementary data

Supplementary data are available at Bioinformatics online.

Conflict of interest

The Authors declare that there is no conflict of interest.

Funding

This work was supported by the Natural Science Foundation of China [62372282, 61972135, 62172143]; STU Scientific Research Initiation Grant [NTF22032]; and the Natural Science Foundation of Heilongjiang Province [LH2023F044].

Data availability

The dataset is available at https://github.com/pingxuan-hlju/NGMDA.

References

Algavi YM, Borenstein E.. A data-driven approach for predicting the impact of drugs on the human microbiome. Nat Commun 2023;14:3614. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ali A. Bacterial sensitivity of Serratia marcescens against antibiotics. Int J Sci Eng Res 2018;9:361–3. [Google Scholar]
Brody S, Alon U, Yahav E. On the expressivity role of LayerNorm in transformers’ attention. In: Findings of the Association for Computational Linguistics: ACL 2023. 14211–21, Toronto, Canada: Association for Computational Linguistics. 2023. [Google Scholar]
Diao C, Loynd R. Relational attention: generalizing transformers for graph-structured tasks. In: International Conference on Learning Representations, Kigali, Rwanda 2023. OpenReview.net.
Dridi B, Lupien A, Bergeron MG. et al. Differences in antibiotic-induced oxidative stress responses between laboratory and clinical isolates of Streptococcus pneumoniae. Antimicrob Agents Chemother 2015;59:5420–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
Duvallet C, Gibbons SM, Gurry T. et al. Meta-analysis of gut microbiome studies identifies disease-specific and shared responses. Nat Commun 2017;8:1784. [DOI] [PMC free article] [PubMed] [Google Scholar]
Dwivedi VP, Luu AT, Laurent T. et al. Graph neural networks with learnable structural and positional representations. In: International Conference on Learning Representations, (Virtual) 2022 . OpenReview.net.
Eibach D, Al-Emran HM, Dekker DM. et al. The emergence of reduced ciprofloxacin susceptibility in Salmonella enterica causing bloodstream infections in rural Ghana. Clin Infect Dis 2016;62 (Suppl. 1):S32–6. [DOI] [PubMed] [Google Scholar]
Escolar C, Gómez D, Del Carmen Rota García M. et al. Antimicrobial resistance profiles of listeria monocytogenes and Listeria innocua isolated from ready-to-eat products of animal origin in Spain. Foodborne Pathog Dis 2017;14:357–63. [DOI] [PubMed] [Google Scholar]
Gao H, Sun J, Wang Y. et al. Predicting metabolite–disease associations based on auto-encoder and non-negative matrix factorization. Brief Bioinform 2023;24:bbad259. [DOI] [PubMed] [Google Scholar]
Gollapudi S, Kim CH, Roshanravan B. et al. Ciprofloxacin inhibits activation of latent human immunodeficiency virus type 1 in chronically infected promonocytic U1 cells. AIDS Res Hum Retroviruses 1998;14:499–504. [DOI] [PubMed] [Google Scholar]
Hacioglu M, Haciosmanoglu E, Birteksoz-Tan AS. et al. Effects of ceragenins and conventional antimicrobials on Candida albicans and Staphylococcus aureus Mono and multispecies biofilms. Diagn Microbiol Infect Dis 2019;95:114863. [DOI] [PubMed] [Google Scholar]
Hattori M, Tanaka N, Kanehisa M. et al. SIMCOMP/SUBCOMP: chemical structure search servers for network analyses. Nucleic Acids Res 2010;38:W652–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
Huang J, Ling CX.. Using AUC and accuracy in evaluating learning algorithms. IEEE Trans Knowl Data Eng 2005;17:299–310. [Google Scholar]
Kim M-C, Woo G-J.. Characterization of antimicrobial resistance and quinolone resistance factors in high-level ciprofloxacin-resistant Enterococcus faecalis and Enterococcus faecium isolates obtained from fresh produce and fecal samples of patients. J Sci Food Agric 2017;97:2858–64. [DOI] [PubMed] [Google Scholar]
Knox C, Wilson M, Klinger CM. et al. DrugBank 6.0: the DrugBank Knowledgebase for 2024. Nucleic Acids Res 2024;52:D1265–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li F, Zhang Z, Guan J. et al. Effective drug–target interaction prediction with mutual interaction neural network. Bioinformatics 2022;38:3582–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
Long Y, Wu M, Kwoh CK. et al. Predicting human microbe–drug associations via graph convolutional network with conditional random field. Bioinformatics 2020a;36:4918–27. [DOI] [PMC free article] [PubMed] [Google Scholar]
Long Y, Wu M, Liu Y. et al. Ensembling graph attention networks for human microbe–drug association prediction. Bioinformatics 2020b;36:i779–86. [DOI] [PubMed] [Google Scholar]
Lv Q, Ding M, Liu Q. et al. Are we really making much progress? Revisiting, benchmarking and refining heterogeneous graph neural networks. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. Singapore: Association for Computing Machinery 2021. 1150–60.
Ma Q, Tan Y, Wang L.. GACNNMDA: a computational model for predicting potential human microbe-drug associations based on graph attention network and CNN-based classifier. BMC Bioinformatics 2023;24:35. [DOI] [PMC free article] [PubMed] [Google Scholar]
Meng R, Yin S, Sun J. et al. scAAGA: single cell data analysis framework using asymmetric autoencoder with gene attention. Comput Biol Med 2023;165:107414. [DOI] [PubMed] [Google Scholar]
Nejman D, Livyatan I, Fuks G. et al. The human tumor microbiome is composed of tumor type–specific intracellular bacteria. Science 2020;368:973–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
Peng W, Lan W, Zhong J. et al. A novel method of predicting microRNA-disease associations based on microRNA, disease, gene and environment factor networks. Methods 2017;124:69–77. [DOI] [PubMed] [Google Scholar]
Peng W, Chen T, Dai W.. Predicting drug response based on multi-omics fusion and graph convolution. IEEE J Biomed Health Inform 2021;26:1384–93. [DOI] [PubMed] [Google Scholar]
Peng W, Che Z, Dai W. et al. Predicting miRNA-disease associations from miRNA-gene-disease heterogeneous network with multi-relational graph convolutional network model. IEEE/ACM Trans Comput Biol Bioinform 2022a;20:3363–75. [DOI] [PubMed] [Google Scholar]
Peng W, Liu H, Dai W. et al. Predicting cancer drug response using parallel heterogeneous graph convolutional networks with neighborhood interactions. Bioinformatics 2022b;38:4546–53. [DOI] [PubMed] [Google Scholar]
Peng W, Tang Q, Dai W. et al. Improving cancer driver gene identification using multi-task learning on graph convolutional network. Brief Bioinform 2022c;23:bbab432. [DOI] [PubMed] [Google Scholar]
Rackaityte E, Lynch SV.. The human microbiome in the 21st century. Nat Commun 2020;11:5256. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rajput A, Thakur A, Sharma S. et al. aBiofilm: a resource of anti-biofilm agents and their potential implications in targeting antibiotic drug resistance. Nucleic Acids Res 2018;46:D894–900. [DOI] [PMC free article] [PubMed] [Google Scholar]
Saito T, Rehmsmeier M.. The precision-recall plot is more informative than the roc plot when evaluating binary classifiers on imbalanced datasets. PLoS One 2015;10:e0118432. [DOI] [PMC free article] [PubMed] [Google Scholar]
Stalin N, Srinivasan P.. Molecular characterization of antibiotic resistant Vibrio harveyi isolated from shrimp aquaculture environment in the South East Coast of India. Microb Pathog 2016;97:110–8. [DOI] [PubMed] [Google Scholar]
Sun Y-Z, Zhang D-H, Cai S-B. et al. MDAD: a special resource for microbe-drug associations. Front Cell Infect Microbiol 2018;8:424. [DOI] [PMC free article] [PubMed] [Google Scholar]
Szczuka E, Jabłońska L, Kaznowski A.. Effect of subinhibitory concentrations of tigecycline and ciprofloxacin on the expression of biofilm-associated genes and biofilm structure of Staphylococcus epidermidis. Microbiology (Reading) 2017;163:712–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tan Y, Zou J, Kuang L. et al. GSAMDA: a computational model for predicting potential microbe–drug associations based on graph attention network and sparse autoencoder. BMC Bioinformatics 2022;23:492. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tian Z, Yu Y, Fang H. et al. Predicting microbe–drug associations with structure-enhanced contrastive learning and self-paced negative sampling strategy. Brief Bioinform 2023;24:bbac634. [DOI] [PubMed] [Google Scholar]
Veličković P, Cucurull G, Casanova A. et al. Graph attention networks. In: International Conference on Learning Representations, Vancouver, Canada2018. OpenReview.net.
Wang D, Guo Q, Yuan Y. et al. The antibiotic resistance of helicobacter pylori to five antibiotics and influencing factors in an area of China with a high risk of gastric cancer. BMC Microbiol 2019;19:152. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wang S, Hui C, Zhang T. et al. Graph reasoning method based on affinity identification and representation decoupling for predicting lncRNA-disease associations. J Chem Inf Model 2023a;63:6947–58. [DOI] [PubMed] [Google Scholar]
Wang T, Sun J, Zhao Q.. Investigating cardiotoxicity related with hERG channel blockers using molecular fingerprints and graph attention mechanism. Comput Biol Med 2023b;153:106464. [DOI] [PubMed] [Google Scholar]
Wang W, Zhang L, Sun J. et al. Predicting the potential human lncRNA–miRNA interactions based on graph convolution network with conditional random field. Brief Bioinform 2022;23:bbac463. [DOI] [PubMed] [Google Scholar]
Yin B, Wang X, Yuan F. et al. Research progress on the effect of gut and tumor microbiota on antitumor efficacy and adverse effects of chemotherapy drugs. Front Microbiol 2022;13:899111. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhang R, Jones MM, Moussa H. et al. Polymer–antibiotic conjugates as antibacterial additives in dental resins. Biomater Sci 2019;7:287–95. [DOI] [PubMed] [Google Scholar]
Zhu S, Jiang Y, Xu K. et al. The progress of gut microbiome research related to brain disorders. J Neuroinflammation 2020;17:25. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zimmermann M, Zimmermann-Kogadeeva M, Wegmann R. et al. Mapping human microbiome drug metabolism by gut bacteria and their genes. Nature 2019;570:462–7. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

btae025_Supplementary_Data

Click here for additional data file.^{(1.3MB, zip)}

Data Availability Statement

The dataset is available at https://github.com/pingxuan-hlju/NGMDA.

[btae025-B1] Algavi YM, Borenstein E.. A data-driven approach for predicting the impact of drugs on the human microbiome. Nat Commun 2023;14:3614. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B2] Ali A. Bacterial sensitivity of Serratia marcescens against antibiotics. Int J Sci Eng Res 2018;9:361–3. [Google Scholar]

[btae025-B3] Brody S, Alon U, Yahav E. On the expressivity role of LayerNorm in transformers’ attention. In: Findings of the Association for Computational Linguistics: ACL 2023. 14211–21, Toronto, Canada: Association for Computational Linguistics. 2023. [Google Scholar]

[btae025-B4] Diao C, Loynd R. Relational attention: generalizing transformers for graph-structured tasks. In: International Conference on Learning Representations, Kigali, Rwanda 2023. OpenReview.net.

[btae025-B5] Dridi B, Lupien A, Bergeron MG. et al. Differences in antibiotic-induced oxidative stress responses between laboratory and clinical isolates of Streptococcus pneumoniae. Antimicrob Agents Chemother 2015;59:5420–6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B6] Duvallet C, Gibbons SM, Gurry T. et al. Meta-analysis of gut microbiome studies identifies disease-specific and shared responses. Nat Commun 2017;8:1784. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B7] Dwivedi VP, Luu AT, Laurent T. et al. Graph neural networks with learnable structural and positional representations. In: International Conference on Learning Representations, (Virtual) 2022 . OpenReview.net.

[btae025-B8] Eibach D, Al-Emran HM, Dekker DM. et al. The emergence of reduced ciprofloxacin susceptibility in Salmonella enterica causing bloodstream infections in rural Ghana. Clin Infect Dis 2016;62 (Suppl. 1):S32–6. [DOI] [PubMed] [Google Scholar]

[btae025-B9] Escolar C, Gómez D, Del Carmen Rota García M. et al. Antimicrobial resistance profiles of listeria monocytogenes and Listeria innocua isolated from ready-to-eat products of animal origin in Spain. Foodborne Pathog Dis 2017;14:357–63. [DOI] [PubMed] [Google Scholar]

[btae025-B10] Gao H, Sun J, Wang Y. et al. Predicting metabolite–disease associations based on auto-encoder and non-negative matrix factorization. Brief Bioinform 2023;24:bbad259. [DOI] [PubMed] [Google Scholar]

[btae025-B11] Gollapudi S, Kim CH, Roshanravan B. et al. Ciprofloxacin inhibits activation of latent human immunodeficiency virus type 1 in chronically infected promonocytic U1 cells. AIDS Res Hum Retroviruses 1998;14:499–504. [DOI] [PubMed] [Google Scholar]

[btae025-B12] Hacioglu M, Haciosmanoglu E, Birteksoz-Tan AS. et al. Effects of ceragenins and conventional antimicrobials on Candida albicans and Staphylococcus aureus Mono and multispecies biofilms. Diagn Microbiol Infect Dis 2019;95:114863. [DOI] [PubMed] [Google Scholar]

[btae025-B13] Hattori M, Tanaka N, Kanehisa M. et al. SIMCOMP/SUBCOMP: chemical structure search servers for network analyses. Nucleic Acids Res 2010;38:W652–6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B14] Huang J, Ling CX.. Using AUC and accuracy in evaluating learning algorithms. IEEE Trans Knowl Data Eng 2005;17:299–310. [Google Scholar]

[btae025-B15] Kim M-C, Woo G-J.. Characterization of antimicrobial resistance and quinolone resistance factors in high-level ciprofloxacin-resistant Enterococcus faecalis and Enterococcus faecium isolates obtained from fresh produce and fecal samples of patients. J Sci Food Agric 2017;97:2858–64. [DOI] [PubMed] [Google Scholar]

[btae025-B16] Knox C, Wilson M, Klinger CM. et al. DrugBank 6.0: the DrugBank Knowledgebase for 2024. Nucleic Acids Res 2024;52:D1265–75. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B17] Li F, Zhang Z, Guan J. et al. Effective drug–target interaction prediction with mutual interaction neural network. Bioinformatics 2022;38:3582–9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B18] Long Y, Wu M, Kwoh CK. et al. Predicting human microbe–drug associations via graph convolutional network with conditional random field. Bioinformatics 2020a;36:4918–27. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B19] Long Y, Wu M, Liu Y. et al. Ensembling graph attention networks for human microbe–drug association prediction. Bioinformatics 2020b;36:i779–86. [DOI] [PubMed] [Google Scholar]

[btae025-B20] Lv Q, Ding M, Liu Q. et al. Are we really making much progress? Revisiting, benchmarking and refining heterogeneous graph neural networks. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. Singapore: Association for Computing Machinery 2021. 1150–60.

[btae025-B21] Ma Q, Tan Y, Wang L.. GACNNMDA: a computational model for predicting potential human microbe-drug associations based on graph attention network and CNN-based classifier. BMC Bioinformatics 2023;24:35. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B22] Meng R, Yin S, Sun J. et al. scAAGA: single cell data analysis framework using asymmetric autoencoder with gene attention. Comput Biol Med 2023;165:107414. [DOI] [PubMed] [Google Scholar]

[btae025-B23] Nejman D, Livyatan I, Fuks G. et al. The human tumor microbiome is composed of tumor type–specific intracellular bacteria. Science 2020;368:973–80. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B24] Peng W, Lan W, Zhong J. et al. A novel method of predicting microRNA-disease associations based on microRNA, disease, gene and environment factor networks. Methods 2017;124:69–77. [DOI] [PubMed] [Google Scholar]

[btae025-B25] Peng W, Chen T, Dai W.. Predicting drug response based on multi-omics fusion and graph convolution. IEEE J Biomed Health Inform 2021;26:1384–93. [DOI] [PubMed] [Google Scholar]

[btae025-B26] Peng W, Che Z, Dai W. et al. Predicting miRNA-disease associations from miRNA-gene-disease heterogeneous network with multi-relational graph convolutional network model. IEEE/ACM Trans Comput Biol Bioinform 2022a;20:3363–75. [DOI] [PubMed] [Google Scholar]

[btae025-B27] Peng W, Liu H, Dai W. et al. Predicting cancer drug response using parallel heterogeneous graph convolutional networks with neighborhood interactions. Bioinformatics 2022b;38:4546–53. [DOI] [PubMed] [Google Scholar]

[btae025-B28] Peng W, Tang Q, Dai W. et al. Improving cancer driver gene identification using multi-task learning on graph convolutional network. Brief Bioinform 2022c;23:bbab432. [DOI] [PubMed] [Google Scholar]

[btae025-B29] Rackaityte E, Lynch SV.. The human microbiome in the 21st century. Nat Commun 2020;11:5256. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B30] Rajput A, Thakur A, Sharma S. et al. aBiofilm: a resource of anti-biofilm agents and their potential implications in targeting antibiotic drug resistance. Nucleic Acids Res 2018;46:D894–900. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B31] Saito T, Rehmsmeier M.. The precision-recall plot is more informative than the roc plot when evaluating binary classifiers on imbalanced datasets. PLoS One 2015;10:e0118432. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B32] Stalin N, Srinivasan P.. Molecular characterization of antibiotic resistant Vibrio harveyi isolated from shrimp aquaculture environment in the South East Coast of India. Microb Pathog 2016;97:110–8. [DOI] [PubMed] [Google Scholar]

[btae025-B33] Sun Y-Z, Zhang D-H, Cai S-B. et al. MDAD: a special resource for microbe-drug associations. Front Cell Infect Microbiol 2018;8:424. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B34] Szczuka E, Jabłońska L, Kaznowski A.. Effect of subinhibitory concentrations of tigecycline and ciprofloxacin on the expression of biofilm-associated genes and biofilm structure of Staphylococcus epidermidis. Microbiology (Reading) 2017;163:712–8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B35] Tan Y, Zou J, Kuang L. et al. GSAMDA: a computational model for predicting potential microbe–drug associations based on graph attention network and sparse autoencoder. BMC Bioinformatics 2022;23:492. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B36] Tian Z, Yu Y, Fang H. et al. Predicting microbe–drug associations with structure-enhanced contrastive learning and self-paced negative sampling strategy. Brief Bioinform 2023;24:bbac634. [DOI] [PubMed] [Google Scholar]

[btae025-B37] Veličković P, Cucurull G, Casanova A. et al. Graph attention networks. In: International Conference on Learning Representations, Vancouver, Canada2018. OpenReview.net.

[btae025-B38] Wang D, Guo Q, Yuan Y. et al. The antibiotic resistance of helicobacter pylori to five antibiotics and influencing factors in an area of China with a high risk of gastric cancer. BMC Microbiol 2019;19:152. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B39] Wang S, Hui C, Zhang T. et al. Graph reasoning method based on affinity identification and representation decoupling for predicting lncRNA-disease associations. J Chem Inf Model 2023a;63:6947–58. [DOI] [PubMed] [Google Scholar]

[btae025-B40] Wang T, Sun J, Zhao Q.. Investigating cardiotoxicity related with hERG channel blockers using molecular fingerprints and graph attention mechanism. Comput Biol Med 2023b;153:106464. [DOI] [PubMed] [Google Scholar]

[btae025-B41] Wang W, Zhang L, Sun J. et al. Predicting the potential human lncRNA–miRNA interactions based on graph convolution network with conditional random field. Brief Bioinform 2022;23:bbac463. [DOI] [PubMed] [Google Scholar]

[btae025-B42] Yin B, Wang X, Yuan F. et al. Research progress on the effect of gut and tumor microbiota on antitumor efficacy and adverse effects of chemotherapy drugs. Front Microbiol 2022;13:899111. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B43] Zhang R, Jones MM, Moussa H. et al. Polymer–antibiotic conjugates as antibacterial additives in dental resins. Biomater Sci 2019;7:287–95. [DOI] [PubMed] [Google Scholar]

[btae025-B44] Zhu S, Jiang Y, Xu K. et al. The progress of gut microbiome research related to brain disorders. J Neuroinflammation 2020;17:25. [DOI] [PMC free article] [PubMed] [Google Scholar]

[btae025-B45] Zimmermann M, Zimmermann-Kogadeeva M, Wegmann R. et al. Mapping human microbiome drug metabolism by gut bacteria and their genes. Nature 2019;570:462–7. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Multi-scale topology and position feature learning and relationship-aware graph reasoning for prediction of drug-related microbes

Ping Xuan

Jing Gu

Hui Cui

Shuai Wang

Nakaguchi Toshiya

Cheng Liu

Tiangang Zhang

Roles

Abstract

Motivation

Results

Availability and implementation

1 Introduction

2 Materials and methods

Figure 1.

2.1 Dataset

2.2 Calculation of microbe similarity

2.3 Microbe–drug heterogeneous graph

2.4 Heterogeneous graph node feature construction and enhancement

2.4.1 Heterogeneous graph node feature construction

2.4.2 Enhancing node embedding

Figure 2.

2.5 Neighbor feature fusion

Figure 3.

2.6 Heterogeneous GFF

2.6.1 Relationship type encoding

2.6.2 Relationship-aware graph transformer

2.7 Representation integration and optimization

3 Experimental evaluation and discussion

3.1 Evaluation metrics

3.2 Parameter settings

3.3 Ablation experiments

Table 1.

3.4 Comparison with other methods

Table 2.

Table 3.

Figure 4.

3.5 Case studies on three drugs

Table 4.

Table 5.

Table 6.

3.6 Prediction of novel microbe–drug associations

4 Conclusion

Supplementary Material

Contributor Information

Supplementary data

Conflict of interest

Funding

Data availability

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases