Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2022 May 4;18(5):e1009531. doi: 10.1371/journal.pcbi.1009531

Controlling astrocyte-mediated synaptic pruning signals for schizophrenia drug repurposing with deep graph networks

Alessio Gravina 1,*, Jennifer L Wilson 2, Davide Bacciu 1, Kevin J Grimes 2, Corrado Priami 1
Editor: Carl Herrmann3
PMCID: PMC9109907  PMID: 35507580

Abstract

Schizophrenia is a debilitating psychiatric disorder, leading to both physical and social morbidity. Worldwide 1% of the population is struggling with the disease, with 100,000 new cases annually only in the United States. Despite its importance, the goal of finding effective treatments for schizophrenia remains a challenging task, and previous work conducted expensive large-scale phenotypic screens. This work investigates the benefits of Machine Learning for graphs to optimize drug phenotypic screens and predict compounds that mitigate abnormal brain reduction induced by excessive glial phagocytic activity in schizophrenia subjects. Given a compound and its concentration as input, we propose a method that predicts a score associated with three possible compound effects, i.e., reduce, increase, or not influence phagocytosis. We leverage a high-throughput screening to prove experimentally that our method achieves good generalization capabilities. The screening involves 2218 compounds at five different concentrations. Then, we analyze the usability of our approach in a practical setting, i.e., prioritizing the selection of compounds in the SWEETLEAD library. We provide a list of 64 compounds from the library that have the most potential clinical utility for glial phagocytosis mitigation. Lastly, we propose a novel approach to computationally validate their utility as possible therapies for schizophrenia.

Author summary

Phagocytosis is a fundamental biological process to protect biological organisms from exogenous infectious particles as well as to preserve equilibrium and efficiency of the host by removing its unwanted cells. A dysregulation of the phagocytic activity can lead to severe consequences for the host. In this study, we focus on a recent theory that relates an excessive phagocytic activity in brain cells, and a consequent abnormal reduction in brain volume, to the development of schizophrenia. Our working hypothesis is that pharmaceutical compounds that can reduce excessive of phagocytic activity might prove effective as a schizophrenia treatment. Rather than attempting to develop ex-novo such a chemical compound, we rely on a more cost-effective and efficient approach that seeks candidate therapies in a set of approved chemical compounds. To achieve this, we train a machine learning model capable of predicting, with good accuracy, the ability of a molecular compound to increase or decrease phagocytosis in the target brain cells. Our approach leverages learning models capable of directly processing the molecular graph of the compound, leading to the identification of 64 candidate drugs of potential clinical utility.


This is a PLOS Computational Biology Methods paper.

Introduction

Schizophrenia is a chronic and severe mental disorder that affects how a person thinks, feels, and behaves. It is expressed as a combination of symptoms, such as recurrent psychosis, social withdrawal, anhedonia, and cognitive dysfunctions. Worldwide about 1% of the population is diagnosed with schizophrenia, with 100,000 new cases annually only in the United States [1].

A recent study [2] states that brain volumes, measured on Magnetic Resonance Imaging (MRI) scans, are abnormal in patients with schizophrenia compared to unaffected individuals, with a reduction in both grey and white matter. In particular, the decreased density of dendritic spines in schizophrenia subjects has been supposed by MacDonald et al. [3] as the result of an excessive pruning activity against synapses. This action is assumed to be performed by glial cells, which are non-neuronal cells with multiple functions in the central nervous system to support and remove neurons. This assumption is supported by the evidence that glial phagocytic activity may be directly associated with the prevalence of various neurodegenerative diseases due to hyperactivation of phagocytic pathways [4, 5]. In addition, the novel PET tracer binds to synaptic vesicle glycoprotein 2A (SV2A) and shows diminished uptake in the frontal and anterior cingulate cortex in individuals with schizophrenia [6].

Towards the goal of discovering novel treatments for schizophrenia, previous work conducted a large-scale phenotypic screen to discover compounds with the ability to alter glial cell phagocytosis. However, understanding structure activity relationships is a challenge in these screens. Further, generating accurate models is difficult because it is not known what chemical information is most associated with predicting chemical function and how to best represent this information for predictive models. Additionally, further experiments remain the gold standard for validating model predictions. Yet, it is not always possible to conduct additional high-throughput screens and we require alternative methods for testing the utility of model predictions.

The compound property/activity prediction problem is a task faced by pharmaceutical companies and academia to improve the comprehension of diseases, discover new drugs, or identify new indications of existing drugs. It is standard practice to scan large libraries of compounds to test their biological activity. However, this operation can be costly and time-consuming, and Machine Learning (ML) methods can be helpful to reduce the effort needed to run experiments. For that reason, in the last decades, several computational approaches have been proposed to determine compound properties, or as filter to select the most promising compounds for clinical and biological experiments [7, 8]. A pioneering work is that by Bianucci et al. [9], where the authors employed Cascade Correlation Networks for structures to predict the boiling point of Alkenes and to predict the affinity towards the Benzodiazepine/GABAA receptor by a group of Benzodiazepines. More recently, Banerjee et al. [10] have developed a ML model to discriminate between sweet and bitter taste of molecules. Specifically, the model leverages a static fingerprint of the molecule to predict the property through a Random Forest. Similarly, Lind et al. [11] feed a static fingerprint and oncogene mutation status to a Random Forest to predict the activity versus inactivity of drugs against cancer cell lines. These results demonstrate that it is possible to associate chemical information to biological outcomes. Yet, static fingerprints are not sufficient in all applications.

Explicitly for schizophrenia, Zhao et al. [12] explored five different ML approaches to repurpose drugs for schizophrenia, depression, and anxiety disorders. In particular, they considered Deep Neural Networks, Support Vector Machines, Elastic Net regression, Random Forest, and Gradient Boosted Trees. Models were trained to predict whether a drug is a known treatment for the disease or not, using drug expression profiles as inputs. Those profiles capture transcriptomic changes when HL60, PC3, and MCF7 cell lines were treated with a chemical. Xu et al. [13] proposed PhenoPredict, a ranking algorithm for schizophrenia drug repurposing. PhenoPredict infers drug treatments from diseases that are phenotypically related to schizophrenia. These models demonstrate the ability to connect chemical information to biological information, yet they are limited to predicting molecular changes (such as gene expression) and are not suited to predicting phagocytic activity from phenotypic screens.

This work studies the use of ML techniques to predict the effects of compounds on glial phagocytic activity that cause abnormal brain reduction in schizophrenia subjects. This work has been done in collaboration with SPARK at Stanford University [14, 15], the main node of a partnership network between university and industry experts in chemistry, biology, and medicine to advance academic biomedical research discoveries into promising new treatments for patients. The objective of this work is to propose a ML method apt to optimize drug phenotypic screens. Specifically, our method identify compounds that reduce glial phagocytic activity for the treatment of schizophrenia, which, to the best of our knowledge, has not been proposed before. Our contribution can be summarized as follows. First, we introduce a ML method based on Deep Graph Networks to predict if a compound can influence the glial phagocytic activity in the brain tissue. Then, we evaluate our method on a real high-throughput screening experiment provided by SPARK. The proposed model achieves a macro Area Under the ROC curve (AUROC) of 0.68 when predicting if a compound inhibits, intensifies, or does not affect the phagocytic activity. Afterwards, we perform an analysis to understand the potential benefits of our approach in a practical scenario. Specifically, we leverage our method to prioritize the selection of a new set of compounds in the SWEETLEAD library, leading to the identification of 64 potential candidates. Lastly, we propose a novel approach to understand the relevance of compounds to biological use case. That approach allows us to compare our results with the more than 287,000 references in the literature. With this analysis we highlight the effectiveness of the model in identify compounds that are already studied in relation to brain-related diseases.

Results and discussion

Compound dissimilarity analysis

Before constructing the ML model, we measured the dissimilarity between compounds in the dataset to better understand how much molecular structures differ from each other in our data. A library of highly similar compounds would prevent the model from sufficiently discriminating drug effects. For this analysis, we computed the Extended-Connectivity Fingerprint (ECFP) for each compound, using standard parameters (i.e., radius equal to 3 and length equal to 1024); then, we measured the dissimilarity for each pair of fingerprints. Note that the ECFP fingerprint is a non-adaptive vectorial representation that encodes the structure of a molecule (please refer to the Model section for a detailed description). For this analysis we leveraged two different metrics: Cosine distance, and Jaccard dissimilarity. Given A and B two vectors, the cosine distance is defined as

Cosine(A,B)=1-ABA2B2

while the Jaccard dissimilarity is

Jaccard(A,B)=1-|AB||AB|

as usual. Let’s consider dRn×n the squared upper triangular matrix where the element di,j represents the dissimilarity between fingerprints i and j, measured with one of the two metrics defined before. The final dissimilarity score is computed as

dissimilarity=1mi=0nj>indi,j

where m = n2/2.

Table 1 reports the final scores computed with the two metrics. Both scores are close to 1, highlighting that the compounds have distinct fingerprints, and therefore can be considered strongly dissimilar.

Table 1. Cosine distance and Jaccard dissimilarity between dataset’s compounds.

Name Avg ± Std 25 Perc. 50 Perc. 75 Perc. Max Min
Cosine 0.8189 ± 0.0281 0.7976 0.8162 0.8374 0.9476 0.7464
Jaccard 0.9026 ± 0.0173 0.8897 0.9017 0.9135 0.9877 0.8571

Afterward, we measured the scaffold diversity in our data with the same strategy proposed for the compound dissimilarity analysis. It is clear from Table 2 that there is a rich diversity even between scaffolds. Fig 1 emphasizes such dissimilarity by showing that only a few molecules in our data share the same scaffold. Indeed, there are 863 scaffolds associated with one molecule in the data, while only 26 are shared among 8 and 232 molecules. Such results highlight that the scaffold distribution is long-tailed. Lastly, Fig 2 shows through the Principal Component Analysis (PCA) that there is no evidence of any natural cluster neither in molecule or scaffold fingerprints. Thus, it is reasonable to assume that the compounds’ uniqueness in the dataset represents an index of the strong complexity of the task.

Table 2. Cosine distance and Jaccard dissimilarity between dataset’s compound scaffolds.

Name Avg ± Std 25 Perc. 50 Perc. 75 Perc. Max Min
Cosine 0.8234 ± 0.0750 0.7805 0.8092 0.8635 0.9999 0.7050
Jaccard 0.8990 ± 0.0341 0.8838 0.9033 0.9182 0.9772 0.8243

Fig 1. The distribution of the number of shared scaffolds within molecules in the dataset.

Fig 1

Fig 2.

Fig 2

(a) The PCA plot of the molecule ECFPs. (b) The PCA plot of the scaffold ECFPs.

Model selection and risk assessment

The best model used in our experiments has been selected by an empirical analysis on the SPARK’s high-throughput screening results (please refer to the Dataset section for details). We tested multiple models to understand which best predicted effects from our phenotypic screen. Each model used different information about the chemical structures in making predictions. These models include: MoRF, MoNN, LinNN, SAGENN, GaNN, ENN, and NeFPNN. The first two exploit static compound fingerprints by leveraging the ECFP technique. LinNN leverages adapting fingerprints computed by a MLP using only atom information. The latter employ several deep graph network based fingerprints, which exploit atom and topological information, with ENN and NeFPNN including also bond features. Please refer to the Model section for details.

Table 3 reports the predictive performance achieved in the 3-fold cross-validation on the development set, using the standard stratification schema that divides the compounds maintain the distribution of the target variable. All configurations outperform the baseline LinNN, whose performance is very close to a random guesser. However, ENN and NeFPNN resulted in a validation score that is substantially inferior with respect to the best performing models. These results suggest that bond information is not helpful in solving this task. The top four models have overlapping validation scores, but highly different training scores. This situation hints at the use of alternative stratification strategies.

Table 3. Macro-AUROC scores achieved with simple cross-validation.

Model Train Valid
MoNN 0.9018±0.0192 0.6759±0.0020
SAGENN 0.8999±0.0134 0.6688±0.0118
MoRF 0.9361±0.0031 0.6674±0.0082
GaNN 0.8418±0.0052 0.6548±0.0148
NeFPNN 0.7102±0.0058 0.6334±0.0086
ENN 0.6840±0.0121 0.6021±0.0093
LinNN 0.5007±0.0008 0.5016±0.0011

Given the neat performance discrepancy between models, we ran a more articulated stratification only for the top four configurations, i.e., MoNN, MoRF, SAGENN, GaNN. The complex strategy separates compounds into groups by maintaining high compound diversity in each split of the data (details discussed in the Experimental setting section). Table 4 shows that nearly all models have better (or overlapping) validation performances with respect to the simple cross-validation strategy discussed above. Training scores are 1 to 5 points smaller than in Table 3 and the validation scores have overlapping value ranges, but the training scores have less optimistic values. MoRF and GaNN are the configurations with the highest gain in validation with the complex stratification setting, while MoNN shows a reduction of approximately 2 points.

Table 4. Macro-AUROC scores achieved with complex cross-validation.

Model Train Valid
MoRF 0.9062±0.0051 0.6764±0.0046
GaNN 0.8301±0.0068 0.6680±0.0039
SAGENN 0.8751±0.0111 0.6547±0.0031
MoNN 0.8580±0.0182 0.6520±0.0038

After the model selection phase, where we selected the best hyper-parameters configuration for each model, we performed the risk assessment step by evaluating the models on the hold-out test set. Also in this case we considered only MoNN, MoRF, GaNN, and SAGENN. Table 5 shows that the configurations selected with the simple cross-validation often obtained a better performance than those selected with the complex stratification strategy. MoRF is the only model achieving an higher score in the latter case. With both methods, each model reached a performance in line with that obtained during model selection.

Table 5. Comparison of macro-AUROC scores computed during risk assessment.

Model Simple Complex
Train Test Train Test
GaNN 0.8397 0.6877 0.8397 0.6632
SAGENN 0.8466 0.6767 0.8466 0.6679
MoNN 0.8991 0.6705 0.8991 0.6386
MoRF 0.8786 0.6676 0.8833 0.6701

By analyzing the confusion matrices in Fig 3, we observe that a higher performance corresponds to an increased ability to correctly predict both increase phagocytosis and decrease phagocytosis classes. This result suggests that DGN-based models are more capable than the Morgan-based counterpart to recognize significant patterns to tackle this central task in our study.

Fig 3. Normalized confusion matrices of GaNN, SAGENN, MoNN, and MoRF on test set.

Fig 3

Upon investigation of the confusion matrices, models have relatively high prediction accuracy for predicting drugs with no effect on phagocytosis (center square on all matrices). All models performed well in predicting drugs that would decrease phagocytosis (bottom right of each matrix) with the GaNN having the highest accuracy for this compound effect. All models had relatively low prediction accuracy for compounds that increase phagocytosis (upper left square of each matrix) and erred on the side of predicting no effect for these drugs (upper middle square of each matrix). For our dataset, we were prioritizing compounds that decreased phagocytosis, and so relatively high prediction for these compounds relative to the increase phagocytosis group was suitable. Ultimately, we selected the GaNN as the best model for both its performance on the hold-out test set and its capacity to distinguish between the three classes.

Problem relaxation

To prove the complexity of the task, we relaxed the problem to predict whether a compound has a positive or negative impact on phagocytosis. We considered no change and increase phagocytosis classes as the negative label. Each model takes as input a compound and its concentration to predict the impact on phagocytosis. Thus, we shifted from multi-label to a binary classification problem.

We followed the same experimental procedure as in the Model selection and risk assessment section. We first performed a model selection phase by running a 3-fold cross-validation on the development set using the standard stratification schema. Given the strong discrepancy between validation and training scores obtained in this phase, we selected the top four model configurations and we run the more articulated stratification. Table 6 reports the results of both experiments. As it shows, relaxing the problem to binary classification helps improve the final performances. Indeed, the validation scores with the standard stratification are more than 10 points higher on average if compared to multi-label classification. When the more sophisticated stratification schema is employed, the difference increases to over 11 points. Particularly interesting is the improvement of LinNN, which performance is on par with NeFPNN. This result remarks the fact that bond information is not strictly helpful in solving this task.

Table 6. Comparison of macro-AUROC scores achieved with simple and complex cross-validation when only 2 classes are considered.

Model Simple Complex
Train Valid Train Valid
SAGENN 0.9313±0.0015 0.7898±0.0106 0.9090±0.0004 0.7825±0.0044
MoNN 0.9364±0.0260 0.7791±0.0150 0.9405±0.0013 0.7895±0.0013
GaNN 0.9094±0.0076 0.7770±0.0034 0.8995±0.0137 0.7776±0.0104
MoRF 0.9522±0.0380 0.7702±0.0170 0.9588±0.0016 0.7731±0.0049
ENN 0.9065±0.0283 0.7662±0.0035
LinNN 0.9212±0.0088 0.7251±0.0085
NeFPNN 0.8350±0.0822 0.7205±0.0191

After the model selection, we proceeded with the risk assessment phase only for the most performing configurations. Table 7 shows the results obtained during this step. The scores exhibit similar behavior to the validation scores with respect to the original problem. However, they are below the confidence interval discovered in the model selection phase. We believe that this is the consequence of some labeling noise introduced during class aggregation. Indeed, the class no change is a borderline class that may contain noise itself. For such reasons, we believe that the three-class splitting leads to better performances that are less prone to errors.

Table 7. Comparison of macro-AUROC scores computed during risk assessment when only 2 classes are considered.

Model Simple Complex
Train Test Train Test
SAGENN 0.8687 0.7694 0.8684 0.7533
MoNN 0.9348 0.7676 0.9345 0.7634
MoRF 0.9262 0.7445 0.9372 0.7500
GaNN 0.8457 0.7368 0.8852 0.7376

These findings show that by relaxing the problem it is possible to achieve higher performances. Therefore, it is reasonable to assume these results as proof of the complexity of the original problem. Indeed, in this case, the computational task of structure-function prediction is really hard because phagocytosis is a multi-protein biological process. This suggests that multiple molecules could bind proteins in this pathway and that there might not be one ideal structure for influencing this process. Moreover, this situation, followed by the high dissimilarity between molecule structures, indicates that our method learns good compound representations and is not overfitting molecular sub-structures for making predictions.

SWEETLEAD library repurposing

The main goal of our analysis is understanding the usability of our approach in a practical setting, i.e., prioritizing the selection of compounds to be tested in a biomedical experiment. We focus on the GaNN model which has emerged as the best configuration according to our model selection and risk assessment analysis. We leveraged the model to predict the impact, at different dosages, of a new set of compounds on astrocyte-mediated synaptic pruning in schizophrenia. We considered the compounds in the SWEETLEAD library [16], an in silico database of approved drugs, regulated chemicals, and herbals designed for drug discovery. The library contains 4314 compounds with 1391 of them marked as FDA approved.

We simulated with the GaNN model the impact of each compound in SWEETLEAD on the phagocytic activity at five different concentrations, i.e., 1.39, 2.78, 5.56, 11.11, 22.22 μM, to parallel the dose ranges used in the initial phenotypic screen and to mirror those typically used in literature.

Table 8 shows how many compounds in SWEETLEAD are predicted to belong to each of the three phagocytosis classes by our GaNN model. The GaNN generates different outcomes for different doses only in 29.49% of the compounds, so the model can be considered highly confident on the other predictions.

Table 8. The number of predicted compounds in SWEETLEAD library for each class.

FDA approved Complete library
no change 585 1647
decrease phagocytosis 309 964
increase phagocytosis 143 431
mixed* 354 1272

* model’s outcome change with different dosage.

We analyzed predicted compound effects by Anatomical Therapeutic Chemical (ATC) codes to understand the therapeutic use classification of these drugs. Additionally, ATC codes related to the neurological system would provide further evidence that a predicted drug could modify phagocytosis in the brain. ATC codes are part of a classification system, controlled by World Health Organization, that classifies drugs according to the organ or system on which they act and their therapeutic, pharmacological and chemical properties. ATC codes are hierarchically organized in five levels, where the first one is the more general and refers to the anatomical main group, while the latter are the more specific and indicates the chemical substance. Note that a drug can be associated with more than one ATC code.

We focused on FDA-approved compounds that have a match in DrugBank [17]. Fig 4 shows the first level of ATC codes with respect to each class. The decrease phagocytosis class is mostly characterized by the category N (nervous systems), confirming that the model can identify compounds that are already used to treat nervous system diseases. Intriguinly, the next highest category for decrease phagocitosys class is C (cardiovascular system). There is some literature evidence supporting the repurposing of cardiovascular system drugs for neurological conditions. Specifically, beta-blockers can reduce severity of migraines, and statins can reduce contrast-induced neuropathy [18]. This preliminary evidence suggests that the model may be predicting viable applications of these drugs to affect phagocytosis. The Cardiovascular system class drugs also contained the highest number of predicted increase phagocytosis drugs suggesting that we cannot predict new drug effects from ATC codes alone, and that even within-class drugs can have distinct effects.

Fig 4. First level ATC codes matched with each predicted class.

Fig 4

The multiple vertical bars chart the first level ATC codes with respect to the predicted class for each analysed compound. The letters on the x-axis refer to the ATC codes; A = alimentary tract and metabolism, B = blood and blood forming organs, C = cardiovascular system, D = dermatologicals, G = genito-urinary system and sex hormones, H = systemic hormonal preparations, excluding sex hormones and insulins, J = antiinfectives for systemic use, L = antineoplastic and immunomodulating agents, M = musculo-skeletal system, N = nervous system, P = antiparasitic products, insecticides and repellents, R = respiratory system, S = sensory organs, and V = various. We used the label mixed* to indicate those cases where the model prediction changes with different doses. We recall also that each compound may be associated with multiple ATC codes.

We selected 64 compounds that have most potential clinical utility. The selection was among compounds marked as FDA approved and that have been predicted to decrease phagocytosis with high confidence. The complete list is reported in Table 9.

Table 9. The list of selected compounds ranked by model confidence.

DrugBank ID Name Cluster DrugBank ID Name Cluster
1 DB00822 Disulfiram B 33 DB00485 Dicloxacillin A
2 DB01163 Amdinocillin A 34 DB01061 Azlocillin A
3 DB00739 Hetacillin A 35 DB00377 Palonosetron M
4 DB01000 Cyclacillin A 36 DB00283 Clemastine A
5 DB00937 Diethylpropion M 37 DB00481 Raloxifene M
6 DB00363 Clozapine B 38 DB00713 Oxacillin A
7 DB00408 Loxapine B 39 DB00883 Isosorbide dinitrate M
8 DB06209 Prasugrel M 40 DB00319 Piperacillin A
9 DB00334 Olanzapine B 41 DB01078 Deslanoside B
10 DB00462 Methscopolamine bromide B 42 DB00208 Ticlopidine M
11 DB05271 Rotigotine B 43 DB00572 Atropine B
12 DB00354 Buclizine B 44 DB01147 Cloxacillin A
13 DB01224 Quetiapine B 45 DB01104 Sertraline B
14 DB13996 Magnesium acetate B 46 DB01221 Ketamine B
15 DB00514 Dextromethorphan M 47 DB00219 Oxyphenonium B
16 DB00857 Terbinafine A 48 DB06787 Hexocyclium M
17 DB00245 Benzatropine B 49 DB00938 Salmeterol M
18 DB06718 Stanozolol M 50 DB00539 Toremifene M
19 DB00882 Clomifene M 51 DB00831 Trifluoperazine B
20 DB00758 Clopidogrel M 52 DB01160 Dinoprost tromethamine M
21 DB00561 Doxapram M 53 DB00341 Cetirizine M
22 DB00496 Darifenacin B 54 DB00390 Digoxin M
23 DB06230 Nalmefene M 55 DB00419 Miglustat M
24 DB00392 Profenamine B 56 DB01328 Cefonicid A
25 DB00925 Phenoxybenzamine B 57 DB00474 Methohexital B
26 DB00679 Thioridazine B 58 DB09346 Metrizoic acid B
27 DB00920 Ketotifen A 59 DB09363 Rauwolfia serpentina root M
28 DB01186 Pergolide B 60 DB06335 Saxagliptin M
29 DB00543 Amoxapine B 61 DB00355 Aztreonam A
30 DB01238 Aripiprazole B 62 DB00088 Alglucerase M
31 DB00865 Benzphetamine B 63 DB01180 Rescinnamine B
32 DB04844 Tetrabenazine B 64 DB06282 Levocetirizine M

We sought to computationally validate the utility of the selected compounds as possible therapies for schizophrenia. We exploited SciFinder [19], which is a database for chemical literature, to extract the 15 topics with highest frequencies per compound. We hypothesized that biological terms co-mentioned with drug names in the literature could represent a reasonable measure of the goodness of candidates. We recall that a topic is a term used by the Chemical Abstracts database to identify the general topic of a reference. We conducted the analysis over more than 287,000 references. Chemical compounds are associated with an average of 914 terms, and frequency of chemical-term co-mentions are in the tens to hundreds of mentions. We retain the top 15 topics with highest frequencies for further analysis. For our 64-chemicals, the frequencies for the 15 topics ranged from 6992 to 1. This level of co-mention was sufficient for us to analyze whether these compounds had utility for our repurposing goal.

We clustered the compounds into three main groups based on topics:

  • Brain (B), which contains all compounds with at least a brain-related topic in the top-15. We considered as brain-related topics the terms: Antidepressants, Antipsychotics, Bipolar disorder, Brain, Cognitive disorders, Depression, Epilepsy, Mental and behavioral disorders, Mood disorder, Multiple sclerosis, Obsessive-compulsive disorder, Parkinson disease, Psychosis, Schizophrenia;

  • Antibiotics (A), which contains all compounds with the topic Antibiotic in the top-15;

  • Miscellaneous (M), which contains all compounds with no brain or antibiotic related topics in the top-15.

We selected brain-related topics because this could suggest applications to neurological diseases. We also selected antibiotics because these compounds are generally well-tolerated and have known safety profiles. Moreover, some literature evidence suggest that antibiotics can directly affect schizophrenia [20] and there is mounting evidence that the gut microbiome affects brain health [21, 22]. Lastly, we kept a miscellaneous category to capture drugs without a clear alignment to either of these hypotheses.

We report the associations between compounds and clusters in Table 9. It appears that compounds with brain-related topics have a higher ranked probability predicted by the GaNN model with respect to others. Indeed, 20 of the first 30 compounds belong to the B cluster. These additional results reinforce the idea that the model is effective in identifying compounds that are already studied in relation to brain-related diseases. Three of the top five compounds ranked by the GaNN model are antibiotics suggesting that repurposing for schizophrenia could leverage the human gut microbiome-brain connection.

Lastly, we found evidence of the utility of some of our 64 candidates (i.e., Loxapine, Dextromethorphan, Thioridazine, Trifluoperazine, and Cetirizine) in the work of So et al. [23]. In their analysis, they leveraged GWAS data, and gene imputation to identify drug repurposing candidates for multiple psychiatric diseases. Their work presents a complimentary view that our predicted candidates influence biology relevant to psychiatric disease. We observe that a high recall score is not possible in this case because the authors report only the top-15 compounds.

Materials and methods

Dataset

We performed our experiments on top of SPARK’s high-throughput screening results, which aim was to screen for compounds that inhibit or activate MEGF10 [24] to correct aberrant astrocyte-mediated synaptic pruning in schizophrenia. With that purpose, the screen was a phagocytosis assay using astrocytes isolated from fetal human brain samples and synaptosomes prepared from mouse brain samples. They measured phagocytosis with a pH-sensitive fluorescent dye conjugated to the synaptosomes that is only activated when engulfed and localized to the low pH found in intracellular lysosomes. The screen was conducted in plates containing both positive and negative controls for data normalization (article in preparation).

The screening assessed 2218 different compounds at different concentrations, i.e., 1.39, 2.78, 5.56, 11.11, 22.22 μM. All the analyzed compounds derive from the Library of Pharmacologically Active Compounds (LOPAC) [25] and NIH Clinical Collection (NIHCC) [26], so they include inhibitors, receptor ligands, pharma-developed tools, and approved drugs. Due to the overlap of the two libraries, we removed duplicate results, leading to 10914 unique compound-dose combinations. For this analysis, we used median-normalized fluorescent signal as a proxy for relative phagocytosis.

We classified screening results by assigning a class to each instance. In particular, there are three class types:

  • increase phagocytosis—if the combination of compound and dose intensifies the phagocytic activity;

  • decrease phagocytosis—if the combination of compound and dose inhibits the phagocytic activity;

  • no change—if the combination of compound and dose does not affect the phagocytic activity.

We assigned the classes by applying a threshold on the number of cells with phagocytosis signal. More specifically,

{increasephagocytosisifβ130%ofthecontrolsnochangeif70%<β<130%ofthecontrolsdecreasephagocytosisifβ70%ofthecontrols

with β as the number of cells with phagocytosis signal with respect to the median of the plate.

Each compound is initially represented through its Simplified Molecular Input Line Entry System (SMILE) string. The SMILE string is a character string that captures the compounds’ elements and the bonds between them. For each compound we also include additional information regarding atoms and bonds, as shown in Table 10.

Table 10. The list of features regarding atoms and bonds.

Name Type Number of Values
Valence one-hot 7
Number of Hs one-hot 6
Hybridization one-hot 5
Symbol one-hot 11
Degree one-hot 7
Aromatic boolean 2
(a) Atom features

Finally, given D={1.39,2.78,5.56,11.11,22.22} the set of tested concentrations and Y={increasephagocytosis,nochange,decreasephagocytosis} the set of target values, the final dataset can be described as D={ci,di,yi}i=110914, where ci is the i-th tested instance (note that a single compound is tested under different concentrations), diD is the tested concentration for the instance, and yiY is the corresponding outcome.

For the purposes of our work, we represent each compound (originally expressed as a SMILE string) as a molecular graph, which is a consolidated representation of atoms and the bonds between them. We consider a molecular graph as an undirected graph defined as the tuple g=(V,E,X,E). The set V contains interacting entities, which in this case corresponds to atoms, and E is the set that contains the links among those entities, i.e., chemical bonds. On the other hand, XR|V|×|F| is the atom features matrix, where |F| is the number of available features for an atom. Analogously, ER|E|×|E| is the bond features matrix, where |E| is the number of available chemical bond features. We refer to xi and eij as the feature vector of atom i and feature vector of bond connecting atoms i and j, respectively. Also, we denote the neighborhood of a node iV as the set, N(i)={jV|{i,j}E}, that contains every nodes with a link with node i.

Model

We tackle the task of predicting glial phagocytic activity in brain tissue by an approach based on deep learning for graphs [27], given its power in encoding graph-based structures, e.g., molecules. Our model comprises the following two components:

  • a compound embedding module fcomp;

  • an output module fout.

From a high level point of view, given D the set of tested concentrations, a compound, c, is processed as follows. First the compound is processed by the compound embedding module to compute a representation vector from its molecular graph,

hc=fcomp(c)

that later is concatenated with the dose, dD, and passed to the output module. Hence, the final prediction is computed as

o=fout(hc,d).

This process is summarized visually in Fig 5.

Fig 5. A high level overview of the proposed model.

Fig 5

Given the molecular graph of a compound c and a dose dD, the model computes a vectorial representation hc using the compound embedding module fcomp and later computes the final result using the output module fout by leveraging the concatenation of hc and d.

We considered three different compound embedding modules and two output modules and we empirically confronted their effectiveness. We implemented the output module either by a Multi-Layer Perceptron (MLP) or by a Random Forest (RF) [28]. For the compound embedding module, we first leveraged the Extended-Connectivity Fingerprint (ECFP) [29] based on the Morgan algorithm [30]. ECFPs are static molecular fingerprints that exploit atom neighborhoods to represent molecules through a non-adaptive approach that does not take into consideration the predictive task at hand. A visual representation of this process is shown in Fig 6.

Fig 6. Fluorine’s influence in a circular fingerprint computation (radius equal to 3) of 1-Chloro-4-fluorobenzene.

Fig 6

Differently from the ECFP method, which is static, the other two approaches considered for the compound embedding module are adaptive and generate compound embeddings that are specialized for the specific predictive problem. To this end, we consider deep learning solutions that can process the compound represented as a molecular graph.

The first adaptive approach that we consider is a linear embedding model, implemented through a MLP that computes a vectorial representation for each atom in the compound, which then are aggregated by an element-wise sum or mean without taking into consideration any spatial information. The second approach leverages Deep Graph Networks (DGNs) [27], which learn a mapping function that compresses the complex relational information captured by a graph into an information-rich feature vector that reflects both the topological and the label information in the original graph. At a high level, given a molecular graph as input, a DGN computes a representation for each node through transformations that combine the previous representation of the node with its neighbor representations. Those transformations are often referred to as Graph Convolutional Layers (GCLs). Ultimately, node representations are aggregated to obtain a single embedding for the whole graph. An overview of this method is shown in Fig 7.

Fig 7. An example of DGN applied on 1-Chloro-4-fluorobenzene molecular graph.

Fig 7

Given the molecular graph of a compound, each GCL computes the representation of each node in the graph as a transformation of their neighbor representations. In the end, node representations are aggregated to obtain a vectorial representation that reflect the original molecular graph.

We investigated different DGN implementations. More specifically, the DGNs that we consider are based on GraphSAGE [31], Graph Attention Network (GAT) [32], Edge-Conditioned Convolution Network (ECC) [33, 34], and Neural Graph Fingerprint [35]. An overview of such methods is reported in S1 Appendix.

For our purposes, we evaluated the performance of seven configurations of compound embedding module and output module:

  • Linear Embedding and Neural Network (LinNN)—used as a baseline for this task;

  • ECFP fingerprint based on Morgan algorithm and Random Forest (MoRF)—commonly used in literature for biomedical-based problems [10, 11, 36, 37];

  • ECFP fingerprint based on Morgan algorithm and Neural Network (MoNN);

  • GraphSAGE and Neural Network (SAGENN);

  • GAT Network and Neural Network (GaNN);

  • ECC Network and Neural Network (ENN);

  • Neural Graph Fingerprint and Neural Network (NeFPNN).

Experimental setting

We split the data into a development set (80%) for model selection and a test set (20%) for risk assessment. We consider the obtained test set as a hold-out, in other words, as a set of examples only used to estimate the generalization performance of the model, and never used during the training phase. Internally to the development set, we used a 3-fold cross-validation for model selection. We generated each split in a stratified fashion. In other words, each split maintains the distribution with respect to a target variable. Specifically, we implemented stratification according to two strategies. The former splits the data by maintaining the distribution with respect to the target y; while the latter with respect to the target y, the cardinality of atoms in the compounds, and the concentrations. The rationale behind the latter strategy is to generate more homogeneous data splits, avoiding unbalanced distributions of molecule size and concentrations in the training and validation splits. We will refer to the first strategy as simple cross-validation, while the second as complex cross-validation.

The three classes are not balanced, with the no change category representing roughly 81% of samples. To preserve the minority classes we undersampled the majority class. At each epoch we randomly sampled from the no change instances to generate a subset with the same size of other classes. The rationale behind this approach is to keep the classes always balanced, and also to leverage all the data available. Indeed different sub-samples are extracted at each epoch. We recall that Random Forests are more resistant to data imbalance, therefore, we did not implement undersampling for RF-based models.

Our experiments can be summarized as follows. At first, we conduct the model selection and the risk assessment phases. The rational behind these steps is, first, to select the best hyper-parameters for the models among a set of candidates, and then to evaluate their generalization capability on a different set of data. Lastly, we use the best model, selected from the previous stage, in a real world drug repurposing scenario with the aim of understanding the potential in prioritizing the selection of compounds to be tested in a specific biomedical experiment.

We performed hyper-parameter tuning via grid search, optimizing the AUROC with macro-average, which is a good estimate of the classification performances since the dataset is balanced. We recall that the macro-AUROC is defined as

macro-AUROC=1ci=1cAUROCi

where c is the number of classes (in this work c = 3) and AUROCi is the metric computed for class i. The grids used in our experiments are reported in S2 Appendix. We trained models with fout = MLP to minimize the Cross-Entropy loss accumulated across all the instances in the dataset.

The experiments were carried on a Dell server with 4 Nvidia GPUs Tesla P100. We release openly the code implementing our methodology and reproducing our empirical analysis at: https://github.com/gravins/DGNs-for-schizophrenia.

Conclusion

This work investigates the benefits of ML for graphs to predict compounds that mitigate abnormal brain reduction induced by excessive glial phagocytic activity in people affected by schizophrenia. In this context, we designed a model that is able to recognize whether a compound can reduce, increase, or not influence phagocytosis. More precisely, the model takes as input a compound and a concentration to predict a score associated with the three possible compound effects. This allows us to anticipate compounds with potentially desirable clinical effects for patients with schizophrenia. Internally, the model leverages a static fingerprint (i.e., Morgan-based ECFP) or an adapting fingerprint (i.e., DGNs) to represent compounds. We have shown experimentally that our approach is effective and has good generalization capabilities. Indeed, we have found that the model can generalize its predictions when employed on an unseen library, identifying as potential beneficial compounds those already used to treat brain-related diseases. Lastly, we have presented a list of compounds that we believe have the most potential clinical utility against glial-mediated brain reduction in schizophrenia patients.

We tested multiple chemical representations and discovered that an adapting approach was sufficient for describing phenotypic screen effects. A static fingerprint was insufficient, yet including full bond information decreased model performance. This suggests that, in some cases, structure-function information requires knowledge of the atoms and their arrangements, but not the full detail of their connections. It may be advantageous for other rapid drug development programs to leverage this information to more efficiently predict compounds with desired effects. In this scenario, we were eager to test our model on new drug information and validate the utility of the model, however experimental validation was infeasible for this work. Instead, we tested a novel validation approach by using SciFinder to understand the relevance of compounds to biological use cases. Indeed, SciFinder recovered drug-phenotype associations that mapped to neurological terms, suggesting that emerging literature evidence supported the potential utility of model-predicted compounds. We anticipate that this can be used to further prioritize predicted compounds and minimize the needs for very large, follow-up validation screens and that other structure-functional studies using machine learning will benefit from our in silico validation approach.

Supporting information

S1 Appendix. Overview of the employed dynamic compound embedding modules based on DGNs.

(PDF)

S2 Appendix. Hyper-parameters tables.

(PDF)

Acknowledgments

This work has been partially supported by SPARK at Stanford University. The authors would like to thank SPARK members and Francesco Landolfi, University of Pisa, for the insightful discussions throughout the development of this work.

Data Availability

Data and Code are released on GitHub https://github.com/gravins/DGNs-for-schizophrenia.

Funding Statement

The authors received no specific funding for this work.

References

  • 1.Schizophrenia Symptoms, Patterns and Statistics and Patterns;. Available from: https://www.mentalhelp.net/schizophrenia/statistics/.
  • 2. Haijma S, Haren NEM, Cahn W, Koolschijn PC, Pol H, Kahn R. Brain Volumes in Schizophrenia: A Meta-Analysis in Over 18 000 Subjects. Schizophrenia bulletin. 2012;39. doi: 10.1093/schbul/sbs118 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. MacDonald M, Alhassan J, Newman J, Richard M, Gu H, Kelly R, et al. Selective Loss of Smaller Spines in Schizophrenia. The American journal of psychiatry. 2017;174:appiajp201716070814. doi: 10.1176/appi.ajp.2017.16070814 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Sellgren C, Gracias J, Watmuff B, Biag J, Thanos J, Whittredge P, et al. Increased synapse elimination by microglia in schizophrenia patient-derived models of synaptic pruning. Nature Neuroscience. 2019. doi: 10.1038/s41593-018-0334-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Lee E, Chung WS. Glial Control of Synapse Number in Healthy and Diseased Brain. Frontiers in Cellular Neuroscience. 2019;13:42. doi: 10.3389/fncel.2019.00042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Onwordi EC, Halff EF, Whitehurst T, Mansur A, Cotel MC, Wells L, et al. Synaptic density marker SV2A is reduced in schizophrenia patients and unaffected by antipsychotics in rats. Nature Communications. 2020;11(1). doi: 10.1038/s41467-019-14122-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Abramenko N, Kustov L, Metelytsia L, Kovalishyn V, Tetko I, Peijnenburg W. A review of recent advances towards the development of QSAR models for toxicity assessment of ionic liquids. Journal of Hazardous Materials. 2020;384:121429. doi: 10.1016/j.jhazmat.2019.121429 [DOI] [PubMed] [Google Scholar]
  • 8. Bianucci AM, Micheli A, Sperduti A, Starita A. In: Cartwright HM, Sztandera LM, editors. A Novel Approach to QSPR/QSAR Based on Neural Networks for Structures. Berlin, Heidelberg: Springer Berlin Heidelberg; 2003. p. 265–296. Available from: 10.1007/978-3-540-36213-5_10. [DOI] [Google Scholar]
  • 9. Bianucci AM, Micheli A, Sperduti A, Starita A. Application of Cascade Correlation Networks for Structures to Chemistry. Applied Intelligence. 2000;12(1):117–147. doi: 10.1023/A:1008368105614 [DOI] [Google Scholar]
  • 10. Banerjee P, Preissner R. BitterSweetForest: A Random Forest Based Binary Classifier to Predict Bitterness and Sweetness of Chemical Compounds. Frontiers in Chemistry. 2018;6. doi: 10.3389/fchem.2018.00093 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Lind AP, Anderson PC. Predicting drug activity against cancer cells by random forest models based on minimal genomic information and chemical properties. PLOS ONE. 2019;14(7):1–20. doi: 10.1371/journal.pone.0219774 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Zhao K, So HC. Drug Repositioning for Schizophrenia and Depression/Anxiety Disorders: A Machine Learning Approach Leveraging Expression Data. IEEE Journal of Biomedical and Health Informatics. 2019;23(3):1304–1315. doi: 10.1109/JBHI.2018.2856535 [DOI] [PubMed] [Google Scholar]
  • 13. Xu R, Wang Q. PhenoPredict: A disease phenome-wide drug repositioning approach towards schizophrenia drug discovery. Journal of Biomedical Informatics. 2015;56:348–355. doi: 10.1016/j.jbi.2015.06.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.SPARK at Stanford;. https://sparkmed.stanford.edu/.
  • 15. Kim E, Omura P, Lo A. Accelerating biomedical innovation: A case study of the SPARK program at Stanford University, School of Medicine. Drug Discovery Today. 2017;. doi: 10.1016/j.drudis.2017.03.015 [DOI] [PubMed] [Google Scholar]
  • 16. Novick PA, Ortiz OF, Poelman J, Abdulhay AY, Pande VS. SWEETLEAD: an In Silico Database of Approved Drugs, Regulated Chemicals, and Herbal Isolates for Computer-Aided Drug Discovery. PLOS ONE. 2013;8(11). doi: 10.1371/journal.pone.0079568 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Wishart DS, Knox C, Guo AC, Cheng D, Shrivastava S, Tzur D, et al. DrugBank: a knowledgebase for drugs, drug actions and drug targets. Nucleic Acids Research. 2007;36(suppl_1):D901–D906. doi: 10.1093/nar/gkm958 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Ishida J, Konishi M, Ebner N, Springer J. Repurposing of approved cardiovascular drugs. Journal of Translational Medicine. 2016;14(1):269. doi: 10.1186/s12967-016-1031-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.CAS SciFinder;. Available from: https://scifinder.cas.org/scifinder.
  • 20. Liu F, Guo X, Wu R, Ou J, Zheng Y, Zhang B, et al. Minocycline supplementation for treatment of negative symptoms in early-phase schizophrenia: A double blind, randomized, controlled trial. Schizophrenia Research. 2014;153(1-3):169–176. doi: 10.1016/j.schres.2014.01.011 [DOI] [PubMed] [Google Scholar]
  • 21. Galland L. The gut microbiome and the brain. Journal of medicinal food. 2014;17(12):1261–1272. doi: 10.1089/jmf.2014.7000 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Hill-Burns EM, Debelius JW, Morton JT, Wissemann WT, Lewis MR, Wallen ZD, et al. Parkinson’s disease and Parkinson’s disease medications have distinct signatures of the gut microbiome. Movement Disorders. 2017;32(5):739–749. doi: 10.1002/mds.26942 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. So HC, Chau CKL, Chiu WT, Ho KS, Lo CP, Yim SHY, et al. Analysis of genome-wide association data highlights candidates for drug repositioning in psychiatry. Nature Neuroscience. 2017;20(10):1342–1349. doi: 10.1038/nn.4618 [DOI] [PubMed] [Google Scholar]
  • 24. Chung WS, Clarke L, Wang G, Stafford B, Sher A, Chakraborty C, et al. Astrocytes mediate synapse elimination through MEGF10 and MERTK pathways. Nature. 2013;504. doi: 10.1038/nature12776 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.LOPAC1280—The Library of Pharmacologically Active Compounds;. Available from: https://www.sigmaaldrich.com/life-science/cell-biology/bioactive-small-molecules/lopac1280-navigator.html.
  • 26.NIH Clinical Collection;. Available from: https://pubchem.ncbi.nlm.nih.gov/source/NIH%20Clinical%20Collection.
  • 27. Bacciu D, Errica F, Micheli A, Podda M. A gentle introduction to deep learning for graphs. Neural Networks. 2020;129:203–221. doi: 10.1016/j.neunet.2020.06.006 [DOI] [PubMed] [Google Scholar]
  • 28. Breiman L. Random Forest. Machine Learing. 2001; p. 5–32. doi: 10.1023/A:1010933404324 [DOI] [Google Scholar]
  • 29. Rogers D, Hahn M. Extended-Connectivity Fingerprints. Journal of chemical information and modeling. 2010;50:742–54. doi: 10.1021/ci100050t [DOI] [PubMed] [Google Scholar]
  • 30. Morgan HL. The Generation of a Unique Machine Description for Chemical Structures-A Technique Developed at Chemical Abstracts Service. Journal of Chemical Documentation. 1965;5(2):107–113. doi: 10.1021/c160017a018 [DOI] [Google Scholar]
  • 31.Hamilton WL, Ying R, Leskovec J. Inductive Representation Learning on Large Graphs. In: NIPS; 2017.
  • 32.Veličković P, Cucurull G, Casanova A, Romero A, Liò P, Bengio Y. Graph Attention Networks. International Conference on Learning Representations. 2018;.
  • 33.Simonovsky M, Komodakis N. Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs; 2017. p. 29–38.
  • 34.Gilmer J, Schoenholz SS, Riley PF, Vinyals O, Dahl GE. Neural Message Passing for Quantum Chemistry. In: Precup D, Teh YW, editors. Proceedings of the 34th International Conference on Machine Learning. vol. 70 of Proceedings of Machine Learning Research. PMLR; 2017. p. 1263–1272. Available from: http://proceedings.mlr.press/v70/gilmer17a.html.
  • 35. Duvenaud DK, Maclaurin D, Iparraguirre J, Bombarell R, Hirzel T, Aspuru-Guzik A, et al. Convolutional Networks on Graphs for Learning Molecular Fingerprints. In: Cortes C, Lawrence ND, Lee DD, Sugiyama M, Garnett R, editors. Advances in Neural Information Processing Systems 28. Curran Associates, Inc.; 2015. p. 2224–2232. [Google Scholar]
  • 36. Liu S, Alnammi M, Ericksen SS, Voter AF, Ananiev GE, Keck JL, et al. Practical Model Selection for Prospective Virtual Screening. Journal of Chemical Information and Modeling. 2019;59(1):282–293. doi: 10.1021/acs.jcim.8b00363 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Kapsiani S, Howlin BJ. Random forest classification for predicting lifespan-extending chemical compounds. Scientific Reports. 2021;11(1):13812. doi: 10.1038/s41598-021-93070-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009531.r001

Decision Letter 0

Feilim Mac Gabhann, Carl Herrmann

6 Jan 2022

Dear Mr. Gravina,

Thank you very much for submitting your manuscript "Controlling astrocyte-mediated synaptic pruning signals for schizophrenia drug repurposing with Deep Graph Networks" for consideration at PLOS Computational Biology.

As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

As you will see from the reviews, both reviewers criticize the fact that the code has not been made available in the GitHub repository, which did not allow an in depth assessment of the method. I therefore urge you to make the code available upon resubmission. You will also see that Reviewer 2 has concerns regarding the performance of the method, as well as on the strength on the validation results. From the methods point of view, the CV procedure should be also adapted according to the reviewer's comments. Reviewer 1 suggests a more concise description of the methods (e.g. in the "Compound dissimilarity analysis" section). Finally, the novelty with respect to already published LBVS methods should be stressed in your revised version.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Carl Herrmann, Ph.D.

Associate Editor

PLOS Computational Biology

Feilim Mac Gabhann

Editor-in-Chief

PLOS Computational Biology

***********************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: This work presents a machine learning-based model that predicts the effects of the given compound and its concentration. Then, the authors screen a great number of compounds by using the proposed computational methods, and validate selected compounds as possible therapies for schizophrenia. I have the following concerns.

(1) The authors developed a computational method for schizophrenia drug repurposing with Deep Graph Networks. The authors directly employed Deep Graph Networks to their problems but didn’t propose novel methods.

(2) Do all compounds have the five concentrations, i.e., 1.39, 2.78, 5.56, 11.11, 22.22. If so, how can we make prediction if the compound has concentrations different from these?

(3) The authors introduce too much basic knowledge about the Deep Graph Networks and molecular fingerprint ECFP, and it is better to make them concise. The writing and organization could be improved for better readability.

(4)The source codes and datasets are not publicly available, and the web link provided by the author is empty. It is difficult to objectively evaluate the quality of the data and explore details about the model implementation.

Reviewer #2: “Controlling astrocyte-mediated synaptic pruning signals for schizophrenia drug repurposing with Deep Graph Networks.”

The authors present results of various ligand-based virtual screening models for predicting a phenotypic endpoint potentially related to schizophrenia. The models are trained and validated using data from SPARK HTS involving a cell-based assay of astrocytes with readout based on phagocytotic activity. Here 2218 diverse compounds from LOPAC and NIHCC were screened at 5 doses (10914 unique compound-dose combinations with activity classes). The best performing model model based on macro-AUROC on held-out subset, a GNN, was then applied on the SWEETLEAD compound in a retrospective test to examine model performance. Model performance here was examined by the extent to it prioritized known CNS active molecules as determined based on ATC labels, which associate the molecules with their likely tissue/organ target sites of action. The authors prioritize 64 molecules from SWEETLEAD based on predicted reduction in synaptic phagocytotic activity and check the likely targets of these compounds based on analysis of SciFinder chemical abstracts terms categorizing the molecules.

The development and testing of deep graph networks for ligand-based virtual screening on phenotypic endpoints is currently of high interest to the computational drug discovery field and PLOS Comp Biol readership. The integration of dose with the compound structure input representation is, to my knowledge, novel and potentially an advance.

The manuscript is clearly written and concise.

The codes/data used in the work are not yet available on the github repository indicated in the text.

Major Issues

The AUROC of the top models (after sweeping hyperparameters) in validation on the held-out SPARK HTS data seems low (0.65-0.68), regardless of cross-validation approach and also poor in the risk-assessment test (Table 4—AUROC 0.64-0.69), compared to published performances of other LBVS models. The complex CV procedure, with diversification of compounds within the folds, seems like a sound approach but did not meaningfully enhance model performances in validation, risk-assessment phases. Although the SPARK molecule set was examined in terms of structural diversity (based on Jaccard and Cosine distances of chemical fingerprints), it is possible that molecules of similar scaffold/chemotype were in spread across folds. This might be mitigated by clustering the training molecules and confining cluster IDs to specific folds. This ensures that predictions are made on chemotypes outside of the training set. Also, rather than showing the quartiles in Table 1, a plot of the distributions (as histograms) would be more informative—as well as some examples of molecule pairs at the different dissimilarity values. It is not obvious what the dissimilarity scores reflect—perhaps some statistics on scaffold diversity (Bemis-Murcko) would be useful to reader.

The instances from the SPARK dataset were defined as unique compound-dose pairs. The inclusion of dose as part of the input representation could be a significant advance—especially when predicting complex phenotypic endpointsg. However, given the poor model performances, and the typical variability of single concentration points in HTS assays, the reviewer recommends a using just the compound representations as inputs and assigning activity labels based on the full dose-response curves (potency or binary/tertiary classes). This will reduce the number of instances from ~10000 to ~2000 but this would likely improve reliability of the labels and simplify the relationship between inputs and outputs. It’s understood that some molecules showed both pro- and anti-phagocytotic activities depending on dose. Such complex responses exhibited by the compound instances could be a result of artifacts. Assay responses at the higher concentrations are probably less relevant and might merely reflect toxicity. Perhaps using the observed response (pro- or anti-phagocytotic) at the lowest concentration point could resolve this issue. Also, since decreased phagocytotic activity is the therapeutically relevant endpoint, perhaps a simple binary classification could be used based on achieving some threshold decrease in phagocytotic response.

The use of ATC codes and Chemical Abstracts topics terms to associate molecule target sites for assessing model predictive performance is interesting but not sufficient for demonstrating the utility of the model for VS on phenotypic endpoints. That the molecules prioritized by the model are frequently associated with CNS therapeutics does not demonstrate that the model is useful for predicting the anti-phagocytotic response. The work would be much stronger with follow-up testing on some or all of the 64 prioritized molecules from SWEETLEAD in the original SPARK assay. Are there other molecules (outside of SWEETLEAD) that have been tested in this assay (or similar assay) that could be evaluated by the model retrospectively? Are there other molecules shown to be active in published assay data (ChEMBL or PubChem BioAssay) on cell-based or biochemical targets related to synaptic phagocytotic activity? Perhaps these data could be used in retrospective test.

Minor Issues

Table 1 – the degree units on the quartile values should be removed. Median (50%) is usually reported when reporting quantiles.

Line 113 – Table 3 does not show that nearly all models have better validation performance with complex validation procedure.

SI Table 9 -- # trees range probably should go much higher – I see much larger forests used with ECFP input representations (n_estimators) in Liu et al., 2019. https://pubs.acs.org/doi/10.1021/acs.jcim.8b00363).

Materials and Methods does not indicate code, libraries, or packages used for model implementations. This is necessary in cases where default hyperparameters are not indicted in text (e.g, in Random Forest what were the max features per tree, etc).

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No: link https://github.com/gravins/DGNs-for-schizophrenia is empty.

Reviewer #2: No: link to GitHub repo in manuscript but code/data not yet uploaded.

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009531.r003

Decision Letter 1

Feilim Mac Gabhann, Carl Herrmann

29 Mar 2022

Dear Mr. Gravina,

We are pleased to inform you that your manuscript 'Controlling astrocyte-mediated synaptic pruning signals for schizophrenia drug repurposing with Deep Graph Networks' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Carl Herrmann, Ph.D.

Associate Editor

PLOS Computational Biology

Feilim Mac Gabhann

Editor-in-Chief

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors have addressed my concerns.

Reviewer #2: Authors have effectively addressed my concerns in previous review. Nice work--was a pleasure to read!

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009531.r004

Acceptance letter

Feilim Mac Gabhann, Carl Herrmann

29 Apr 2022

PCOMPBIOL-D-21-01801R1

Controlling astrocyte-mediated synaptic pruning signals for schizophrenia drug repurposing with Deep Graph Networks

Dear Dr Gravina,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Andrea Szabo

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Appendix. Overview of the employed dynamic compound embedding modules based on DGNs.

    (PDF)

    S2 Appendix. Hyper-parameters tables.

    (PDF)

    Attachment

    Submitted filename: Response to reviewers.pdf

    Data Availability Statement

    Data and Code are released on GitHub https://github.com/gravins/DGNs-for-schizophrenia.


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES