Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2022 Oct 7;12:16846. doi: 10.1038/s41598-022-20404-3

Alzheimer’s disease: insights from a network medicine perspective

Federica Conte 1, Paola Paci 1,2,
PMCID: PMC9546925  PMID: 36207441

Abstract

Alzheimer’s disease (AD) is the most common neurodegenerative disease that currently lacks available effective therapy. Thus, identifying novel molecular biomarkers for diagnosis and treatment of AD is urgently demanded. In this study, we exploited tools and concepts of the emerging research area of Network Medicine to unveil a novel putative disease gene signature associated with AD. We proposed a new pipeline, which combines the strengths of two consolidated algorithms of the Network Medicine: DIseAse MOdule Detection (DIAMOnD), designed to predict new disease-associated genes within the human interactome network; and SWItch Miner (SWIM), designed to predict important (switch) genes within the co-expression network. Our integrated computational analysis allowed us to enlarge the set of the known disease genes associated to AD with additional 14 genes that may be proposed as new potential diagnostic biomarkers and therapeutic targets for AD phenotype.

Subject terms: Computational biology and bioinformatics, Biomarkers, Alzheimer's disease

Introduction

Alzheimer’s disease (AD) is a progressive neurodegenerative disorder estimated to affect nearly 30 million people worldwide1. AD is the most common cause of dementia in mid- and late-life, but its clinical impact is modified by other neurodegenerative and cerebrovascular conditions (e.g., impairment in memory, judgment, decision making, distorted language, and physical orientation)2,3. Over the last years, significant advances were made in the understanding of AD by investigating the molecular mechanisms underlying amyloid-β peptides and protein tau, both hallmarks of this disease. Despite this progress, no disease-modifying treatments currently exist in AD. Indeed, the amyloid-β and tau biomarkers are very useful in tracking AD progression but, unfortunately, they have not yet translated into treatments for AD. Thus, the identification of novel and effective molecular biomarkers for the early diagnosis and treatment of AD is urgently demanded.

In this perspective, fundamental insights may come from the new emerging research area of Network Medicine, which exploits concepts garnered from network theory to elucidate the relation between perturbations in the human interactome (i.e., the network of all physical interactions within a cell, from protein–protein to regulatory protein–DNA and metabolic interactions) and phenotypic disease manifestations4. In particular, an extensive literature on this topic demonstrated that the molecular determinants associated with a given disease (named disease genes) are not randomly scattered, but form locally dense and topologically/functionally well-defined regions in the human interactome (named disease modules)510. More precisely, the fundamental hypotheses of Network Medicine state that the molecular determinants associated with a specific disease: (i) have an increased tendency to interact with each other (local hypothesis); (ii) show a tendency to cluster in the same network neighbourhood (disease module hypothesis); (iii) show a tendency to be involved in closely disease-related cellular functions or causal molecular pathways (functional coherence hypothesis)4,7. These observations have fueled the development of numerous algorithms to interrogate disease etiology, model molecular and genetic interactions, identify potential biomarkers, and design therapeutic interventions9. Such algorithms were highly customized according to the study design, the phenotype under investigation, the biological question of interest, the molecular entities measured, and the type and size of the available datasets. Several of these algorithms1012 make use of the human interactome, also denoted as protein–protein interaction (PPI) network, and of the a priori knowledge of disease genes defined as genes with mutations known to have a phenotypic impact13,14. Among them, one of the most well-established algorithm is DIAMOnD (DIseAse MOdule Detection)12, which was designed to predict the full disease module around a set of known disease associated proteins (called seed proteins) in the human interactome by performing a systematic analysis which prioritizes the proteins having a significant fraction of their interactions with the seed proteins. Other algorithms make use of gene expression networks (GENs) that are built by calculating correlations between the expression profiles of each gene pair. Among them, one of the most promising algorithm is SWIM (SWItch Miner)15, which was designed to predict important (switch) genes within GENs that are associated with intriguing patterns of molecular co-abundance and may play a key role in phenotypic transitions in various biological settings2,1518. In particular, switch genes are defined as a special set of genes with peculiar topological features within GENs, i.e.: (i) they show coherent patterns of correlation, suggesting they may be co-regulated or functionally related; (ii) they form localized connected subnetworks/modules; (iii) they are not local hub within their module, but they act as connectors able to convey information among modules of the correlation network.

Summarizing, while PPI networks leverage experimentally confirmed interactomics data pointing out physical and functional interactions occurring among proteins, GENs leverage phenotype-specific gene expression data resulting in a context-specific gene interaction network.

Here, to combine the strengths of both methodologies, we integrated the DIAMOnD and SWIM results by mapping the list of the AD-specific switch genes in the subnetwork of the human interactome constituted by the nearest neighbours of the AD-associated seed proteins.

Regarding AD gene expression data, there exist several publically available datasets. For example, Liang et al. collected transcriptomic data coming from post-mortem brain tissue and identified region-specific gene expression alterations in AD brains19. In this study, we decided to pursue a blood-based gene expression analysis since the identification of a blood-based diagnostic signature could be extremely useful for pre-screening ahead of invasive and costly follow-up analysis2022. Indeed, blood-based biomarkers represent an ideal option as the first-step of the diagnostic process beginning in primary care settings, and may help to determine which individuals should receive a referral to assessment by specialists, including CSF analysis, MRI or amyloid PET diagnostics. Another advantage is that blood testing is already a well-established part of clinical routines, requiring no further training for health care professionals. In addition, the use of blood-based biomarkers may offer the possibility to test a wide range of other candidate pathophysiological biomarkers, reflecting the full spectrum of disease and driving molecular mechanisms underlying AD, beyond the standard amyloid- and tau-based tests21.

Our network medicine approach led to an in silico recognition of a novel disease gene signature for AD composed of proteins (14 switch genes plus the already known 99 AD-associated seed proteins) having unique, quantifiable characteristics that distinguish them from the other nodes in the human interactome: they are physically and functionally related and AD phenotype-specific in the human interactome.

We believe that our findings could provide advancements in the ongoing effort to identify effective diagnostic biomarkers and therapeutic targets for AD so to improve the clinical management of this dreadful disease.

Materials and methods

Alzheimer’s disease gene expression data

The AD dataset is available through the GEO public repository at accession numbers GSE63060 (batch 1) and GSE63061 (batch 2) published on August 05, 2015 and updated on May 03, 201920. As already described in7, data include expression profiling by array related to Alzheimer’s Disease (AD) and control samples (CTL) originating from the EU funded AddNeuroMed Cohort23, which is a large cross-European AD biomarker study relying on human blood as the source of RNA. In particular, GSE63060 comes from array A-MEXP-1171—Illumina HumanHT-12 v3.0 Expression BeadChipm and includes a total of 249 samples (145 AD, 104 CTL); whereas GSE63061 comes from array A-GEOD-10558—Illumina HumanHT-12 V4.0 expression beadchip and includes a total of 273 samples (139 AD and 134 CTL). The probe-sets were mapped to official gene symbols using the relative platform (GPL6947-13512 for GSE63060 and GPL10558-50081 for GSE63061) available from the GEO repository. Multiple probe measurements of a given gene were collapsed into a single gene measurement by considering the mean. By matching genes based on gene symbols, we created a single merged dataset with both batches and we used Combat function from R/Bioconductor package SVA to correct for batch-specific effects. Finally, we obtained a data matrix of 19,460 gene symbols (rows) and 522 samples (columns) including 284 AD and 238 CTL. Table 1 reports all available information for AD case–control samples of the gene expression dataset under study.

Table 1.

AddNeuroMed dataset.

Total samples Gender Age (mean ± SD)
control 238 143 F (60%); 95 M (40%) 72 ± 19.8 yr
AD case 284 184 F (65%); 100 M (35%) 70 ± 17 yr

AD, Alzheimer’s disease; F, Female; M, Male; sd, standard deviation; yr, years.

Human interactome and Alzheimer’s disease associated genes

The human interactome was downloaded from Ghiassian and coauthors12 that assembled direct physical protein interactions with reported experimental evidence (e.g., binary interactions, literature curated interactions, metabolic enzyme-coupled interactions) from several consolidated data sources such as IntAct24 MINT25, and BioGRID26. The union of all these interactions yields a network of 13,460 proteins that are interconnected by 141,296 physical interactions.

For the disease-gene associations, we considered the 29 Alzheimer’s disease associated genes collected by Ghiassian and coauthors12 and derived from OMIM (Online Mendelian Inheritance in Man) database27 and GWAS (Genome-Wide Association Studies) database13. In addition, we also included the findings of the very recent and fundamental study by Bellenguez and colleages28, where the authors performed a genome-wide association study totaling 111,326 clinically diagnosed/‘proxy’ AD cases and 677,663 controls. This study led to the identification of 75 risk loci corresponding to a list of 77 GWAS genes , which compasses 7 GWAS genes (i.e., CR1, BIN1, TREM2, CD2AP, PHA1 CLU, APP) collected by Ghiassian et al.12. The union of the two lists collected by Ghiassian et al.12 and identified by Bellenguez et al.28 ended up with a unique pool of 99 AD-associated GWAS genes (Supplementary Table 1).

DIAMOnD tool

In order to predict novel disease genes for AD, we exploited the network-based DIAMOnD (Disease Module Detection) tool12. DIAMOnD implements a series of well-defined steps described in details in12. Specially, DIAMOnD starts from the seed s0 and uses the hypergeometric distribution to calculate the probability of having drawn ks seed proteins (out of k total draws) from a population of N proteins (Fig. 1). In the network perspective, the N proteins are the nodes of the human interactome and k are the nearest neighbours of a certain protein in the network (Fig. 1). Through an iterative procedure, DIAMOnD computes, for all proteins of the interactome with at least one link with the seed, the so-called connectivity significance (CS). The CS represents the probability (p value) that a protein with a total of k links has ks or more connections to seed proteins than expected (Fig. 1) and it was used to prioritize the proteins. In particular, the protein with the highest rank (i.e. lowest p value or CS) becomes a “candidate protein” and it is added to the set of seed nodes, increasing their number from s0 to s1=s0+1. This procedure is iterated for a fixed number of iterations (i.e., 500) by expanding the disease module by one node at each iteration step. Thus, at the end of the 500 fixed iterations, the algorithm provides a ranked list of proteins (as many as the iterations performed), each with its own associated connectivity significance (p value). The order in which the DIAMOnD predictions are pulled into the disease module reflects their topological relevance to the disease.

Figure 1.

Figure 1

DIAMOnD methodology. The network corresponds to the human interactome where the orange circular nodes are the seed proteins, the green square node is the protein to test with k connections (red and grey thick links) including ks links to seed proteins (red thick links), the grey circular nodes refer to other proteins in the interactome. The ensembles considered in the hypergeometric test for the computation of connectivity significance are reported.

Biological evaluation of DIAMOnD predictions

In order to select the most promising disease protein candidates predicted by DIAMOnD, the so-called biological criterion was exploited12. This criterion is based on the hypothesis that proteins with biological annotations similar to the ones of the seed proteins of a given disease are more likely to be associated to that disease. The biological evaluation of DIAMOnD predictions exploits a sliding window approach. In particular, a sliding window, whose size is equal to the number of seed proteins, is defined and a functional enrichment analysis is performed to identify the biological annotations (e.g. GO terms or KEGG pathways) enriched in the list of the initial seed proteins. Candidate proteins within the sliding window that are annotated with any of the enriched biological annotations are considered true positives. The sliding window span across the list of DIAMOnD predicted proteins and at each iteration step, a hypergeometric test is performed to evaluate if the number of true positives is greater than what expected by chance. Thus, the application of this hypergeometric test provides a p value (called enrichment p value) associated with each iteration of the DIAMOnD algorithm. The maximal number of DIAMOnD predicted proteins that should be considered corresponds to the number of iterations such that the number of DIAMOnD proteins with direct statistically significant biological evidence reaches a plateau (enrichment p value ≤ 0.01, where 0.01 is the threshold of the enrichment p value chosen in the present study). The functional enrichment analysis of the initial seed proteins was conducted by considering the KEGG pathways annotations provided by Enrichr web tool and updated to 202129. KEGG pathways were considered statistically significant with an adjusted p value ≤ 0.05.

SWIMmeR tool

In order to identify AD-specific switch genes, we exploited the network-based SWIMmeR tool30, an R open-source version of its predecessor SWIM (SWitchMiner) written in MATLAB®15. SWIMmeR (and SWIM) implements a series of well-defined steps described in details in15,30. Specially, SWIMmeR starts from the computation of differentially expressed genes (DEGs) between two condition of interest (e.g. physiological versus pathological condition) and uses DEGs to build a correlation network where two nodes are connected if the absolute value of the Pearson correlation coefficient of their expression profiles is greater than a given threshold. This threshold should be selected in order to have a manageable network as well preserve the integrity of the network. Next, SWIMmeR identifies communities in the correlation network by means of the k-means clustering algorithm. Then, it searches for specific topological properties of the correlation network using the date/party/fight-club hub classification15, based on the Average Pearson Correlation Coefficients (APCCs) between the expression profiles of each hub (i.e., node with degree greater than 5) and its nearest neighbors. Finally, SWIMmeR assigns a role to each node based on its ability to convey information within and between clusters in the network through the computation of two parameters: the within-module degree, which quantifies how much a node is a hub in its own cluster, and thus, it is a measure of local connectivity; and the clusterphobic coefficient, which quantifies the ratio of internal to external connections of a node, and thus, it represents a measure of global connectivity. Switch genes are identified as a special subclass of fight-club hubs (i.e., hubs with negative APCC values) interacting mainly outside their own cluster. All the parameters used in this study for SWIMmeR run are reported in Table 2.

Table 2.

Summary of SWIMmeR parameters.

SWIMmeR run for AD dataset
Number of control samples 238
Number of AD case samples 284
FC threshold 1
FDR threshold 0.001
Number of DEGs 2327
PC threshold 0.46 (83th prc)
Numeber of network nodes 2249
Number of clusters 3
Number of switch genes 598

AD, Alzheimer’s disease; FC, fold-change; FDR, False Discovery Rate; DEGs, Differentially Expressed Genes; PC, Pearson Correlation.

Identification of disease modules

In order to test whether a given list of genes forms a statistically significant disease module, we exploited the procedure firstly proposed in7. Specifically, the genes were mapped onto the human interactome, the corresponding subnetwork was extracted, and the following three metrics were computed: (1) the size of the largest connected component (LCC); (2) the number of interactions in the LCC; and (3) the total number of interactions (edges). In order to complement these metrics with a measure of statistical significance, we evaluated the probability that the given list of genes was localized within a certain network neighborhood greater than expected by chance7. To do so, we randomly selected groups of proteins in the human interactome with the same size and degree distribution as the original list of genes and we computed the three above-mentioned metrics. This procedure was repeated 1,000 times, and we derived three distributions for all three metrics corresponding to the subgraph induced by the random gene set. The three metrics calculated for the original list of genes were z-score-normalized with respect to the corresponding reference random distribution. Subsequently, the p value for the given z statistic was calculated, expecting a p value ≤ 0.05 for genes forming a statistically significant disease module7.

Functional enrichment analysis

The functional enrichment analysis was performed by querying KEGG31, GO Gene Ontology32 and DisGeNET2 though Enrichr web tool29. p values were adjusted with the Benjamini–Hochberg method and a threshold equal to 0.05 was set to identify functional annotations and disease-gene associations significantly enriched amongst the selected genes.

Results

Study design

In the present study, we aim to propose a novel disease gene signature associated with Alzheimer’s disease by integrating two well-established Network Medicine methodologies, i.e. DIAMOnD and SWIM (in particular its R implementation called SWIMmeR). A schematic of our study design is depicted in Fig. 2.

Figure 2.

Figure 2

Study design. The figure depicts the schematic of the analysis applied in this study.

Identification of DIAMOnD genes

The DIAMOnD methodology was applied to enlarge the set of 99 known AD-associated genes (referred as seed) in the human interactome, offering a ranked list of 500 new candidate disease genes (refereed as DIAMOnD genes) having a significant fraction (connectivity significance ranging from 9.62E-22 to 3.52E-05) of their interactions with the seed (Supplementary Table 1).

This approach, however, requires an additional criterion to define the boundary of the AD disease module, i.e. a threshold for the total number of novel candidates to be considered. The connectivity significance (p value) cannot be used directly to define this threshold since the disease module grows at each iteration step and then the number of seed proteins on which the p value is based increases (see Materials and Methods). Since larger sets can produce smaller p- values, the absolute significance values obtained at different iteration steps cannot be compared to each other. Thus, in order to select the most promising DIAMOnD predictions, we exploited the so-called biological criterion proposed by12 and based on the assumption that candidates with biological characteristics similar to the ones of the initial seed are more likely to be disease associated as well (see Materials and Methods). We found that the DIAMOnD genes with direct biological evidence (enrichment p value ≤ 0.01) are the first 238, suggesting this as the maximal number to be considered (Fig. 3). This criterion allowed to identify 238 DIAMOnD predicted genes for AD (Supplementary Table 1).

Figure 3.

Figure 3

Biological evaluation of DIAMOnD predictions. The significance (enrichment p value ≤ 0.01) of the similarity between DIAMOnD predicted disease genes and seed genes suggests a cutoff corresponding to the iteration 238.

Identification of switch genes

The SWIM methodology was applied on the blood-based gene expression profiles of 284 AD samples and 238 control samples (Table 2) freely available from GEO database (see Materials and Methods) to extract the AD-specific switch genes.

Firstly, SWIMmeR identified 2327 genes showing a statistically significant differential expression (FDR < 0.01) between the AD and control condition (Fig. 4a and Supplementary Table 2). The differentially expressed genes (DEGs) were then used to build the AD correlation network. In this study, we selected a correlation threshold corresponding to the 83th percentile (Table 2) of the entire correlations’ (Fig. 4b). Next, SWIMmeR identified three clusters in the AD correlation network (Supplementary Figure 1a) and computed APCC distribution, revealing the three peaks associated with the date/party/fight-club hub classification15 (Fig. 4c). Finally, SWIMmeR assigned a role to each network node based on the values of the within-module degree and the clusterphobic coefficient in order to unveil the special set of switch genes (Fig. 4d).

Figure 4.

Figure 4

SWIMmeR results on AD gene expression data. (a) Differentially expressed genes (DEGs). Differentially expressed profiles are clustered according to genes (rows) and samples (columns) by using Pearson correlation distance as metrics. Heat map colours represent different expression levels increasing from blue to yellow. AD samples (case) are indicated in red, whereas control samples are indicated in green. (b) Connectivity of correlation network. The x-axis represents the Pearson correlation threshold varying in the chosen range, while the y-axis represents the fraction of nodes populating the largest component. The dashed red line corresponds to the selected threshold (i.e., 0.46 or 83th percentile). Note that y = 1 means that all nodes fall in the largest component and thus the network is fully connected; otherwise more components exist. (c) APCC distribution. Probability distribution of APCC for hubs (i.e., node with degree greater than 5) identified in the AD correlation network. Note the presence of the three peaks associated to the date/party/fight-club hub classification. (d) Heat cartography map. Dots correspond to nodes in the AD correlation network. Each node is coloured according to the value of the APCC between its expression profile and that of its nearest neighbours in the network.

The SWIM-based analysis of the AD gene expression dataset led to the identification of 598 switch genes, including 584 (98%) up-regulated and 14 (2%) down-regulated genes (Supplementary Table 2). Among them, we focused on those switch genes showing a coherent gene expression pattern and falling in the same network cluster, i.e. the 448 switch genes up-regulated in the AD condition and included in cluster 1 of the correlation network built by SWIMmeR (Supplementary Figure 1b and Supplementary Table 2). Notably, five of these switch genes (i.e. GBA, BCKDK, GRN, SORL1, RBCK1) are known disease genes of AD. Moreover, by performing a functional enrichment analysis, we found that these switch genes are significantly involved in several inflammatory and innate immune mechanisms (Supplementary Table 3) able to promote Alzheimer's disease according with a large body of evidence3336.

Recognition of AD gene signature

Starting from the known AD-associated genes (seed proteins), DIAMOnD algorithm allowed to predict 238 novel putative disease genes. By intersecting these 238 DIAMOnD genes with the 448 AD-specific switch genes, we found a set of 14 genes to be proposed to enlarge the known AD disease module (Fig. 5). The complete list of these 14 genes and their statistics are reported in Table 3. Interestingly, all of them are up-regulated in AD condition (Table 3).

Figure 5.

Figure 5

AD gene signature. By integrating the DIAMOnD and SWIMmeR results, the set of the known disease genes associated to AD (seed) was enlarged with additional 14 genes that are switch genes and that physically interact with each other and with the seed.

Table 3.

AD gene signature.

Gene symbol Gene name Gene ID p value FDR logFC Direction
AKT1 AKT serine/threonine kinase 1 207 1.05E−05 1.03E−04 0.0093 Up
CBL Cbl proto-oncogene 867 2.60E−06 3.06E−05 0.0162 Up
CSF3R Colony stimulating factor 3 receptor 1441 1.23E−08 2.73E−07 0.0228 Up
DOK3 Docking protein 3 79,930 1.50E−05 1.41E−04 0.0113 Up
HCK HCK proto-oncogene, Src family tyrosine kinase 3055 1.54E−06 1.93E−05 0.0165 Up
INPPL1 Inositol polyphosphate phosphatase like 1 3636 7.24E−07 9.83E−06 0.0182 Up
PRKCD Protein kinase C delta 5580 6.36E−11 2.45E−09 0.0139 Up
PTPN6 Protein tyrosine phosphatase non-receptor type 6 5777 2.31E−06 2.78E−05 0.0079 Up
STAT3 Signal transducer and activator of transcription 3 6774 1.23 E−04 8.89E−04 0.0153 Up
STAT5A Signal transducer and activator of transcription 5A 6776 6.02E−05 4.83E−04 0.0125 Up
STAT5B Signal transducer and activator of transcription 5B 6777 1.26E−07 2.13E−06 0.0174 Up
TNK2 Tyrosine kinase non receptor 2 10,188 2.34E−08 4.84E−07 0.0101 Up
TYK2 Tyrosine kinase 2 7297 4.93E−18 1.01E−15 0.0297 Up
WAS WASP actin nucleation promoting factor 7454 4.95E−07 7.11E−06 0.0157 Up

AD, Alzheimer’s disease; CTL, control; FC, fold-change AD/CTL; FDR, False Discovery Rate.

Then, following the fundamental principle of Network Medicine stating that the molecular determinants associated with a given disease should be agglomerate in specific regions of the human interactome (disease modules)4, we verified that this novel 14-gene signature, together with the seed proteins of AD, actually form a statistically significant disease module (Fig. 5 and Supplementary Figure 2).

Discussion

In the last few years, the new paradigm of Network Medicine overcame the conventional medicine paradigm ‘one gene, one drug, one disease’ mainly focused on treating the symptoms rather than discovering the causes of diseases4,5. According to the Network Medicine paradigm, diseases are rarely caused by a single gene mutation, but more typically by the deregulation of a set of genes interconnected with each other within the human interactome. Moreover, it is becoming increasingly evident that the molecular determinants associated with a given disease (named disease genes) have a high propensity to agglomerate in specific regions of the interactome, suggesting the existence of specific disease network modules for each disease510. Thus, identifying fully these disease modules and understanding the effects of their perturbations on disease onset and progression could lead to unveil new diagnostic biomarkers as well as therapeutic targets.

Following this innovative vision of medicine, in the present paper, we carried out a computational analysis by integrating consolidated network-medicine tools and concepts in order to unveil novel putative disease genes associated with one of the most common and awful neurodegenerative diseases, i.e., the Alzheimer’s disease. In particular, we exploited the DIAMOnD methodology12 combined with the SWIM methodology15,30 to enlarge the set of known AD-associated genes (seed) with additional 14 genes having peculiar and crucial characteristics, i.e.:

  • they have a significant fraction of their interactions with the seed of AD in the human interactome (Fig. 1);

  • they have biological characteristics similar to the seed of AD (Fig. 3);

  • they are switch genes that are similarly modulated (all up-regulated in AD) and are important inter-module connectors in the gene co-expression network (Table 3 and Fig. 4);

  • they form a statistically significant disease module in the human interactome (Fig. 5 and Supplementary Figure 2).

We finally investigated the current biological knowledge of the 14-gene signature in the AD framework.

The AKT serine/threonine kinase 1 (AKT1) is a key player in the signaling of insulin and other growth factors and its alterations were widely associated with AD pathology and lower cognitive function37,38. Indeed, insulin and the PI3K-AKT signaling pathway have a significant role in neuronal health as well as synapse formation and maintenance39. Interestingly, enhancement of PI3K-AKT signaling in the central nervous system by intranasal insulin treatments was shown to improve memory in vivo in mice and in human trials38. Also, the inositol polyphosphate phosphatase like 1 (INPPL1), which encodes the protein SHIP2, play a key role in the regulation of insulin function and growth factor receptors turnover. The study by Mostafavi et al.40 showed that the upregulation of INPPL1 gene significantly correlates with cognitive decline in human AD patients. Other independent studies reported SHIP2 functions as a mediator of amyloid toxicity via tau hyperphosphorylation41 and actin-cytoskeleton reorganization42.

The Cbl proto-oncogene (CBL) encodes a RING finger E3 ubiquitin ligase, which is an enzyme required for targeting substrates for degradation by the proteasome. Several studies have indicated that CBL functions as a negative regulator of many signal transduction pathways43,44, and in particular, it could suppress the Src-family tyrosine kinase Fyn previously identified as potential target for AD45. Another member of Src family of tyrosine kinases is the HCK proto-oncogene, Src family tyrosine kinase (HCK). In particular, HCK is a hematopoietic cell kinase whose dysregulation may affect microglia (i.e., resident immune cells in the central nervous system playing critical roles in brain immunity, development, and homeostasis) and accelerate early stage Alzheimer's disease-like neuropathology46. Recently, also the colony stimulating factor 3 receptor (CSF3R) and the docking protein 3 (DOK3) was found to be involved in the in microglial cell activation47,48.

The protein kinase C delta (PRKCD) encodes one of the protein kinase C family members that are involved in physiological processes related to learning and memory, and thus have been classified as cognitive kinases. Besides their implication in memory and cognition, PKC family members were found to regulate several pathways relevant for AD pathophysiology and thus proposed as potential therapeutic strategy against AD49. To our knowledge, there currently exist no studies associating the tyrosine kinase non receptor 2 (TNK2) as well as the WASP actin nucleation promoting factor (WAS) with the etiology and pathogenesis of AD.

Finally, the signal transducer and activator of transcription 3 (STAT3), the signal transducer and activator of transcription 5A (STAT5A) and 5B (STAT5A), and the tyrosine kinase 2 (TYK2) are key components of the JAK-STAT signaling pathway which seems to promote neuroinflammation in neurodegenerative diseases50.

Limitations of the study

One limitation of the DIAMOnD-based analysis is that it assumes that the new candidate disease genes must be topologically, functionally, and biologically close to the starting seed proteins. In fact, the algorithm firstly explores the nearest neighborhood of the seed proteins in the human interactome (topological vicinity) and prioritizes those proteins having a significant fraction of their physical and functional interactions with them (functional vicinity). Then, by using the biological criterion, it selects among them those proteins that have biological characteristics like the ones of the initial seed proteins (biological vicinity). However, the incompleteness of the human interactome provides a partial knowledge of the physical and functional interactions among all the human proteins. Thus, starting from the seed proteins used in12, no genes reported by Bellenguez et al.28 will be predicted because direct interactions between them are not yet known, and the two lists are now topologically and functionally distant in the human interactome. By increasing the number of DIAMOnD interactions up to 1000, just one protein out of the ones identified by Bellenguez et al.28 was found.

Conclusion

The present study showed how a systematic analysis exploiting tools and concepts of the new paradigm of Network Medicine can improve the prediction of a novel AD gene signature that may contribute to the pathological phenotype. Indeed, our network-based pipeline allowed to complement the current knowledge of AD-associated genes, mainly stemmed from GWAS analysis, with additional putative AD biomarkers, mainly stemmed from measuring transcript abundance and gene expression patterns. It is worth stressing that our study has a purely computational nature and experimental validations would be necessary to investigate the actual role of the identified genes as AD biomarkers. However, we believe that our findings could deepen the understanding of mechanisms underlying AD pathogenesis, maybe suggesting new potential diagnostic biomarkers and/or therapeutic targets for this awful neurodegenerative disease.

Supplementary Information

Supplementary Table 1. (47.1KB, xlsx)
Supplementary Table 2. (1,003.5KB, xlsx)
Supplementary Table 3. (372.1KB, xlsx)

Author contributions

Conceptualization, P.P.; supervision, P.P; funding acquisition, P.P; methodology, P.P., F.C.; formal analysis, F.C.; investigation, P.P., F.C.; data curation and visualization, F.C.; writing—original draft preparation, all authors; writing—review and editing, all authors. All authors have read and agreed to the published version of the manuscript.

Funding

This work has been partially funded by the BiBiNet project (Grant Number: H35F21000430002) within the POR-Lazio FESR 2014-2020, by PRIN 2017—Settore ERC LS2—Codice Progetto 20178L3P38, and by Sapienza University of Rome grant, Progetto di ricerca di Ateneo 2021 (grant number: RM12117A34663A2C). The funding body played no roles in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Data availability

Data that support the findings of this study are openly available in GEO public repository and in Supporting Information files of12. SWIMmeR code is freely available at https://github.com/sportingCode/SWIMmeR. DIAMOnD code is freely available from the Supporting Information files of12. All the other relevant data are within this manuscript and its Supporting Information files.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-022-20404-3.

References

  • 1.Haque RU, Levey AI. Alzheimer’s disease: A clinical perspective and future nonhuman primate research opportunities. Proc. Natl. Acad. Sci. 2019;116:26224–26229. doi: 10.1073/pnas.1912954116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Hansen TB, et al. Natural RNA circles function as efficient microRNA sponges. Nature. 2013;495:384–388. doi: 10.1038/nature11993. [DOI] [PubMed] [Google Scholar]
  • 3.Knopman DS, et al. Alzheimer disease. Nat. Rev. Dis. Primer. 2021;7:1–21. doi: 10.1038/s41572-021-00269-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Barabási A-L, Gulbahce N, Loscalzo J. Network medicine: A network-based approach to human disease. Nat. Rev. Genet. 2011;12:56–68. doi: 10.1038/nrg2918. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Caldera M, Buphamalai P, Müller F, Menche J. Interactome-based approaches to human disease. Curr. Opin. Syst. Biol. 2017;3:88–94. doi: 10.1016/j.coisb.2017.04.015. [DOI] [Google Scholar]
  • 6.Barabási A-L. Network medicine—From obesity to the “diseasome”. N. Engl. J. Med. 2007;357:404–407. doi: 10.1056/NEJMe078114. [DOI] [PubMed] [Google Scholar]
  • 7.Paci P, et al. Gene co-expression in the interactome: moving from correlation toward causation via an integrated approach to disease module discovery. NPJ Syst. Biol. Appl. 2021;7:1–11. doi: 10.1038/s41540-020-00168-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Silverman EK, et al. Molecular networks in Network Medicine: Development and applications. WIREs Syst. Biol. Med. 2020;12:e1489. doi: 10.1002/wsbm.1489. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Sonawane AR, Weiss ST, Glass K, Sharma A. Network medicine in the age of biomedical big data. Front. Genet. 2019;10:294. doi: 10.3389/fgene.2019.00294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Goh K-I, et al. The human disease network. Proc. Natl. Acad. Sci. 2007;104:8685–8690. doi: 10.1073/pnas.0701361104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Menche J, et al. Uncovering disease-disease relationships through the incomplete interactome. Science. 2015;347:1257601. doi: 10.1126/science.1257601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Ghiassian SD, Menche J, Barabási A-L. A DIseAse MOdule Detection (DIAMOnD) Algorithm Derived from a Systematic Analysis of Connectivity Patterns of Disease Proteins in the Human Interactome. PLOS Comput. Biol. 2015;11:e1004120. doi: 10.1371/journal.pcbi.1004120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Buniello A, et al. The NHGRI-EBI GWAS Catalog of published genome-wide association studies, targeted arrays and summary statistics 2019. Nucleic Acids Res. 2019;47:D1005–D1012. doi: 10.1093/nar/gky1120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Piñero J, et al. DisGeNET: A comprehensive platform integrating information on human disease-associated genes and variants. Nucleic Acids Res. 2017;45:D833–D839. doi: 10.1093/nar/gkw943. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Paci P, et al. SWIM: A computational tool to unveiling crucial nodes in complex biological networks. Sci. Rep. 2017;7:srep44797. doi: 10.1038/srep44797. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Fiscon G, Conte F, Paci P. SWIM tool application to expression data of glioblastoma stem-like cell lines, corresponding primary tumors and conventional glioma cell lines. BMC Bioinform. 2018;19:436. doi: 10.1186/s12859-018-2421-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Fiscon G, Conte F, Licursi V, Nasi S, Paci P. Computational identification of specific genes for glioblastoma stem-like cells identity. Sci. Rep. 2018;8:7769. doi: 10.1038/s41598-018-26081-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Falcone R, et al. BRAFV600E-mutant cancers display a variety of networks by SWIM analysis: prediction of vemurafenib clinical response. Endocrine. 2019;64:406–413. doi: 10.1007/s12020-019-01890-4. [DOI] [PubMed] [Google Scholar]
  • 19.Liang WS, et al. Altered neuronal gene expression in brain regions differentially affected by Alzheimer’s disease: a reference data set. Physiol. Genomics. 2008;33:240–256. doi: 10.1152/physiolgenomics.00242.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Sood S, et al. A novel multi-tissue RNA diagnostic of healthy ageing relates to cognitive health status. Genome Biol. 2015;16:185. doi: 10.1186/s13059-015-0750-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Hampel H, et al. Blood-based biomarkers for Alzheimer’s disease: mapping the road to the clinic. Nat. Rev. Neurol. 2018;14:639–652. doi: 10.1038/s41582-018-0079-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Zetterberg H, Schott JM. Blood biomarkers for Alzheimer’s disease and related disorders. Acta Neurol. Scand. 2022;146:51–55. doi: 10.1111/ane.13628. [DOI] [PubMed] [Google Scholar]
  • 23.Lovestone S, et al. AddNeuroMed–the European collaboration for the discovery of novel biomarkers for Alzheimer’s disease. Ann. N. Y. Acad. Sci. 2009;1180:36–46. doi: 10.1111/j.1749-6632.2009.05064.x. [DOI] [PubMed] [Google Scholar]
  • 24.Aranda B, et al. The IntAct molecular interaction database in 2010. Nucleic Acids Res. 2010;38:D525–531. doi: 10.1093/nar/gkp878. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Ceol A, et al. MINT, the molecular interaction database: 2009 update. Nucleic Acids Res. 2010;38:D532–D539. doi: 10.1093/nar/gkp983. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Stark C, et al. The BioGRID interaction database: 2011 update. Nucleic Acids Res. 2011;39:D698–704. doi: 10.1093/nar/gkq1116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Amberger JS, Bocchini CA, Scott AF, Hamosh A. OMIM.org: Leveraging knowledge across phenotype–gene relationships. Nucleic Acids Res. 2019;47:D1038–D1043. doi: 10.1093/nar/gky1151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Bellenguez C, et al. New insights into the genetic etiology of Alzheimer’s disease and related dementias. Nat. Genet. 2022;54:412–436. doi: 10.1038/s41588-022-01024-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Kuleshov MV, et al. Enrichr: A comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res. 2016;44:W90–97. doi: 10.1093/nar/gkw377. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Paci P, Fiscon G. SWIMmeR: an R-based software to unveiling crucial nodes in complex biological networks. Bioinformatics. 2022;38:586–588. doi: 10.1093/bioinformatics/btab657. [DOI] [PubMed] [Google Scholar]
  • 31.Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Res. 2016;44:D457–D462. doi: 10.1093/nar/gkv1070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Ashburner M, et al. Gene ontology: Tool for the unification of biology. The Gene Ontology Consortium. Nat. Genet. 2000;25:25–29. doi: 10.1038/75556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Zenaro E, et al. Neutrophils promote Alzheimer’s disease–like pathology and cognitive decline via LFA-1 integrin. Nat. Med. 2015;21:880–886. doi: 10.1038/nm.3913. [DOI] [PubMed] [Google Scholar]
  • 34.Wyss-Coray T. Inflammation in Alzheimer disease: driving force, bystander or beneficial response? Nat. Med. 2006;12:1005–1015. doi: 10.1038/nm1484. [DOI] [PubMed] [Google Scholar]
  • 35.Czirr E, Wyss-Coray T. The immunology of neurodegeneration. J. Clin. Invest. 2012;122:1156–1163. doi: 10.1172/JCI58656. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Krstic D, Knuesel I. Deciphering the mechanism underlying late-onset Alzheimer disease. Nat. Rev. Neurol. 2013;9:25–34. doi: 10.1038/nrneurol.2012.236. [DOI] [PubMed] [Google Scholar]
  • 37.Arvanitakis Z, et al. Brain insulin signaling, Alzheimer disease pathology, and cognitive function. Ann. Neurol. 2020;88:513–525. doi: 10.1002/ana.25826. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Gabbouj S, et al. Altered insulin signaling in Alzheimer’s disease brain—Special emphasis on PI3K-Akt pathway. Front. Neurosci. 2019;13:629. doi: 10.3389/fnins.2019.00629. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Chiu S-L, Chen C-M, Cline HT. Insulin receptor signaling regulates synapse number, dendritic plasticity, and circuit function in vivo. Neuron. 2008;58:708–719. doi: 10.1016/j.neuron.2008.04.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Mostafavi S, et al. A molecular network of the aging human brain provides insights into the pathology and cognitive decline of Alzheimer’s disease. Nat. Neurosci. 2018;21:811–819. doi: 10.1038/s41593-018-0154-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Kam T-I, et al. FcγRIIb-SHIP2 axis links Aβ to tau pathology by disrupting phosphoinositide metabolism in Alzheimer’s disease model. Elife. 2016;5:e18691. doi: 10.7554/eLife.18691. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Lee HN, et al. Aβ modulates actin cytoskeleton via SHIP2-mediated phosphoinositide metabolism. Sci. Rep. 2019;9:15557. doi: 10.1038/s41598-019-51914-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Andoniou CE, et al. The Cbl proto-oncogene product negatively regulates the Src-family tyrosine kinase Fyn by enhancing its degradation. Mol. Cell. Biol. 2000;20:851–867. doi: 10.1128/MCB.20.3.851-867.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Rao N, et al. An essential role of ubiquitination in Cbl-mediated negative regulation of the Src-family kinase Fyn. Signal Transduct. 2002;2:29–39. doi: 10.1002/1615-4061(200205)2:1/2&#x0003c;29::AID-SITA29&#x0003e;3.0.CO;2-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Yang K, et al. Fyn, a potential target for Alzheimer’s disease. J. Alzheimers Dis. JAD. 2011;27:243–252. doi: 10.3233/JAD-2011-110353. [DOI] [PubMed] [Google Scholar]
  • 46.Lim SL, et al. Inhibition of hematopoietic cell kinase dysregulates microglial function and accelerates early stage Alzheimer’s disease-like neuropathology. Glia. 2018;66:2700–2718. doi: 10.1002/glia.23522. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Wang H. Microglia heterogeneity in Alzheimer’s disease: Insights from single-cell technologies. Front. Synaptic Neurosci. 2021;13:773590. doi: 10.3389/fnsyn.2021.773590. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Gao W-S, et al. DOK3 is involved in microglial cell activation in neuropathic pain by interacting with GPR84. Aging. 2020;13:389–410. doi: 10.18632/aging.202144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Talman V, Pascale A, Jäntti M, Amadio M, Tuominen RK. Protein kinase C activation as a potential therapeutic strategy in Alzheimer’s disease: Is there a role for embryonic lethal abnormal vision-like proteins? Basic Clin. Pharmacol. Toxicol. 2016;119:149–160. doi: 10.1111/bcpt.12581. [DOI] [PubMed] [Google Scholar]
  • 50.Jain M, et al. Role of JAK/STAT in the neuroinflammation and its association with neurological disorders. Ann. Neurosci. 2021;28:191–200. doi: 10.1177/09727531211070532. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Table 1. (47.1KB, xlsx)
Supplementary Table 2. (1,003.5KB, xlsx)
Supplementary Table 3. (372.1KB, xlsx)

Data Availability Statement

Data that support the findings of this study are openly available in GEO public repository and in Supporting Information files of12. SWIMmeR code is freely available at https://github.com/sportingCode/SWIMmeR. DIAMOnD code is freely available from the Supporting Information files of12. All the other relevant data are within this manuscript and its Supporting Information files.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES