Skip to main content
Briefings in Bioinformatics logoLink to Briefings in Bioinformatics
. 2025 Jan 21;26(1):bbaf007. doi: 10.1093/bib/bbaf007

Multimodal multiobjective optimization with structural network control principles to optimize personalized drug targets for drug discovery of individual patients

Jing Liang 1,2, Zhuo Hu 3, Ying Bi 4, Han Cheng 5, Wei-Feng Guo 6,7,
PMCID: PMC11747759  PMID: 39835535

Abstract

Structural network control principles provided novel and efficient clues for the optimization of personalized drug targets (PDTs) related to state transitions of individual patients. However, most existing methods focus on one subnetwork or module as drug targets through the identification of the minimal set of driver nodes and ignore the state transition capabilities of other modules with different configurations of drug targets [i.e. multimodal drug targets (MDTs)] embedding the knowledge of previous drug targets (i.e. multiobjective optimization). Therefore, a novel multimodal multiobjective evolutionary optimization framework (called MMONCP) is proposed to optimize PDTs with network control principles. The key points of MMONCP are that a constrained multimodal multiobjective optimization problem is formed with discrete constraints on the decision space and multimodality characteristics, and a novel evolutionary algorithm denoted as CMMOEA-GLS-WSCD is designed by combining a global and local search strategy and a weighting-based special crowding distance strategy to balance the diversity of both objective and decision space. The experimental results on three cancer genomics data from The Cancer Genome Atlas indicate that MMONCP achieves a higher performance including algorithm convergence and diversity, the fraction of identified MDTs, and the area under the curve score than advanced algorithms. Additionally, MMONCP can detect the early state from the difference between the target activity and toxicity of MDTs and provide early treatment options for cancer treatment in precision medicine.

Keywords: personalized drug targets, network control principles, multimodal multiobjective optimization, precision medicine

Introduction

Accurate prediction of drug targets is essential for drug discovery. In precision medicine, it is the basis of finding efficient drugs of individual patients but a big challenge for discovering suitable personalized drug targets (PDTs). The personalized genomic profile of individual patients makes it possible to optimize PDTs [1, 2]. Current large samples-based static models face challenges in performance and false negatives [3, 4], since the identification of effective drug targets for individual patients using personalized omics data is limited by the scarcity of personalized sample information. Advances in network science have promoted the development of methods using sample-specific gene interaction networks. Methods such as CPGD [1] and SCS [5] have been developed to optimize PDTs on personalized gene interaction networks (PGINs). The above tools usually contain two steps: (i) constructing a PGIN, and (ii) optimizing PDTs by selecting the targeted driver genes. Commonly used methods for constructing PGIN include Paired-SSN [6], LIONESS [7], and SSN [8]. To optimize PDTs, structural network control principles provide a theoretically precise depiction of the manner in PGIN where state transitions can be accomplished by appropriately selecting driver genes [9].

Nevertheless, these traditional methods based on network control, such as the Maximum Matching Set (MMS)-based control methods (MMS) [10], Minimum Dominating Set-based control method (MDS) [11], Nonlinear Control of undirected network Algorithm (NCUA) [6], and Directed Feedback Vertex-based control method (DFVS) [12], are subject to two primary limitations: (i) these methods pose computational challenges in optimizing the set of driver nodes within large-scale networks featuring nonlinear dynamics, but it is possible to approximate efficient solutions [9]. (ii) These methods primarily concentrate on system control through the identification of the minimal set of driver nodes. They are constrained single-objective optimization problems but neglect the knowledge of previous drug targets during the optimization process. To solve these problems, multiobjective optimization-based structural network control principles (MONCP) [13] were developed by simultaneously minimizing the number of driver nodes and maximizing the information of prior-known drug targets in PGIN. Based on this, a set of Pareto-optimal solutions or a trade-off solution curve could be obtained, allowing decision-makers to make a selection and provide additional potential pathways for personalized therapy in the field of precision medicine. Subsequently, in our another previous work to improve the performance of MONCP, we proposed a knowledge-embedded multitasking constrained multiobjective Evolutionary Algorithm (KMCEA) by analyzing the characteristics of the problems and mining relevant knowledge to design specific strategies [14]. Furthermore, in our recent work, we also applied MONCP on temporal network observability model to identify the disease predictive biomarkers [15].

Though the above methods have been gradually optimized for optimizing PDTs, they all ignore a fact that multiple sets of PDTs are equivalent in the information available about prior-known drug targets and the number of driver nodes, but they have differences in their configurations. These different sets of PDTs may provide varying biological functions [i.e. multimodal drug targets (MDTs)], as shown in Fig. S1. Therefore, this study proposed a novel multimodal multiobjective optimization framework with network control principles (called MMONCP) to optimize PDTs. The demonstration of MMONCP is illustrated in Fig. 1. MMONCP firstly considers the optimization of PDTs in individual patients as a constrained multimodal multiobjective optimization problem (CMMOP).

Figure 1.

Figure 1

Demonstration of MMONCP. The MMONCP aims to discover multiple MDTs on the PGIN. Several sets of drug targets are called MDTs, if they are equivalent in MMONCP in the information available about prior-known drug targets (red color) and the number of driver nodes, but they have differences in their configurations. The PGIN was constructed with Paired-SSN by integrating their gene expression data of paired samples of an individual patient and gene mutation data. Then, a constrained multimodal multiobjective evolutionary algorithm with structural network control principles is used to optimize MDTs according to the structure of the network.

Then, a new CMMOEA called CMMOEA-GLS-WSCD is developed by integrating a combined global and local search (GLS) strategy and a weighting-based special crowding distance (WSCD), which considers the diversity of both the decision space and the objective space. In detail, CMMOEA-GLS-WSCD employs a multitask framework, where the main task is used for global search by solving CMMOP, and the two auxiliary tasks are used for local search by solving derivative constrained multimodal single-objective optimization problems (CMSOP). Meanwhile, a combined GLS strategy is designed to achieve a balance of GLS, where the angle-based niching parent selection method [16] is adopted for local search and the enhanced binary tournament selection is utilized for global search [17]. Additionally, to balance the diversity of the objective space and decision space, WSCD is proposed in environmental selection. Different from traditional methods, MMONCP not only considers simultaneously minimum driver nodes (the first objective) and maximum prior-known drug–target information (the second objective) embedding the knowledge of previous drug targets, but also identifies Pareto-optimal set (PS) or a trade-off curve (i.e. distribution of PDTs in two objective space) that can be equivalent in terms of the two objectives, but they may differ in terms of their gene or protein configurations. Finally, the effectiveness of MMONCP is validated on datasets obtained from TCGA, including breast invasive carcinoma (BRCA), lung adenocarcinoma (LUAD), and lung squamous cell carcinoma (LUSC). The experimental results demonstrate that MMONCP outperforms current CMMOEAs including convergence, diversity, and the fraction of MDTs in PGIN of patients. In addition, we found that our MMONCP can detect the early state in stage ia from the difference in drug activity and toxicity of different sets of MDTs in the BRCA cancer dataset. Additionally, the functional differences of MDTs are analyzed at this stage (i.e. stage ia for BRCA cancer), and it is verified that our MMONCP can provide different functional targets for precise treatment of cancer.

Material and methods

Structural network control theory

Comprehending the underlying dynamics of molecular networks from the perspective of control theory is the key to understanding cancer progression. Specifically, the progression of cancer from a normal state to a disease state is formulated as a structural network control problem [9]. The goal of structural network control principles in PGINs is to find a set of personalized driver genes (denoted as PDGs) with the minimum size, based on a proper understanding of the molecular network structure. The state of these genes can be changed by input signals, thereby changing the state of the entire network. It is noteworthy that PDTs can be viewed as PDGs that are targeted by drug activation in PGINs. This type of problem can be summarized as follows:

graphic file with name DmEquation1.gif (1)

where Inline graphic represents the state of the gene at a specific time point Inline graphic in the molecular network (PGIN in this work), and Inline graphic is the adjacency matrix of the molecular network. Inline graphic denotes the driving that has Inline graphic controllers with Inline graphic genes. Common controllers include the genetic or environmental factors that can provide input signals.

Intrinsic structure and dynamic propagation are crucial in controlling the process of the molecular network. However, how to represent the particular functional patterns of the molecular network is still unknown. By considering the network structure, structural network control principles can be exploited to analyze the controllability of complex networks by identifying a set with the minimum number of driver nodes.

Traditional principles include MMS [10], MDS [11], NCUA [6], and DFVS [12], where MMS and DFVS are directed networks and the other two are undirected networks. According to the way of control, there are three types of MMS-based control methods, including MMS-based full controllability [10], MMS-based target controllability [18], and MMS-based constrained target controllability [19]. These MMS-based control methods focus on the controllability of complex networks with linear or local nonlinear characteristics. However, nonlinear network dynamics are prevalent, particularly concerning individual patient heterogeneity of PGINs. MMS-based control methods may have difficulty in presenting a complete view. MDS assumes that each edge is dual-directional and each driver node has the autonomous capability to control the edges it is associated with. Moreover, non-driver nodes become controllable when they are in close proximity to a minimum of one driver node. In short, every node within MDS has autonomous control over each of its outbound edges. In the past few years, the research emphasis in network control has transitioned from linear dynamics to nonlinear dynamics [20]. DFVS is applied to direct networks with nonlinear dynamics and known network structures, where the functional form of the governing equations cannot be random but must have certain properties. To achieve effective feedback control, it is necessary to manipulate the FVS (Feedback Variable Set), which associates with the feedback loop within the network. However, DFVS cannot be directly applied to undirected networks Inline graphic. Therefore, NCUA is developed for the network control problem of undirect networks with nonlinear dynamics based on FVS. NCUA is the same as MDS in that the edges are bidirectional. It is noteworthy that the network in NCUA can be transformed into a bipartite graph Inline graphic. If Inline graphic is one of the nodes connected to Inline graphic, the edge between Inline graphic and Inline graphic will be put into Inline graphic. The set of nodes, which is not dominate by other nodes in Inline graphic, should be chosen from Inline graphic. By adding other objectives (i.e. maximize the information of prior drug targets), MONCP can be modified according to traditional principles. Note that MONCP under the framework of MDS, NCUA, and DFVS have different constraints but the same objectives.

Constrained multimodal multiobjective optimization theory

CMMOPs are widespread in real life, such as the site selection problem [21]. Generally, a CMMOP can be described as follows:

graphic file with name DmEquation2.gif (2)
graphic file with name DmEquation3.gif (3)

where Inline graphic is an objective vector and Inline graphic represents the number of objectives. Decision vector, denoted as Inline graphic, has Inline graphic decision variables. The Inline graphicth inequality constraint can be represented as Inline graphic, while the Inline graphicth equality constraint can be represented as Inline graphic. Inline graphic and Inline graphic represent the number of inequality constraints and equality constraints. Generally, the extent to which a solution satisfies a constraint can be represented by its score of constraint violation (CV), which can be calculated as follows:

graphic file with name DmEquation4.gif (4)

where Inline graphic is a tolerance value used to relax the equality constraints. A solution with a CV value of 0 is called a feasible solution, while a solution with a nonzero CV value is called an infeasible solution.

The infeasible solution with a smaller CV value is superior to the infeasible solution with a larger CV value. Among a feasible solution Inline graphic and an infeasible solution Inline graphic, Inline graphic dominates Inline graphic. The concept of dominance between two feasible solutions Inline graphic and Inline graphic is expressed as follows: if Inline graphic for every Inline graphic and Inline graphic for at least one Inline graphic, then Inline graphic dominates Inline graphic. A solution that is not dominated by any other solution and satisfies all constraints is called a Pareto optimal solution. The collection of all Pareto optimal solutions forms a set known as the constrained Pareto optimal solution set (CPS), and the mapping of this set in the objective space is called the constrained Pareto front (CPF).

Different from multimodal multiobjective optimization problems, the solutions must be confined to the feasible region in CMMOPs. That is to say, the search space for constrained multimodal multiobjective optimization is not the entire search space, but rather within a feasible region. The illustration of CMMOPs is given in Fig. 2, where the shaded area represents the feasible region. In short, a CMMOP can be defined as a scenario where various solutions in the CPS map to a single point in the CPF.

Figure 2.

Figure 2

Illustration of constrained multimodal multiobjective problem. The constrained Pareto solution sets (CPSs) in the decision space (the left) are called CMMOPs if in the feasible domain (gray and yellow areas), they are correspond to the same constrained Pareto front (CPF) in the objective space (the right).

Recently, several tools have been developed to tackle constrained multimodal single-objective optimization problems (CMSOP). Few researches have been conducted in CMMOP due to its high complexity. Liang et al. [21] developed a differential evolutionary algorithm called CMMODE to solve CMMOPs by employing a specification mechanism to obtain multiple equivalent solutions in CPF, and they developed an enhanced environment selection strategy. Ming et al. [22] presented a coevolutionary method CMMOCEA to solve CMMOPs, which contains two populations: Inline graphic considers both constraints and multimodality, Inline graphic disregards the constraints, and both of them consider the diversity of the objective space and decision space. However, these above methods focus on test suits that are continuous and have fewer decision variables. When solving MMONCP, it may be challenging to achieve better performance.

CMMOPs can be regarded as one type of constrained multiobjective optimization problems (CMOPs) considering multimodality or multimodal multiobjective optimization problems (MMOPs) with constraints. Therefore, it is necessary to design appropriate strategies to handle constraints, develop a multimodal mechanism, and learn how to balance them. In this section, related content of CMOPs and MMOPs will be introduced in the following.

Constrained multiobjective optimization

As shown in Eqs. (23), CMOPs have the same formulas as CMMOPs. Recently, many CMOEAs have been developed, which can be classified into three types based on the form of the algorithm, including multipopulation, multistage, and other types. CMOEAs employ two or more populations, and an effective result can be obtained by cooperation between several populations. For these CMOEAs, multiple populations have different tasks, and cooperative evolution can be called “multitask.” More precisely, it is called multiform optimization in the latest evolutionary transfer optimization survey paper [23]. Particularly, multiform optimization exploits the alternate form of an optimization problem with single target. The search experiences derived from various formulations of a single optimization problem may lead to improved problem-solving outcomes for the target problem, as different formulations can capture diverse properties or landscapes of the problem encountered. Instead of conducting an evolutionary search on the complex search space of the original problem formulation, the search could be carried out on alternative formulations with simpler search spaces. Transferring the valuable knowledge acquired in the simpler search space back to the original space can guide the search toward high-quality solution areas, leading to improved optimization performance.

For instance, Tian et al. [24] developed a coevolutionary CMOEA (i.e. CMMO), where the main population is based on CMOP and the auxiliary population focuses on an unconstrained problem. Also, a weak cooperation of two populations is adopted, and this is a popular method. Qiao et al. [25] presented a dual-population CMOEA, where the main and auxiliary population employ different mechanisms to handle constraints. Meanwhile, a scheme for dynamic resource allocation was developed to assign varied resources to populations. CTAEA [26] and MTCMO [27] are popular CMOEAs with multipopulation. CMOEAs with multistage are used to balance objectives and constraints at different stages. For example, Fan et al. [28] developed a CMOEA PPS including two stages, where constraints are not considered at the first stage to evolve the population toward the UPF, and all constraints are considered at the second stage for convergence to CPF. Liu et al. [29] introduced a two-stage CMOEA (ToP) that employs the search for single-objectives in the first stage and the search for multiple objectives in the second stage. Additionally, there are some other types of CMOEAs, such as NSGAII-CDP [30], MOEA/D-CDP [31], etc.

However these CMOEAs still have limitations in solving our MMONCP due to discrete feasible areas. Furthermore, MMONCP is a CMMOP that is supposed to consider the diversity of the decision space. To sum up, it is challenging for these CMOEAs to achieve good performance when solving MMONCP.

Multimodal multiobjective optimization

An MMOP can be defined as a problem that requires optimizing multiple (two or more) objectives, and it has multiple solutions in the decision space that correspond to the same point in the objective space (i.e. multimodal solutions). Establishing a species protection mechanism is crucial for mining and preserving multimodal solutions, thereby enhancing the diversity of solutions in the decision space. This is typically accomplished by setting up a niche through clustering [32], employing a ring topology [33], and enhancing the diversity of the decision space [34]. Multimodal multiobjective evolutionary algorithms (MMEAs) using clustering include MMOEA/DC [35] using clustering in both decision space and objective space, MMODE_CSCD [36] using a special crowed distance based on clustering, etc. MO_Ring_PSO_SCD [33] is a representative algorithm using ring topology to generate niche. Additionally, many MMEAs exploit the diversity of decision space to solve MMOPs, such as HREA [34], DN-NSGAII [32], etc.

Despite the excellent performance demonstrated by the aforementioned multimodal optimization algorithms in simulation tests, they cannot be directly utilized for biomedical problems due to their lack of tailored strategies for biological data. Particularly, the optimization algorithm is faced with a significant challenge due to the intricate nature of biomarker recognition in dynamic networks. In addition, there are many constraints in the decision space, and these MMEAs fail to handle constraints.

M‌MONCP framework

MMONCP not only provides different treatment programs but may also be tailored to specific biological functions not addressed by other drug targets. Therefore, this paper extends MDT to network control principles and formulates the problem of MMONCP in individual patients. MMONCP is a discrete, multimodality problem with constraints on the decision space, so it may be difficult to consider the diversity of the decision space while finding multiple discrete feasible regions. To address this issue, this paper develops a novel CMMOEA (denoted as CMMOEA-GLS-WSCD), which the specific process is shown in Fig. 3, by a combined GLS strategy and a WSCD strategy. In our CMMOEA-GLS-WSCD, a combined GLS strategy is used by combining the angle-based niching parent selection method and the enhanced binary tournament selection method to balance local and global search. Meanwhile, CMMOEA-GLS-WSCD simultaneously considers the diversity of the decision space and objective space. When the crowding distance of the decision space is always greater than that of the objective space in basic special crowding distance, the objective space will have poor diversity. To achieve this, WSCD strategy is proposed in this paper in the process of environmental selection.We have put the source code and data at GitHub repository for its web availability.(https://github.com/WilfongGuo/MMONCP).

Figure 3.

Figure 3

The method explanation of CMMOEA-GLS-WSCD. (A) The overall workflow of CMMOEA-GLS-WSCD. (B) The main framework of the combined global and local search strategy. (C) The main framework of the weighting-based special crowding distance.

Objective functions of MMONCP

In fact, MMONCP is the MONCP by considering multimodality. Therefore, MMONCP has the same objective functions as MONCP:

graphic file with name DmEquation5.gif (5)
graphic file with name DmEquation6.gif (6)

subject to

graphic file with name DmEquation7.gif (7)

where Inline graphic represents the number of decision variables (i.e. the number of genes in the PGIN), and Inline graphic denotes the Inline graphicth gene. For binary decision variable Inline graphic, if the Inline graphicth gene is identified, Inline graphic=1; otherwise, Inline graphic=0. Inline graphic represents the number of constraints. Inline graphic represents the label of the Inline graphicth PDT, and it is set to 1 if the Inline graphicth PDT is prior-known; otherwise, it is set to 0. Inline graphic denotes the first objective of minimizing the number of genes in the PGIN, and Inline graphic represents the second objective of minimizing the maximum prior-known drug–target information. For Eq. (9), which is a summary form, MDS, NCUA, and DFVS have different constraints (network controllability), as shown in Eqs. (1012), respectively.

The form of MDS is given below:

graphic file with name DmEquation8.gif (8)

where Inline graphic represents the set of adjacent nodes of Inline graphic. This equation indicates that at least one of two adjacent nodes must be identified.

For NCUA, the constraints in the following form are used to ensure the network is controllable:

graphic file with name DmEquation9.gif (9)

for an undirected network Inline graphic, its bipartite graph is represented as Inline graphic, which is described in Section II. Inline graphic is set to 1 if the Inline graphicth node belongs to the cover set in Inline graphic to cover the nodes in Inline graphic.

For DFVS, the concept of FVS allows for determining the driver nodes in a directed network based on its cycle structure and source nodes, which can be described as an integer linear programming (ILP) problem:

graphic file with name DmEquation10.gif (10)

where Inline graphic is set to 1 when node Inline graphic is included in FVS.

Designing constrained multimodal multiobjective evolutionary algorithm

Algorithm 1 provides the detailed process of CMMOEA-GLS-WSCD. The constraint equations are formed according to the PGIN using Eq. (10), Eq. (11), or Eq. (12), in which different equations represent different MONCPs (line 1). The main population (Inline graphic) and two auxiliary populations (Inline graphic and Inline graphic) are randomly initialized with corresponding sizes (lines 2 and 3). The following steps are conducted in each iteration until the termination condition is satisfied. First, the non-dominance level is obtained by non-dominated sorting, and the WSCD of Inline graphic is calculated by Eqs. (14–15) (line 5). The diversity (Inline graphic) of Inline graphic and Inline graphic are evaluated based on the Hamming distance to the nearest individual in the decision space (line 6). Then, a combined GLS strategy is used in Inline graphic to generate Inline graphic offspring Inline graphic by combining the angle-based niching parent selection method and the enhanced binary tournament selection method (line 7). The specific details of this are presented in Algorithm 2. According to Inline graphic, the mating pool is selected, and Inline graphic offspring Inline graphic of Inline graphic and Inline graphic of Inline graphic are generated using a binary GA (lines 8-9). Finally, for the main population, Inline graphic solutions are selected by using the environmental selection method in [21], from Inline graphic  Inline graphic  Inline graphic  Inline graphic  Inline graphic  Inline graphic  Inline graphic. Unlike the original version, WSCD is employed to balance the diversity of decision space and objective space (line 11). For two auxiliary populations, Inline graphic solutions are selected by comparing the fitness of Inline graphic  Inline graphic  Inline graphic  Inline graphic  Inline graphic  Inline graphic  Inline graphic and Inline graphic  Inline graphic  Inline graphic  Inline graphic  Inline graphic  Inline graphic  Inline graphic (lines 12-13). Inline graphic denotes the final population and output.

graphic file with name bbaf007f1001.jpg

Weighting-based special crowding distance

Special crowding distance (SCD), proposed by Yue et al. [33], considers the diversity of both decision space and objective space. However, Inline graphic of SCD cannot reflect the distribution and convergence of the decision space in MMONCP, which is a discrete large-scale problem. Hamming distance is usually employed to measure the diversity in discrete problems [37–39], and it is also used in our work. When there are many individuals of the population in the later stages of evolution that are far away in the decision space but very close to the objective space, the SCD is set to Inline graphic. This implies that SCD only considers the crowding distance in the decision space. In this case, these solutions will be retained, leading to poor diversity of the objective space. To achieve a balance of diversity in both the decision space and objective space, WSCD is proposed and shown as follows:

graphic file with name DmEquation12.gif (11)
graphic file with name DmEquation13.gif (12)

where Inline graphic and Inline graphic represent the crowding distance and the average crowding distance in the decision space, respectively. Similarly, Inline graphic and Inline graphic represent the crowding distance and the average crowding distance in the objective space, respectively. In this work, Hamming distance is utilized to reflect the crowding distance of the decision space, and the calculation method for Inline graphic is given in [21]. Inline graphic denotes the number of individuals in the population. Inline graphic is set to 1 if Inline graphic is less than or equal to Inline graphic, and otherwise, it is set to 0. Inline graphic is set in the same way as above. Inline graphic represents an average score by calculating the ratio of the average crowding distance of the decision space to the average crowding distance of the objective space. Inline graphic represents an individual score of the Inline graphic-th individual by calculating the ratio of the crowding distance of the decision space to the crowding distance of the objective space. If Inline graphic is larger than Inline graphic, it implies that the crowding distance of the decision space of the Inline graphic-th individual is much greater than that of the objective space. When there are many individuals of this type, the diversity of the objective space will be poor, indicating that it is necessary to improve the diversity of the objective space. Therefore, the proportion of the individuals with Inline graphic being larger than Inline graphic is set to Inline graphic, and the proportion of the individuals with Inline graphic being less than or equal to Inline graphic is set to Inline graphic.

This paper considers the number of individuals whose crowding distance in the decision space is less than that in the objective space, to adjust the crowding distance weight of the objective space and decision space. To achieve this, the ratio of crowding distance in the decision space and the objective space is calculated for each individual, and the relative ratio of the crowding distance of all individuals is compared with the threshold value Inline graphic. If the number of individuals whose crowding distance in the decision space is smaller than that in the objective space decreases, the crowding distance weight in the decision space increases adaptively, and otherwise, the crowding distance weight in the decision space decreases adaptively.

graphic file with name bbaf007f1002.jpg

Combined global and local search strategy

To achieve a balance of global search and local search, a combined search strategy is proposed in this paper, and its details are given in Algorithm 2. This method includes two steps: local search and global search. Local search is used to help the population find multiple discrete feasible regions and maintain the diversity of the objective space, and global search is used to obtain multimodal solutions and maintain diversity.

For local search, it starts by using the angle-based niching parent selection method. The objective values Inline graphic of individuals in Inline graphic are calculated and transformed into a new coordinate axis (lines 1-2). The angle value Inline graphic of every two individuals is obtained based on Inline graphic (line 3). Then, two parents are chosen from the Inline graphic neighbors with the smallest Inline graphic values (line 5), and the GA operator is employed to generate offspring based on these parents (line 6).

For global search, an enhanced binary tournament selection method [17] using WSCD and decision space-based niching are employed to select better individuals for evolutionary calculation. Mating pool is selected according to the following guidelines (lines 8–22): (1) two solutions are selected from all individuals in the current population, (2) two solutions with higher Pareto levels will preferentially enter the mating pool, and (3) if the Pareto levels of the two solutions are the same, then those with larger SCD values will preferentially enter the mating pool. Furthermore, the GA operator is employed to generate offspring.

Computational complexity

The main factors that contribute to the time complexity of MMONCP (or CMMOEA-GLS-WSCD) at each generation including initializing the population, evaluating individuals, generating offspring, and updating the population for all three population. These factors have complexities of Inline graphic, Inline graphic, Inline graphic, and Inline graphic, respectively, in the main population. Thus, the computational complexity in the main population can be estimated as Inline graphic. Similarly, the computational complexity in the auxiliary population can be approximated as Inline graphic. Inline graphic and Inline graphic denote the sizes of the main and auxiliary populations, respectively. Inline graphic represents the number of iterations, and Inline graphic represents the number of genes in the PGIN. In conclusion, the overall computational complexity of MMONCP is Inline graphic in the PGIN for an individual patient.

Experimental

Data

In the experiment, three gene expression datasets of cancer patients from TCGA are used, namely BRCA, LUSC, and LUAD. The paired samples of individual patients, including normal samples and tumor samples, are obtained from the same patients to construct the PGIN. The specific process can be found in the Section S-I in supplementary materials. Meanwhile, single nucleotide variations are employed to form the reliable edges of the PGIN. BRCA has 112 patients, and their numbers of genes range from 1581 to 1780; LUAD has 57 patients, and their numbers of genes range from 1987 to 2342; and LUSC has 47 patients, and their numbers of genes range from 2100 to 2344. Note that the gene number of the used datasets is less than the actual number of cancer genes in individual patients. This is because not all genes in the process of cancer transformation change, and only genes with changed functions are used. Also, the prior known reliable gene interactions are collected from existing biological datasets with incomplete information.

To describe the definition of “well-estimated drug targets”, we firstly introduce the source of data. In the experiment, the prior known reliable gene interactions as “well-estimated drug targets” are collected from existing DrugCombDB (http://drugcombdb.denglab.org). DrugCombDB is a comprehensive database dedicated to integrating drug combinations from various data sources. In addition, 21,000 FDA approved or literature-supported interactions of drugs and drug targets collected from external databases and text-mining followed by manual curations can be downloaded.

To show the performance of MMONCP in discovering clinical combinatorial drugs, combinatorial drug and gene interaction networks are collected in this work. Then, a network between gene and drug is constructed according to the interactions between combinatorial drugs and genes [40]. This interaction network includes 342 combinatorial drugs, where 122 combinations of drugs have been found to be effective in treating cancer. Meanwhile, drug toxicity data (https://toxric.bioinforai.tech/home) and drug activity data (https://www.ebi.ac.uk/chembl/) are collected to explore the biological information of PDTs.

Parameter settings of MMONCP

Since there are numerous genes in a PGIN (1581 to 1780 for BRCA, 1987 to 2342 for LUSC, and 2100 to 2344 for LUAD), the main population size is designated as 300 (Inline graphic), while the auxiliary population size is specified as 90 (Inline graphic). This setup is the same as that of the recent work [13]. The maximal number of function evaluations is set to 20,0000 [41]. To enhance result reliability, MMONCP is executed independently 30 times for each patient.

Algorithms for comparison

Since there are few CMMOEAs so far, MMONCP is compared with two CMMOEAs, two MMOEAs, and one CMOEA for solving MMONCP to show its effectiveness. Specifically, the two CMMOEAs are CMMODE [21] and CMMOCEA [22], which are described in Section II. The two MMOEAs are DN-NSGA-II [32] and MP-MMEA [42], where DN-NSGA-II employ the niche based on decision space to solve MMOPs and MP-MMEA is used to solve large-scale MMOPs by using multiple sub-populations. To ensure the effectiveness of MMOEA on MMONCP, the CDP method [30] is used in these two MMOEAs, and they are denoted as CDN-NSGA-II and CMP-MMEA, respectively. The one CMOEA is LSCV-MCEA, which is used to optimize PDTs in recent work [13]. To ensure fairness, the crowding distance of the decision space is considered in LSCV_MCEA. itionalitionally, some methods based on cancer-specific driver genes and traditional structural network control methods, including CPGD [1], ActiveDriver [43], OncoDriveFM [44], and DriverML [45], are taken for comparison. Our method has been compared with traditional methods, including MDS [11], NCUA [6], and DFVS [12], for MDT optimization in the PGIN. It is noteworthy that all algorithms adopt the particular parameter settings of the original article and the same shared parameters with our method. The statistical analysis is conducted using the Wilcoxon rank-sum test [46] with a significance level of 0.05.

Performance metrics

To verify the effectiveness of all algorithms on the three cancer datasets, four performance metrics are used in this work.

  • Hypervolume (HV) [47]: It is obtained by computing the volume bounded by the discovered solutions and the reference points. A higher HV value signifies superior performance in both diversity and convergence aspects. In this paper, the reference points are set to [1.1, 1.1].

  • Inverted Generational Distance (IGD) [48]: It mainly reflects the diversity and convergence of the algorithms by evaluating the gap between the obtained solutions and the Pareto optimal solutions. However, the PF and PS are unknown for our MMONCP. This study considers non-dominated solutions of the union set, and the Pareto optimal solutions of MMONCP are obtained by combining solutions of all algorithms [13]. Different from HV, a smaller IGD indicates better performance.

  • Area Under Curve (AUC): It is used to assess the effectiveness of utilizing PDTs in personalized drug discovery. A larger AUC indicates better performance in discovering clinical combinatorial drugs. It is calculated by the predicted probability and the true label in the CAC, and the specific calculation details are shown in Section S-II in the supplementary materials:

  • multiSet Value (MSV) [49]: It is applied in this paper to compare the multimodality of all algorithms on the three cancer datasets. The calculation is performed in the following manner:
    graphic file with name DmEquation15.gif (13)
    where Inline graphic represents the number of genes in each PGIN, and Inline graphic denotes the number of multimodal solutions in the true PF. The true PF is unknown, and its calculation is the same as that of IGD. Inline graphic denotes the number of multimodal solutions obtained by algorithms that must be in the true PF. A larger MSV indicates higher multimodality.

Results and discussions

Performance of discovering clinical combinatorial drugs

In order to demonstrate the superiority of MMONCP in identifying clinical combinatorial drugs, we apply AUC to evaluate the performance of MMONCP relative to other algorithms across the three cancer datasets. The highest AUC value for each individual in each algorithm is recorded, and the average results of 30 runs in MDS, NCUA, and DFVS are shown in Fig. 4. It is noteworthy that since CMMODE cannot find a feasible solution, it is not shown in Fig. 4. The results suggest that only LSCV_MCEA has a similar performance to MMONCP on the three cancer datasets. In simpler terms, MMONCP demonstrates superior performance compared with the other methods. In particular, MMONCP performs better than LSCV_MCEA in MDS across the three cancer datasets and in NCUA in the LUSC dataset. Therefore, the results presented here confirm the effectiveness of our proposed MMONCP algorithm in identifying clinically effective combinatorial drugs.

Figure 4.

Figure 4

The boxplot of AUC for MMONCP and other evolutionary algorithms.

Additionally, MMONCP is compared with some traditional methods, including methods based on cancer-specific driver genes and traditional methods based network in discovering clinical combinatorial drugs. The specific results are shown in Fig. 5, where MMONCP_MDS, MMONCP_MDS, and MMONCP_DFVS represent MMONCP in MDS, NCUA, and DFVS, respectively. In addition, we also conducted rank sum tests on the results of various methods at a significance level of 0.05 to perform statistical analysis and provide corresponding p-value of Fig. 5 in section S-V of supplementary material. It can be found that MMONCP outperforms other methods in terms of AUC value across the three cancer datasets, indicating MMONCP performs better in discovering clinical combinatorial drugs.

Figure 5.

Figure 5

The comparison results of AUC indicator on three cancer datasets for MMONCP and other traditional methods.

Detection of functional drug targets in early stage

The development of cancer is generally categorized into four stages, from I to IV. Stage IV, often referred to as advanced cancer, is characterized by the proliferation of cancer cells and their metastasis to other tissues. Consequently, effectively detecting early warning signs during the progression of cancer is crucial for both diagnosis and prognosis. According to data from the TCGA database, patients with BRCA, LUSC, and LUAD exhibit different cancer stages. To ensure statistical significance, we merged the stages with fewer than three patients into their nearest former stages. The final staging for BRCA patients includes I, IA, IIA, IIB, IIIA, IIIB, IIIC, and IV. For LUSC and LUAD, the stages are IA, IB, IIA, IIB, III, and IV. This consolidation allows for a more robust analysis of cancer progression across these patient groups.

In this work, the biological significance of MMONCP is verified from the perspective of toxicity and activity on the BRCA dataset. Drug and gene interaction networks are also collected from DrugCombDB. In this work, the total activity or toxicity of the gene attached to drugs is considered as the two biological activity and toxicity function indexes which reflects the regulation function of genes in the drug-gene interaction network [50]. Firstly, toxicity values and activity values are calculated for MDTs. That is, MDT could be defined as multiple sets of PDTs which are equivalent in the information available about prior-known drug targets and the number of driver nodes, but they have differences in their configurations. Then for each MDT along the PS, the difference between the maximum toxicity and the minimum toxicity or maximum activity and minimum activity is recorded and we choose the maximum difference among all MDTs along the PS as the toxicity or activity value of an individual patient. Next, this work obtains the clinical stage information of all patients on the BRCA cancer dataset from TCGA and calculates the average difference of each stage. Finally, the line chart of toxicity and activity is shown in Fig. 6. From Fig. 6, it can be seen that the average difference of activity (toxicity) curve has a maximum (minimum) value in stage ia. These results demonstrate that the MDTs of our MMONCP can detect the early state from the difference of activity (toxicity) curve for MDTs and provide early cancer treatment options in precision medicine.

Figure 6.

Figure 6

The average difference of nine stages using activity or toxicity on BRCA dataset.

To further investigate the biological significance of these drug targets, the patients of stage ia are taken for validation. Firstly, the MDT with the largest difference in toxicity or activity on PS in each patient is recorded. Then, within this MDT, the gene sets of the patients in stage ia with maximal activity, minimal activity, maximal toxicity, and minimal toxicity are obtained and merged respectively. Finally, the merged gene sets of all the patients in stage ia are used for KEGG pathway enrichment analysis (https://david.ncifcrf.gov/), as shown in Fig. S2. From Fig. S2, it can be seen that there are some differences in gene sets with maximal activity and minimal activity Fig. S2(A-B), such as focal adhesion and Human papillomavirus infection. Within the gene sets with maximal toxicity and minimal toxicity Fig. S2(C-D), similar observations can be obtained (e.g. human papillomavirus infection is prominent in PDTs with minimal toxicity). These results suggest that MMONCP can provide different functional targets for the precise treatment of cancer.

It is worth noting that LUSC and LUAD have different clinical stage information.The specific results are shown in Fig S3. We can find that the average difference of activity curve of LUSC and LUAD has a maximum (minimum) value in stage ib and iia, respectively. Meanwhile the average difference of toxicity curve of LUSC and LUAD has a minimum value in stage ib and ia, respectively. These results demonstrate that average difference of activity curve varies in different cancer datasets and the similar prediction results are observed in BRCA and LUSC datasets.

Analysis of algorithm performance

Comparison of constrained multimodal multiobjective evolutionary algorithms

To demonstrate the advantages of MMONCP for optimizing PDTs compared with several other methods, HV and IGD are calculated. The average HV and IGD values on each dataset are presented in Fig. 7. The result of CMMODE is 0 in Fig. 7(A–C) because this algorithm fails to find feasible solutions. Therefore, CMMODE is removed in Fig. 7(D–F). From the results of Fig. 7, it can be seen that MMONCP has higher HV and lower IGD values under the framework of MDS and NCUA than other algorithms. Under the framework of DFVS, MMONCP is more efficient than most algorithms, except LSCV_MCEA. This is mainly because there are intermediate variables (Inline graphic and Inline graphic in Eq. (12)) and DFVS focuses on directed networks, leading to a more complicated problem under the framework of DFVS which needs to achieve a more proper balance between the diversity of the decision space and objective space.

Figure 7.

Figure 7

The box plot of HV and IGD for MMONCP and other evolutionary algorithms.

To include specific numerical improvements, we conducted rank sum tests on the results of various methods HV and IGD at a significance level of 0.05 to perform statistical analysis, shown in the Tables SI–SVI. We can find that MMONCP almost has higher HV or lower IGD. In 112 BRCA patients, 49 LUSC patients, and 57 LUAD patients than other CMMOEAs under the framework of MDS and NCUA. For example, under the frame of MDS, the mean HV of CDN-NSGA-II, CMP-MMEA, CMMOCE, CMMODE, LSCV_MCEA and MMONCP are 0.343, 0.230, 0.391, 0.000, 0.540 and 0.545 on BRCA. On LUSC data, the mean HV of these algorithms are 0.347, 0.231, 0.389, 0.000, 0.542 and 0.551. On LUAD data, the mean HV of these algorithms are 0.346, 0.231, 0.385, 0.000, 0.541 and 0.551. There are no feasible solutions in CMMODE so the HV is set to 0.To further investigate the multimodality of MMONCP, MSV is used in this work by calculating the proportion of PDTs in all individual patients. The results are illustrated in Fig. 8. Figure 8 demonstrates that MMONCP can provide more MDTs than other methods, especially in MDS and NCUA. However, LSCV_MCEA performs better than MMONCP under the framework of DFVS because there are more solutions in the true PF (Inline graphic has a larger value than that in MMONCP). The results demonstrate that MMONCP generally provides notable benefits compared with other methods in effectively optimizing PDTs, which helps to explore the multimodal characteristics of PDTs.We have compared calculation time of different algorithms on three cancer data. All the experiments used a specialized computer with Intel(R) Xeno(R) Gold 6230 CPU and 640 GB RAM. The results of the comparison are presented in Figure S3, which leads to the following conclusions: The results of the comparison are presented in Fig. S4, which leads to the following conclusions: As shown in Fig. S4, under the framework of MDS, NCUA, and DFVS, although our MMONCP have slight worse performance compared with CMMOCEA, CMPMMEA, and CMMODE, the computation time of these algorithms remains at the same order of magnitude.

Figure 8.

Figure 8

The boxplot of MSV for MMONCP and other evolutionary algorithms.

Ablation experiments

MMONCP is designed to optimize PDTs in the PGIN by using specific strategies. There are two main strategies in MMONCP, including a combined GLS strategy and WSCD. In order to validate the efficacy of the two parts, two variants of MMONCP are generated: one is MMONCP using traditional special crowding distance instead of WSCD, denoted as Inline graphic, and the other is MMONCP with common decision space based on the niching method instead of using combined GLS strategy, denoted as Inline graphic. The convergence and diversity of the two variants of MMONCP are compared by calculating HV and IGD. As shown in Fig. 9, MMONCP has a higher HV and lower IGD values than Inline graphic and Inline graphic, especially in MDS and DFVS, indicating that combined GLS strategy and WSCD are effective in solving MMONCP.

Figure 9.

Figure 9

Ablation experiments of MMONCP with other two variants.

In addition, we verify whether these two strategies are effective for MMONCP to find PDTs, shown in Fig. 10. We can find that MMONCP obtains more multimodal solutions in almost all cases, except under the framework of DFVS on the LUAD dataset. The results demonstrate that MMONCP can optimize PDTs effectively. For example, compared with MMONCP under the framework of MDS, the mean HV of MMONCP without the combined global and local search strategy and the WSCD dropped by about 0.008 and 0.013 on BRCA, respectively.

Figure 10.

Figure 10

The comparison results of MSV indicator.

The statistical insights into the characteristics of the network

The characteristics of the network is considered to help the reader better understand the underlying network structure and its influence on the outcomes. Firstly, we considered the frequency of each node identified in Pareto Sets as the controllability of MDS, NCUA, and DFVS. Then the Pearson correlation coefficient (PCC) between the controllability of different methods and Closeness Centrality (CC), Betweenness Centrality (BC), Degree Centrality (DC), Local Clustering Coefficient(LCC) was calculated and shown in Fig. S5. From Fig. S5, we can obtain the following results and conclusions, (i) for the controllability of MDS, Closeness Centrality (CC) has the larger positive correlation coefficient than other network structure parameters. Meanwhile for the controllability of DFVS and NCUA, the Degree Centrality (DC) has the larger positive correlation coefficient. These results show that Closeness Centrality (CC) has relative larger influence on the controllability of MDS while Degree Centrality (DC) has relative larger influence on the controllability of DFVS and NCUA. (ii) The influence of network structure on the controllability varies for different cancer individual patients. For example, for MDS controllability, Degree Centrality (DC) has positive correlation coefficient in BRCA cancer datasets while Degree Centrality (DC) has negative correlation coefficient in LUNG cancer datasets.

Conclusion

This study proposes a multimodal multiobjective optimization based network control framework MMONCP to optimize PDTs by considering multiple equivalent drug targets with the same information of prior-known drug targets and number of driver nodes but differing in gene or protein configurations. Our study is the first time to introduce the evolutionary multimodal muti-objective optimization into the field of network control principles. To solve MMONCP, a novel evolutionary algorithm, called CMMOEA-GLS-WSCD, is designed to optimize PDTs that can support drug targets in clinical applications. Meanwhile, a multitask framework is applied to CMMOEA-GLS-WSCD, which includes two local auxiliary tasks with a single-objective. Besides, a combined GLS strategy is developed to improve the search convergence in objective and decision space. Additionally, the WSCD is proposed to balance the diversity of objective space and decision space where the traditional special crowding distance is improved by dynamically weighting the distance in the decision space and the distance in the objective space.

The advantages of MMONCP over other algorithms are verified on BRCA, LUSC, and LUAD. The results indicate that MMONCP achieves better performance in convergence and diversity than other algorithms. Then, the advantages of MMONCP in finding PDTs are verified, especially in MDS and NCUA. In comparison with certain advanced algorithms, MMONCP have a better performance in effectively optimizing the PDTs for predicting clinical combinatorial drugs. Furthermore, our MMONCP can detect the early state from the difference in target activity and toxicity of MDTs in the BRCA cancer dataset. To sum up, this study provides decision-makers with additional therapeutic options for the early treatment of cancer patients. In future work, we will design more effective strategies to optimize PDTs under the framework of MMONCP. Also,

Accurate prediction of drug–target interactions is is a very promising research direction for drug development, yet current deep learning models face challenges in performance and false negatives [3]. Several models like EADTN [51] and MRBDTA [52] have been proposed, incorporating techniques such as ensemble modeling, feature adaptation, and transformer-based architectures. In the future, multimodal multiobjective optimization models could provide clues for enhancing prediction accuracy, interpretability, and reliability in in predicting interactions and aiding drug repurposing, with validation in both experiments and case studies [4]. In addition, wet-lab experiments (maybe animal studies) in real-world drug application scenarios are persuasive, compared with other methods.

Key Points

  • The problem of MMONCP in individual patients is formulated. MMONCP aims to find multiple equivalent sets of PDTs with the same information of prior-known drug targets and the number of driver nodes but different configurations.

  • To solve MMONCP, a novel CMMOEA called CMMOEA-GLS-WSCD is developed, where a combined GLS method is proposed to achieve a balance of GLS, and WSCD is developed to balance the diversity of the objective space and decision space.

  • The experimental results on datasets obtained from TCGA show that MMONCP outperforms current advanced methods including algorithm convergence, diversity, and the area under the curve score of enriching clinical combinatoral drugs. Furthermore our MMONCP can detect the efficient functional drug targets of early state in stage ia.

Supplementary Material

SupplementaryMaterials1205_bbaf007

Acknowledgments

We thank all members of Professor Jing Liang’s lab from School of Electrical Engineering, Zhengzhou University for giving us useful suggestions during the preparation of our manuscript.

Contributor Information

Jing Liang, School of Electrical and Information Engineering, Zhengzhou University, No. 100, Science Avenue, Hightech District, Zhengzhou City 450001, Henan Province, China; State Key Laboratory of Intelligent Agricultural Power Equipment, No. 39, Xiyuan Road, Jianxi District, Luoyang City 471039, Henan Province, China.

Zhuo Hu, School of Electrical and Information Engineering, Zhengzhou University, No. 100, Science Avenue, Hightech District, Zhengzhou City 450001, Henan Province, China.

Ying Bi, School of Electrical and Information Engineering, Zhengzhou University, No. 100, Science Avenue, Hightech District, Zhengzhou City 450001, Henan Province, China.

Han Cheng, School of Life Sciences, Zhengzhou University, No. 100, Science Avenue, High-tech District, Zhengzhou City 450001, Henan Province, China.

Wei-Feng Guo, School of Electrical and Information Engineering, Zhengzhou University, No. 100, Science Avenue, Hightech District, Zhengzhou City 450001, Henan Province, China; State Key Laboratory of Intelligent Agricultural Power Equipment, No. 39, Xiyuan Road, Jianxi District, Luoyang City 471039, Henan Province, China.

 

Conflict of interest: The authors have declared no competing interests.

Funding

This paper was supported by the National Natural Science Foundation of China (62476253,62002329), and Henan Province Natural Science Foundation (242300421401), and China postdoctoral science foundation (2021M692915), and Open Research Fund of State Key Laboratory of Digital Medical Engineering (2024-K07), and Frontier Exploration Projects of Longmen Laboratory (LMQYTSKT031), and Henan Province Scientific Research Joint Fund Industrial Category Major Program (245101610001).

References

  • 1. Guo  W-F, Zhang  S-W, Feng  Y-H. et al.  Network controllability-based algorithm to target personalized driver genes for discovering combinatorial drugs of individual patients. Nucleic Acids Res  2021;49:e37–7. 10.1093/nar/gkaa1272. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Guo  W-F, Yu  X, Shi  Q-Q. et al.  Performance assessment of sample-specific network control methods for bulk and single-cell biological data analysis. PLoS Comput Biol  2021;17:e1008962. 10.1371/journal.pcbi.1008962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Chen  X, Yan  CC, Xiaotian Zhang  X. et al.  Drug–target interaction prediction: Databases, web servers and computational models. Brief Bioinform  2016;17:696–712. 10.1093/bib/bbv066. [DOI] [PubMed] [Google Scholar]
  • 4. Zhou  Z, Liao  Q, Wei  J. et al.  Revisiting drug–protein interaction prediction: A novel global–local perspective. Bioinformatics  2024;40:btae271. 10.1093/bioinformatics/btae271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Guo  W-F, Zhang  S-W, Liu  L-L. et al.  Discovering personalized driver mutation profiles of single samples in cancer by network control strategy. Bioinformatics  2018;34:1893–903. 10.1093/bioinformatics/bty006. [DOI] [PubMed] [Google Scholar]
  • 6. Guo  W-F, Zhang  S-W, Zeng  T. et al.  A novel network control model for identifying personalized driver genes in cancer. PLoS Comput Biol  2019;15:e1007520. 10.1371/journal.pcbi.1007520. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Kuijjer  ML, Tung  MG, Yuan  GC. et al.  Estimating sample-specific regulatory networks. Iscience  2019;14:226–40. 10.1016/j.isci.2019.03.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Liu  X, Wang  Y, Ji  H. et al.  Personalized characterization of diseases using sample-specific networks. Nucleic Acids Res  2016;44:e164–4. 10.1093/nar/gkw772. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Guo  W-F, Zhang  S-W, Zeng  T. et al.  Network control principles for identifying personalized driver genes in cancer. Brief Bioinform  2020;21:1641–62. 10.1093/bib/bbz089. [DOI] [PubMed] [Google Scholar]
  • 10. Liu  Y-Y, Slotine  J-J, Barabási  A-L. Controllability of complex networks. Nature  2011;473:167–73. 10.1038/nature10011. [DOI] [PubMed] [Google Scholar]
  • 11. Nacher  JC, Akutsu  T. Dominating scale-free networks with variable scaling exponent: Heterogeneous networks are not difficult to control. New J Phys  2012;14:073005. 10.1088/1367-2630/14/7/073005. [DOI] [Google Scholar]
  • 12. Zañudo  JGT, Yang  G, Albert  R. Structure-based control of complex networks with nonlinear dynamics. Proc Natl Acad Sci  2017;114:7234–9. 10.1073/pnas.1617387114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Liang  J, Zhuo  H, Li  Z-W. et al.  Multiobjective optimization-based network control principles for identifying personalized drug targets with cancer. IEEE Trans Evol Comput  2024;28:1322–35. 10.1109/TEVC.2023.3303958. [DOI] [Google Scholar]
  • 14. Qiao  K, Liang  J, Guo  W-F. et al.  Knowledge-embedded constrained multiobjective evolutionary algorithm based on structural network control principles for personalized drug targets recognition in cancer. Inform Sci  2024;679:121033. 10.1016/j.ins.2024.121033. [DOI] [Google Scholar]
  • 15. Qiao  K, Liang  J, Guo  W-F. et al.  Constrained multi-objective optimization-based temporal network observability for biomarker identification of individual patients. IEEE J Biomed Health Inform  2024;PP:1–12. 10.1109/JBHI.2024.3435418. [DOI] [PubMed] [Google Scholar]
  • 16. Yuan  J, Liu  H-L, Ong  Y-S. et al.  Indicator-based evolutionary algorithm for solving constrained multiobjective optimization problems. IEEE Trans Evol Comput  2021;26:379–91. 10.1109/TEVC.2021.3089155. [DOI] [Google Scholar]
  • 17. Deng  W, Zhang  X, Zhou  Y. et al.  An enhanced fast non-dominated solution sorting genetic algorithm for multi-objective problems. Inform Sci  2022;585:441–53. 10.1016/j.ins.2021.11.052. [DOI] [Google Scholar]
  • 18. Lin  W, Shen  Y, Li  M. et al.  Network output controllability-based method for drug target identification. IEEE Trans Nanobiosci  2015;14:184–91. 10.1109/TNB.2015.2391175. [DOI] [PubMed] [Google Scholar]
  • 19. Guo  W-F, Zhang  S-W, Wei  Z-G. et al.  Constrained target controllability of complex networks. J Stat Mech: Theory Exp  2017;2017:063402. 10.1088/1742-5468/aa6de6. [DOI] [Google Scholar]
  • 20. Li  M, Gao  H, Wang  J. et al.  Control principles for complex biological networks. Brief Bioinform  2019;20:2253–66. 10.1093/bib/bby088. [DOI] [PubMed] [Google Scholar]
  • 21. Liang  J, Lin  H, Yue  C. et al.  Multiobjective differential evolution with speciation for constrained multimodal multiobjective optimization. IEEE Trans Evol Comput  2023;27:1115–29. 10.1109/TEVC.2022.3194253. [DOI] [Google Scholar]
  • 22. Ming  F, Gong  W, Yang  Y. et al.  Constrained multimodal multi-objective optimization: Test problem construction and algorithm design. Swarm Evol Comput  2023;76:101209. 10.1016/j.swevo.2022.101209. [DOI] [Google Scholar]
  • 23. Tan  KC, Feng  L, Jiang  M. Evolutionary transfer optimization-a new frontier in evolutionary computation research. IEEE Comput Intell Mag  2021;16:22–33. 10.1109/MCI.2020.3039066. [DOI] [Google Scholar]
  • 24. Tian  Y, Zhang  T, Xiao  J. et al.  A coevolutionary framework for constrained multiobjective optimization problems. IEEE Trans Evol Comput  2020;25:102–16. 10.1109/TEVC.2020.3004012. [DOI] [Google Scholar]
  • 25. Qiao  K, Chen  Z, Boyang  Q. et al.  A dual-population evolutionary algorithm based on dynamic constraint processing and resources allocation for constrained multi-objective optimization problems. Expert Syst Appl  2024;238:121707. 10.1016/j.eswa.2023.121707. [DOI] [Google Scholar]
  • 26. Li  K, Chen  R, Guangtao  F. et al.  Two-archive evolutionary algorithm for constrained multiobjective optimization. IEEE Trans Evol Comput  2018;23:303–15. 10.1109/TEVC.2018.2855411. [DOI] [Google Scholar]
  • 27. Qiao  K, Kunjie  Y, Boyang  Q. et al.  Dynamic auxiliary task-based evolutionary multitasking for constrained multiobjective optimization. IEEE Trans Evol Comput  2023;27:642–56. 10.1109/TEVC.2022.3175065. [DOI] [Google Scholar]
  • 28. Fan  Z, Li  W, Cai  X. et al.  Push and pull search for solving constrained multi-objective optimization problems. Swarm Evol Comput  2019;44:665–79. 10.1016/j.swevo.2018.08.017. [DOI] [Google Scholar]
  • 29. Liu  Z-Z, Wang  Y. Handling constrained multiobjective optimization problems with constraints in both the decision and objective spaces. IEEE Trans Evol Comput  2019;23:870–84. 10.1109/TEVC.2019.2894743. [DOI] [Google Scholar]
  • 30. Deb  K, Pratap  A, Agarwal  S. et al.  A fast and elitist multiobjective genetic algorithm: Nsga-ii. IEEE Trans Evol Comput  2002;6:182–97. 10.1109/4235.996017. [DOI] [Google Scholar]
  • 31. Zhang  Q, Li  H. Moea/d: A multiobjective evolutionary algorithm based on decomposition. IEEE Trans Evol Comput  2007;11:712–31. 10.1109/TEVC.2007.892759. [DOI] [Google Scholar]
  • 32. Liang  JJ, Yue  CT, Bo-Yang  Q. Multimodal multi-objective optimization: A preliminary study. In: 2016 IEEE Congress on Evolutionary Computation (CEC), pp. 2454–61. IEEE, 2016. [Google Scholar]
  • 33. Yue  C, Boyang  Q, Liang  J. A multiobjective particle swarm optimizer using ring topology for solving multimodal multiobjective problems. IEEE Trans Evol Comput  2017;22:805–17. 10.1109/TEVC.2017.2754271. [DOI] [Google Scholar]
  • 34. Li  W, Yao  X, Zhang  T. et al.  Hierarchy ranking method for multimodal multiobjective optimization with local pareto fronts. IEEE Trans Evol Comput  2022;27:98–110. 10.1109/TEVC.2022.3155757. [DOI] [Google Scholar]
  • 35. Qiuzhen Lin  W, Lin  ZZ, Gong  M. et al.  Multimodal multiobjective evolutionary optimization with dual clustering in decision and objective spaces. IEEE Trans Evol Comput  2020;25:130–44. 10.1109/TEVC.2020.3008822. [DOI] [Google Scholar]
  • 36. Liang  J, Qiao  K, Yue  C. et al.  A clustering-based differential evolution algorithm for solving multimodal multi-objective optimization problems. Swarm Evol Comput  2021;60:100788. 10.1016/j.swevo.2020.100788. [DOI] [Google Scholar]
  • 37. Liang J, Zhang Y-Y, Qu B-Y. et al.  A multiform optimization framework for multi-objective feature selection in classification. IEEE Transactions on Evolutionary Computation 2024;28:1024–38. [Google Scholar]
  • 38. Liang J, Li Z-W, Yue C-T. et al.  Multi-modal optimization to identify personalized biomarkers for disease prediction of individual patients with cancer. Brief Bioinform 2022;23:bbac254. 10.1093/bib/bbac254. [DOI] [PubMed] [Google Scholar]
  • 39. Liang J, Li Z-W, Sun Z-N. et al.  Latent space search based multimodal optimization with personalized edge-network biomarker for multi-purpose early disease prediction. Brief Bioinform 2023;24:bbad364. 10.1093/bib/bbad364. [DOI] [PubMed] [Google Scholar]
  • 40. Quan  Y, Liu  M-Y, Liu  Y-M. et al.  Facilitating anti-cancer combinatorial drug discovery by targeting epistatic disease genes. Molecules  2018;23:736. 10.3390/molecules23040736. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Qiao  K, Liang  J, Kunjie  Y. et al.  Evolutionary constrained multiobjective optimization: Scalable high-dimensional constraint benchmarks and algorithm. IEEE Trans Evol Comput  2024;28:965–79. 10.1109/TEVC.2023.3281666. [DOI] [Google Scholar]
  • 42. Tian  Y, Liu  R, Zhang  X. et al.  A multipopulation evolutionary algorithm for solving large-scale multimodal multiobjective optimization problems. IEEE Trans Evol Comput  2020;25:405–18. 10.1109/TEVC.2020.3044711. [DOI] [Google Scholar]
  • 43. Reimand J, Bader GD. Systematic analysis of somatic mutations in phosphorylation signaling predicts novel cancer drivers. Mol Syst Biol 2013;9:637. 10.1038/msb.2012.68. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Gonzalez-Perez  A, Lopez-Bigas  N. Functional impact bias reveals cancer drivers. Nucleic Acids Res  2012;40:e169–9. 10.1093/nar/gks743. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Han  Y, Yang  J, Qian  X. et al.  Driverml: A machine learning algorithm for identifying driver genes in cancer sequencing studies. Nucleic Acids Res  2019;47:e45–5. 10.1093/nar/gkz096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Derrac  J, García  S, Molina  D. et al.  A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms. Swarm Evol Comput  2011;1:3–18. 10.1016/j.swevo.2011.02.002. [DOI] [Google Scholar]
  • 47. Zitzler  E, Thiele  L. Multiobjective evolutionary algorithms: A comparative case study and the strength pareto approach. IEEE Trans Evol Comput  1999;3:257–71. 10.1109/4235.797969. [DOI] [Google Scholar]
  • 48. Bosman  PAN, Thierens  D. The balance between proximity and diversity in multiobjective evolutionary algorithms. IEEE Trans Evol Comput  2003;7:174–88. 10.1109/TEVC.2003.810761. [DOI] [Google Scholar]
  • 49. Agrawal  S, Tiwari  A, Yaduvanshi  B. et al.  Feature subset selection using multimodal multiobjective differential evolution. Knowledge-Based Syst  2023;265:110361. 10.1016/j.knosys.2023.110361. [DOI] [Google Scholar]
  • 50. Garcia-Alonso  L, Holland  CH, Ibrahim  MM. et al.  Benchmark and integration of resources for the estimation of human transcription factor activities. Genome Res  2019;29:1363–75. 10.1101/gr.240663.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Wei  J, Zhu  Y, Zhuo  L. et al.  Efficient deep model ensemble framework for drug-target interaction prediction. J Phys Chem Lett  2024;15:7681–93. 10.1021/acs.jpclett.4c01509. [DOI] [PubMed] [Google Scholar]
  • 52. Zhang L, Wang C-C, Chen X. Predicting drug–target binding affinity through molecule representation block based on multi-head attention and skip connection. Brief Bioinform 2022;23:bbac468. 10.1093/bib/bbac468. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

SupplementaryMaterials1205_bbaf007

Articles from Briefings in Bioinformatics are provided here courtesy of Oxford University Press

RESOURCES