Combining greedy and evolutionary algorithms to maximize influence in networks under deterministic linear threshold model

Alexander Andreev; Stepan Kochemazov; Alexander Semenov

doi:10.1371/journal.pone.0331109

. 2025 Sep 8;20(9):e0331109. doi: 10.1371/journal.pone.0331109

Combining greedy and evolutionary algorithms to maximize influence in networks under deterministic linear threshold model

Alexander Andreev ¹, Stepan Kochemazov ², Alexander Semenov ^1,^*

Editor: Yangming Zhou³

PMCID: PMC12416658 PMID: 40920803

Abstract

In the paper we consider the well-known Influence Maximization (IM) and Target Set Selection (TSS) problems for Boolean networks under Deterministic Linear Threshold Model (DLTM). The main novelty of our paper is that we state these problems in the context of pseudo-Boolean optimization and solve them using evolutionary algorithms in combination with the known greedy heuristic. We also propose a new variant of (1 + 1)-Evolutionary Algorithm, which is designed to optimize a fitness function on the subset of the Boolean hypercube comprised of vectors of a fixed Hamming weight. The properties of this algorithm suit well for solving IM. The proposed algorithm is combined with the greedy heuristic for solving IM and TSS: the latter is used to construct initial solutions. We show that the described hybrid algorithms demonstrate significantly better performance compared to the computational scheme combining the greedy heuristic with the classic variant of (1 + 1)-EA. In the experiments, the proposed algorithms are applied to both real-world networks and the random networks constructed with respect to well-known models of random graphs. The results show that the new algorithms outperform the competition and are applicable to TSS and IM under DLTM for networks with tens of thousands of vertices.

Introduction

Network science is a very fruitful and intensively developed area which features a lot of high quality publications. They can be broadly categorized into studies that analyze the properties related to network topology and papers that consider the properties of functions defined by networks. The former typically consider the so-called critical phenomena in networks and in this context they study various qualitative properties such as clustering, various consequences of the connectivity, such as a small world effect, degrees distributions, etc., as well as the models of random graphs that can be used to generate networks with desired properties. Since these questions do not constitute the main object of our study, let us cite in this context the reviews [1,2] and the book [3].

The scope of the present paper belongs to the second class of network science studies: the one that considers an existing network and analyzes various dynamic processes which occur inside said network. Thus the problems considered in the paper can be viewed as simple examples of agent-based modeling. Namely, we consider a network which models some collective: the network vertices interpret the members of collective (agents) and the edges interpret the connections between agents. With each network agent there is associated a special function that defines the degree of activity of this agent at each discrete time moment. The process in question models information dissemination in such a network, which can often be viewed as an activation process.

Among the central problems in the formulated context are the Influence Maximization problem (IM) and the related Target Set Selection problem (TSS). In just a few recent years hundreds of papers on these problems have been published, see e.g. [4–25]. In Sect 5 we will briefly review the main results achieved in these works. We would like to specifically note that the seminal paper [4] remains one of the main works on IM for more than 20 years. In that paper they performed the analysis of a number of key properties of activation processes in networks, and proved the combinatorial nature of both IM and TSS (e.g. showed that both are NP-hard). Also, in [4] it was demonstrated that under some models of information dissemination there exists an approximation algorithm for solving IM with constant multiplicative approximation factor of 1–e⁻¹. This is the case for the so-called Linear Threshold Model and Independent Cascade Model. Both models are not deterministic in the base case, as the value of the objective function (influence in terms of [4]) is defined using some random variables. What is interesting, however, is that it is the non-deterministic nature of these models that is the reason for the existence of a 1–e⁻¹-approximation algorithm for IM. In addition to that, [4] showed that under the deterministic variant of Linear Threshold Model (to which we further refer as to DLTM) IM already has tight arguments for inapproximability: specifically, if there exists an approximation algorithm with multiplicative constant factor for IM under DLTM, then P = NP.

Recall, that under DLTM the thresholds of vertices are fixed and therefore the influence spread follows a completely deterministic scenario. It means that its value is determined as the number of active vertices in the network at the moment when the activation process stops, or more precisely, when the activation algorithm does not make any inactive vertices active. DLTM looks to be quite reasonable since in applied scenarios the vertices’ thresholds usually have entirely practical interpretation and thus can be measured, similar to how it is done in the known threshold models of collective behavior [26].

Taking into account the high structural complexity of IM and TSS, the design of computational algorithms to solve these problems is a relevant direction of research. In the present paper, we propose several novel algorithms for finding an approximate solution for IM and TSS. In particular, we use as the basis of the proposed approaches the evolutionary algorithms which are applicable to solving optimization problems from a wide class, including the search for effective combinations of hyperparameters in Artificial Intelligence [27–29], etc. To justify the good accuracy of the proposed methods we find the exact solutions of network activation problems for networks of small size by reducing them to combinatorial problems, in particular to the Boolean satisfiability problem (SAT). The latter is solved using state-of-the-art SAT solvers which are actively used in combinatorial optimization and in the recent years find increasing use in the area of Explainable AI, see [30,31], etc.

The main novelty of our approach to IM and TSS solving is the combination of greedy algorithms with evolutionary algorithms and some additional heuristics. Specifically, the main contributions of our approach are the following:

We consider IM and TSS under DLTM in the context of the general pseudo-Boolean optimization problem which makes it possible to apply a wide variety of algorithms for their solving, including evolutionary algorithms;
We describe a new variant of (1 + 1)-Evolutionary algorithm ((1 + 1)-EA), which is designed to optimize a pseudo-Boolean fitness function on a subset of the Boolean hypercube of dimension n formed by the vectors of fixed size $k : 0 < k < n$ and apply this algorithm to IM and TSS;
Using the networks of small size and reduction of TSS to the Boolean Satisfiability problem (SAT) we experimentally show that the metaheuristic algorithms for IM and TSS often make it possible to find solutions that are close to exact solutions;
We propose the hybrid strategy that combines the well-known greedy heuristic with evolutionary algorithms and show that it is applicable to solving IM and TSS under DLTM for real-world networks with tens of thousands of vertices.

Let us give a brief outline of our paper. The following Sect 1 is the preliminaries which contains the main notions, some auxiliary results, and also describes basic principles according to which evolutionary algorithms can be employed to optimize pseudo-Boolean fitness functions. Next Sect 2 contains the description of the (1 + 1)-Weighted Evolutionary Algorithm designed to optimize arbitrary pseudo-Boolean fitness functions over the subset of the Boolean hypercube formed by vectors of fixed Hamming weight. Further this algorithm together with several other evolutionary algorithms is used to solve IM and TSS. In Sect 3 we state IM and TSS under DLTM in form of pseudo-Boolean optimization problems, and describe a number of auxiliary heuristics used together with evolutionary algorithms when solving IM and TSS. The following Sect 4 presents the results of two series of computational experiments. In the first one we use networks of small size and reduction of TSS to SAT to construct exact solutions and show that the proposed metaheuristic algorithms often find either exact solutions or get very close to them. In the second series, we demonstrate the practical applicability of proposed algorithms to solving IM and TSS under DLTM for real-world networks from the SNAP repository [32] with thousands and tens of thousands of vertices. The paper is concluded with Sect 5 containing a brief review of related works and Sect 6 with conclusions and discussion of future work.

1 Preliminaries

It is convenient to consider DLTM in the context of the so-called Synchronous Boolean Networks (SBN), introduced by Stuart Kauffman in his seminal paper [33].

1.1 Synchronous Boolean networks and DLTM

Let us consider an arbitrary network as a directed labeled graph $G = (V, A, L)$ , where V is the set of vertices, A is the set of directed edges called arcs, and L is the set of labels associated with vertices and arcs. Assume that G does not have loops or multiple arcs. For an arbitrary vertex $v \in V$ we say that the vertex $u \in V$ is a neighbor of v if $(u, v) \in A$ , i.e. if there is an arc in G that goes from u to v. The set of all neighbors of the vertex v is called the neighborhood of this vertex, which we denote as $U_{v}$ .

Hereinafter, we suppose that each vertex from G is associated with a symbol from the set of size 2, which we denote by ${0, 1}$ . If a vertex v is associated with the number 1 then we say that v is active, otherwise v is assigned with the number 0 and is inactive. Apparently, it was S. Kauffman who proposed for the first time in his work [33] to associate with G the parameter t which takes the values in the set ${0, 1, 2, \dots}$ and represents discrete time. The set of numbers from ${0, 1}$ connected to each network vertex at some time t forms a Boolean vector of length n, $n = | V |$ . In this case, as it was shown by S. Kauffman, such Boolean vectors associated with the considered network at moments $t \in {0, 1, 2, \dots}$ can be viewed as the states of a Discrete Dynamical System (DDS), in which the transitions between the states are specified by the deterministic rules, that do not depend on the specific time moment. S. Kauffman viewed the value from ${0, 1}$ associated with vertex v at moment t + 1, t ≥ 0 as the value of some Boolean function of arity $| U_{v} |$ which takes as an input the values assigned to vertices from $U_{v}$ at time moment t. Therefore, all coordinates of the Boolean vector of length n representing the network state at time moment $t + 1$ are recomputed synchronously as the functions of the coordinates of the vector corresponding to the network state at time t. The described model is known as Synchronous Boolean network or Kauffman network. We will refer to the Boolean functions associated with network vertices the way outlined above as weight functions.

Since the rules that define the transitions between the states of Boolean network are deterministic, then the DDS defined this way, to which we further refer as DDS G, has a finite number of states. It means that, given the countable number of time states, such a DDS will traverse the same states multiple times. The situations of such a sort are defined as attractors or cycles. More strictly, let us represent the state of DDS G at an arbitrary time moment t by a Boolean vector of length n denoted as $α (t)$ . Consider the set of states $α (t), α (t + 1), \dots, α (t + j)$ such that j is the minimal possible number for which it holds that $α (t) = α (t + j)$ . In this case, we say that this set of states forms the cycle (attractor) of length j. If the states $α (t), α (t + 1)$ form a cycle of length 1, i.e. $α (t) = α (t + 1)$ , then the state $α (t)$ is called a fixed point of a considered DDS.

Hereinafter, by {0,1}ⁿ denote the set comprised of all binary vectors of length n. This set essentially is a Boolean hypercube of dimension n. Due to the deterministic nature of the weight functions of the vertices, we can view all 2ⁿ different Boolean vectors from {0,1}ⁿ as the states of DDS G. These states can be represented by the vertices of a directed graph called State Transition Graph (STG) and denoted by $Γ_{G}$ . An arc (a,b) in $Γ_{G}$ interprets the transition from the state $a = α (t)$ of DDS G to the state $b = α (t + 1)$ . For two arbitrary states $α (t), α (t + j)$ , $t \geq 0$ , $j \geq 1$ , such that there exists a path from $α (t)$ to $α (t + j)$ in $Γ_{G}$ we will say that $α (t + j)$ is reachable from $α (t)$ . All these notions are illustrated on Fig 1. For example, state (0110) is a fixed point. Coincidentally, it is reachable from all other states. On the other hand, state (0000) is not reachable from any other state.

It is possible to specify the weight functions of the vertices using different ways: by truth tables (as in [33]), by Boolean formulas (as in Fig 1) or by some predicates that express the relations between the corresponding elements (typically, in numeric form).

Further, we will mostly deal with DDS G, in which each arc (u,v) is assigned a non-negative rational number $B_{(u, v)}$ . For correctness, assume that for each v at least one of the numbers $B_{(u, v)}$ is positive. Let us define the number $b_{(u, v)}$ as follows: $b_{(u, v)} = \frac{B_{(u, v)}}{\sum_{u \in U_{v}} B_{(u, v)}}$ . Let us connect with an arbitrary vertex v some fixed number θ_v ∈ [0, 1], called threshold of v. Now, we define the weight function of vertex $v \in V$ :

f_{v} (t + 1) = {\begin{matrix} 1, & \sum_{u \in U_{v}} b_{(u, v)} \cdot f_{u} (t) \geq θ_{v} \\ 0, & otherwise \end{matrix}

(1)

The state of DDS G at the moment t = 0 is called initial and is specified separately. One can use any vector from {0,1}ⁿ as an initial. A vertex v is called active at moment t, if $f_{v} (t) = 1$ , otherwise it is called inactive at this moment.

DDS G defined this way corresponds to the Deterministic Linear Threshold Model (DLTM) in the sense of [4].

1.2 IM and TSS w.r.t. DLTM

The following theorem is widely known and it follows from the fact that at time moment $t + 1$ the number of active vertices in the network is greater or equal than the number of active vertices at moment t. The latter is, in turn, ensured by the properties of weight functions (1).

Theorem 1. For an arbitrary DDS G defined above with weight functions of the kind (1) it holds that for any initial state $α (0)$ there exists such $t \in {0, 1, 2, \dots, n}$ that $α (t)$ , reachable from $α (0)$ is a fixed point of DDS G.

Hereinafter, for an arbitrary $α (0) \in {0, 1}^{n}$ we will be interested in the smallest t such that $α (t)$ is a fixed point reachable from $α (0)$ . Let us denote the corresponding fixed point as $\overset{―}{α}$ . We will refer to the transition of DDS G from state $α (0)$ to a fixed point $\overset{―}{α}$ as to activation process.

Definition 1. Consider an arbitrary DDS G w.r.t. DLTM defined above. For any vector $α \in {0, 1}^{n}$ define the influence $σ (α)$ as the number of active vertices in G at fixed point $\overset{―}{α}$ reachable from the initial state specified by α.

For an arbitrary vector $α \in {0, 1}^{n}$ denote by $w t_{H} (α)$ the Hamming weight [34] of the considered vector. For fixed $k \in {1, \dots, n}$ denote by ${0, 1}_{k}^{n}$ the set of all $α \in {0, 1}^{n}$ s.t. $w t_{H} (α) = k$ .

Definition 2. Influence Maximization problem (IM) w.r.t. DLTM for specific value of parameter k consists in finding the following vector:

α^{*} = a r g m a x_{α \in {0, 1}_{k}^{n}} σ (α)

(2)

In other words, with each vector α from ${0, 1}_{k}^{n}$ one associates the value $σ (α)$ equal to the number of active vertices the network has in a fixed point reachable from α. We denote by $α^{*}$ such a vector that for any $α \in {0, 1}_{k}^{n}$ it holds that $σ (α^{*}) \geq σ (α)$ .

In the context of IM the set of vertices which are active in initial state α is often referred to as a target set. Thus, any vector from {0,1}ⁿ specifies some target set and vice versa. Taking this into account, we will use the term “target set" for an arbitrary $α \in {0, 1}^{n}$ considered as an initial state of DDS G.

The Target Set Selection (TSS) problem [4,35] is closely related to IM.

Definition 3. The TSS w.r.t. DLTM for the predefined number R, $1 \leq R \leq n$ consists in finding a target set α of minimal Hamming weight such that $σ (α) \geq R$ .

Note, that while in IM one searches for a vector of particular weight k which maximizes the value of σ over ${0, 1}_{k}^{n}$ , in TSS one is looking for a vector α of minimal weight in the whole hypercube {0,1}ⁿ, such that $σ (α) \geq R$ .

1.3 Greedy heuristics for IM and TSS w.r.t. DLTM

As said above, there exist effective approximate algorithms for solving IM w.r.t. non-deterministic variant of linear threshold model (to which we further refer as LTM), such as the one considered in [4].

The weight functions in the framework of LTM are the same as functions (1) but the thresholds of vertices are randomly chosen. In more detail, we consider a probabilistic experiment in which the sample space is [0,1]ⁿ (n-th Cartesian degree of the segment [0,1]). Each replica of [0,1] is connected to a separate vertex of network G and each [0,1] has a uniform distribution, according to which the threshold $θ_{v}$ for the corresponding vertex $v \in V$ is selected. For an arbitrary $α \in {0, 1}^{n}$ on [0,1]ⁿ a random variable $ξ (α)$ is defined, the value of which for a random set of thresholds from [0,1]ⁿ is equal to the number of active vertices in $\overset{―}{α}$ reached by DDS G from the initial state α. Influence $\tilde{σ} (α)$ of specific $α \in {0, 1}^{n}$ is defined as the expected value of $ξ (α)$ :

\tilde{σ} (α) = E [ξ (α)]

(3)

For any $k \in {1, \dots, n}$ IM w.r.t. the considered model consists in finding

α^{*} = a r g m a x_{α \in {0, 1}_{k}^{n}} \tilde{σ} (α)

As it was shown in [4], the function (3) is submodular and hence the simple greedy algorithm from [36] can be used to solve this variant of IM, and it approximates the desired optimal solution with constant multiplicative factor equal to 1–e⁻¹ in the following sense: if $α^{'}$ is the solution found by the greedy algorithm then the following inequality holds: $\tilde{σ} (α^{'}) \geq (1 - e^{- 1}) \cdot \tilde{σ} (α^{*})$ (It should be noted that this is valid only under the assumption that the exact values of function (3) are given by some oracle).

On the other hand, according to [4], the function $σ (\cdot)$ is in general not submodular and cannot be approximated with any constant multiplicative factor under the assumption that $P \neq N P$ . Therefore, it is reasonable to apply various heuristics and metaheuristics for solving IM under DLTM.

Apparently, the first algorithm for solving IM and TSS specifically under DLTM is the greedy heuristic proposed in [37]. Let us give its brief description. This algorithm is quite similar to the greedy algorithm from [4]: that is, it starts with an empty set and at each subsequent step one vertex is added to the current target set for which the influence measure of this vertex is maximal. To define this measure, the concept of residual threshold of an arbitrary vertex is used, which we give below.

Consider an arbitrary state of DDS G given by a vector $α \in {0, 1}^{n}$ . Let us associate the following value with α and an arbitrary vertex $v \in V$ :

Θ_{v} (α) = θ_{v} - \sum_{u \in U_{v}} b_{(u, v)} \cdot g_{u} (α)

(4)

The function $g_{u} (α)$ in (4) is defined as follows:

g_{u} (α) = {\begin{matrix} 1, & i f u is active in α \\ 0, & otherwise \end{matrix}

Definition 4. We will call the value given by the formula (4) a residual threshold of vertex v in state α.

Obviously, if $Θ_{v} (α) \leq 0$ , then v is active in state α, otherwise it is inactive in this state. Let us now describe the greedy heuristic which is a small modification of the algorithm proposed in [37]. Given the above, we will view an arbitrary target set as both a concrete set of vertices in the network G and as a Boolean vector of length n, where n is the number of vertices in G: the ones in such a vector mark vertices in target set under consideration.

Let us construct target sets of Hamming weight k, $k \in {0, 1, \dots, n}$ , using a strategy similar to the one employed in [4]. The values of k will correspond to the steps of the algorithm. At each step, the Hamming weight of the current target set is increased by 1 compared to the previous step. Denote the target set at step k by T_k. For k = 0, we have $T_{k} = \emptyset$ . To transition from T_k to $T_{k + 1}$ , we need to select some inactive vertex $v \in V ⧵ T_{k}$ and construct $T_{k + 1} = T_{k} \cup {v}$ . To choose such v, we compute the influence measure of vertices from $V ⧵ T_{k}$ .

In more detail, for each vertex $v \in V ⧵ T_{k}$ , we do the following:

Construct a probing target set ${\tilde{T}}_{k + 1} = T_{k} \cup {v}$
Denote by $\tilde{α}$ the state of DDS G corresponding to ${\tilde{T}}_{k + 1}$ ; considering $\tilde{α}$ as the initial state of DDS G, run activation process and obtain the fixed point $\overset{―}{α}$ ;
Select the set of $A_{v}$ vertices that were inactive in state $\tilde{α}$ but became active in state $\overset{―}{α}$ , and the set of $N_{v}$ vertices that remained inactive in state $\overset{―}{α}$ ;
For each vertex $u \in N_{v}$ let us calculate the following value:
$Δ Θ_{u} = Θ_{u} (\tilde{α}) - Θ_{u} (\overset{―}{α})$ (5)

Note that (5) is the value by which the residual threshold of vertex u decreases as we move from $\tilde{α}$ to $\overset{―}{α}$ . Consider the following value:
$I_{v} = | A_{v} | + \sum_{u \in N_{v}} \frac{Δ Θ_{u}}{θ_{u}}$ (6)

It is clear that (6) can be viewed as a measure of the v’s influence on the activation process: intuitively the larger the value of (6) is, the more v is involved in the process.
Let us go from T_k to $T_{k + 1} = T_{k} \cup {v^{*}}$ , where $v^{*} \in V ⧵ T_{k}$ is a vertex with the maximum value (6). For each $k \in {0, 1, \dots, n}$ , we will consider the Boolean vector that corresponds to the TS T_k constructed in the way described above as an approximation of $α^{*}$ defined according to (2).

The pseudocode of the algorithm is presented at Algorithm 1.

Algorithm 1. Greedy algorithm for solving IM/TSS.

graphic file with name pone.0331109.e511.jpg

Note that the algorithm from [37] differs from the one described above only in that in [37] the activation process is run for some fixed number of time moments each time (specified by one of the algorithm parameters), without necessarily reaching a fixed point.

1.4 Finding exact solution to TSS using SAT solvers

It is easy to see that for relatively small networks one can find exact solutions of IM and TSS using exact combinatorial algorithms. In the present paper, for this purpose we employ the algorithms for solving the Boolean Satisfiability problem (SAT) [38], to which we refer as SAT solvers. Note that DDS G under DLTM defines a discrete function of the kind

Φ_{G} : {0, 1}^{n} \to {0, 1}^{n}

(7)

where $Φ_{G}$ receives as an input an arbitrary $α \in {0, 1}^{n}$ , considered as the initial state of DDS G, and outputs a fixed point $\overset{―}{α}$ reachable from α. Obviously, if the binary representation of numbers of the kind $θ_{v}$ and $b_{(u, v)}$ do not depend on n or are bounded polynomially from n, then the function (7) is computable in polynomial time in n. In this case, due to the Cook-Levin theorem [39,40] there exists a procedure with the runtime polynomial in n that reduces IM and TSS under DLTM to SAT. Recall that SAT [38] is a problem in which one has to determine the satisfiability of an arbitrary Boolean formula $ℱ$ , that is, to answer the question whether there exists a set of values of the variables in $ℱ$ whose substitution into $ℱ$ converts this formula to 1 (True). SAT is a classical NP-complete and NP-hard problem [41], therefore, under the assumption that $P \neq N P$ this problem cannot be solved in the general case in polynomial time. However, it is often the case that the SAT instances of large dimension can be solved in reasonable time with the help of modern SAT solvers [38]. This fact has stimulated the application of SAT solvers to an extremely wide class of combinatorial problems, which we will briefly discuss in Sect 5.

In the paper [42] the SAT solvers were applied to the study of the phenomena of conforming behavior in Kauffman networks. In essence, that paper solved TSS for DDS G under DLTM for functions of the kind (1), where the values $B_{(u, v)}$ were chosen from ${0, 1}$ . Unfortunately, this approach is applicable only to networks with at most a few hundred vertices and, therefore, can not be used to analyze real-world networks of practical interest. However, one feature of this approach is that it makes it possible to construct exact solutions of TSS for networks of small size, and therefore gives us an opportunity to understand how far the solutions obtained by metaheuristic algorithms for these networks are from the exact ones. This will give us additional arguments in favor of the accuracy of the metaheuristics we use.

2 Evolutionary optimization

The main novelty of this paper is the use of metaheuristic optimization algorithms, specifically evolutionary algorithms, to solve IM and TSS. We will use such algorithms to optimize pseudo-Boolean functions which are naturally connected with IM and TSS.

Recall that a pseudo-Boolean function [43] is any function of the following form:

F : {0, 1}^{n} \to ℝ

(8)

In many real-world problems, there is no information about the analytic properties of functions of the form (8), and in such cases it is reasonable to apply metaheuristic algorithms to find the optimum of F over the hypercube {0,1}ⁿ. In the role of such algorithms, one can use, for example, various local search strategies [44], but in our paper we employ the algorithms of evolutionary optimization [45] because they often show good results in application to similar problems from related areas.

Let us briefly present the main notions and constructions used below. The simplest evolutionary algorithm is (1 + 1)-EA [46]. It is based on the notion of random mutation: with an arbitrary Boolean vector $α \in {0, 1}^{n}$ : $α = (α_{1}, \dots, α_{n})$ associate a sequence of independent Bernoulli trials [47] of length n: $β_{1}, \dots, β_{n}$ with a fixed probability of success p (called mutation rate) in each trial. For each $i \in {1, \dots, n}$ : if $β_{i} = 0$ , then the $α_{i}$ coordinate of vector α remains unchanged; if $β_{i} = 1$ , then the $α_{i}$ coordinate is flipped, that is, $0 \to 1$ or $1 \to 0$ . This process yields the vector $α^{'}$ ; $α^{'}$ is said to be the result of a random mutation of α with mutation rate p. If α is the current vector and $α^{'}$ is the result of its random mutation such that $F (α^{'}) > F (α)$ (assuming that we consider the maximization problem for (8)) then $α^{'}$ becomes the new current vector; otherwise α remains as the current vector. Random mutations are applied to the current vector until some termination criterion is met (e.g., a limit on the number of mutations is exceeded).

Despite its simple nature, (1 + 1)-EA has many important theoretical properties. For example, it is easy to see that when $p = \frac{1}{n}$ , the expected value of random variable $d_{H} (α, α^{'})$ (Hamming distance between α and $α^{'}$ ) is 1. This property is very important because it means that, on average, (1 + 1)-EA behaves like a simple Hill Climbing (HC) algorithm [48] and thus can adjust to the landscape of function F, efficiently climbing to local extremums. On the other hand, unlike HC, (1 + 1)-EA is able to move with non-zero probability from an arbitrary point α to a point $α^{*}$ where the function (8) has a global extremum over {0,1}ⁿ.

A number of important theoretical properties of (1 + 1)-EA were presented in the seminal paper [49]. In particular, that paper introduced the notion of complexity for this algorithm and showed that when p = 1/n in the general case, the algorithm is extremely inefficient, since its complexity interpreted as the expected value of the number of random mutations required to reach $α^{*}$ for a number of special functions is $n^{ε n}$ for some constant $ε \in [0, 1]$ . Thus, in the sense of this measure, (1 + 1)-EA has worse complexity even compared to the simple random walk in which we randomly choose α w.r.t. uniform distribution on {0,1}ⁿ. However, this result is a typical example of the worst case scenario whereas in application to many combinatorial problems (1 + 1)-EA is surprisingly effective and often performs better than more sophisticated algorithms. One of the reasons for this is the small expected number of flipped bits in one random mutation, which makes (1 + 1)-EA on average similar to HC [50]. Another advantage of (1 + 1)-EA is that it is easy implement and has a high speed of operation.

In several papers there have been proposed alternative variants of (1 + 1)-EA, the goal of which was to reduce the upper bound on its complexity. The (1 + 1)-Fast Evolutionary Algorithm proposed in [51] is among the best approaches of such a kind. It employs a parameter β and is usually denoted as (1 + 1)-FEA_β. The main idea of that algorithm is to adjust the mutation rate during its work. In more detail, before mutating each $α \in {0, 1}^{n}$ one observes the value of a special random variable δ with the spectrum $S_{δ} = {1, 2, \dots, \frac{n}{2}}$ . The distribution of δ (also called “power-law distribution $D_{n / 2}^{β}$ ” [51]) is defined in the following manner:

Pr {δ = k, k \in S_{δ}} = {(C_{n / 2}^{β})}^{- 1} \cdot \frac{1}{k^{β}}

(9)

In the Eq 9 β, $β > 1$ is the algorithm’s parameter and $C_{n / 2}^{β}$ is the so-called normalizing constant used to normalize probabilities: $C_{n / 2}^{β} = \sum_{i = 1}^{n / 2} i^{- β}$ . Then, a single mutation in (1 + 1)-FEA_β is performed as follows:

Generate a value of $δ \in {1, \dots, n / 2}$ according to the distribution $D_{n / 2}^{β}$
Apply the standard mutation operator with $p = δ / n$ to $α \in {0, 1}^{n}$ .

The upper bound on the complexity of (1 + 1)- FEA_β is $O (C_{n / 2}^{β} \cdot n^{β} \cdot 2^{n})$ which is asymptotically significantly better than that for (1 + 1)-EA. Meanwhile, the expected value of flipped bits of (1 + 1)- FEA_β is a constant independent of n only when $β > 2$ . For example in the case $β = 3$ we have $E [d_{H} (α, α^{'})] = 1.3685 \dots$ (see [51]).

Genetic algorithms (GA) [45] are often used to solve pseudo-Boolean optimization problems, and they can also be applied to IM and TSS. In this paper, we employ the following variant of GA.

Suppose that there is a set of Boolean vectors from {0,1}ⁿ, called population. In particular, we will talk about the current population and use the notation $P_{c u r r} = {α^{1}, \dots, α^{s}}$ . The vectors $α^{j}$ , $j \in {1, \dots, s}$ are referred to as individuals. Let us set the probability distribution $D_{c u r r} = {p_{1}, \dots, p_{s}}$ on P_curr in accordance with the following rules (assuming that we aim to maximize (8) over {0,1}ⁿ):

p_{i} = \frac{F (α^{i})}{\sum_{j = 1}^{s} F (α^{j})}, i \in {1, \dots, s}

Next, we select l individuals with the best values of function (8) from P_curr and add these individuals into new population P_new. Then, in accordance with the distribution D_curr, we select g random individuals from P_curr and apply to each of them random mutation with mutation rate p = 1/n or the mutation variant from [51]. The obtained individuals are added to the new population P_new. Finally, we construct h pairs of randomly chosen vectors from P_curr w.r.t. distribution D_curr and apply to each pair some variant of the crossover operator [45] (in the experiments we used standard two-point crossover) and add the resulting vectors to P_new. The values of l,g,h are selected so that the relation $l + g + 2 h = s$ is satisfied. After this the constructed population P_new becomes P_curr.

2.1 A new variant of (1 + 1)-EA

In this section we describe the algorithm that seeks an extremum of an arbitrary function of the kind (8) over ${0, 1}_{k}^{n}$ . We will later apply this algorithm to solving IM and TSS.

Consider arbitrary $k, n \in N$ : 0<k<n and the set ${0, 1}_{k}^{n}$ formed by all vectors from the hypercube {0,1}ⁿ with Hamming weight k. Assume that we have an optimization problem for some pseudo-Boolean function of the kind (8) over ${0, 1}_{k}^{n}$ . Our closest goal is to construct an algorithm for solving this problem which inherits the basic properties of (1 + 1)-EA. The main idea of this algorithm is to apply the standard (1 + 1) mutation to some part of $α \in {0, 1}_{k}^{n}$ , and then use the so-called correcting transformation on the other part. Let us present the details below.

By $| α |$ let us denote the length of vector α. For an arbitrary $α \in {0, 1}_{k}^{n}$ we split it into two parts $γ_{1}, γ_{0} : α = (γ_{1}, γ_{0})$ . Vectors $γ_{1}$ and $γ_{0}$ are formed by the coordinates of α which are equal to 1 and 0, respectively. We assume that $| γ_{1} | = k$ and $| γ_{0} | = n - k$ . Next, we define the transformation of α as the transformation of vectors $γ_{1}, γ_{0}$ .

We will use the following notation: $γ^{'} \in {γ_{1}, γ_{0}} : | γ^{'} | = min {| γ_{1} |, | γ_{0} |}$ and $γ^{″} = {γ_{1}, γ_{0}} ⧵ {γ^{'}}$ . Apply the standard random mutation operator with rate $p = 1 / | γ^{'} |$ to $γ^{'}$ . If no bits of $γ^{'}$ were flipped during the mutation then $α^{'} = α$ . Now, suppose that r bits in $γ^{'}$ were flipped, r>0. Let us fix the coordinates of flipped bits in $γ^{'}$ . We need to apply to $γ^{″}$ such a transformation that the Hamming weight of the result of mutation of α will be equal to that of α. For this purpose, we choose r random coordinates in $γ^{″}$ using the standard sampling without replacement [47], and flip the bits with these coordinates in α. Denote the resulting vector by $\tilde{α}$ . It is clear that $\tilde{α} \in {0, 1}_{k}^{n}$ . We denote the operator that maps ${0, 1}_{k}^{n}$ onto itself as $μ_{k}^{n} : μ_{k}^{n} (α) = \tilde{α}$ . For convenience, we will also denote the images of vectors $γ^{'}$ and $γ^{″}$ in $\tilde{α}$ w.r.t. $μ_{k}^{n}$ as ${\tilde{γ}}^{'}$ and ${\tilde{γ}}^{″}$ . Let us refer to the $μ_{k}^{n}$ operator as k-weight random mutation.

Note that we can employ the operator $μ_{k}^{n}$ to perform transitions between vectors from ${0, 1}_{k}^{n}$ similar to how the standard random mutation operator in (1 + 1)-EA is employed: if α is the current vector from ${0, 1}_{k}^{n}$ , and $α^{'} = μ_{k}^{n} (α)$ , then $α^{'}$ becomes current only if $F (α^{'}) > F (α)$ (assuming that we aim to maximize F). The pseudocode of this algorithm is presented in Algorithm 2. Further we will refer to it as (1 + 1)-WEA (from ‘Weighted Evolutionary Algorithm’). Let us establish the main properties of the algorithm.

Algorithm 2. The pseudocode for (1 + 1)-WEA.

graphic file with name pone.0331109.e512.jpg

Proposition 1. For any two vectors $α, \tilde{α} \in {0, 1}_{k}^{n}$ there exists a transition of the kind $μ_{k}^{n} (α) = \tilde{α}$ .

Proof: Consider two arbitrary vectors $α, \tilde{α} \in {0, 1}_{k}^{n}$ and outline the part $γ^{'}$ in α. Without loss of generality, assume that vector $γ^{'}$ is formed by ones and $| γ^{'} | = k$ . Assume that the coordinates of $γ^{'}$ have the numbers $i_{1}, \dots, i_{k}$ , and the coordinates of $γ^{″}$ have the numbers $j_{1}, \dots, j_{n - k}$ with respect to the order (of coordinates) common for both α and $\tilde{α}$ . Let ${\tilde{λ}}^{'}, {\tilde{λ}}^{″}$ be the vectors formed by the coordinates with numbers $i_{1}, \dots, i_{k}$ in vector $\tilde{α}$ and numbers $j_{1}, \dots, j_{n - k}$ in vector $\tilde{α}$ respectively. Note that if $d_{H} (γ^{'}, {\tilde{λ}}^{'}) = 0$ , then $d_{H} (γ^{″}, {\tilde{λ}}^{″}) = 0$ , thus $α = \tilde{α}$ . Therefore, in this case the transition $α \to \tilde{α}$ is realized by means of the event in which during the (1 + 1)-random mutation of $γ^{'}$ there are no flipped bits. This event has non-zero probability and, therefore, can be realized. Suppose that $d_{H} (γ^{'}, {\tilde{λ}}^{'}) = t$ , $1 \leq t \leq k$ . It means that vector ${\tilde{λ}}^{'}$ is formed by t zeroes and k–t ones. But in this case, the vector ${\tilde{λ}}^{″}$ must contain t ones and, accordingly, n–k–t zeros. Then it is easy to see that the transition $α \to \tilde{α}$ is also possible (and consequently has non-zero probability) thanks to the use of the $μ_{k}^{n}$ operator: ${\tilde{γ}}^{'} = {\tilde{λ}}^{'}$ ; ${\tilde{γ}}^{″} = {\tilde{λ}}^{″}$ . Due to the fact that we can arbitrarily choose vectors $α, \tilde{α}$ from ${0, 1}_{k}^{n}$ the proposition is proved. □

As we already noted above, it is important that the vector which is the random mutation of α is as close to α as possible Hamming distance-wise, since it enables the algorithm to ‘sense’ the landscape of a plot of a considered function. Note that for an arbitrary pair of vectors $α, \tilde{α} \in {0, 1}_{k}^{n}$ : $α \neq \tilde{α}$ we have $d_{H} (α, \tilde{α}) \geq 2$ . Now, let us establish the following property of the described algorithm.

Proposition 2. For $α, \tilde{α} \in {0, 1}_{k}^{n}$ : $\tilde{α} = μ_{k}^{n} (α)$ it holds that $E [d_{H} (α, \tilde{α})] = 2$ .

Proof: Assume that the application of $μ_{k}^{k}$ to a part $γ^{'}$ of vector α resulted in the flipping of r bits. But in this case in $γ^{″}$ exactly r bits will also be flipped (chosen randomly). Then it holds that $E [d_{H} (α, \tilde{α})] = E [2 \cdot d_{H} (γ^{'}, {\tilde{γ}}^{'})] = 2 \cdot E [d_{H} (γ^{'}, {\tilde{γ}}^{'})]$ . Since the expected value of the number of flipped bits in $γ^{'}$ w.r.t. mutation rate $p = 1 / | γ^{'} |$ is 1 (see above), we have that $E [d_{H} (α, \tilde{α})] = 2$ . Thus the proposition is proved. □

Now let us construct an upper bound on the complexity of the described algorithm. We will use the reasoning similar to that from [49]. Remind that in [49] they proposed the idea to estimate the complexity of (1 + 1)-EA via the expected value of a random variable which is the number of random mutations performed by (1 + 1)-EA until it reaches a global extremum of the considered function. We will denote this expectation for specific $α \in {0, 1}^{n}$ as $E [α \to α^{*}]$ . Let P be the probability of success of the event ${α \to α^{*}}$ . Then due to the properties of the geometric distribution [47] we have that the considered expectation is $E [α \to α^{*}] = 1 / P$ .

In [49] an apparatus was presented which makes it possible to construct upper bounds for values of the kind $E [α \to α^{*}]$ keeping in mind an arbitrary vector α. As it was showed in [49] the value P reaches its minimum for α such that $d_{H} (α, α^{*}) = n$ and in this case we have P = pⁿ where p is a mutation rate. If, for example, $p = \frac{1}{n}$ , then we have P = n⁻ⁿ and, consequently, the following upper bound on the complexity of (1 + 1)-EA holds: $E [α \to α^{*}] \leq n^{n}$ . As mentioned above, for some specific functions (see [49]) it was demonstrated that bounds of the kind $n^{ε \cdot n}$ can be achieved for some constant $ε \in [0, 1]$ . However, in our case, we know nothing about the nature of the considered functions and, consequently, we need to work under the assumption that this function is a black box and construct for (1 + 1)-WEA only the upper bounds similar to ones for the classical variant of (1 + 1)-EA. The corresponding results are presented by the following theorem.

Theorem 2. Suppose, that (1 + 1)-WEA is applied to maximization of an arbitrary pseudo-Boolean function over ${0, 1}_{k}^{n}$ , 0<k<n, and let $l = m i n {k, n - k}$ . Then for some $n_{0} \in ℕ$ and for all $n \geq n_{0}$ the following upper bound holds:

E [α \to α^{*}] \leq 2 \cdot {(\frac{l \cdot (n - l)}{e^{1 + \frac{1}{n - l}}})}^{l}

Proof: Fix an order on the set of coordinates of all vectors from {0,1}ⁿ. Consider the operator $μ_{k}^{n} : {0, 1}_{k}^{n} \to {0, 1}_{k}^{n}$ . Let us construct vectors $γ^{'}$ and $γ^{″}$ and assume, that $| γ^{'} | = l$ , $| γ^{″} | = n - l$ . From the definition of $μ_{k}^{n}$ it follows that $l \leq n / 2$ . The vector $γ^{'}$ undergoes the standard (1 + 1)-EA mutation with the rate p = 1/l. Let $α^{*} \in {0, 1}_{k}^{n}$ be the vector on which the considered (arbitrary) fitness function achieves a global extremum in ${0, 1}_{k}^{n}$ . Denote by $γ^{' *}$ the vector formed by the coordinates in $α^{*}$ with the same numbers (w.r.t. the order fixed above) as in $γ^{'}$ . We want to estimate the probability that after the application of $μ_{k}^{n}$ to α the following transition happens: $γ^{'} \to γ^{' *}$ . By using the reasoning similar to that in [49], we have that:

Pr {γ^{'} \to γ^{' *}} = {(\frac{1}{l})}^{d_{H} (γ^{'}, γ^{' *})} \cdot {(1 - \frac{1}{l})}^{l - d_{H} (γ^{'}, γ^{' *})}

(10)

Suppose, that during the transition $μ_{k}^{n} (α) = α^{*}$ there were r ( $r \leq l$ ) mutated bits in $γ^{'}$ , and the result of transition was $γ^{'} \to γ^{' *}$ . Thus, $r = d_{H} (γ^{'}, γ^{' *})$ . Now, consider the corresponding correcting transformation of $γ^{″}$ . Under the assumption that the transition is $γ^{″} \to γ^{*^{″}}$ we can conclude, that $γ^{″}$ and $γ^{*^{″}}$ must differ in r coordinates, and these are the exact coordinates that must be flipped during the application of $μ_{k}^{n}$ to $γ^{″}$ . Thus, we need to correctly choose r coordinates out of n–l, using the sampling without replacement. It is a well-known fact (see e.g. [47]) that if the order of chosen coordinates matters, then the corresponding number of possible alternative choices is expressed by the following falling factorial: $(n - l)_{r} = (n - l) \cdot \dots \cdot (n - l - r + 1)$ . But in our case, the order of chosen coordinates does not matter because n–l considered coordinates are either all zeroes or all ones. Thus, the number of possible alternatives of the choice is the following binomial coefficient: $(\binom{n - l}{r})$ . Then, the probability of the transition $γ^{″} \to γ^{*^{″}}$ looks as follows:

Pr {γ^{″} \to γ^{*^{″}}} = \frac{r!}{(n - l)_{r}}

(11)

Note, that when r = 0 the vector α will not change and the probability of this event is ${(1 - \frac{1}{l})}^{l} \approx e^{- 1}$ . Hereinafter, we assume that $r \geq 1$ . Taking into account (10) and (11), for a fixed $r : 1 \leq r \leq l \leq n / 2$ we have, that:

Pr {α \to α^{*}} = \frac{r!}{l^{r} \cdot (n - l)_{r}} \cdot {(1 - \frac{1}{l})}^{l - r}

Let us make use of the fact, that ${(1 - \frac{1}{l})}^{l - r} = {(1 - \frac{1}{l})}^{l \cdot (1 - \frac{r}{l})} \approx e^{- (1 - \frac{r}{l})}$ . Then, when $1 \leq r \leq l$ we have $e^{- 1} < e^{- (1 - \frac{r}{l})} \leq 1$ . Therefore,

Pr {α \to α^{*}} \geq \frac{r!}{e \cdot l^{r} \cdot (n - l)_{r}}

(12)

From (12), using the reasoning analogous to [49], we have an expression for the expected value of the number of independent mutations that need to happen for the transition $α \to α^{*}$ to occur:

E [α \to α^{*}] \leq \frac{e}{r!} \cdot l^{r} \cdot (n - l)_{r}

(13)

To construct a more explicit estimation for $E [α \to α^{*}]$ let us introduce the notation $Ξ (n, l, r) = \frac{e}{r!} \cdot l^{r} \cdot (n - l)_{r}$ and consider the value: $\frac{1}{{(n - l)}^{r}} Ξ (n, l, r)$ . Then, directly from (13) the following holds:

\frac{1}{(n - l)^{r}} Ξ (n, l, r) = \frac{e}{r!} \cdot l^{r} \cdot 1 \cdot (1 - \frac{1}{n - l}) \cdot \dots \cdot (1 - \frac{r - 1}{n - l})

Taking into account, that $r \leq l \leq n / 2$ , for all $r \geq 2$ we have:

\frac{1}{(n - l)^{r}} Ξ (n, l, r) \leq \frac{e}{r!} \cdot l^{r} \cdot {(1 - \frac{1}{n - l})}^{r - 1} = \frac{e {(1 - \frac{1}{n - l})}^{- 1}}{r!} \cdot l^{r} \cdot {(1 - \frac{1}{n - l})}^{\frac{r (n - l)}{n - l}}

(14)

Note, that ${(1 - \frac{1}{n - l})}^{n - l}$ tends to e⁻¹ with the increase of n. Then, keeping in mind that for all r > 6 it holds that $r! > e^{r + 1}$ we can rewrite (14) as follows:

\begin{matrix} \frac{1}{(n - l)^{r}} Ξ (n, l, r) & \leq {(1 - \frac{1}{n - l})}^{- 1} \cdot \frac{1}{e^{r}} \cdot l^{r} \cdot {(\frac{1}{e^{\frac{1}{n - l}}})}^{r} = \\ = {(1 - \frac{1}{n - l})}^{- 1} \cdot {(\frac{l}{e^{1 + \frac{1}{n - l}}})}^{r} \end{matrix}

(15)

Since $l \leq \frac{n}{2}$ , then $1 < {(1 - \frac{1}{n - l})}^{- 1} \leq 2$ for all $n : n \geq 4$ . Also, taking into account (13) we can derive from (15) that:

E [α \to α^{*}] \leq 2 \cdot {(\frac{l \cdot (n - l)}{e^{1 + \frac{1}{n - l}}})}^{r}

(16)

Let us note, that the value in the right part of (16) is increasing with the increase of r. However, since $r \leq l$ it is clear that the validity of the theorem follows from (16). □

3 IM and TSS as pseudo-Boolean optimization problems

In this section we will give formal definitions of IM and TSS in form of maximization problems for pseudo-Boolean functions of the kind (8). To solve them we will employ the concept in which the solving itself is split in two stages:

The stage at which we construct an initial solution using a Greedy Algorithm (GrA);
The stage at which we improve the solution found at stage 1 using evolutionary optimization.

In the context of IM we consider DDS G, $| V | = n$ with weight functions of the kind (1) and fixed thresholds (DLTM). For each $k \in {1, \dots, n}$ let us focus on the following function:

F_{k} : {0, 1}_{k}^{n} \to {1, \dots, n}

(17)

For an arbitrary $α \in {0, 1}_{k}^{n}$ we will suppose that $F_{k} (α) = σ (α)$ , where $σ (\cdot)$ is the influence computed w.r.t. Definition 2.

Using GrA let us find some $α^{0} \in {0, 1}_{k}^{n}$ which will serve as an initial approximation of $α^{*}$ , where $α^{*}$ is defined by (2). Next, we use $α^{0}$ as a starting current point and apply k-weight mutation operator $μ_{k}^{n}$ to $α^{0}$ and the following current vectors, aiming to maximize (17) until some termination criterion is met. The resulting vector ${\tilde{α}}^{*}$ is the approximate solution of IM for network G over ${0, 1}_{k}^{n}$ . Let us additionally note, that to find $α^{0} \in {0, 1}_{k}^{n}$ , $k \geq 1$ the greedy algorithm uses the approximate solutions of IM over the sets of the kind ${0, 1}_{i}^{n}$ , $i \in {0, \dots, k - 1}$ .

Next, consider the TSS problem. The general scheme for solving TSS employed further is as follows. On the first stage, similar to IM we construct an approximate solution $α^{0}$ using GrA. Thus, the vector $α^{0}$ is an approximate solution for IM for the smallest k such that the number of active vertices in the state $\overset{―}{α}$ reachable from $α^{0}$ is at least R.

Once we found $α^{0}$ in ${0, 1}_{k}^{n}$ we make use of the two following strategies for solving TSS. The first strategy implies launching (1 + 1)-EA, (1 + 1)-FEA [51] or GA. Each algorithm is tasked with finding a vector of minimal Hamming weight such that starting from this vector, the DDS G transitions to a fixed point $\overset{―}{α}$ in which the number of active vertices is at least R.

The second strategy consists in the following steps:

Start from the current dimension k such that $α^{0} \in {0, 1}_{k}^{n}$ and use only operator $μ_{k}^{n}$ on ${0, 1}_{k}^{n}$ to find some approximate solution of IM denoted by ${\tilde{α}}_{k}^{*} \in {0, 1}_{k}^{n}$ ;
Transition from ${\tilde{α}}_{k}^{*}$ to some vector $α_{k - 1}^{0} \in {0, 1}_{k - 1}^{n}$ using the heuristics described below;
Starting from $α_{k - 1}^{0}$ apply operator $μ_{k - 1}^{n}$ and try to find such ${\tilde{α}}_{k - 1}^{*} \in {0, 1}_{k - 1}^{n}$ that the number of active vertices in fixed point $\overset{―}{α}$ reached from ${\tilde{α}}_{k - 1}^{*}$ is at least R. In the case of success, decrease the current dimension by 1 and go to step 2. Otherwise, output the current best known vector ${\tilde{α}}^{*}$ as an approximate solution of TSS.

To transit from ${\tilde{α}}_{k}^{*}$ to $α_{k - 1}^{0}$ on step 2 we use several heuristics. Since for each value of k we will perform the same actions, let us without the loss of generality use the notation ${\tilde{α}}^{*}$ instead of ${\tilde{α}}_{k}^{*}$ . An arbitrary TS defined by vector $α \in {0, 1}^{n}$ we will denote as $T (α)$ .

The first and the simplest heuristic, to which we further refer as to (1 + 1)-WEA_v1 consists in removing from the set $T ({\tilde{α}}^{*})$ the vertex with the smallest number of outgoing arcs.

In the second heuristic denoted as (1 + 1)-WEA_v2 for each vertex $v \in T ({\tilde{α}}^{*})$ we compute the so-called activation potential:

π (v) = \sum_{u : (v, u) \in A} \frac{b_{(v, u)}}{θ_{u}}

(18)

Then we remove from $T ({\tilde{α}}^{*})$ the vertex with minimal value of $π (v)$ .

We also employed a heuristic which is close in spirit to the greedy algorithm described in Sect 1.3. Consider ${\tilde{α}}^{*}$ and let $\overset{―}{α}$ be the fixed point reached by DDS G when it starts from the initial state ${\tilde{α}}^{*}$ . For each vertex $v \in T ({\tilde{α}}^{*})$ denote by ${\tilde{α}}_{v}^{*}$ the vector which specifies the target set $T ({\tilde{α}}^{*}) \ {v}$ and let ${\overset{―}{α}}_{v}$ be a fixed point reached by DDS G from state ${\tilde{α}}_{v}^{*}$ . Next we find the following vertex

v^{'} = a r g m i n_{v \in T ({\tilde{α}}^{*})} ({w t}_{H} (\overset{―}{α}) - w t_{H} ({\overset{―}{α}}_{v}))

(19)

In the role of $α_{k - 1}^{0}$ we pick the vector that specifies target set $T ({\tilde{α}}^{*}) \ {v^{'}}$ . However in practice this step is quite computationally intensive due to the necessity to check all vertices in $T ({\tilde{α}}^{*})$ . That is why we sort all vertices from $T ({\tilde{α}}^{*})$ in an ascending order of value (18), calculate the difference $w t_{H} (\overset{―}{α}) - w t_{H} ({\overset{―}{α}}_{v})$ only for the first Q vertices from this ordered set, and choose a vertex with minimal value of this difference. We denote the described heuristic as (1 + 1)-WEA_v3. In the next section, we present the results that demonstrate the behavior of the described heuristic for different values of parameter Q. The pseudocode of the proposed strategy for solving TSS by iteratively decreasing the value of k using the heuristic variant v3 is presented at Algorithm 3.

Algorithm 3. The hybrid algorithm with v3 heuristic.

graphic file with name pone.0331109.e513.jpg

4 Computational experiments

We performed several series of experiments to assess the algorithms proposed in the previous sections and evaluate their performance compared to competition. All data related to experiments is available online at https://github.com/Alex-Andrv/evo_network_public.

4.1 Benchmarks

In the experiments we considered two classes of networks. The first one is formed by random networks of small size generated using the well-known models: the Watts-Strogatz model [52] (WS-networks) and Barabasi-Albert model [53] (BA-networks). We used these networks to construct exact solutions for TSS and evaluate the quality of solutions found by metaheuristic algorithms by comparing them against the exact one. The second class of benchmarks was formed by the fragments of real-world networks taken from the SNAP [32] repository. We used them to compare the considered evolutionary and hybrid algorithms against each other when solving IM and TSS.

4.1.1 Network graphs.

In more detail, for the experiment with small random networks to generate the latter we employed the well-known NetworkX package [54]. For the Watts-Strogatz model we generated networks with 40 vertices, where each node is joined with k = 8 neighbors in a ring and each edge has the probability of $β = 0.5$ to be rewired. In the case of Barabasi-Albert model we generated graphs with 50 vertices, and used the value of parameter m (the number of edges to attach from a new node) equal to 4, with no initial graph specified (in which case NetworkX starts from a star graph on $m + 1$ nodes).

As for the benchmarks from the SNAP repository, we used a number of networks with several thousand vertices on average. Specifically, we picked facebook_combined (which is a whole network from the ego-Facebook SNAP entry), p2p-Gnutella06_combined, p2p-Gnutella08_combined, wiki-vote and ca-GrQc networks.

The majority of network graphs from SNAP as well as the generated small graphs are undirected, thus we replaced each edge by a pair of arks going in opposite directions.

4.1.2 Arc weights and thresholds.

To generate arc weights $b_{(u, v)}$ we chose the numbers $B_{(u, v)}$ (see Sect 1.1) uniformly from a set ${1, \dots, K}$ , where K was chosen depending on the network size. For the small networks the values of $B_{(u, v)}$ were picked from the range of ${1, 2}$ for the Watts-Strogatz networks and from ${1, \dots, 5}$ for the Barabasi-Albert networks. In the case of SNAP networks to reflect the possibility that the influence of some agents may be significantly larger than that of other we picked $B_{(u, v)}$ from ${1, \dots, 1000}$ .

The vertex thresholds were generated with respect to one of the two different thresholds distributions. In the first case we used equal thresholds for all vertices $θ = 0.8$ , while in the second case, the value of θ for each vertex was chosen randomly from $[\frac{3}{4}, 1]$ .

Typically, we encapsulate parameter values in a benchmark name. For example, in benchmark BA_50_4_uni_1-5_const_0.8, BA refers to the Barabasi-Albert model, 50_4 to the network size and the parameter for the graph generator (in this case, the number of edges to attach from a new node to existing nodes). The following block refers to the way the arc weights are chosen: uni_1-5 means that the arc weights were picked in accordance with the uniform distribution from the range ${1, \dots, 5}$ The final block const_0.8 refers to the scheme used to choose thresholds and in the example it employs constant thresholds equal to 0.8 for each vertex.

4.2 Comparing to exact solution

The objective of the first experiment was to demonstrate that evolutionary algorithms and the combination of the greedy and evolutionary algorithms given enough resources can produce solutions that are close in size to the exact solution. To show it we used small random networks described in Sect 4.1, so that we were able to find the exact solutions using combinatorial algorithms, in the role of which we employed modern SAT solvers.

Note that in this context it is natural to consider TSS in the following formulation: to find the TS of smallest size that activates at least $\geq R$ network vertices in M time steps. To reduce this problem to SAT we used the methods similar to those described in [42] and the PySAT toolkit [55]. As the value of M we used the number of vertices in networks, thus in this part of the experiments M = n. If in the process of solving the algorithm finds a set of size $k^{'}$ then the formula is constructed that encodes the existence of a TS of size $k^{'} - 1$ which activates at least R vertices in M time steps. If this formula is unsatisfiable for TS of size $k^{'} - 1$ then the TS of size $k^{'}$ is considered to be the exact solution.

We applied to the same networks the combination of the greedy and the evolutionary algorithm, in particular, (1 + 1)-EA. The evolutionary algorithm started either from a set corresponding to all network vertices or from an initial point corresponding to the solution found by the greedy algorithm.

The results of the experiment are presented in Table 1. It is clear from the table that in most cases the hybrid strategy manages to find the solution which is close to optimal, and often does find the optimal solution.

Table 1. Results on the comparison of evolutionary and greedy algorithms with exact solution.

Benchmark	SAT (exact)	GrA	(1 + 1)-EA	GrA&(1 + 1)-EA
BA_50_4_uni_1-5_uni_0.75-1	15	18	16.35	15.6
BA_50_4_uni_1-5_const_0.8	16	19	17.5	17.1
WS_40_8_0.5_uni_1-2_uni_0.75-1	15	20	17.65	17.1
WS_40_8_0.5_uni_1-2_const_0.8	16	21	16.9	16.25

Open in a new tab

In all cases the value represents the TS size. Evolutionary algorithms were allowed to perform up to 10000 iterations. The results of nondeterministic algorithms are averaged over 20 launches and are presented in cursive.

4.3 Solving IM with (1 + 1)-WEA

The goal of the next series of experiments was to see whether the proposed mutation operator in (1 + 1)-WEA really manages to improve the quality of solution in the context of IM, e.g. when we fix the size of the target set and aim to maximize the number of activated networks.

For this purpose, we used the GrA algorithm to find TSS solutions for $M = h \times n$ , where $h \in {0.5, 0.6, 0.7, 0.8}$ and n is the number of vertices in a network. Recall that similar to above we denote by k the size of the constructed TS. Next, we applied (1 + 1)-WEA (with the heuristics to decrease TS size disabled) to see how it performs in solving IM for specific values of k. For this purpose, we used the benchmarks based on the SNAP networks described in Sect 4.1.

In the experiments, we observed that in all cases (1 + 1)-WEA was able to improve the quality of the solution found by the greedy algorithm. The Fig 2 presents the convergence plot for one of the considered benchmarks in which the improvements were better than average. It is clear from the figure that (1 + 1)-WEA is sometimes capable of explosive improvements (see orange line on the plot) that result in significant increases in the quality of the found TS. In many observed cases, however, the behavior is more akin to that presented on red and blue lines: clear improvements, but without sudden large increases in influence.

To illustrate the improvements that can be obtained using (1 + 1)-WEA, we generated a small network with 50 vertices, and used greedy algorithm to find a target set of size k = 19. The set found by the greedy algorithm is presented in the left part of Fig 3, while that found by the hybrid approach – in the right part of the figure. It is clear that the quality of the set constructed by the proposed approach is significantly higher.

Fig 3 — Vertices from the target set are marked red, active vertices—by blue and inactive vertices are gray. It is clear that the set found using (1 + 1)-WEA manages to activate 11 more vertices out of 50 compared to the set found by the greedy algorithm.

4.4 Evaluation of (1 + 1)-WEA for solving TSS

In the final series of experiments we considered SNAP networks and solved TSS using the following algorithms to evaluate their performance: Greedy Algorithm (GrA), (1 + 1)-EA, (1 + 1)-FEA with $β = 3$ , Genetic algorithm (GA), (1 + 1)-WEA augmented with heuristics for moving from ${0, 1}_{k}^{n}$ to ${0, 1}_{k - 1}^{n}$ , which were described in Sect 3: (1 + 1)-WEA_v1, (1 + 1)-WEA_v2, (1 + 1)-WEA_v3. All evolutionary algorithms started from the solution found by the greedy algorithm, which stopped once it found a TS that activates at least $R \geq 0.75 \times n$ vertices. Evolutionary algorithms were allowed to perform 10000 iterations: in the case of evolutionary algorithms it was 10000 mutations and in the case of Genetic Algorithm it was 10000 populations of size 10.

From Table 2 one can see that the Genetic Algorithm while having the most resources yields the worst results. We believe that the reason for this behavior is that the crossover and mutation operators in GA need to be specifically tuned to work well with IM/TSS in mind. The analysis shows that the standard crossover operator often produces TSs the Hamming weight of which is too different from that of current best known TS. Therefore, such individuals are typically useless, but they nevertheless consume a lot of allocated resources. From our point of view, adapting crossover for IM and TSS represents a promising direction of future research.

Table 2. Results on solving TSS using different evolutionary and greedy algorithms.

	GrA	EA	FEA	GA	WEA_v1	WEA_v2	WEA_v3
Wiki-Vote
uni_1-1000_const_0.8	3450	3224	3227	3381	3193	3180	3180
uni_1-1000_uni_0.75-1	3380	3170	3174	3305	3142	3136	3135
p2p-Gnutella06
uni_1-1000_const_0.8	582	490	489	536	400	393	397
uni_1-1000_uni_0.75-1	437	376	376	406	319	309	300
p2p-Gnutella08
uni_1-1000_const_0.8	332	268	271	301	233	216	219
uni_1-1000_uni_0.75-1	281	227	228	252	185	176	176
facebook_combined
uni_1-1000_const_0.8	1526	1428	1426	1486	1371	1411	1411
uni_1-1000_uni_0.75-1	1371	1254	1255	1318	1211	1243	1243
ca-GrQc
uni_1-1000_const_0.8	1592	1536	1535	1574	1496	1500	1499
uni_1-1000_uni_0.75-1	1475	1413	1412	1451	1366	1376	1374

Open in a new tab

In all cases the value represents the TS size. Evolutionary algorithms were allowed to perform up to 10000 iterations and started from the solution found by the greedy algorithm in all cases, thus we omit this information in column names. We also omit the (1 + 1) part for brevity. The results of nondeterministic algorithms are averaged over 20 launches and rounded. The best result for each benchmark is outlined in bold. The values are rounded to the closest integer.

We computed a Wilcoxon Signed Rank Test to study the (1 + 1)-WEA performance improvement over other methods. It revealed a statistically significant difference in TS size found by different methods, in particular, the results of the proposed WEA algorithm (in any of three variations) significantly differ from others (p<0.05), with a large effect size (r = 0.886) [56].

Fig 4 presents convergence plots for a single run of the algorithms for four out of five networks. From both Fig 4 and Table 2 it is clear, that (1 + 1)-EA and (1 + 1)-FEA show comparable results, which are substantially better compared to that of GA. In almost all cases they make it possible to improve the solution found by the Greedy algorithm.

Finally, three versions of (1 + 1)-WEA show the best results overall. An interesting fact is that in the considered test series typically different versions of (1 + 1)-WEA perform better than the others, and this behavior perseveres for different arcs’ weights and vertices’ thresholds. It is natural to assume that they tend to align well with graph structure. Thus, another interesting venue for future research consists in learning to pick the best heuristic based on the effectively computable network properties, such as centrality, etc.

4.5 Sensitivity analysis

In this subsection we present the results on the sensitivity analysis of the behavior of the (1 + 1)-WEA_v3 algorithm described in Sect 3 under different values of parameter Q. In this series of experiments we used the same networks as in the previous subsection.

Remind, that the v3 heuristic described in Sect 3 attempts to remove from the current target set of size k a single vertex, and then launches (1 + 1)-WEA over {0,1}^k−1 to maximize the influence. If it manages to find ${\tilde{α}}^{*} \in {0, 1}^{k - 1} : σ ({\tilde{α}}^{*}) \geq R$ then this ${\tilde{α}}^{*}$ becomes the current best known target set.

When attempting to transition from TS of size k to TS of size k–1 using (19), we want to compute the value $w t_{H} (\overset{―}{α}) - w t_{H} (\overset{―}{α_{v}})$ for as few TS candidates as possible, thus we pic the first Q of them when sorted by the value of (18).

We tested the values $Q \in {25, 50, 75, 100, 125}$ , the corresponding results are presented in Table 3. It is clear from them that the value of Q has quite little impact on the quality of the obtained solutions in most cases.

Table 3. Results on solving TSS using (1 + 1)-WEA_v3 with different values of Q.

	$Q = 25$	$Q = 50$	$Q = 75$	$Q = 100$	$Q = 125$
Wiki-Vote
uni_1-1000_const_0.8	3175	3180	3181	3181	3179
uni_1-1000_uni_0.75-1	3138	3135	3131	3138	3139
p2p-Gnutella06
uni_1-1000_const_0.8	399	397	396	396	400
uni_1-1000_uni_0.75-1	309	300	315	303	310
p2p-Gnutella08
uni_1-1000_const_0.8	223	219	218	217	221
uni_1-1000_uni_0.75-1	179	176	175	177	178
facebook_combined
uni_1-1000_const_0.8	1407	1411	1411	1407	1415
uni_1-1000_uni_0.75-1	1235	1243	1235	1246	1242
ca-GrQc
uni_1-1000_const_0.8	1497	1499	1500	1498	1497
uni_1-1000_uni_0.75-1	1375	1374	1376	1377	1375

Open in a new tab

In all cases the value represents the TS size. The results are averaged over 20 launches and rounded. The best result for each benchmark and value of Q is outlined in bold.

We used Wilcoxon Signed Rank test to see whether any of the values of Q shows statistically significant performance improvement over the others, but on the considered benchmarks the amount of data was not sufficient enough to reject a null hypothesis about the equality of medians between every pair of values.

On Fig 5 we show how the (1 + 1)-WEA_v3 algorithm improves the target set during several iterations. Here, we used the same network as in Fig 3.

4.6 Analysis of practical efficiency

Let us say a few words about the practical complexity of the employed algorithms. It is clear that a single mutation is performed by (1 + 1)-EA and (1 + 1)-WEA in time O(n), where n is the size of the mutated vector. Theoretical upper bounds on the number of mutations until the considered algorithms reach a global extrema can be found in Sect 2. In practice, the typical approach is to limit the number of mutations by some constant. In our experiments, we used the limits from 1000 to 10000 mutations. Usually, within the allocated mutation budget, the algorithms manage to substantially improve the initial value of the target function found by the greedy algorithm.

As for the memory, the described algorithms do not have any specific requirements: it is clear that the total amount of memory used by the (1 + 1)-EA and (1 + 1)-WEA algorithms from start to the moment they find a global extrema does not exceed O(n). Indeed, they basically store in the memory the best point, corresponding to the Best Known Value of a considered function in that point, and the current point. Also note, that the functions we optimized in the present paper are relatively cheap, compared to e.g. functions from [57,58]. Therefore, for these functions we can see that the (1 + 1)-evolutionary strategy works significantly better compared to more complex genetic algorithms operating with populations of size greater than one. Thus, for the considered functions the more effective strategy is the one that processes more points per the unit of time.

5 Related work

As it was mentioned above, to the best of our knowledge, the paper [33] by Stewart Kauffman was the first work in which networks were used to define dynamical systems with discrete time and finite number of different states. Also, in that work it was proposed to associate Boolean functions with network vertices, defining functions via the truth tables. The information diffusion model considered in [33] is exactly the same as the DLTM in the variant described in our paper. It was used there to describe possible interactions in biological systems known as Gene Regulatory Networks.

Later, the term “Kauffman Networks” was used in a number of works, e.g. [59,60], that studied different critical phenomena in these networks, such as distributions of the length of cycles produced by these networks, statistical characteristics for some kinds of clusters (called modules in [60]), as well as the analysis of similarities with the percolation theory (in this point the authors of [59,60] refer to [61]). Papers [62,63] studied the cyclic structure of state transition graphs for DDS induced by Kauffman Networks. In particular, in [62] to find cycles of small length they use Binary Decision Diagramns (BDD) – the well-known structute actively used in symbolic verification [64].

The idea to define the weight function of network vertices using thresholds was apparently first proposed by Mark Granovetter in [26]. In that paper, a number of examples that demonstrate the correspondence between the threshold model of activation dynamics and the so-called conforming behavior which is often observed in real life collectives, was provided. We would like to mention several further works which developed the ideas from [26], in particular [65–67], etc.

The paper [4] for more than 20 years remains the central work in the context of maximizing influence in networks and continues to stimulate novel results in this area. It introduced all the main notions related to influence maximization. At the same time, the authors of [4] note that they were largely inspired by the problems considered in [68]. The main models under which IM is usually considered, in particular, Linear Threshold Model and Independent Cascade Model, were also proposed in [4]. The paper also showed that for the nondeterministic variants of these models there exists a polynomial algorithm that approximates the solution of IM with approximation factor 1–e⁻¹. However, for the deterministic variants of both models it gave strong arguments on their inapproximability.

IM is one of the most actively researched problems in network science in the last 20 years: see [4–21]-[25,68],etc. Let us briefly comment only on the results of the most cited of the mentioned works. Thus, [6] is a direct development of [4]: in particular, they studied in detail the so-called decreasing cascade model, the basic properties of which were formulated in [4] in form of hypotheses. The authors of the paper [69] state that the approach they proposed based on the meticulous analysis of the submodularity property outperforms the basic greedy algorithm from [4] in hundreds of times on some networks. In [7] there have been studied the questions related to influence diffusion in social networks by aggregating the groups of users by the topics of their interests, the corresponding approach is called Topical Affinity Propagation. In [8] they proposed the strategy of Two-phase Influence Maximization which makes it possible for the non-deterministic Independent Cascade model to significantly speed up the standard greedy algorithm without falsifying the approximability guarantees of the obtained solutions. The paper [9] studies the problem of activating the network by means of top-K influential vertices. As the authors mention in text, their algorithm is aimed at solving the problem of activating mobile social networks with a large number of participants. Several works, such as [11,12], proposed improvements to the greedy algorithm and some additional centrality-based heuristics, which make it possible to increase the dimension of effectively activated networks to millions of vertices. In [15] the authors studied nontrivial questions related to the structural complexity of IM, in particular, they showed the $# P$ -hardness of exact influence computation in the context of the Linear Threshold model. Some peculiarities of the structure of real-world information networks that pass through themselves colossal volumes of information (millions of blogs and news articles) were studied in [17]. A time-constrained variant of IM was apparently first considered in [19]. The Influence Maximization for a network which changes with time was analyzed in [20] In [21] it was showed that taking into account the information about location may play an important role when maximizing the influence in real-world marketing strategies. The robustness of IM solutions under uncertainty of some network parameters (specifically, probabilities assigned to edges) was studied in [24].

Surprisingly, the papers dealing with IM under DLTM are significantly fewer in number compared to that for LTM, despite the fact that the ideas how one can solve IM under DLTM using heuristic and metaheuristic algorithms look natural. The greedy algorithm for DLTM which we use to construct an initial solution of IM and TSS was described in [37]. Several subsequent works used some generalizations of this algorithm: [70,71], etc. The paper [70] considered the targeted and budgeted variant of IM, in which the profits and costs of activating vertices were added, and proposed the modification of the greedy algorithm. In [71] there was considered a time-constrained variant of IM under DLTM. The intention to demonstrate that it is possible to solve IM and TSS under DLTM by combining greedy heuristics and evolutionary optimization became our main motivation for writing the present paper.

The TSS problem was first implicitly formulated in [4]. The formulation that we use in our paper was employed in [72] and [35]. The structural complexity of IM and TSS was first evaluated in [4]. As we already mentioned, there are approximation algorithms with constant factor for IM under nondeterministic LTM (when influence is defined as an expectation (3)), meanwhile, under DLTM the solution of IM cannot be efficiently approximated if $P \neq N P$ . It was noted in [35], that the intractability of approximation of TSS under DLTM follows from this fact. A number of further results related to the complexity of TSS was also presented in [35]. In particular, it was shown that some non-monotonic variants of TSS are $# P$ -hard.

As we mentioned above, both IM and TSS under DLTM can be solved exactly using modern combinatorial algorithms. However, the dimension of the problems with which such algorithms are able to successfully deal with is quite small: typically in the dozens or at most hundreds of vertices. In [42,73] the algorithms for solving the Boolean satisfiability problem (SAT) were used to solve TSS. Note, that in [42,74–76] the algorithms based on SAT or on Binary Decision Diagrams (BDD) were also used to find attractors (cycles) of small length in State Transition Graphs of Kauffman networks and networks with more elaborate weight functions modeling dynamic processes in gene networks. It is also worth mentioning, that the standard network activation model considered in the majority of related works (including the present paper) implies the monotonic scenario, in which DDS has only attractors of length 1 (i.e. fixed points). This scenario corresponds to the so-called conforming behavior in terms of [26]. In paper [42] it was showed that replacing functions $f_{v} (t + 1)$ of the kind (1) by functions of the kind ${\tilde{f}}_{v} (t + 1) = 1 - f_{v} (t + 1)$ , corresponding to anti-conforming behavior, results in DDS with STG the structure of which is significantly different from the structure of STG for the conforming scenario.

We would like to specifically note the series of works in which the Discrete Dynamic Systems of the same kind as the ones considered in our paper were studied from the point of view of cellular automata. In particular, in [77] they showed that the general reachability problem for models close to DLTM is PSPACE complete. The works [78,79] studied a number of problems related to reachability and specifically showed the NP-completeness of the general Predecessor Existence problem in the context of DDS similar to Synchronous Boolean Networks.

The networks we use in the paper can be broadly divided into two main classes. First, these are network constructed using the random graph models, such as the Watts-Strogatz model [52] and Barabasi-Albert model [53]. Second, there are fragments of real-world networks available in the SNAP repository [32].

The evolutionary and genetic algorithms are among the most popular tools for metaheuristic optimization [45]. The (1 + 1)-EA algorithm can be viewed as the simplest implementation of the idea of evolutionary computations. Nevertheless, this algorithm has a lot of nontrivial mathematical properties studied in e.g. [49,51,80–83], etc. In particular, as we mentioned above, in [49] there was proposed a natural approach to the study of the behavior of (1 + 1)-EA on a wide class of pseudo-Boolean functions. For a number of artificially constructed functions in [49] it was shown that (1 + 1)-EA can be both either very effective (e.g. for ONEMAX function) or vice versa. For an arbitrary pseudo-Boolean function without any analytical specification (1 + 1)-EA is highly ineffective in the context of the bounds from [49]. Note, that similar results were presented in paper [84]. In [51] an approach was proposed in the context of which (1 + 1)-EA changes its mutation rate according to some probabilistic distribution. A similar idea was used earlier in [80]. The (1 + 1)-EA variant from [51] has a significantly better upper bound on its complexity compared to the original (1 + 1)-EA. In papers [83] and [85] variants of (1 + 1)-EA were described with the complexity upper bound significantly smaller than that of the classic algorithm.

The variant of (1 + 1)-EA which is based on the $μ_{k}^{n}$ operator was first described in the conference paper [86]. In that paper this algorithm was used to find the so-called ρ-backdoors described in [87] which in essence represent a probabilistic generalization of backdoors for SAT introduced in [88]. In the present paper we performed the extended analysis of the properties of the algorithm from [86] and significantly improved the upper bound of its complexity for pseudo-Boolean functions without any analytical specification. One of the contributions of the present paper lies in adapting (1 + 1)-WEA for solving IM and TSS.

To the best of our knowledge, the conference paper [89] has the first examples of the application of evolutionary algorithms to solving IM under DLTM. Our two conference papers [90,91] present the preliminary results on solving TSS using different evolutionary algorithms, including that from [51,86] and [85].

6 Discussion, conclusion and future work

In the present paper we considered two well-known combinatorial problems for Boolean networks: Influence Maximization and Target Set selection. We studied these problems in the context of the Deterministic Linear Threshold Model, while the majority of papers on the topic consider nondeterministic models, first formally presented in [4]. This choice is motivated by a number of factors. First, DLTM agrees well with many practical examples, the nature of which was described by Mark Granovetter in [26]. Second, our goal was to employ for solving IM and TSS the metaheuristic algorithms used in pseudo-Boolean optimization and specifically evolutionary algorithms. It should be noted that the described computational schemes may be applied to non-deteministic LTM from [4], but in this case it makes sense to implement them in parallel environments, since computing the fitness function under LTM is much more expensive compared to DLTM. Indeed, for large networks the value (3) can only be estimated using the Monte Carlo method [92], and the accuracy of this estimation improves with the increase of the size of the random sample. From our point of view, the study of how the network size influences the accuracy of computing influence in the sense of (3) is an important problem which we plan to consider in the nearest future. In particular, we believe that the results presented in this paper look promising in that the corresponding algorithms for solving IM and TSS under LTM can be built upon the parallel versions of the algorithms proposed in this paper.

The use of neural networks for solving IM and TSS looks promising. This direction fits into a general area where neural networks are used to tackle NP-hard problems. It grew in popularity in recent years thanks to the ability of neural networks to perform the role of universal approximators. The examples of such an approach can be found in papers [93–97], etc. Some of the works, in particular [98–100], and some others, show the possible applications of neural networks for solving IM and TSS. However, the cited papers typically either consider an Independent Cascade Model or the Linear Threshold Model in their standard non-deterministic variants from [4]. Our attempts to apply the tools presented in these articles to DLTM did not yield good results, since we observed the effects mentioned in [94], e.g., various “bottlenecks”, in particular, the lack of robustness of the neural network models when fed with data only implicitly related to the problems they were designed to solve. As a result, to construct the target sets similar in their effectiveness to that found by our hybrid approach, one would need to employ a very resource-intensive training stage. Nevertheless, we believe that an approach to solving IM via Neural Networks is very promising and we plan to address it in our future works. In particular, we believe that Binarized Neural Networks [101] suit well to solving IM and TSS in the context of DLTM and plan to conduct corresponding research in the near future. In particular, we plan to study the possibilities in combining the proposed evolutionary algorithms with binarized neural networks, specifically to use the latter to improve the initial solutions to IM and TSS found by greedy heuristics.

Acknowledgments

We are grateful to anonymous reviewers for their constructive feedback that made it possible to significantly improve the presentation of our results. We thank Konstantin Chukharev and Mikhail Smirnov for fruitful discussions that helped a lot at the earlier stages of the paper preparation.

Data Availability

The data related to experiments is available in a public repository https://github.com/Alex-Andrv/evo_network_public.

Funding Statement

This research is financially supported by the Ministry of Economic Development of the Russian Federation (IGK 000000C313925P4C0002), agreement No 139-15-2025-010. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1.Newman MEJ. The Structure and Function of Complex Networks. SIAM Review. 2003;45(2):167–256. [Google Scholar]
2.Dorogovtsev SN, Goltsev AV, Mendes JFF. Critical phenomena in complex networks. Rev Mod Phys. 2008;80:1275–335. [Google Scholar]
3.Newman MEJ. Networks: An Introduction. Oxford University Press; 2010.
4.Kempe D, Kleinberg JM, Tardos É. Maximizing the spread of influence through a social network. In: KDD. ACM; 2003. p. 137–46.
5.Kempe D, Kleinberg JM, Tardos É. Influential Nodes in a Diffusion Model for Social Networks. In: ICALP. vol. 3580 of Lecture Notes in Computer Science. Springer; 2005. p. 1127–38.
6.Bharathi S, Kempe D, Salek M. Competitive Influence Maximization in Social Networks. In: WINE. vol. 4858 of Lecture Notes in Computer Science. Springer; 2007. p. 306–11.
7.Tang J, Sun J, Wang C, Yang Z. Social influence analysis in large-scale networks. In: KDD. ACM; 2009. p. 807–16.
8.Tang Y, Xiao X, Shi Y. Influence maximization: near-optimal time complexity meets practical efficiency. In: SIGMOD Conference. ACM; 2014. p. 75–86.
9.Wang Y, Cong G, Song G, Xie K. Community-based greedy algorithm for mining top-K influential nodes in mobile social networks. In: KDD. ACM; 2010. p. 1039–48.
10.Borodin A, Filmus Y, Oren J. Threshold Models for Competitive Influence in Social Networks. In: WINE. vol. 6484 of Lecture Notes in Computer Science. Springer; 2010. p. 539–50.
11.Chen W, Wang Y, Yang S. Efficient influence maximization in social networks. In: KDD. ACM; 2009. p. 199–208.
12.Chen W, Wang C, Wang Y. Scalable influence maximization for prevalent viral marketing in large-scale social networks. In: KDD. ACM; 2010. p. 1029–38.
13.Das A, Kempe D. Submodular meets Spectral: Greedy Algorithms for Subset Selection, Sparse Approximation and Dictionary Selection. In: ICML. Omnipress; 2011. p. 1057–64.
14.Jiang Q, Song G, Cong G, Wang Y, Si W, Xie K. Simulated Annealing Based Influence Maximization in Social Networks. In: AAAI. AAAI Press; 2011, p. 127–32.
15.Chen W, Yuan Y, Zhang L. Scalable Influence Maximization in Social Networks under the Linear Threshold Model. In: ICDM. IEEE Computer Society; 2010. p. 88–97.
16.Sun J, Tang J. A Survey of Models and Algorithms for Social Influence Analysis. In: Social Network Data Analytics. Springer; 2011. p. 177–214.
17.Gomez-Rodriguez M, Leskovec J, Krause A. Inferring Networks of Diffusion and Influence. ACM Trans Knowl Discov Data. 2012;5(4):1–37. doi: 10.1145/2086737.2086741 [DOI] [Google Scholar]
18.He X, Song G, Chen W, Jiang Q. Influence Blocking Maximization in Social Networks under the Competitive Linear Threshold Model. In: SDM. SIAM/ Omnipress; 2012. p. 463–74.
19.Liu B, Cong G, Xu D, Zeng Y. Time Constrained Influence Maximization in Social Networks. In: ICDM. IEEE Computer Society; 2012. p. 439–48.
20.Zhuang H, Sun Y, Tang J, Zhang J, Sun X. Influence Maximization in Dynamic Social Networks. In: ICDM. IEEE Computer Society; 2013. p. 1313–8.
21.Li G, Chen S, Feng J, Tan K, Li W. Efficient location-aware influence maximization. In: SIGMOD Conference. ACM; 2014. p. 87–98.
22.Ohsaka N, Akiba T, Yoshida Y, Kawarabayashi K. Fast and Accurate Influence Maximization on Large Networks with Pruned Monte-Carlo Simulations. In: AAAI. AAAI Press; 2014. p. 138–44.
23.Huiyuan Zhang MTT, Subhankar Mishra. Recent Advances in Information Diffusion and Influence Maximization of Complex Social Networks. In: Opportunistic Mobile Social Networks. CRC Press; 2014. p. 37–68.
24.Chen W, Lin T, Tan Z, Zhao M, Zhou X. Robust Influence Maximization. In: KDD. ACM; 2016. p. 795–804.
25.Li Y, Fan J, Wang Y, Tan K-L. Influence Maximization on Social Graphs: A Survey. IEEE Trans Knowl Data Eng. 2018;30(10):1852–72. doi: 10.1109/tkde.2018.2807843 [DOI] [Google Scholar]
26.Granovetter M. Threshold Models of Collective Behavior. Am J Soc. 1978;83(6):1420–43. doi: 10.1086/226707 [DOI] [Google Scholar]
27.Akiba T, Sano S, Yanase T, Ohta T, Koyama M. Optuna: A Next-generation Hyperparameter Optimization Framework. In: KDD. ACM; 2019. p. 2623–31.
28.Real E, Liang C, So DR, Le QV. AutoML-Zero: Evolving Machine Learning Algorithms From Scratch. In: ICML. vol. 119 of Proceedings of Machine Learning Research. PMLR; 2020. p. 8007–19.
29.He X, Zhao K, Chu X. AutoML: A survey of the state-of-the-art. Knowledge-Based Systems. 2021;212:106622. doi: 10.1016/j.knosys.2020.106622 [DOI] [Google Scholar]
30.Narodytska N, Kasiviswanathan SP, Ryzhyk L, Sagiv M, Walsh T. Verifying Properties of Binarized Deep Neural Networks. In: AAAI. AAAI Press; 2018. p. 6615–24.
31.Ignatiev A, Narodytska N, Marques-Silva J. Abduction-Based Explanations for Machine Learning Models. In: AAAI. AAAI Press; 2019. p. 1511–9.
32.Leskovec J, Krevl A. SNAP Datasets: Stanford Large Network Dataset Collection. 2014. http://snap.stanford.edu/data
33.Kauffman SA. Metabolic stability and epigenesis in randomly constructed genetic nets. J Theor Biol. 1969;22(3):437–67. doi: 10.1016/0022-5193(69)90015-0 [DOI] [PubMed] [Google Scholar]
34.MacWilliams FJ, Sloane NJA. The theory of error-correcting codes. 2nd edition. North-Holland Publishing Company; 1978.
35.Ben-Zwi O, Hermelin D, Lokshtanov D, Newman I. An exact almost optimal algorithm for target set selection in social networks. In: EC. ACM; 2009. p. 355–62.
36.Nemhauser GL, Wolsey LA, Fisher ML. An analysis of approximations for maximizing submodular set functions–I. Math Program. 1978;14(1):265–94. [Google Scholar]
37.Swaminathan A. An Algorithm for Influence Maximization and Target Set Selection for the Deterministic Linear Threshold Model [Master’s thesis]. USA: Virginia Polytechnic Institute and State University; 2014.
38.Biere A, Heule M, van Maaren H, Walsh T, editors. Handbook of Satisfiability - Second Edition. vol. 336 of Frontiers in Artificial Intelligence and Applications. IOS Press; 2021.
39.Cook SA. The Complexity of Theorem-Proving Procedures. In: STOC. ACM; 1971. p. 151–8.
40.Levin LA. Universal Sequential Search Problems. Problems Inform Transmission. 1973;9:265–6. [Google Scholar]
41.Garey MR, Johnson DS. Computers and Intractability: A Guide to the Theory of NP-Completeness. New York (NY): W. H. Freeman & Co.; 1979.
42.Kochemazov S, Semenov A. Using synchronous Boolean networks to model several phenomena of collective behavior. PLoS One. 2014;9(12):e115156. doi: 10.1371/journal.pone.0115156 [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Boros E, Hammer PL. Pseudo-Boolean optimization. Discrete Applied Mathematics. 2002;123(1–3):155–225. doi: 10.1016/s0166-218x(01)00341-9 [DOI] [Google Scholar]
44.Burke EK, Kendall G, editor. Search Methodologies. 2nd edition. Springer. 2016.
45.Luke S. Essentials of metaheuristics. 2nd edition. Lulu. 2013.
46.Mühlenbein H. How Genetic Algorithms Really Work: Mutation and Hillclimbing. In: PPSN; 1992. p. 15–26.
47.Feller W. An introduction to probability theory and its applications. 3rd edition. John Wiley & Sons, Inc.; 1968.
48.Russell SJ, Norvig P. Artificial Intelligence – A Modern Approach. 3rd edition. Pearson Education; 2010.
49.Droste S, Jansen T, Wegener I. On the Analysis of the (1 + 1) Evolutionary Algorithm. Theor Comput Sci. 2002;276(1–2):51–81. [DOI] [PubMed] [Google Scholar]
50.Wegener I. Theoretical Aspects of Evolutionary Algorithms. In: ICALP; 2001. p. 64–78.
51.Doerr B, Le HP, Makhmara R, Nguyen TD. Fast Genetic Algorithms. In: GECCO; 2017. p. 777–84.
52.Watts DJ, Strogatz SH. Collective dynamics of “small-world” networks. Nature. 1998;393(6684):440–2. doi: 10.1038/30918 [DOI] [PubMed] [Google Scholar]
53.Barabasi A, Albert R. Emergence of scaling in random networks. Science. 1999;286(5439):509–12. doi: 10.1126/science.286.5439.509 [DOI] [PubMed] [Google Scholar]
54.Hagberg AA, Schult DA, Swart PJ. Exploring Network Structure, Dynamics, and Function using NetworkX. In: Varoquaux G, Vaught T, Millman J, editors. Proceedings of the 7th Python in Science Conference. Pasadena (CA); 2008, p. 11–15.
55.Ignatiev A, Morgado A, Marques-Silva J. PySAT: A Python Toolkit for Prototyping with SAT Oracles. In: SAT. LNCS; 2018. p. 428–37.
56.Wilcoxon F. Individual Comparisons by Ranking Methods. Biometrics Bulletin. 1945;1(6):80. doi: 10.2307/3001968 [DOI] [Google Scholar]
57.Semenov A, Chivilikhin D, Pavlenko A, Otpuschennikov I, Ulyantsev V, Ignatiev A. Evaluating the Hardness of SAT Instances Using Evolutionary Optimization Algorithms. In: CP; 2021. p. 47:1–18.
58.Semenov A, Zaikin O, Kochemazov S. Finding Effective SAT Partitionings Via Black-Box Optimization. In: Black Box Optimization, Machine Learning, and No-Free Lunch Theorems. vol. 170 of Springer Optimization and Its Applications. Springer; 2021. p. 319–55.
59.Bastolla U, Parisi G. A numerical study of the critical line of Kauffman networks. J Theor Biol. 1997;187(1):117–33. doi: 10.1006/jtbi.1997.0423 [DOI] [PubMed] [Google Scholar]
60.Bastolla U, Parisi G. The modular structure of Kauffman networks. Physica D: Nonlinear Phenomena. 1998;115(3–4):219–33. doi: 10.1016/s0167-2789(97)00242-x [DOI] [Google Scholar]
61.Derrida B, Stauffer D. Phase Transitions in Two-Dimensional Kauffman Cellular Automata. Europhys Lett. 1986;2(10):739–45. doi: 10.1209/0295-5075/2/10/001 [DOI] [Google Scholar]
62.Dubrova E, Teslenko M, Martinelli A. Kauffman networks: analysis and applications. In: ICCAD. IEEE Computer Society; 2005. p. 479–84.
63.Hawick KA, James HA, Scogings C. Structural Circuits and Attractors in Kauffman Networks. In: ACAL. vol. 4828 of Lecture Notes in Computer Science. Springer; 2007. p. 189–200.
64.Bryant. Graph-Based Algorithms for Boolean Function Manipulation. IEEE Trans Comput. 1986;C–35(8):677–91. doi: 10.1109/tc.1986.1676819 [DOI] [Google Scholar]
65.Granovetter M, Soong R. Threshold models of diffusion and collective behavior. J Math Soc. 1983;9(3):165–79. doi: 10.1080/0022250x.1983.9989941 [DOI] [Google Scholar]
66.Granovetter M, Soong R. Threshold models of interpersonal effects in consumer demand. J Eco Beh Organ. 1986;7(1):83–99. doi: 10.1016/0167-2681(86)90023-5 [DOI] [Google Scholar]
67.Chwe MS. Structure and Strategy in Collective Action. Am J Soc. 1999;105(1):128–56. doi: 10.1086/210269 [DOI] [Google Scholar]
68.Domingos PM, Richardson M. Mining the network value of customers. In: KDD. ACM; 2001. p. 57–66.
69.Leskovec J, Krause A, Guestrin C, Faloutsos C, VanBriesen JM, Glance NS. Cost-effective outbreak detection in networks. In: KDD. ACM; 2007. p. 420–9.
70.Gursoy F, Gunnec D. Influence maximization in social networks under Deterministic Linear Threshold Model. Knowledge-Based Systems. 2018;161:111–23. doi: 10.1016/j.knosys.2018.07.040 [DOI] [Google Scholar]
71.Kochemazov S, Semenov AA. Computational Study of Time Constrained Influence Maximization Problem under Deterministic Linear Threshold Model for Networks with Nonuniform Thresholds. In: MIPRO. IEEE; 2019. p. 1121–5.
72.Chen N. On the approximability of influence in social networks. In: SODA. SIAM; 2008. p. 1029–37.
73.Kochemazov S, Zaikin O, Semenov AA. Improving the effectiveness of SAT approach in application to analysis of several discrete models of collective behavior. In: MIPRO. IEEE; 2017. p. 1172–7.
74.Dubrova E, Teslenko M. A SAT-based algorithm for finding attractors in synchronous Boolean networks. IEEE/ACM Trans Comput Biol Bioinform. 2011;8(5):1393–9. doi: 10.1109/TCBB.2010.20 [DOI] [PubMed] [Google Scholar]
75.Evdokimov A, Kochemazov S, Semenov A. Application of symbolic computations to the study of discrete models of some gene networks. VichislTechnol. 2011;16(1):30–47. [Google Scholar]
76.Evdokimov AA, Kochemazov SE, Otpushchennikov IV, Semenov AA. Study of discrete automaton models of gene networks of nonregular structure using symbolic calculations. J Appl Ind Math. 2014;8(3):307–16. doi: 10.1134/s1990478914030028 [DOI] [Google Scholar]
77.Barrett CL, Hunt HBI, Marathe MV, Ravi SS, Rosenkrantz DJ, Stearns RE. Complexity of reachability problems for finite discrete dynamical systems. J Comput Syst Sci. 2006;72(8):1317–45. [Google Scholar]
78.Barrett CL, Hunt HBI, Marathe MV, Ravi SS, Rosenkrantz DJ, Stearns RE. Predecessor existence problems for finite discrete dynamical systems. Theor Comput Sci. 2007;386(1–2):3–37. [Google Scholar]
79.Marathe MV, Ravi SS, Rosenkrantz DJ, Stearns RE. Computational Aspects of Fault Location and Resilience Problems for Interdependent Infrastructure Networks. In: COMPLEX NETWORKS (1). vol. 812 of Studies in Computational Intelligence. Springer; 2018. p. 879–90.
80.Witt C. Runtime analysis of the (mu + 1) EA on simple Pseudo-Boolean functions. Evol Comput. 2006;14(1):65–86. doi: 10.1162/evco.2006.14.1.65 [DOI] [PubMed] [Google Scholar]
81.Doerr B, Johannsen D, Winzen C. Multiplicative drift analysis. In: GECCO. ACM; 2010. p. 1449–56.
82.Doerr B. Analyzing randomized search heuristics via stochastic domination. Theor Comput Sci. 2019;773:115–37. [Google Scholar]
83.Semenov AA. Merging Variables: One Technique of Search in Pseudo-Boolean Optimization. In: MOTOR (2). vol. 1090 of Communications in Computer and Information Science. Springer; 2019. p. 86–102.
84.He J, Yao X. Drift analysis and average time complexity of evolutionary algorithms. Artif Intell. 2001;127(1):57–85. [Google Scholar]
85.Semenov AA, Otpuschennikov IV, Antonov K. On Some Variants of the Merging Variables Based (1 + 1)-Evolutionary Algorithm with Application to MaxSAT Problem. In: MOTOR. vol. 12755 of LNCS; 2021. p. 111–24.
86.Pavlenko A, Semenov AA. Using Island Model in Asynchronous Evolutionary Strategy to Search for Backdoors for SAT. In: CEC. IEEE; 2024. p. 1–7.
87.Semenov AA, Pavlenko A, Chivilikhin D, Kochemazov S. On Probabilistic Generalization of Backdoors in Boolean Satisfiability. In: AAAI. AAAI Press; 2022. p. 10353–61.
88.Williams R, Gomes CP, Selman B. Backdoors To Typical Case Complexity. In: IJCAI. Morgan Kaufmann; 2003. p. 1173–8.
89.Kochemazov S. Comparative Study of Combinatorial Algorithms for Solving the Influence Maximization Problem in Networks under a Deterministic Linear Threshold Model. Procedia Computer Science. 2018;136:190–9. doi: 10.1016/j.procs.2018.08.252 [DOI] [Google Scholar]
90.Smirnov M, Kochemazov S, Semenov AA. The Study of the Target Set Selection Problem under Deterministic Linear Threshold Model Using Evolutionary Algorithms. In: MIPRO. IEEE; 2023. p. 1039–44.
91.Andreev A, Chukharev K, Kochemazov S, Semenov AA. Solving Influence Maximization Problem Under Deterministic Linear Threshold Model Using Metaheuristic optimization. In: MIPRO. IEEE; 2024. p. 19–24.
92.Metropolis N, Ulam S. The Monte Carlo Method. J Amer statistical assoc. 1949;44(247):335–41. [DOI] [PubMed] [Google Scholar]
93.Ding J, Zhang C, Shen L, Li S, Wang B, Xu Y, et al. Accelerating Primal Solution Findings for Mixed Integer Programs Based on Solution Prediction. In: AAAI. AAAI Press; 2020. p. 1452–9.
94.Veličković P, Blundell C. Neural algorithmic reasoning. Patterns (N Y). 2021;2(7):100273. doi: 10.1016/j.patter.2021.100273 [DOI] [PMC free article] [PubMed] [Google Scholar]
95.Ferber AM, Song J, Dilkina B, Yue Y. Learning Pseudo-Backdoors for Mixed Integer Programs. In: CPAIOR. vol. 1329 2 of LNCS. Springer; 2022. p. 91–102.
96.Cappart Q, Chételat D, Khalil EB, Lodi A, Morris C, Velickovic P. Combinatorial Optimization and Reasoning with Graph Neural Networks. J Mach Learn Res. 2023;24:130:1–61. [Google Scholar]
97.Cai J, Huang T, Dilkina B. Learning Backdoors for Mixed Integer Linear Programs with Contrastive Learning. In: ECAI. vol. 392 of Frontiers in Artificial Intelligence and Applications. IOS Press; 2024. p. 2418–25.
98.Manchanda S, Mittal A, Dhawan A, Medya S, Ranu S, Singh AK. GCOMB: Learning Budget-constrained Combinatorial Algorithms over Billion-sized Graphs. In: NeurIPS; 2020.
99.Meirom EA, Maron H, Mannor S, Chechik G. Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks. In: ICML. vol. 139 of Proceedings of Machine Learning Research. PMLR; 2021. p. 7565–77.
100.Ling C, Jiang J, Wang J, Thai MT, Xue R, Song J, et al. Deep Graph Representation Learning and Optimization for Influence Maximization. In: ICML. vol. 202 of Proceedings of Machine Learning Research. PMLR; 2023. p. 21350–61.
101.Hubara I, Courbariaux M, Soudry D, El-Yaniv R, Bengio Y. Binarized Neural Networks. In: NIPS; 2016. p. 4107–15.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data related to experiments is available in a public repository https://github.com/Alex-Andrv/evo_network_public.

[pone.0331109.ref001] 1.Newman MEJ. The Structure and Function of Complex Networks. SIAM Review. 2003;45(2):167–256. [Google Scholar]

[pone.0331109.ref002] 2.Dorogovtsev SN, Goltsev AV, Mendes JFF. Critical phenomena in complex networks. Rev Mod Phys. 2008;80:1275–335. [Google Scholar]

[pone.0331109.ref003] 3.Newman MEJ. Networks: An Introduction. Oxford University Press; 2010.

[pone.0331109.ref004] 4.Kempe D, Kleinberg JM, Tardos É. Maximizing the spread of influence through a social network. In: KDD. ACM; 2003. p. 137–46.

[pone.0331109.ref005] 5.Kempe D, Kleinberg JM, Tardos É. Influential Nodes in a Diffusion Model for Social Networks. In: ICALP. vol. 3580 of Lecture Notes in Computer Science. Springer; 2005. p. 1127–38.

[pone.0331109.ref006] 6.Bharathi S, Kempe D, Salek M. Competitive Influence Maximization in Social Networks. In: WINE. vol. 4858 of Lecture Notes in Computer Science. Springer; 2007. p. 306–11.

[pone.0331109.ref007] 7.Tang J, Sun J, Wang C, Yang Z. Social influence analysis in large-scale networks. In: KDD. ACM; 2009. p. 807–16.

[pone.0331109.ref008] 8.Tang Y, Xiao X, Shi Y. Influence maximization: near-optimal time complexity meets practical efficiency. In: SIGMOD Conference. ACM; 2014. p. 75–86.

[pone.0331109.ref009] 9.Wang Y, Cong G, Song G, Xie K. Community-based greedy algorithm for mining top-K influential nodes in mobile social networks. In: KDD. ACM; 2010. p. 1039–48.

[pone.0331109.ref010] 10.Borodin A, Filmus Y, Oren J. Threshold Models for Competitive Influence in Social Networks. In: WINE. vol. 6484 of Lecture Notes in Computer Science. Springer; 2010. p. 539–50.

[pone.0331109.ref011] 11.Chen W, Wang Y, Yang S. Efficient influence maximization in social networks. In: KDD. ACM; 2009. p. 199–208.

[pone.0331109.ref012] 12.Chen W, Wang C, Wang Y. Scalable influence maximization for prevalent viral marketing in large-scale social networks. In: KDD. ACM; 2010. p. 1029–38.

[pone.0331109.ref013] 13.Das A, Kempe D. Submodular meets Spectral: Greedy Algorithms for Subset Selection, Sparse Approximation and Dictionary Selection. In: ICML. Omnipress; 2011. p. 1057–64.

[pone.0331109.ref014] 14.Jiang Q, Song G, Cong G, Wang Y, Si W, Xie K. Simulated Annealing Based Influence Maximization in Social Networks. In: AAAI. AAAI Press; 2011, p. 127–32.

[pone.0331109.ref015] 15.Chen W, Yuan Y, Zhang L. Scalable Influence Maximization in Social Networks under the Linear Threshold Model. In: ICDM. IEEE Computer Society; 2010. p. 88–97.

[pone.0331109.ref016] 16.Sun J, Tang J. A Survey of Models and Algorithms for Social Influence Analysis. In: Social Network Data Analytics. Springer; 2011. p. 177–214.

[pone.0331109.ref017] 17.Gomez-Rodriguez M, Leskovec J, Krause A. Inferring Networks of Diffusion and Influence. ACM Trans Knowl Discov Data. 2012;5(4):1–37. doi: 10.1145/2086737.2086741 [DOI] [Google Scholar]

[pone.0331109.ref018] 18.He X, Song G, Chen W, Jiang Q. Influence Blocking Maximization in Social Networks under the Competitive Linear Threshold Model. In: SDM. SIAM/ Omnipress; 2012. p. 463–74.

[pone.0331109.ref019] 19.Liu B, Cong G, Xu D, Zeng Y. Time Constrained Influence Maximization in Social Networks. In: ICDM. IEEE Computer Society; 2012. p. 439–48.

[pone.0331109.ref020] 20.Zhuang H, Sun Y, Tang J, Zhang J, Sun X. Influence Maximization in Dynamic Social Networks. In: ICDM. IEEE Computer Society; 2013. p. 1313–8.

[pone.0331109.ref021] 21.Li G, Chen S, Feng J, Tan K, Li W. Efficient location-aware influence maximization. In: SIGMOD Conference. ACM; 2014. p. 87–98.

[pone.0331109.ref022] 22.Ohsaka N, Akiba T, Yoshida Y, Kawarabayashi K. Fast and Accurate Influence Maximization on Large Networks with Pruned Monte-Carlo Simulations. In: AAAI. AAAI Press; 2014. p. 138–44.

[pone.0331109.ref023] 23.Huiyuan Zhang MTT, Subhankar Mishra. Recent Advances in Information Diffusion and Influence Maximization of Complex Social Networks. In: Opportunistic Mobile Social Networks. CRC Press; 2014. p. 37–68.

[pone.0331109.ref024] 24.Chen W, Lin T, Tan Z, Zhao M, Zhou X. Robust Influence Maximization. In: KDD. ACM; 2016. p. 795–804.

[pone.0331109.ref025] 25.Li Y, Fan J, Wang Y, Tan K-L. Influence Maximization on Social Graphs: A Survey. IEEE Trans Knowl Data Eng. 2018;30(10):1852–72. doi: 10.1109/tkde.2018.2807843 [DOI] [Google Scholar]

[pone.0331109.ref026] 26.Granovetter M. Threshold Models of Collective Behavior. Am J Soc. 1978;83(6):1420–43. doi: 10.1086/226707 [DOI] [Google Scholar]

[pone.0331109.ref027] 27.Akiba T, Sano S, Yanase T, Ohta T, Koyama M. Optuna: A Next-generation Hyperparameter Optimization Framework. In: KDD. ACM; 2019. p. 2623–31.

[pone.0331109.ref028] 28.Real E, Liang C, So DR, Le QV. AutoML-Zero: Evolving Machine Learning Algorithms From Scratch. In: ICML. vol. 119 of Proceedings of Machine Learning Research. PMLR; 2020. p. 8007–19.

[pone.0331109.ref029] 29.He X, Zhao K, Chu X. AutoML: A survey of the state-of-the-art. Knowledge-Based Systems. 2021;212:106622. doi: 10.1016/j.knosys.2020.106622 [DOI] [Google Scholar]

[pone.0331109.ref030] 30.Narodytska N, Kasiviswanathan SP, Ryzhyk L, Sagiv M, Walsh T. Verifying Properties of Binarized Deep Neural Networks. In: AAAI. AAAI Press; 2018. p. 6615–24.

[pone.0331109.ref031] 31.Ignatiev A, Narodytska N, Marques-Silva J. Abduction-Based Explanations for Machine Learning Models. In: AAAI. AAAI Press; 2019. p. 1511–9.

[pone.0331109.ref032] 32.Leskovec J, Krevl A. SNAP Datasets: Stanford Large Network Dataset Collection. 2014. http://snap.stanford.edu/data

[pone.0331109.ref033] 33.Kauffman SA. Metabolic stability and epigenesis in randomly constructed genetic nets. J Theor Biol. 1969;22(3):437–67. doi: 10.1016/0022-5193(69)90015-0 [DOI] [PubMed] [Google Scholar]

[pone.0331109.ref034] 34.MacWilliams FJ, Sloane NJA. The theory of error-correcting codes. 2nd edition. North-Holland Publishing Company; 1978.

[pone.0331109.ref035] 35.Ben-Zwi O, Hermelin D, Lokshtanov D, Newman I. An exact almost optimal algorithm for target set selection in social networks. In: EC. ACM; 2009. p. 355–62.

[pone.0331109.ref036] 36.Nemhauser GL, Wolsey LA, Fisher ML. An analysis of approximations for maximizing submodular set functions–I. Math Program. 1978;14(1):265–94. [Google Scholar]

[pone.0331109.ref037] 37.Swaminathan A. An Algorithm for Influence Maximization and Target Set Selection for the Deterministic Linear Threshold Model [Master’s thesis]. USA: Virginia Polytechnic Institute and State University; 2014.

[pone.0331109.ref038] 38.Biere A, Heule M, van Maaren H, Walsh T, editors. Handbook of Satisfiability - Second Edition. vol. 336 of Frontiers in Artificial Intelligence and Applications. IOS Press; 2021.

[pone.0331109.ref039] 39.Cook SA. The Complexity of Theorem-Proving Procedures. In: STOC. ACM; 1971. p. 151–8.

[pone.0331109.ref040] 40.Levin LA. Universal Sequential Search Problems. Problems Inform Transmission. 1973;9:265–6. [Google Scholar]

[pone.0331109.ref041] 41.Garey MR, Johnson DS. Computers and Intractability: A Guide to the Theory of NP-Completeness. New York (NY): W. H. Freeman & Co.; 1979.

[pone.0331109.ref042] 42.Kochemazov S, Semenov A. Using synchronous Boolean networks to model several phenomena of collective behavior. PLoS One. 2014;9(12):e115156. doi: 10.1371/journal.pone.0115156 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0331109.ref043] 43.Boros E, Hammer PL. Pseudo-Boolean optimization. Discrete Applied Mathematics. 2002;123(1–3):155–225. doi: 10.1016/s0166-218x(01)00341-9 [DOI] [Google Scholar]

[pone.0331109.ref044] 44.Burke EK, Kendall G, editor. Search Methodologies. 2nd edition. Springer. 2016.

[pone.0331109.ref045] 45.Luke S. Essentials of metaheuristics. 2nd edition. Lulu. 2013.

[pone.0331109.ref046] 46.Mühlenbein H. How Genetic Algorithms Really Work: Mutation and Hillclimbing. In: PPSN; 1992. p. 15–26.

[pone.0331109.ref047] 47.Feller W. An introduction to probability theory and its applications. 3rd edition. John Wiley & Sons, Inc.; 1968.

[pone.0331109.ref048] 48.Russell SJ, Norvig P. Artificial Intelligence – A Modern Approach. 3rd edition. Pearson Education; 2010.

[pone.0331109.ref049] 49.Droste S, Jansen T, Wegener I. On the Analysis of the (1 + 1) Evolutionary Algorithm. Theor Comput Sci. 2002;276(1–2):51–81. [DOI] [PubMed] [Google Scholar]

[pone.0331109.ref050] 50.Wegener I. Theoretical Aspects of Evolutionary Algorithms. In: ICALP; 2001. p. 64–78.

[pone.0331109.ref051] 51.Doerr B, Le HP, Makhmara R, Nguyen TD. Fast Genetic Algorithms. In: GECCO; 2017. p. 777–84.

[pone.0331109.ref052] 52.Watts DJ, Strogatz SH. Collective dynamics of “small-world” networks. Nature. 1998;393(6684):440–2. doi: 10.1038/30918 [DOI] [PubMed] [Google Scholar]

[pone.0331109.ref053] 53.Barabasi A, Albert R. Emergence of scaling in random networks. Science. 1999;286(5439):509–12. doi: 10.1126/science.286.5439.509 [DOI] [PubMed] [Google Scholar]

[pone.0331109.ref054] 54.Hagberg AA, Schult DA, Swart PJ. Exploring Network Structure, Dynamics, and Function using NetworkX. In: Varoquaux G, Vaught T, Millman J, editors. Proceedings of the 7th Python in Science Conference. Pasadena (CA); 2008, p. 11–15.

[pone.0331109.ref055] 55.Ignatiev A, Morgado A, Marques-Silva J. PySAT: A Python Toolkit for Prototyping with SAT Oracles. In: SAT. LNCS; 2018. p. 428–37.

[pone.0331109.ref056] 56.Wilcoxon F. Individual Comparisons by Ranking Methods. Biometrics Bulletin. 1945;1(6):80. doi: 10.2307/3001968 [DOI] [Google Scholar]

[pone.0331109.ref057] 57.Semenov A, Chivilikhin D, Pavlenko A, Otpuschennikov I, Ulyantsev V, Ignatiev A. Evaluating the Hardness of SAT Instances Using Evolutionary Optimization Algorithms. In: CP; 2021. p. 47:1–18.

[pone.0331109.ref058] 58.Semenov A, Zaikin O, Kochemazov S. Finding Effective SAT Partitionings Via Black-Box Optimization. In: Black Box Optimization, Machine Learning, and No-Free Lunch Theorems. vol. 170 of Springer Optimization and Its Applications. Springer; 2021. p. 319–55.

[pone.0331109.ref059] 59.Bastolla U, Parisi G. A numerical study of the critical line of Kauffman networks. J Theor Biol. 1997;187(1):117–33. doi: 10.1006/jtbi.1997.0423 [DOI] [PubMed] [Google Scholar]

[pone.0331109.ref060] 60.Bastolla U, Parisi G. The modular structure of Kauffman networks. Physica D: Nonlinear Phenomena. 1998;115(3–4):219–33. doi: 10.1016/s0167-2789(97)00242-x [DOI] [Google Scholar]

[pone.0331109.ref061] 61.Derrida B, Stauffer D. Phase Transitions in Two-Dimensional Kauffman Cellular Automata. Europhys Lett. 1986;2(10):739–45. doi: 10.1209/0295-5075/2/10/001 [DOI] [Google Scholar]

[pone.0331109.ref062] 62.Dubrova E, Teslenko M, Martinelli A. Kauffman networks: analysis and applications. In: ICCAD. IEEE Computer Society; 2005. p. 479–84.

[pone.0331109.ref063] 63.Hawick KA, James HA, Scogings C. Structural Circuits and Attractors in Kauffman Networks. In: ACAL. vol. 4828 of Lecture Notes in Computer Science. Springer; 2007. p. 189–200.

[pone.0331109.ref064] 64.Bryant. Graph-Based Algorithms for Boolean Function Manipulation. IEEE Trans Comput. 1986;C–35(8):677–91. doi: 10.1109/tc.1986.1676819 [DOI] [Google Scholar]

[pone.0331109.ref065] 65.Granovetter M, Soong R. Threshold models of diffusion and collective behavior. J Math Soc. 1983;9(3):165–79. doi: 10.1080/0022250x.1983.9989941 [DOI] [Google Scholar]

[pone.0331109.ref066] 66.Granovetter M, Soong R. Threshold models of interpersonal effects in consumer demand. J Eco Beh Organ. 1986;7(1):83–99. doi: 10.1016/0167-2681(86)90023-5 [DOI] [Google Scholar]

[pone.0331109.ref067] 67.Chwe MS. Structure and Strategy in Collective Action. Am J Soc. 1999;105(1):128–56. doi: 10.1086/210269 [DOI] [Google Scholar]

[pone.0331109.ref068] 68.Domingos PM, Richardson M. Mining the network value of customers. In: KDD. ACM; 2001. p. 57–66.

[pone.0331109.ref069] 69.Leskovec J, Krause A, Guestrin C, Faloutsos C, VanBriesen JM, Glance NS. Cost-effective outbreak detection in networks. In: KDD. ACM; 2007. p. 420–9.

[pone.0331109.ref070] 70.Gursoy F, Gunnec D. Influence maximization in social networks under Deterministic Linear Threshold Model. Knowledge-Based Systems. 2018;161:111–23. doi: 10.1016/j.knosys.2018.07.040 [DOI] [Google Scholar]

[pone.0331109.ref071] 71.Kochemazov S, Semenov AA. Computational Study of Time Constrained Influence Maximization Problem under Deterministic Linear Threshold Model for Networks with Nonuniform Thresholds. In: MIPRO. IEEE; 2019. p. 1121–5.

[pone.0331109.ref072] 72.Chen N. On the approximability of influence in social networks. In: SODA. SIAM; 2008. p. 1029–37.

[pone.0331109.ref073] 73.Kochemazov S, Zaikin O, Semenov AA. Improving the effectiveness of SAT approach in application to analysis of several discrete models of collective behavior. In: MIPRO. IEEE; 2017. p. 1172–7.

[pone.0331109.ref074] 74.Dubrova E, Teslenko M. A SAT-based algorithm for finding attractors in synchronous Boolean networks. IEEE/ACM Trans Comput Biol Bioinform. 2011;8(5):1393–9. doi: 10.1109/TCBB.2010.20 [DOI] [PubMed] [Google Scholar]

[pone.0331109.ref075] 75.Evdokimov A, Kochemazov S, Semenov A. Application of symbolic computations to the study of discrete models of some gene networks. VichislTechnol. 2011;16(1):30–47. [Google Scholar]

[pone.0331109.ref076] 76.Evdokimov AA, Kochemazov SE, Otpushchennikov IV, Semenov AA. Study of discrete automaton models of gene networks of nonregular structure using symbolic calculations. J Appl Ind Math. 2014;8(3):307–16. doi: 10.1134/s1990478914030028 [DOI] [Google Scholar]

[pone.0331109.ref077] 77.Barrett CL, Hunt HBI, Marathe MV, Ravi SS, Rosenkrantz DJ, Stearns RE. Complexity of reachability problems for finite discrete dynamical systems. J Comput Syst Sci. 2006;72(8):1317–45. [Google Scholar]

[pone.0331109.ref078] 78.Barrett CL, Hunt HBI, Marathe MV, Ravi SS, Rosenkrantz DJ, Stearns RE. Predecessor existence problems for finite discrete dynamical systems. Theor Comput Sci. 2007;386(1–2):3–37. [Google Scholar]

[pone.0331109.ref079] 79.Marathe MV, Ravi SS, Rosenkrantz DJ, Stearns RE. Computational Aspects of Fault Location and Resilience Problems for Interdependent Infrastructure Networks. In: COMPLEX NETWORKS (1). vol. 812 of Studies in Computational Intelligence. Springer; 2018. p. 879–90.

[pone.0331109.ref080] 80.Witt C. Runtime analysis of the (mu + 1) EA on simple Pseudo-Boolean functions. Evol Comput. 2006;14(1):65–86. doi: 10.1162/evco.2006.14.1.65 [DOI] [PubMed] [Google Scholar]

[pone.0331109.ref081] 81.Doerr B, Johannsen D, Winzen C. Multiplicative drift analysis. In: GECCO. ACM; 2010. p. 1449–56.

[pone.0331109.ref082] 82.Doerr B. Analyzing randomized search heuristics via stochastic domination. Theor Comput Sci. 2019;773:115–37. [Google Scholar]

[pone.0331109.ref083] 83.Semenov AA. Merging Variables: One Technique of Search in Pseudo-Boolean Optimization. In: MOTOR (2). vol. 1090 of Communications in Computer and Information Science. Springer; 2019. p. 86–102.

[pone.0331109.ref084] 84.He J, Yao X. Drift analysis and average time complexity of evolutionary algorithms. Artif Intell. 2001;127(1):57–85. [Google Scholar]

[pone.0331109.ref085] 85.Semenov AA, Otpuschennikov IV, Antonov K. On Some Variants of the Merging Variables Based (1 + 1)-Evolutionary Algorithm with Application to MaxSAT Problem. In: MOTOR. vol. 12755 of LNCS; 2021. p. 111–24.

[pone.0331109.ref086] 86.Pavlenko A, Semenov AA. Using Island Model in Asynchronous Evolutionary Strategy to Search for Backdoors for SAT. In: CEC. IEEE; 2024. p. 1–7.

[pone.0331109.ref087] 87.Semenov AA, Pavlenko A, Chivilikhin D, Kochemazov S. On Probabilistic Generalization of Backdoors in Boolean Satisfiability. In: AAAI. AAAI Press; 2022. p. 10353–61.

[pone.0331109.ref088] 88.Williams R, Gomes CP, Selman B. Backdoors To Typical Case Complexity. In: IJCAI. Morgan Kaufmann; 2003. p. 1173–8.

[pone.0331109.ref089] 89.Kochemazov S. Comparative Study of Combinatorial Algorithms for Solving the Influence Maximization Problem in Networks under a Deterministic Linear Threshold Model. Procedia Computer Science. 2018;136:190–9. doi: 10.1016/j.procs.2018.08.252 [DOI] [Google Scholar]

[pone.0331109.ref090] 90.Smirnov M, Kochemazov S, Semenov AA. The Study of the Target Set Selection Problem under Deterministic Linear Threshold Model Using Evolutionary Algorithms. In: MIPRO. IEEE; 2023. p. 1039–44.

[pone.0331109.ref091] 91.Andreev A, Chukharev K, Kochemazov S, Semenov AA. Solving Influence Maximization Problem Under Deterministic Linear Threshold Model Using Metaheuristic optimization. In: MIPRO. IEEE; 2024. p. 19–24.

[pone.0331109.ref092] 92.Metropolis N, Ulam S. The Monte Carlo Method. J Amer statistical assoc. 1949;44(247):335–41. [DOI] [PubMed] [Google Scholar]

[pone.0331109.ref093] 93.Ding J, Zhang C, Shen L, Li S, Wang B, Xu Y, et al. Accelerating Primal Solution Findings for Mixed Integer Programs Based on Solution Prediction. In: AAAI. AAAI Press; 2020. p. 1452–9.

[pone.0331109.ref094] 94.Veličković P, Blundell C. Neural algorithmic reasoning. Patterns (N Y). 2021;2(7):100273. doi: 10.1016/j.patter.2021.100273 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0331109.ref095] 95.Ferber AM, Song J, Dilkina B, Yue Y. Learning Pseudo-Backdoors for Mixed Integer Programs. In: CPAIOR. vol. 1329 2 of LNCS. Springer; 2022. p. 91–102.

[pone.0331109.ref096] 96.Cappart Q, Chételat D, Khalil EB, Lodi A, Morris C, Velickovic P. Combinatorial Optimization and Reasoning with Graph Neural Networks. J Mach Learn Res. 2023;24:130:1–61. [Google Scholar]

[pone.0331109.ref097] 97.Cai J, Huang T, Dilkina B. Learning Backdoors for Mixed Integer Linear Programs with Contrastive Learning. In: ECAI. vol. 392 of Frontiers in Artificial Intelligence and Applications. IOS Press; 2024. p. 2418–25.

[pone.0331109.ref098] 98.Manchanda S, Mittal A, Dhawan A, Medya S, Ranu S, Singh AK. GCOMB: Learning Budget-constrained Combinatorial Algorithms over Billion-sized Graphs. In: NeurIPS; 2020.

[pone.0331109.ref099] 99.Meirom EA, Maron H, Mannor S, Chechik G. Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks. In: ICML. vol. 139 of Proceedings of Machine Learning Research. PMLR; 2021. p. 7565–77.

[pone.0331109.ref100] 100.Ling C, Jiang J, Wang J, Thai MT, Xue R, Song J, et al. Deep Graph Representation Learning and Optimization for Influence Maximization. In: ICML. vol. 202 of Proceedings of Machine Learning Research. PMLR; 2023. p. 21350–61.

[pone.0331109.ref101] 101.Hubara I, Courbariaux M, Soudry D, El-Yaniv R, Bengio Y. Binarized Neural Networks. In: NIPS; 2016. p. 4107–15.

PERMALINK

Combining greedy and evolutionary algorithms to maximize influence in networks under deterministic linear threshold model

Alexander Andreev

Stepan Kochemazov

Alexander Semenov

Roles

Abstract

Introduction

1 Preliminaries

1.1 Synchronous Boolean networks and DLTM

Fig 1. Example of a Synchronous Boolean Network with 4 vertices and its State Transition Graph.

1.2 IM and TSS w.r.t. DLTM

1.3 Greedy heuristics for IM and TSS w.r.t. DLTM

1.4 Finding exact solution to TSS using SAT solvers

2 Evolutionary optimization

2.1 A new variant of (1 + 1)-EA

3 IM and TSS as pseudo-Boolean optimization problems

4 Computational experiments

4.1 Benchmarks

4.1.1 Network graphs.

4.1.2 Arc weights and thresholds.

4.2 Comparing to exact solution

Table 1. Results on the comparison of evolutionary and greedy algorithms with exact solution.

4.3 Solving IM with (1 + 1)-WEA

Fig 2. Convergence plots for solving IM for p2p-Gnutella06_uni_1-1000_uni_0.75-1 via (1 + 1)-WEA.

Fig 3. Comparison of the set found when solving IM for k = 19 by the greedy algorithm (left) and the one found by (1 + 1)-WEA for a network with n = 50 vertices.

4.4 Evaluation of (1 + 1)-WEA for solving TSS

Table 2. Results on solving TSS using different evolutionary and greedy algorithms.

Fig 4. Convergence plots for solving TSS for SNAP networks.

4.5 Sensitivity analysis

Table 3. Results on solving TSS using (1 + 1)-WEA_v3 with different values of Q.

Fig 5. Illustration of how (1 + 1)-WEA_v3 improves the target set when solving TSS.

4.6 Analysis of practical efficiency

5 Related work

6 Discussion, conclusion and future work

Acknowledgments

Data Availability

Funding Statement

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases