Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2022 Jul 24;26(18):9069–9081. doi: 10.1007/s00500-022-07305-3

On the shortest path problem of uncertain random digraphs

Hao Li 1, Kun Zhang 1,
PMCID: PMC9308943  PMID: 35909949

Abstract

In the field of graph theory, the shortest path problem is one of the most significant problems. However, since varieties of indeterminated factors appear in complex networks, determining of the shortest path from one vertex to another in complex networks may be a lot more complicated than the cases in deterministic networks. To illustrate this problem, the model of uncertain random digraph will be proposed via chance theory, in which some arcs exist with degrees in probability measure and others exist with degrees in uncertain measure. The main focus of this paper is to investigate the main properties of the shortest path in uncertain random digraph. Methods and algorithms are designed to calculate the distribution of shortest path more efficiently. Besides, some numerical examples are presented to show the efficiency of these methods and algorithms.

Keywords: Shortest path problem, Distance, Uncertain random digraph, Chance theory, Uncertainty theory

Introduction

In the field of graph and network optimization, the shortest path problem is a very important problem. The shortest path problem of graph aims to find the minimum of all path lengths from the starting point to the end point. The length of a path refers to the sum of the weights of all arcs on the path or the number of arcs. The shortest path problem has a wide range of applications, including the lowest cost problems, the shortest time problems, the minimum flow problems and so on. For certain graphs and networks, there are many classical algorithms for shortest path problems, such as: Dijkstra Algorithm, Bellman–Ford Algorithm, SPFA Algorithm and Floyd–Warshall Algorithm (Bondy and Murty 2008).

However, practical networks are always changing instead of keeping static. As a result, shortest path problems seem to become more complicated. To satisfy the need of researching shortest path problems under network changes, indeterministic factors such as random and uncertain factors have been taken into consideration in the research of graph theory.

To deal with the problems of indeterminacy, Kolmogoroff (1933) established the systems of probability theory in 1933. As we all know, probability needs a lot of historical data to make sure the statistical rules can be sufficiently generated. In 1959, two models of random graphs were proposed by Erdos and Renyi (1959). Of course, the theory of probability is never a master key to deal with all the problems in the field of indeterminacy. Just as what we mentioned above, probability theory is data-dependent, i.e., if there’s no sufficient data to estimate the distribution, the theory will be invalid to solve the practical problems.

Besides probability theory, fuzzy theory also provides an important model framework for the study of the shortest path problem. Zadeh (1965) proposed the definition of fuzzy set in 1965. When using fuzzy theory to solve the shortest path problem, there are two main methods: direct solution methods and heuristic methods. In the work of using the direct solution methods, Deng et al. (2012) extended classical Dijkstra algorithm to deal with the fuzzy shortest path problems. Dubois and Prade (1980) found new methods to extended Floyd algorithms to solve fuzzy shortest path problems. Okada and Soper (2000) proposed a method to find all Pareto optimal paths from one certain node to all the other nodes with fuzzy numbers. As for heuristic methods, Chuang and Kung (2005) proposed a heuristic procedure to find the FSP length among all possible paths in a network. Hernandes et al. (2007) uses genetic algorithm to solve the fuzzy shortest path problem. Ebrahimnejad et al. (2016, 2015, 2020) proposed artificial bee colony (ABC) algorithm, particle swarm optimization algorithm, lexicographic optimization method to solve the fuzzy shortest problems with fuzzy arc weights.

Many excellent results have been obtained by using fuzzy theory to study the shortest path problem. However, fuzzy set theory is lack of duality axiom, which may lead to wrong results in practice. In order to overcome this defect of fuzzy theory, uncertainty theory was founded by Liu (2007). Shortest path problem with uncertain arc lengths was discussed by Gao (2011). In 2013, uncertain graph was proposed by Gao and Gao (2013). Some of uncertain graphs were soon discussed, such as Euler index (Zhang and Peng 2012), diameter index (Gao and Gao 2013), cycle index (Gao 2013), regularity index (Gao 2014), tree index (Gao 2016). In addition, some traditional problems of graph theory were discussed in uncertain graphs. More information can be found in Li et al. (2018), Rosyida et al. (2018), Zhou et al. (2014a), Gao et al. (2015), Gao and Qin (2016).

In recent years, networks became more and more complex. Different kinds of indeterminate factors appear at the same time. Random and uncertain factors in particular, appear in most cases. In order to solve this problem, Liu (2013b) proposed the chance theory in 2013 which includes concepts of uncertain random variable and chance measure. In addition, expected value and variance of an uncertain random variable were proposed. Chance theory has been applied to many optimization networks problems, such as uncertain random programming (Liu 2013a), goal programming (Qin 2018), multi-objective programming (Zhou et al. 2014b).

Uncertain random graph was proposed by Liu (2014). In an uncertain random graph, all edges are independent, and some edges exist with degrees in probability measure while other edges exist with degrees in uncertain measure. Liu (2014) discussed the diameter index of an uncertain random graph. Sheng and Gao (23016); Sheng and Mei (2020) gave the EVSPM, SPM and MCMSPM models of shortest path problems in uncertain random graph. In 2016, the Euler index of an uncertain random graph was discussed by Zhang et al. (2017). In 2018, the cycle index of an uncertain random graph was discussed by Chen et al. (2018). Li and Zhang (2020) discussed the edge-connectivity of uncertain random graph, and the vertex-connectivity of uncertain random graph was investigated by Li and Gao (2020).

The uncertain random graph model provides a new idea for us to study the shortest path problem of networks in reality. In real life, when we describe an uncertain event, we must first generate the distribution function of the event in advance. If this distribution is close enough to the frequency, we can safely use the probability model for modeling. If not, we will use uncertainty theory for modeling and analysis. However, the network in real life is often very complex. Some arcs can have enough historical data for probability modeling, while others may have a large difference between the distribution and the actual situation due to the lack of historical data, so the uncertain theory needs to be used. To sum up, we can find that the uncertain random graph model is more suitable for the description of real networks than the traditional probability model. It is a better research model of real networks and their shortest path problems.

In this paper, we will propose the model of uncertain random digraph, and discuss the shortest path problem. Definition of distance from one vertex to another in a random uncertain digraph will be given. Then, we will discuss the shortest path distribution and give an efficient algorithm to calculate the distribution. And we will use the method of this paper to establish the spreading model of COVID-19 in social networks. In addition, we will illustrate that our algorithm will greatly reduce the complexity compared with existing methods.

The remainder of the paper is organized as follows. In Sect. 2, we will introduce some necessary notations of digraph, and give a brief introduction of chance theory. In Sects. 3 and 4, we will discuss the shortest path problem in uncertain digraph and uncertain random digraph, respectively, including the form of the distribution of shortest path and algorithms of simplifying the process of calculation. Examples will be shown to illustrate the efficiency of these algorithms. The last section will conclude this paper with a brief summary.

Preliminaries

In this section, we first introduce some necessary definitions and notations of digraph. Then, we introduce some preliminary knowledge about chance theory.

Notations of digraphs

A digraph D is an ordered pair (VA) consisting of a set V of vertices and a set A of arcs (directed edges). Without loss of generality, in the rest of this paper, we assume V={1,2,3,,n}. An ordered pair (i,j)A means there exist an arc from vertex i to j, where i and j are called the tail and the head of arc (ij), respectively. Here, we will only consider simple digraphs.

A directed walk with length k is an alternating sequence W=i1i2ikik+1 such that {i1,i2,,ik}V and (ij,ij+1)A for j=1,2,,k. If all the vertices of W are distinct, then W is called a directed path.

The adjacency matrix of digraph D, denoted by M(D), is a n×n matrix such that

M(D)=α11α12α13α1nα21α22α23α2nα31α32α33α3nαn1αn2αn3αnn,

where

αij=1,if(i,j)A0,otherwise.

And if there is an arc from vertex i to j, then αij=1, otherwise αij=0. A well-known proposition is proposed in the following to find out whether there exists a directed walk of length k(k=1,2,3,,n-1) from i to j.

Proposition 1

Let D be a digraph and let X be the adjacency matrix of D. For two distinct vertices i, j, there exists a directed walk of length k from i to j if and only if aij(Xk)1, where aij(Xk) is the (ij) entry of matrix Xk.

For any vertices i,jV, if there exist a directed path from vertex i to vertex j, then the distance from i to j, denoted by d(ij), is the minimal length of all the directed paths from i to j. If there is no directed path from i to j, which means i and j are in different components, then d(ij) is usually defined as . In the field of graph theory, d(ij) can be calculated by Dijkstra Algorithm Bondy and Murty (2008), the complexity of which is O(n2). As all directed paths are directed walks, a proposition can be proposed in the following.

Proposition 2

Let D be a digraph and X be the adjacency matrix of D. For two distinct vertices i, j and k=1,2,3,,n-1,

d(i,j)kifandonlyifαij(I+X+X2++Xk)>0,

where αij is the (ij)-entry of matrix I+X+X2++Xk.

Chance theory

Let (Γ,L,M) and (Ω,A,Pr) be an uncertainty space and a probability space, respectively. The product (Γ,L,M)×(Ω,A,Pr) is called a chance space. The product σ-algebra L×A is the smallest σ-algebra containing all measurable rectangles of the form Λ×A, where ΛL and AA. For each ΘL×A, Θ is called an event of the chance space. The chance measure of event Θ was defined by Liu (2013b) as

Ch{Θ}=01Pr{ωΩ|M{γΓ|(γ,ω)Θ}x}dx.

An uncertain random variable is a function η from a chance space (Γ,L,M)×(Ω,A,Pr) to the set of real numbers such that {ηB} is an event in L×A for any Borel set B. A random variable is called a Boolean random variable if it takes values from {0,1}. Similarly, an uncertain variable or an uncertain random variable is called a Boolean uncertain variable or a Boolean uncertain random variable, respectively, if it takes values from {0,1}. A function with n variables is called a Boolean function if it is a mapping from {0,1}n to {0,1}.

Theorem 1

(Liu 2013a) Assume that η1,η2,,ηm are independent Boolean random variables, i.e.,

ηi=1withprobabilitymeasureai0withprobabilitymeasure1-ai

for i=1,2,,m, and the variables τ1,τ2,,τn are independent Boolean uncertain variables, i.e.,

τj=1withuncertainmeasurebj0withuncertainmeasure1-bj

for j=1,2,,n. If f is a Boolean function, then η=f(η1,η2,,ηm,τ1,τ2,,τn) is a Boolean uncertain random variable such that

Ch{η=1}=(x1,,xm){0,1}m(i=1mμi(xi))f(x1,,xm),

where

f(x1,,xm)=supf(x1,,xm,y1,,yn)=1min1jnvj(yj),ifsupf(x1,,xm,y1,,yn)=1min1jnvj(yj)<0.51-supf(x1,,xm,y1,,yn)=0min1jnvj(yj),ifsupf(x1,,xm,y1,,yn)=1min1jnvj(yj)0.5,μi(xi)=aiifxi=11-aiifxi=0(i=1,2,,m),vj(yj)=bjifyj=11-bjifyj=0(j=1,2,,n).

The shortest path problem in uncertain digraphs

In this section, we will introduce the definition of uncertain digraph. Then, we will discuss the shortest path problem in uncertain digraphs and give the distribution of shortest path.

Problem description

We first give the definition of uncertain digraph and some necessary notations.

Definition 1

Let V be a set of n vertices, and write V={1,2,,n}. We call

A=α11α12α1nα21α22α2nαn1αn2αnn

an uncertain adjacency matrix, if αij represent the uncertain measures that the arcs(i,j) exist, i,j= 1,2,,n,, respectively. The ordered pair (V,A) is called an uncertain digraph, which is denoted by D.

Remark 1

Since D is simple, αii=0,fori=1,2,,n. Note that A is normally asymmetric, i.e., it’s possible that αijαji.

Write

X=x11x12x1nx21x22x2nxn1xn2xnn

and

X=Xxij=0or1,i,j=1,2,,nxii=0,i=1,2,,n. 1

In D=(V,A), all the arcs (ij) exist with the uncertain measure αij. For any XX defined by (1), xij=1 represents that the arc from i to j exists. By the definition of αij, the uncertain measure that the event {xij=1} happens is αij.

As D has n vertices, there are n(n-1) uncertain arcs. Then, there are 2n(n-1) possible realizations of arcs. Each realization of arcs and corresponding vertices form a deterministic digraph D. Since a digraph could be fully characterized by its adjacency matrix. Once a realization of arcs is given, there exists an unique matrix X in X satisfying such that X is the adjacency matrix of D. By the product axiom of uncertain measure (Liu 2007), the uncertain measure of the event that D appears is

min1i,jnμij(X),

where

μij(X)=αijifxij=11-αijifxij=0.

For any two distinct vertices i and j in D, the length of the shortest path from vertex i to j, denoted by d(ij), is a function of all xij. Since xij is an uncertain variable, d(ij) is an uncertain variable as well. Once X is given, we could calculate d(ij) by Dijkstra Algorithm (Bondy and Murty 2008). Therefore, we will naturally study the distribution and the properties of uncertain variable d(ij).

Distribution of shortest path in uncertain digraph

Aiming at dealing with the problem in the last subsection, we first give a lemma and some notations of uncertain theory.

Lemma 1

(Operational Law of Boolean System) (Liu 2013a) Assume that the variables ξ1,ξ2,,ξn are independent Boolean uncertain variables, i.e.,

ξj=1withuncertainmeasurebj0withuncertainmeasure1-bj

for j=1,2,,n. If f is a Boolean function, then ξ=f(ξ1,ξ2,,ξn) is a Boolean uncertain variable such that

M{ξ=1}=supf(y1,,yn)=1min1jnvj(yj),ifsupf(y1,,yn)=1min1jnvj(yj)<0.51-supf(y1,,yn)=0min1jnvj(yj),ifsupf(y1,,yn)=1min1jnvj(yj)0.5,

where yj take values either 0 or 1, and vj are defined by

vj(yj)=bjifyj=11-bjifyj=0(j=1,2,,n)

for i=1,2,,n, respectively.

According to Proposition 2, the property d(i,j)k can be fully characterized by the adjacency matrix of a digraph. So we give the characteristic function of this property in the following.

For any n×n Boolean matrix X and any integer k (k=1,2,,n-1), we define

fk(X)=1,ifαij(I+X++Xk)>00,otherwise, 2

where αij is the (ij)-entry of matrix I+X+X2++Xk. And by Lemma 1, we have the following Theorem.

Theorem 2

Let D=(V,A) be an uncertain digraph. For distinct vertices i, j V, the uncertain measure that the distance from i to j is no more than k is denoted by ηijk(D). And we have

ηijk(D)=supXX,fk(X)=1min1i,jnμij(X),ifsupXX,fk(X)=1min1i,jnμij(X)<0.51-supXX,fk(X)=0min1i,jnμij(X),ifsupXX,fk(X)=0min1i,jnμij(X)0.5.

where

μij(X)=αij,ifxij=11-αij,ifxij=0 3

and X is the set of matrices satisfying (1).

Remark 2

According to the duality axiom, the uncertain measure that i and j are in two different components is denoted by ηij(D). And we have ηij(D)=1-ηijn-1(D).

According to Theorem 2, it is easy to calculate the distribution function of d(ij).

Theorem 3

Let D=(V,A) be an uncertain digraph. For two distinct vertices i and j, let Φ(i,j)(x) be the distribution function of uncertain variable d(ij). Then,

Φ(i,j)(x)=0,ifx<1ηij[x](D),if1x<n-1ηijn-1(D),ifxn-1.

Maximum dijk-digraph

In D, given distinct vertices i and j, the uncertain distribution function of d(ij) is given by Theorem 2 and 3. However, it’s impractical to calculate the distribution function by Theorem 2 and 3, as we have to concern all the realization digraphs. For examples, if D has n uncertain arcs, then it has 2n(n-1)2 uncertain arcs and 2n(n-1)2 realization digraphs. The complexity of computing fk is O(n3). So the complexity is 2n(n-1)2O(n3) in total, which is not polynomial. In next part, we will propose a more efficient method to calculate ηijk(D), and present a polynomial algorithm.

Definition 2

Let D=(V,A) be an uncertain digraph. A digraph D is called a maximum dijk digraph if all the following hold.

(1)    V(D)=V;

(2)    d(i,j)k;

(3)    min(i,j)A(D)αij is maximum.

Maximum dijk digraph given by Definition 2 could be used to calculated uncertain measure ηijk(D), which is shown in the following Theorem.

Theorem 4

Let D=(V,A) be an uncertain digraph and i, j be the two vertices in V. Let D=(V,A) be a maximum dijk digraph of D. Then,

ηijk(D)=min(i,j)A(D)αij.

Proof

We assume min(i,j)A(D)αij=a0. Let D0=(V,A0) be an uncertain digraph, where A0=(bij)n×n and

bij=αij,if(i,j)A(D)0,if(i,j)A(D)for1i,jn.

We will first prove ηijk(D)a0. Note that D could be viewed as a realization digraph of D0, and the uncertain measure that it appears is a0. We have ηijk(D)min(i,j)A(D)αij=a0. As D has more arcs than D0, it’s easier to find a directed path from i to j. Then, we have

ηijk(D)ηijk(D0)min(i,j)A(D)αij=a0. 4

We then show that ηijk(D)a0. As the number of realization digraphs is finite, there exists a realization digraph D1 of D whose adjacency matrix X1 satisfies fk(X1)=1 and

supXX,fk(X)=1min1i,jnμij(X)=min1i,jnμij(X1).

fk and μij are defined by (2) and (3), respectively. By the definition of D, min(i,j)A(D1)μija0. Thus,

supXX,fk(X)=1min1i,jnμij(X)=min1i,jnμij(X1)min(i,j)A(D1)μij=min(i,j)A(D1)αijmin(i,j)A(D1)(1-αij)min(i,j)A(D1)αija0. 5

Let D2=(V,A2) be a realization digraph of D such that A2=A(D){(i,j)|αij>a0}\{(i,j)|αij=a0}. Let X2 be the adjacency matrix of D2. Therefore, min(i,j)A(D2)αij>a0. By the choice of digraph D2, d(i,j)>k holds in D2, i.e., fk(X2)=0.

If supXX,fk(X)=1min1i,jnαij(X)<0.5, according to Theorem 2 and (5), then we have

ηijk(D)=supXX,fk(X)=1min1i,jnμij(X)a0. 6

If supXX,fk(X)=1min1i,jnαij(X)0.5, then by (5), a00.5. According to Theorem 2,

ηijk(D)=1-supXX,fk(X)=0min1i,jnμij(X)1-min1i,jnμij(X2)=1-min(i,j)A(D2)αijmin(i,j)A(D2)(1-αij)1-min(i,j)A(D2)αij(1-a0)1-a0(1-a0).

Since a00.51-a0, we have

ηijk(D)1-(1-a0)=a0. 7

By inequalities (6) and (7), ηijk(D)a0. And by inequalities (4), ηijk(D)=a0 is proved to be correct.

Algorithm and example

By Theorem 4, in an uncertain digraph D=(V,A), once ijk are given, the uncertain measure ηijk(D) can be calculated by finding a maximum dijk digraph of D. Although such a maximum digraph may not be unique, we can always use a greedy algorithm to find a maximum dijk digraph of D.

Algorithm 1

Greedy Algorithm for calculating ηijk(D).

Note that D=(V,A) be an uncertain digraph. Let A(D) be the set of uncertain arcs. Let i and j be the vertices in V and k be a constant taking values from {1,2,n-1}.

Step 1. Set A=.

Step 2. Choose (i,j)A(D) such that αij is maximum. Set A=A{(i,j)}, and A(D)= A(D)\{(i,j)}. Calculate d(ij) using Dijkstra Algorithm(if there’s no path from i to j, return to n) in D=(V,A).

Step 3. If d(i,j)k, stop the iteration. The digraph D=(V,A) is a maximum dijk digraph. Thus, by Theorem 4, ηijk(D)=αij. Otherwise, go to Step 4.

Step 4. If A(D), then go to Step 2. If A(D)=, then stop and ηijk(D)=0.

Using algorithm 1 and Theorem 3, distribution function Φ(i,j)(x) could be calculated easily by the following Algorithm.

Algorithm 2

Maximum Digraph Algorithm(MDA) for calculating Φ(i,j)(x).

Note that D=(V,A) be an uncertain digraph. Let A(D) be the set of uncertain arcs. Let i and j be the vertices in V.

Step 1. Set A=, k=n-1.

Step 2. Choose (i,j)A(D) such that αij is maximum. Set A=A{(i,j)}, and A(D)= A(D)\{(i,j)}. Calculate d(ij) using Dijkstra Algorithm(if there’s no path from i to j, return to n) in D=(V,A).

Step 3. If d(i,j)k, then the digraph D=(V,A) is a maximum dijk digraph. Thus, by Theorem 4, ηijk(D)=αij. Let k=k-1 and then go to Step 3; otherwise, go to Step 4.

Step 4. If A(D), then go to Step 2. If A(D)=, then stop and ηijk(D)=0.

Step 5. According to Theorem 3, we could calculate the distribution function Φ(i,j)(x).

Next, we will discuss the complexity of Algorithm 2. Let m be the number of different uncertain values with which arcs exist. In Step 2,  we need to calculate d(ij) by Dijkstra Algorithm, which the complexity is O(n2). As the Algorithm has m interactions of Step 2, the complexity of Algorithm 1 is O(mn2). When the number of vertices is small, the algorithm is quite efficient. Here is an example.

Example 1

Let D={V,A} be an uncertain digraph. V={1,2,3,4,5} and

A=0000.90.4100.900.310.300.6000000.80000.30.

We calculate the value of η25k, k{1,2,3,4} using Algorithm 1.

The uncertain measures of all arcs are listed from high to low as follows: 1,0.9,0.8,0.6,0.4,0.3 (Fig. 1).

Fig. 1.

Fig. 1

Iterations in Example 1

In the first iteration, k=4, and then we add the arcs with uncertain measure 1. It can be easily verified that d(2,5)=+>4. According to the algorithm, we continue the iteration.

In the second iteration, k=4, then we add the arcs with uncertain measure 0.9. It can be found that d(2,5)=+>4. We continue the iteration.

In the third iteration, k=4, then we add the arcs with uncertain measure 0.8. We find it that d(2,5)=34. D is the maximum d254 digraph, thus η254=0.8. Let k=3 and we find that d(2,5)3 still holds. So η253=0.8. Let k=2, and we continue the iteration.

In the fourth iteration, k=2, then we add the arcs with uncertain measure 0.6. By Dijkstra Algorithm, d(2,5)=3>2, so we continue.

In the fifth iteration, k=2, then we add the arcs with uncertain measure 0.4. We find that d(2,5)=22, so η252=0.4. Let k=1, and we continue the iteration.

In the sixth iteration, k=1, then we add the arcs with uncertain measure 0.3. d(2,5)=11, so η251=0.3. All the directed edges have been added, so we stop the iteration. By the iteration process, distribution function of d(2, 5) is

Φ(2,5)(x)=0,ifx<10.3,if1x<20.4,if2x<30.8,ifx3,

whose image is shown in Fig.2.

Fig. 2.

Fig. 2

Distribution function of d(2, 5)

Effectiveness of MDA algorithm

In order to illustrate the effectiveness of this algorithm, we choose two classical algorithms for solving the shortest path problem: genetic algorithm (GA) and ant colony algorithm (ACO) to compare with the maximum directed graph algorithm (MDA) proposed in this paper. Each algorithm performs ten rounds of operation, and the final results are as follows. We select an uncertain digraph D with 20 vertices and 39 arcs, and the connection of its vertices is shown in Fig.3.

Fig. 3.

Fig. 3

Uncertain digraph D

We study the shortest path distribution of d(1, 20) in uncertain digraph D. The three algorithms run 10 times, respectively, and the final distributions are all the same. The final results of convergence iteration times and running time are shown in Fig. 4. We can find the following points:

Fig. 4.

Fig. 4

Model comparison

(1) According to Fig. 4, the average running time of MDA algorithm is only 1.452s, which is less than 14.806s of GA algorithm and 22.565s of ACO algorithm. Compared with the two classical algorithms, the running time can be greatly shortened.

(2) The average convergence time of MDA algorithm is 0.854s, which is greatly improved compared with 9.577s of GA algorithm and 12.911s of ACO algorithm.

(3) The variance of average running time and average convergence time of MDA algorithm are 0.008996 and 0.003384, respectively, which are far less than GA algorithm and ACO algorithm. This algorithm has good stability.

Above all, since GA algorithm and ACO algorithm are heuristic algorithms, their running speed and results are greatly affected by initial parameters. MDA algorithm does not depend on the selection of initial parameters, but only related to the structure of uncertain digraph. It runs stably and efficiently. So, it is an efficient algorithm to solve the shortest path distribution problems in uncertain digraphs.

Numerical simulation of the shortest path model on COVID-19 transmission in social networks

The transmission of COVID-19 has severely threatened people’s lives since 2019. Therefore, research about the transmission of COVID-19 has its theoretical and practical significance. According to the World Health Organization(WHO), COVID-19 has distinct community transmission characteristics. It can easily spread through the social network and the spreading rate of COVID-19 is much higher than the other viruses. In this part, we simulate the spread of viruses in social networks.

We use a vertex to represent a person. For any two person i and j, if COVID-19 is transmitted from i to j, there is an arc between the corresponding vertices, otherwise it does not exist. Due to the strong dependence of probability model on the amount of data. Since the virus transmission is affected by indeterminate factors, which are lack of supporting historical data, the distribution obtained by models of probability theory will be far from the actual situation. Then, we have to ask experts in relevant fields to give the experience distribution. As a result, we assume all the arcs exist with a certain measure. As the measures that COVID-19 transmit from i to j and from j to i are different, it can only be characterized by directed graphs instead of undirected graphs. At the same time, due to the lack of historical data of interpersonal communication, it is difficult for us to build a model by probability theory. As a result, we assume all the arcs exist with degree of uncertain measure.

When a small number of infected people appear in a social network, they have infected some people around because of the incubation period. At this time, the distance of uncertain digraph can be used to estimate the required isolation range. One reason why random graphs are not suitable here is that the probability measures of arcs in the graph are difficult to be obtained objectively. The other reason is that according to the independence of probability measure, the probability measure of a product event is equal to the product of all the probability measures. According to the little probability event principle, the transmission range is generally very small. However, this estimation is sometimes inconsistent with the reality, since the transmission range will be much larger due to the incubation period of the virus.

Then, we simulate how a person infects others around him. We assume the number of vertices in the transmission networks is 100. According to sociological research, we assume that the out-degrees and in-degrees of each vertex are around 25 originally. We consider the transmission range of infected person 1. Choose one of the remaining 99 healthy people, assuming the 100th one. We will explore how many people the virus needs to transmit from the first person to the 100th person. This is equivalent to calculating the distance between them in an uncertain digraph.

Let D={V,A} be an uncertain digraph with 100 vertices. All the arcs exist with degree of uncertain measure that two people have close contact. We calculate the value of η1,100k and the distribution function of uncertain variable d(1, 100) using Algorithm 1 and 2. The original social network of 100 vertices is shown in Fig. 5. By algorithm 1, the value of η1,1002 is 0.1250. The same is true of the calculation of η1,100k. So we can get distribution function of d(1, 100). Distribution function of d(1, 100) is

Φ(1,100)(x)=0,ifx<10.1250,if1x<20.3631,if2x<30.5781,if3x<40.8340,if4x<50.9006,if5x<60.9580,if6x<70.9681,ifx7,

whose image is shown in Fig. 6. Through calculation, we find that d(1, 100) does not exceed 7 with an uncertain measure of 0.9580. In other words, patient 1 will infect the healthy person 100 through up to six times of transmission. In fact, if we choose other healthy people, the calculation result is almost the same, which means that an infected person will infect the whole community by seven rounds of transmission at most.

Fig. 5.

Fig. 5

Social network of 100 nodes

Fig. 6.

Fig. 6

Distribution function of d(1, 100)

Shortest path problem in uncertain random digraph

In this section, we will propose the model of uncertain random digraph, and discuss the shortest path problem in uncertain random digraphs.

Problem description

In an uncertain random digraph with n vertices, all arcs are independent, and some arcs exist with degrees in probability measure while other arcs exist with degrees in uncertain measure.

We define two disjoint collections of arcs,

U={(i,j)|1i,jnand(i,j)areuncertainarcs},R={(i,j)|1i,jnand(i,j)arerandomarcs},

with UR={(i,j)|1i,jn}. Note that deterministic arcs are regarded as special uncertain ones which exist with degrees in uncertain measure 1.

The adjacency matrix is an n×n matrix

A=α11α12α1nα21α22α2nαn1αn2αnn,

where αij represent the truth values in uncertain measure or probability measure if the arc from i to j exist, i,j=1,2,,n. As digraphs considered in this paper are simple, αii=0, for i=1,2,,n, and A is normally asymmetric matrix, i.e., αijαji, for ij.

Definition 3

Assume V is the collection of vertices, U is the collection of uncertain arcs, R is the collection of random arcs, and A is the adjacency matrix. Then, the quartette D=(V,U,R,A) is said to be an uncertain random digraph.

For an uncertain random digraph D=(V,U,R,A), write

X=x11x12x1nx21x22x2nxn1xn2xnn

and

X=X|xij=0or1,if(i,j)Rxij=0,if(i,j)Uxii=0,i=1,2,,n. 8

For any XX, the extension class of X is defined by

X=Y|yij=xij,if(i,j)Ryij=0or1,if(i,j)Uyii=0,i=1,2,,n. 9

As there are n(n-1) possible arcs, there are 2n(n-1) possible realization of arcs. Each one of them could be represented by a deterministic digraph, which is called a realization digraph. Since a digraph could be fully characterized by its adjacency matrix. For every XX and YX, such that Y is the adjacency matrix of a realization digraph H. The chance measure of the event that the realization digraph H appears, is

(i,j)Rwij(Y)min(i,j)Uwij(Y), 10

where

wij(Y)=αij,ifyij=11-αij,ifyij=0.

Example 2. Let D=(V,U,R,A) be an uncertain random digraph (shown in Fig.7), A={(1,2),(1,3),(1,5),(2,3),(3,1),(3,4),(4,5),(5,2)}. where V={1,2,3,4,5}, R={(1,2),(1,3),(3,4)}, U={(1,5),(2,3),(3,1),(4,5),(5,2)}, and

A=00.20.900.8000.4000.1000.7000000.300.4000.

As D has 8 edges, it has 28 realizations graphs, three of which are shown in Fig.7. The chance measure of the event that H1 appears is

0.2×0.9×0.7×min{0.6,0.9,0.4,0.8,0.3}=0.0378.

Similarly, the chance measure of the events that H2 and H3 appear are 0.0504 and 0.0014, respectively.

Fig. 7.

Fig. 7

Uncertain random digraph D and its realization digraphs

Remark 3

If U=, an uncertain random digraph D=(V,U,R,A) becomes a random digraph. Then,

X=X|xij=0or1,i,j=1,2,,nxii=0,i=1,2,,n. 11

For any XX, X is the adjacency matrix of a realization digraph, which appears with probability

1i,jnwij(X).

If R=, an uncertain random digraph D=(V,U,R,A) becomes an uncertain digraph in Definition 1.

The shortest path length from i to j, denoted by d(ij), is a function of all xij. Since xij is a uncertain random digraph, d(ij) is an uncertain random variable. Therefore, we will naturally study the chance distribution and the properties of uncertain random variable d(ij).

Distribution of shortest path in uncertain random digraph

In this section, we will give the formula of the shortest path distribution function in uncertain random digraphs. According to Theorem 1, we have the following theorem.

Theorem 5

Let D=(V,U,R,A) be an uncertain random digraph. For any vertices i, j V, X is the class of matrices satisfying (8), X is the extension class of X satisfying (9). The chance measure that the distance from i to j is at most k is denoted by ηijk(D). And we have

ηijk(D)=XX((i,j)Rwij(X))fk(X),

where

fk(X)=supYX,fk(Y)=1min(i,j)Uwij(Y),ifsupYX,fk(Y)=1min(i,j)Uwij(Y)<0.51-supYX,fk(Y)=0min(i,j)Uwij(Y),ifsupYX,fk(Y)=1min(i,j)Uwij(Y)0.5.

and

wij(X)=αij,ifxij=11-αij,ifxij=0. 12

Remark 4

According to the duality axiom, the chance measure that i and j are in two different components is denoted by ηij. And we have ηij=1-ηijk(D).

Corollary 1

Let D=(V,A) be a random digraph. X is the class of matrices satisfying (8), X is the extension class of X satisfying (9). wij satisfies (12). For any vertices i, j V, we have

1. the probability measure that the distance from i to j is at most k is

P{d(i,j)k}=XX,gk(Y)=11i,jnwij(X).

2. the probability measure that i and j are in two different components is

P{d(i,j)=}=1-P{d(i,j)n-1},

According to Theorem 5 and the definition of distribution function, it is easy to calculate the distribution function of d(ij) by the following theorem.

Theorem 6

Let D=(V,U,R,A) be an uncertain random digraph. For any two distinct vertices i and j, Φ(i,j)(x) is the chance distribution function of uncertain random variable d(ij), and we have

Φ(i,j)(x)=0,ifx<1ηij[x](D),if1x<n-1ηijn-1(D),ifxn-1.

Although the method of formulating chance distribution function has been given, it seems to be theoretical and quite complicated because we have to concern all the realization digraphs. So we need a simplified formula to calculate final result. According to Theorem 5, after random arcs are fixed, the uncertain random digraph will become an uncertain digraph. As a result, we can calculated f by Algorithm 1. In next part, we will propose an efficient method to calculate ηijk(D).

Algorithm and example

According to Algorithm 1 and Theorem 5, we give an efficient Algorithm to calculate ηijk(D) of an uncertain random digraph.

Algorithm 3

Algorithm for calculating the ηijk(D)of an uncertain random digraph

Step 1. Divide the realization digraph of uncertain random digraph D=(V,U,R,A) into l parts by the arcs with degree of random measure, where l=2|R|. For each part of the D, we assume there exist only arcs with degree of random measure from the beginning. Let j=1.

Step 2. Calculate fk by Algorithm 1. Let j=j+1.

Step 3. If j=l, stop and calculate the ηijk(D) by Theorem 5; if j<l, set j=j+1 and go to Step 2.

Next, we will discuss the complexity of Algorithm 3. We assume |R(D)|=r, m be the number of different uncertain values with which arcs exist. In Step 2,  we need to calculate fk by Algorithm 1, whose the complexity is O(mn2). Since there are 2r iterations, the complexity of Step 2 is O(2rmn2). In Step 3, there are 2r multiplications and each multiplication has complexity of O(r). Then, we have to calculate 2r additions. Thus, the complexity of Algorithm 3 is

O(2rmn2)×(O(r2r)+2r)=4rrmn2.

Example 2

D={V,U,R,A} be an uncertain random digraph. R={(3,1),(2,1)} and other arcs exist with degree of uncertain measure. We calculate the value of η253(D) using Algorithm 3 (Figs. 8, 9).

Fig. 8.

Fig. 8

Uncertain random digraph D

Fig. 9.

Fig. 9

The result of the four cases

Since there are 2 arcs exist with degree of probability measure, we divide the realization of digraph into 22=4 cases. We calculate the fk, respectively. We give the solution procedure of the case 1, another 3 cases can be solved by the same method. According to Algorithm 1, four maximum dijk digraph are found and fk equals to 0.8, 0.6, 0.8, 0.6, respectively. By Theorem 5, η253(D) is given by the following equation

η253(D)=0.336+0.168+0.144+0.072=0.72.

As a result, the distribution function Φ(2,5)(x) is

Φ(2,5)(x)=0,ifx<10.3,if1x<20.36,if2x<30.72,if3x<40.776,ifx4,

whose image is shown in Fig. 10.

Fig. 10.

Fig. 10

Distribution function of d(2, 5)

Conclusions

In this paper, we discussed the shortest path problem in uncertain digraphs and uncertain random digraphs. Key properties of d(i,j)k have been discussed. We gave the formulas for calculating ηijk(D) and distribution function of d(ij). And an efficient polynomial algorithm was proposed to calculate ηijk(D). When the number of vertices is small, the algorithm performs well and can greatly reduce the amount of computation. We also used this model to study the transmission of covid-19 in social networks.

Due to the strong dependence of probability model on the amount of data. If there is no or lack of historical data, the distribution obtained by using the model of probability theory will be far from the actual situation. In this case, the theory of uncertainty will have great theoretical value and practical significance. Of course, if there is enough historical data, the classical probability model may have better results.

Further research will focus on the following aspects. First, MDA algorithm is a more efficient algorithm. With the help of the property that the shortest path increases with the increase in edges in directed graphs, this method can be considered when studying the index properties of edge increase in other uncertain directed graphs in the future. Second, important indices, such as diameter, radius could be considered in the frame of uncertain random digraphs. Finally, similar models could be introduced, such as uncertain fuzzy graph.

Funding

The authors have not disclosed any funding.

Data availibility

Enquiries about data availability should be directed to the authors.

Declarations

Conflict of interest

The authors have not disclosed any competing interests.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Hao Li, Email: hlimath@ruc.edu.cn.

Kun Zhang, Email: zk-math2020@ruc.edu.cn.

References

  1. Abbaszadeh Sori A, Ebrahimnejad A, Motameni H. The fuzzy inference approach to solve multi-objective constrained shortest path problem. J Intell Fuzzy Syst. 2020;38(4):4711–4720. doi: 10.3233/JIFS-191413. [DOI] [Google Scholar]
  2. Bollobás B (2011) Random graphs, 2nd edn. Cambridge University Press, Cambridge
  3. Bondy A, Murty USR. Graph theory. Berlin: Springer; 2008. [Google Scholar]
  4. Chen L, Peng J, Rao C, Rosyida I. Cycle index of uncertain random graph. J Intell Fuzzy Syst. 2018;34:4249–4259. doi: 10.3233/JIFS-17373. [DOI] [Google Scholar]
  5. Chuang TN, Kung JY. The fuzzy shortest path length and the corresponding shortest path in a network. Comput Oper Res. 2005;32(6):1409–1428. doi: 10.1016/j.cor.2003.11.011. [DOI] [Google Scholar]
  6. Deng Y, Chen Y, Zhang Y, Mahadevan S. Fuzzy Dijkstra algorithm for shortest path problem under uncertain environment. Appl Soft Comput. 2012;12(3):1231–1237. doi: 10.1016/j.asoc.2011.11.011. [DOI] [Google Scholar]
  7. Dubois D, Prade H. Fuzzy sets and systems: theory and applications. New York: Academic Press; 1980. [Google Scholar]
  8. Ebrahimnejad A. An acceptability index based approach for solving shortest path problem on a network with interval weights. RAIRO Oper Res. 2021;55(1):1767–1787. doi: 10.1051/ro/2020033. [DOI] [Google Scholar]
  9. Ebrahimnejad A, Karimnejad Z, Alrezaamiri H. Particle swarm optimisation algorithm for solving shortest path problems with mixed fuzzy arc weights. Int J Appl Decis Sci. 2015;8(2):203–222. [Google Scholar]
  10. Ebrahimnejad A, Madjid Tavava, Hamnidreza Alrezaamiri. A novel artificial bee colony algorithm for shortest path problems with fuzzy arc weights. Measurement. 2016;93:48–56. doi: 10.1016/j.measurement.2016.06.050. [DOI] [Google Scholar]
  11. Ebrahimnejad A, Tabatabaei S, Santos-Arteaga FJ. A novel lexicographic optimization method for solving shortest path problems with interval-valued triangular fuzzy arc weights. J Intell Fuzzy Syst. 2020;39(1):1277–1287. doi: 10.3233/JIFS-192176. [DOI] [Google Scholar]
  12. Erdos P, Renyi A. On random graphs. Publicationes Mathematicae. 1959;6:290–297. [Google Scholar]
  13. Esfahanian H (2008) On the evolution of graph connectivity algorithms
  14. Gao Y. Shortest path problem with uncertain arc lengths. Comput Math Appl. 2011;62(6):2591–2600. doi: 10.1016/j.camwa.2011.07.058. [DOI] [Google Scholar]
  15. Gao X. Cycle index of uncertain graph. Information. 2013;15(12):270–277. [Google Scholar]
  16. Gao X. Regularity index of uncertain graph. J Intell Fuzzy Syst. 2014;27:1671–1678. doi: 10.3233/IFS-141133. [DOI] [Google Scholar]
  17. Gao X. Tree index of uncertain graphs. Soft Comput. 2016;20(4):1449–1458. doi: 10.1007/s00500-015-1597-5. [DOI] [Google Scholar]
  18. Gao X, Gao Y. Connectedness index of uncertain graph. Int J Uncertain Fuzz Knowl Based Syst. 2013;21(1):127–137. doi: 10.1142/S0218488513500074. [DOI] [Google Scholar]
  19. Gao Y, Qin Z. On computing the edge-connectivity of an uncertain graph. IEEE Trans Fuzzy Syst. 2016;24(4):981–991. doi: 10.1109/TFUZZ.2015.2500267. [DOI] [Google Scholar]
  20. Gao R, Yao K. Importance index of components in uncertain random systems. Knowl Based Syst. 2016;109(1):208–217. doi: 10.1016/j.knosys.2016.07.006. [DOI] [Google Scholar]
  21. Gao Y, Yang L, Li S, Kar S. On distribution function of the diameter in uncertain graph. Inf Sci. 2015;296:61–74. doi: 10.1016/j.ins.2014.10.048. [DOI] [Google Scholar]
  22. Gao R, Sun Y, Ralescu DA. Order statistics of uncertain random variables with application to k-out-of-n system. Fuzzy Optim Decis Mak. 2016;16(2):1–23. [Google Scholar]
  23. Gao X, Guo C, Yin X, Yu X. The computation on α-connectedness index of uncertain graph. Clust Comput. 2019;22(3):5691–5701. doi: 10.1007/s10586-017-1471-6. [DOI] [Google Scholar]
  24. Gilbert EN. Random graphs. Ann Math Stat. 1959;30(4):1141–1144. doi: 10.1214/aoms/1177706098. [DOI] [Google Scholar]
  25. Hernandes F, Lamata MT, Verdegay JL, Yamakami A. The shortest path problem on networks with fuzzy parameters. Fuzzy Sets Syst. 2007;158(14):1561–1570. doi: 10.1016/j.fss.2007.02.022. [DOI] [Google Scholar]
  26. Hou Y, Peng W. Distance between uncertain random variables. Math Model Eng Probl. 2014;1:17–24. [Google Scholar]
  27. Ke H, Su T, Ni Y. Uncertain random multilevel programming with application to production control problem. Soft Comput. 2015;19(6):1739–1746. doi: 10.1007/s00500-014-1361-2. [DOI] [Google Scholar]
  28. Kolmogoroff A. Grundbegriffe der Wahrscheinlichkeitsrechnung. Berlin: Julius Springer; 1933. [Google Scholar]
  29. Li H, Gao X. On the vertex-connectivity of an uncertain random graph. IEEE Access. 2020;8:85504–85514. doi: 10.1109/ACCESS.2020.2990574. [DOI] [Google Scholar]
  30. Li H , Zhang H. On the edge-connectivity of an uncertain random graph. IEEE Access. 2020;99:1. [Google Scholar]
  31. Li H, Zhang B, Peng J. On the matching number of an uncertain graph. Iran J Fuzzy Syst. 2018;15(2):89–108. [Google Scholar]
  32. Liu B (2007) Uncertainty theory, 2nd edn. Springer, Berlin
  33. Liu B. Some research problems in uncertainy theory. J Uncertain Syst. 2009;3:3–10. [Google Scholar]
  34. Liu Y (2013a) Uncertain random programming with applications. Fuzzy Optim Decis Mak 12(2):153–169
  35. Liu Y (2013b) Uncertain random variables: a mixture of uncertainty and randomness. Soft Comput 17(4):625–634
  36. Liu B. Uncertain random graph and uncertain random network. J Uncertain Syst. 2014;8(1):3–12. [Google Scholar]
  37. Liu Y, Ralescu DA. Risk index in uncertain random risk analysis. Int J Uncertain Fuzz Knowl Based Syst. 2014;22(4):491–504. doi: 10.1142/S021848851450024X. [DOI] [Google Scholar]
  38. Matula DW (1987) Determining edge connectivity in 0 (nm). In: Symposium on Foundations of Computer Science
  39. Okada S, Soper T. A shortest path problem on a network with fuzzy arc lengths. Fuzzy Sets Syst. 2000;109(1):129–140. doi: 10.1016/S0165-0114(98)00054-2. [DOI] [Google Scholar]
  40. Qin Z. Uncertain random goal programming. Fuzzy Optim Decis Mak. 2018;17(4):375–386. doi: 10.1007/s10700-017-9277-9. [DOI] [Google Scholar]
  41. Rosyida I, Peng J, Chen L, Widodo W, Indrati ChR, Sugeng KA. An uncertain chromatic number of an uncertain graph based on α-cut coloring. Fuzzy Optim Decis Mak. 2018;17(1):103–123. doi: 10.1007/s10700-016-9260-x. [DOI] [Google Scholar]
  42. Sheng Y , Gao Y. Shortest path problem of uncertain random network. Comput Ind Eng. 2016;99:66. doi: 10.1016/j.cie.2016.07.011. [DOI] [Google Scholar]
  43. Sheng Y, Mei X. Uncertain random shortest path problem. Soft Comput. 2020;24(4):66. doi: 10.1007/s00500-018-03714-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Soifer A. The mathematical coloring book. New York: Springer; 2009. [Google Scholar]
  45. Wen M, Kang R. Reliability analysis in uncertain random system. Fuzzy Optim Decis Mak. 2016;15(4):491–506. doi: 10.1007/s10700-016-9235-y. [DOI] [Google Scholar]
  46. Yao K, Gao J. Law of large numbers for uncertain random variables. IEEE Trans Fuzzy Syst. 2016;24(3):615–621. doi: 10.1109/TFUZZ.2015.2466080. [DOI] [Google Scholar]
  47. Zadeh LA. Fuzzy sets. Inf Control. 1965;8:338–353. doi: 10.1016/S0019-9958(65)90241-X. [DOI] [Google Scholar]
  48. Zhang B, Peng J. Euler index in uncertain graph. Appl Math Comput. 2012;218(20):10279–10288. [Google Scholar]
  49. Zhang B, Peng J, Li S. Euler index of uncertain random graph: concepts and properties. Int J Comput Math. 2017;94:217–229. doi: 10.1080/00207160.2015.1100720. [DOI] [Google Scholar]
  50. Zhou J, Yang F, Wang K (2014a) An inverse shortest path problem on an uncertain graph. J Netw 9(9):2353–2359
  51. Zhou J, Yang F, Wang K (2014b) Multi-objective optimization in uncertain random environments. Fuzzy Optim Decis Mak 13(4):397–413

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Enquiries about data availability should be directed to the authors.


Articles from Soft Computing are provided here courtesy of Nature Publishing Group

RESOURCES