Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2021 Dec 26;155:111660. doi: 10.1016/j.chaos.2021.111660

A symbiosis between cellular automata and dynamic weighted multigraph with application on virus spread modeling

Somaye Moghari 1,, Maryam Ghorani 1
PMCID: PMC8710307  PMID: 34975234

Abstract

The pattern of coronavirus spread at different geographical scales verifies that travel or shipment by air, sea or road are potential to transmit viruses from one location to somewhere far away in a very short time. Simulation and analysis of such a situation requires the development of models that support long distance transmission of viruses. Cellular Automata (CA) are a family of spatiotemporal computational models frequently employed in analysis of biomedical systems. A CA consists of a topological combination of units called cells as well as a transition function that propagates the configuration of cells locally and step by step. In this paper, we first present some patterns that show the local interaction between CA cells is not sufficient for virus spread modeling, especially at large spatial scales. Then, we generalize the concept of CA by providing a symbiosis between the neighborhood relationship of cells and the transmission channels represented by a dynamic weighted multigraph. Furthermore, we characterize the capabilities of the proposed modeling tool in simulation of the virus spread, and estimating the risk control during the movement restrictions and related health protocols. Finally, we simulate the coronavirus outbreak in the five study areas including three states and two countries. Our experiments using the proposed model verify that the proposed model is capable of formulating different ways of virus transmission, including long-distance transmission, and supports high-precision simulation of the pandemic.

Keywords: COVID-19, Epidemic modeling, Cellular automata, Dynamic neighborhood configuration

1. Introduction

The large amounts of data and information generated due to the spread and longevity of viruses and infectious diseases provide an opportunity for scientists to extract the knowledge needed for making decisions in similar situations [1], [2], [3], [4], [5], [6], [7], [8] as well as control measure issues [9], [10], [11], [12]. Modeling devices, as a result of knowledge representation, offer several advantages by their representational, computational and analytical potentials. Models can be developed, tested and adapted for different applications, especially when data on events that occurred over a wide range of time and space are available.

Cellular Automata (CA) [13] are a family of spatiotemporal mathematical and computational tools employed in a variety of modeling and simulation contexts such as epidemic, growth, flow, propagation, evacuation and transport dynamics [14], [15], [16], [17], [18], [19], [20], [21], [22]. The successful applications of CA shows its high potential for simulating and analyzing how changes spread locally over time [23].

A CA consists of a collection of cells arranged in a multi-dimensional space and a dynamical rule (transition function) which updates their configuration synchronously. The transition function calculates the state of each cell based on the state of its neighbors where, these simple local interactions and computations between cells, form a complex global behavior [24], [25], [26]. The cells usually represent real spaces in various sizes ranging from atomic structure of matter to urban texture components or even larger [27], [28]. In physical space modeling / simulation, the migration of agents or the spread of phenomena among the cells causes change in their state through a number of discrete time steps [19], [21]. Therefore, the dynamics of CA depend on the set of states, the geometry of cells, the definition of neighborhood relationship, and the state calculation method of transition function.

Cells are the basic blocks of CA that are located in a topological arrangement. A cell has a number of configuration parameters that change over time. Each configuration is called a state, and at each time step, it is calculated according to state of the neighboring cells. In a CA, the set of neighbors of each cell is determined based on the topological relationships of the cells and the parameter of the neighboring radius. Figure 1 shows tree sample CA with different topologies. Here, the cell numbered by 0 is considered as the base cell and other cells are numbered with their neighborhood radius from it.

Fig. 1.

Fig. 1

Three sample CA and neighbor cells around a base cell indicted by 0.

Various geometries and distributions are possible for cells, each of them affects the interaction between cells [23], [13]. Interaction between cells occurs passively through the transition function, in such a way that updates state of all cells in parallel. Figure 2 illustrates the general structure of a transition function in classic definition of CA. Here, Si,t corresponds to the state of cell i in time step t, Dj,i represents the distance of cell i from cell j, and r is the neighboring radius. Accordingly, the transition function calculates the next state of cell i based on the current state of itself and its neighbors within a maximum radius of r.

Fig. 2.

Fig. 2

Inputs and output of a transition function for cell i in a classic CA.

In the classic sense, to define cell neighborhood, the focus of attention is on the absolute location of cells and a neighborhood radius that the higher the value, the more cells can interact with each other [23]. This model does not support relational conceptions of space as well as remote transmission of agents or data without intermediaries. On the contrary, by focusing on the interaction between cells through transmission pathways, the adjacency relationship can be represented using graphs or vector layers (in Geospatial Information Science), and even the neighborhood radius can be considered on these structures [29], [30], [31], [32], [33]. However, providing a communication network between cells in the form of a 0/1 graph allows the modeling of direct communication between two non-adjacent cells, it is not capable of modeling the intensity of communication between cells. For example, in applications such as virus spread modeling that population areas are considered as CA cells, the amount of traffic between them cannot be modeled.

Modeling the dynamic of heterogeneous spaces using CA leads to defining the weights of neighborhood to reflect the varying impacts of nearby cells [34], [35], [36]. The weighted adjacency presentation can increase the modeling power of CA, especially in cases neighboring cells do not necessarily interact and influence each other similarly. In Fig. 3 , the transition function of a heterogeneous CA is presented. Here, Wj,i corresponds to the weight of communication between cells j and i. The weighted relationship of cells provides modeling the heterogeneous transport of virus-transmitting agents, but it is not enough for manipulating the changes of health or political protocols.

Fig. 3.

Fig. 3

Inputs and output of a transition function for cell i in a heterogeneous CA.

Another feature that make CA more flexible is the ability to adjust the connectivity strength for cells during simulation [36], [37], [38]. This feature is very useful for modeling and analysis of control measures such as blocking the entry of imported cases by mandatory quarantine upon arrival, an entry ban for nonresidents and requiring a negative test result from arrivals, passenger monitoring and other protective protocols applied dynamically during the outbreak of COVID-19 [39], [40], [41], [42], [43], [44], [45]. Figure 4 , presents the transition function of a heterogeneous CA that is capable of dynamic adjusting the weights. Here, Wj,i,t indicates the weight of communication between cells j and i in time step t. Although this model includes powerful features for modeling the dynamics of systems and propagating changes in them, it is not sufficiently abstract to support the interaction of CA cells through different communication layers. In other words, for cases where several types of neighborhoods need to be defined between two cells, the model should be more capable.

Fig. 4.

Fig. 4

Inputs and output of a transition function for cell i in a heterogeneous CA with dynamic connectivity.

The aim of this research is study of spatiotemporal pattern of new coronavirus spread at different geographical scales to discover pathways that have accelerated the pandemic [46] of COVID-19, and use the resulting knowledge to develop a generalized CA that support required features for virus spread modeling and simulation. In this direction, we focus on the sequence of first case reported in the subdivisions of study regions and extract the patterns corresponding to the virus transmission channels. Then, we develop the proposed CA by employing a multigraph with dynamic weights as the interaction model of CA cells. As well, we show the impact of controls and constraints related to health protocols are supported. In addition, we compare the results of the simulation of the corona virus outbreak in the five study areas with the actual data and show the high capability of the proposed model in predicting the spread.

2. COVID-19 spread pattern

We investigate th pattern of coronavirus spread at different levels of political divisions of the world. At each spatial scale, we extract the desired patterns according to the first case reported timeline of the political divisions of the study area. We also consider the researches that realize the potential virus transmission pathways as well as the impact of monitoring, controls and restrictions for mitigating virus spread, using field studies or laboratory activities [40], [45], [47], [48], [49].

Figure 5 presents a timeline of the first confirmed cases of COVID-19 on the worldwide scale during December 2019 and January 2020. This timeline is extracted from the information presented in Wold Health Organization (WHO) coronavirus disease dashboard [50]. On December 2019, the first case was observed in China, and then on January 13, 2020, the first case outside of China was reported in Thailand. The detection trend of COVID-19 in other countries is also shown on the chart and indicates that in January, the rate of virus spread in the world was very high. Now, considering a long-distance virus transmission pattern in a short time, On January 20, the first case was reported in North America (United States of America), on January 24, the first case was reported in Europe (France), on January 25, the first case was reported in Australia, and on January 29, the first case was reported in Middle East (United Arab Emirates). The countries included in this pattern are colored in red. Also, the short time of formation of this pattern on a global scale indicates the spread of the virus through the airlines.

Fig. 5.

Fig. 5

A timeline of COVID-19 developments during December 2019 and January 2020. The countries marked in red, form a long-distance virus spread pattern. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

We study the pattern of COVID-19 spread in the USA because of its vastness, the large number of infected people, wide extent of virus spread, and the availability of the required information through online dashboards of health departments of different states. Figure 6 illustrates the state by state development of COVID-19 in USA where, the default color of states is white, the yellow mean the first case reported on the same day and the red mean the case reported in the previous days. According to maps Fig. 6a to d, which run from January 21, to February 1, the pattern of development through neighboring states has not occurred; especially on January 24, (Illinois marked in yellow) and February 1, (Massachusetts marked in yellow) when there is a great physical distance from the previously infected states. Infected cases were also reported in Florida and Texas on March 1, and 4, respectively, while no cases were reported in neighboring states. On the other hand, the first reported case in the state of Nevada is on March 5, while the two neighboring states, California and Arizona, have been infected with the corona virus since January 26, and also in the other neighboring state, Oregon, since February 28, the first Infection was reported. As another example, focusing on West Virginia, the last state that report the first case of COVID-19 (Fig. 6); the first neighboring states were infected on March 5, (Fig. 6), the other two neighboring states on March 6, (Fig. 6), and by March 10, (Fig. 6) all neighboring states were infected. In addition, the state of New Mexico reported the first case on March 11, while in neighboring Arizona, the first case was reported about six weeks ago on January 26. The information leads us to the idea that at a country scale, the proximity of provinces has less effect on the spread of the virus than communication lines such as railroads and airlines. Intuitively, this idea seems plausible because train stations and airports are crowded, and in each transfer, a significant number of people on the train or aircraft are in contact with each other.

Fig. 6.

Fig. 6

The evolution map of COVID-19 first case report in USA during January, February and March 2020. The map is simplified and all states and areas are not included. Areas are colored in white (default), yellow (first case report) or red (some cases are reported in previous days). (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

We generated a state level virus spread map based on data extracted from Indiana COVID-19 dashboard [51]. This dashboard is updated daily with information reported to Indiana State Department of Health (ISDH) in the last 24 h. Figure 7 shows the spread of coronavirus in the state of Indiana, until March 12. As shown in Fig. 7a, the first case of COVID-19 reported in Marion County on March 6. It is the most populous county in Indiana and its administrative center is Indianapolis, the state capital and largest city. According to the maps from March 8, to 12, two different distribution patterns can be seen in the state. The first pattern is related to the spread in the counties adjacent to Marion. Since the state capital is located in this county and has a crowded connections with other cities, this pattern of virus spread seems natural. The second pattern is related to non-adjacent counties, where at the time of reporting the first case in each county, no case of COVID-19 was reported in any of the neighboring counties. These patterns show that population flow communications are more important than neighborhood communications in modeling the spread of the virus at the state level and by county division.

Fig. 7.

Fig. 7

Coronavirus spread in the state of Indiana, USA during March 2020.

The patterns of coronavirus spread at different macroscopic scales such as worldwide, countrywide and state level suggest that important factors in addition to the geographical neighborhood of the regions are effective in transmission of coronavirus. Since COVID-19 is a highly transmittable and pathogenic viral infection [52], travel or shipment by air, sea or road are potential to transmit the virus from one location to somewhere far away in a very short time. Therefore, in the next section, we will consider the transmission pathways as well as how to control their impact on the spread of viruses in the development of the proposed model.

3. Symbiosis between cellular automata and dynamic weighted multigraph

In this section, we introduce a Dynamic Weighted Multigraph-based Cellular Automaton (DWMCA) that is adapted for modeling the propagation of changes by different type of transmission channels with dynamic capacity.

Definition 3.1

A weighted graph is an ordered pair G=(V,E), where:

  • V is a set of vertices called nodes.

  • E:V×V[0,1] is called the set of weighted edges.

Definition 3.2

A dynamic weighted multigraph is defined as G=(V,Y,I,D), where:

  • V is the set of nodes.

  • Y is the set of weighted layers, such that for each layer EY the ordered pair G=(V,E) is a weighted graph. Also, by |Y| we indicate the number of layers.

  • I:Y[0,1] is called the impact factor of layers.

  • D is the graph dynamic and in each time tZ, updates Yt and It to Yt+1 and It+1, respectively.

Definition 3.3

Let G=(L,Y,I,D) be a dynamic weighted multigraph. A dynamic weighted multigraph-based CA is a 7-tuple DCA=(L,S,c,c0,G,f,fa), where:

  • -

    L is a lattice of cells.

  • -

    S is a set of states.

  • -

    c:L×ZS indicates the configuration of CA (the states of all cells) in time tZ. In other words, ct returns the state of all cells int time t.

  • -

    c0 is the initial configuration of CA.

  • -

    G represents the dynamic weighted multi-layer neighborhood on L.

  • -

    f:L×2E×2cS for some EY, is the transition function.

  • -

    fa:2SS is the aggregation function.

As well, by transition T:c×f×fac we mean the simultaneously application of the transition function f and aggregation function fa to all cells and calculating ct+1 from ct for some tZ. We note that this paper does not deal with the CA with memory [53], where calculating new configuration relies on not only current configuration but also past configurations.

Figure 8 illustrates the general structure of a transition on cell i in a k-layer DWMCA, where k=|Y|. Here, Dj,i,t(r) and Wj,i,t(r) are the distance and the weight of the connection between cells j and i through layer r at time t, respectively. The transition function generates a tuple of k states S^i,t+1=Si,t+1(1),Si,t+1(2),,Si,t+1(k), where Si,t+1(r) is the next state of cell i calculated through layer r{1,,k}. The aggregation function calculates the next state Si,t+1 from the tuple S^i,t+1.

Fig. 8.

Fig. 8

Inputs and output of a transition for cell Ci in DWMCA.

In fact, DWMCA is the resulted of symbiosis between CA and DWM, so that the DWM represents cell neighborood relation, and the transitin function is adapted accordingly. Therefore, in DWMCA, the cell neighborhood graph is defined by DWM which results to the following properties rather than CA:

  • -

    Each DWMCA represents a set of neighborhood relations (called layers), each of which is modeled by a dynamic weighted graph.

  • -

    The neighborhood relations are dynamic and may change over time.

  • -

    Each layer has an impact factor that may change over time.

  • -

    Compared to CA, the definition of transition function is extended to a pair of transition function and aggregation function to support manipulating multiple neighborhood relations.

Now, we define a DWMCA model for simulating covid-19 outbreak. Of course, our goal is not determining the degree of infection for areas, and we consider the first case reported. In this model, the geographic regions such as countries or provinces are considered as set of cells L. Also, we assume that dynamic weighted multigraph G=(L,Y,I,D) is defined on different pathways (such as airlines, roads, railways and waterways) between these areas. Therefore, to define the desired automaton, we specify the set of states, the transition function and the aggregation function.

  • The set of states:S=[0,1].

  • The transition function: To define the transition function, we consider the neighborhood radius to be 1. Therefore, the next state of each cell is calculated based on the current state of itself and the cells directly related to it (in different layers). Also, we consider a nonnegative growing factor gR for the dynamics of each cell. Hence, the transition function for each cell lL at time tZ is defined as follows:
    EYt:fE(l)=1((ct(l)×g)lLct(l)×E(l,l)) (1)
  • The aggregation function: At first, the set of most effective layers for the cell at time t is defined by:
    Rt(l)=EYtIt(E)×fE(l) (2)
    Yt(l)={EYt|It(E)×fE(l)=Rt(l)} (3)
    Then, the amount of propagation data received through the most effective layers is calculated using the aggregation function as follows:
    fa(l)=EYtfE(l) (4)

This model has three free parameters c0 (the initial configuration), g (the growing factor) and G (the DWM corresponding to neighboring relationship) that can be adjusted according to the application. Also, an important issue in simulation with this automaton is the interpretation of states. For example, a threshold can be used to divide states into two classes infected and non-infected. It should be noted that in the proposed model, we have considered the state of each cell to mean the degree of risk in that cell, and different procedures can be used to analyze ct (the set of states in time step t). The parameter g can also be considered smaller for situations where control measures within cells are greater. To model neighborhood relationships by DWM, the weight of layers can be adjusted to model the degree of risk of that layer. For example, in some areas (such as the USA) most travel is by air, and in other areas (such as Iran) the volume of road and rail travel is significant. Also, the degree of boundary control measures for blocking the entry of imported cases can be manipulated by updating the weight of the DWM edges. In the following, we investigate some of the behavioral features of this automaton.

Proposition 3.4

Supposeg>0and in the time stept, celllis in statect(l)>0. Then, every neighborlof celllwill be in statect+1(l)>0in time stept+1.

Proof

According to Eq. (1), existence of expression lLct(l)×E(l,l) guaranties that if E(l,l)>0 and ct(l)>0, then fE(l)>0. Thus, Eq. (2) gives the result that Rt(l)>0. Therefore, according to Eq. (3), the set of effective layers Yt(l) is not empty and contains layers through which the transition function calculates non-zero states. Hence, Eq. (4) results in fa(l)>0. □

Proposition 3.5

Supposeg>1and in the time stept, celllis in statect(l)>0. Then, there exists a time stept+Δsuch that celllwill eventually be in statect(l)=1for alltt+Δ.

Proof

According to Eq. (1), existence of expression ct(l)×g guaranties that fE(l)ct(l)×gΔ in time step t+Δ. Hence, fE(l) is ascending over time for g>1. So, it can be simply proved that with increasing time, the limit of fE(l) is 1 and as a result, the value of the aggregation function will also be 1. □

Corollary 3.6

Supposeg>1and in the time stept, celllis in statect(l)>0. Then, there exists a time stept+Δsuch that every neighborlof celllwill in statect(l)=1for alltt+Δ.

Proof

This is a direct result of Propositions 3.4 and 3.5. □

Therefore, in this model, two important factors in the contamination of a cell (reaching the risk level of the contamination threshold) are border control measures (weight of edges as well as impact of communication layers) and also internal control measures (parameter g).

4. Experimental results

We consider three states Colorado, Indiana and Utah in the United States, as well as two countries United States and Iran as the study areas and generate evolution maps of spreading coronavirus and compare them with the maps derived from Colorado [54], Indiana [51], Pourghasemi et al. [55], USA [56]. Table 1 presents the specification of our study areas where, the first column is assigned to the area ID, the second column specifies the name of the area, the third column specifies the area type, the fourth column indicates the CA cell granulation level for area modeling, and the fifth column indicates the number of CA cells corresponding to that area.

Table 1.

Specifications of the study areas and the corresponding models.

ID AreaName AreaType CellType Cells
CO Colorado State County 64
IN Indiana State County 92
UT Utah State County 29
USA USA Country State 49
IR Iran Country Province 31

We implemented DWMCA by C++ programming language and defined a threshold parameter τinf=0.6 to generate virus development maps. Then, for each cell l with ct(l)τinf, we concluded that it is infected. This value has been obtained experimentally and is fixed in all our experiments. Also, to define the weighted multigraph in the simulation, the transportation and related layers such as airlines, railroads, Eisenhower Interstate System (EIS) and other National Highway System (NHS) of roads are extracted from On [57], Open [58]. We presented each layer as well as adjacency graph of cells (counties, states or provinces) in the form of a separated adjacency matrix (layer) of the weighted multigraph. During the simulation, we did not change the weight of the edges, but slightly increased the impact factor of the layers except for the airline layer. Table 2 presents the initial impact factor of the layers airlines, railroad, EIS, NHS and adjacency graph of cells for the study areas.

Table 2.

The Default Impact Factor of Layers for the study areas.

ID Airline Railroad EIS NS Adj
CO 0.9 0.5 0.4 0.3 0.1
IN 0.9 0.5 0.4 0.3 0.1
UT 0.9 0.5 0.4 0.3 0.1
USA 0.9 0.6 0.4 0.3 0.1
IR 0.5 0.7 - 0.7 0.2

In addition, we used the population of cells [51], [56] to calculate the weight of the edges between them. Here, Eq. (5) calculates the weight of each existing edge between cells i and j in layer E.

E(i,j)=populationipopulationi+populationj (5)

Also, the growing factor is set to g=1.1, and for each cell i, we used two variables d and a to keep the values needed for calculating the state. The value of d changes by transmission of data from other cells, but a is a boolean variable with value true only for the counties that have airports and flights outside the state. The function c is defined by Eq. (6).

ct(l)={dl+0.01ifa=true,dlotherwise. (6)

To analyze the simulation results of each of the study areas, we have prepared virus development maps in which the maps for each day include a pair of maps in which white means uninfected areas, orange means infected spots based on real data and blue means contaminated points based on simulation information. We have also prepared graphs and statistical tables corresponding to these maps. Table 3 explains the symbols used in this section. For example, True+ indicates the number of cells that are reported infected in both the simulation and the actual data. As well, False+ indicates the number of cells reported infected in the simulation but not in the actual data. The same definitions can be extended to True and False.

Table 3.

The explanation of the symbols used in the tables and graphs in this section.

Symbol Title SimulationReport RealDataReport
True+ truepositive infected infected
False+ falsepositive infected notinfected
True truenegative notinfected notinfected
False falsenegative notinfected infected

To measure the accuracy of the simulation results, we use the %True, which is calculated according to Eq. (7).

%True=(True+)+(True)numberofcells×100 (7)

Each chart also has two axes; The vertical axis corresponds to the number of cells and the horizontal axis corresponds to the sequence of coronavirus spread evolution maps, which are sorted by date in ascending order.

4.1. Colorado

Figure 9 provides a comparison between six simulated maps and the corresponding maps obtained from real data reported in [54], [56]. The default state of the cells is not infected, which is marked in white. The orange cells are areas where COVID-19 infection is reported, and the blue cells are areas infected based on the simulation results. It is notable that the initial map and the probability of infection in the simulation was prepared based on the information on the number of cell flights out of the state. In Fig. 9a, one cell (Denver) is declared correctly infected, but another cell (Adams) is mistakenly declared infected. Thus, the simulation on 64 cells detected one true positive and one false positive infection case. Here, the number of true negative cases is 61 and there is one false negative case (Chaffee). These statistics are shown in Fig. 10 in column a, where the accuracy of this map is 97%. In addition, the county of Adams reported the first case of infection on March 9, [56] which is only three days away from the simulation result.

Fig. 9.

Fig. 9

The simulation results of daily coronavirus spread in the state of Colorado during March and April 2020. White: not infected, Orange: infection reported in [6,50], Blue: infection reported by simulation. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

Fig. 10.

Fig. 10

The statistical results of coronavirus spread simulation for the state of Colorado during March and April 2020.

After one week, according to Fig. 9, 19 infected cells are reported by the simulation where the result includes 13 true positive and 6 false positive cases. Also, the number of true negative and false negative cases are 44 and 1, respectively. Also, according to column f in Fig. 10 the accuracy of this map is 89%.

Figure 9 presents the virus spread and its corresponding simulation map at the beginning of the third week. Here, 29 infected cells are reported by the simulation where it includes 22 true positive and 7 false positive cases, and the number of true negative and false negative cases are 32 and 2, respectively. Also, based on column j in Fig. 10 the accuracy of this map is 86%. Furthermore, based on columns p and v in Fig. 10, the accuracy of the simulation results for the next two weeks is 80% and 83%, respectively.

Our experimental results using the proposed model to simulate the spread of coronavirus in the state of Colorado confirm that the accuracy obtained in this state is at least 80%. The simulation maps have also reported all state counties infected since April 2, while about 19% of states have not yet been infected. One of the reasons for this error is the lack of transmission restrictions data in the simulation.

4.2. Indiana

Figure 11 provides a comparison between 30 simulated maps of Indiana and the corresponding maps obtained from real data reported in [51]. Based on the maps, the pattern of coronavirus spread in the state of Indiana is close in both our simulations and the reported data.

Fig. 11.

Fig. 11

The simulation results of daily coronavirus spread in the state of Indiana during March and April 2020. White: not infected, Orange: infection reported in [50], Blue: infection reported by simulation. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

In Fig. 11a, one cell (Marion) is declared correctly infected, but another cell (Adams) is mistakenly declared infected. Thus, the simulation on 92 cells detected one true positive and one false positive infection case. Here, the number of true negative cases is 90 and false negative is 0. As well, the county of Adams reported the first case of infection on March 9, [51] which is only three days away from the simulation result (Fig. 11).

Figure 12 includes the statistical data derived from virus spread evolution maps of Fig. 11. The data verifies that the minimum accuracy of the simulation results is 79%. Of course, the minimum accuracy occurred on April 25, and two days later, on April 27, the accuracy reached 88% and increased in the following days.

Fig. 12.

Fig. 12

The statistical results of coronavirus spread simulation for the state of Indiana during March and April 2020.

4.3. Utah

The first reported case in Utah was on March 7, [56]. As shown in Fig. 13 a, the first infected county is Utah in simulation but Salt Lake in [56]. However, they are neighbors and in later maps the spread patterns are close in the simulation and the actual data. Also, the simulation results in the next map, which is related to March 9, show the Salt Lake as an infected county.

Fig. 13.

Fig. 13

The simulation results of daily coronavirus spread in the state of Utah during March and April 2020. White: not infected, Orange: infection reported in [50], Blue: infection reported by simulation. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

According to Fig. 13c, d, g and i, the simulation based on the proposed model successfully identified two long-distance transmission patterns. The first pattern belongs to Washington where, simulation reported the first case on March 12, and [56] reported this county is infected on March 15. The second pattern is about Grand County where detected by simulation on March 18, and based on [56] this pattern occurred on March 27.

Figure 14 presents comparison statistics between the simulation results and [56] data based on evolution maps illustrated in Fig. 13. Here, the minimum accuracy of the simulation results (according to row %True) is at least 79%, which verifies that the spread patterns in the simulation are close to the real patterns.

Fig. 14.

Fig. 14

The statistical results of coronavirus spread simulation for the state of Utah during March and April 2020.

4.4. United states

To simulate the spread of coronavirus across USA, we defined cells at the state level (with a slight tolerance) so that the model has 49 cells. Also, there are two considerations about data and simulation:

  • Maritime traffic is not included in the simulation, but when defining the neighborhood graph for cells, all the western states that have a blue border are considered neighbors, and all the eastern states that have a blue border are considered neighbors.

  • We extracted the traffic information of the international flights of the states from Bureau [59] and then increased the probability of infection for the states that have the most international flights. The infection status of the origin of flights is ignored here.

According to Fig. 15 , the first case of coronavirus was reported on January 21, 2020 in Washington state, but simulation maps show three other states. In fact, the initial maps, which are based on the risk probability calculated from international flight traffic of states, have different initial values. However, from January 26, (Fig. 15) the COVID-19 outbreak maps (simulation / actual data) become more similar which indicates that the error of the calculated probabilities is negligible. Also, the development maps such as Fig. 15c, d, e, g and j show that DWMCA detects the spread of the virus in areas whose neighbors are not yet infected because it is capable of modeling and simulating the long-distance transmission of viruses.

Fig. 15.

Fig. 15

The simulated maps of coronavirus spread in USA during January and March 2020. White: not infected, Orange: infection based on [50], Blue: infection reported by simulation. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

The statistical data in Fig. 16 , shows that the accuracy of the simulation maps is more than 70%. Of course, the lowest accuracy is belong to the map illustrated in Fig. 15, which is 71%. Also, the number of its false positive cells is greater than the number of its true positive cells, but in the next few days, a significant number of false positive cells have become true positive. Therefore, considering that the incubation period of COVID-19 is more than one week, we can say that the actual accuracy of the map in Fig. 15 is more than 71%.

Fig. 16.

Fig. 16

The statistical results of coronavirus spread simulation for USA during January and March 2020.

4.5. Iran

To generate simulation maps of COVID-19 outbreak in Iran, its provinces are considered as cells. Unlike countries such as USA, where a large volume of travels are done by air, in Iran most travels are done through railroads as well as intercity highways. Therefore, the weight of the airline layer was less and the weight of railroads and highways was considered more, compared to the four previously studied areas.

In our experiments, we consider the highest probability of infection for the Iranian capital, Tehran because the most active international airport in the country is located there. However, the first case in Iran was reported on February 19, 2020 in Qom province, which is a neighbor of Tehran and the largest international airport in the country is located between them. Of course, two days later, on February 21, the first case was reported in Tehran. In Fig. 17 , the virus outbreak maps for both the simulation case and [55] have approximately the similar patterns, relating to railroads and intercity highways. In fact, the country’s busiest routes are from Tehran to the recreational provinces in the north, the religious province in the northeast of the country and the Tehran highway to the south (Bushehr province, where the country’s largest port is located).

Fig. 17.

Fig. 17

The simulated maps of daily coronavirus spread in Iran from February 19, 2020 to March 03, 2020. White: not infected, Orange: infection reported in [38], Blue: infection reported by simulation. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

According to Pourghasemi et al. [55], the outbreak of COVID-19 occurred in all the provinces over a period of two weeks, which makes it difficult to measure the accuracy of the simulation. Fig. 18 contains diagrams and tables of statistical data related to simulation maps. The data show that the simulation accuracy on February 24, and 25 is 77% and 71%, respectively, but in other maps the accuracy is more than 80%.

Fig. 18.

Fig. 18

The statistical results of coronavirus spread simulation for Iran during February and March 2020.

5. Conclusion

We studied the spread of COVID-19 at different spatial scales and extracted some patterns from the first cases reported in different places. Based on the extracted knowledge, we considered patterns of local as well as remote transmission of infectious disease / viruses to develop the proposed model. In this direction, we provided a symbiosis between the concept of neighborhood in cellular automata and dynamic weighted multigraph to develop a model called DWMCA. The model is capable of simulating the outbreak of infectious diseases under various transmission restrictions. Also it provides multi-layer modeling of communication channels and configuring them dynamically. Finally, we implemented a simple version of DWMCA by C++ to simulate the spread of COVID-19 in five study areas including three states Colorado, Indiana and Utah in USA as well as two countries USA and Iran. Although we did not take into account some data such as travel bans and the effects of health protocols in this simulation, the virus development maps obtained are highly consistent with the maps derived from actual data and reached accuracy above 77% at the state-level and above 70% at the country-level. Furthermore, comparing the simulation results with real data verifies that DWMCA is capable of successfully simulating virus transmission over long distances.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  • 1.An L., Bacon E., Hawley S., Yang P., Russell D., Huffman S., et al. Relationship between coronavirus-related ehealth literacy and COVID-19 knowledge, attitudes, and practices among US adults: web-based survey study. J Med Internet Res. 2021;23(3):e25042. doi: 10.2196/25042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Chakraborty T., Ghosh I. Real-time forecasts and risk assessment of novel coronavirus (COVID-19) cases: a data-driven analysis. Chaos Solitons Fractals. 2020:109850. doi: 10.1016/j.chaos.2020.109850. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Fang Y., Nie Y., Penny M. Transmission dynamics of the COVID-19 outbreak and effectiveness of government interventions: a data-driven analysis. J Med Virol. 2020;92(6):645–659. doi: 10.1002/jmv.25750. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Singh H., Kumar Y. Cellular automata based model for e-healthcare data analysis. Int J Inf SystModel Des (IJISMD) 2019;10(3):1–18. [Google Scholar]
  • 5.White S.H., Del Rey A.M., Sánchez G.R. modeling epidemics using cellular automata. Appl Math Comput. 2007;186(1):193–202. doi: 10.1016/j.amc.2006.06.126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Zhang Q., Xu L. 2012 Fifth international conference on information and computing science. IEEE; 2012. Simulation of the spread of epidemics with individuals contact using cellular automata modeling; pp. 60–62. [Google Scholar]
  • 7.Zhang S., Diao M., Yu W., Pei L., Lin Z., Chen D. Estimation of the reproductive number of novel coronavirus (COVID-19) and the probable outbreak size on the diamond princess cruise ship: a data-driven analysis. Int J Infect Dis. 2020;93:201–204. doi: 10.1016/j.ijid.2020.02.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Zlojutro A., Rey D., Gardner L. A decision-support framework to optimize border control for global outbreak mitigation. Sci Rep. 2019;9(1):1–14. doi: 10.1038/s41598-019-38665-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Choi W., Shim E. Optimal strategies for social distancing and testing to control COVID-19. J Theor Biol. 2021;512:110568. doi: 10.1016/j.jtbi.2020.110568. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Li Y., Li M., Rice M., Zhang H., Sha D., Li M., et al. The impact of policy measures on human mobility, COVID-19 cases, and mortality in the US: a spatiotemporal perspective. Int J Environ Res Public Health. 2021;18(3):996. doi: 10.3390/ijerph18030996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.MacIntyre C.R., Nguyen P.-Y., Chughtai A.A., Trent M., Gerber B., Steinhofel K., et al. mask use, risk-mitigation behaviours and pandemic fatigue during the COVID-19 pandemic in five cities in Australia, the UK and USA: a cross-sectional survey. Int J Infect Dis. 2021;106:199–207. doi: 10.1016/j.ijid.2021.03.056. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Nouvellet P., Bhatia S., Cori A., Ainslie K.E.C., Baguelin M., Bhatt S., et al. Reduction in mobility and COVID-19 transmission. Nat Commun. 2021;12(1):1–9. doi: 10.1038/s41467-021-21358-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Sutner K. Classification of cellular automata. Encycl Complexity Syst Sci. 2009;2009:755–768. [Google Scholar]
  • 14.Fu S., Milne G. Proceedings of the Australian conference on artificial life. 2003. Epidemic modelling using cellular automata. [Google Scholar]
  • 15.Karim M.F.A., Ismail A.I.M., Ching H.B. Cellular automata modelling of hantarvirus infection. Chaos Solitons Fractals. 2009;41(5):2847–2853. [Google Scholar]
  • 16.Li Y., Chen M., Dou Z., Zheng X., Cheng Y., Mebarki A. A review of cellular automata models for crowd evacuation. Physica A. 2019;526:120752. [Google Scholar]
  • 17.Mallet D.G., De Pillis L.G. A cellular automata model of tumor_immune system interactions. J Theor Biol. 2006;239(3):334–350. doi: 10.1016/j.jtbi.2005.08.002. [DOI] [PubMed] [Google Scholar]
  • 18.Mier J.A., Sánchez R., Newman D.E. Tracer particle transport dynamics in the diffusive sandpile cellular automaton. Chaos Solitons Fractals. 2020;140:110117. [Google Scholar]
  • 19.Mutthulakshmi K., Wee M.R.E., Wong Y.C.K., Lai J.W., Koh J.M., Acharya U.R., et al. Simulating forest fire spread and fire-fighting using cellular automata. Chin J Phys. 2020 [Google Scholar]
  • 20.Pourhasanzade F., Sabzpoushan S.H. A cellular automata model of chemotherapy effects on tumour growth: targeting cancer and immune cells. Math Comput Model Dyn Syst. 2019;25(1):63–89. [Google Scholar]
  • 21.Shi M., Lee E.W.M., Ma Y. A dynamic impatience-determined cellular automata model for evacuation dynamics. Simul Modell Pract Theory. 2019;94:367–378. [Google Scholar]
  • 22.Sirakoulis G.C., Karafyllidis I., Thanailakis A. A cellular automaton model for the effects of population movement and vaccination on epidemic propagation. Ecol Modell. 2000;133(3):209–223. [Google Scholar]
  • 23.Bhattacharjee K., Naskar N., Roy S., Das S. A survey of cellular automata: types, dynamics, non-uniformity and applications. Nat Comput. 2020;2(19):433–461. [Google Scholar]
  • 24.Dennunzio A., Formenti E., Grinberg D., Margara L. Dynamical behavior of additive cellular automata over finite abelian groups. Theor Comput Sci. 2020;843:45–56. [Google Scholar]
  • 25.Guiso S., Caprio D.D., de Lamare J., Gwinner B. Influence of the grid cell geometry on 3D cellular automata behavior in intergranular corrosion. J Comput Sci. 2021;53:101322. [Google Scholar]
  • 26.Li Y., Chen M., Zheng X., Dou Z., Cheng Y. Relationship between behavior aggressiveness and pedestrian dynamics using behavior-based cellular automata model. Appl Math Comput. 2020;371:124941. [Google Scholar]
  • 27.Chidyagwai P., Reiter C.A. A local cellular model for growth on quasicrystals. Chaos Solitons Fractals. 2005;3(24):803–812. [Google Scholar]
  • 28.Yang J., Gong J., Tang W., Liu C. Patch-based cellular automata model of urban growth simulation: integrating feedback between quantitative composition and spatial configuration. Comput Environ Urban Syst. 2020;79:101402. [Google Scholar]
  • 29.Barreira-González P., Gómez-Delgado M., Aguilera-Benavente F. From raster to vector cellular automata models: anew approach to simulate urban growth with the help of graph theory. Comput Environ Urban Syst. 2015;54:119–131. [Google Scholar]
  • 30.Dridi S., Yacoubi S.E., Bagnoli F., Fontaine A. A graph theory approach for regional controllability of boolean cellular automata. Int J Parallel Emergent Distrib Syst. 2020;5(35):499–513. [Google Scholar]
  • 31.O’Sullivan D. Graph-cellular automata: a generalised discrete urban and regional model. Environ Plann B. 2001;28(5):687–705. [Google Scholar]
  • 32.Ruhomally Y.B., Dauhoo M.Z., Dumas L. A graph cellular automaton with relation-based neighbourhood describing the impact of peer influence on the consumption of marijuana among college-aged youths. J Dyn Games. 2021;8(3):277. [Google Scholar]
  • 33.Sayama H., Pestov I., Schmidt J., Bush B.J., Wong C., Yamanoi J., et al. Modeling complex systems with adaptive networks. Comput Math Appl. 2013;65(10):1645–1664. [Google Scholar]
  • 34.Silva D., Rafael M., Mendes C.F.O., Manchein C. Scrutinizing the heterogeneous spreading of COVID-19 outbreak in large territorial countries. Phys Biol. 2021;18(2):025002. doi: 10.1088/1478-3975/abd0dc. [DOI] [PubMed] [Google Scholar]
  • 35.Feng Y., Tong X. Incorporation of spatial heterogeneity-weighted neighborhood into cellular automata for dynamic urban growth simulation. GISci Remote Sens. 2019;56(7):1024–1045. [Google Scholar]
  • 36.Silva E., Pereira F.H., Schimit P.H.T. Information spreading in a population modeled by continuous asynchronous probabilistic cellular automata. ComputCommun. 2020 [Google Scholar]
  • 37.Wang L., Fang G., Qian L. Modeling of dynamic recrystallization of magnesium alloy using cellular automata considering initial topology of grains. Mater Sci Eng A. 2018;711:268–283. [Google Scholar]
  • 38.Yao Y., Liu X., Li X., Liu P., Hong Y., Zhang Y., Mai K. Simulating urban land-use changes at a large scale by integrating dynamic land parcel subdivision and vector-based cellular automata. Int J Geograph Inf Sci. 2017;31(12):2452–2479. [Google Scholar]
  • 39.Chen S., Yang J., Yang W., Wang C., Bärnighausen T. COVID-19 control in China during mass population movements at new year. Lancet. 2020;10226(395):764–766. doi: 10.1016/S0140-6736(20)30421-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Chinazzi M., Davis J.T., Ajelli M., Gioannini C., Litvinova M., Merler S., Piontti A.P.y. The effect of travel restrictions on the spread of the 2019 novel coronavirus (COVID-19) outbreak. Science. 2020;6489(368):395–400. doi: 10.1126/science.aba9757. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Jithesh P.K. A model based on cellular automata for investigating the impact of lockdown, migration and vaccination on COVID-19 dynamics. Comput Methods Programs Biomed. 2021:106402. doi: 10.1016/j.cmpb.2021.106402. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Kucharski A.J., Russell T.W., Diamond C., Liu Y., Edmunds J., Funk S., et al. Early dynamics of transmission and control of COVID-19: a mathematical modelling study. Lancet Infect Dis. 2020 doi: 10.1016/S1473-3099(20)30144-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Normile D. Why airport screening won’t stop the spread of coronavirus. Science. 2020 [Google Scholar]
  • 44.Schimit P.H. A model based on cellular automata to estimate the social isolation impact on COVID-19 spreading in Brazil. Comput Methods Programs Biomed. 2021;200:105832. doi: 10.1016/j.cmpb.2020.105832. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Tian H., Liu Y., Li Y., Wu C.-H., Chen B., Kraemer M.U.G., Li B. An investigation of transmission control measures during the first 50 days of the COVID-19 epidemic in China. Science. 2020;368(6491):638–642. doi: 10.1126/science.abb6105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.World Health Organization. Coronavirus disease 2019 (COVID-19): situation report, 51. 2020.
  • 47.Candido D.S., Claro I.M., de Jesus J.G., Souza W.M., Moreira F.R.R., Dellicour S., Mellan T.A. Evolution and epidemic spread of SARS-CoV-2 in Brazil. Science. 2020 doi: 10.1126/science.abd2161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Fauver J.R., Petrone M.E., Hodcroft E.B., Shioda K., Ehrlich H.Y., Watts A.G., Vogels C.B.F. Coast-to-coast spread of SARS-CoV-2 during the early epidemic in the United States. Cell. 2020 doi: 10.1016/j.cell.2020.04.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Gonzalez-Reiche A.S., Hernandez M.M., Sullivan M.J., Ciferri B., Alshammary H., Obla A., Fabre S. Introductions and early spread of SARS-CoV-2 in the New York City area. Science. 2020 doi: 10.1126/science.abc1917. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.World Health Organization Coronavirus Disease (COVID-19) Dashboard. 2020. https://covid19.who.int.
  • 51.Indiana COVID-19 dashboard. 2020. https://www.coronavirus.in.gov/2393.htm.
  • 52.Shereen M.A., Khan S., Kazmi A., Bashir N., Siddique R. COVID-19 infection: Origin, transmission, and characteristics of human coronaviruses. J Adv Res. 2020 doi: 10.1016/j.jare.2020.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Goncu E., Yalcin M.E. Cellular automata with random memory and its implementations. Int J Bifurcation Chaos. 2017;27(05):1730017. [Google Scholar]
  • 54.Colorado Department of Public Health & Environment. https://covid19.colorado.gov.
  • 55.Pourghasemi H.R., Pouyan S., Heidari B., Farajzadeh Z., Shamsi S.R.F., Babaei S., et al. Spatial modeling, risk mapping, change detection, and outbreak trend analysis of coronavirus (COVID-19) in Iran (days between february 19, and june 14, 2020) Int J Infect Dis. 2020;98:90–108. doi: 10.1016/j.ijid.2020.06.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.USA Facts. 2020. https://usafacts.org/visualizations/coronavirus-covid-19-spread-map.
  • 57.On Theworldmap. 2020. http://ontheworldmap.com.
  • 58.Open Street maps. 2020. https://www.openstreetmap.org.
  • 59.Bureau of transportation statistics. 2020. https://cms.bts.gov/topics/national-transportation-statistics.

Articles from Chaos, Solitons, and Fractals are provided here courtesy of Elsevier

RESOURCES