Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2021 Oct 29;27:100275. doi: 10.1016/j.bdr.2021.100275

Epidemic Spreading in Trajectory Networks

Tilemachos Pechlivanoglou 1,, Jing Li 1, Jialin Sun 1, Farzaneh Heidari 1, Manos Papagelis 1
PMCID: PMC8553416

Abstract

Epidemics of infectious diseases, such as the one caused by the rapid spread of the coronavirus disease 2019 (COVID-19), have tested the world's more advanced health systems and have caused an enormous societal and economic damage. The mechanism of contagion is well understood. As people move around, over time, they regularly engage in social interactions. The spatiotemporal network representing these interactions constitutes the backbone on which an epidemic spreads, causing outbreaks. At the same time, advanced technological responses have claimed some success in controlling the epidemic based on digital contact tracing technologies. Motivated by these observations, we design, develop and evaluate a stochastic agent-basedSEIRmodel of epidemic spreading in spatiotemporal networks informed by mobility data of individuals (trajectories). The model focuses on individual variation in mobility patterns that affects the degree of exposure to the disease. Understanding the role that individual nodes play in the process of disease spreading through network effects is fundamental as it allows to (i) assess the risk of infection of individuals, (ii) assess the size of a disease outbreak due to specific individuals, and (iii) assess targeted intervention strategies that aim to control the epidemic spreading. We perform a comprehensive analysis of the model employing COVID-19 as a use case. The results indicate that simple individual-based intervention strategies that exhibit significant network effects can effectively control the spread of an epidemic. We have also demonstrated that targeted interventions can outperform generic intervention strategies. Overall, our work provides an evidence-based data-driven model to support decision making and inform public policy regarding intervention strategies for containing or mitigating the epidemic spread.

Keywords: Epidemic modeling, Trajectory network, Individual variability, Risk of infection, SEIR model, COVID-19

1. Introduction

From the Plague of Athens (430 to 426 BC) [1], [2] to the Spanish Flu (1918) [3], [4], pandemics have had a significant impact on human society [5]. In the last 20 years alone, the world has seen many infectious disease outbreaks. Notorious examples include the pandemics caused by the Severe Acute Respiratory Syndrome coronavirus (SARS-CoV) [6], the influenza A virus subtype H1N1 (swine flu) [7], the Middle East Respiratory Syndrome coronavirus (MERS-CoV) [8], the Ebola virus (EVD) [9], the Zika virus (ZIKV) [10], and most recently the Severe Acute Respiratory Syndrome coronavirus 2 (SARS-CoV-2) [11]. These pandemics have tested the world's more advanced health systems and have caused an enormous societal and economic damage. Conventional methods to address the rapid spread of an infectious disease include physical distancing, confinement measures and human-based contact tracing of infected individuals. These describe some of the common policies imposed by governing authorities and jurisdictions that aim to contain or slow down the spread of the virus to levels that can be managed by healthcare units and socio-political institutions. While these easily understood policies can be effective in controlling the spread of the disease and saving lives [12], [13], they have well-known drawbacks: (i) they are imposing extreme restrictions or limitations on individuals' activities or freedom, leading to a slowdown of social and economic activities of the community and to socioeconomic side-effects for the individuals themselves; (ii) they depend on human-based contact tracing of infected individuals that are cumbersome, expensive, slow and inaccurate; and (iii) they do not provide the means of a controlled transition to an immune community through well-defined intervention strategies that can easily translate to health policy and potentially ameliorate the socioeconomic impact.

More recently, advanced technological responses to the problem based on digital contact tracing have claimed some success in controlling the epidemic [14]. Digital contact tracing or proximity tracing, enabled by GPS-enabled devices, mobile apps [15] and beyond [16], represents the ability to track and reconstruct the close contacts that an individual had with other people within a time period. The way that proximity tracing can have an impact in containing or slowing down the disease spread is straightforward. Individuals that are known to be infected, can inform (via cloud services) recent close contacts that have been digitally traced, who can then take precautions and avoid further contacts with other people by isolating themselves or seeking expert advice. The process can involve third parties, such as governing authorities and/or health experts responsible for the containment of the disease.

The focus of the current research is on utilization of GPS-enabled digital contact traces of individuals (i.e., mobility data or trajectories) to inform a more comprehensive analysis and modeling of disease spreading through methods of graph mining [17] and trajectory data mining [18]. In particular, we present a data-driven model for the spread of the disease in a community that take into account the mobility patterns of individuals. As people move in cities, they engage in different types of interaction with other people, resulting in different mobility patterns. As such, the relative risk of them being infected or infecting others can be substantially different. We systematically study the effect of the individual variability of mobility behavior to the risk of infection of an individual. This observation can have significant consequences to a model's accuracy of how the disease propagates in a community, as well as to the intervention strategies that can be designed to control the epidemic.

Contributions. Motivated by the feasibility of digital contact tracing technologies [19] and the inherent limitations of traditional epidemiological models (see Section 2), this paper presents data-driven models of infectious disease spreading that incorporate individual variability due to individuals' mobility patterns. Our study aims to clarify how differences in mobility patterns can inform infectious epidemic dynamics and determine the impact of various intervention strategies. In summary, the major contributions of this work are as follows:

  • we present novel data-driven models for assessment of the risk of infection of an individual based on mobility patterns and the amount of time they spend in proximity with others (“individual risk assessment”);

  • we present a stochastic agent-based Susceptible-Exposed-Infected-Removed (SEIR) network model for infectious disease spreading in trajectory networks (“community risk assessment”);

  • we design and evaluate novel individual-based intervention strategies for containing (or mitigating to an acceptance rate) the spread of an infectious disease in trajectory networks (“containment intervention strategies”);

  • we design and evaluate novel individual-based immunization strategies for providing a controlled and safe transition to an immune community (“targeted immunization”);

  • we present a large-scale case study using model parameter values that resemble the recent COVID-19 outbreak and realistic synthetic mobility data in a real urban environment (large University campus and surroundings) that allows for many human-human interactions; the model and algorithms presented generalize to other similar infectious diseases;

  • we provide source code and data to encourage reproducibility of results.

The remainder of this paper is organized as follows: Section 2 provides background information, introduces notation and provides definitions of the technical problems of interest in this paper. Section 3 presents our epidemic model, algorithmic details of epidemic spreading in trajectory networks and descriptions of disease containment intervention strategies. Section 4 presents an experimental evaluation of the different models and methods for varying settings. We review the related work in Section 5 and conclude in Section 6.

2. Background and problem definitions

In this section, we introduce notation and preliminaries of the problem of interest, as well as formal problem definitions. The background mostly relates to the definition of a contact network or a trajectory network as defined in [20]. We also provide background information related to the basic reproductive number R0 and its limitations, as well as information about the SEIR epidemic model and its variations, as we employ it in our study.

2.1. Observation area: Earth surface versus Euclidean space

We assume monitoring of mobility data of individuals within a finite observation area A. For the needs of our study, this area typically represents the administrative boundaries of a city or a city neighborhood where daily human contacts occur. Since A is a relatively small region, the Earth surface it represents has a low curvature and is close to flat. We can therefore, for simplicity and without loss of generality, assume that individuals move in a finite 2-dimensional Euclidean space R2 and not on the surface of the Earth. This assumption allows to approximate geodesic distances on Earth with Euclidean distances in R2, a common practice in many real-world algorithms and services.

2.2. Contacts and events

Consider a set of objects N={u1,u2,,uN} moving in an observation area A, defined as a finite 2-dimensional Euclidean space R2 for a finite observation time interval [0,T], forming a set of trajectories P. We formally define a trajectory as follows.

Definition 2.1 Trajectory —

A trajectory Pu of a moving object uN is a sequence Pu={(x1,y1,t1),(x2,y2,t2),,(xT,yT,tT)}, where ti[0,T] and (x,y)AR2 represent latitude and longitude coordinates in the 2D Cartesian system. We assume that an object might appear and disappear multiple times during the observation time interval [0,T].

As individuals move in A, they can encounter each other, forming contacts. Following Pechlivanoglou and Papagelis [20], we define a contact as follows.

Definition 2.2 Contact —

A contact cu,v between two moving individuals u,vN occurs when their physical proximity (spatial distance) du,v is smaller than or equal to a threshold τ (i.e. du,vτ).

Several approaches exist to estimate the spatial distance of two points in Euclidean plane. We employ its simplest form, the Euclidean distance, given by:

du,v=(xuxv)2+(yuyv)2

where (xu,yu) and (xv,yv) are the spatial coordinates of individuals u and v at a time t, where 0tT respectively. The two individuals u and v are considered to be in contact for as long as their spatial distance remains consistently smaller than a proximity threshold τ. We extend the concept of a contact to include its temporal dimension and formally define an event as follows.

Definition 2.3 Event —

An event eu,v between two moving objects u,vN, represents a contact cu,v that lasted for a time interval [ts,te], where ts represents the time point of the beginning of the contact and te represents the time point the contact ended. An event is represented by the quadruple eu,v=(u,v,ts,te). We also define the duration of the event as δ(eu,v)=tets.

Note that, in our setting, we do not preclude the case that two individuals are in contact multiple times. In this case, the contact information between two moving individuals u and v is represented by a sequence of events Eu,v={eu,v1,eu,v2,,eu,vn} or Eu,v={(u,v,ts1,te1),(u,v,ts2,te2),,(u,v,tsn,ten)}. We also define the duration of all events as Δ(Eu,v)=inδeu,vi=in(teitsi). Furthermore, in this paper we employ a universal proximity threshold τ, so the contacts will always be reciprocal, meaning that (u,v,ts,te) is equivalent to (v,u,ts,te). Indeed, this is sufficient for the case of human-to-human interactions we examine in this work.

2.3. Trajectory networks and node importance

A network that is constructed by connecting pairs of individuals that are close to each other based on physical proximity is called a proximity network. However, a proximity network is static and does not capture well the idea of individuals moving in space. When individuals are moving, the temporal dimension of interactions must be considered, and the resulting network can be thought of as a temporal network, also referred to as a time-varying network. Most characterizations of temporal networks discretize time by grouping together temporal information into a sequence of T network “snapshots” Gt(Vt,Et),t{1,2,,T}. Each snapshot contains the vertices Vt and edges Et, representing the individuals and their contacts, respectively, within a basic time unit t (e.g., second, minute, hour, etc.). The resulting data structure can be thought of as either a single aggregation graph with varying vertices and edges, or a sequence of proximity graphs. In either case, we refer to Gt(Vt,Et) as a trajectory network G(V,E) for the rest of the manuscript.

There are many possible metrics to determine the importance (or influence) of an individual (or node) in a temporal network. Note that the term node centrality refers to node importance that is common in static network analysis, and isn't applicable for trajectory networks. This is because measures of node centrality in the traditional setting of a static network are commonly based on shortest paths (e.g., betweenness centrality [21], [22]), but shortest paths in temporal networks take a different character [23]. For example, in [24], the authors define minimum temporal paths to capture the different characterizations of time-constraint shortest paths including cases of earliest-arrival paths, latest-departure paths, or fastest paths. It is possible to evaluate a notion of temporal betweenness [25], but in our setting, we focus on notions of importance that are critical in the context of epidemic modeling in the trajectory network. Similarly to Pechlivanoglou and Papagelis [20], we define metrics that relate to the temporal node degree and the duration of events, and use these metrics to construct node profiles that describe the behavior of each individual.

Definition 2.4 Trajectory network node degree —

We define the following metrics related to node degree in the trajectory network:

  • Cu: a set of all contacts of u during the observation time interval [0,T].

  • Ddegu(k): a distribution that represents the fraction of the time steps ti[0,T] that u has node degree k.

2.4. Problem definitions

In this paper, we are interested in the assessment and mitigation of the risk of infectious disease spreading in trajectory networks based on mobility data. In particular, we aim to address the following problems:

Problem 1

Given an observation area A, an observation time interval [0,T], a set of individuals N and their trajectories P, determine the risk of infection risku of each individual in the trajectory network G(V,E).

Problem 2

Given a trajectory network G(V,E), a seed set of initially infected individuals I0N and the risk risku of any individual uI0, determine the size of the epidemic spread ITN at time t=T, where I0IT.

Problem 3

Given a trajectory network G(V,E), and the parameters of an emerging infectious disease, determine the impact of various epidemic containment intervention strategies that can easily translate to health policy. The focus is on comparative analysis of the impact of targeted individual-based interventions against a null model (informed by less sophisticated horizontal measures).

2.5. Limitations of the basic reproductive number R0

The basic reproductive number R0 (sometimes called basic reproduction ratio), is the most widely used parameter in epidemiology. It can be thought of as the expected number of new infections caused by a single infected individual. Commonly used epidemiological models suggest that R0=1 is a critical value. When R0<1, each infected person produces less than one new case in expectation, therefore the size of the outbreak is constantly trending downwards, until eventually the disease dies off. On the other hand, when R0>1, each infected person produces more than one new cases in expectation, therefore the size of the outbreak is constantly trending upwards. In principle, the larger the value of R0, the more challenging it is to control the epidemic.

Despite its usefulness as an approximate indication of the spreading power of the disease, many studies have stressed the limitations of R0. An underlying assumption of R0 is that the disease is spreading in a perfect mixing network (i.e., a complete graph) or a regular tree network – a special type of a network topology that has no cycles and each internal node has a constant number of children, defined by a branching factor d. However real-world communities do not resemble a complete graph or regular trees, since some people have more contacts than others and it is common for people to have common friends (forming triangles or cycles). It is also easy to see how the basic computation of R0 breaks down when we consider transmission of infection to be a stochastic process involving discrete individuals [26].

For the purposes of this work, when we refer to R0 for individuals, we define it as “the expected number of secondary cases produced, in a completely susceptible population, produced by a typical infected individual” [27].

2.6. The SEIR epidemic model

Compartmental models of epidemic modeling divide the population into separate divisions (compartments) and people transition between them based on their health status during an epidemic. For instance, in the classic SIR model [28], [29], people progress between three compartments: susceptible (S), infectious (I) and removed/recovered (R). For many infectious diseases, there is a significant latent period (incubation) during which susceptible individuals have been infected, but are not yet infectious themselves. During this period an individual is considered to be in a compartment labeled as exposed (E), and the model is known as SEIR. The current research employs the SEIR model for modeling the spread of a virus in the community. Depending on assumptions of population structure and transmission progression, there are two main classes of the SEIR model studied in the literature.

Homogeneous population. The first class, assumes a large, homogeneously mixing population where individuals move between compartments at certain transition rates described by ordinary differential equations [30], [31]:

dSdt=βISNdEdt=βISNσEdIdt=σEγIdRdt=γI

where β is the transmission rate, σ is the incubation rate, and γ is the recovery rate, respectively. This is a deterministic model, so for a fixed set of parameter values and SEIR model initialization (t=0), it produces the same outcome at each simulation. This model can inform about the state of the epidemic spread in the community and provide insights about future trends as well as inform health policy at large [32]. However, there are certain limitations of this model. Its results and usefulness are limited by the inherent assumption that all individuals share the same characteristics.

Heterogeneous population. The second class, assumes heterogeneity of population and is based on an agent-based SEIR model, where each agent is representing an individual [33], [34]. This approach allows to model individual characteristics and behavior towards the epidemic. In our research we focus on heterogeneity that is attributed to different mobility and contact patterns of individuals, over time. Different mobility patterns, lead to complex spatio-temporal social interactions between people in the community [20], [35]. These models are more challenging to analyze and interpret as they depend on a stochastic (probabilistic) process of epidemic spreading that increases the complexity [36]. However, they are more realistic and can help to better understand the emergence of a disease due to different individual behaviors. In addition, since they operate (simulate) on individual-level behavior, they provide an opportunity to design targeted intervention strategies that can more easily translate to health policy. In section 3 we present the details of the agent-based SEIR model.

3. Methodology

Recent studies on epidemic modeling highlight the importance of individual variability in modeling the spread of an infectious disease in a community and predicting its relevant outcome. For example, Gomes et al. [37] studied the effect of the biological variation in susceptibility of individuals and their physical exposure to infection. And, Britton et al. [38] studied how population heterogeneity affects herd immunity. In our research, we study the effect of individual variability in epidemic modeling that is due to mobility patterns. We first present a method for computing the risk of infection of any individual in the community, as a result of their spatiotemporal interactions. Then, we present a stochastic agent-based epidemic model (and optimizations) that can better capture the dynamic disease spreading in a community.

3.1. Modeling individual risk of infection

We integrate individual variation by modeling the risk of infection of an individual in relation to its mobility patterns and contacts over a time period. Intuitively, we would like to model that the more contacts an individual has and the more time they spent with each other, the higher the risk of infection. Formally, given a trajectory network G(V,E), an individual uN and its contacts Cu during [0,T], we model the risk of infection risku of an individual by the following three methods, each offering a different level of analysis.

risku(1)=i=1|Cu|1=|Cu| (1)
risku(2)=i=1|Cu|Δ(Eu,i) (2)
risku(3)=i=1|Cu|(1(1β)Δ(Eu,i)) (3)

Out of the three definitions, risku(1) is the simplest one as it is based on the node degree in the aggregation network (i.e., the network defined by aggregating the edges of a temporal network over [0,T]); risku(2) takes into consideration both the number of contacts of u and the total duration of these contacts (due to potentially multiple events); risku(3) models the risk of infection as a probability of getting infected by any of its contacts factoring the total duration of these contacts (due to potentially multiple events), where β is the transmission probability of the disease. In particular, we use a geometric function to represent the risk attributed to each distinct contact. The outcome is a regularized metric for risk (capped at 1), so that specific contacts with a very long duration do not dominate the overall risk of an individual.

Relative risk of infection: While the actual value of an individual's risk of infection does not hold any natural interpretation, it is important for our analysis to represent the relative riskrrisku of u to other individuals in the network. We therefore normalize each risk metric by the aggregated risk of all N individuals in the network to get the relative risk of uN, as follows:

rrisku(i)=risku(i)u=1Nrisku(i) (4)

Note that it is rrisku(1)[0,1], rrisku(2)[0,1], rrisku(3)[0,1] and that u=1Nrrisku(i)=1. We utilize the relative risk in our experimental analysis.

3.2. Epidemic spreading in trajectory networks

We present a stochastic agent-based SEIR network model for epidemic spreading in a trajectory network, where nodes represent individuals and edges represent contacts of nodes. According to the epidemic model, each node can be at one of the following infection states, at any discrete time t:

  • Susceptible (S). This is the initial state of all nodes; a node can get exposed to the infection by any of its infected neighbors with probability β, per time step.

  • Exposed (E). A node is in this state if it has been infected by one of its neighbors, but it is not yet infectious itself. A node stays in this state for as long as the incubation period of the disease lasts, which for simplicity we model as a constant that lasts If time steps. After that period, the node becomes infectious and switches to state I with certainty. Depending, on the disease we aim to model, the certainty can be relaxed by incorporating a parameter to control the probability of a node switching to I (or to S).

  • Infected (I). A node is in this state if it is infectious, therefore can transmit the disease to any of its neighbors with probability β.

  • Removed (R). A node is in this state if it has been removed, meaning either has passed away or has recovered. Nodes that are in I will be removed after Ir time steps with a recovery probability γ. The recovered nodes are neither infectious anymore nor susceptible to the infection.

Given a trajectory network G(V,E) and parameter values β, γ, If and Ir the model allows us to monitor the state of every individual over time. Given a population of N individuals, the cumulative number of individuals within each of the disease states at time t is given by S(t), E(t), I(t), and R(t), respectively. We also define two special sets of infected nodes: (i) the initial seed set of infected nodes I0=I(0), and (ii) the set of the infected nodes at the end of the process IT=I(T), which represents the size of the epidemic spread.

3.2.1. Algorithmic details of the stochastic model

We describe here algorithmic details of the stochastic model. Recall that individuals move between infection states S, E, I and R based on a stochastic process. At each discrete time step t, each S(usceptible) node has a chance to switch to E(xposed), E(xposed) nodes might switch to I(nfected), and I(nfected) nodes might be R(emoved). Formally, let uS and let Nu be the set of neighbors of u at time t. Each neighbor vNu such that vI, flips a biased coin with a bias equal to the transmission probability β to determine whether it will infect u. If u is infected, then it switches its infection state to E(xposed), otherwise its infection state remains S(usceptible). Similarly, an E(xposed) node will switch to I(infected) after If steps and an I(nfected) node will switch to R(ecovered) after Ir steps, with a probability γ. The pseudocode of the stochastic model of epidemic spreading is given in Algorithm 1 . In our analysis, each time step in the discrete time simulation corresponds to a minute (60 secs), so negligible contacts (interactions of less than a minute) are not considered.

Algorithm 1.

Algorithm 1

Epidemic spreading in trajectory networks.

Studies on infectious diseases have showed that prolonged exposure of a susceptible node to an infected node increases the likelihood of infection [39]. It is easy to see that in the epidemic model presented in Algorithm 1, an infected node u has multiple chances to propagate the disease. Formally, given a trajectory network G(N,V), the probability pu,v of a susceptible node uV being infected by a neighboring infected node vV after k independent trials is given by the cumulative distribution function of the geometric distribution:

pu,v=1(1β)k (5)

where β is the transmission probability of the disease. Eq. (5) represents the complementary probability of u not being infected after k independent trials. It is easy to see that k depends on the duration of the contact between an infected node v and u (i.e., one chance per time unit) and that 0pu,v1.

3.3. Conversion of the stochastic model to a deterministic model

The epidemic spreading model we described in Algorithm 1, is a stochastic process that possesses some inherent randomness. Starting with the same initial conditions (i.e., the same sets of Susceptible and Infected nodes) and parameter values, multiple independent simulations of the epidemic spreading process can produce outputs that vary a lot, in terms of the total number of nodes infected at the end of the process. This is because the final outcome depends on flipping a biased coin at every time step to decide whether the disease will diffuse from one node to another in the network.

Interestingly, there is an equivalent deterministic model that offers a static view of the network and is more practical, as it allows for faster simulations than the stochastic model. We describe here a method that given a stochastic model of epidemic spreading in the trajectory network, converts it to a deterministic model based on percolation theory [40], [41]. In mathematics and physics, percolation theory is used to explain the flow of fluids through certain types of porous material. Similarly, in network science, it is used to describe the behavior of a network when nodes or links are removed.

To utilize this idea in the epidemic spreading model, recall that each infected node in the network has a probability β to infect each of its neighboring nodes at every time step t, by flipping a biased coin with a probability β. At the end of the interaction, the infected node has either infected the neighboring node, in which case we consider the edge to be “active”, or not, in which case we consider the edge to be “removed or blocked”. The idea of percolation is that instead of deferring the decision of whether an edge will be “active” or “removed” at runtime, we can make a decision for each edge of the trajectory network G(V,E) at the very beginning of the whole process. In practice, for each edge in the network, we just need to flip a biased coin with probability β as many times as the duration of the contact (expressed in time units), and decide whether to keep it or remove it from the network. At the end of the process a smaller network G(V,E) is constructed, such that EE.

In terms of the correctness of the epidemic spreading process itself, it does not matter if the decision to keep or remove an edge is made at runtime or early in the process. In terms of runtime cost, percolation allows to work on a smaller network (since many edges are already removed) and allows simulations to finish faster. We therefore employ percolation in the relevant set of experiments.

3.4. Containment intervention strategies

In this section, we explore various network-based intervention strategies that aim to contain an epidemic [42]. These interventions change the structure of the trajectory network — the backbone on which an epidemic spreads over time — and eventually affect the size of the set IT of the infected nodes at the end of the process. The strategies relate to node immunization (network node removal) or breaking of social ties (network edge removal) and are actuated either at a network-level (governing authority decision) or a node-level (individual decision). Our goal is to design targeted models of intervention and evaluate them against sensible null models. Details of these strategies and models are presented below, along with discussion on their feasibility and their implications to health policy.

Strategy 1: node immunization. Based on this strategy, we remove a fraction αn of all nodes in the network. Formally, given a set SV of nodes to be removed, where |S|=αn|V|, the infectious disease now spreads in the induced subgraph G(V,E) of G whose vertex set is V=VS and whose edge set E consists of all of the edges in E that have both endpoints in V. The real-world interpretation of this strategy is that some individuals are quarantined (i.e., they are in a state of isolation where no contacts occur) or develop immunization because of a vaccine. The network effect is that a contagious disease cannot spread through their contacts anymore.

Null model: A fraction αn of nodes is removed uniformly at random.

Targeted model (removing high-risk individuals): Nodes are ranked based on their relative risk of infection rrisku, in a descending order. Then, a fraction αn of the nodes with the largest risk rrisku are removed. Ties are resolved uniformly at random. An illustration of this intervention can be seen on Fig. 1 .

Fig. 1.

Fig. 1

Example of removing high-risk individuals.

It is important to note that this is a network-level intervention strategy, where a national authority determines a set of individuals to immune (or request to quarantine) based on an estimate of their relative risk rrisku. Such an intervention, is resource-intensive, but also might infringe the privacy of individuals. It also carries a risk of discriminating against individuals with specific mobility patterns (i.e., super-spreaders). As a result, the feasibility of this intervention strategy is rather weak for large communities.

Strategy 2: breaking of social ties. Based on this strategy, we remove a fraction αe of edges adjacent to each node (contacts). Formally, given a node uV and its set of neighbors Γ(u), we remove |Hu|=αe|Γ(u)| edges, where HuΓ(u). The total number of edges removed from the network is |H|=i=1N|Hi|, HE. The infectious disease now spreads in the subgraph G(V,E) of G, where E=EH. The real-world interpretation of this strategy is that individuals have some understanding of the mobility patterns of their contacts and they can make decisions about who to avoid. The network effect is that a contagious disease cannot spread through some specific contacts anymore.

Null model: For each node u, a fraction αe of its contacts to neighboring nodes Γ(u) are removed, uniformly at random.

Targeted model A (removing high-risk contacts): Nodes vΓ(u) in the neighborhood of u are ranked based on their relative risk of infection rriskv,vΓ(u), in a descending order. Then, a fraction αe of contacts to the neighboring nodes with the largest risk rriskv are removed from the network. Ties are resolved uniformly at random. An illustration of this intervention can be seen on Fig. 2 .

Fig. 2.

Fig. 2

Example of removing high-risk contacts of individuals.

It is important to note that this is an individual-level intervention strategy, where each individual makes a local decision about who to avoid, based on some understanding of the relative risk rrisku associated with each of its contacts. The model assumes that individuals are in position to understand that they should avoid contacts that are frequently and regularly interacting with many others (e.g., due to their occupation or mobility habits). Such an intervention is easier and not resource-intensive to implement, due to its distributed nature and does not infringe on the privacy of individuals. As a result, the feasibility of this intervention strategy is rather high for large communities.

Targeted model B (removing non-community contacts): Nodes vΓ(u) in the neighborhood of u are ranked based on the number of shared friends s(u,v)=|Γ(u)Γ(v)|, in an ascending order. Then, a fraction αe of edges to the neighbors with the smallest s(u,v) is removed from the network. Ties are resolved uniformly at random. An illustration of this intervention can be seen on Fig. 3 .

Fig. 3.

Fig. 3

Example of removing non-community contacts of individuals.

This model resembles a “social bubble” policy practiced by many, where an individual maintains contact with only family members and a few close friends. This way, potential “network bridges” between different well-knit communities in the network are eliminated and the infectious disease finds it hard to cross between them. This is an individual-level intervention strategy, where each individual makes a local decision about who to keep in its social bubble, based on some understanding of how many friends they have in common. This is a relatively easier assumption to make (than the one made by the targeted model A). Such an intervention is once again easier and not resource-intensive to implement, due to its distributed nature and does not infringe on the privacy of individuals. As a result, the feasibility of this intervention strategy is rather high for large communities.

4. Experimental evaluation

In this section, we provide details of our experimental evaluation. We first present our synthetic data generator and describe the characteristics of the contact networks produced. Then, we present a COVID-19 use case, by specifying the parameters and refining the research questions we aim to explore. For each research question, we outline the experimental scenario and process followed to effectively address it. Finally, we discuss the results and any implications.

4.1. Data generation

In order to evaluate our stochastic agent-based SEIR epidemic model, we had to rely on large-size data representing trajectories of individuals or their spatiotemporal contacts. Moreover, for simulations to be reliable, the data needs to be (almost) complete; if significant amount of information about people's mobility or contacts is missing, then any underlying analysis related to community structure and individual behavior could be significantly affected. At the same time, mobility data is highly sensitive; many contact tracing applications rely on privacy-preserving proximity data, making the collection of real-world data impossible. With these factors in mind, we opted to use synthetically generated data. On the other hand, the benefit of generating synthetic data is that all parameters could be tuned and therefore analysis can be more comprehensive.

We generated synthetic data that simulates the activity of people living and working within a specified urban area over the course of a month. We defined an observation area A of approximately 1 km2 including the York University Keele campus and surrounding neighborhoods in Toronto, Canada. Each individual in the simulation is randomly assigned a home location and frequents a number of favorite places (out of a predefined set of places), following a normal distribution. Moreover, each person is assigned an activity level parameter that determines how “active” they are by controlling the number of hours they may spend outside their home every day and the number of places they are likely to visit. Based on existing research on daily activity [43], each individual was assigned between 0 and 12 active daily hours, determined by their activity level. Table 1 presents the parameters of the data generator and Fig. 4 presents descriptive analytics of the generated individual mobility data, including the distribution of activity levels, the distribution of places visited and the hourly activity over the course of a month by individuals of different activity level.

Table 1.

Summary of trajectory data.

Parameter Value
#individuals 2-10k
area (km2) 1
observation time (second) 2,592,000
#points of interest 1,330
activity distribution AlevelN(μ=5.5,σ2=2.5),Alevel[0,10]

Fig. 4.

Fig. 4

Indicators and distribution of generated individuals' activity.

We combine all previous parameters to generate a set of destinations and daily schedules for a specified number of individuals. Afterwards, the exact movement and trajectory traces of these people are simulated using Eclipse Simulation of Urban MObility (SUMO) [44], an open source, highly portable, microscopic and continuous multi-modal traffic simulation package. SUMO is capable of modeling accurate and highly realistic movement of vehicles but also pedestrians, including movement through pedestrian crossings and crowded sidewalks. The end result is synthetic but reliable, complete datasets representing the daily movement of individuals in the observation area A, over a period of a month.

4.2. Use case: parameters for the COVID-19 epidemic

We used the synthetic data generator to model a population of 2,000, 3,000, 5,000 and 10,000 individuals moving in the same campus area. Of course, the resulting datasets correspond to different population densities. This allows us to examine the progression of an epidemic in urban areas with different population densities while controlling the rest of the parameters in the problem.

Focusing on the COVID-19 epidemic use case, we can transform the trajectory traces we obtained into trajectory networks to model the spread of infection. To do this, we need to determine a specific distance threshold where two individuals are considered in contact. Prior research on SARS-CoV-2 transmission through air droplets has shown that individuals at a close physical distance of ≤1-2 m have a high probability of transmission, while there still exists a lower probability when within 2-9 m [39], [45]. As we require a single cutoff value, we selected the conservative threshold value τ=2m. We used the tools developed by [20] to construct the corresponding trajectory networks. The properties of the generated network datasets can be seen in Table 2 .

Table 2.

Summary of datasets.

Name 2k 3k 5k 10k
#individuals 2,000 3,000 5,000 10,000
#data points 172,800,000 259,200,000 432,000,000 864,000,000
#contacts 230,170 1,493,344 9,890,294 56,232,058
clustering coeff. 0.1896 0.2737 0.5899 0.6914

A factor that is somewhat uncertain in related research is the duration required for two individuals to be considered in contact and, subsequently, the transmission probability per time unit β. Studies that examine definitions of contact duration typically consider the case of 1-2 m for 15 minutes or more [19]. With the 12.8% transmission probability from [39], this would result in β0.85% per minute. Furthermore, there are studies of transmission times in different environments such as airplanes [46] or ventilated spaces [47]. These provide values of 1.8% per minute (quadrupled for conservative results) when within 1 m and 1% per minute when in a well-ventilated space without masks, respectively. In our work, we use β=1% per minute in most experiments, but we also explore the progress of an epidemic with different values of β.

Regarding the infection's progress, we follow the example of well-established prior research on COVID-19 [32] and utilize the SEIR model with exposure period of 3 days, infectious period of 6 days and recovery period of 10 days. The recovery probability γ helps to understand the severity of a disease in long term, since together with transmission rate β, it determines R0. However, analysis of varying values of parameter γ is out of the scope of our model that focuses on heterogeneity due to mobility patterns and targeted intervention strategies. In the experiments, we therefore fix the recovery probability to γ=1 (i.e., 100%).

With these parameters selected, we aim to answer the following questions:

  • Q1 Estimation of infection riskrisku. What is the distribution of relative infection risks, for each of the proposed estimation metrics?

  • Q2 Properties of infection seedI0. How does the estimated risk or the number of initial infected individuals affect the progress of the epidemic?

  • Q3 Effect of transmission probabilityβ. How does the probability of transmission influence the progress of the epidemic?

  • Q4 Effect of quarantineq. How does the quarantine of infectious individuals influence the progress of the epidemic?

  • Q5 Effect of intervention strategies. When each intervention strategy is applied, how is the progress of the epidemic affected?

  • Q6 Comparative analysis of intervention strategies. When the same number of contacts are removed, how does the effect of different intervention strategies compare?

Throughout the experimental evaluation, experiments are repeated for 10 iterations with different random seeds, reporting the average values of results.

4.3. Estimation of infection risk

We utilize each of the three proposed methods to estimate the relative risk of infection for the population sample in all datasets. The resulting distribution of risks can be seen in Fig. 5 . As can be seen, the duration-based risku(2) gives a higher risk to a smaller number of individuals than the degree-based risku(1). The geometric interaction-based risku(3) produces an estimate that is balanced between the two other metrics, and we use this in all remaining experiments. The reason why we employ risku(3) is that it naturally captures the dynamics of the infection transmission process. In particular, the risk model needs to capture the following characteristics:

  • the more contacts an individual has the higher the risk;

  • the longer the duration of an interaction, the higher the risk; and

  • the risk of infection due to a singular contact should not increase infinitely but it should plateau once it reaches a probability close to 1 (i.e., certain infection).

risku(3) uses a geometric function to naturally represent the risk due to these characteristics. Note that the probability of u infecting v after n attempts is increasing for every time unit (i.e., minute), demonstrates diminishing returns and it is eventually plateauing out as it approaches to 1 (i.e., 100%). If we were not considering a geometric function (or similar diminishing returns function), then the risk of certain individuals would grow continuously as a factor of the duration of the contact and would lead to disproportional large risk to certain individuals (due to certain lengthy interactions).

Fig. 5.

Fig. 5

Infection risk distribution of individuals.

4.4. Infection transmission characteristics

Fig. 6a presents the progress of the SEIR model over the period of 30 days for the case of the 10k dataset; we report the number of individuals that are found in each of the four compartments of the model (Susceptible, Exposed, Infected and Recovered, respectively).

Fig. 6.

Fig. 6

Infection progress and characteristics.

Furthermore, Fig. 6b shows the basic reproductive number R0 over time. It can be seen that the R0 fluctuates over time with values ranging from 0.0 to 2.5, while its 30-day moving average is equal to 0.939 (dashed line). Note that the R0 is assuming a perfect mixing network (i.e., a complete graph). However, real-world communities do not resemble a complete graph. Our micro-scale analysis of infections allows to monitor the direct and secondary infections attributed to an individual u, and therefore allows to report the reproductive number R0u of uN. In contrast to the basic reproduction number R0 that represents the expected number of infections directly generated by an infectious individual, the R0u represents the exact number of people infected by the specific individual u. Now, instead of relying on the R0 we are in position to provide the distribution of the individual R0u values in the population. In Fig. 6c, we show the distribution of R0u for the 10k dataset, along with the mean R0=uNR0u. It is evident that there is significant variation between individuals, something that the mean R0 fails to capture.

4.5. Properties of infection seed

In Fig. 7 a we can see the infected count I over the period of a month for an initial “seed” risku(3), all of them with high, medium, low, or random risks risku(3). In the case of low-risk individuals, the infection never spreads to other people as the initial ones have very limited or no contact with anyone else. In all other cases however, we can see that after a month the vast majority of the population has been infected, with that conclusion arriving faster or slower depending on the initial seed risk. Similarly, in Fig. 7b we can see the final infected counts IT for initial seeds of random risk but different size IO, for all 3 datasets. While there is some reduction in final infections when IO=1, all other seed sizes lead to the same result, determined by the population density.

Fig. 7.

Fig. 7

Effect of initial seed type and size.

An explanation for this behavior can be found when examining the contribution of each individual in the spread of the infection, and the role of super-spreaders. As mentioned above, we use R0u to define the set of individuals that were directly infected by their contact with individual u. We define as R1u those infected by any person vR0u, R2u those infected by vR1u, and so on. Furthermore, we define Ru={R0u,R1u,}, i.e. all individuals that were infected directly or indirectly because of u. In Fig. 8 a we can see the distribution of R0u for the entire population, along with each person's relative risk rrisku(2). As expected, high-risk individuals are responsible for the vast majority of direct disease transmissions. However, in Fig. 8b we display the equivalent distribution for Ru. There, we can see that many medium or low-risk individuals are actually responsible for a lot of the secondary, indirect infections. This means that, even when a person with few contacts is infected, a single contact with a high-risk super-spreader is enough for the disease to quickly propagate across the community.

Fig. 8.

Fig. 8

Direct and indirect infection counts for different individuals.

The rest of the experiments use IO=10 individuals of random risk risku(3).

4.6. Effect of transmission probability

In Fig. 9 a we can see the progress of an epidemic in a population of 10,000 when the probability of transmission β has different values. Furthermore, in Fig. 9b we can see the final infected counts for those same values of β for the different datasets. Any value above 1-2% all but guarantees the rapid infection of the entire population, and values below 1% result in significantly reduced counts when the population density isn't too high. It is evident that the transmission probability has a significant impact on the spread of an epidemic; face masks and any other means of reducing it can have critical benefits, as the vast majority of existing research also indicates [45].

Fig. 9.

Fig. 9

Effect of transmission probability β.

4.7. Effect of quarantine

Quarantine policies, which include social distancing, home confinement and centralized quarantine, have been widely used to break the transmission chain of epidemic spread [48]. Even though some human-rights and socioeconomic issues have been raised in the process, this public health measure has proved effective in controlling disease spread [49], [50]. Ideally, only those who are infected should be quarantined, while others can travel as they wish. However, this imposes another challenge as recent studies have shown that many infected individuals are asymptomatic or only have mild symptoms [51], [52]. These individuals are most likely not aware they are infected, still able to transmit the virus to others, and therefore in need of quarantine.

To evaluate the effect of quarantine, we incorporate a quarantine parameter q in the experiments that represents the proportion of infected individuals that will have no contact with any others until they fully recover; we assume that the rest 1q proportion of the infected nodes will not quarantine and will continue interacting with others (e.g., due to no symptoms). Given that the quarantine of every infectious individual corresponds with the removal of many probable-transmission contacts, the expectation for this case is that the effect of the quarantine parameter q on the epidemic will be significant. Indeed, as can be seen in Fig. 10 , even small values can result in greatly reduced infection numbers, which greatly highlights the importance of quarantine measures. For the remainder of this work we wish to examine other properties in isolation, and we therefore set the parameter q=0.

Fig. 10.

Fig. 10

Effect of quarantine on final infected (10k dataset).

4.8. Effect of intervention strategies

To answer this research question, we examine the progress of the epidemic after applying the proposed intervention strategies on the 3k dataset. For each strategy, we report intervention and null-model results of I for an intervention proportion α=0.2, and the final infected IT for different α values.

Fig. 11a shows the results of removing 20% of the network nodes, i.e. individuals based on their risk. As mentioned earlier, this corresponds with targeted immunization or isolation of selected individuals. We can see that this has a substantial effect, greatly reducing the spread of the epidemic. The result is much less pronounced when removing medium or random risk individuals, and removing low-risk individuals has almost no effect. In Fig. 11b we can see the outcome for different proportions α. In this case, simply removing 30% of the highest-risk individuals practically eliminates the spread of infection completely.

Fig. 11.

Fig. 11

Results after high risk node immunization intervention.

Figs. 12a and 12b show the same results for the high-risk contact removal intervention. As mentioned in Section 3, this is equivalent to every person avoiding high-risk individuals among their contacts. We can see that this time the targeted strategy performs only slightly better than the null model, although that difference grows for higher intervention proportions. The result is not surprising. This is because the null model employed is not necessarily representing a bad strategy; by randomly selecting nodes and removing edges, one may remove high-risk, medium-risk or low-risk contacts. Furthermore, the aggregated network of the 3k dataset represents a small-world graph. It is well-known that when only a small proportion of edges are removed from that network (i.e. alpha is small), the impact on the connectivity of the network might not be significant due to the high clustering property of this type of graph.

Fig. 12.

Fig. 12

Results of removing high-risk individuals intervention.

Finally, Figs. 13a and 13b show the α=20% progress and different-proportion final counts IT for the non-community contact removal intervention strategy. As mentioned earlier this is equivalent to the “social bubble” concept, where each person only maintains contact with their close friends and family, avoiding people from other groups. We can see that this strategy is notably more effective than the null-model one for α<40%. Above that value, it is more beneficial to simply reduce each person's contacts overall, rather than targeting the more sporadic and (brief) contacts with people outside their community.

Fig. 13.

Fig. 13

Results of non-community contacts removal intervention.

4.9. Comparative analysis of intervention strategies

In order to compare the effectiveness of the proposed intervention strategies we have to take into account their relative impact on the population. Specifically, in high-risk node immunization with αn=0.1 the percentage of removed edges αe is much higher. In order to present a fair comparison of the intervention strategies, we report their results based on the number of removed contacts αe. The results for this experiment can be seen on Fig. 14 a.

Fig. 14.

Fig. 14

Distribution of R0 value for each individual.

In order to compare the effectiveness of the proposed intervention strategies, we have to take into account their relative impact on the population. Specifically, in high-risk node immunization with αn=0.1, the percentage of removed edges αe is much higher. Even with the same strategies (e.g., breaking of social ties), the number of edges to be removed can be different. As such, in order to present a fair comparison of the intervention strategies, we normalized the number of edges being removed in the experiments and report their results based on the number of removed contacts αe. The results for this experiment can be seen on Fig. 14a.

We can see that the “social bubble” intervention yields slightly better results than the high-risk individual removal one, when 30% or fewer contacts are removed. Afterwards, targeted immunization/isolation of individuals is significantly more effective. However, as mentioned in Section 3, we also need to take into consideration the feasibility of each approach. In that case, the uncommon edge intervention isn't only the most easily implemented one, but also performs very well up until a full 50-60% of all contacts have been removed. Any intervention above such magnitudes would significantly impact the movement and interactions of the population, and may be unrealistic for larger communities.

Fig. 14b provides the R0u distributions after applying each of the interventions. This allows for a more in-depth examination of each intervention's effects and better interpretation of the performance results. In this plot, the total area under each curve corresponds to the sum of all transmissions of the infection for that case. Therefore, it is expected that all interventions will result in curves covering smaller areas than the non-intervention case. However, each intervention accomplishes this in a different way. The node immunization and high-risk edge removal interventions simply remove most of the highest-risk contacts, corresponding to many of the highest R0u values in the distribution, resulting in “shifting” the entire distribution to the left. On the other hand, the non-community edge removal intervention removes values throughout the distribution, resulting in a steeper curve.

5. Related work

Our research is related to (i) trajectory data mining, (ii) dynamic network analysis, and (iii) epidemic spreading in complex networks. These topics have been active research directions for a long time, so there is a broad spectrum of related literature. However, it is only recently that due to the technological advancement in geolocation tracking devices (e.g., GPS-enabled mobile devices), mobility data is becoming more accessible. Mining patterns in large amounts of human digital traces provides an opportunity for designing more accurate epidemic spreading models and more effective network intervention strategies than before. We cover below some of the most significant efforts relevant to that goal. Note, as well that some of the related work has already been cited throughout the manuscript to keep the discussion focused, so we omit it here.

5.1. Trajectory data mining

Computational methods for mining spatiotemporal data, including trajectory/mobility data, have been extensively studied by the data mining and database communities. Two comprehensive surveys are provided by Zheng [18] and Atluri et al. [53]. Closely related to the problem of interest in this paper are problems that focus on mining the interactions among moving objects, over time, such as detecting pedestrian groups in trajectories [54], [55], [56] or determining the node centrality of moving objects in trajectory networks [20]. More recently, deep learning approaches for learning from spatiotemporal data and spatiotemporal networks have gained increasing attention [57], [58].

Digital contact tracing: In our research, we assume that human mobile traces are available. This can be enabled by existing digital contact tracing technologies [59], [60], [61]. For instance, Aleta et al. [62] synthesized contact networks and modeled SARS-CoV-2 transmission in the Boston metropolitan area. They found that effective testing and contact tracing plays an important role in preventing second-wave spreading when complete isolation is relaxed. Other technologies of contact tracing have also been proposed [63].

Privacy concerns of digital contact tracing: Digital contact tracing enables an easy and rapid implementation of infectious disease tracing, as it requires to gather and process simple information. However, gathering sensitive information might infringe the privacy of individuals [64]. We believe that controlling an infectious disease should not lead to a weakening of the privacy of individuals. We therefore advocate for protocols and technologies for “privacy-preserving proximity tracing” that protect the privacy of individuals [65], [66]. Connectivity technology available in mobile devices and newly developed connectivity protocols can make a decisive contribution to efficiently and widely support proximity tracing enabled by bluetooth connection and/or GPS location. Both approaches protect the privacy of the user through dynamic pseudo-IDs [67]. For instance, Apple (iOS) and Google (Android) are providing privacy-preserving cross-platform contact tracing via an open API and an opt-in Bluetooth-based proximity tracking.1 In addition, the Pan-European Privacy-Preserving Proximity Tracing (PEPP-PT) protocol2 and the Decentralized Privacy-Preserving Proximity Tracing (DP-3T) [68] protocol3 have recently been proposed for providing a secure and decentralized privacy-preserving proximity tracing system.

5.2. Dynamic networks analysis

The problem of objects being dispersed in space and interacting with each other if they are in close vicinity has been intensely explored in graph theory. Graph theory concepts, such as proximity graphs [69] and geometric intersection graphs [70] are characteristic examples. For instance, relative neighbor graphs [71] and Gabriel graphs [69] connect nearest neighbors if no other vertexes are nearby, while Delaunay triangulations [72] maximize the minimum angles of all triangles formed. These graphs, however, mostly deal with static data, while in our problem we are studying cases of multiple proximity graphs, one for each time unit. There has also been significant research on dynamic networks, such as time-varying networks or otherwise temporal networks. With the addition of temporal information, several concepts of static networks are not valid anymore, so they need to be adapted/studied in the context of dynamic networks. Due to its importance, dynamic network analysis is therefore an emergent discipline of network science focusing on network dynamics. Example problems include the computation of temporal node centrality [25] or computing metrics of network reachability [73], shortest paths [24], motifs [74] and other [75]. The dynamic nature of these systems introduces additional complexity and computational challenges. Dynamic networks have also been studied using machine learning models in the context of gradually evolving networks, where nodes/edges are added/removed over time [76], [77]. A comprehensive survey of this line of research can be found in [78]. Related to the current research, Leitch et al. [79] presented a review towards epidemic thresholds on temporal networks; they pointed out that temporal networks engage dynamics of real-world contacts, so their study is of great importance for understanding disease spreading processes. Statistical approaches or computer simulations are often necessary to explore the evolution of these external processes over evolving networks.

5.3. Epidemic spreading in complex networks

Mathematical modeling of epidemic spreading in networks can help to study and control the emergence of infectious diseases in a population. Based on the traditional SIR model, Weitz et al. [80] designed epidemiological interventions that can exploit the idea of ‘shield immunity’. The main idea of the model is to deploy recovered individuals as focal points for sustaining safer interactions via interaction substitution. This method, however, cannot easily translate to health policy and/or individual level recommendations. In our study, we employ an agent-based SEIR model. The agent-based SEIR model has previously been employed to study the spread of epidemics in dynamic networks. For instance, Perez and Dragicevic [33] proposed a spatially explicit epidemiological model of infectious disease for understanding of the diffusion of a disease in a network of human contacts. In their model, human interactions are not fully dynamic, but are determined by the geographic area they are found at the same time. Yang et al. [81] proposed a flow-based edge betweenness method that detects important “bottleneck” edges in contact networks. They show that targeting those edges can contain the epidemic spread more than state-of-the-art edge betweenness methods. Their model does not easily translate to individual level policy, but offers high-level guidance for the network containment problem. Agent based epidemic spreading models are also very similar to the study of the epidemic spreading driven by random walks and one can translate to the other. Similar to the SEIR models, in the random walk based epidemic models, the infection is spread to the neighbors with a probability. This probability is determined by the transition matrix of the random walker. Pu et al. [82] proposed a biased random walk based spreading model. They show that the average node degree and homogeneity of the node degree plays an important role in the number of the infected nodes. More recently, Bestehorn et al. [83] derive an upper bound for the reproduction number based on a discrete-time Markovian random walk model of the infection spreading. New advances in deep learning and network representation learning have also been used to model the epidemics [84]. Change et al. [85] propose an epidemic model based on the embedding of the mobility network and show the role of the social and economical disparities in the spread of the epidemics and the network structure. The main limitation of these models is that they model the epidemics on a static network and evolving aspect of the network is not analyzed. Heidari and Papagelis [76], [77] have proposed evolving network representation learning methods that lie in the intersection of these two topics and can be used to model the epidemic spreading in evolving networks.

Population heterogeneity in epidemics: Britton et al. [38] studied the effect of population heterogeneity on achieving the epidemiological objective of ‘herd immunity’ to the COVID-19 disease. In their study, they found that herd immunity can be achieved with less per cent of the population being infected than it was thought to be required (i.e., 60%), if one considers the age and social activity into the model. Similar ideas of evaluating the effect of individual variability in the disease spreading process have been considered by others. Our research is individual variability based on the structural information of a network; information that is related to the differences of nodes in the network based on basic graph/network characteristics (e.g., based on node centrality in the dynamic network, etc.). Related to our main thesis, Hébert-Dufresne et al. [86] argued that using R0 alone to predict epidemic size is not enough in real-world outbreaks. They pointed out the necessity of considering heterogeneity in secondary infections to predict outbreak size. In practice, this can be done automatically, fairly cheaply, and highly accurately, by a wide-spread deployment and adoption of digital contact tracing technologies. Moreover, Lloyd-Smith et al. [87] pointed out that the basic reproduction number R0 in the traditional epidemic analyses is a population-level estimate. Through a theoretical and statistical analysis, they showed that individual variation greatly affects epidemic growth rates, and therefore targeted control interventions would be more effective than population-wide ones. On a similar basis, Changruenngam et al. [88] studied the effect of human mobility on disease transmission dynamics in two contrasted countries. In the study, they incorporated individual human mobility in the SEIR model, which helped better describe infection spreading dynamics. In particular, based on population data of two areas, human mobility was modeled by obtaining the probability for an individual exploring new locations, from which the human mobility landscape could also be captured. Rocha and Masuda [89] augmented the SIR model by incorporating an individual-based approximation that captures the evolution of the probability that an individual is infected by another individual in the network. These studies highlighted the importance of incorporating individual variation in the epidemic model.

Network containment interventions: Instead of a complete or near-complete lockdown, P. Block et al. [42] proposed more moderate distancing strategies inspired by network science, including limiting contacts to similar, community-based or repetitive contacts. This study is based on a static network, while in a real-world situation, the network is dynamic informed by how individuals move around and interact with each other. Incorporating individual variation in the model has also the advantage that network interventions can be designed at an individual's level. For instance, Zhou et al. [90] reviewed several studies related to network immunization and concluded that vaccination of targeted nodes (e.g., nodes with large node degree), outperforms random immunization. Similarly, Torres et al. [91] evaluated different immunization strategies and found that node degree is a very strong measurement in determining node importance when considering the targeted nodes. In [92], authors study the spread of epidemics on static and temporal networks. Epidemics on temporal networks is closely related to our work as it follows the same assumption in the network construction: the network evolves faster than the spread of the pathogen. However, that work assumes same degree for all the nodes in the temporal network. In our case, node degrees are determined by the mobility patterns of individuals.

6. Conclusions

Effective modeling of an emerging infectious disease has the potential to improve or save human lives. It can also minimize the societal and economic damage caused by physical distancing and confinement measures imposed by governing authorities to control an epidemic. Towards that end, we have presented a data-driven model of infectious epidemic spreading in spatiotemporal networks informed by mobility data of individuals. We designed and evaluated simple individual-based intervention strategies that exhibit network effects and can significantly control the spread of an infectious disease. We have also demonstrated that these targeted interventions can outperform generic intervention strategies. These strategies are easy to understand and translate to public health policy. While COVID-19 serves as a use case in this research, the same methodology can be used to model and mitigate any emerging infectious disease. An inherent limitation of our model is that it assumes availability of trajectories of individuals through digital tracing technologies. While these data are typically available to telecom and other third-parties through privacy-preserving techniques, our research relied only on realistic synthetic data sets.

Reproducibility: We make source code and data sets used in the experiments publicly available4 to encourage reproducibility of results.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Footnotes

References

  • 1.Smith C.F., et al. W. Heinemann; 1962. Thucydides: History of the Peloponnesian War, vol. 108. [Google Scholar]
  • 2.Papagrigorakis M.J., Yapijakis C., Synodinos P.N., Baziotopoulou-Valavani E. Dna examination of ancient dental pulp incriminates typhoid fever as a probable cause of the plague of Athens. Int. J. Infect. Dis. 2006;10(3):206–214. doi: 10.1016/j.ijid.2005.09.001. [DOI] [PubMed] [Google Scholar]
  • 3.Taubenberger J.K., Morens D.M. 1918 influenza: the mother of all pandemics. Rev. Biomed. 2006;17(1):69–79. doi: 10.3201/eid1201.050979. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Morens D.M., Fauci A.S. The 1918 influenza pandemic: insights for the 21st century. J. Infect. Dis. 2007;195(7):1018–1028. doi: 10.1086/511989. [DOI] [PubMed] [Google Scholar]
  • 5.Hays J.N. ABC-CLIO; 2005. Epidemics and Pandemics: Their Impacts on Human History. [Google Scholar]
  • 6.Peiris J.S., Yuen K.Y., Osterhaus A.D., Stöhr K. The severe acute respiratory syndrome. N. Engl. J. Med. 2003;349(25) doi: 10.1056/NEJMra032498. [DOI] [PubMed] [Google Scholar]
  • 7.Butler D. Swine flu goes global: new influenza virus tests pandemic emergency preparedness. Nature. 2009;458(7242):1082–1084. doi: 10.1038/4581082a. [DOI] [PubMed] [Google Scholar]
  • 8.Zumla A., Hui D.S., Perlman S. Middle East respiratory syndrome. Lancet. 2015;386(9997):995–1007. doi: 10.1016/S0140-6736(15)60454-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Baize S., Pannetier D., Oestereich L., Rieger T., Koivogui L., Magassouba N., Soropogui B., Sow M.S., Keïta S., De Clerck H., et al. Emergence of Zaire Ebola virus disease in Guinea. N. Engl. J. Med. 2014;371(15):1418–1425. doi: 10.1056/NEJMoa1404505. [DOI] [PubMed] [Google Scholar]
  • 10.Campos G.S., Bandeira A.C., Sardi S.I. Zika virus outbreak, Bahia, Brazil. Emerg. Infect. Dis. 2015;21(10):1885. doi: 10.3201/eid2110.150847. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.W.H. Organization, et al. Coronavirus disease 2019 (covid-19): situation report, 72 (2020).
  • 12.Eames K.T., Keeling M.J. Contact tracing and disease control. Proc. R. Soc. Lond. B, Biol. Sci. 2003;270(1533):2565–2571. doi: 10.1098/rspb.2003.2554. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.World Health Organization & Centers for Disease Control and Prevention (U.S.) World Health Organization; 2015. Implementation and management of contact tracing for Ebola virus disease: emergency guideline. Tech. rep. [Google Scholar]
  • 14.Ferretti L., Wymant C., Kendall M., Zhao L., Nurtay A., Abeler-Dörner L., Parker M., Bonsall D., Fraser C. Quantifying sars-cov-2 transmission suggests epidemic control with digital contact tracing. Science. 2020;368 doi: 10.1126/science.abb6936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Woodhams S. 2020. Covid-19 digital rights tracker. [Google Scholar]
  • 16.Mokbel M.F., Abbar S., Stanojevic R. Contact tracing: beyond the apps. 2020. arXiv:2006.04585 preprint.
  • 17.Chakrabarti D., Faloutsos C. Graph mining: laws, generators, and algorithms. ACM Comput. Surv. 2006;38(1):2. [Google Scholar]
  • 18.Zheng Y. Trajectory data mining: an overview. ACM Trans. Intell. Syst. Technol. 2015;6(3):1–41. [Google Scholar]
  • 19.Keeling M.J., Hollingsworth T.D., Read J.M. The efficacy of contact tracing for the containment of the 2019 novel coronavirus (covid-19) 2020. https://doi.org/10.1101/2020.02.14.20023036 [DOI] [PMC free article] [PubMed]
  • 20.Pechlivanoglou T., Papagelis M. Fast and accurate mining of node importance in trajectory networks. 2018 IEEE International Conference on Big Data; Big Data; IEEE; 2018. pp. 781–790. [Google Scholar]
  • 21.Newman M.E. A measure of betweenness centrality based on random walks. Soc. Netw. 2005;27(1):39–54. [Google Scholar]
  • 22.Brandes U. A faster algorithm for betweenness centrality. J. Math. Sociol. 2001;25(2):163–177. [Google Scholar]
  • 23.Kempe D., Kleinberg J., Kumar A. Proceedings of the Thirty-Second Annual ACM Symposium on Theory of Computing. ACM; 2000. Connectivity and inference problems for temporal networks; pp. 504–513. [Google Scholar]
  • 24.Wu H., Cheng J., Huang S., Ke Y., Lu Y., Xu Y. Path problems in temporal graphs. Proc. VLDB Endow. 2014;7(9):721–732. [Google Scholar]
  • 25.Kim H., Anderson R. Temporal node centrality in complex networks. Phys. Rev. E. 2012;85(2) doi: 10.1103/PhysRevE.85.026107. [DOI] [PubMed] [Google Scholar]
  • 26.Keeling M.J., Grenfell B.T. Individual-based perspectives on r0. J. Theor. Biol. 2000;203(1):51–61. doi: 10.1006/jtbi.1999.1064. [DOI] [PubMed] [Google Scholar]
  • 27.Diekmann O., Heesterbeek J., Metz J. On the definition and the computation of the basic reproduction ratio r 0 in models for infectious diseases in heterogeneous populations. J. Math. Biol. 1990;28(4) doi: 10.1007/bf00178324. [DOI] [PubMed] [Google Scholar]
  • 28.Anderson R.M., Anderson B., May R.M. Oxford University Press; 1992. Infectious Diseases of Humans: Dynamics and Control. [Google Scholar]
  • 29.Brauer F., Castillo-Chavez C., Castillo-Chavez C. Springer; 2012. Mathematical Models in Population Biology and Epidemiology, vol. 2. [Google Scholar]
  • 30.Hethcote H.W. The mathematics of infectious diseases. SIAM Rev. 2000;42(4):599–653. [Google Scholar]
  • 31.Earn D.J., Rohani P., Bolker B.M., Grenfell B.T. A simple model for complex dynamical transitions in epidemics. Science. 2000;287(5453) doi: 10.1126/science.287.5453.667. [DOI] [PubMed] [Google Scholar]
  • 32.Tuite A.R., Fisman D.N., Greer A.L. Mathematical modelling of covid-19 transmission and mitigation strategies in the population of Ontario, Canada. CMAJ, Can. Med. Assoc. J. 2020;192(19):E497–E505. doi: 10.1503/cmaj.200476. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Perez L., Dragicevic S. An agent-based approach for modeling dynamics of contagious disease spread. Int. J. Health Geogr. 2009;8(1):50. doi: 10.1186/1476-072X-8-50. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.N. Hoertel, M. Blachier, C. Blanco, M. Olfson, M. Massetti, F. Limosin, H. Leleu, Facing the covid-19 epidemic in nyc: a stochastic agent-based model of various intervention strategies, medRxiv.
  • 35.Yao D., Durrett R. Epidemics on evolving graphs. 2020. arXiv:2003.08534 preprint.
  • 36.Grimm V., Berger U., Bastiansen F., Eliassen S., Ginot V., Giske J., Goss-Custard J., Grand T., Heinz S.K., Huse G., et al. A standard protocol for describing individual-based and agent-based models. Ecol. Model. 2006;198(1–2):115–126. [Google Scholar]
  • 37.M.G.M. Gomes, R. Aguas, R.M. Corder, J.G. King, K.E. Langwig, C. Souto-Maior, J. Carneiro, M.U. Ferreira, C. Penha-Goncalves, Individual variation in susceptibility or exposure to sars-cov-2 lowers the herd immunity threshold, medRxiv, 2020. [DOI] [PMC free article] [PubMed]
  • 38.Britton T., Ball F., Trapman P. A mathematical model reveals the influence of population heterogeneity on herd immunity to sars-cov-2. Science. 2020 doi: 10.1126/science.abc6810. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Jones N.R., Qureshi Z.U., Temple R.J., Larwood J.P.J., Greenhalgh T., Bourouiba L. Two metres or one: what is the evidence for physical distancing in covid-19? BMJ, Br. Med. J. 2020;370 doi: 10.1136/bmj.m3223. [DOI] [PubMed] [Google Scholar]
  • 40.Barbour A., Mollison D. Stochastic Processes in Epidemic Theory. Springer; 1990. Epidemics and random graphs; pp. 86–89. [Google Scholar]
  • 41.Frisch H., Hammersley J. Percolation processes and related topics. J. Soc. Ind. Appl. Math. 1963;11(4):894–918. [Google Scholar]
  • 42.Block P., Hoffman M., Raabe I.J., Dowd J.B., Rahal C., Kashyap R., Mills M.C. Social network-based distancing strategies to flatten the covid-19 curve in a post-lockdown world. Nat. Hum. Behav. 2020:1–9. doi: 10.1038/s41562-020-0898-6. [DOI] [PubMed] [Google Scholar]
  • 43.Bhat C.R., Singh S.K. A comprehensive daily activity-travel generation model system for workers. Transp. Res., Part A, Policy Pract. 2000;34(1):1–22. doi: 10.1016/s0965-8564(98)00037-8. [DOI] [Google Scholar]
  • 44.Lopez P.A., Behrisch M., Bieker-Walz L., Erdmann J., Flötteröd Y.-P., Hilbrich R., Lücken L., Rummel J., Wagner P., Wießner E. The 21st IEEE International Conference on Intelligent Transportation Systems. IEEE; 2018. Microscopic traffic simulation using sumo.https://elib.dlr.de/124092/ [Google Scholar]
  • 45.Chu D.K., Akl E.A., Duda S., Solo K., Yaacoub S., Schünemann H.J., El-harakeh A., Bognanni A., Lotfi T., Loeb M., et al. Physical distancing, face masks, and eye protection to prevent person-to-person transmission of sars-cov-2 and covid-19: a systematic review and meta-analysis. Lancet. 2020;395(10242):1973–1987. doi: 10.1016/S0140-6736(20)31142-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Hertzberg V.S., Weiss H., Elon L., Si W., Norris S.L. Behaviors, movements, and transmission of droplet-mediated respiratory diseases during transcontinental airline flights. Proc. Natl. Acad. Sci. 2018;115(14):3623–3627. doi: 10.1073/pnas.1711611115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Augenbraun B.L., Lasner Z.D., Mitra D., Prabhu S., Raval S., Sawaoka H., Doyle J.M. Assessment and mitigation of aerosol airborne sars-cov-2 transmission in laboratory and office environments. J. Occup. Environ. Hyg. 2020;17(10):447–456. doi: 10.1080/15459624.2020.1805117. [DOI] [PubMed] [Google Scholar]
  • 48.Gensini G.F., Yacoub M.H., Conti A.A. The concept of quarantine in history: from plague to sars. J. Infect. 2004;49(4):257–261. doi: 10.1016/j.jinf.2004.03.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Tognotti E. Lessons from the history of quarantine, from plague to influenza a. Emerg. Infect. Dis. 2013;19(2):254. doi: 10.3201/eid1902.120312. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Wang Y., Shi L., Que J., Lu Q., Liu L., Lu Z., Xu Y., Liu J., Sun Y., Meng S., et al. The impact of quarantine on mental health status among general population in China during the covid-19 pandemic. Mol. Psychiatry. 2021:1–10. doi: 10.1038/s41380-021-01019-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Gao Z., Xu Y., Sun C., Wang X., Guo Y., Qiu S., Ma K. A systematic review of asymptomatic infections with covid-19. J. Microbiol. Immunol. Infect. 2021;54(1):12–16. doi: 10.1016/j.jmii.2020.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Kim G.-u., Kim M.-J., Ra S.H., Lee J., Bae S., Jung J., Kim S.-H. Clinical characteristics of asymptomatic and symptomatic patients with mild covid-19. Clin. Microbiol. Infect. 2020;26(7):948–e1. doi: 10.1016/j.cmi.2020.04.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Atluri G., Karpatne A., Kumar V. Spatio-temporal data mining: a survey of problems and methods. ACM Comput. Surv. 2018;51(4) [Google Scholar]
  • 54.Sawas A., Abuolaim A., Afifi M., Papagelis M. Tensor methods for group pattern discovery of pedestrian trajectories. 2018 19th IEEE International Conference on Mobile Data Management; MDM; IEEE; 2018. [Google Scholar]
  • 55.Sawas A., Abuolaim A., Afifi M., Papagelis M. Trajectolizer: interactive analysis and exploration of trajectory group dynamics. 2018 19th IEEE International Conference on Mobile Data Management; MDM; IEEE; 2018. pp. 286–287. [Google Scholar]
  • 56.Sawas A., Abuolaim A., Afifi M., Papagelis M. A versatile computational framework for group pattern mining of pedestrian trajectories. GeoInformatica. 2019;23(4):501–531. [Google Scholar]
  • 57.Wang S., Cao J., Yu P. IEEE Transactions on Knowledge and Data Engineering. 2020. Deep learning for spatio-temporal data mining: a survey. [Google Scholar]
  • 58.Mehmood S., Papagelis M. Learning semantic relationships of geographical areas based on trajectories. 2020 21st IEEE International Conference on Mobile Data Management; MDM; IEEE; 2020. pp. 109–118. [Google Scholar]
  • 59.Farrahi K., Emonet R., Cebrian M. Epidemic contact tracing via communication traces. PLoS ONE. 2014;9(5) doi: 10.1371/journal.pone.0095133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.G. Cencetti, G. Santin, A. Longa, E. Pigani, A. Barrat, C. Cattuto, S. Lehmann, M. Salathe, B. Lepri, Digital proximity tracing in the covid-19 pandemic on empirical contact networks, 2020, medRxiv. [DOI] [PMC free article] [PubMed]
  • 61.Sahneh F.D., Scoglio C. 2011 50th IEEE Conference on Decision and Control and European Control Conference. IEEE; 2011. Epidemic spread in human networks; pp. 3008–3013. [Google Scholar]
  • 62.A. Aleta, D. Martin-Corral, A.P. y Piontti, M. Ajelli, M. Litvinova, M. Chinazzi, N. Dean, M.E. Halloran, I. Longini Jr., S. Merler, et al., Modeling the impact of social distancing, testing, contact tracing and household quarantine on second-wave scenarios of the covid-19 epidemic, 2020, medRxiv. [DOI] [PMC free article] [PubMed]
  • 63.Luo Y., Zhang C., Zhang Y., Zuo C., Xuan D., Lin Z., Champion A.C., Shroff N. Acoustic-turf: acoustic-based privacy-preserving covid-19 contact tracing. 2020. arXiv:2006.13362 preprint.
  • 64.Cho H., Ippolito D., Yu Y.W. Contact tracing mobile apps for covid-19: privacy considerations and related trade-offs. 2020. arXiv:2003.11511 preprint.
  • 65.Hoh B., Gruteser M., Xiong H., Alrabady A. Proceedings of the 14th ACM Conference on Computer and Communications Security. 2007. Preserving privacy in gps traces via uncertainty-aware path cloaking. [Google Scholar]
  • 66.Reichert L., Brack S., Scheuermann B. Privacy-preserving contact tracing of covid-19 patients. IACR Cryptol. ePrint Arch. 2020;2020:375. [Google Scholar]
  • 67.Chan J., Gollakota S., Horvitz E., Jaeger J., Kakade S., Kohno T., Langford J., Larson J., Singanamalla S., Sunshine J., et al. 2020. Pact: Privacy Sensitive Protocols and Mechanisms for Mobile Contact Tracing. [Google Scholar]
  • 68.Troncoso C., Payer M., Hubaux J.-P., Salathé M., Larus J., Bugnion E., Lueks W., Stadler T., Pyrgelis A., Antonioli D., et al. 2020. Decentralized Privacy-Preserving Proximity Tracing. [Google Scholar]
  • 69.Gabriel K.R., Sokal R.R. A new statistical approach to geographic variation analysis. Syst. Zool. 1969;18(3):259. [Google Scholar]
  • 70.Erlebach T., Jansen K., Seidel E. Polynomial-time approximation schemes for geometric intersection graphs. SIAM J. Comput. 2005;6 [Google Scholar]
  • 71.Toussaint G.T. The relative neighbourhood graph of a finite planar set. Pattern Recognit. 1980;12(4):261–268. [Google Scholar]
  • 72.Delaunay B. Sur la sphère vide. a la mémoire de Georges Voronoï. Bull. Acad. Sci. URSS. 1934;6:793–800. [Google Scholar]
  • 73.Holme P. Network reachability of real-world contact sequences. Phys. Rev. E. 2005;71(4) doi: 10.1103/PhysRevE.71.046119. [DOI] [PubMed] [Google Scholar]
  • 74.Kovanen L., Karsai M., Kaski K., Kertész J., Saramäki J. Temporal motifs in time-dependent networks. J. Stat. Mech. Theory Exp. 2011;2011(11) [Google Scholar]
  • 75.Nicosia V., Tang J., Mascolo C., Musolesi M., Russo G., Latora V. Temporal Networks. Springer; 2013. Graph metrics for temporal networks. [Google Scholar]
  • 76.Heidari F., Papagelis M. International Conference on Complex Networks and Their Applications. Springer; 2018. Evonrl: evolving network representation learning based on random walks; pp. 457–469. [Google Scholar]
  • 77.Heidari F., Papagelis M. Evolving network representation learning based on random walks. Appl. Netw. Sci. 2020;5(1):1–38. doi: 10.1007/s41109-020-00257-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Zhang Z., Cui P., Zhu W. IEEE Transactions on Knowledge and Data Engineering. 2020. Deep learning on graphs: a survey. [Google Scholar]
  • 79.Leitch J., Alexander K.A., Sengupta S. Toward epidemic thresholds on temporal networks: a review and open questions. Appl. Netw. Sci. 2019;4(1):105. [Google Scholar]
  • 80.Weitz J.S., Beckett S.J., Coenen A.R., Demory D., Dominguez-Mirazo M., Dushoff J., Leung C.-Y., Li G., Măgălie A., Park S.W., et al. Modeling shield immunity to reduce covid-19 epidemic spread. Nature. 2020 doi: 10.1038/s41591-020-0895-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Yang S., Senapati P., Wang D., Bauch C.T., Fountoulakis K. Targeted pandemic containment through identifying local contact network bottlenecks. 2020. arXiv:2006.06939 preprint. [DOI] [PMC free article] [PubMed]
  • 82.Pu C., Li S., Yang J. Epidemic spreading driven by biased random walks. Physica A. 2015;432:230–239. [Google Scholar]
  • 83.Bestehorn M., Riascos A.P., Michelitsch T.M., Collet B.A. Continuum Mechanics and Thermodynamics. 2021. A markovian random walk model of epidemic spreading; pp. 1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Hamilton W.L., Ying R., Leskovec J. Representation learning on graphs: methods and applications. 2017. arXiv:1709.05584 preprint.
  • 85.Chang S., Pierson E., Koh P.W., Gerardin J., Redbird B., Grusky D., Leskovec J. Mobility network models of covid-19 explain inequities and inform reopening. Nature. 2021;589(7840):82–87. doi: 10.1038/s41586-020-2923-3. [DOI] [PubMed] [Google Scholar]
  • 86.L. Hébert-Dufresne, B.M. Althouse, S.V. Scarpino, A. Allard, Beyond r0: heterogeneity in secondary infections and probabilistic epidemic forecasting, medRxiv, 2020. [DOI] [PMC free article] [PubMed]
  • 87.Lloyd-Smith J., Schreiber S., Kopp P.E., Getz W. Superspreading and the effect of individual variation on disease emergence. Nature. 2005;438 doi: 10.1038/nature04153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Changruenngam S., Bicout D.J., Modchang C. How the individual human mobility spatio-temporally shapes the disease transmission dynamics. Sci. Rep. 2020;10(1):1–13. doi: 10.1038/s41598-020-68230-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Rocha L.E., Masuda N. Individual-based approach to epidemic processes on arbitrary dynamic contact networks. Sci. Rep. 2016;6 doi: 10.1038/srep31456. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Tao Z., Zhongqian F., Binghong W. Epidemic dynamics on complex networks. Prog. Nat. Sci. 2006;16(5):452–457. [Google Scholar]
  • 91.Torres L., Chan K.S., Tong H., Eliassi-Rad T. Node immunization with non-backtracking eigenvalues. 2020. arXiv:2002.12309 preprint.
  • 92.Pastor-Satorras R., Castellano C., Van Mieghem P., Vespignani A. Epidemic processes in complex networks. Rev. Mod. Phys. 2015;87 [Google Scholar]

Articles from Big Data Research are provided here courtesy of Elsevier

RESOURCES