Skip to main content
IEEE - PMC COVID-19 Collection logoLink to IEEE - PMC COVID-19 Collection
. 2021 Feb 9;9:26196–26207. doi: 10.1109/ACCESS.2021.3058206

Leveraging Network Science for Social Distancing to Curb Pandemic Spread

Satyaki Roy 1,, Andrii Cherevko 2, Sayak Chakraborty 3, Nirnay Ghosh 3, Preetam Ghosh 2
PMCID: PMC8545212  PMID: 34812379

Abstract

COVID-19 has irreversibly upended the course of human life and compelled countries to invoke national emergencies and strict public guidelines. As the scientific community is in the early stages of rigorous clinical testing to come up with effective vaccination measures, the world is still heavily reliant on social distancing to curb the rapid spread and mortality rates. In this work, we present three optimization strategies to guide human mobility and restrict contact of susceptible and infective individuals. The proposed strategies rely on well-studied concepts of network science, such as clustering and homophily, as well as two different scenarios of the SEIRD epidemic model. We also propose a new metric, called contagion potential, to gauge the infectivity of individuals in a social setting. Our extensive simulation experiments show that the recommended mobility approaches slow down spread considerably when compared against several standard human mobility models. Finally, as a case study of the mobility strategies, we introduce a mobile application, MyCovid, that provides periodic location recommendations to the registered app users.

Keywords: Social distancing, network science, clustering, optimization, homophily

I. Introduction

COVID-19 has had an indelible imprint on human life and upended public health standards and social and economic order [1]. Nearly 1.7 million people have been reported dead worldwide by December 2020. However, the actual death numbers are likely to be greater due to want of testing, reporting and problems identifying cause of death [2], as some countries consider hospital deaths, while others factor in deaths at homes. The unprecedented mass hysteria surrounding this ongoing pandemic has bled into the businesses around the world, as millions of enterprises are projected to face extinction and families file for unemployment [3].

The road to a bonafide vaccination for COVID is long and arduous, particularly because they entail months, and sometimes years, of testing. However, the scientists are racing against time to come up with a vaccine by next year. At present 48 vaccines are under clinical trials on humans and nearly 88 being subjected to animal testing for best outcomes [4], making extensive social distancing our best bet to mitigate contagion. Health officials are continuing to harp on the role of physical distancing and self-quarantine measures towards flattening the curve [5]. There is a consensus in the scientific community regarding the possibility of contagion mitigation through the use of face coverings, personal hygiene, and by avoiding crowded and poorly ventilated places [6]. The clusters of infected populations were shown to be more likely in occupational or community settings, lending further credence to the significance of physical distancing [7]. The white collar jobs are relying on work-from-home and virtual communication, while the blue collar workers are being advised to abide by the 6 feet distance rule. There is a negative association between the number of days of lockdown and the reported COVID-19 cases per million [8].

A. Related Works

The lack of prior knowledge on COVID-19 leaves the policymakers ill-equipped to design mitigation strategies. The research community of epidemiologists, clinicians and computer scientists are applying their expertise to seek out factors and their implications on contagion as well as economic downturn [9]. First, attempts are being made to apply machine learning (ML) to build prediction models on epidemiological and clinical data. Given existing clinical data, prediction models [10] and therapeutic approaches can help identify vulnerable groups [11], [12]. Epidemiologists are trying to identify spread dynamics of COVID-19. Holmdahl and Buckee [13] analyze the pros and cons of forecasting models that make predictions through curve fitting or mechanistic models, while supervised and unsupervised ML is helping trace the trends in infection dynamics [14]. Khan et al. used regression tree analysis, cluster analysis and principal component analysis on Worldometer infection count data to gauge the variability and effect of testing in prediction of confirmed cases [15]. Roy et al. perform regression analysis to identify pre-lockdown factors that affect the post-lockdown pandemic numbers [16] and topic modeling to find the least and worst affected economic sectors in the US [17].

Second, there have been efforts to modify the SEIRD model to study the effects of demography, immunity and social distancing on infection spread. SEIRD assumes that the susceptible person is exposed when he is in contact with an infected person (see Sec. II-A), implying that its accuracy depends on the correctness of knowledge of the epidemiological status of the individuals. However, with regard to COVID-19, (1) it may be hard to pinpoint when the susceptible person transitions to exposed; (2) a person is not deemed infected until tested positive. These factors can mislead SEIRD estimates. Gharakhanlou employed the SEIRD model to create an agent-based simulation to demonstrate the effects of social contact and propose mitigation measures to contain the spread of COVID-19 in Urmia city, Iran [18]. Bedi et al. proposed a modified SEIRD that considers a certain section of the exposed population to be infectious. They compare the COVID-19 projections made by their model on the different states of India against those from the Long Short-Term Memory (LSTM) model [19]. Ghanam et al. discuss a bayesian approach to estimate the parameters for the SEIRD model and quantify the impact of government intervention measures on infection spread [20]. Lattanzio et al. studied the relationship of lockdown and mobility in Lombardy and London as well as the ill-effects of flouting social distancing regulations [21]. Third, tracing contact using mobile apps has emerged as an approach to enforce physical distancing. Kretzschmar et al. evaluate the importance of timely contact tracing using a stochastic mathematical model with explicit time delays [22]. Ferretti et al. argue that app-based contact tracing together with virus-testing programs may help restrict further spread [23]. Ahmed et al. discuss the workings of the existing contact tracing apps based on proximity and duration of contact with infected individuals [24]. Campbell et al. designed a puzzle-game on top of an interactive learning environment, where players prepare for an outbreak on a social contact network and subsequently quarantine people to quell the epidemic [25]. Nadini et al. [26] created a mobile application that combines the features of InfluenzaNet [27] and Flutracking [28]. InfluenzaNet and Flutracking both perform an online survey to create a repository of symptoms of patients from geographic locations, with the objective to monitor spread and identify risk factors from symptoms.

B. Contributions

Given a closed region (say, grocery store, queue at a bus stop, auditorium, stadium, etc.) where individuals are prone to high physical contact, we propose three optimization strategies to recommend new locations of individuals to curb pandemic spread. These optimizations operate on the same social contact network, yet vary on the basis of the underlying network science principle and epidemic model. While optimizations 1 and 2 minimize infection by eliminating contact and network clustering among susceptible and infected individuals in the SEIRD epidemic model, optimization 3 employs homophily on a modified SEIRD model (inspired from Bedi et al. discussed in Sec. I-A) where a population of the exposed asymptomatic (or untested) persons are spreaders of infection. Second, we introduce a new metric, called contagion potential, that quantifies the infectivity of an individual.

We carry out extensive simulation experiments on a small region in New York City to demonstrate that the three social distancing optimization strategies curb the spread of infection when compared against a random and two standard human mobility models, namely least action trip planning and social network theory. We also study how well these approaches preserve the network science principles of homophily and clustering as well as the effect of the epidemic parameters on the overall performance. Finally, we introduce a mobile application, called MyCovid, that presents a case study on the three optimization strategies and guide the registered users’ mobility to minimize contagion. With prior user permission, it can also create a repository of mobility traces and enable research on informing mobility during future outbreaks.

This paper is organized as follows. In Sec. 2 we cover preliminary concepts and system model. In Sec. 3, we present the three optimization strategies and the notion of contagion potential – a new metric to gauge infectivity. Sec. 4 has been dedicated to the experimental results, where we analyze the performance of the proposed optimization w.r.t. epidemic models, human mobility, scalability, parametric variations, etc., and introduce the features of the MyCovid app. Finally, we conclude the paper and discuss future works in Sec. 5.

II. Preliminary Concepts and System Model

We first discuss the SEIRD epidemic model and key network science concepts used in the paper (viz., network clustering and homophily), followed by the experimental scenario.

A. SEIRD Epidemic Model

We adapt the susceptible-exposed-infected-recovered-death (SEIRD) epidemic model [29]. The susceptible (S) class comprises individuals who are not exposed to the infection. Once exposed to infected individuals, they may transfer to the exposed (E) category, and this transition is controlled by a rate Inline graphic. The E class are asymptomatic or untested individuals, who transition to the (tested) infected (I) class with probability Inline graphic. The individuals in Inline graphic transition to another state with a probability Inline graphic; this other state can be either recovered (R) or dead (D) with probabilities Inline graphic and Inline graphic, respectively, as shown below. Note that Inline graphic, where Inline graphic is the basic reproduction number that has a median value of 3, but can be equal to 5.7 or even more as per previous literature [30], [31]. Thus, unlike, (Inline graphic), Inline graphic is not a transition probability.

A.

Modified SEIRD Model: We classify the exposed population Inline graphic into Inline graphic and Inline graphic (i.e., Inline graphic), where the individuals in Inline graphic are the population of asymptomatic (or untested) individuals that do not transition to Inline graphic or Inline graphic, while Inline graphic are asymptomatic (or untested) individuals who transition to Inline graphic or Inline graphic states. Moreover, the susceptible individuals (Inline graphic) may transition to exposed (Inline graphic) category, if they come in contact with either infected (Inline graphic) or spreaders or vectors, i.e., Inline graphic individuals, as we show below:

A.

Eqs. 3, 4 are common for both SEIRD models. In case of modified SEIRD, we replace Eqs. 1, 2 of the original SEIRD with Eqs. 5 and 6. It is still difficult to identify the exposed individuals who act as vectors (i.e., Inline graphic), and with more testing, some of them may be identified as infected.

B. Key Network Science Concepts

The proposed social distancing optimization measures (delineated in Sec. III) are built upon two concepts of network science, particularly social network analysis, as follows.

1). Clustering

It is a tendency of nodes to form tightly knit groups [32]. In an undirected graph Inline graphic, clustering coefficient of any node Inline graphic is calculated as:

1).

In the above equation Inline graphic is the number of triangles node Inline graphic participates in and Inline graphic is its degree. The average clustering coefficient (Inline graphic) of the undirected graph Inline graphic is given by –

1).

Given two groups (or labels) of nodes marked in red and blue, Fig. 1 shows three clusters demarcated in dotted boundaries. On a scale of 0 and 1, this network has Inline graphic.

FIGURE 1.

FIGURE 1.

Clustering and Homophily. Three clusters demarcated by boxes, where a node belongs to either group 1 or 2 (differentiated using red and blue colors, respectively).

2). Homophily

It is the tendency of a node to share links with other nodes with similar characteristics (i.e., groups or labels). Homophily (literally meaning love of the same) causes nodes to preferentially attach to similar nodes resulting in relationships like geographical proximity, friendship, etc. [33]. While network homophilicity is often verified using metrics such as dyadicity and heterophilicity [34], E-I index can be a simple measure for homophily [35]. It is calculated as the difference between between-group ties and within-group ties, divided by the total number of links in the network. Complete heterophily is quantified by an E-I index of 1, and complete homophily by a E-I score of −1. The network in Fig. 1 has E-I index = −0.6, suggesting that it is highly homophilic.

C. Scenario

We simulate a region (of dimension Inline graphic square units) where mobile individuals Inline graphic are confined. Each individual must initially belong to one of these epidemic states: susceptible or infected. From time to time, these individuals may come within some preassigned contact threshold Inline graphic (say, 6 ft. for COVID-19 [36]) of one another, allowing for the susceptible person to get exposed to the infection from an already infected person. The location of each person is known, and it is possible to capture the dynamics of physical contact between people by creating a contact graph (refer Sec. III), where the individuals are vertices (or nodes) and bidirectional links exist between any two persons within distance Inline graphic at any given time Inline graphic. The system administrator may apply the proposed social distancing optimization approaches (discussed in Sec. III) on the time-varying contact graph at time Inline graphic and suggest a new position of a person within a distance threshold Inline graphic of its current location, in order to minimize infection spread. A person may or may not abide by the recommendation of the administrator.

D. Human Mobility Model

1). Least Action Trip Planning

This mobility model is based on the notion that humans tend to consider distance to be a crucial criterion for deciding the next destination, termed waypoint [37]. In other words, for any individual, the likelihood of choosing a certain waypoint is directly proportional to the proximity to his current location. Given a current waypoint Inline graphic, the probability of selecting waypoint Inline graphic is:

1).

Here, Inline graphic is the Euclidean distance between Inline graphic and Inline graphic, and Inline graphic is the weighing factor, a positive constant, that characterizes the preference to waypoints. If Inline graphic, a waypoint has an equal likelihood of being visited, while higher Inline graphic causes the closer waypoints to get a higher likelihood. For our experiment, we consider Inline graphic; this is based on the fact that LATP is shown to produce mobility traces that match the real GPS traces very well when Inline graphic lies between 1 and 3 [38].

2). Social Network Based Mobility

People incorporate social interactions into their mobility decisions. A person visits areas where he may meet his kin. We implement the social network theoretic (SNT) model [39] for which we generate a (undirected) friendship network where the friends of a node are its 1-hop neighbor. Any node Inline graphic will choose its next waypoint with likelihood, on the basis of its the social affinity Inline graphic, by the formula Inline graphic. Given individual nodes Inline graphic located at waypoint Inline graphic, the social affinity for waypoint Inline graphic having Inline graphic nodes, is calculated as follows:

2).

In the above equation, Inline graphic is the ratio between the social affinity to the total population of Inline graphic. Social affinity of a person to a zone is measured as the number of friends (i.e., 1-hop neighbors in the friendship graph) who are currently located in that given zone.

III. Approach

Consider a set of individuals Inline graphic are in moving in a closed region of dimension Inline graphic square units and intermittently coming in close contact. Given Inline graphic time slots, we create a contact graph Inline graphic for Inline graphic, where the nodes Inline graphic are the individuals and edges Inline graphic denote contact between (the individuals represented as) nodes Inline graphic that are within a contact threshold of Inline graphic for at least a prespecified duration of time within the current time slot Inline graphic. The neighbor-list of a node Inline graphic, Inline graphic, is the set of individuals that are within distance Inline graphic at time Inline graphic. Each individual Inline graphic must belong to exactly one of Inline graphic states, where Inline graphic.

A. Optimization Formulation

We discuss the intuition and formulation of the three social distancing optimization strategies. The first two approaches opt-1 and opt-2 apply the standard SEIRD model, while opt-3 makes use of the modified SEIRD (refer Sec. II-A).

1). Objective

Given the current location at time Inline graphic, these optimizations calculate new locations for each individual to minimize contact that may potentially cause infection spread.

2). Constraint

The optimizations have a common constraint (Eqs. 11. 13, 15) to ensure that the recommended location is not unrealistically far from current location of an individual. We define a distance threshold Inline graphic that dictates the maximum distance between the current and recommended locations.

B. Approach 1

In the SEIRD model, a susceptible person may get exposed only upon contact with the infected individual, making the latter the only spreaders of infection in the contact network at time Inline graphic, Inline graphic. Hence, in opt-1 we attempt to curb contagion by placing the nodes in a manner that we may minimize contact between the susceptible and infected people in the network.

B.

Here Inline graphic if Inline graphic and 0 otherwise, and Inline graphic denotes the set of exposed individuals at time Inline graphic. We consider susceptible as well as exposed in objective (Expression 10), because one cannot differentiate between susceptible from the asymptomatic (or untested) exposed individuals, i.e., they are indistinguishable for the optimizer. Expression 11 ensures that recommended new location is within distance threshold Inline graphic from the present location (Inline graphic) of an individual Inline graphic.

C. Approach 2

We expect the clusters in Inline graphic to have higher contact leading to greater contagion. Recall from Sec. II-B1, the clustering tendency of a node is proportional to the number of triangles it participates in. In this approach, we attempt to minimize the triangles involving at least 1 susceptible/exposed and 1 infected individual. Fig. 2a shows the four triangle configurations, that the following optimization eliminates.

C.

Here Inline graphic if the following conditions hold:

  • 1)

    Inline graphic, and

  • 2)

    Inline graphic and Inline graphic

Inline graphic otherwise.

FIGURE 2.

FIGURE 2.

Social distancing optimizations. (a) the four triangular configurations involving at least one susceptible (or exposed) and one infected individual, (b) links between susceptible (Inline graphic) and infected (Inline graphic) individuals are eliminated in opt-1 (top), whereas links showing a high difference in CP are removed in opt-3 (bottom).

Exp. 12 invokes function Inline graphic to minimize the occurrence of triangles with at least one susceptible/exposed and infected.

D. Approach 3

In the modified SEIRD model (discussed in Sec. II-A), we define the exposed population as Inline graphic, where the individuals in Inline graphic are exposed, yet act as vectors of infection. Since we have no definitive knowledge of the individuals in Inline graphic, it becomes imperative to devise a metric to gauge the likelihood that a person may act as spreader.

1). Contagion Potential (CP)

It is the ability of an individual to act as the spreader of infection. The instantaneous CP of Inline graphic (s.t., Inline graphic) at time Inline graphic is proportional to the number of infected people in its current neighborhood (Inline graphic):

1).

Here, Inline graphic is the maximum number of neighbors of any node at time Inline graphic. The overall CP till time Inline graphic, Inline graphic, is calculated as the mean over instantaneous values, as follows:

1).

2). Intuition

Considering the modified SEIRD model, in Fig. 3 we illustrate an individual node at different timepoints Inline graphic with a different set of neighboring nodes. Initially the reference node (shown as a large circle) encounters very few infected neighbors (marked red) and is less likely to have contracted the infection and has a low CP (hence colored green). Over time, it transitions to high CP score upon contact with more high CP individuals. We would like to point out that CP can have significant implications on the accuracy of the traditional SEIRD model where we consider a binary infection status. Due to imperfections in the testing method or lack of testing, an infected individual is mistakenly deemed susceptible (or exposed). Under such a circumstance, CP, which gauges his encounters with other high CP or infected nodes, can serve as an alternative measure to quantify the likelihood of his being infected (and therefore, “infective”).

FIGURE 3.

FIGURE 3.

Evolution of contagion potential in opt-3. Each panel shows the location of individuals at a certain timepoint Inline graphic (Inline graphic) and the colors green and red show their CP.

In the third optimization (opt-3), we calculate the CP for every individual not in the recovered or dead category. We intuit that contagion can be contained if the difference between the CP of any pair of individuals Inline graphic in contact (i.e., Inline graphic) is low. In other words, high Inline graphic may imply that the individual with the lower CP (say Inline graphic) may contract the infection from Inline graphic. Going back to the discussion in Sec. II-B2, if two individuals having same CP are considered to belong to the same groups (and vice versa), the current optimization attempts to create a homophilic network by eliminating links between individuals with disparate CP (see Expr. 14). In other words, we put nodes with similar CPs into the same groups and minimize edges between different groups having dissimilar CPs.

2).

Above expression is a generalization of opt-1 where a person holds a binary status of infected or not infected (as shown in Fig. 2b top). In effect, opt-3 turns the infection status into a continuous variable in range Inline graphic (see Fig. 2b bottom).

IV. Result

We create a simulation environment in Python to validate the proposed methods. Let us discuss the experimental results in the following subsection: (1) effect of infection spread and contagion potential, (2) clustering and homophily, (3) parametric variations, (4) scalability, (5) human mobility, (6) MyCovid app, and (7) effects of flouting recommendations.

TABLE 1. Default Parameter Values.

Parameter Notation Value
Number of iterations 5
Population size Inline graphic 100
Simulation area Inline graphic 100 ft, 100 ft
Simulation duration Inline graphic 100 minutes
SEIRD parameters [30] Inline graphic Inline graphic
Contact threshold Inline graphic 6 ft
Distance threshold Inline graphic 6 ft
No. of waypoints 40
Edge prob. friend. net. Inline graphic 0.1
LATP parameter Inline graphic 1.2

Default Parameters: We perform the experiments, each of duration 100 minutes, on a population of 100 individuals and contact rate Inline graphic. We plot the mean curve from 5 iterations, showing the cumulative count, which we measure as the sum of infected, recovered and dead individuals at any given time. To ensure fairness of comparison, the individuals have the same initial starting location and epidemic status in each run of the experiment. The contact threshold is Inline graphic ft. and individuals move within distance threshold Inline graphic ft. on an average at every minute. Strategies opt-1, opt-2 and opt-3 are run on SEIRD and modified SEIRD, respectively. Recall from our discussion in Sec. III-D1, contagion potential (CP) is calculated over a period of time. Hence, for the experiments on opt-3 utilizing CP (on the modified SEIRD model), we follow random mobility till Inline graphic minutes. This allows each individual to achieve a steady CP, before opt-3 is invoked.

A. Effect on Infection Spread and Contagion Potential

1). Contagion Potential

Recall from Sec. II-A, the modified SEIRD model allows a fraction of exposed individuals (Inline graphic) to act as vectors. In Sec. III-D1, we introduce contagion potential (CP) (Sec. III-D1) to gauge how likely an exposed or infected individual is to act as the vector. Fig. 4 shows the CP of each individual in blue lines and the mean CP of the population in red in the modified SEIRD. Note, the mean CP declines over time as more people recover or die and their respective CPs become 0.

FIGURE 4.

FIGURE 4.

Contagion potential (CP) of an individual (shown in blue) and declining mean CP (shown in red) over time.

We show the efficacy of CP by introducing another simple measure, called infectivity, measured as the ratio between the number of times an (exposed or infected) individual transmits infection to the total number of contacts with susceptible individuals. Fig. 5 depicts that the overall infectivity of an individual correlates with mean CP, suggesting that it can be an effective metric to identify latent spreaders of infection.

FIGURE 5.

FIGURE 5.

Change in infectivity with contagion potential.

B. Clustering and Homophily

We discuss how the social distancing optimization opt-2 minimizes significant clusters, while opt-3 maximizes homophily in the contact network – both of which contribute towards containment of contagion (refer Sec. II-B2). Fig. 6a shows that opt-2 has fewer number of significant clusters (i.e., triangles containing at least one infected and one susceptible/exposed). Similarly, to gauge whether opt-3 achieves homophilic contact network with respect to CP, we assume two individuals Inline graphic to be “similar” only if Inline graphic. In other words, if Inline graphic, it is deemed to similar to Inline graphic and Inline graphic with Inline graphic and Inline graphic. We compare the E-I index (see Sec. II-B2) of random vs opt3 at each time point. Fig. 6b shows that, not considering the recovered and dead, opt-3 has lower E-I index (i.e., it creates homophilic networks with a lower difference in CP between the individuals) than random.

FIGURE 6.

FIGURE 6.

Verification of network science principles. Comparison of (a) number of clusters (or triangles) with at least one infected and one susceptible/exposed in the contact graph for random and opt-2, (b) E-I index for random and opt-3.

C. Parametric Variations

We discuss in Sec. II-A that the modified SEIRD model allows a set of exposed individuals to also act as potential spreaders. In this section, we analyze how opt-3 performs upon variation of the parameters in the SEIRD epidemic model, namely, (1) number of initial infected persons, (2) susceptible to vector transition probability Inline graphic and (3) contact rate (Inline graphic).

Number of initial infected persons and susceptible to exposed vector (Inline graphic) transition probability. Fig. 7 shows that the cumulative count increases with initial infected population. Note that opt-3 outperforms random mobility for all the three values of initial infected fraction Inline graphic. Fig. 8 shows the variation in Inline graphic for fixed initial infected fraction, where opt-3 again outperforms random mobility. It is noteworthy that Inline graphic curve (shown in green) shows lower cumulative Inline graphic. This is because, Inline graphic causes a higher number of individuals to become part of the exposed (Inline graphic) and not feature in the cumulative count comprising Inline graphic epidemic states.

FIGURE 7.

FIGURE 7.

Cumulative count for varying number of initial infected individuals in random and opt-3 approaches.

FIGURE 8.

FIGURE 8.

Cumulative count for varying susceptible to vector transition probability in random and opt-3 approaches.

Contagion for Varying Contact Rates: For two separate contact rate Inline graphic, we compare the infection spread in opt-3 against random individual mobility. Fig. 9 shows that opt-3 exhibits a lower infection count than their random counterpart in the modified SEIRD model. On the other hand, for the modified SEIRD with branching timepoint set at Inline graphic minutes.

FIGURE 9.

FIGURE 9.

Cumulative count for varying contact rates Inline graphic in random and opt-3 approaches.

D. Scalability

Let us analyze the performance of opt-1, opt-2 and opt-3 for larger populations. We consider population sizes of Inline graphic and record the simulation time needed for the cumulative count of infected, recovered and dead individuals to reach 50% and 75% of the overall population sizes.

Scalability and computational cost are critical factors involving large population scenarios; we are trying to study this phenomenon with the MyCovid app (introduced in Sec. IV-F and Fig. 10). Experimentally, we show that social distancing strategies slow contagion down significantly as compared to the random counterparts. Figs. 11a, 11b, 12a and 12b show that while the random mobility results (blue bars) in cumulative count reaching 50% and 75% of total population in less than 25 minutes, the green and red bars corresponding to opt-1, opt-2 and opt-3 correspond to 100 minutes, suggesting that cumulative count does not reach 50% and 75% within the stipulated duration of 100 minutes.

FIGURE 10.

FIGURE 10.

MyCovid App. User (a) signs up on the registration page, (b) enters epidemic status (susceptible, infected or recovered), (c) locations (through GPS), (d) summary of user information on the app, and (e) Firebase Realtime Database schema.

FIGURE 11.

FIGURE 11.

Time in minutes taken by the opt-1, opt-2 and random to reach 50% and 75% cumulative count for variation in population sizes.

FIGURE 12.

FIGURE 12.

Time in minutes taken by the opt-3 and random to reach 50% and 75% cumulative count for variation in the population sizes.

E. Human Mobility

For the experiments on human mobility, we recreate the map of New York City, which comprises 5 boroughs – Manhattan, Bronx, Brooklyn, Queens and Staten Island. List of NYC boroughs and districts are taken from the official website of New York city [40] and the latitude and longitude of the 5 boroughs and 59 districts (or neighborhoods) are taken from the Python library for geocoding services, called GeoPy [41]. We select 40 locations (green dots in Fig. 13) in and around 2 neighborhoods (blue dots) selected as cluster centers from Manhattan, NYC (red cross). Distance between pair of points is calculated using geodesic distance function of GeoPy. An individual moves to any of the 40 locations on the map.

FIGURE 13.

FIGURE 13.

The 40 locations (represented as green dots) in and around the 2 neighborhoods (shown in blue) of Manhattan, New York City selected as cluster centers. Each person must be placed at one the 40 points and have 90% contact probability with other persons at the same location at the same time.

We compare the performance of the social distancing optimization strategies against two human mobility models, namely LATP and SNT (discussed in Sec. II-D). Figs. 14 and 15 shows that the cumulative for SNT and LATP is significantly higher than that of the random mobility models. The SNT mobility, which relies on social interaction, requires a friendship graph with bidirectional link between friends of the same number of nodes as the total population (see Sec. II-D). We model the friendship graph as undirected Erdos-Renyi random graph [42] with edge probability Inline graphic. The results show that cumulative for the optimized mobility strategy, particularly opt-3, hit the plateau much earlier compared to the human mobility models. Overall, the proposed optimizations exhibit a notably lower rate of contagion than extant mobility approaches. It is worth mentioning that both SNT allows people to travel to waypoints with more social ties, while LATP causes individuals to prefers shorter trips. Either of these two conditions can result in notably higher social contact, necessitating the application of the proposed social distancing measures.

FIGURE 14.

FIGURE 14.

Comparison of the cumulative count for opt-1 and opt-2 against the SNT and LATP mobility models.

FIGURE 15.

FIGURE 15.

Comparison of the cumulative count for opt-3 against the SNT and LATP mobility models.

F. MyCovid APP

MyCovid App is a mobile platform that constructs the temporal contact graph from the locations of the users and invokes one of the three optimization strategies (discussed in Sec. III-BIII-D) to inform their next location. Unlike the standard applications ([26][28]) discussed in Sec. I that construct a surveillance repository based on socio-demographic and medical characteristics, MyCovid App works at a granular level by minimizing the social ties between individuals that may potentially lead to spread of infection.

The user can register and use the fully functional MyCovid App by applying the following steps: (1) each user signs up on the registration page of MyCovid; (2) the app periodically transfers the last updated epidemic status (susceptible, infected or recovered) and location (lat-long coordinates through GPS) to the Firebase Realtime Database [43] – a cloud-hosted Google platform – connected to the server; (3) the server retrieves the information and runs the optimizer; (4) the respective new location recommended by the optimization is then sent back to each user. This location coordinate is not saved in the database, since there is no certainty that the user will follow the recommendation besides also reducing the load on the central server.

Fig. 10 shows the features of the MyCovid app. With express permission of the registered user, MyCovid creates a repository of user mobility data over time, which will serve as a testing set and help refine the proposed optimization strategies. It employs google API (fusedLocationProviderApi with settings of LocationRequest.PRIORITY_HIGH_ACCURACY and permission ACCESS_FINE_LOCATION) in the high accuracy mode that operates on WiFi/cellular in conjunction with GPS to minimize error in location tracking – a standard technology for location identification on android. One possible limitation of MyCovid is that the precision of locating smart devices can be 20 meters in worst case [44] (especially in indoor settings), although it is generally more accurate [45]. At present, MyCovid works on Android devices only, but we are working towards MyCovid on iOS platforms. We shared the source files for MyCovid app, with documentation and demonstrative video, on GitHub (https://github.com/satunr/ COVID-19/tree/master/Network%20Science) so that the network administrator can employ any location API of his choice to achieve the best results.

G. Comparison of the Strategies

The MyCovid app is customized to invoke any one of the three optimization strategies, and the choice of optimization is dependent on the underlying epidemic model as well as the contact network. As discussed in Sec. I-B, the opt-1 and opt-2 approaches both work on the SEIRD epidemic model. Fig. 11 shows that opt-1 outperforms opt-2 for most population sizes. This can be explained by considering a simple contact network shown in Fig. 16a, where the social ties that can cause contagion are marked red. The objective function of opt-1 (Expression 10) tries to eliminate all red ties between susceptible and infected individuals (Fig. 16b). On the other hand, opt-2 only eliminates red ties that belong to a cluster (Fig. 16c). It is worth noting that the MyCovid app may select opt-2 (Expression 12) when there are multiple or large groups in the contact graph that have the triangles (discussed in Sec. II-B1) as their building blocks. Finally, we do not compare opt-1 and opt-2 against opt-3 because unlike the first two, the latter operates on the modified SEIRD model where a part of the exposed population are perennially infectious. Also, opt-3 is essentially a generalization of opt-1 strategy, since it assumes that the infectivity of an individual is a continuous value between 0 and 1 (and not a binary like in opt-1).

FIGURE 16.

FIGURE 16.

Comparison of opt-1 and opt-2 strategies (with susceptible and infected nodes colored green and blue, respectively). (a) input contact network, (b) opt-1, and (c) opt-2. The re-positioned individuals are labeled as rp.

H. Effects of Flouting Recommendations

We know that the registered user of MyCovid app. are free to accept or reject the recommendation of the social distancing approaches. We study the cumulative count for two scenarios where each individual ignore the recommendation 5% and 50% of the times (and adopt random mobility) for opt-1 and opt-2. Fig. 17 shows that, in either case, flouting the recommendations 50% of the times result in higher contagion manifested in overall increase in the cumulative count.

FIGURE 17.

FIGURE 17.

Cumulative count for opt-1 and opt-2 where people ignore the recommendation 5% and 50% of the times.

V. Limitations of the Study

One possible limitation in the proposed optimization strategies is the computational cost associated with scalability. In a population of Inline graphic individuals, the optimizer tunes Inline graphic parameters to achieve the socially distanced placement of individuals. To demonstrate this, we estimate the running time (in seconds) (for the optimization strategies) for a population of 50 - 250 individuals on a Mac OS Intel(R) Core(TM) i7-7820HQ CPU, 2.90GHz and 16 GM RAM system. Fig. 18 shows the near-exponential growth in the running time in seconds, necessitating scalable versions of these strategies in the future. We are in the early stages of designing greedy strategies that employ Markov Chain Monte Carlo methods and grid-based area partition to achieve time-efficient convergence.

FIGURE 18.

FIGURE 18.

The running time (in seconds) of the three optimization strategies for population of 50 - 250 individuals.

VI. Conclusion

In this work we leverage network science principles, namely network clustering and homophily to conceive network optimization strategies to enable socially distanced human mobility in two different SEIRD epidemic model scenarios. Based on current location, the optimizations recommend new locations of individuals, in order to minimize contact potentially resulting in contagion. We present a new measure of infectivity, termed contagion potential (CP), that addresses some of the challenges faced by the SEIRD model. Extensive simulation experiments show that our proposed approaches slow contagion better than several standard human mobility models. Finally, we present a mobile app., MyCovid, as a case study that employs one of the three strategies to inform mobility of registered users. With express permission from the user, the app will also build a repository of human mobility that would aid the future extensions of this work.

There are a few future directions in this line of work. First, we will adapt the greedy heuristics to different stages of the epidemic cycle. For instance, in the early stages of outbreak, the objective may be to enforce rigorous social distancing to clamp down on contagion. Conversely, during the later stages of the outbreak the restrictions may be gradually and strategically relaxed. Second, we are currently utilizing the MyCovid app to build a comprehensive repository of the mobility traces of the registered individuals that will be used to test the efficacy of the proposed approximation methods. Finally, we are going to explore the data-sharing and privacy concerns of MyCovid app. users. Different behavioral studies may emerge from the fact that users exhibit variable levels of adherence to the location recommendations of the app.

Competing Interests

The authors have declared that no competing interests exist.

Biographies

graphic file with name roy-3058206.gif

Satyaki Roy received the Ph.D. degree in computer science from the Missouri University of Science and Technology, USA, in 2019. He is currently a Postdoctoral Research Associate with the Department of Genetics, University of North Carolina, Chapel Hill, NC, USA. His research interests include computational biology, network science and optimization, wireless sensor networks, epidemiology, machine learning, and parallel computing.

graphic file with name chere-3058206.gif

Andrii Cherevko received the B.S. and M.S. degrees in computer science from the Kharkiv Polytechnic Institute, Eastern Ukraine. He is currently pursuing the Ph.D. degree with the Biological Networks Laboratory, Department of Computer Science, Virginia Commonwealth University. He works with a variety of biological networks where he develops algorithms to sample and study specific network connections in network samples of different sizes.

graphic file with name chakr-3058206.gif

Sayak Chakraborty is currently pursuing the B.Tech. degree with IIEST Shibpur. His research interests include machine learning and deep learning and has participated in quite a few projects on related topics. He is exploring competitive coding and proficient in data structures and algorithms.

graphic file with name nghos-3058206.gif

Nirnay Ghosh was a Research Fellow with the Research & Security Innovation Lab for IoT (ReSILIoT), iTrust Research Center for Cyber Security, Singapore University of Technology and Design (SUTD), where he worked on privacy and access control issues related to IoT systems. He is currently an Assistant Professor with the Department of Computer Science and Technology (CST), Indian Institute of Engineering Science and Technology Shibpur (IIESTS). His research interests include IoT security, participatory sensing, and cloud computing.

graphic file with name pghos-3058206.gif

Preetam Ghosh received the B.S. degree in computer science from Jadavpur University, Kolkata, India, and the M.S. and Ph.D. degrees in computer science and engineering from The University of Texas at Arlington. He is currently a Professor with the Biological Networks Laboratory, Department of Computer Science, Virginia Commonwealth University. His research interests include algorithms, stochastic modeling and simulation, network science and machine learning related approaches in systems biology and computational epidemiology and mobile computing related issues in pervasive grids that has resulted in more than 170 conference and journal articles and several federally funded research projects from NSF, NIH, DoD, and US-VHA. He also serves as the Secretary/Treasurer of ACM SIGBio.

Funding Statement

This work was supported in part by the National Science Foundation under Grant CBET-1802588.

References

  • [1].(2020). Impact of COVID-19 on People’s Livelihoods, Their Health and Our Food Systems. [Online]. Available: https://www.who.int/news/item/13-10-2020-impact-of-covid-19-on-people's%-livelihoods-their-health-and-our-food-systems
  • [2].(2020). What is the Total Number of Confirmed Deaths. [Online]. Available: https://ourworldindata.org/covid-deaths
  • [3].(2020). The Effects of COVID-19 on International Labor Markets: An Update. [Online]. Available: https://www.brookings.edu/research/the-effects-of-covid-19-on-internati%onal-labor-markets-an-update/
  • [4].(2020). Coronavirus Vaccine Tracker. [Online]. Available: https://www.nytimes.com/interactive/2020/science/coronavirus-vaccine-tr%acker.html
  • [5].(2020). Coronavirus, Social and Physical Distancing and Self-Quarantine. [Online]. Available: https://www.hopkinsmedicine.org/health/conditions-and-diseases/coronavi%rus/coronavirus-social-distancing-and-self-quarantine
  • [6].Alwan N., Burgess R. A., Ashworth S., Beale R., Bhadelia N., Bogaert D., Dowd J., Eckerle I., Goldman L. R., Greenhalgh T., and Gurdasani D., “Scientific consensus on the COVID-19 pandemic: We need to act now,” Lancet, vol. 396, no. 10260, pp. e71–e72, 2020, doi: 10.1016/S0140-6736(20)32153-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [7].Dawood F., Ricks P., Njie G. J., Daugherty M., Davis W., Fuller J. A., Winstead A., McCarron M., Scott L. C., Chen D., and Blain A. E., “Observations of the global epidemiology of COVID-19 from the prepandemic period using Web-based surveillance: A cross-sectional analysis,” Lancet Infectious Diseases, vol. 20, no. 11, pp. 1255–1262, 2020, doi: 10.1016/S1473-3099(20)30581-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [8].Hopman J. and Mehtar S., “Country level analysis of COVID-19 policies,” EClinicalMedicine, vol. 25, Aug. 2020, Art. no. 100500, doi: 10.1016/j.eclinm.2020.100500. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [9].Adhikari S. P., Meng S., Wu Y.-J., Mao Y.-P., Ye R.-X., Wang Q.-Z., Sun C., Sylvia S., Rozelle S., Raat H., and Zhou H., “Epidemiology, causes, clinical manifestation and diagnosis, prevention and control of coronavirus disease (COVID-19) during the early outbreak period: A scoping review,” Infectious Diseases Poverty, vol. 9, no. 1, p. 29, Dec. 2020, doi: 10.1186/s40249-020-00646-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [10].Wynants L., Van Calster B., Collins G. S., Riley R. D., Heinze G., Schuit E., Bonten M. M., Dahly D. L., Damen J. A., Debray T. P., and de Jong V. M., “Prediction models for diagnosis and prognosis of COVID-19: Systematic review and critical appraisal,” BMJ, vol. 369, Apr. 2020, Art. no. m1328, doi: 10.1136/bmj.m1328. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [11].Alimadadi A., Aryal S., Manandhar I., Munroe P. B., Joe B., and Cheng X., “Artificial intelligence and machine learning to fight COVID-19,” Physiolog. Genomics, vol. 52, no. 4, pp. 200–202, Apr. 2020, doi: 10.1152/physiolgenomics.00029.2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [12].Randhawa G. S., Soltysiak M. P. M., El Roz H., de Souza C. P. E., Hill K. A., and Kari L., “Machine learning using intrinsic genomic signatures for rapid classification of novel pathogens: COVID-19 case study,” PLoS ONE, vol. 15, no. 4, Apr. 2020, Art. no. e0232391, doi: 10.1371/journal.pone.0232391. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [13].Holmdahl I. and Buckee C., “Wrong but useful—What covid-19 epidemiologic models can and cannot tell us,” New England J. Med., vol. 383, no. 4, pp. 303–305, Jul. 2020, doi: 10.1056/NEJMp2016822. [DOI] [PubMed] [Google Scholar]
  • [14].Wang P., Zheng X., Li J., and Zhu B., “Prediction of epidemic trends in COVID-19 with logistic model and machine learning technics,” Chaos, Solitons Fractals, vol. 139, Oct. 2020, Art. no. 110058, doi: 10.1016/j.chaos.2020.110058. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [15].Khan N., Naushad M., Fahad S., Faisal S., and Muhammad A., “COVID-2019 and world economy,” J. Health Econ., Forthcoming, Apr. 2020, doi: 10.2139/ssrn.3566632. [DOI]
  • [16].Roy S. and Ghosh P., “Factors affecting COVID-19 infected and death rates inform lockdown-related policymaking,” PLoS ONE, vol. 15, no. 10, Oct. 2020, Art. no. e0241165, doi: 10.1371/journal.pone.0241165. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [17].Roy S., Dutta R., and Ghosh P., “Recreational and philanthropic sectors are the worst-hit US industries in the COVID-19 aftermath,” Social Sci. Humanities Open, vol. 3, no. 1, 2021, Art. no. 100098, doi: 10.1016/j.ssaho.2020.100098. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [18].Mahdizadeh Gharakhanlou N. and Hooshangi N., “Spatio-temporal simulation of the novel coronavirus (COVID-19) outbreak using the agent-based modeling approach (case study: Urmia, Iran),” Informat. Med. Unlocked, vol. 20, Jan. 2020, Art. no. 100403, doi: 10.1016/j.imu.2020.100403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [19].Bedi P., Shivani, Gole P., Gupta N., and Jindal V., “Projections for COVID-19 spread in India and its worst affected five states using the modified SEIRD and LSTM models,” 2020, arXiv:2009.06457. [Online]. Available: http://arxiv.org/abs/2009.06457
  • [20].Ghanam R., Boone E. L., and Abdel-Salam A.-S. G., “SEIRD model for qatar COVID-19 outbreak: A case study,” 2020, arXiv:2005.12777. [Online]. Available: http://arxiv.org/abs/2005.12777
  • [21].Lattanzio S. and Palumbo D., “Lifting restrictions with changing mobility and the importance of soft containment measures: A seird model of COVID-19 dynamics,” COVID-19 Econ. Res. Univ. Cambridge, May 2020. [Online]. Available: http://covid.econ.cam.ac.uk/lattanzio-palumbo-importance-of-soft-conta%nment-measures
  • [22].Kretzschmar M., Rozhnova G., Bootsma M. C., van Boven M., van de Wijgert J. H., and Bonten M. J., “Impact of delays on effectiveness of contact tracing strategies for COVID-19: A modelling study,” Lancet Public Health, vol. 5, no. 8, pp. e452–e459, 2020, doi: 10.1016/S2468-2667(20)30157-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [23].Ferretti L., Wymant C., Kendall M., Zhao L., Nurtay A., Abeler-Dörner L., Parker M., Bonsall D., and Fraser C., “Quantifying SARS-CoV-2 transmission suggests epidemic control with digital contact tracing,” Science, vol. 368, no. 6491, May 2020, Art. no. eabb6936, doi: 10.1126/science.abb6936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [24].Ahmed N., Michelin R. A., Xue W., Ruj S., Malaney R., Kanhere S. S., Seneviratne A., Hu W., Janicke H., and Jha S. K., “A survey of COVID-19 contact tracing apps,” IEEE Access, vol. 8, pp. 134577–134601, 2020, doi: 10.1109/ACCESS.2020.3010226. [DOI] [Google Scholar]
  • [25].(2014). Vax. [Online]. Available: https://github.com/digitalepidemiologylab/VaxGame
  • [26].Nadini M., Richmond S., Huang J., Rizzo A., and Porfiri M., “Design and feasibility study of the mobile application StopTheSpread,” IEEE Access, vol. 8, pp. 172105–172122, 2020, doi: 10.1109/ACCESS.2020.3022740. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [27].Koppeschaar C. E., Colizza V., Guerrisi C., Turbelin C., Duggan J., Edmunds W. J., Kjelsø C., Mexia R., Moreno Y., Meloni S., Paolotti D., Perrotta D., van Straten E., and Franco A. O., “InfluenzaNet: Citizens among 10 countries collaborating to monitor influenza in europe,” JMIR Public Health Surveill., vol. 3, no. 3, p. e66, Sep. 2017, doi: 10.2196/publichealth.7429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [28].Dalton C., Durrheim D., Fejsa J., Francis L., Carlson S., d’Espaignet E. T., and Tuyl F., “Flutracking: A weekly australian community online survey of influenza-like illness in 2006, 2007 and 2008,” Communicable Diseases Intell. Quart. Rep., vol. 33, no. 3, pp. 316–322, 2009. [Online]. Available: https://www1.health.gov.au/internet/main/publishing.nsf/Content/cda-cdi%3303g.htm [PubMed] [Google Scholar]
  • [29].Hethcote H. W., “The mathematics of infectious diseases,” SIAM Rev., vol. 42, no. 4, pp. 599–653, Jan. 2000, doi: 10.1137/S0036144500371907. [DOI] [Google Scholar]
  • [30].Korolev I., “Identification and estimation of the SEIRD epidemic model for COVID-19,” J. Econometrics, vol. 220, no. 1, pp. 63–65, 2021, doi: 10.1016/j.jeconom.2020.07.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [31].Sanche S., Lin Y. T., Xu C., Romero-Severson E., Hengartner N., and Ke R., “Early release-high contagiousness and rapid spread of severe acute respiratory syndrome coronavirus 2,” Emerg. Infectious Diseases, vol. 26, no. 7, pp. 1470–1477, 2020, doi: 10.3201/eid2607.200282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [32].Holland P. W. and Leinhardt S., “Transitivity in structural models of small groups,” Comparative Group Stud., vol. 2, no. 2, pp. 107–124, May 1971, doi: 10.1177/104649647100200201. [DOI] [Google Scholar]
  • [33].Kim K. and Altmann J., “Effect of homophily on network formation,” Commun. Nonlinear Sci. Numer. Simul., vol. 44, pp. 482–494, Mar. 2017, doi: 10.1016/j.cnsns.2016.08.011. [DOI] [Google Scholar]
  • [34].Karimi F., Génois M., Wagner C., Singer P., and Strohmaier M., “Homophily influences ranking of minorities in social networks,” Sci. Rep., vol. 8, no. 1, Dec. 2018, Art. no. 11077, doi: 10.1038/s41598-018-29405-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [35].Bojanowski M. and Corten R., “Measuring segregation in social networks,” Social Netw., vol. 39, pp. 14–32, Oct. 2014, doi: 10.1016/j.socnet.2014.04.001. [DOI] [Google Scholar]
  • [36].(2020). Centers for Disease Control and Prevention—Social Distancing. [Online]. Available: https://www.cdc.gov/coronavirus/2019-ncov/prevent-getting-sick/social-d%istancing.html
  • [37].Lee K., Hong S., Kim S. J., Rhee I., and Chong S., “SLAW: Self-similar least-action human walk,” IEEE/ACM Trans. Netw., vol. 20, no. 2, pp. 515–529, Apr. 2012, doi: 10.1109/TNET.2011.2172984. [DOI] [Google Scholar]
  • [38].Solmaz G. and Turgut D., “A survey of human mobility models,” IEEE Access, vol. 7, pp. 125711–125731, 2019, doi: 10.1109/ACCESS.2019.2939203. [DOI] [Google Scholar]
  • [39].Musolesi M. and Mascolo C., “Designing mobility models based on social network theory,” ACM SIGMOBILE Mobile Comput. Commun. Rev., vol. 11, no. 3, pp. 59–70, Jul. 2007, doi: 10.1145/1317425.1317433. [DOI] [Google Scholar]
  • [40].(2020). Nyc—The Official Website of New York City. [Online]. Available: https://www1.nyc.gov/site/doh/data/data-publications/profiles.page
  • [41].(2020). Geopy: Geocoding Library for Python. [Online]. Available: https://github.com/geopy/geopy, 2020.
  • [42].Erdős P. and Rényi A., “On the evolution of random graphs,” in Publication of the Mathematical Institute of the Hungarian Academy OF Sciences. Budapest, Hungary: Hungarian Academy of Sciences, 1960, pp. 17–61. [Google Scholar]
  • [43].(2020). Firebase Realtime Database. [Online]. Available: https://firebase.google.com/docs/database
  • [44].(2020). Find and Improve Your Location’s Accuracy. [Online]. Available: https://support.google.com/maps/answer/2839911?co=GENIE.Platform%3DAndr%oid&hl=en
  • [45].(2020). Fused Location Provider. [Online]. Available: https://lembergsolutions.com/blog/fused-location-provider

Articles from Ieee Access are provided here courtesy of Institute of Electrical and Electronics Engineers

RESOURCES