Abstract
This paper is based on the observation that, during Covid-19 epidemic, the choice of which individuals should be tested has an important impact on the effectiveness of selective confinement measures. This decision problem is closely related to the problem of optimal sensor selection, which is a very active research subject in control engineering. The goal of this paper is to propose a policy to smartly select the individuals to be tested. The main idea is to model the epidemics as a stochastic dynamic system and to select the individual to be tested accordingly to some optimality criteria, e.g. to minimize the probability of undetected asymptomatic cases. Every day, the probability of infection of the different individuals is updated making use of the stochastic model of the phenomenon and of the information collected in the previous days. Simulations for a closed community of 10’000 individuals show that the proposed technique, coupled with a selective confinement policy, can reduce the spread of the disease while limiting the number of individuals confined if compared to the simple contact tracing of positive and to an off-line test selection strategy based on the number of contacts.
1. Introduction
During the Covid-19 epidemic, one of the limiting factors that affected the capability to handle the spread of the disease was the limited number of available tests. This lack of information has created major issues in several countries and promoted the idea that testing is essential in the control of an epidemic (Salath et al., 2020).
Recent research works support the importance of testing to effectively control an epidemic, see Brotherhood et al., 2020, Eichenbaum et al., 2020 and Wang (2020). In this regard, the selection of the individuals to be tested has become a major concern in many countries. However, to the best of the authors’ knowledge, research on how to define these testing policies is still at a very early stage (Nowzari, Preciado, & Pappas, 2016).
This observation is testified by the de facto policies applied by decision makers during the Covid-19 epidemic. Among the various policies we can mention the use of contact tracing of individuals exposed to positive cases (Cereda et al., 2020), contact tracing combined with additional random testing (Shim, Tariq, Choi, Lee, & Chowell, 2020), the use of exhaustive control of new arrivals in isolated communities (Wang, Ng, & Brook, 2020), and the testing of people with high number of human interaction such as health care personnel (Padula, 2020). It is worth to note that most of these strategies rely on the appearance of symptomatic cases and required the use of hard lockdown policies to be effective.
Interestingly enough, the selection of individuals to test has important similarities with the problem of sensor selection for state estimation in the context of Wireless Sensor Networks. In both cases only a limited amount of information on a partially unknown process can be retrieved due to a limited amount of resources, i.e. the number of available tests or the channel bandwidth, respectively. The objective is to optimize where to collect the measurements based on the available information and on the model of the process. Sensor selection has been an active field in the last two decades: a method based on convex optimization is proposed by Joshi and Boyd (2008), a stochastic policy is studied by Gupta, Chung, Hassibi, and Murray (2006) and the optimal periodic policy for two sensors is given by Shi, Cheng, and Chen (2011). In the case of a general number of sensors the problem has been explored by Vitus, Zhang, Abate, Hu, and Tomlin (2012) over a finite horizon, by Mo, Garone, and Sinopoli (2014) over the infinite horizon, and for a general number of independent dynamical systems by Han, Wu, Zhang, and Shi (2017). However most of available works on sensor selection focus on real-valued dynamical systems, while the case where the process state assumes values from a finite set is at the best of our knowledge still largely unexplored.
The first step to propose an effective smart testing is the selection of an adequate model to monitor the epidemic. Compartmental epidemic models proved to provide accurate estimations of the dynamics of an epidemic (Brauer, 2008). These models can be divided in deterministic models, governed by differential equations (McCluskey, 2010), or stochastic models, where the heterogeneity of small communities can be better represented (Bøjrnstad et al., 2002, Lopez-Herrero and Amador, 2017). New models tailored for the Covid-19 case have been developed seeking for more suitable approaches for the design of control strategies, e.g. Casella, 2020, Franco, 2020 and Giordano et al. (2020). However, the nature of compartmental models implies an homogeneously distributed population with random mixing between individuals, which does not inform about the granular distribution of the disease. To model the granularity of the spread of a disease, network diffusion models provide a better insight of the population’s distribution and allow to identify the critical clusters of the spreading.
The most common network diffusion models are based on Stochastic Cellular Automata (SCA), where the spread of the disease depends on the interaction between neighbouring cells (Mikler et al., 2005, White et al., 2007). This idea has been lately extended to more complex network topologies (Keeling and Eames, 2005, Li et al., 2014). In these complex network models the interactions between individuals are modelled as the edges of a graph. This representation makes it possible to also model time-varying interactions, as well as selective quarantine policies (e.g. by removing the connections of certain individual with the rest of the population). From the theoretical viewpoint it is possible to prove that any SCA model is equivalent to a Markov chain (Ruhi & Hassibi, 2015). As we will discuss later on in this paper, this fact, although important from the theoretical viewpoint, is however not very useful in practice as the resulting Markov chain has a number of states that is exponential in the number of the states of SCA.
It is important to mention that while the use of network models has been often overlooked due to the difficulty to monitor and define the interactions in real communities, in the authors’ opinion the conception of more advanced tracking systems during the last pandemic leads naturally to this kind of approaches.
The problem of estimating the state of partially observable dynamic networks has been object of only a few studies in the last few years. One of the most studied problems is the estimation of the source of an information spread in networks using only limited observations. Zhu and Ying, 2014, Zhu and Ying, 2016 propose a sample path algorithm to estimate the location of a source of information or a disease. Alexandru and Dragotti (2019) extend this idea to the case where multiple rumours are spread and the time of the origin of the information is unknown. These works provide interesting idea that can be possibly adapted for the estimation of the evolution of an epidemic over a network.
An alternative approach to the surveillance of epidemics within networks can be found on the use of a sentinel system to estimate the evolution of the epidemic as done by Braeye, Quoilin, and Hens (2019). A sentinel system involves a limited network of selected reporting sites monitoring the disease in small portions of the population. The obtained data is used to estimate the behaviour of the entire network. Souty and Boëlle (2016) estimate the total number of cases of influenza based on the population density associated to each reporting site. Although this approach uses the density of population to improve the estimation of the state of the epidemic, the total population is still divided into clusters with homogeneous distribution and interactions.
At the time of the writing of this paper, some early work presenting attempts to define smart testing and quarantine policies have been just published. In particular (Berger, Herkenhoff, & Mongey, 0000) propose a policy based on conditional quarantine and random testing. However, the model based on partial observations assumes that tested negatives are “tagged” and they remain observable after a single test. In another recent paper on the subject by Kasy and Teytelboym (2020), the trade-off between quarantine and testing is regarded by defining a certain threshold based on the infection probability and related to the cost of testing or quarantining an individual. In this case, the partial information is inferred based on the social group of the individual rather than its interactions within the network.
The main contribution of this paper is to propose a smart testing strategy to select the individuals to be tested based on the estimated probability of infection of each individual. As a first step we propose a method to make an approximated estimation of the current state of the epidemic which is computationally inexpensive. On the basis of this estimation, the testing policy is defined as a constrained optimization problem. This testing policy is coupled with a selective confinement policy which allows to only confine few individuals of the population based on the outcome of the tests. We compare the proposed strategy with the current best practice, namely contact tracing of positives, and a suitable topology-based strategy, where individuals to test are selected according to their number of contacts. Numerical simulations show the advantage of this approach both in terms of number of infected individuals and in terms of number of individuals put in quarantine at each time. In particular, on a population of 10’000 individuals, the total number of infected is 8 times less and the total amount of days spent in quarantine is 5 times less with respect to the current best practice, and the improvement with respect to the topology-based strategy is even more evident. These results also show that tracing of contacts is crucial to keep under control the epidemic but it can be largely improved by using the algorithms proposed in this paper. The proposed algorithms can be used in a centralized way (e.g. by a decision maker) but they are also suitable to work in a distributed privacy-aware fashion and to integrate with tracing devices.
The remainder of the paper is organized as follows. In Section 2 the proposed model of the epidemic is presented. Sections 3, 4 introduce the exact and the approximated estimations of the evolution of the epidemic. Section 5 defines the testing strategy and Section 6 the quarantine actions. In Section 7 several simulations demonstrate the performance of the proposed strategies. Section 8 provides conclusions and future works.
2. Model
Consider a population of individuals where a disease is spreading. Each individual can be susceptible, infected, or removed. The spreading of the epidemic is modelled according to the following assumption.
Assumption 1
A susceptible individual can be infected by other infected individuals of the population with whom he had a direct contact. Once an individual is infected, the individual will eventually become removed and cannot be infected a second time.
The exposure to an infected individual is a necessary but not-sufficient condition for a susceptible individual to become infected. Indeed, the contagion actually takes place if some events (e.g. exchange of body fluids for flu-like illnesses) have occurred and thus it is intrinsically stochastic. Motivated by these considerations, we model the transmission of the disease through random variables. Similarly, also recovery is modelled as a random variable to capture the uncertainty of the recovery process.
Mathematically, each individual has at fixed time instants, say every day, a state that can take three logical states:
-
•
S - susceptible, the individual is healthy and was never infected before, so it is susceptible of being infected;
-
•
I - infected, the individual is infected and can infect others;
-
•
R - removed, the individual has been infected in the past and cannot be infected anymore (because immune or dead).
We denote with the binary stochastic input representing the stochastic contagion event at time . The variable takes value if the th individual has been infected between time and time , and otherwise.
To characterize we introduce the transmission variables which takes value if the infection is transmitted from to between time and time given that the individual was infected and the individual was susceptible. The same way denotes the binary stochastic variable representing the stochastic recovery event at time . In particular, if the th individual becomes removed between time and time , and otherwise. Note that the recovery variable indicates the moment that individual is not infected anymore, due to immunity or death.
Finally the state of each individual evolves according to the following equation
(1) |
with
(2) |
The state evolution of each individual is depicted by Fig. 1. From the last equation it is clear that an individual can be infected by individual if individual was infected, i.e. , and if the transmission occurred, i.e. . The modelling of variable is summarized by the following assumption.
Assumption 2
The transmission of the disease from an infected individual to a susceptible individual is a Bernoulli random variable with mean , such as . Moreover is independent of and of the initial state . The mean values are symmetric, i.e. . For any pair of individuals that have no contacts .
The variable is modelled according to the following assumption,
Assumption 3
The recovery is a Bernoulli random variable with mean constant over time. Moreover is independent of , of , and of the initial state .
In general the system is partially observable as symptoms only appear in a small percentage of the population. The appearance of symptoms is modelled by the random variable taking value if th individual is infected and shows symptoms between time and time , and otherwise. We model it according to the following assumption.
Assumption 4
The appearance of symptoms is a Bernoulli random variable with mean constant over time. Moreover is independent of and , of , and of the initial state .
2.1. Problem formulation
In this paper we consider the case in which only a limited amount of tests are available at each time . We assume that when a test is performed on the th individual at time we obtain the information if or not. No other information is provided by the test, so it is not possible to distinguish if an individual is susceptible or recovered. We can formally introduce the auxiliary state , taking value if , and otherwise, that represents the binary variable accessed by the test. For Covid-19 the value of can be retrieved by exploiting several different kind of tests. To date, even if false negatives are not completely avoided, PCR tests are widely considered to be very accurate (it is the only recommended method for the identification of Covid-19, see World Health Organization (2020a), and the reference standard for many medical studies, see e.g. Dinnes et al. (2020)). Based on this consideration, the paper assumes that available tests are ideal, which is in line with an ample part of the literature dealing with testing strategies (Berger et al., 0000, Piguillem and Shi, 0000).
To model the testing phase, we introduce the selection variable taking value if individual is selected to be tested at time and otherwise. The variable is thus a controlled variable that can be managed to tackle the disease diffusion. Beside tested individuals, we also consider that additional information is provided by symptomatic individuals. In the context of this work, a symptomatic individual is assumed to be an infected person who spontaneously visits a medical centre with clear symptoms and it is diagnosed with the disease. Let be the set containing the indices of the individuals who are tested at the time instant and of the individuals who show first symptoms at time . Note that the cardinality of the set is time-dependent since the number of symptomatic individuals is not constant. The observed output at time can be then expressed as
(3) |
while the set of the observed outputs up to time instant is
(4) |
and it represents the information available at time instant . Beside the testing phase, the system evolution is affected by the quarantine mechanism represented by the control variable , taking value if individual is in quarantine at time and otherwise. The variable is the control variable that governments use to tackle the epidemic.
The goal of this paper is the definition of a policy to select the individuals to be tested that, in conjunction with a selective quarantine policy, is able to reduce the spread of the disease while keeping a limited number of individuals in quarantine. To do so, we tackle the problem by proposing the closed-loop structure reported in Fig. 2 consisting of three stages: (1) estimation of the states using the information available from the feedback of the outputs; (2) selection of the individuals to test by optimizing a reward function ; and (3) based on the output , execution of control actions through selective quarantine.
The following sections focus on the derivation of a proper state estimation given the information available and the definition of a suitable reward .
3. Exact estimate: Hidden Markov model
The state of the whole population is defined by the vector
(5) |
Under Assumptions 1, 2, 3, at any time , the next state of the population depends only on the current state of the population . Accordingly, in line of principle, the stochastic process describing the evolution of the epidemic satisfies the Markov property and can be represented by a Markov chain.
To model the dynamics of the Markov chain, we have to derive the transition matrix whose entries are
(6) |
where represent two possible states of the network, and represent the indices of the transition matrix associated to them. Under Assumptions 1, 2, 3, the next states of two individuals are independent given the previous state of the population. It follows that
(7) |
allowing to compute the transition probabilities of the network as a derivation of the transition probabilities between the states of each single individual. Since only the transition from susceptible to infected depends on the state of other individuals, the following simplification holds
(8) |
where .
The state transition probability of any individual at time can be computed as
(9) |
while the probability of remaining in a given state is
(10) |
All other transitions have probability 0.
A major difficulty in our setting is that the evolution of the system can be only observed by symptomatic individuals and selective tests on the population. Since , the Markov model is hidden and can be only partially observed through the output. The complete characterization of the state given the available information is provided by the joint distribution . For a given , the joint distribution can be represented by a vector of dimension with entries . In the case of Hidden Markov Models, this joint distribution can be easily computed by means of the forward algorithm (see Blunsom (2004) and Rabiner (1989)), providing the following expression
(11) |
The computation of is then easy: if the state gives the output , and 0 otherwise, namely if is not a possible output for the state .
From the joint distribution, the conditional distribution is
(12) |
where we used the Bayes rule and the law of total probability. Recall now that the optimal estimate of a random variable corresponds to the expected value given the observations, see Anderson and Moore (2012). Then the optimal estimate of the -individual is
(13) |
where we used the definition of expected value for a binary variable and the law of total probability. This procedure allows to obtain the probability of each individual to be infected at time given the complete vector of observations . However, in spite of allowing to compute the exact probability, this approach requires the computation of all the transition probabilities of the matrix and the use of a vector variable of size , which is not computationally feasible even for small populations.
4. An approximated state estimate
Due to the prohibitive burden of an exact probability computation, in this paper we propose an approximated low-computational algorithm to estimate such probability. The proposed approximated estimation is based on the idea of temporal and spatial truncation of the updates and is also suitable for decentralized implementations More precisely, we propose to propagate the information from testing only to individuals that are the most correlated to the tested individuals, namely the ones that have direct contact with tested people, while for the remaining part of the population the update is performed based on previous estimates and the topology of the network representing the population. In the same way, only a limited amount of past state estimates are assumed to be affected by the new information. This approximation allows to retrieve the information regarding the individuals that are most affected by the result of each test while keeping a limited computational time.
4.1. Approximated state estimation definitions
We define the estimate of the state of the individual as
(14) |
where the local information set for the th individual is defined as
(15) |
The local information set consists of the local state estimates of direct contacts, updated at the time instant of the interaction, and the state estimates of tested individuals, updated just after the test. In order to keep the computations associated to the th individual limited, we define a local approximated estimation which can be retrieved based only a partial knowledge of the network. More precisely, the state of each individual is estimated under the assumption that only its contacts are known, and no information on the connections between any other individuals is assumed to be available. In this sense, we will focus only on individuals with whom the th individual was in contact: in the case of untested individuals we will use only the previous local estimates while in the case of tested individuals we will use the current state and the updated estimation of past states.
Denote by the (random) vector collecting all the random variables , namely . With a little misuse of notation denotes the case where all the past states , are equal to . We assume that the random variables , are conditional independent given . Similar assumptions are made by Boguná, Castellano, and Pastor-Satorras (2013) and Valdano, Ferreri, Poletto, and Colizza (2015). The rationale is that, if the individual has always been healthy, the coupling between two of his neighbours and is negligible, as in the case when individual is the only connection between and or, even if , contacts of are enough different from contacts of . It follows that
(16) |
To further simplify the estimation algorithm we will simplify the stochastic recovery with a deterministic one based on the average recovery time . Then we have
(17) |
We compute and as
(18) |
where the last equality holds since are conditional independent given . To obtain the numerical value of would require to compute that in turn would require and so on. Since this propagation is very computationally expensive we make the approximation that
(19) |
The underlying assumption is that the state of an individual and those of its neighbours are independent. The accuracy of this assumption has been explored by Gleeson, Melnik, Ward, Porter, and Mucha (2012) where it has been shown that the dynamics are well approximated if the degrees of closest neighbours are high. At the same way, the assumption holds when the underlying network of contacts is time-varying, but the results can be less accurate if pairs of individuals have frequent interactions, and many contacts in common. Since this happens in real life (think of relatives and colleagues), we introduce a correction factor , for simplicity denoted by , that accounts for the coupling of individuals and due to the interactions before
(20) |
In line of principle, is smaller when more interactions have occurred between and in the past. In fact, the probability that is infected given that has been healthy (namely the left hand side of Eq. (19)) is lower than the probability that is infected without any knowledge on the past states of (namely the right hand side of Eq. (19)). An efficient way to compute is defined in Section 7. We can conveniently incorporate the correction factor in the term as . We finally obtain the following update rule
(21) |
To keep a limited number of computations, we also make the following approximation
(22) |
with initialization . Roughly speaking, if individual has direct contact with a tested individual and individual has direct contact with but not with , the state estimates of will be corrected based on the outcome while the state estimates of will use the old estimation of , as derived without the knowledge of the outcome. This means that we use the information regarding the outcome from the tests to only update the direct contacts of a tested individual.
4.2. State estimation update
Since the update of each individual uses only knowledge from local connections, new information can be used differently for tested individuals, individuals with a direct contact with them, and the remaining of the population.
4.2.1. Tested individuals
Let denote the outcome of the test to the individual . Then we have
(23) |
(24) |
If , no contagion happened in , namely
(25) |
for , and
(26) |
for as no additional information on past states is given by a negative outcome. As , may be equal to 1 only if a contagion occurs in the interval , therefore the infection probability is updated as
(27) |
for and
(28) |
for .
For the case of a positive result, , we have
(29) |
for , while for it holds that
(30) |
If , is equal to 1 only if the contagion occurred in the interval . Knowing that , we can compute the infection probability for these individuals as
(31) |
for and
(32) |
for .
4.2.2. Neighbours of tested individuals
Formally the neighbours of a tested individual are defined by the set which represents the set of individuals that has been in contact at least once with at least a tested individual. According to the definition of local information set, the update of the estimation exploits also the updated estimate of the past states of tested individual.
The probability relative to the initial time instant is not changed
(33) |
By using the information from the contacts that have been tested at time instant we can update the probabilities starting from (21) as
(34) |
using (22), where
(35) |
(36) |
Note that the previous update takes advantage from the knowledge of the update estimate of the past state of tested individuals. The last equality holds only if individual has not been tested before, otherwise would be different according to the update relative to a tested individual. In that case, the correction procedure starts from the instant where the individual was tested. The correction procedure works if more than one neighbour have been tested even in different time instants. Note that if and if . Finally, the infection probability at time is computed as
(37) |
4.2.3. Open-loop state estimation
For each individual not having direct contact with any tested individual, the open-loop estimate is computed as based on the previous estimates provided by its contacts as
(38) |
using (21), (22). Other required values are updated according to
(39) |
(40) |
(41) |
4.3. Overall estimation scheme
The state estimation scheme proposed above performs a hierarchical update of the infection probability. This update is structured around individuals that are tested at time , the neighbours of the tested individuals and the remaining of the population. At each time instant, the estimation is thus divided into levels of update based on the derivations obtained in the previous subsection:
-
•
First level: Tested individuals, using the output from the performed tests
-
•
Second level: Neighbours of tested individuals, including the update estimate from the first level and the previous estimates
-
•
Third level: Rest of the population (open loop), using only the previous estimates .
This scheme is depicted in Fig. 3.
In line of principle, buffers of increasing length are needed to store past probabilities. In the spirit of a temporal truncation of the updates, since the current test outcomes bring little information on the oldest states except for positive tested individuals, we assume that for untested individuals past probabilities older than are not affected by the new outcomes, i.e.
(42) |
Under this approximation, in terms of information storage, the local update of the current state estimate requires the storage of the following two buffers of information for each individual, namely the Susceptibility buffer
(43) |
and, the Infection probability buffer
(44) |
It is worth to note that the complexity of the proposed estimator is much lower than the optimal estimation devised in the previous section. At each update, the open-loop state estimation requires for each individual the product of at most real factors, see (38). Since only effective contacts (namely those with ) affect the product, the number of required multiplications drastically falls and scales with the node degree. The second level update requires, for each contact of a tested individual, the correction of the last values and involves simple multiplications of scalar quantities, see (34). Similarly, at the first level, for each tested individual, the update of the last estimates requires only elementary operations using already available quantities (see (27), (30)).
An interesting feature of the proposed approach is that it is not only computationally efficient to be used in a centralized way for a given community, but that it can be also implemented in a decentralized manner. This is the case where each individual is equipped with a smart device (e.g. a smartphone) provided with small computational capability and able to communicate with other devices and with a central testing unit, see Fig. 5.
Contact tracing mechanisms have already been applied by many countries during the Covid-19 epidemic and software applications are already available in the market. With respect to them, our algorithm can be implemented based on the same hardware and with a larger amount of transmitted data. In particular, when individuals get in contact during the day, their previous estimate has to be exchanged. The outcomes of the tests are provided to tested individuals who compute and communicate the updated estimates of the previous states to the server. Then remote communications of those updated estimates are performed once per day from the main server to the population. An explanatory representation is given in Fig. 4. Note that no information on interactions is communicated neither to the central unit nor to other individuals, so that privacy is preserved and vulnerability of a central data collector is avoided. Then, each individual transmits the updated estimates to the server which decides who to test the next day and convokes them.
5. Testing policy
Similarly to the literature on sensor selection, it is possible to formulate the test selection problem as a constrained optimization problem based on the state estimate. Formally, we introduce the binary control variable taking value if th individual is selected at time instant to be tested at the next time instant , and otherwise, while we denote . Then the test selection problem can be formulated as
(45) |
(46) |
where is a suitable reward function.
Several possibilities exist for the choice of the cost function. Differently from most of the works on sensor selection for remote estimation, we avoid to adopt the error covariance matrix because it is computationally infeasible for large . More suitable cost functions can be computed based on different metrics of the current state of the population or the topology and characteristics of the network. Namely, intuitive choices would focus on the expected number of detected people, the expected number of infections at the next time instant or targeting individuals with high number of contacts (e.g. first-line health workers). In this context, different cost functions may provide different results based on the time of application of their actions, the number of available tests or the applied quarantine actions.
Based on good preliminary results, in this paper we propose to maximize the expected value of the number of detected positive individuals, that is
(47) |
This policy is equivalent to select the individuals whose probability of being infected is the highest.
It should be noted that the proposed cost function is a primary attempt to define an efficient metric in line with the presented framework. Nonetheless, the selection of the optimal cost function is out of the scope of this paper and remains an open problem for further research.
6. Quarantine actions
The outcomes of the tests are exploited to act on the population through a selective quarantine. Formally, we introduce the control variable such that if th individual is selected to be quarantined at time instant , and otherwise. In this paper for any positive we propose to quarantine the closest neighbours, i.e. the individuals with the highest transmission probability . The parameter can be properly tuned to trade-off between the total number of quarantined for positive and the expected number of infected (but not detected) that are quarantined because they have a direct contact with a positive. We consider that individuals will leave quarantine after days.
Note that in line of principle other quarantine strategies can be designed based on probabilities of infection of the neighbours of a positive tested, as well as preventive quarantine based only on the state estimate, and they will be the subject of future investigations.
7. Numerical simulations
This section shows, through numerical simulations, the effectiveness of the proposed solution by comparing it to current approaches.
7.1. Setting
The simulation setup considers a closed population of 10’000 individuals with the following parameters regarding the spread of the disease
-
•
. This value is equivalent to a virus with high spreading, e.g. the Covid-19, when no social distance measures are adopted (Giordano et al., 2020, Salath et al., 2020).
-
•
0.1% of the population is initially infected.
-
•
20% of new infected present symptoms of the disease before the recovery, in agreement with Ing, Cocks, and Green (2020) and Lavezzo, Franchin, Ciavarella, Cuomo-Dannenburg, Barzon, Del Vecchio, et al. (2020).
-
•
0.5% of the population can be tested at each day, corresponding to . This value is similar to the percentage of daily tests in South Korea or in USA at December 2020, see https://covidtracking.com/data.
-
•
The closest individuals of each individual with positive test are put in quarantine for days. When in quarantine, all the transmission probability are reduced to of their normal value.
The population distribution can be conveniently representedthrough a weighted undirected graph, where each node represents an individual, an edge between two nodes represents an interaction between two individuals, and the weight is set equal to the probability of transmission . The graph topology has been generated to emulate a small-world network. This kind of graphs are characterized by the presence of clusters, which are subgraphs that are (almost) complete, and of short paths connecting (almost) any pair of nodes. They have been introduced to capture the evidence of human connections and have been widely studied in the literature, see de Sola Pool and Kochen (1978) and Watts and Strogatz (1998). In our case, each individual belongs to more clusters (at least 2, up to 6) to mimic families, offices, habitual relations and activities, etc. The dimensions of the clusters are uniformly distributed and the range depends on the kind of relationship that they capture: for example the dimensions of households randomly vary from 1 to 8, while dimensions of offices vary from 4 to 40. Random links are also added to the network. The resulting graph is then heterogeneous and possibly unbalanced. The average weights are set in a realistic way, e.g. the average weights in a household are four times the ones in a small office. For the sake of simplicity the graph is assumed to be time-invariant, except for the effects of quarantine actions.
Initial conditions , i.e. which individuals are initially infected, are stochastically generated based on the initial probability of each node to be infected. To test the robustness of the proposed strategy, we assume that the probability distribution of the initial conditions is perturbed up to the 10%. It is also assumed that 10% of the arcs of the graph are unknown.
The presented simulations compare three different scenarios:
-
•Test and trace (T&T). This policy traces the contacts (based on the knowledge of the network) of symptomatic and detected cases (see Dar, Lone, Zahoor, Khan, and Naaz (2020) and Ferretti et al. (2020)). More formally we define the set of individuals that have been detected at a generic time as . Then for any individual we retrieve the set of recent contacts , and we refine it by removing already detected individuals and individuals that have been recently tested. From the set we select the individuals that have been more in contact with the individual . Among the different possibilities for doing so, we choose such that
(48) (49)
where denotes the cardinality of the set. Less rigorously, we can say that contains the contacts of with which the last interaction has been the most dangerous, while the number of individuals in each set is chosen such that tests are allocated as uniformly as possible among the sets of contacts of detected positive. Please note that, since in the following simulations the graph is fixed, implies that for any if has not been in quarantined. Remaining tests are used to randomly explore other parts of the graph in order to model test selection policies that are not based on the probabilities of infection, as it is done in reality where tests are also partially allocated to the employs of interested companies. Test and trace strategy is a well-known policy which has provided good results in several countries and it has been considered the best practice by the medical community (World Health Organization, 2020b).(50) -
•Topology-based testing. This scenario presents a policy where, based on the topology of the graph, i.e. the number of contacts, certain individuals are periodically tested. In particular we choose a period days and we solve the constrained optimization problem
(51)
Individuals such that are then randomly sorted and tested accordingly. The periodical testing campaign is delayed on-line in order to allocate tests to the closets neighbours of a new detected. However we consider only a partial tracing, assuming that at most 8 contacts are provided. The application of topology-based centrality metrics for test selection is quite new but they have been studied to select the edges to remove (see Doostmohammadian, Rabiee, and Khan (2020) and references therein).(52) -
•Smart testing (T&EST). This scenario follows the proposed control scheme where individuals are selected according to the probability of being infected, Eq. (47), and the outcomes of the tests are used to update the state estimate according to Section 4. The correction term is set as
with(53)
so is the number of days in the last with a contact between individual and . We set parameter by fitting the time evolution of the mean probability computed by the proposed estimator to the incidence of cases obtained by simulating a graph of a similar topology in the case of no action on the system.(54)
Given the stochastic nature of the model, simulations have been generated for each scenario. The spread of the infectious disease is monitored for a time span of days.
7.2. Results
As we can see in Fig. 6, the proposed control mechanism (testing based on estimation and conditional quarantine) is effective in reducing the total number of infected people in a given temporal window both with respect to the topology-based strategy and test-and-trace. It is important to note that the test-and-trace strategy clearly outperforms the topology-based strategy even if the latter, when no new known positives are present, allocates tests on crucial points of the network instead of on random individuals. This result confirms how important is to trace contacts of positive individuals in order to keep the epidemic under control. On the other hand, the comparison between T&T and T&EST shows that the performances of pure tracing can be largely improved by using control methodologies. In particular, when many positive individuals have been detected, it is impossible to test all of their contacts due to the limited number of available tests. In that case, with the proposed strategy, the contacts that should be tested naturally come up among all the other contacts. When no new positive is detected, T&EST selects who to test using updated information on the state of the population. Thus, the proposed strategy takes into account both information on the graph topology (for individuals that have not been in contact with tested individuals, estimation is affected by the number of interactions) and the information from tests. The obtained results show that including dynamics provides better performances than simpler off-line strategies.
The proposed strategy is effective also in mitigating the epidemic outbreak by avoiding any peak of active cases, as shown in Fig. 7. This is an important result since it is fundamental to have a low number of active cases to avoid the health-care system maximum capacity to be reached. It is important to remark that the presented approach has shown to be very effective when applied to cases where the initial number of infected individuals is small compared to the total size of the population. In such a case, a fast identification of clusters of infection is essential, providing enough tracing to detected positive individuals but also exploring new areas of the network. This improvement in the performances can be better appreciated in Fig. 8.
The number of people in quarantine at each time instant is depicted in Fig. 9. Although not intuitive, this plot shows lower numbers of people in quarantine for the smart testing policy, indicating that the improvement in performance does not require a greater number of people in quarantine but that actually can be achieved with less but better focused quarantines. In these simulations the number of people in quarantine for the T&EST is almost negligible, showing that an efficient testing policy can have a great impact also in the required control actions. This is a very promising result especially from an economic point of view since it would limit the social and economical impact of the measures.
A synoptic overview of the numerical simulations is reported in Table 1. The results show the clear improvement on the containment of the epidemic, in terms of both active cases and people in quarantine, by using a testing and quarantine policy based on the presented probability estimation algorithm.
Table 1.
Scenario | Peak of active cases | Total infected | Work days lost |
---|---|---|---|
Test and trace | 116 | 1472 | 2564 |
Topology-based strategy | 365 | 3859 | 5847 |
Smart testing | 20 | 169 | 444 |
An important aspect of the presented strategy is the assumption of a good knowledge of the network topology. In this sense, Fig. 10 provides the variation in the performances of the presented strategy with respect to the number of individual interactions known. From this plot it can be seen that within the range of 80%–100% of knowledge of the network, the results are very similar and promising. For a percentage of unknown interactions superior to 20%, a threshold behaviour can be seen, where the performance is clearly worse and a more evident peak of infection can be seen. However, it must be noted that even if the performance is clearly poorer, the results are still much better than the other strategies.
8. Conclusions
In this paper we presented a novel testing strategy to smartly select the individuals to be tested during an epidemic. This policy is based on a decentralized state estimation of the status of the epidemic obtained from the outcome of the tests.
The testing policy is defined as an optimization problem based on the state estimation. The proposed estimation algorithm is computationally inexpensive and can even be implemented in a distributed fashion.
The numerical results based on Monte Carlo simulations demonstrate that the use of the proposed scheme, testing and selective quarantine, significantly reduces the total number of infected people as well as the peak of active case and the number of people put in quarantine.
Future works will focus on the link between the test selection objective functions and the quarantine policies. The case where the reliability of the tests is considered is another subject of study in future research.
Declaration of Competing Interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Footnotes
This work has been supported by the Fonds de la Recherche Scientifique-FNRS under the Grant number 40003443 (“Smart Testing”) and by the European Commission under the Grant Agreement number 774571 (project PANTHEON, “Precision farming of hazelnut orchards”).
References
- Alexandru R.I., Dragotti P.L. In: Wavelets and Sparsity XVIII. Lu Y.M., Papadakis M., Van De Ville D., editors. SPIE; San Diego, United States: 2019. Diffusion source detection in a network using partial observations; p. 20. [DOI] [Google Scholar]
- Anderson B.D., Moore J.B. Courier Corporation; 2012. Optimal filtering. [Google Scholar]
- Berger, D., Herkenhoff, K., & Mongey, S. (0000). An SEIR infectious disease model with testing and conditional quarantine, URL: https://www.nber.org/papers/w26901.
- Blunsom P. Hidden markov models. Lecture notes, August. 2004;15(18–19):48. URL: http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.123.1016&rep=rep1&type=pdf. [Google Scholar]
- Boguná M., Castellano C., Pastor-Satorras R. Nature of the epidemic threshold for the susceptible-infected-susceptible dynamics in networks. Physical Review Letters. 2013;111(6) doi: 10.1103/PhysRevLett.111.068701. [DOI] [PubMed] [Google Scholar]
- Bøjrnstad O.N., Finkenstädt B.F., Grenfell B.T. Dynamics of measles epidemics : Estimating scaling of transmission rates using a time series SIR model. Ecological Monographs. 2002;72(2):169–184. doi: 10.1890/0012-9615(2002)072[0169:DOMEES]2.0.CO;2. [DOI] [Google Scholar]
- Braeye T., Quoilin S., Hens N. Incidence estimation from sentinel surveillance data; a simulation study and application to data from the Belgian laboratory sentinel surveillance. BMC Public Health. 2019;19(1):982. doi: 10.1186/s12889-019-7279-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brauer F. In: Mathematical Epidemiology. Brauer F., van den Driessche P., Wu J., editors. Springer Berlin Heidelberg; Berlin, Heidelberg: 2008. Compartmental models in epidemiology; pp. 19–79. [DOI] [Google Scholar]
- Brotherhood L., Kircher P., Santos C., Tertilt M. 2020. An economic model of the COVID-19 epidemic: The importance of testing and age-specific policies. URL https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3618840. [Google Scholar]
- Casella F. Can the covid-19 epidemic be controlled on the basis of daily test reports? IEEE Control Systems Letters. 2020;5(3):1079–1084. [Google Scholar]
- Cereda D., Tirani M., Rovida F., Demicheli V., Ajelli M., Poletti P., et al. 2020. The early phase of the COVID-19 outbreak in lombardy, Italy. arXiv preprint arXiv:2003.09320. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dar A.B., Lone A.H., Zahoor S., Khan A.A., Naaz R. Applicability of mobile contact tracing in fighting pandemic (COVID-19): Issues, challenges and solutions. Computer Science Review. 2020 doi: 10.1016/j.cosrev.2020.100307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- de Sola Pool I., Kochen M. Contacts and influence. Social Networks. 1978;1(1):5–51. [Google Scholar]
- Dinnes J., Deeks J.J., Adriano A., Berhane S., Davenport C., Dittrich S., et al. Rapid, point-of-care antigen and molecular-based tests for diagnosis of SARS-CoV-2 infection. Cochrane Database of Systematic Reviews. 2020;(8) doi: 10.1002/14651858.CD013705. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Doostmohammadian M., Rabiee H.R., Khan U.A. Centrality-based epidemic control in complex social networks. Social Network Analysis and Mining. 2020;10(1):32. doi: 10.1007/s13278-023-01070-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eichenbaum M., Rebelo S., Trabandt M. National Bureau of Economic Research; Cambridge, MA: 2020. The Macroeconomics of Testing and Quarantining: Technical Report w27104. [DOI] [Google Scholar]
- Ferretti L., Wymant C., Kendall M., Zhao L., Nurtay A., Abeler-Dörner L., et al. Quantifying SARS-CoV-2 transmission suggests epidemic control with digital contact tracing. Science. 2020;368(6491) doi: 10.1126/science.abb6936. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Franco E. 2020. A feedback SIR (fSIR) model highlights advantages and limitations of infection-based social distancing. arXiv preprint arXiv:2004.13216. [Google Scholar]
- Giordano G., Blanchini F., Bruno R., Colaneri P., Di Filippo A., Di Matteo A., et al. Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy. Nature Medicine. 2020;26(6):855–860. doi: 10.1038/s41591-020-0883-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gleeson J.P., Melnik S., Ward J.A., Porter M.A., Mucha P.J. Accuracy of mean-field theory for dynamics on real-world networks. Physical Review E. 2012;85(2) doi: 10.1103/PhysRevE.85.026106. [DOI] [PubMed] [Google Scholar]
- Gupta V., Chung T.H., Hassibi B., Murray R.M. On a stochastic sensor selection algorithm with applications in sensor scheduling and sensor coverage. Automatica. 2006;42(2):251–260. [Google Scholar]
- Han D., Wu J., Zhang H., Shi L. Optimal sensor scheduling for multiple linear dynamical systems. Automatica. 2017;75:260–270. [Google Scholar]
- Ing A.J., Cocks C., Green J.P. COVID-19: in the footsteps of Ernest Shackleton. Thorax. 2020 doi: 10.1136/thoraxjnl-2020-215091. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Joshi S., Boyd S. Sensor selection via convex optimization. IEEE Transactions on Signal Processing. 2008;57(2):451–462. [Google Scholar]
- Kasy M., Teytelboym A. Adaptive targeted infectious disease testing. Oxford Review of Economic Policy. 2020 doi: 10.1093/oxrep/graa018. [DOI] [Google Scholar]
- Keeling M.J., Eames K.T. Networks and epidemic models. Journal of the Royal Society Interface. 2005;2(4):295–307. doi: 10.1098/rsif.2005.0051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lavezzo E., Franchin E., Ciavarella C., Cuomo-Dannenburg G., Barzon L., Del Vecchio C., et al. Suppression of COVID-19 outbreak in the municipality of Vo’, Italy. Nature. 2020;584(7821):425–429. doi: 10.1038/s41586-020-2488-1. [DOI] [PubMed] [Google Scholar]
- Li C.-H., Tsai C.-C., Yang S.-Y. Analysis of epidemic spreading of an SIRS model in complex heterogeneous networks. Communications in Nonlinear Science and Numerical Simulation. 2014;19(4):1042–1054. doi: 10.1016/j.cnsns.2013.08.033. [DOI] [Google Scholar]
- Lopez-Herrero M., Amador J. Cumulative and maximum epidemic sizes for a nonlinear SEIR stochastic model with limited resources. Discrete and Continuous Dynamical Systems. Series B. 2017;22(5):18. doi: 10.3934/dcdsb.2017211. [DOI] [Google Scholar]
- McCluskey C.C. Complete global stability for an SIR epidemic model with delay — distributed or discrete. Nonlinear Analysis. Real World Applications. 2010;11(1):55–59. doi: 10.1016/j.nonrwa.2008.10.014. [DOI] [Google Scholar]
- Mikler A.R., Venkatachalam S., Abbas K. Modeling infectious diseases using global stochastic cellular automata. Journal of Biological Systems. 2005;13(04):421–439. doi: 10.1142/S0218339005001604. [DOI] [Google Scholar]
- Mo Y., Garone E., Sinopoli B. On infinite-horizon sensor scheduling. Systems & Control Letters. 2014;67:65–70. [Google Scholar]
- Nowzari C., Preciado V.M., Pappas G.J. Analysis and control of epidemics: A survey of spreading processes on complex networks. IEEE Control Systems Magazine. 2016;36(1):26–46. [Google Scholar]
- Padula W.V. Why only test symptomatic patients? Consider random screening for COVID-19. Applied Health Economics and Health Policy. 2020;18(3):333–334. doi: 10.1007/s40258-020-00579-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Piguillem, F., & Shi, L. (0000). The Optimal COVID-19 quarantine and testing policies, CEPR Discussion Paper No. DP14613, URL: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3594243.
- Rabiner L.R. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE. 1989;77(2):257–286. doi: 10.1109/5.18626. [DOI] [Google Scholar]
- Ruhi N.A., Hassibi B. 2015 54th IEEE Conference on Decision and Control (CDC) IEEE; Osaka: 2015. SIRS epidemics on complex networks: Concurrence of exact Markov chain and approximated models; pp. 2919–2926. [DOI] [Google Scholar]
- Salath M., Althaus C.L., Neher R., Stringhini S., Hodcroft E., Fellay J., et al. COVID-19 epidemic in Switzerland: on the importance of testing, contact tracing and isolation. Swiss Medical Weekly. 2020 doi: 10.4414/smw.2020.20225. [DOI] [PubMed] [Google Scholar]
- Shi L., Cheng P., Chen J. Optimal periodic sensor scheduling with limited resources. IEEE Transactions on Automatic Control. 2011;56(9):2190–2195. [Google Scholar]
- Shim E., Tariq A., Choi W., Lee Y., Chowell G. Transmission potential and severity of COVID-19 in south Korea. International Journal of Infectious Diseases. 2020;93:339–344. doi: 10.1016/j.ijid.2020.03.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Souty C., Boëlle P.-Y. Improving incidence estimation in practice-based sentinel surveillance networks using spatial variation in general practitioner density. BMC Medical Research Methodology. 2016;16(1):156. doi: 10.1186/s12874-016-0260-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Valdano E., Ferreri L., Poletto C., Colizza V. Analytical computation of the epidemic threshold on temporal networks. Physical Review X. 2015;5(2) [Google Scholar]
- Vitus M.P., Zhang W., Abate A., Hu J., Tomlin C.J. On efficient sensor scheduling for linear dynamical systems. Automatica. 2012;48(10):2482–2493. [Google Scholar]
- Wang Y. An analytical SIR model of epidemics and a sustainable suppression policy: Testing. SSRN Electronic Journal. 2020 doi: 10.2139/ssrn.3573979. [DOI] [Google Scholar]
- Wang C.J., Ng C.Y., Brook R.H. Response to COVID-19 in Taiwan: Big data analytics, new technology, and proactive testing. JAMA. 2020;323(14):1341. doi: 10.1001/jama.2020.3151. [DOI] [PubMed] [Google Scholar]
- Watts D.J., Strogatz S.H. Collective dynamics of ‘small-world’networks. Nature. 1998;393(6684):440–442. doi: 10.1038/30918. [DOI] [PubMed] [Google Scholar]
- White S.H., del Rey A.M., Sánchez G.R. Modeling epidemics using cellular automata. Applied Mathematics and Computation. 2007;186(1):193–202. doi: 10.1016/j.amc.2006.06.126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- World Health Organization, WHO . 2020. Advice on the use of point-of-care immunodiagnostic tests for COVID-19: Technical documents. URL https://www.who.int/news-room/commentaries/detail/advice-on-the-use-of-point-of-care-immunodiagnostic-tests-for-covid-19. [Google Scholar]
- World Health Organization, WHO . 2020. Contact tracing in the context of COVID-19. Publications, URL https://www.who.int/publications/i/item/contact-tracing-in-the-context-of-covid-19. [Google Scholar]
- Zhu K., Ying L. A robust information source estimator with sparse observations. Computational Social Networks. 2014;1(1):3. doi: 10.1186/s40649-014-0003-2. [DOI] [Google Scholar]
- Zhu K., Ying L. Information source detection in the SIR model: A sample-path-based approach. IEEE/ACM Transactions on Networking. 2016;24(1):408–421. [Google Scholar]