Skip to main content
PLOS One logoLink to PLOS One
. 2021 Nov 18;16(11):e0259969. doi: 10.1371/journal.pone.0259969

Modelling digital and manual contact tracing for COVID-19. Are low uptakes and missed contacts deal-breakers?

Andrei C Rusu 1,*, Rémi Emonet 2,#, Katayoun Farrahi 1,#
Editor: Ivan Kryven3
PMCID: PMC8601513  PMID: 34793526

Abstract

Comprehensive testing schemes, followed by adequate contact tracing and isolation, represent the best public health interventions we can employ to reduce the impact of an ongoing epidemic when no or limited vaccine supplies are available and the implications of a full lockdown are to be avoided. However, the process of tracing can prove feckless for highly-contagious viruses such as SARS-CoV-2. The interview-based approaches often miss contacts and involve significant delays, while digital solutions can suffer from insufficient adoption rates or inadequate usage patterns. Here we present a novel way of modelling different contact tracing strategies, using a generalized multi-site mean-field model, which can naturally assess the impact of manual and digital approaches alike. Our methodology can readily be applied to any compartmental formulation, thus enabling the study of more complex pathogen dynamics. We use this technique to simulate a newly-defined epidemiological model, SEIR-T, and show that, given the right conditions, tracing in a COVID-19 epidemic can be effective even when digital uptakes are sub-optimal or interviewers miss a fair proportion of the contacts.

1 Introduction

1.1 Problem overview

The epidemic started in Wuhan, China by the SARS-CoV-2 virus has uncontrollably spread through communities from all around the world, rapidly becoming a major global threat, which was responsible for more than 237 million infections and 4.8 million deaths by October 2021 [1]. Prompted by the scale of this epidemic, cross-disciplinary teams started working against the clock to develop reliable pathogen spreading models that could be used to assess the effectiveness of different public health interventions. Since imposing a general lockdown has proven economically unbearable for most countries, the attention significantly shifted to less restrictive yet partially successful measures, such as educating the public to socially distance, deploying large-scale testing and quarantining contacts through various tracing mechanisms [2]. The latter proved rather challenging for the traditional interview-based approaches, mostly due to significant delays, staffing issues and a generally poor recollection exhibited by the interviewees. As a result, digital alternatives were quickly sought after by several governments. These were or are currently being deployed in many states, most of them being reliant on either a Bluetooth solution, such as the Exposure Notification (GAEN) system [3], or a geolocation-based software, similar to the Integrated Disease Surveillance Programme in India [4]. That being said, the efficiency of these strategies remains largely dependent on the application adoption rates and the behavioral patterns of their userbase (i.e. self-isolation compliance, respecting the usage guidance, keeping the tracing device turned on etc.). Although some have suggested an application uptake of at least 50% would be needed at the population level to contain the epidemic [5], others showed via simulations that 60% would be enough to stop the spread without requiring further interventions [6]. That being said, the adoption levels generally quoted in the literature as “sufficient” remain mostly unattainable due to privacy concerns and internet access limitations. The picture gets even more intricate when the aforementioned behavioral issues are widespread in the users’ communities or if inadequate testing regimes and manual tracing procedures are employed.

Motivated by the limited evidence we have of the efficacy exhibited by contact tracing methods in the face of such challenges, we developed a multi-site mean-field model that can simulate the joint effects of these variables on the evolution of an epidemic, and used it to study COVID-19 via a new disease-specific compartmental formulation—SEIR-T. Our methodology draws inspiration from the work of [7], but it enables the simulation of more varied scenarios involving digital tracing at different uptake levels r, manual tracing with various network overlaps Γ, or both procedures combined. Additionally, we propose separating the “traced” status from the infection states, thus allowing for a node to get isolated at all times (unless it has reached an end state, i.e. recovered or dead), while also ensuring self-isolation can end due to non-compliance or term expiration, all without impacting an individual’s standard disease progression. This feature also makes our approach directly compatible with any other compartmental model. As is customary, all our code was made publicly available (see S1 File).

The experiments we conducted confirm that the potency of contact tracing not only depends on the accuracy of the tracing network, but also on several other variables (i.e. testing rates, tracing reliability, staffing and delays, public-health communiqués, isolation conformity etc.), an optimal configuration of these given a country’s epidemiological situation being essential for a swift viral containment. Even when lower uptakes are registered (r < 0.4) or the interviewing process misses many contacts (Γ ≤ 0.5), our simulations suggest that significant reductions in the peak of infections and the total number of deaths can still be achieved given small tracing delays and the appropriate levels of testing and self-isolation compliance. What is more, the combined effects of manual and digital tracing can drive the effective reproduction number R below 1 even when neither is very efficient. We validate our results on numerous parameter configurations and several classic network topologies, including random Erdős–Rényi [8], scale-free [911], small-world [1113], and a real social network [14].

1.2 Related work

In recent years, modelling epidemics has mainly been achieved via either of two paradigms: agent-based (ABM) or equation-based models (EBM). The first represents a bottom-up approach in which a set of behaviors are attributed to each agent in a topological system. These behaviors dictate every individual’s action patterns through this topology, and ultimately determine the execution of different discrete interaction events (e.g. infection spreading, tracing notification broadcasting etc). ABMs tend to be relatively complex and resource-intensive to simulate, the involved cost being often justified by their exhibited level of granularity and their ability to monitor public interventions at the individual level [15]. Government-advising groups in the UK decided to employ this paradigm early on in the COVID-19 pandemic to estimate the effects of such interventions [16, 17]. A more recent Oxford study looked at the combined effects of manual tracing with digital solutions, at various application uptakes, via a rich yet scalable ABM fitted to mobility data from different counties in Washington [18]. We consider their findings the strongest modelling evidence to date that digital tracing can be effective even at low adoption rates.

On the other hand, EBMs define a set of equations that express the evolution of certain continuous observables over time. These generally represent system states (called compartments) showing how a disease progresses through a population. The SIR process, a widely-known EBM, utilizes three ordinary differential equations to model a generic epidemic [19]. Extensions of SIR were subsequently used to simulate the transmission of many pathogens, including Zika [20], Ebola [21], and most recently SARS-CoV-2—e.g. SIDARTHE [22], SUQC [23]. The present study employs a variation of the compartmental model designed by the French National Institute of Health and Medical Research (Inserm) to study the impact of lockdown exit strategies on the spread of COVID-19 [24].

At the intersection of these two paradigms lies the category of multi-site mean-field models which combine the mathematical rigour and the superior generalizability of EBMs with the ability to leverage locality information regarding every individual. Similarly to ABMs, the infection spreads over a predefined network that can either be random [25] or inferred from real data [7], yet unlike ABMs, the dynamics are fully characterized by state transition equations. Huerta and Tsimring first employed this technique for modelling contact tracing in a generic epidemic [26, 27]. Farrahi et al. took the idea a step further by restricting the tracing propagation to a subset of the infection network, thus accounting for the inherently noisy nature of this process [7]. Even though both of these exhibited powerful modelling capabilities, they were limited by their underlying compartmental formulation (SIRT) which made several unrealistic assumptions that do not generalize to real viral diseases: inter alia, the recovery was conditioned on tracing, susceptibles could not be wrongfully isolated, a traced person remained noninfectious for the full duration of the epidemic. Our modelling approach fixes these issues by separating the traced/isolated status from the infection state, therefore allowing for all the “active” nodes (i.e. not hospitalized, recovered or dead) to become traced or exit self-isolation after a certain amount of time without changing their corresponding disease progression. Concurrently, this modification enables one to simulate the effects of contact tracing, independently of the compartmental model used.

For the sake of completeness, we would also like to mention that branching process models for epidemics have become increasingly popular in the last few years [28, 29]. One such model, concerned with studying the effects of manual contact identification together with digital tracing solutions at various uptakes on the COVID-19 pandemic, has recently been proposed [30]. Simulations conducted with it show that effective manual tracing needs to be coupled with an application uptake of at least 75% to achieve containment, although smaller adoption rates can decrease the reproduction number R if combined with other public health interventions. Our results are in accordance with the latter observation, but they also show that, given the right testing and tracing regimes (including good self-isolation compliance), lower and achievable adoption levels are actually enough to significantly reduce the viral spread, subject to the social network’s connectivity patterns.

2 Materials and methods

2.1 Compartmental model outline

Motivated by growing evidence that simple SIR frameworks are inefficient at capturing the dynamics of SARS-CoV-2 epidemics [31], we developed a new compartmental model that accounts for many of its particular features. Each state transition represented in Fig 1 model is labeled with its corresponding time-dependent probability, an end configuration being reached when all non-susceptible nodes become either recovered (R) or dead (D). A description of the model parameters, together with the values we consider for each of them, can be consulted in Table 1.

Fig 1. The SEIR-T compartmental model for COVID-19.

Fig 1

Each node has 2 allocated variables: an infection state and a tracing status. The infection states from top to bottom are: S—susceptible; E—exposed but not infectious; Ip—infectious, presymptomatic; Ia—infectious, asymptomatic; Is—infectious, symptomatic; H—hospitalized; R—recovered / removed; D—dead. At any point in time, a node’s tracing status can either be T (traced and isolated) or N (not traced/isolated or non-compliant). Each state transition has a certain time-dependent probability pS1→S2; the edge labels here represent both pS1S2Δt, and the λ rate of the corresponding exponential to sample from in the continuous-time simulations.

Table 1. Compartmental model parameters.

Parameter Value(s) Description
β 0.0791 Transmission rate corresponding to R0 = 3.18. According to maximum likelihood estimation performed by [24].
K X R Function mapping nodes to the total number/weight of connections to neighboring nodes in state X ∈ {Ip, Ia, Is, T} for a given network.
r I 0.5 Relative infectiousness of Ip and Ia compared to Is. This is still disputed: ≈0.5 according to [24, 32], but weak evidence as per [33].
ϵ −1 3.7 Latency period, measured in days. Source: [24].
p a 0.2 / 0.5 Probability of being asymptomatic. This is still disputed: 0.2 used by [24, 34], but 0.5 according to [35, 36].
μp-1 1.5 Presymptomatic period, measured in days. Source: [37].
p h 0.1 Probability of being hospitalized for adults (can be considerably different for children/seniors). Equivalent to pss in [24].
γ −1 2.3 Infectious period considering the mean generation time 6.6 days. Source: [24].
λHR 0.083 Daily rate of recovery for adults (different for children/seniors). Source: [38].
λHD 0.0031 Daily rate of deaths for adults (different for children/seniors). Source: [38].
τ t [0–0.5] Contact tracing rate. Encompasses multiple related phenomena: the tracing latency/efficiency due to staffing/server reliability, depending on the type of tracing; the likelihood of remaining isolated given the number of traced neighbors. Ranges from no tracing (0) to every 2 days on average (0.5).
τ r (0–0.5] Testing / Random tracing rate. Ranges from almost no testing (0.001) to every 2 days on average (0.5).
r T 0.8 Relative probability for Ia to be tested positive (against Is). Assume testing E and Ip rarely happens or results in false negatives most of the time.
η 0 / 0.001 Non-compliance / Self-isolation exit rate. Scaled by the time elapsed since beginning the isolation: tcurrentttrace.

2.2 Network propagation mechanism

Our propagation model consists of a predefined network on which the infection spreads, and one subnetwork ascribed to each type of contact tracing (manual or digital). This mechanism allows for simulating either one tracing strategy in isolation (dual topology, example in Fig 2) or both in tandem (triad topology, Fig 3). Connected vertices in the true infection network are to be considered “close contacts”, as defined by institutions like the CDC [39].

Fig 2. Final state of an epidemic simulation over a dual topology.

Fig 2

Infection spreads with respect to the neighborhoods of the first network (here a SF graph); the second network corresponds to digital tracing at uptake r = 0.5.

Fig 3. Final state of an epidemic simulation over a triad topology.

Fig 3

Infection spreads with respect to the neighborhoods of the first network (here a SW graph); the second network corresponds to digital tracing at uptake r = 0.5, while the third involves manual tracing with overlap Γ = 0.5.

The tracing graphs are usually subset views of the true contacts network, where missing edges correspond to application misuse in the digital setting or contacts not recalled in the manual interviewing process, while isolated vertices are used to represent individuals that never run a government’s digital solution or are effectively unreachable. Be that as it may, people can at times overestimate the number or the duration of their social interactions [40], and thus it is possible that tracers are occasionally pursuing erroneous links. Even though our model can simulate “false” contacts, similarly to [7], we consider their occurrence quite rare during a global pandemic (and thus negligible), since the public health personnel is particularly well-trained and the general public is more attentive. We control the subsetting of the infection graph via two interlinked parameters: the degree of overlap Γ=K-ZremK and the uptake rate r=N-NutnN, variables which ultimately determine the values of Nutn and Nute (refer to Eq 1). To be more explicit, the inputted Γ and the infection network’s mean degree K are utilized to calculate Zrem, the average number of edges per node to get removed from a tracing view. The latter effectively corresponds to marking as untraceable Nute=N·Zrem2 of the edges in the interaction graph. Similarly, the selected r and the total number of nodes N are used to establish how many vertices are to be made completely untraceable in a particular tracing subnetwork: Nutn = N ⋅ (1 − r). This work showcases simulations in which the first of these two parameters describes the accuracy of manual tracing, whereas the second quantifies the adoption of a digital solution. That being said, our model supports exploring more complex scenarios, where both the overlap and the uptake can be varied for a single tracing view. A full description of the network-related variables involved in our modelling procedure can be consulted in Table 2.

Nutn=N·(1-r)Nute=N·Zrem2=N·K·(1-Γ)2 (1)

Table 2. Network-generation parameters.

Parameter Value(s) Description
N N Population size of the infection network.
K 10/20 Average degree of the infection network. We fix this for ER graphs.
m 10 Random edges to add for each new node in Holme-Kim networks [11].
p 0.2 Probability of completing a triangle after adding a random edge.
Γ [.1, 1] Degree of overlap between infection network and a tracing subgraph. Used to calculate Zrem, which in turn gives Nute (number of untraceable links).
r [.1, 1] Uptake rate (between infection network and a tracing subgraph). Used to calculate Nutn (number of untraceable nodes).

Throughout our experiments, we assume a “traced” individual (i.e. in state T) automatically enters self-isolation, so infecting or getting infected remains impossible until it becomes “non-isolating” (N). This can happen either legally (after 14 days) or unlawfully (with a probability of η scaled by the time elapsed since isolating). In addition, we presume that a node’s probability to get infected proportionately increases with the amount of infectious neighbors it has in the contacts network, while the likelihood of being traced and compliant with self-isolation recommendations is directly proportional to the number of adjacent T nodes it features in each tracing subnetwork.

2.3 Simulation overview

The baseline simulations in this study were run over Erdős–Rényi random graphs (ER), featuring different population sizes and average degrees. It is worth mentioning that, although the epidemiological literature has widely adopted it, this type of graph model tends to be unsuitable for capturing the interaction patterns of many real social networks [41]. In spite of this, Tsimring and Huerta concluded that the SIRT-induced epidemic dynamics stays “qualitatively similar” between ER and the empirically motivated class of small-world graphs, since the realizations of both these models feature a well-defined epidemic threshold [27]. This result should also hold for our framework, considering that we model tracing in a fairly analogous fashion. What is more, the ER’s inherent ability to accommodate the characteristics of randomly mixed populations [42] makes it an adequate vehicle for studying outbreaks in public places, such as stores or mass transit conveyances [18]. Random mixing models, in turn, were shown to offer acceptable estimates of the total epidemic size when the transmission probability is high or the infectious period is relatively small [43], conditions that are usually satisfied in the case of COVID-19 breakouts. Nonetheless, several experiments involving more realistic small-world (SW) and scale-free networks (SF) are the focus of a more detailed exploration in Secton 3.4. We note here that ABM simulations, mobility or contact tracing datasets could be utilized in conjunction with the configuration model to obtain even more accurate predictions for particular lifelike scenarios [44, 45], but these do not offer any generalization guarantee.

It is a known fact that the SARS-CoV-2 virus is an overdispersed infectious agent [46, 47], and like many other pathogens with a high epidemic potential [48], the disease diffusion is largely driven by “superspreading” events [49]. As such, SFs like the Barabási-Albert networks [9] tend to offer a sounder representation of the transmission chain since superspreaders can be adequately modelled as hubs in a specific social graph [50, 51], while the latter naturally arise as a result of the preferential attachment process that underpins SFs. On the other hand, SWs more closely resemble interactions in social networks due to their larger clustering coefficient, while clusters, in turn, have been shown to be an important catalyst of the COVID-19 pandemic [52]. We believe our modelling technique of preferentially sampling nodes with a higher traced neighbor count for undergoing quarantine to be similar in nature to the frequency-based contact tracing procedure employed in [51], and thus we expect superspreaders inside SFs to get swiftly targeted by our control framework, subject to the strength of the contact tracing rate. Moreover, as we mentioned earlier, the tracing-imbued epidemic dynamics over SWs is akin to that of ERs, and thus similar levels of tracing efficiency are to be expected for both these graph models. It is therefore sensible to assume that the modelling mechanism we employ remains suitable for assessing the effects of tracing over a broader range of network types (other than ERs).

In contrast to the above, Secton 3.5 investigates the effects of digital and manual tracing in a viral outbreak simulated over a real social network, representing a tightly-connected community of 74 students and graduates from MIT who agreed to have their location and interactions monitored via WLAN and Bluetooth scans over an entire academic year (detailed exploratory analyses of the dataset can be examined in [7, 14]). In our simulations, this dynamic network changes daily over a period of 31 weeks, its links being weighted by the aggregated number of Bluetooth proximities recorded between their corresponding corner points on each particular day. In the static settings presented thus far, KX represents a function mapping nodes to the total number of neighbors in state X ∈ {Ip, Ia, Is, T} (see Fig 1). To account for dynamic weights, however, all KX terms get replaced by a time-dependent function KtX, given by Eq 2, where Knorm = 10 is a normalization factor that ensures the average function value remains above 1, wtX(n) is the sum of edge weights incoming from those neighbors of node n which are in state X at time t, while < W > is the overall static average weight. The latter represents an average over days of the average total weight per node, calculated using Eq 3, where D = 216 is the number of days within the considered 31-week period, N = 74 is the number of nodes for which we have contacts data, and wt(n) is the total weighted degree of node n at time t (i.e. irrespective of state).

KtX(n)=Knorm·wtX(n)<W> (2)
<W>=tDnNwt(n)D·N (3)

In this work, the time intervals between two state changes of the same kind are assumed to form an exponential distribution, with the λ rate equal to the corresponding transition label displayed in Fig 1. Choosing this distribution for timing the infection propagation, in particular, keeps our approach in line with many previous epidemiological works relying on compartmental formulations [7, 53, 54], while also being in accordance with the findings of different cohort studies involving wearable tracking devices that reported roughly-exponential decays in their participants’ histogram of interactions [55, 56]. Similar cohort studies found heavier-tailed distributions based on power laws to be more compatible with the time intervals between successive interactions, citing the bursty nature of social dynamics as the determining factor [57, 58], yet the corresponding data fit was often imperfect while extensive comparisons against exponentials were not performed. In the epidemiological setting, several authors have argued for a shift towards more realistic and flexible Gamma (more commonly Erlang [59, 60]) or Weibull distributions [6163] for the infection waiting times, emphasizing the non-Markovian behavior that epidemics occasionally exhibit. That being said, exponentials have been shown to provide a particularly good fit to epidemiological data when the mean generation time is correctly fixed [59] or the mean infection duration is smaller [64]. Both of these conditions constitute sensible assumptions in our case.

For efficiency, we simulate the COVID-19 outbreaks using Gillespie’s algorithm [65], which has been shown to be stochastically exact to and faster than the Monte Carlo method (MC) for both static and dynamic network-based diffusion processes [66]. Compared to a continuous-time MC simulation, which entails sampling the next transition for all the possible state changes, discarding all but the most “recent” event [56], Gillespie’s procedure directly draws the time elapsed until the next transition and identifies the state change most likely to have taken place within that period. A detailed pseudocode for event sampling in our work is provided in Fig 4.

Fig 4. Pseudocode for event sampling in the SEIR-T model.

Fig 4

We adapt Gillespie’s algorithm for our network-based simulations, thus sampling the minimum event time directly. The list of rates is updated at each iteration only for the last updated node and its neighbors. The procedure returns an event dictionary which is then used to update the network states, neighbor counts and running statistics.

2.4 Metrics under consideration

Aside from scrutinizing the number of individuals in each compartment over time (please also refer to S1 File for more such evaluations), we assess the efficacy of different contact tracing strategies (Cθ;τt, under different τt) by looking at their achieved peak suppression (Psup) throughout all our simulations, thus comparing them against the corresponding no-tracing scenario (Cθ;τt = 0) in which all parameters θ (but τt) are left unchanged. Mathematically, this can be expressed through Eq 4, where Imax is a function mapping parameter configurations Cθ to the average peak of infections recorded across multiple runs.

Psup=Imax(Cθ;τt=0)-Imax(Cθ;τt) (4)

Since the inception of the COVID-19 pandemic, the majority of the literature on epidemiological modelling and public-health messages alike have scrutinized different nonpharmaceutical interventions in relation with their impact on R, the effective reproduction number [24, 67]. For the latter more realistic scenarios (i.e. Sections 3.4 and 3.5), we also estimate the R value after t = 7 days since t0. To do so, we input the recorded exponential growth rate λ to Eq 5, thus following the Wallinga and Lipsitch methodology [68]. The generation time distribution for our SARS-CoV-2 epidemics is assumed to be Gamma(α = 1.87, β = 0.28) [69], its moment-generating function being denoted with M(.). To calculate λ from the incidence rate c(t) recorded within the time window [t0, t0 + t], we use Eq 6 together with the initial number of infected c(t0).

R=1M(-λ)=1(1--λβ)-α=(1+λβ)α (5)
c(t)=c(t0)eλtλ=logc(t)-logc(t0)t (6)

3 Results and discussion

3.1 Variation induced by population size

Initial simulations using ER graphs suggested the degree of variability across runs scales with the number of nodes. In order to verify this hypothesis, we design an experiment in which we vary the population size: N ∈ {200, 500, 1000, 2000, 5000, 10000, 20000}, while keeping the other parameters fixed at: average degree K = 10, dual network tracing with uptake r = 0.5 (overlap Γ implicitly derived), asymptomatic probability pa = 0.2, contact tracing rate τr = 0.1, and testing rate τr = 0.1, with one infectious individual set for time t0. We note that, as τr = τt > β, contact tracing is expected to engulf the infection percolation in the limit. However, by choosing an uptake value considerably smaller than 1, we ensure our variance analysis remains significant since many of the randomly-generated tracing views end up producing a much slower discontinuation of otherwise quickly-contained infection cascades. This results in a high probability for finite outbreaks to occur during the early stages of the simulations (i.e. above the epidemic threshold for enough time).

The statistics in Fig 5 size represent averages over several simulations conducted with each of the 10 different network initializations picked by a random sampler, filtering out those iterations which registered less than three overall infected (for a total of 80–100 simulations overall per each value of N). The data confirm the variance in peaks of infection increases as the network expands, aspect which can be explained by the growing difference between early-stopped and full-blown outbreaks. In contrast, the uncertainty in estimating the relative percentage of these maximal points expectedly decays with the size (≈ according to 1N), a choice of N = 1000 resulting in a tolerable standard deviation of almost 3%, while N = 10000 leads to an even smaller variability of <1% across runs. Consequently, we consider these two values representative for our model’s expressive power given a randomly-mixed population, and, as such, we use them both in our experiments. We account for the corresponding difference in variances by simulating 7 different networks with 15 random seeds each for N = 10000, but 50 networks and 15 seeds in the case of N = 1000. We note this result may not hold in the case of structured populations, yet nevertheless we reproduce the latter of the two setups in the SF-SW experiment for consistency.

Fig 5. Uncertainty of simulation results with regard to the infection peak.

Fig 5

Values from 80–100 runs plotted for different population sizes, K = 10, τt = τr = 0.1, pa = 0.2. On top, boxplots with quartiles represented via whiskers, medians via red dotted lines, and averages via red diamonds; the standard deviations σ are given below. The left-hand side displays absolute values, whereas on the right all the variation levels are scaled down by N.

3.2 Tracing overlap in larger populations

Going forward, we want to assess the effect of varying a tracing network’s accuracy (i.e. overlap) in an outbreak involving a large community of N = 10000 individuals. To achieve this, we use the following parameter configuration: an average degree K = 10, dual network tracing with overlap Γ ∈ {0.11, 0.22, …, 1} (uptake r is implicitly derived), asymptomatic probability pa = 0.2, a tracing rate τt ∈ {0.01, 0.04, 0.07, 0.1}, a testing rate τr ∈ {.001, 0.01, 0.04, 0.07, 0.1}, and a non-compliance rate η = 0 (assuming everybody self-isolates until they are no longer infectious), with a single Ip node sampled at time t0. The resulting statistics get averaged over 105 runs, as previously described.

Fig 6 shows that a sub-optimal test rate, such as τr = 0.001, leads to inconclusive results, where the variance induced by the stochasticity of the process shadows any benefit obtained through contact tracing. With better testing, clearer patterns start to emerge: The higher the contact tracing rate, the better the peak suppression is and the faster it gets approached (see Fig 7). As τr becomes even more effective, smaller tracing network overlaps are needed to swiftly reduce that maximum point. Looking at the tracing rate, a moderate value of τt ∈ {0.04, 0.07} achieves a delay in the peak for smaller Γ, but this can occasionally lead to a prolonged epidemic, especially for overlaps in the “noise” region like Γ = 0.11, since initially-uninfected regions may get incorrectly traced, so the epidemic has the chance to gain momentum once those individuals exit self-isolation. In contrast, noticeable reductions with no such side effect can be observed for Γ ≥ 0.5. On the other hand, a small value of τt = 0.01 seems unable to produce a positive outcome. In real life, the latter scenario would occur if the tracing programme was very slow, missing too many contacts as a result, or if the digital contacts application failed to promptly notify many of its active users. Another noteworthy occurrence in Fig 7 is the bimodality of some of the curves. This effect has been previously noted for larger tracing rates and overlaps [7], being a rare artefact of fast incidence reductions that cannot be sustained by a τt < β any further.

Fig 6. ER network—Peak suppression (left) and the time of peak (right) at various tracing network overlaps.

Fig 6

Values are averaged over 105 runs, representing results for N = 10000, K = 10, pa = 0.2. The suppression is calculated by subtracting the average maximal infected point given by each parameter configuration from the average point obtained with no contact tracing (τt = 0). Apart from τr = 0.001 and τt = 0.01 which produce inconclusive results that we regard as noise, the effectiveness of an epidemic containment strategy expectedly scales with the testing and the tracing rates.

Fig 7. ER network—Epidemic evolution over time given a less efficient (left) and a more effective (right) testing regime.

Fig 7

Results averaged over 105 simulations, obtained for N = 10000, K = 10, pa = 0.2. As the contact tracing rate increases, the accuracy of the network given by Γ becomes more important for “flattening” the curves. The case with no contact tracing (τt = 0) is colored in black.

Aside from outlining the effects of different testing strategies and tracing network overlaps, this experiment also hints at which parts of τr’s and τt’s parameter spaces are more relevant for exploration. To aid our search, we plot heatmaps of these parameter’s achieved peak suppression for different levels of overlap (see Fig 8), and observe, as a result, that significant outcomes (i.e. distinguishable from simulation noise for Γ ≈ 0.5 and beyond) are obtained when τr ≥ 0.1 and τt ≥ 0.04, while values ≥0.1 should fall within the “adequate” region of a large spectrum of Γ values.

Fig 8. ER network—Heatmaps of achieved peak suppression for different testing and tracing rates.

Fig 8

N = 10000, K = 10, pa = 0.2. Averaged over 105 runs.

3.3 Effects of average degree and app uptake

Further, we analyse the impact of the application uptake in scenarios with different average degrees (K ∈ 10, 20), and more appropriate testing and tracing strategies—i.e. τt, τr ∈ {0.05, 0.1, 0.2, 0.5}. For this trial, we set N = 1000, the asymptomatic probability to pa = 0.2, and the non-compliance rate to η = 0.001 (with automatic isolation exit after 14 days), selecting a single Ip node as the infection seed. The results are averaged over 750 simulations to reduce the variance induced by the smaller N.

Fig 9 shows the peak suppression achieved by each strategy given a specific adoption level. For τt = 0.05, uptakes r ≤ 0.5 generally give results within the noise region. Improving the contact tracing rate, however, leads to a noticeable decrease of this maximal point, even at smaller adoption levels. This is particularly true in the larger average degree case. Interestingly, deploying a wider-scale testing programme alone (τr = 0.5) seems to lead to a considerable spread reduction which makes contact tracing less beneficial at achievable uptakes (even entirely profitless in the K = 10 situation).

Fig 9. ER network—Uptake rate r against peak suppression.

Fig 9

Suppression is difference in peak to no tracing, i.e. τt = 0. N = 1000, pa = 0.2, η = .001. K = 10 given on the left, K = 20 on the right. The case with no contact tracing (τt = 0) is colored in black. All lines were plotted with the 95% confidence intervals resulted from 750 runs.

Our findings suggest that a testing rate of τr = 0.1 remains suitable in conjunction with contacts isolation not only for the previous experiment with N = 10000, but also in these smaller scale scenarios featuring different average degrees. Consequently, we decided to examine further the effect of such a testing regime on the evolution of the spread (Fig 10), the number of total deaths (Fig 11) and hospitalizations (S1 Fig in S1 File) for K = 20. The first chart below illustrates how the epidemic curves significantly “flatten” for uptakes r ≥ 0.4, the effect being more apparent as the contact tracing rate increases. The second diagram puts these results into perspective by showing that the number of deaths can be reduced even with lower uptakes, while at the other end of the spectrum many simulations have ended with notably fewer deceased (even none for an effective tracing τt ≥ 0.2).

Fig 10. ER network—Epidemic evolution over time for τr = 0.1 N = 1000 and K = 20.

Fig 10

Results averaged over 750 runs. The case with no tracing (τt = 0) is colored in black.

Fig 11. ER network—Total deaths over time for τr = 0.1 N = 1000 and K = 20.

Fig 11

The 95% confidence intervals resulted from 750 runs are displayed around each line. The case with no contact tracing (τt = 0) is colored in black.

Interestingly, these figures indicate that a higher uptake does not always guarantee a better epidemic outcome (e.g. r = 1 ends up with a higher peak than r = 0.8 in the case of τr = 0.1 and τt ≥ 0.2). This is a direct consequence of isolating too many susceptibles early on in the outbreak (scenario similar to a partial lockdown), making their eventual self-isolation exit an unpredictable impact factor for the transmission chain.

3.4 Combining digital tracing with an imperfect manual tracing process

In this section, we study a more realistic scenario in which digital solutions complement an inherently imperfect interview-based tracing system. To that end, a triad network topology is employed, with digital tracing happening at a rate of 1τt days on average, over one subgraph given by the uptake r ∈ {0.1, 0.25, 0.4, 0.55, 0.7, 0.85, 1}, while the manual process gets carried at a slower pace of 2+1τt days on average, over a third network view whose edges have been randomly removed according to the degree of overlap Γ ∈ {0.1, 0.25, 0.4, 0.55, 0.7, 0.85, 1}. For the purpose of this experiment, we make use of a more representative graph structure for the SARS-CoV-2 transmission based on the Holme and Kim (HK) model [11], which features both a SF degree distribution and a SW clustering coefficient. The network parameters chosen here are: N = 1000, m = 10 (number of random edges to add for each new node; this replaces K in Eq 1 for calculating Nute) and p = 0.2 (probability of making a triangle after adding a random edge). To avoid runs in which the epidemic gets quickly contained by chance, the simulation starts with 10% of the nodes in the Ip state—c(t0) = 10% of N. The other parameters remain unchanged from the previous section, including the number of total runs.

The first aspect to notice in Figs 12 and 13 is that all curves remain monotonic with respect to r, while the dissimilarities between different τt contact rates become more apparent than what could be observed in the preceding experiment. This is a direct consequence of the increased number of infected people selected for time t0, which prevents simulations from averaging over too many early-stopped runs. Considering the scale this pandemic has reached and the unavoidable presence of a delay between the infection onset and the debut of tracing, scenarios such as this one are more likely to occur, and therefore of a greater interest [17, 70].

Fig 12. HK network—Uptake rate r against peak suppression.

Fig 12

Suppression is difference in peak to no tracing, i.e. τt = 0. The results here correspond to a Holme-Kim network with N = 1000, m = 10, p = 0.2, pa = 0.2, η = .001. On the left, we have a scenario in which only digital tracing was conducted, whereas the next 3 columns represent simulations with a combination of digital tracing on a second network, and manual tracing over a third network with various overlaps: 0.1, 0.55, 1. The 95% confidence intervals are displayed. The case with no tracing (τt = 0) is colored in black.

Fig 13. HK network—Uptake rate r against the effective reproduction number R.

Fig 13

Suppression represents the difference to no tracing, i.e. τt = 0. The results here correspond to a Holme-Kim network with N = 1000, m = 10, p = 0.2, pa = 0.2, η = .001. On the left, we have a scenario in which only digital tracing was conducted, whereas the next columns represent simulations with a combination of digital tracing on a second network, and manual tracing over a third network with various overlaps: 0.1, 0.55, 1. The case with no contact tracing (τt = 0) is colored in black.

Fig 12 shows the degree of peak suppression achieved by utilizing digital and manual tracing solutions when compared to a scenario in which no contact tracing was performed. These results suggest that, as the efficacy of the interview-based process increases (i.e. less contacts get missed), lower and achievable application adoption rates (20–50%) are sufficient to effectively reduce the maximal point of the epidemic. When the tracers are eventually able to “see” the full network of contacts (Γ = 1), varying r no longer impacts the spread significantly, as should be expected. In contrast, a very good testing regime (τr ≥ 0.2) can partially compensate for an inefficient manual tracing system (Γ = 0.1) within the aforementioned uptake range.

Our estimate of R = 3.20 for minimal interventions (i.e. τr = 0.05 and no tracing) during this scenario’s first week falls within the confidence interval of the basic reproduction number R0 ∈ [3.09, 3.24] derived in Di Domenico et al. [24] by applying the next-generation approach [71] on a model fairly similar to ours. Fig 13 demonstrates that with good testing regimes (τr ≥ 0.1) and a reasonable manual tracing in place (Γ ≥ 0.5), achievable uptake levels are enough to limit this R to a value close to 1. In contrast, digital tracing alone fails to significantly reduce the spread unless both the testing and the adoption rates are very high. Similarly to what could be observed in Fig 12, uptakes play a minor role in the infection proliferation if tracers are able to track the whole contact network eventually, yet this scenario is rather unlikely in real life. Interestingly, most of the other trends outlined in the peak suppression charts are faithfully mirrored by the evolution of R in the first week of the simulation. This reinforces the fact that efficient contact tracing in the early stages of an outbreak is essential for containing a virus like SARS-CoV-2 [72].

Even though peak suppression remains a good metric for assessing the benefits of public interventions, policy makers are more often interested in what combinations of these measures can quickly bring R to acceptable levels. In light of this, we plotted the contour lines of the R values produced by various degrees of interview-based network overlaps, testing and digital tracing adoption rates (see Fig 14, but also S6 Fig in S1 File). With an estimated uptake of around 40% in Finland and Ireland, 30% in the UK, or 27% in Germany and Norway at the time of writing [73], an effective testing regime (τr ≥ 0.2) coupled with an efficient contact tracing rate (τt = 0.5) can drive R below 1 even when tracers miss up to half the contacts (Γ ≥ 0.5). Should this adoption improve to 50%, the aforementioned effect would be obtained with a testing rate half as good. In contrast, a moderate tracing rate only becomes effective if a large-scale testing programme gets deployed (τr = 0.5) or bigger uptakes are achieved within a population (r > 50%). We note the quoted uptakes were approximated using the total number of application downloads, therefore due diligence should be exercised when interpreting these statistics because new downloads do not always convert to new active users.

Fig 14. HK network—Contour plots of R based on the level of manual tracing overlap Γ and digital tracing uptake r.

Fig 14

The results here are for a Holme-Kim network with N = 1000, m = 10, p = 0.2, pa = 0.2, η = .001. Each line represents a different testing level τr, while the columns correspond to a moderate (left) and efficient (right) average level of tracing engagement and isolation compliance given by τt.

3.5 Contact tracing efficiency in a real social network

Lastly, we evaluate the ability of digital tracing to curb an epidemic simulated over a real social network, in the presence or the absence of manual contact tracing. In this scenario, both the population size N and the average degree K are data-driven, with the latter also changing dynamically (N = 74, Kt0=5.62 at time t0). Given that the network represents a tightly-knit community (static average degree Kstatic > 60), we investigate a broader range of testing and tracing rates: τr ∈ {0.1, 0.2, 0.5, 1, 1.5}, τt ∈ {0.1, 0.2, 0.5, 1, 1.5, 2}. The uptakes r, the overlaps Γ, and the initial incidence c(t0) are left unchanged from the last passage, while the relative delay between digital and manual tracing is kept at 2 days on average. The probability of becoming an asymptomatic case following exposure is fixed at pa = 0.2 for the purpose of our initial discussion, but a comparison to the case in which pa = 0.5 can be consulted at the end of this section.

The first thing to note about both Figs 15 and 16 is that each presents qualitatively similar trends to their counterpart figures from the previous experiment (i.e Figs 12 and 13, respectively). Namely, the better the testing and tracing rates are, the higher the benefit. At the same time, lower uptakes, in conjunction with an adequate overlap Γ ≥ 0.5, consistently achieve significant peak reductions, driving the R estimate of the first week below 1.5, even when the testing rate is smaller than 0.5. In contrast, with higher uptake values, the degree of overlap becomes less relevant for the epidemic outcome. Interestingly, the benefits of increasing the tracing effort τt beyond the value of 1 remain minimal across parameter configurations, and therefore we restrict further analyses to the range [0.1, 1].

Fig 15. Social evolution—Uptake rate r against peak suppression.

Fig 15

Suppression is difference in peak to no tracing, i.e. τt = 0. The results here correspond to the real Social Evolution network, dynamic over the studied period of 31 weeks, pa = 0.2, η = .001. On the left, we have a scenario in which only digital tracing was conducted, whereas the next 3 columns represent simulations with a combination of digital tracing on a second network, and manual tracing over a third network with various overlaps: 0.1, 0.55, 1. The 95% confidence intervals are displayed. The case with no tracing (τt = 0) is colored in black.

Fig 16. Social evolution—Uptake rate r against the effective reproduction number R.

Fig 16

Suppression represents the difference to no tracing, i.e. τt = 0. The results here correspond to the data-driven Social Evolution network, pa = 0.2, η = .001. On the left, we have a scenario in which only digital tracing was conducted, whereas the next columns represent simulations with a combination of digital tracing on a second network, and manual tracing over a third network with various overlaps: 0.1, 0.55, 1. The case with no contact tracing (τt = 0) is colored in black.

Fig 17 presents the 2D contours of the estimated R value (during the initial 7 days of the simulation) for the whole range of parameters. When comparing these results to Fig 14, we can see that a significantly faster testing strategy would be needed in this case to swiftly contain the epidemic and force R < 1. This is a consequence of dealing with an outbreak in such a lively and highly-interactive community, where the virus spreads too rapidly to afford testing at a lower rate than 0.5 (or even 1 in some cases) if the objective is to keep R subunitary. Similarly, an efficient τt ≥ 0.5 is needed for achievable uptakes to attain (or be close to) the aforementioned goal. Where limited public health resources are available, locking down or restricting the movements within such hubs is therefore recommendable.

Fig 17. Social evolution—Contour plots of R based on the level of manual tracing overlap Γ and digital tracing uptake r.

Fig 17

The results here correspond to the real Social Evolution network, dynamic over the studied period of 31 weeks, pa = 0.2, η = .001. Each line represents a different testing level τr, while the columns showcase a less efficient (far left), a moderate (center-left), an efficient (center-right) and a very efficient (far right) average level of tracing engagement and isolation compliance given by τt.

Finally, we investigate whether the efficacy of tracing appreciably changes when different pa values are considered. An asymptomatic node is assumed to be less infectious—rI = 0.5, but also less likely to get tested positive—rT = 0.8, so the epidemic dynamics should significantly differ when varying this probability. Remarkably, however, we observe the benefits of contact tracing do not fluctuate across the two studied values in the majority of the scenarios under scrutiny (see Figs 18 and 19). As shown in Fig 19, the most apparent differences in R were recorded when less accurate tracing networks (Γ, r < 0.5) and less effective testing rates (τr < 0.5) were employed. That is to say a suboptimal “test and trace” policy leads to more people getting infected when pa = 0.2, yet this higher rate of infectiousness can be offset by the smaller likelihood of nodes testing positive in the pa = 0.5 scenario, ultimately leading to minimal dissimilarities for the more adequate policies.

Fig 18. Social evolution—Uptake rate r against peak suppression, for different pa values.

Fig 18

Suppression is difference in peak to no tracing, i.e. τt = 0. The results here correspond to the real Social Evolution network, dynamic over the studied period of 31 weeks, η = .001, and either pa = 0.2 (on the left of each pair) or pa = 0.5 (on the right of each pair). The left-quadrant pairs represent a triad network scenario with manual overlap Γ = 0.1, while the right quadrant showcases Γ = 0.5. The 95% confidence intervals are displayed. The case with no tracing (τt = 0) is colored in black.

Fig 19. Social evolution—Contour plots of R based on the level of manual tracing overlap Γ and digital tracing uptake r, for different pa values.

Fig 19

The results here correspond to the data-driven Social Evolution network when η = .001, with each pair of charts describing pa = 0.2 on the left and pa = 0.5 on the right. Each line represents a different testing level τr, while the columns showcase combinations of one pa value together with a level of tracing effort given by τt.

4 Conclusions and future work

This paper demonstrated how a novel methodology for modelling the effects of different “test and trace” strategies can be applied to study the transmission dynamics of a complex viral epidemic, such as COVID-19. Following a comprehensive analysis of the model’s parameters, the procedures described here can be utilised to predict how the SARS-CoV-2 virus would spread through those communities where some indication of the interview-based network overlap and/or the digital tracing uptake exists. To facilitate such endeavors, we made our entire codebase open-source (refer to S1 File).

The approach we propose can address from a modelling perspective four of the open questions formulated by Anglemyer et al. in their Cochrane Review [74]: the combined effects of digital and manual tracing can be studied via the triad network topology, populations with poor access to the internet may be factored in by the degree of overlap Γ, individuals that have privacy concerns or accessibility issues can be represented in the system via the application adoption rate r, while the ethical and economical repercussions of balancing false positives and false negatives of tracing can be assessed through the statistics our simulations readily capture (for more details, consult S2 and S3 Figs in S1 File). Consequently, the model we put forward is already powerful enough to answer a large spectrum of research and policy-related questions.

The simulations we conducted show that digital tracing remains a viable solution for reducing the peak of an outbreak, as well as the effective reproduction number R, even when its adoption levels are lower. At the same time, a less efficient interview-based process, which misses up to half the contacts, can still contain the spread if coupled with 30–40% application uptakes and large-scale testing regimes. For highly-connected communities, the latter condition becomes even more essential for swift containment. The peak reduction seems ubiquitously tied to how fast the tracing is conducted, as well as how impactful the public-health messages are in making the involved communities more compliant with the self-isolation recommendations, as soon as more and more of each individual’s contacts get traced and isolated (aspects encompassed in the τt rate).

We would like to emphasize that the parameter ranges under scrutiny in this study are by no means exhaustive. Therefore, we leave for future exploration studying the effects of extensively varying the average degrees of the random networks, the non-isolation rates, the initial infections, or the variant-specific asymptomatic and hospitalization probabilities. Looking at such diverse scenarios would allow one to better estimate the shortfalls of contact tracing when different variants of concern are circulating, discover factors that may have introduced significant inefficiencies into the strategies adopted by many countries (e.g. higher non-compliance [75]), while also ensuring the variability induced by early-stopped simulations is curbed.

Next, we envision leveraging several mobility datasets in subsequent endeavors to infer a broader range of network structures, and derive time-dependent estimates of the transmission rate, as previously described in Liu et al. [60]. Other parameters in our model could be tailored to the epidemiological situation of different countries by fitting them to governmental data reporting on the number of COVID-19 deaths registered within each region of interest.

Finally, the random nature of testing and deriving static tracing views in this study may not provide the most realistic setup. Strategically targeting mass-testing campaigns to hubs or dynamically intensifying tracing efforts in highly-affected regions could significantly improve the outcome of an outbreak. To finely control the network dynamics in such an informed fashion, we expect future studies to utilize a combination of graph neural networks and gradient-based reinforcement learning techniques, possibly leveraging the setup recently proposed by Meirom et al. for prioritizing the viral testing allocation [45].

Supporting information

S1 File. Supporting material.

Contains a link to the repository that maintains our open-source model, further discussions on other epidemic statistics we captured, and more charts illustrating the effects induced by varying the contact tracing parameters.

(PDF)

Acknowledgments

We would like to extend our gratitude to Professor Niranjan Mahesan for his novel ideas, as well as his continuous effort and support throughout this project. We also acknowledge the use of the IRIDIS High Performance Computing Facility, and its support services at the University of Southampton, in the completion of this work.

Data Availability

All relevant data are to be found within the paper and the Supporting information files.

Funding Statement

AR is an UKRI-funded PhD student. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.WHO. WHO Coronavirus Disease (COVID-19) Dashboard; 2021. Available from: https://covid19.who.int [cited 2021-10-13].
  • 2. Dighe A, Cattarino L, Cuomo-Dannenburg G, Skarp J, Imai N, Bhatia S, et al. Response to COVID-19 in South Korea and Implications for Lifting Stringent Interventions. BMC Medicine. 2020;18(1):321. doi: 10.1186/s12916-020-01791-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Google, Apple. Exposure Notifications API | Google API for Exposure Notifications; 2020. Available from: https://developers.google.com/android/exposure-notifications/exposure-notifications-api [cited 2021-10-13].
  • 4. Garg S, Bhatnagar N, Gangadharan N. A Case for Participatory Disease Surveillance of the COVID-19 Pandemic in India. JMIR Public Health and Surveillance. 2020;6(2):e18795. doi: 10.2196/18795 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Ferretti L. Quantifying Dynamics of SARS-CoV-2 Transmission Suggests That Epidemic Control Is Feasible through Instantaneous Digital Contact Tracing; 2020. Available from: https://perso.math.univ-toulouse.fr/cattiaux/files/2020/04/slides_COVID19AppBasedContactTracing_Ferretti_30min.pdf [cited 2021-07-13].
  • 6.Hinch R, Probert W, Nurtay A, Kendall M, Wymant C, Hall M, et al. Digital Contact Tracing Can Slow or Even Stop Coronavirus Transmission and Ease Us out of Lockdown; 2020. Available from: https://www.research.ox.ac.uk/Article/2020-04-16-digital-contact-tracing-can-slow-or-even-stop-coronavirus-transmission-and-ease-us-out-of-lockdown [cited 2020-10-28].
  • 7. Farrahi K, Emonet R, Cebrian M. Epidemic Contact Tracing via Communication Traces. PLoS ONE. 2014;9(5):e95133. doi: 10.1371/journal.pone.0095133 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Erdös P, Rényi A. On Random Graphs I. Publicationes Mathematicae Debrecen. 1959;6:290. [Google Scholar]
  • 9. Barabási AL, Albert R. Emergence of Scaling in Random Networks. Science. 1999;286(5439):509–512. doi: 10.1126/science.286.5439.509 [DOI] [PubMed] [Google Scholar]
  • 10.Barabási AL. Network Science Random Networks; 2015. Available from: https://barabasi.com/f/624.pdf.
  • 11. Holme P, Kim BJ. Growing Scale-Free Networks with Tunable Clustering. Physical Review E. 2002;65(2):026107. doi: 10.1103/PhysRevE.65.026107 [DOI] [PubMed] [Google Scholar]
  • 12. Watts DJ, Strogatz SH. Collective Dynamics of ‘Small-World’ Networks. Nature. 1998;393(6684):440–442. doi: 10.1038/30918 [DOI] [PubMed] [Google Scholar]
  • 13. Wang XF, Chen GR. Complex Networks: Small-World, Scale-Free and Beyond. IEEE Circuits and Systems Magazine. 2003;3(1):6–20. doi: 10.1109/MCAS.2003.1228503 [DOI] [Google Scholar]
  • 14. Madan A, Cebrian M, Moturu S, Farrahi K, Pentland AS. Sensing the “Health State” of a Community. IEEE Pervasive Computing. 2012;11(4):36–45. doi: 10.1109/MPRV.2011.79 [DOI] [Google Scholar]
  • 15.Sukumar SR, Nutaro JJ. Agent-Based vs. Equation-Based Epidemiological Models: A Model Selection Case Study. In: 2012 ASE/IEEE International Conference on BioMedical Computing (BioMedCom); 2012. p. 74–79.
  • 16.Ferguson N, Laydon D, Nedjati Gilani G, Imai N, Ainslie K, Baguelin M, et al. Report 9: Impact of Non-Pharmaceutical Interventions (NPIs) to Reduce COVID19 Mortality and Healthcare Demand. Imperial College London; 2020. Available from: http://spiral.imperial.ac.uk/handle/10044/1/77482 [cited 2020-11-21].
  • 17.Hinch R, Probert W, Nurtay A, Kendall M, Wymant C, Hall M, et al. Effective Configurations of a Digital Contact Tracing App: A Report to NHSX; 2020. Available from: https://cdn.theconversation.com/static_files/files/1009/Report_-_Effective_App_Configurations.pdf?1587531217 [cited 2021-10-13].
  • 18. Abueg M, Hinch R, Wu N, Liu L, Probert W, Wu A, et al. Modeling the Effect of Exposure Notification and Non-Pharmaceutical Interventions on COVID-19 Transmission in Washington State. npj Digital Medicine. 2021;4(1):1–10. doi: 10.1038/s41746-021-00422-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Kermack WO, McKendrick AG, Walker GT. A Contribution to the Mathematical Theory of Epidemics. Proceedings of the Royal Society of London Series A, Containing Papers of a Mathematical and Physical Character. 1927;115(772):700–721. doi: 10.1098/rspa.1927.0118 [DOI] [Google Scholar]
  • 20. Aik LE, Kiang LC, Hong TW, Abu MS. The SIR Model of Zika Virus Disease Outbreak in Brazil at Year 2015. AIP Conference Proceedings. 2017;1847(1):020004. doi: 10.1063/1.4983859 [DOI] [Google Scholar]
  • 21. Berge T, Lubuma JMS, Moremedi GM, Morris N, Kondera-Shava R. A Simple Mathematical Model for Ebola in Africa. Journal of Biological Dynamics. 2017;11(1):42–74. doi: 10.1080/17513758.2016.1229817 [DOI] [PubMed] [Google Scholar]
  • 22. Giordano G, Blanchini F, Bruno R, Colaneri P, Di Filippo A, Di Matteo A, et al. A SIDARTHE Model of COVID-19 Epidemic in Italy. Nature Medicine. 2020;26(6):855–860. doi: 10.1038/s41591-020-0883-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Zhao S, Chen H. Modeling the Epidemic Dynamics and Control of COVID-19 Outbreak in China. Quantitative Biology (Beijing, China). 2020; p. 1–9. doi: 10.1007/s40484-020-0199-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Di Domenico L, Pullano G, Sabbatini CE, Boëlle PY, Colizza V. Impact of Lockdown on COVID-19 Epidemic in Île-de-France and Possible Exit Strategies. BMC Medicine. 2020;18(1):240. doi: 10.1186/s12916-020-01698-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Rozhnova G, Nunes A. SIRS Dynamics on Random Networks: Simulations and Analytical Models. In: Zhou J, editor. Complex Sciences. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering. Berlin, Heidelberg: Springer; 2009. p. 792–797. [Google Scholar]
  • 26. Huerta R, Tsimring LS. Contact Tracing and Epidemics Control in Social Networks. Physical Review E. 2002;66(5):056115. doi: 10.1103/PhysRevE.66.056115 [DOI] [PubMed] [Google Scholar]
  • 27. Tsimring LS, Huerta R. Modeling of Contact Tracing in Social Networks. Physica A: Statistical Mechanics and its Applications. 2003;325(1):33–39. doi: 10.1016/S0378-4371(03)00180-8 [DOI] [Google Scholar]
  • 28. Jacob C. Branching Processes: Their Role in Epidemiology. International Journal of Environmental Research and Public Health. 2010;7(3):1186–1204. doi: 10.3390/ijerph7031204 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Lashari AA, Trapman P. Branching Process Approach for Epidemics in Dynamic Partnership Network. Journal of Mathematical Biology. 2018;76(1):265–294. doi: 10.1007/s00285-017-1147-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Plank MJ, James A, Lustig A, Steyn N, Binny RN, Hendy SC. Potential Reduction in Transmission of COVID-19 by Digital Contact Tracing Systems. Infectious Diseases (except HIV/AIDS); 2020. 10.1101/2020.08.27.20068346. Available from: http://medrxiv:org/lookup/doi/10:1101/2020:08:27:20068346 [cited 2020-11-15]. [DOI]
  • 31. Moein S, Nickaeen N, Roointan A, Borhani N, Heidary Z, Javanmard SH, et al. Inefficiency of SIR Models in Forecasting COVID-19 Epidemic: A Case Study of Isfahan. Scientific Reports. 2021;11(1):4725. doi: 10.1038/s41598-021-84055-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Grassly NC, Pons-Salort M, Parker EPK, White PJ, Ferguson NM, Ainslie K, et al. Comparison of Molecular Testing Strategies for COVID-19 Control: A Mathematical Modelling Study. The Lancet Infectious Diseases. 2020;0(0). doi: 10.1016/S1473-3099(20)30630-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. McEvoy D, McAloon C, Collins A, Hunt K, Butler F, Byrne A, et al. Relative Infectiousness of Asymptomatic SARS-CoV-2 Infected Persons Compared with Symptomatic Individuals: A Rapid Scoping Review. BMJ open. 2021;11(5):e042354. doi: 10.1136/bmjopen-2020-042354 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Mizumoto K, Kagaya K, Zarebski A, Chowell G. Estimating the Asymptomatic Proportion of Coronavirus Disease 2019 (COVID-19) Cases on Board the Diamond Princess Cruise Ship, Yokohama, Japan, 2020. Eurosurveillance. 2020;25(10). doi: 10.2807/1560-7917.ES.2020.25.10.2000180 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Keller J, Liyanage SE, Hingorani M, Hingorani A. Probability of Encountering Covid-19 Patients Based on Prevalence and Testing during Resumption of Ophthalmology Services. Eye. 2020;July:1–2. doi: 10.1038/s41433-020-1089-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Oran DP, Topol EJ. Prevalence of Asymptomatic SARS-CoV-2 Infection. Annals of Internal Medicine. 2020;173(5):362–367. doi: 10.7326/M20-3012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Ferretti L, Wymant C, Kendall M, Zhao L, Nurtay A, Abeler-Dörner L, et al. Quantifying SARS-CoV-2 Transmission Suggests Epidemic Control with Digital Contact Tracing. Science. 2020;368. doi: 10.1126/science.abb6936 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Etalab. Données Hospitalières Relatives à l’épidémie de COVID-19; 2020. Available from: https://www.data.gouv.fr/en/datasets/donnees-hospitalieres-relatives-a-lepidemie-de-covid-19/ [cited 2020-05-25].
  • 39.CDC. Health Departments; 2020. Available from: https://www.cdc.gov/coronavirus/2019-ncov/php/contact-tracing/contact-tracing-plan/appendix.html [cited 2021-10-13].
  • 40. Mastrandrea R, Fournet J, Barrat A. Contact Patterns in a High School: A Comparison between Data Collected Using Wearable Sensors, Contact Diaries and Friendship Surveys. PLOS ONE. 2015;10(9):e0136497. doi: 10.1371/journal.pone.0136497 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Newman MEJ, Watts DJ, Strogatz SH. Random Graph Models of Social Networks. Proceedings of the National Academy of Sciences. 2002;99(suppl 1):2566–2572. doi: 10.1073/pnas.012582999 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Keeling MJ, Eames KTD. Networks and Epidemic Models. Journal of The Royal Society Interface. 2005;2(4):295–307. doi: 10.1098/rsif.2005.0051 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Smieszek T, Fiebig L, Scholz RW. Models of Epidemics: When Contact Repetition and Clustering Should Be Included. Theoretical Biology and Medical Modelling. 2009;6(1):11. doi: 10.1186/1742-4682-6-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Newman M. Networks: An Introduction. Oxford University Press; 2010. Available from: https://oxford.universitypressscholarship.com/view/10.1093/acprof:oso/9780199206650.001.0001/acprof-9780199206650 [cited 2021-06-22].
  • 45.Meirom E, Maron H, Mannor S, Chechik G. Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks. In: Proceedings of the 38th International Conference on Machine Learning. PMLR; 2021. p. 7565–7577. Available from: https://proceedings.mlr.press/v139/meirom21a.html.
  • 46. Endo A, Abbott S, Kucharski AJ, Funk S. Estimating the Overdispersion in COVID-19 Transmission Using Outbreak Sizes Outside China. Wellcome Open Research. 2020;5. doi: 10.12688/wellcomeopenres.15842.3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Adam DC, Wu P, Wong JY, Lau EHY, Tsang TK, Cauchemez S, et al. Clustering and Superspreading Potential of SARS-CoV-2 Infections in Hong Kong. Nature Medicine. 2020;26(11):1714–1719. doi: 10.1038/s41591-020-1092-0 [DOI] [PubMed] [Google Scholar]
  • 48. Lloyd-Smith JO, Schreiber SJ, Kopp PE, Getz WM. Superspreading and the Effect of Individual Variation on Disease Emergence. Nature. 2005;438(7066):355–359. doi: 10.1038/nature04153 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Lewis D. Superspreading Drives the COVID Pandemic—and Could Help to Tame It. Nature. 2021;590(7847):544–546. doi: 10.1038/d41586-021-00460-x [DOI] [PubMed] [Google Scholar]
  • 50. Zenk L, Steiner G, Pina e Cunha M, Laubichler MD, Bertau M, Kainz MJ, et al. Fast Response to Superspreading: Uncertainty and Complexity in the Context of COVID-19. International Journal of Environmental Research and Public Health. 2020;17(21):7884. doi: 10.3390/ijerph17217884 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Kojaku S, Hébert-Dufresne L, Mones E, Lehmann S, Ahn YY. The Effectiveness of Backward Contact Tracing in Networks. Nature Physics. 2021;17(5):652–658. doi: 10.1038/s41567-021-01187-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Liu T, Gong D, Xiao J, Hu J, He G, Rong Z, et al. Cluster Infections Play Important Roles in the Rapid Evolution of COVID-19 Transmission: A Systematic Review. International Journal of Infectious Diseases. 2020;99:374–380. doi: 10.1016/j.ijid.2020.07.073 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Kröger M, Schlickeiser R. Analytical Solution of the SIR-Model for the Temporal Evolution of Epidemics. Part A: Time-Independent Reproduction Factor. Journal of Physics A: Mathematical and Theoretical. 2020;53(50):505601. doi: 10.1088/1751-8121/abc65d [DOI] [Google Scholar]
  • 54. Ma J. Estimating Epidemic Exponential Growth Rate and Basic Reproduction Number. Infectious Disease Modelling. 2020;5:129–141. doi: 10.1016/j.idm.2019.12.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Stehlé J, Voirin N, Barrat A, Cattuto C, Colizza V, Isella L, et al. Simulation of an SEIR Infectious Disease Model on the Dynamic Contact Network of Conference Attendees. BMC Medicine. 2011;9(1):87. doi: 10.1186/1741-7015-9-87 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Farrahi K, Emonet R, Cebrian M. Predicting a Community’s Flu Dynamics with Mobile Phone Data. In: Computer-Supported Cooperative Work and Social Computing. Vancouver, Canada; 2015. Available from: https://hal.archives-ouvertes.fr/hal-01146198.
  • 57. Cattuto C, den Broeck WV, Barrat A, Colizza V, Pinton JF, Vespignani A. Dynamics of Person-to-Person Interactions from Distributed RFID Sensor Networks. PLOS ONE. 2010;5(7):e11596. doi: 10.1371/journal.pone.0011596 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Starnini M, Baronchelli A, Pastor-Satorras R. Modeling Human Dynamics of Face-to-Face Interaction Networks. Physical Review Letters. 2013;110(16):168701. doi: 10.1103/PhysRevLett.110.168701 [DOI] [PubMed] [Google Scholar]
  • 59. Krylova O, Earn DJD. Effects of the Infectious Period Distribution on Predicted Transitions in Childhood Disease Dynamics. Journal of the Royal Society Interface. 2013;10(84). doi: 10.1098/rsif.2013.0098 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Liu L, Vikram S, Lao J, Ben X, D’Amour A, O’Banion S, et al. Estimating the Changing Infection Rate of COVID-19 Using Bayesian Models of Mobility. Epidemiology; 2020. Available from: http://medrxiv.org/lookup/doi/10.1101/2020.08.06.20169664 [cited 2020-11-24]. [Google Scholar]
  • 61. Streftaris G, Gibson GJ. Non-Exponential Tolerance to Infection in Epidemic Systems—Modeling, Inference, and Assessment. Biostatistics. 2012;13(4):580–593. doi: 10.1093/biostatistics/kxs011 [DOI] [PubMed] [Google Scholar]
  • 62. Lipsitch M, Cohen T, Cooper B, Robins JM, Ma S, James L, et al. Transmission Dynamics and Control of Severe Acute Respiratory Syndrome. Science (New York, NY). 2003;300(5627):1966–1970. doi: 10.1126/science.1086616 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Van Mieghem P, Liu Q. Explicit Non-Markovian Susceptible-Infected-Susceptible Mean-Field Epidemic Threshold for Weibull and Gamma Infections but Poisson Curings. Physical Review E. 2019;100(2):022317. doi: 10.1103/PhysRevE.100.022317 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64. Vergu E, Busson H, Ezanno P. Impact of the Infection Period Distribution on the Epidemic Spread in a Metapopulation Model. PLoS ONE. 2010;5(2). doi: 10.1371/journal.pone.0009371 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Gillespie DT. Exact Stochastic Simulation of Coupled Chemical Reactions. The Journal of Physical Chemistry. 1977;81(25):2340–2361. doi: 10.1021/j100540a008 [DOI] [Google Scholar]
  • 66. Vestergaard CL, Génois M. Temporal Gillespie Algorithm: Fast Simulation of Contagion Processes on Time-Varying Networks. PLOS Computational Biology. 2015;11(10):e1004579. doi: 10.1371/journal.pcbi.1004579 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Kajitani Y, Hatayama M. Explaining the Effective Reproduction Number of COVID-19 through Mobility and Enterprise Statistics: Evidence from the First Wave in Japan. Cold Spring Harbor Laboratory Press; 2020. Available from: https://www.medrxiv.org/content/10.1101/2020.10.08.20209643v2 [cited 2021-01-12]. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Wallinga J, Lipsitch M. How Generation Intervals Shape the Relationship between Growth Rates and Reproductive Numbers. Proceedings of the Royal Society B: Biological Sciences. 2007;274(1609):599–604. doi: 10.1098/rspb.2006.3754 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Cereda D, Tirani M, Rovida F, Demicheli V, Ajelli M, Poletti P, et al. The Early Phase of the COVID-19 Outbreak in Lombardy, Italy; 2020. Available from: http://arxiv.org/abs/2003.09320 [cited 2021-09-01]. [DOI] [PMC free article] [PubMed]
  • 70. Hellewell J, Abbott S, Gimma A, Bosse NI, Jarvis CI, Russell TW, et al. Feasibility of Controlling COVID-19 Outbreaks by Isolation of Cases and Contacts. The Lancet Global Health. 2020;8(4):e488–e496. doi: 10.1016/S2214-109X(20)30074-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Diekmann O, Heesterbeek JAP, Metz JAJ. On the Definition and the Computation of the Basic Reproduction Ratio R0 in Models for Infectious Diseases in Heterogeneous Populations. Journal of Mathematical Biology. 1990;28(4):365–382. doi: 10.1007/BF00178324 [DOI] [PubMed] [Google Scholar]
  • 72.Shah C, Dehmamy N, Perra N, Chinazzi M, Barabási AL, Vespignani A, et al. Finding Patient Zero: Learning Contagion Source with Graph Neural Networks; 2020. Available from: http://arxiv.org/abs/2006.11913 [cited 2021-01-13].
  • 73.GPAW. Global Pandemic App Watch (GPAW): COVID-19 Exposure Notification & Contact Tracing—CRAiEDL; 2020. Available from: https://craiedl.ca/gpaw/ [cited 2021-02-13].
  • 74. Anglemyer A, Moore TH, Parker L, Chambers T, Grady A, Chiu K, et al. Digital Contact Tracing Technologies in Epidemics: A Rapid Review. Cochrane Database of Systematic Reviews. 2020;1(8). 10.1002/14651858.CD013699 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75. Lewis D. Why Many Countries Failed at COVID Contact-Tracing—but Some Got It Right. Nature. 2020;588(7838):384–387. doi: 10.1038/d41586-020-03518-4 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Ivan Kryven

Transfer Alert

This paper was transferred from another journal. As a result, its full editorial history (including decision letters, peer reviews and author responses) may not be present.

20 May 2021

PONE-D-21-07031

Modelling digital and manual contact tracing for COVID-19. Are low uptakes and missed contacts deal-breakers?

PLOS ONE

Dear Dr. Rusu,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jul 04 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Ivan Kryven

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: N/A

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors present a model combining the spread of an epidemic with a mitigation strategy based on contact tracing and isolation. The model is a multi-site mean-field model combining equations for the evolution of compartment size and the use of an underlying network of contacts between agents. Exploring the range of parameters, they test several scenarios of tracing efficiency and their impact on outbreaks. They find that imperfect tracing can nonetheless have a sufficient impact to significantly lower the epidemic peak.

The topic is interesting and addresses the very actual challenge of quantifying the impact of realistic mitigating strategies on epidemic outbreaks. As such, the study is perfectly valid and presents interesting results that could inform health policies.

I have however some issues with the model and some claims the authors make:

- The compartmental model used for describing the epidemic process is much more complicated than the classic ones. It is understandable, as the authors aim at describing realistic outbreaks of SARS-COV-2, in particular with the use of the Ip (pre-contagious), Ia (asymptomatic) and Is (symptomatic) compartments. However, the separation between H (hospitalised), R (recovered) and D is not necessarily useful, as D will only be a fraction from H set by lambda_H-D, and all remaining agents in the H compartment end up in the R one.

- The mechanisms for generating the tracing networks are not described. How are the links chosen, when, and by which mechanism? Is the tracing connected to new infections or not? If not (as this is a mean-field approach), the model might have an intrinsic limitation due to the absence of correlation between the infection process and the network.

- On the same topic, at line 127 the authors say: "The tracing graphs are usually subsets of the first network." When is this not the case? From the model described in ref 21, I get that tracing might generate "false" contacts, but this is not said at any point in the manuscript.

- The authors validate most of their findings using Erdös-Rényi networks, claiming that "it tends to offer acceptable approximations most of the time" (l146). I must argue that it does not, and in fact the authors themselves then use a scale-free/high clustering model in section 3.4. Why not directly use this one, which is indeed more realistic? They also could use empirical datasets, which are easily found in the community.

- Another problem is the temporality of the spreading. Line 155, the authors state: "The time intervals between two state changes of the same kind are assumed to form an exponential distribution". This is very unrealistic. As shown in a wide range of empirical studies, human interactions exhibit bursty behaviour, with distributions of temporal properties having typical heavy-tails. It has been further shown that such properties condition strongly the spread of an epidemic (see for example Lambiotte et al EPJB 86 (2013) 320).

- I suggest the author define the measures they used to quantify the effect of tracing in the text. In particular, what is "peak suppression"? I would recommend a clearer measure, such as a normalised peak reduction (N_(no tracing) - N_tracing)/(N_(no tracing)). Same approach could be used for time of peak.

- l203 : The authors select "good" values for tau_r and tau_t, but it seems to me that there is a correlation between the threshold values of tau_r and tau_t. It would be interesting to see heatmaps combining both dependencies.

- I have a question about the results presented in Fig 5, top right quadrant. It seems that the larger the population, the smaller the maximum infected fraction of the population, with a limit going to 0 as N increases. This would indicate that the model does not generate large cascades of infections, while spreading processes —being analog to percolation— should always reach a non-zero fraction of the population when the infection parameter is above the epidemic threshold. If accurate, doesn't it indicate an intrinsic unrealistic property of the model? It could also be that the parameters chosen for the study in section 3.1 lead to a spreading under the epidemic threshold, but in that case the analysis of the variation induced by population size is irrelevant.

- The study would be much stronger if the model was explored and validated for a larger range of spreading parameters. Only two values for p_a and one for p_h are considered. I understand that the authors have used estimated values from the empirical studies, but since the model for interactions is not realistic, it might be that these values are not suitable to reproduce the desired spreading properties (see my previous comment).

- Similarly, only two values for K are considered.

- I understand that the model is complex and the parameter space is huge, but the current results merely rely on eye-balling "relevant" values from a minimal set of tries. Deriving quantitative, meaningful results from such an evaluation of "proper" values for the parameters seems too optimistic.

- Manual tracing and digital tracing seem to be a sort of redundant system, to increase the global tracing efficiency. Shouldn't the use of both be equivalent to having a single, higher value of either digital tracing or manual tracing?

- I would appreciate that the authors discuss two features from Fig 7:

1. How can contact tracing lead to situations in which the outbreak lasts longer?

2. What generates the bimodality of some curves?

Furthermore, I have some minor issues with the manuscript:

- I would advise for replacing ref 21 with "R. Huerta, L. Tsimring, Phys. Rev. E 66 (2002) 056 115". I am not an expert on multi-site mean-field models, and I found that the latter reference contains a much clearer description of the approach than the one given by the authors.

- l196 : please move the reference to Fig 7 at the relevant location ; the current one is a sentence about Fig 6.

Reviewer #2: The paper presents a novel (SEIR-type) compartmental model of epidemic spread that accounts for possible manual and/or digital contact tracing by describing the individuals’ state by a pair of variables, one that describes its epidemic state and a second that describes whether it is traced (and isolating) or not. The authors explore the properties of the model using simulations, and show on synthetic data that combined manual and digital contact tracing may be effective in curbing the simulated spread of SARS-COV2 even with either of the two tracing modalities is suboptimal.

The paper is generally technically sound and the writing is clear and understandable. Though, given the large number of different parameters and tests performed it is sometimes difficult for the reader to recall everything and follow the story. I recommend the manuscript for publication in PLOS ONE after the authors have addressed several, mostly technical, points and some optional suggestions that I think may help the reader when reading the paper.

The conclusions of the simulation study are weakened by the fact that they rely solely on simulated contact networks which lack many realistic features of real world contact patterns. I think the model in itself and the present study merit publication on their own, but it would be a big plus if the authors could motivate that the networks’ parameters are realistic and that the missing features do not change the conclusions, and/or perform simulations on empirical contact networks.

The model has a lot of different variables and parameters and it is difficult to keep track of them all. It would be very helpful to list all of them in a table, similar to Table 1. Namely: \\Gamma, K, r, t_rem. It might also helpful to list the different compartments (states) of the model.

In the same manner, it would make the manuscript easier to read if the names, or short definitions, were given for each of the parameters when their ranges considered are listed in each subsection of the Results, e.g., on lines 187 and 188 recall that \\Gamma is the degree of overlap, \\tau_t the contact tracing rate, etc.

Add values of all relevant parameters to figure captions. This will make the figures easier to read.

It is unintuitive to use S_i for general states and S for the susceptible state. I suggest the authors use another symbol to denote a general state, e.g. X_i.

The fact that the relative error is ~3% for networks of size N=1000 and ~1% for N=10000 is corresponds to a relative error of 1/\\sqrt{N}, consistent with central limit type arguments for a well mixed population. The result may not hold however in a structured population or near a critical point and thus cannot necessarily be extrapolated to general conditions.

The authors refer to some parameters of their model as “hyperparameters”. This term is generally reserved for parameters of an inference procedure, not a model, e.g., of the learning procedure in machine learning or of prior distributions in Bayesian inference.

The authors compare the uptake rate in the simulations to the digital tracing adoption rates reported for different countries. However, the adoption rates cited are typically calculated as numbers of download relative to the population, which is not the same as the fraction of population using the app correctly.

In Section 3.4, if I understood correctly, the overlap between edges in the digital contact tracing network and the true contact network is assumed to be perfect, while the uptake in the manual tracing network is assumed to be 100%. Both of these assumptions seem overly optimistic to me, e.g., the manual contact tracing system will saturate at a given prevalence as was seen in most western countries, while digital contacts are only proxies for real (possibly disease transmitting) contacts.

Please corroborate the claim (on lines 144-146) that the Erdos-Renyi graph model tends to offer acceptable approximations most of the time.

It seems like an error that symbols in Table 1 are in boldface.

The authors may want to discuss the implications of the crossover effects seen in Fig. 10, i.e., the final outbreak size can be larger for higher uptake.

Can the authors comment on how realistic the required values of \\Gamma and \\tau_t for combined tracing to be effective alone are? Have the values of these parameters been estimated empirically?

In Fig. 5 add description of what symbols and boxes and whiskers represent.

Line 74: Change “SIR” to “The SIR process”

Line 80: add abbreviation “(Inserm)”

line 121: “onto” → “on”

Lines 140-141: should “scales with” be replaced by “is proportional to”?

Line 205: Specify what is meant by “meaningful results”.

Line 212: “N” → “small N”.

Line 302: “encapsulated” → “model’s”

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Nov 18;16(11):e0259969. doi: 10.1371/journal.pone.0259969.r002

Author response to Decision Letter 0


6 Jul 2021

Replies to editor:

Following the editor’s comments, we amended the supporting information references to respect the format – i.e. S1 Fig instead of Fig S1. We also limited the use of italics to very important information only. Finally, we changed the formatting of the authors section to match the suggested styling.

Replies to reviewer #1:

1. The compartmental model used for describing the epidemic process is much more complicated than the classic ones. It is understandable, as the authors aim at describing realistic outbreaks of SARS-COV-2, in particular with the use of the Ip (pre-contagious), Ia (asymptomatic) and Is (symptomatic) compartments. However, the separation between H (hospitalised), R (recovered) and D is not necessarily useful, as D will only be a fraction from H set by lambda_H-D, and all remaining agents in the H compartment end up in the R one.

Our goal from the very beginning was to make simulations that could capture a broad range of effects stemmed from the viral spread, including the evolution of hospitalizations and deaths. We also wanted to show that, unlike previous works, our contact tracing approach can be used in conjunction with virtually any complex compartmental formulation (many of which have been proposed for COVID-19 given the inefficacy of SIR [1]). That being said, this modeling decision is in line with many different works which explicitly separated these states [2, 3], including the study we used as reference when designing our proposed model [4]. It is true that, given the setup described in our paper, D are but a fraction from H. However, having the capability of directly capturing those metrics from different compartments is not only convenient, but also more realistic and useful when visualizing the specific effects of different measures (running our code allows for visual explorations of which node gets infected/hospitalized/dead and when). Having separate states with different transition times could also be exploited in the dynamical/temporal sense to infer noteworthy trends, although such an analysis is beyond the scope of the current paper. Lastly, this setup allows for a node’s behavior within a simulation to change based on its state (for example, one may want to explore a scenario in which hospitalized/recovered people can still ping their contacts through a tracing application, whereas a deceased node would be unable to aid the tracing process).

[1] Moein S, Nickaeen N, Roointan A, et al. Inefficiency of SIR models in forecasting COVID-19 epidemic: a case study of Isfahan. Scientific Reports. 2021;11(1):4725. doi:10.1038/s41598-021-84055-6

[2] Balabdaoui F, Mohr D. Age-stratified discrete compartment model of the COVID-19 epidemic with application to Switzerland. Scientific Reports. 2020;10(1):21306. doi:10.1038/s41598-020-77420-4

[3] Leontitsis A, Senok A, Alsheikh-Ali A, Al Nasser Y, Loney T, Alshamsi A. SEAHIR: A Specialized Compartmental Model for COVID-19. Int J Environ Res Public Health. 2021;18(5). doi:10.3390/ijerph18052667

[4] Di Domenico L, Pullano G, Sabbatini CE, Boëlle P-Y, Colizza V. Impact of lockdown on COVID-19 epidemic in Île-de-France and possible exit strategies. BMC Medicine. 2020;18(1):240. doi:10.1186/s12916-020-01698-4

2. The mechanisms for generating the tracing networks are not described. How are the links chosen, when, and by which mechanism? Is the tracing connected to new infections or not? If not (as this is a mean-field approach), the model might have an intrinsic limitation due to the absence of correlation between the infection process and the network.

We thank the reviewer for pointing out the generation mechanism was not intuitively explained. We have revised Section 2.2 of the paper to address this point. We now describe in more detail how the tracing networks are generated, based on the 2 inter-linked parameters: “degree of overlap” and “uptake”. On one hand, the degree of overlap gets converted to Z_rem via the specified formula, quantity which represents the average number of edges that get randomly removed from the infection network to render a tracing network (this effectively means N * Z_rem / 2 edges get removed, and we have now made this explicit in the revision). On the other hand, the uptake dictates how many nodes should have all their edges removed (again at random). We average over many such tracing scenarios to reduce the potential noise induced by these random processes (hence why we mention in Section 3.1 that we simulate X different networks with Y random seeds). Consequently, the tracing networks are preset, so new infections do not technically influence their structure, but this should not pose a limitation because, in practice, we can assume this ‘static’ structure has been generated from aggregating all the potentially-observed dynamic links for any given node (i.e. a random process corresponding to tracers/apps monitoring the contacts neighborhood of any new detected infections), thus actually correlating the entire procedure to new infections. Given that presumption, the tracing propagation system becomes, in fact, directly influenced by any new detected infection / isolated node, since the latter contributes to the additive probability of encouraging its neighborhood to self-isolate (hence the multi-site nature of our mean-field approach), while the neighborhood itself gets partially observed from the aforementioned pseudo-dynamic process.

That being said, removing the random nature of this operation altogether, and thus actively tying new infections to the tracing view substructure, is indeed an interesting avenue worth exploring (possibly via targeting the testing and the neighborhood exploration processes with reinforcement learning techniques, like [1] - a potential future direction for us as indicated in the revised Section 4), yet this may come at the cost of adding more bias: Currently, we assume contact tracers have a limited time to conduct their research and they do not favor any node when looking for contacts, whereas each individual’s memory constitutes an unpredictable, seemingly-random factor to model. Hence, the simplifying presumption that contacts tend to be missed at random fits these assumptions and should be less biased. This very presumption enables us to view the precomputed static subnetwork as a dynamic aggregate of randomly-removed edges.

[1] Meirom EA, Maron H, Mannor S, Chechik G. How to Stop Epidemics: Controlling Graph Dynamics with Reinforcement Learning and Graph Neural Networks. arXiv:201005313 [cs]. Published online October 26, 2020. Accessed May 15, 2021. http://arxiv.org/abs/2010.05313

3. On the same topic, at line 127 the authors say: "The tracing graphs are usually subsets of the first network." When is this not the case? From the model described in ref 21, I get that tracing might generate "false" contacts, but this is not said at any point in the manuscript.

Thank you for pointing out this inconsistency. It is true that the referenced paper studies such a scenario in more detail. Our model is also capable of simulating “false” contacts, but we have decided not to study such a situation further in this paper since we deemed it less likely to happen in our global pandemic scenario, where contact tracers are generally very well trained and the public is much more aware of who they meet and for how long. We have modified Section 2.2 to better reflect this.

4. The authors validate most of their findings using Erdös-Rényi networks, claiming that "it tends to offer acceptable approximations most of the time" (l146). I must argue that it does not, and in fact the authors themselves then use a scale-free/high clustering model in section 3.4. Why not directly use this one, which is indeed more realistic? They also could use empirical datasets, which are easily found in the community.

We use ER networks as our baseline due to their wide adoption in the epidemic literature (including the studies we used as reference points for the spreading model [1, 2]), as well as their well-behaved epidemic dynamics in the limit, caused by the presence of epidemic thresholds (from [1], the study which introduced the SIRT model we extended, “the epidemic dynamics in SW networks remains qualitatively similar to random graphs, since they possess a well-defined epidemic threshold”). In addition to this, both real networks and ER graphs (under the studied K values) generally exhibit a single giant component, sparse densities, small average shortest paths [3,4]. Even though ER networks make a few unrealistic assumptions about the degree distribution (nodes with too many or too few connections are heavily restricted) and the independence of interactions in closed communities (where some inherent dependence certainly exists), they are still a good tool for modelling infections in randomly mixed populations [5] (e.g. in stores, public transportation etc. [6]). The first few experiments we conducted on the basis of ERs played a pivotal role in exemplifying the overall dynamics induced by different variables in our system, which we preferred to observe (for ourselves) and evoke (to the readers) in the most generalized framework, just as the predecessor papers did in their initial simulations. Indeed, the reviewer was kind to notice that we made a note of the unsuitability such a model may exhibit for social network tasks, hence why we focused the main experiments in the study on a more realistic network model, for which we actually derived our final conclusions and plotted the most representative figures on the efficacy of various contact tracing strategies w.r.t. the peak suppression and the reproduction number. We agree with the reviewer that our original statement about ERs was overly optimistic, and hence we have modified the section accordingly. We have also added Section 3.5 to the paper which details experiments we ran over a real dynamic social network.

[1] Tsimring LS, Huerta R. Modeling of contact tracing in social networks. Physica A: Statistical Mechanics and its Applications. 2003;325(1):33-39. doi:10.1016/S0378-4371(03)00180-8

[2] Farrahi K, Emonet R, Cebrian M. Epidemic Contact Tracing via Communication Traces. Lambiotte R, ed. PLoS ONE. 2014;9(5):e95133. doi:10.1371/journal.pone.0095133

[3] D. Du, “Social Network Analysis: Lecture 3-Network Characteristics,” Sep. 28, 2016. [Online]. Available: http://www2.unb.ca/~ddu/6634/Lecture_notes/Lec3_network_statistics_handout.pdf

[4] A. L. Barabási, Network Science Random Networks. 2015. [Online]. Available: https://barabasi.com/f/624.pdf

[5] M. J. Keeling and K. T. D. Eames, “Networks and epidemic models,” Journal of The Royal Society Interface, vol. 2, no. 4, pp. 295–307, Sep. 2005, doi: 10.1098/rsif.2005.0051.

[6] M. Abueg et al., “Modeling the combined effect of digital exposure notification and non-pharmaceutical interventions on the COVID-19 epidemic in Washington state,” in medRxiv, Sep. 2020, p. 2020.08.29.20184135. doi: 10.1101/2020.08.29.20184135.

5. Another problem is the temporality of the spreading. Line 155, the authors state: "The time intervals between two state changes of the same kind are assumed to form an exponential distribution". This is very unrealistic. As shown in a wide range of empirical studies, human interactions exhibit bursty behaviour, with distributions of temporal properties having typical heavy-tails. It has been further shown that such properties condition strongly the spread of an epidemic (see for example Lambiotte et al EPJB 86 (2013) 320).

We thank the reviewer for opening up this important discussion on the temporality involved in our simulations, which has prompted us to briefly explain our rationale in Section 2.3 of the revision. Our choice here was motivated by the exponential’s wide adoption in previous epidemiological works relying on SIR [1, 2, 4], as well as evidence of exponentially-distributed numbers of interactions over time observed in 2 reference studies for our work, which analyzed human behavior using tracking devices [2, 3]. We agree with the reviewer that our assumption can be too restrictive, especially surrounding infection times, and more flexible gamma (e.g. Erlang) or Weibull distributions represented via temporal sub-compartments could provide more realistic measurements. However, standard exponentials have been shown to offer acceptable approximations in several cases (e.g. when the mean infection duration is smaller [5], or the mean-generation time is correctly estimated [6]). As to other time intervals involved in the model (e.g. the infectious period), an older study showed that major epidemic persistence and dynamic problems could be introduced in SIR/SEIR simulations when using more realistic gamma distributions, amounting to worse results overall [7]. That is not to say using such distributions should be discouraged in SIR, but a very thoughtful consideration should be put into making this temporality choice nonetheless, carefully weighing the benefits against the potential limitations. We excluded this analysis for the purpose of the current paper, but it is definitely an interesting avenue worth exploring in subsequent studies.

[1] J. Ma, “Estimating epidemic exponential growth rate and basic reproduction number,” Infectious Disease Modelling, vol. 5, pp. 129–141, Jan. 2020, doi: 10.1016/j.idm.2019.12.009.

[2] K. Farrahi, R. Emonet, and M. Cebrian, “Predicting a Community’s Flu Dynamics with Mobile Phone Data,” Vancouver, Canada, Mar. 2015. doi: 10.1145/2675133.2675237.

[3] J. Stehlé et al., “Simulation of an SEIR infectious disease model on the dynamic contact network of conference attendees,” BMC Medicine, vol. 9, no. 1, p. 87, Jul. 2011, doi: 10.1186/1741-7015-9-87.

[4] M. Kröger and R. Schlickeiser, ‘Analytical solution of the SIR-model for the temporal evolution of epidemics. Part A: time-independent reproduction factor’, J. Phys. A: Math. Theor., vol. 53, no. 50, p. 505601, Nov. 2020, doi: 10.1088/1751-8121/abc65d.

[5] E. Vergu, H. Busson, and P. Ezanno, “Impact of the Infection Period Distribution on the Epidemic Spread in a Metapopulation Model,” PLoS One, vol. 5, no. 2, Feb. 2010, doi: 10.1371/journal.pone.0009371.

[6] O. Krylova and D. J. D. Earn, “Effects of the infectious period distribution on predicted transitions in childhood disease dynamics,” J R Soc Interface, vol. 10, no. 84, Jul. 2013, doi: 10.1098/rsif.2013.0098.

[7] A. L. Lloyd, ‘Realistic Distributions of Infectious Periods in Epidemic Models: Changing Patterns of Persistence and Dynamics’, Theoretical Population Biology, vol. 60, no. 1, pp. 59–71, Aug. 2001, doi: 10.1006/tpbi.2001.1525.

6. I suggest the author define the measures they used to quantify the effect of tracing in the text. In particular, what is "peak suppression"? I would recommend a clearer measure, such as a normalised peak reduction (N_(no tracing) - N_tracing)/(N_(no tracing)). Same approach could be used for time of peak.

Thank you for this insightful comment. We have addressed this point by adding a clear mathematical explanation of the peak suppression in the newly added Section 2.4. We would also like to mention that the original purpose of our peak suppression plots was to emphasize the benefit of contact tracing in absolute terms (scrutinized in direct reference to the total population size, which remains fixed at either 1000 or 10000 throughout the study, making it easy for the reader to visualize the full effect). While we agree with the reviewer that normalised charts would offer a valuable alternative view of our simulation data, an unnormalized reporting directly referencing the achieved peak suppression better fits with our initial plan.

7. l203 : The authors select "good" values for tau_r and tau_t, but it seems to me that there is a correlation between the threshold values of tau_r and tau_t. It would be interesting to see heatmaps combining both dependencies.

Thank you very much for this suggestion. We have now added heatmaps of the 2 parameters for different degrees of overlap to the revised Section 3.4.

8. I have a question about the results presented in Fig 5, top right quadrant. It seems that the larger the population, the smaller the maximum infected fraction of the population, with a limit going to 0 as N increases. This would indicate that the model does not generate large cascades of infections, while spreading processes —being analog to percolation— should always reach a non-zero fraction of the population when the infection parameter is above the epidemic threshold. If accurate, doesn't it indicate an intrinsic unrealistic property of the model? It could also be that the parameters chosen for the study in section 3.1 lead to a spreading under the epidemic threshold, but in that case the analysis of the variation induced by population size is irrelevant.

As illustrated by the unnormalized values in the left quadrant, the average of the peak of infection is monotonic on the studied range, and hence, for larger populations, the disease did manage to percolate at an increased rate (in absolute terms) as expected, making the variation analysis hold (at least on this very range). In spite of this, as indicated in the right quadrant and correctly pointed out by the reviewer, the peak clearly decreases in relative importance w.r.t. the total population (and the above-named monotony is not enough to ensure a non-zero relative average in the limit). Even so, the purpose of this experiment was not to study the dynamics in the limit, but rather to check if the variability across runs scales with the number of nodes (as initially suspected), and to settle on a sensible range of values for the population size (which would not be plagued by large variances, at least relative to that size). Indeed, as the reviewer correctly intuited, the limiting behavior of the percolation process is the result of analyzing a scenario in which both tau_t and tau_r are marginally larger than the infection rate beta (0.1 against 0.0791). Should the uptake have been 100% and considering that both the infection and contact tracing “spread” in a similar fashion, the simulation would have been deterministically contained (and below the epidemic threshold with a greater probability). However, given that the uptake in this experiment is set to 50%, this is now not always the case, as some instances do get quickly contained, while others continue to percolate for longer periods, depending on the network subsetting seed. It is also worth noting here that Section 3.1 reports on the variance of peaks, rather than overall epidemic sizes. While tracing can indeed be effective at “flattening the curve”, this does not guarantee the final size actually follows the same trend (or has the same decaying shape in the limit).

On a side note, our choice of studying \\tau_t=\\tau_r=0.1 here stemmed from empirical observations of this particular assignment resulting in an “average” efficacy w.r.t. preventing full-blown outbreaks, across many different parameter configurations (possibly due to the aforementioned marginal difference compared to the transmission rate). As reported in the paper, this assignment belongs to the higher-mid range of our initial setups, but also lies within the lower-mid range in the main experiments.

9. The study would be much stronger if the model was explored and validated for a larger range of spreading parameters. Only two values for p_a and one for p_h are considered. I understand that the authors have used estimated values from the empirical studies, but since the model for interactions is not realistic, it might be that these values are not suitable to reproduce the desired spreading properties (see my previous comment). Similarly, only two values for K are considered. I understand that the model is complex and the parameter space is huge, but the current results merely rely on eye-balling "relevant" values from a minimal set of tries. Deriving quantitative, meaningful results from such an evaluation of "proper" values for the parameters seems too optimistic.

As the reviewer kindly noted, the parameter space is indeed very large, and it is very difficult to find the most realistic values here. We concentrated our experiments on the most agreed-upon values, which were previously derived by trusted institutions and used to inform real governmental policies (by the CDC [1], by the The French National Institute of Health and Medical Research [2]). Our compartmental model was deliberately designed to be “in line” with [2], and therefore we were very composed with parameter changes (in fact, modifications were conducted only based on estimates from other trusted sources). This makes our results meaningful for the scenario in question. That being said, we consider our main contribution to be the modelling technique itself (aspect reflected in our decision to submit to this particular journal), and hence the range of the parameters was constrained to the most sensible values for presentation. A wider range of tau_r and tau_t values was, however, studied in our experiments, some significantly below and above the epidemic threshold, with the final ranges reported in the final piece being indicative of all the observed trends. We agree with the reviewer that a more thorough analysis of the parameters should be carried away in future work, especially in relation with K, and this has been marked accordingly in Section 4 (Conclusions and future work).

[1] M. A. Johansson et al., “SARS-CoV-2 Transmission From People Without COVID-19 Symptoms,” JAMA Netw Open, vol. 4, no. 1, p. e2035057, Jan. 2021, doi: 10.1001/jamanetworkopen.2020.35057.

[2] L. Di Domenico, G. Pullano, C. E. Sabbatini, P.-Y. Boëlle, and V. Colizza, “Impact of lockdown on COVID-19 epidemic in Île-de-France and possible exit strategies,” BMC Medicine, vol. 18, no. 1, p. 240, Jul. 2020, doi: 10.1186/s12916-020-01698-4.

10. Manual tracing and digital tracing seem to be a sort of redundant system, to increase the global tracing efficiency. Shouldn't the use of both be equivalent to having a single, higher value of either digital tracing or manual tracing?

The two tracing systems operate on different assumptions (please refer back to Section 3.4). Namely, both conduct tracing over different networks (a sensible assumption since not all interviewees will have a COVID tracing application to notify neighbors, and there may be discrepancies between what each service “sees”). In manual tracing, the overlap controls missed contacts, while in digital tracing both an overlap (inadequate use) and an uptake rate can influence the topology (the latter does not make sense in the manual tracing case). Finally, there is an inherent delay of tracing a contact in the interview-based approach (controlled by differences in \\tau_t rates; assumed to be equal to two days in our study, but with direct support in the code for varying that delay).

11. I would appreciate that the authors discuss two features from Fig 7:

Thank you for the questions below. First, we would like to note that we have changed Fig 7 in the revision to match the given caption, since in the initial submission we have imported by mistake an older version of that figure (with less scenarios depicted). In the new figure, the 4 quadrants visible in the originally-submitted picture are all placed on the right column. We have also removed the case with overlap=0 since this theoretically corresponds to the same scenario as \\tau_t=0. That being said, the questions below still hold since this figure update is based on the same data and scenarios.

11.1. How can contact tracing lead to situations in which the outbreak lasts longer?

First of all, we must distinguish all cases from the first row, which we regard as giving mostly noisy results of the contact tracing process due to the ineffective value of \\tau_t. In some of the more-meaningful configurations, very small values of the overlap (<.33) still manage to fall within the “noise region” because \\tau_t is not large enough to make their effect distinguishable from the noise induced by the inherent stochasticity of the simulations. Such a poor representation of the infection network can also lead to situations in which only incorrect regions of the graph get isolated (i.e. many false positives), which in turn can cause the epidemics to gain momentum at later stages (e.g. an obvious example is overlap=0.11 in the third quadrant). A brief explanation has been added to the revision of Section 3.2.

11.2. What generates the bimodality of some curves?

The effect of multiple peaks has also been noted for the SIR formulation in [1]: “with larger tracing effort, the epidemic is reduced significantly rapidly, leaving a great deal of the population susceptible for a second peak of infections[…] This is due to the contact tracing becoming so effective that the number of cases drops rapidly, resulting in tracing becoming less effective. Note, this effect is much more attenuated in the dual network case” (the “dual network case” refers to a scenario with a single contact tracing network in which the overlap is very small: 0.08). Indeed, in our case this bimodality is a rarer phenomenon which only affects a few of the simulations that exhibit large values of the overlap.

[1] Farrahi K, Emonet R, Cebrian M. Epidemic Contact Tracing via Communication Traces. Lambiotte R, ed. PLoS ONE. 2014;9(5):e95133. doi:10.1371/journal.pone.0095133

12. Furthermore, I have some minor issues with the manuscript:

- I would advise for replacing ref 21 with "R. Huerta, L. Tsimring, Phys. Rev. E 66 (2002) 056 115". I am not an expert on multi-site mean-field models, and I found that the latter reference contains a much clearer description of the approach than the one given by the authors.

- l196 : please move the reference to Fig 7 at the relevant location ; the current one is a sentence about Fig 6

We thank the reviewer for spotting these minor issues. Indeed, the initial paper by Huerta and Tsimring includes more details on multi-site mean-field models, but the latter features more experiments and contains further conclusions derived from them. This is why we decided to keep both references in the revision. As to Fig 7, as mentioned above, the caption was in fact correct but we imported an older version of that figure onto the system. This has now been amended.

Replies to reviewer #2:

1. The conclusions of the simulation study are weakened by the fact that they rely solely on simulated contact networks which lack many realistic features of real world contact patterns. I think the model in itself and the present study merit publication on their own, but it would be a big plus if the authors could motivate that the networks’ parameters are realistic and that the missing features do not change the conclusions, and/or perform simulations on empirical contact networks.

We thank the reviewer for this important suggestion, and we agree that the study would benefit from showcasing the impact of contact tracing on real social networks. To that end, we have added Section 3.5 (with background explanations added to Section 2.3) where we explore a real data-driven scenario.

2. The model has a lot of different variables and parameters and it is difficult to keep track of them all. It would be very helpful to list all of them in a table, similar to Table 1.

Namely: \\Gamma, K, r, t_rem. It might also be helpful to list the different compartments (states) of the model.

Thank you for this suggestion. A table detailing the network parameters used in our simulations has been added to the revision. A diagram of the compartments together with the available transitions has been supplied in Fig 1, with the caption detailing their meaning.

3. In the same manner, it would make the manuscript easier to read if the names, or short definitions, were given for each of the parameters when their ranges considered are listed in each subsection of the Results, e.g., on lines 187 and 188 recall that \\Gamma is the degree of overlap, \\tau_t the contact tracing rate, etc.

Thank you for this thoughtful suggestion. The names of the different parameters have now been made apparent at the beginning of each section.

4. Add values of all relevant parameters to figure captions. This will make the figures easier to read.

We thank the reviewer for this suggestion. The values of the relevant parameters have now been added to the figure captions in the revised study.

5. It is unintuitive to use S_i for general states and S for the susceptible state. I suggest the authors use another symbol to denote a general state, e.g. X_i.

We have amended the table according to this suggestion, using X to refer to different states.

6. The fact that the relative error is ~3% for networks of size N=1000 and ~1% for N=10000 is corresponds to a relative error of 1/\\sqrt{N}, consistent with central limit type arguments for a well mixed population. The result may not hold however in a structured population or near a critical point and thus cannot necessarily be extrapolated to general conditions.

We thank the reviewer for this insightful comment. We have now made a note of the limitations our variance analysis has.

7. The authors refer to some parameters of their model as “hyperparameters”. This term is generally reserved for parameters of an inference procedure, not a model, e.g., of the learning procedure in machine learning or of prior distributions in Bayesian inference.

The term hyperparameters was chosen to distinguish network parameters from compartmental model parameters. We have now modified the nomenclature accordingly to avoid any confusion.

8. The authors compare the uptake rate in the simulations to the digital tracing adoption rates reported for different countries. However, the adoption rates cited are typically calculated as numbers of download relative to the population, which is not the same as the fraction of population using the app correctly.

The adoption rates cited are estimated from application downloads, which can actually represent significant overestimations of the true uptake (this is now apparent in Section 3.4 of the revised text). As to modelling the fraction of the population using the app correctly per se, our simulations should technically rely on the degree of overlap \\Gamma instead of the uptake r. The former, however, was kept to 1 in the digital tracing network, as was correctly observed by the reviewer in the next question.

8. In Section 3.4, if I understood correctly, the overlap between edges in the digital contact tracing network and the true contact network is assumed to be perfect, while the uptake in the manual tracing network is assumed to be 100%. Both of these assumptions seem overly optimistic to me, e.g., the manual contact tracing system will saturate at a given prevalence as was seen in most western countries, while digital contacts are only proxies for real (possibly disease transmitting) contacts.

Indeed, we are only varying the overlap for manual tracing and the uptake for digital tracing, but the two parameters are interlinked within a single network (varying one can directly impact the other). That being said, the “uptake” in manual tracing does not have a clear real-life correspondent, unless we are referring to memory-impaired individuals who are unable to identify any of their contacts (but for the purpose of our study we assumed such special circumstances can be covered by the randomness of link removal attributed to diversifying the degree of overlap). In contrast, varying the “overlap” in digital tracing corresponds to inadequate usage patterns of the application, and was made entirely possible in our codebase. However, showcasing the effects of sampling the entire parameter grid becomes combinatorically harder, so we decided to illustrate only a targeted subset of the scenarios our model is actually capable of running.

8. Please corroborate the claim (on lines 144-146) that the Erdos-Renyi graph model tends to offer acceptable approximations most of the time.

From [1], the study which introduced the SIRT model we extended, “the epidemic dynamics in SW networks remains qualitatively similar to random graphs, since they possess a well-defined epidemic threshold”. In addition to this, both real networks and ER graphs (under the studied K values) generally exhibit a single giant component, sparse densities, small average shortest paths [3,4]. Even though ER networks make a few unrealistic assumptions about the degree distribution (nodes with too many or too few connections are heavily restricted) and the independence of interactions in closed communities (where some inherent dependence must actually exist), they are still a good tool for modelling infections in randomly mixed populations [5] (e.g. in stores, public transportation etc. [6]). That being said, we do agree our original statement is overly optimistic, and therefore we amended our formulation in the revision.

[1] Tsimring LS, Huerta R. Modeling of contact tracing in social networks. Physica A: Statistical Mechanics and its Applications. 2003;325(1):33-39. doi:10.1016/S0378-4371(03)00180-8

[2] Farrahi K, Emonet R, Cebrian M. Epidemic Contact Tracing via Communication Traces. Lambiotte R, ed. PLoS ONE. 2014;9(5):e95133. doi:10.1371/journal.pone.0095133

[3] D. Du, “Social Network Analysis: Lecture 3-Network Characteristics,” Sep. 28, 2016. [Online]. Available: http://www2.unb.ca/~ddu/6634/Lecture_notes/Lec3_network_statistics_handout.pdf

[4] A.-L. Barabási, Network Science Random Networks. 2015. [Online]. Available: https://barabasi.com/f/624.pdf

[5] M. J. Keeling and K. T. D. Eames, “Networks and epidemic models,” Journal of The Royal Society Interface, vol. 2, no. 4, pp. 295–307, Sep. 2005, doi: 10.1098/rsif.2005.0051.

[6] M. Abueg et al., “Modeling the combined effect of digital exposure notification and non-pharmaceutical interventions on the COVID-19 epidemic in Washington state,” in medRxiv, Sep. 2020, p. 2020.08.29.20184135. doi: 10.1101/2020.08.29.20184135.

8. It seems like an error that symbols in Table 1 are in boldface.

This was done on purpose to highlight the afferent symbols, but in the new version of the table we removed this styling.

9. The authors may want to discuss the implications of the crossover effects seen in Fig. 10, i.e., the final outbreak size can be larger for higher uptake.

A discussion on the rare “crossover effects” has been added to the revised manuscript in Section 3.3.

10. Can the authors comment on how realistic the required values of \\Gamma and \\tau_t for combined tracing to be effective alone are? Have the values of these parameters been estimated empirically?

The parameter values we indicated as being “effective” have been estimated empirically through inspecting the simulation results. Unfortunately, it is a rather challenging task to find real-world estimates of either of these two parameters. This is why we conduct an extensive study of the trends emerged from running simulations over a large range of values. The UK government SAGE has recommended that at least 80% of close contacts be reached for the system to be deemed effective – overlap=0.8: https://www.health.org.uk/news-and-comment/charts-and-infographics/nhs-test-and-trace-performance-tracker. This percentage is highly influenced, however, by the recalled contacts (only 74% provided at least one close contact in the last month) and the possibility to reach those contacts (84% identified and asked to isolate in the last month). This would bring the actual overlap to a value significantly lower than 0.8 (the quoted value of 0.5 being reasonable to assume as possible to achieve).

11. In Fig. 5 add description of what symbols and boxes and whiskers represent.

Line 74: Change “SIR” to “The SIR process”

Line 80: add abbreviation “(Inserm)”

Line 121: “onto” → “on”

Lines 140-141: should “scales with” be replaced by “is proportional to”?

Line 205: Specify what is meant by “meaningful results”.

Line 212: “N” → “small N”.

Line 302: “encapsulated” → “model’s”

Thank you for these helpful suggestions. We have accommodated these in the revision.

Attachment

Submitted filename: Response to Reviewers.pdf

Decision Letter 1

Ivan Kryven

20 Sep 2021

PONE-D-21-07031R1Modelling digital and manual contact tracing for COVID-19. Are low uptakes and missed contacts deal-breakers?PLOS ONE

Dear Dr. Rusu,

Thank you for submitting your manuscript to PLOS ONE.  We invite you to submit a revised version one more time.

Immediate acceptance is possible afterwards. Please focus on answering Second Reviewer's question about the waiting time distribution and implement their minor comments.

Please submit your revised manuscript by Nov 04 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Ivan Kryven

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

Reviewer #3: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: N/A

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #2: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #2: I apologize for the delay in submitting my report.

The authors have addressed all my previous criticism in their revision. I am happy to recommend the publication of their paper in PLOS ONE.

The authors have added a discussion of the modeling assumption that state transitions take place with constant rates (exponentially distributed waiting times). Which is great since this common assumption is often not satisfied in empirical data. However, they discuss only the case where waiting time distributions are less skewed than exponentials. Though they may also be more skewed, notably due to intercontact times and edge weights following heavy-tailed distributions as is often the case in physical proximity networks often (see e.g. Starnini et al. “Modeling human dynamics of face-to-face interaction networks” PRL 2013).

One has to read Eqs. 1 and 2 to understand how $\\Gamma$ and $r$ are defined, while the definitions of $Z_{rem}$ and $N_{utn}$ are clear from the text. I suggest moving the first part of Eqs. 1 and 2 (before the $\\Rightarrow$ sign) up to where $\\Gamma$ and $r$ are introduced. The second parts of Eqs. 1 and 2 may be removed as they are obtained by a simple arithmetic inversion of the first parts, or the authors may keep them at their current place.

Change: ‘contacts network’ to ‘contact network’ on page 6 (no line numbering) and on page 7, line 137.

Change: ‘newly isolated’ to ‘isolated’ on page 6.

Change: ‘testing regimes’ to ‘testing regime’ in Fig. 7 caption.

Change: ‘$tau_r$’ to ‘$\\tau_r$’ in line 252.

Change ‘First aspect’ to ‘The first aspect’ in line 304.

Change: ‘experiments’ to ‘simulations’ in line 371.

Reviewer #3: The manuscript presents a compartmental model that can explicitly capture the manual/digital contact tracing and study contact tracing strategies in various situations. Overall, I believe that the work mostly satisfies the publication criteria of PLOS ONE, providing a solid study of the proposed model.

However, I still have two comments. First, as other reviewers mentioned, the network structure and temporal dynamics are somewhat overlooked in the paper. Although I would not argue that the authors should perform extra simulations, I think it is important to acknowledge it more thoroughly in the discussion. It has been recognized that super-spreading is a rather universal characteristic of many epidemics [1] and COVID-19 is argued to be driven primarily by such super-spreading events. Furthermore, recent studies (e.g., [2]) have shown that when spreading is driven by such super-spreading events, the details of contact tracing implementation may matter a lot. In this context, I believe that the paper should expand the discussion to provide a better context to the readers.

Second, I think the plots can be improved a lot by carefully choosing colors and by limiting the number of lines/objects that each figure shows. Many figures have numerous (~10) lines with random colors associated with each line, making them very difficult to parse. I believe that most of the figures will not lose much information by reducing the number of lines to ~5. In addition, as each of these lines show a range of parameter values (i.e., they can be ordered), a linear colormap (e.g., sampling colors across the "viridis" colormap) would make them much easier to read. Also, the heatmap figure uses green-to-red colormap, which can be understood by a significant fraction of population who has colorblindness. Furthermore, the colormap used introduces an arbitrary cut-off point (between 750 and 1000) that introduces an artifact. Again, I believe that a linear, perceptually uniform colormap should be used here. Although this is probably not "critical" regarding PLOS ONE's publication criteria, I believe that this simple improvement in the figures will make the paper much more accessible.

[1] Lloyd-Smith, J. O., Schreiber, S. J., Kopp, P. E. & Getz, W. M. Superspreading and the effect of individual variation on disease emergence. Nature 438, 355–359 (2005).

[2] Kojaku, S., Hébert-Dufresne, L., Mones, E., Lehmann, S., & Ahn, Y. Y. (2021). The effectiveness of backward contact tracing in networks. Nature Physics, 17(5), 652-658.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Nov 18;16(11):e0259969. doi: 10.1371/journal.pone.0259969.r004

Author response to Decision Letter 1


26 Oct 2021

Replies to editor

1. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

We reverified our References list to ensure no retracted papers got cited. At the same time, we added a few more references to support our discussion on waiting times and network structures, in light of the suggestions made by the reviewers. In addition, we supplemented the information provided for references 5, 6, 10, 16 (previously 15), 17 (previously 16) with links to the corresponding cited web resources. Finally, we modified references 18 (previously 17) and 45 (previously 65) to reflect these papers’ recent publication.

2. While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

All our figures have been verified using the PACE system, and no issues were reported, except for Fig8.eps in which the tool detected bitmap images with resolution < 300 DPI. We have inspected the apparently-problematic figure but found no issues with its rendering. Please also note that most of our figures were regenerated using a different colormap in light of the comments made by Reviewer #3, and adequate margins have now been added.

Replies to Reviewer #2

1. The authors have added a discussion of the modeling assumption that state transitions take place with constant rates (exponentially distributed waiting times). Which is great since this common assumption is often not satisfied in empirical data. However, they discuss only the case where waiting time distributions are less skewed than exponentials. Though they may also be more skewed, notably due to intercontact times and edge weights following heavy-tailed distributions as is often the case in physical proximity networks often (see e.g. Starnini et al. “Modeling human dynamics of face-to-face interaction networks” PRL 2013).

We thank the reviewer for pointing out that intercontact times may often be gracefully modelled via heavier-tailed distributions, such as power laws. We have now added a comment on this aspect in the manuscript. The decision we took for our contact tracing model was based on the findings of 2 cohort studies that reported exponential decays in the number of interaction events over time, while also having in mind the fact that this is a very common assumption in the epidemiological literature. The work of Starnini et al. [1] found power laws to be excellent approximators for the contact duration, but a power law fit to waiting times was found to be moderately less optimal, as the authors remark themselves. We would also like to emphasize that the figures presented in the aforementioned study utilize a log-log scale, which may sometimes hide exponential trends. In fact, the authors of this paper do not provide the parameters for the fitted power law to the intercontact times (although they do so for contact duration), so it is harder to judge the extent of the inferred exponent. At the same time, the fitted curve exhibits a concave curvature in the log-log plot, which may actually correspond to a roughly-exponential trend if translated to a linear scale. Sadly, no comparison with an exponential fit is provided in this text either. We acknowledge, however, that some interaction datasets do feature more skewed waiting times distributions than Gamma, and hence we believe that exploring the impact of utilizing longer-tailed distributions is an interesting avenue worth exploring in future endeavors.

[1] Starnini et al. “Modeling human dynamics of face-to-face interaction networks” PRL 2013

2. One has to read Eqs. 1 and 2 to understand how $\\Gamma$ and $r$ are defined, while the definitions of $Z_{rem}$ and $N_{utn}$ are clear from the text. I suggest moving the first part of Eqs. 1 and 2 (before the $\\Rightarrow$ sign) up to where $\\Gamma$ and $r$ are introduced. The second parts of Eqs. 1 and 2 may be removed as they are obtained by a simple arithmetic inversion of the first parts, or the authors may keep them at their current place.

Thank you for suggesting this easier-to-comprehend rephrasing and reordering of Section 2.2 and its equations. The manuscript has been amended to include in-line equations for all the network parameters involved, while full equations for $N_{utn}$ and $N_{ute}$ have been provided below.

3. Change: ‘contacts network’ to ‘contact network’ on page 6 (no line numbering) and on page 7, line 137. Change: ‘newly isolated’ to ‘isolated’ on page 6.

Change: ‘testing regimes’ to ‘testing regime’ in Fig. 7 caption.

Change: ‘$tau_r$’ to ‘$\\tau_r$’ in line 252.

Change ‘First aspect’ to ‘The first aspect’ in line 304.

Change: ‘experiments’ to ‘simulations’ in line 371.

Thank you for signaling these difficult-to-spot phrasing issues and mistakes. These suggestions have now been accommodated in the final manuscript.

Replies to Reviewer #3

1. First, as other reviewers mentioned, the network structure and temporal dynamics are somewhat overlooked in the paper. Although I would not argue that the authors should perform extra simulations, I think it is important to acknowledge it more thoroughly in the discussion. It has been recognized that super-spreading is a rather universal characteristic of many epidemics [1] and COVID-19 is argued to be driven primarily by such super-spreading events. Furthermore, recent studies (e.g., [2]) have shown that when spreading is driven by such super-spreading events, the details of contact tracing implementation may matter a lot. In this context, I believe that the paper should expand the discussion to provide a better context to the readers.

[1] Lloyd-Smith, J. O., Schreiber, S. J., Kopp, P. E. & Getz, W. M. Superspreading and the effect of individual variation on disease emergence. Nature 438, 355–359 (2005).

[2] Kojaku, S., Hébert-Dufresne, L., Mones, E., Lehmann, S., & Ahn, Y. Y. (2021). The effectiveness of backward contact tracing in networks. Nature Physics, 17(5), 652-658.

We thank the reviewer for pointing out that our discussion on superspreading, a very important mechanism underpinning the spread of COVID-19, was minimal in the previous submission. As such, we have now included a larger paragraph on superspreading, preferential attachment and small-world networks (see Section 2.3). In the same section, we have also commented on why we believe our contact tracing modelling technique remains suitable in the face of challenges like superspreading in scale-free networks or large clustering coefficients in small-world graphs, taking into consideration the findings of Kojaku et al. [1] and Tsimring and Huerta [2], respectively, as well as the similarities between our modelling technique and theirs.

[1] Kojaku, S., Hébert-Dufresne, L., Mones, E., Lehmann, S., & Ahn, Y. Y. (2021). The effectiveness of backward contact tracing in networks. Nature Physics, 17(5), 652-658.

[2] Tsimring LS, Huerta R. Modeling of contact tracing in social networks. Physica A: Statistical Mechanics and its Applications. 2003;325(1):33-39. doi:10.1016/S0378-4371(03)00180-8

2. Second, I think the plots can be improved a lot by carefully choosing colors and by limiting the number of lines/objects that each figure shows. Many figures have numerous (~10) lines with random colors associated with each line, making them very difficult to parse. I believe that most of the figures will not lose much information by reducing the number of lines to ~5. In addition, as each of these lines show a range of parameter values (i.e., they can be ordered), a linear colormap (e.g., sampling colors across the "viridis" colormap) would make them much easier to read. Also, the heatmap figure uses green-to-red colormap, which can be understood by a significant fraction of population who has colorblindness. Furthermore, the colormap used introduces an arbitrary cut-off point (between 750 and 1000) that introduces an artifact. Again, I believe that a linear, perceptually uniform colormap should be used here. Although this is probably not "critical" regarding PLOS ONE's publication criteria, I believe that this simple improvement in the figures will make the paper much more accessible.

Indeed, the original color schemes used for our plots were sometimes counterintuitive and probably difficult to comprehend for people with color blindness. To fix this issue, we have regenerated all our charts, including the heatmaps, using the linear colormap ‘inferno’. What is more, we have also reduced the number of lines in all the overlap/uptake figures to a maximum of 7 different rates, thus making them more comprehendible.

Attachment

Submitted filename: ResponsetoReviewers-Stage2.pdf

Decision Letter 2

Ivan Kryven

2 Nov 2021

Modelling digital and manual contact tracing for COVID-19. Are low uptakes and missed contacts deal-breakers?

PONE-D-21-07031R2

Dear Dr. Rusu,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Ivan Kryven

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Ivan Kryven

8 Nov 2021

PONE-D-21-07031R2

Modelling digital and manual contact tracing for COVID-19 Are low uptakes and missed contacts deal-breakers?

Dear Dr. Rusu:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Ivan Kryven

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File. Supporting material.

    Contains a link to the repository that maintains our open-source model, further discussions on other epidemic statistics we captured, and more charts illustrating the effects induced by varying the contact tracing parameters.

    (PDF)

    Attachment

    Submitted filename: Response to Reviewers.pdf

    Attachment

    Submitted filename: ResponsetoReviewers-Stage2.pdf

    Data Availability Statement

    All relevant data are to be found within the paper and the Supporting information files.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES