Effectiveness of isolation measures with app support to contain COVID-19 epidemics: a parametric approach

Andrea Maiorana; Marco Meneghelli; Mario Resnati

doi:10.1007/s00285-021-01660-9

. 2021 Oct 2;83(5):46. doi: 10.1007/s00285-021-01660-9

Effectiveness of isolation measures with app support to contain COVID-19 epidemics: a parametric approach

Andrea Maiorana ^1,^✉, Marco Meneghelli ¹, Mario Resnati ¹

PMCID: PMC8486969 PMID: 34599662

Abstract

In this study, we analyze the effectiveness of measures aimed at finding and isolating infected individuals to contain epidemics like COVID-19, as the suppression induced over the effective reproduction number. We develop a mathematical model to compute the relative suppression of the effective reproduction number of an epidemic that such measures produce. This outcome is expressed as a function of a small set of parameters that describe the main features of the epidemic and summarize the effectiveness of the isolation measures. In particular, we focus on the impact when a fraction of the population uses a mobile application for epidemic control. Finally, we apply the model to COVID-19, providing several computations as examples, and a link to a public repository to run custom calculations. These computations display in a quantitative manner the importance of recognizing infected individuals from symptoms and contact-tracing information, and isolating them as early as possible. The computations also assess the impact of each variable on the mitigation of the epidemic.

Keywords: Contact tracing, COVID-19, Epidemic models

Introduction

Main concepts and goals

This study aims to develop a probabilistic model to predict the effectiveness of containing an epidemic such as COVID-19 with measures aimed at finding and isolating infected individuals. More precisely, we are interested in modeling such “isolation measures,” by which we mean finding and isolating infected people via their symptoms and contact tracing, to predict the impact of these measures on the effective reproduction number of the epidemic. Special attention is dedicated to the case in which contact tracing is achieved, for a part of the population, through a mobile application.

Studies such as Ferretti et al. (2020) have underlined the role of asymptomatic and presymptomatic transmission in the COVID-19 outbreak, and the consequent importance of using a mobile application for efficient contact tracing. This insight has also led to the development of models to quantitatively assess the impact of a contact tracing app on the epidemic, primarily through agent-based approaches like in Pathogen Dynamics Group (2020).

In this paper, we propose an analytical approach to answer the following questions: How is the effective number $R_{t}$ of an epidemic impacted when isolation measures are in place versus when they are not, and what are the main factors contributing to the reduction in $R_{t}$ ? We take the effective reproduction number in the absence of isolation measures, denoted by $R_{t}^{0}$ , as an input of our model, which is thus independent of any underlying epidemic model. Moreover, our approach is parametric in that we concentrate the quantitative description of the isolation measures into relatively few, comprehensible parameters that comprise the input of the model. These parameters include the share of the population using an app, the share of people who self-isolate upon testing positive, and more.

Previous studies concerning the impact on the epidemic of isolating infected individuals include (Müller et al. 2000), which proposes a generative stochastic model of SIR-type, and Fraser et al. (2004), which uses an analytical method more similar to our own. The subject has also been addressed recently in Scarabel et al. (2021) using a deterministic dynamical model.

The starting point of our analysis is the effective reproduction number $R_{t}^{0}$ in the absence of isolation measures,1 that we consider as given. When discussing modeling “isolation measures,” we refer to policies focused on selectively isolating infected individuals after these individuals have been found through contact tracing or because they have displayed symptoms. We do not refer to generalized actions like imposing a lockdown, whose impact on the epidemic is considered already known and encompassed in $R_{t}^{0}$ .

$R_{t}^{0}$ is defined, for any absolute time t, as the expected number of cases generated by a random individual who was infected at time t during their lifetime. This quantity can be written as an integral

\begin{matrix} R_{t}^{0} = \int_{[0, + \infty)} β_{t}^{0} (τ) d τ, \end{matrix}

where $β_{t}^{0}$ is the infectiousness (also called effective contact rate): $β_{t}^{0}$ is a function describing the expected number of cases generated by an individual infected at time t, per unit of infectious age, that is the period of time (measured in days) elapsed from the time of infection of the individual. So, for example, the number

\begin{matrix} \int_{[1, 3)} β_{t}^{0} (τ) d τ \end{matrix}

is the expected number of people infected between 24 and 72 h from the infector’s moment of infection. Note that the normalization $β_{t}^{0} / R_{t}^{0}$ is the PDF of the generation time, the time taken by an individual infected at t to infect a different individual.2

In this study, we set up a methodology and a model to analyze changes in the reproduction number when the population is subject to isolation measures, including the support of an app for individuals who have tested positive, and depending on some parameters of simple interpretation. We denote by

\begin{matrix} R_{t} = \int_{[0, + \infty)} β_{t} (τ) d τ \end{matrix}

the effective reproduction number in presence of isolation measures, and we compute $R_{t}$ as a function of $R_{t}^{0}$ ,3 other epidemiological data such as the symptom onset distribution, and some parameters describing the isolation measures, such as the probability that an infected, symptomatic individual gets a test, or the probability that a recipient of the infection gets notified when their infector receives a positive test. We only model how isolation measures work and how they affect the epidemic,4 without assuming anything about how the epidemic itself develops. In particular, our model is agnostic of any particular form for $β_{t}^{0}$ and $R_{t}^{0}$ .

The final goal of the model we propose is to understand the most important leverages that may facilitate optimization to better direct efforts of decision-makers, scientists, and developers. Such factors include app efficiencies, timeliness of notifications, app adoption in the population, and others.

The assumptions of the model and outline of the paper

The model developed in Sect. 2 is the translation into mathematical terms of the following assumptions, that describe an idealized schema in which infected individuals acknowledge their illness and take measures to avoid infecting others.

An infected individual who shows symptoms is immediately5 notified that they should take a test (which does not discount the possibility that they acknowledge this necessity independent of an external input). This process does not always necessarily occur, but does so with a probability $s^{s}$ .
Given a infector–infectee pair, when the infector tests positive after the contagion, the infectee is immediately notified to take a test, with probability $s^{c}$ .
In either scenario, after an infected individual is notified to take a test, they take a test which will return a positive result after a time from the notification, which is distributed according to a given distribution $Δ^{A \to T}$ (possibly reaching $+ \infty$ to account for the case in which the individual is never tested or never receives the positive outcome).
Immediately upon receiving the positive outcome of the test, an average infected individual will self-isolate with probability $ξ$ . Put differently, the number of individuals they infect from this moment is reduced by a factor $1 - ξ$ compared to the scenario in which they do not take any isolation measures.

The equations derived from these hypotheses produce an algorithm that computes the time evolution of the key quantities. This is summarized in Sect. 2.5.

Note that in our model we are only considering forward contact tracing, i.e., infectees are notified of the positive result of their infectors, but not vice-versa. Doing otherwise would significantly complicate the discussion. This is probably the main limitation of the model, which may thus underestimate the effectiveness of the isolation measures: While backward contact tracing is in general less effective when timeliness in isolating infected individuals is key, it must be noted that its effect may be significant for epidemics for which super-spreaders, i.e. individuals that infect a large number of people, have a major impact on the contagion. Such individuals may be identified more easily thanks to backward tracing. A treatment of backward tracing in the context of a generative model is covered in Müller et al. (2000, §3.1).

Subsequently, in Sect. 3 we consider a more complex model. Instead, we assume that the population is split into two groups, depending on whether or not they use a mobile application for epidemic control. The parameters $s^{s}$ and $s^{c}$ are different, depending on whether they refer to individuals who use the app.

Finally, in Sect. 4, we apply these models by computing the suppression of $R_{t}$ for specific choices of the input parameters, particularly to assess the importance of such parameters. As for the input parameters that describe the epidemic, we use data relative to COVID-19. All these data are taken for a single source (Ferretti et al. 2020). It should be noted that these quantities are still preliminary, have quite large uncertainties, and are not necessarily the most up-to-date. However, we stress that these data are only used as inputs in all our computations, which can be easily reproduced and extended by using the code available in the open repository (Maiorana and Meneghelli 2021). It would be immediate to redo the computations with different inputs, to reflect any new understandings the scientific community should gain on COVID-19. In addition to this, in Sect. 4.2.3 we briefly check the robustness of our results with respect to changes in some epidemic data, namely the share of infected individuals that are asymptomatic, the contribution of those individuals to the reproduction number, and the generation time distribution.

The paper includes an Appendix where the main steps of the mathematical model are proven rigorously, in a framework where the hypotheses can be formulated precisely using the language of probability theory.

Discussion of the results

Summing up, this paper introduces a model of targeted isolation measures—with special attention paid to those based on contact tracing—in the context of an epidemic with given dynamics. It studies the impact of these, measured as the change in the key indicators of the epidemics (first of all, the reproduction number) with respect to the situation without measures. It presents a methodology to turn the assumptions defining the model into mathematical equations, without assuming an underlying model explaining the time evolution of the epidemic. In particular, the formalism developed in the Appendix allows a careful and exact development of the theory, in which all the interdependencies of the involved quantities are clarified. We end up with with a set of equations that express the relevant quantities in terms of those relative to previous times, giving a deterministic time evolution.

These equations (summarized in Sect. 2.5 for the “homogeneous” setting) are quite complex, reflecting the non-triviality of the assumptions about how isolation measures work. This makes it hard to analyze them analytically, for example, to study the asymptotic behaviour of the solutions, as was done in Fraser et al. (2004). On the other hand, our treatment allows us to refrain from making strong and unrealistic independence assumptions about the involved quantities, and leaves us greater freedom in setting up the hypotheses of how contact tracing works (for example, the isolation of contact-traced individuals is not assumed to be certain, nor immediate). And, notably, it allows us to numerically compute, with arbitrary precision, the time evolution of the reproduction number $R_{t}$ (and, hence, of the epidemic size) starting from the “default” reproduction number $R_{t}^{0}$ , other epidemiological data, and the parameters introduced in Sect. 1.2 describing the isolation measures.

We stress that, despite our extensive use of the language of probability theory, our model of the isolation measures is deterministic: It works as if the full history of the epidemic, with or without isolation measures, is given, and uses some parameters describing the mean efficacy of the isolation measures on the population. It then expresses $R_{t}$ in terms of $R_{t}^{0}$ and these parameters.

Note also that, in this paper, we always refer to $R_{t}$ as the case reproduction number. Sometimes, the instantaneous reproduction number is instead used in the literature when monitoring the evolution of an epidemic.6 Our choice is also connected to the way in which we formulate our mitigation hypotheses in Sect. 2 in terms of parameters $s^{s}$ , $s^{c}$ , $ξ$ , which we consider depending on absolute infection times rather than notification and isolation times. An alternative formulation following the latter option would add some slightly more cumbersome formulae but otherwise no essential complications of note to the treatment.

A limitation to the model comes from our homogeneous-mixing hypotheses regarding contact tracing and isolation policies: The only heterogeneity taken into account is the separation between individuals who do or do not use an app in Sect. 3. For example, the fact that, in reality, individuals belonging to the same household are more easily traced (in addition to being more easily infected by each other) is not taken into account. Besides the absence of backward contact tracing, mentioned in Sect. 1.2, other limitations may be attributed to the specific form of the hypotheses. However, many changes to the assumptions could be taken into account within the same mathematical framework: Features such as a different delay in testing for symptomatic or contact-traced individuals, or the existence of a targeted quarantine for potential infected individuals (even before they get tested) could be modeled without adding conceptual complications.

By using the model in Sect. 4 to compute the reduction in $R_{t}$ , we can recognize how isolation measures, particularly app-mediated isolation measures, can play an important role in mitigating epidemics like COVID-19. However, our results show how the impact of such measures is strongly sensitive to parameters describing their efficiency and timeliness: For example, the reduction in $R_{t}$ quickly becomes insignificant as the time taken to get a positive test result (and then to start isolating) grows past a few days (see Fig. 2).

Fig. 2 — ${Eff}_{\infty}$ as a function of the time $Δ^{A \to T}$ from notification to positive testing

The computations relative to the case in which an app is used show the importance of having an app which is effective at spotting infections, maximizing the fraction of true-positives.7 Past studies like Bendavid et al. (2021) and Li et al. (2020) suggest that “standard” contact tracing measures used by healthcare systems may be less efficient (fewer truly infected individuals are recognized) and slower when compared to an app (usually, several days elapse between symptom onset, the first medical visit, and the test outcome). In the computations, we model this fact by setting different parameters for people using an app and people who don’t, with the latter parameters left to reasonably low values. We analyze how the impact on the epidemics depends on these parameters and the app adoption rate (Fig. 9), showing how these are all key factors in reaching satisfactory epidemic mitigation levels.

Fig. 9 — ${Eff}_{\infty}$ as a function of app adoption $ϵ_{app}$

The mathematical model in the homogeneous population setting

In this section, we develop the core mathematical model of the paper. We do so with a simplified scenario in which the same isolation measures apply to the entire population, thus eliminating the need to distinguish between those who do and who do not use an app. Some mathematical derivations require extra care, and their complete proofs have been moved to the Appendix to prevent this section from being loaded with many formulae and a heavier formalism.

Notations and conventions

We consider random variables on the sample space of all infected individuals, describing (absolute) times at which certain events happen: $t^{I}$ (time of infection), $t^{S}$ (time of symptom onset), $t^{A}$ (time of infection notification), $t^{T}$ (time of positive test). These variables can take $+ \infty$ as a value to express the cases in which an event never takes place (this is useful when writing relations between them).

As we want to relate these variables to the reproduction number $R_{t}$ , which measures the average number of people infected by an individual infected at a given time t, it is logical that all these variables refer to the infectious age (that is, the time from the infection) of the average individual infected at t: so we have, for example, the relative time of symptom onset, which is the $[0, + \infty]$ -valued random variable

\begin{matrix} τ_{t}^{S} = (t^{S} - t^{I}) {|_{t^{I} = t} = t^{S} |}_{t^{I} = t} - t . \end{matrix}

We can assume that this variable is independent of the contagion time t. Hence, we denote it by $τ^{S}$ . Analogously, we have the random variables $τ_{t}^{A}$ (time of notification for an individual infected at t, measured since t), $τ_{t}^{T}$ (time of positive test for an individual infected at t, measured since t).

In this section we need to understand how to describe the random variables $τ^{S}$ , $τ_{t}^{A}$ , $τ_{t}^{T}$ , and their relation to the reproduction number

\begin{matrix} R_{t} = \int_{[0, + \infty)} β_{t} (τ) d τ, \end{matrix}

based on the assumptions of Sect. 1.2. The finite parts of these random variables are described using improper CDFs, denoted by $F^{S}$ , $F_{t}^{A}$ , and $F_{t}^{T}$ respectively, whose limit for $τ \to + \infty$ (representing the probability that each time is less than infinite) may be less than 1. So, for example, $F_{t}^{T} (τ)$ denotes the probability that an individual infected at t tests positive within a time $τ$ from the time of infection. ${lim}_{τ \to + \infty} F_{t}^{T} (τ)$ is the probability that the same individual eventually tests positive.

Further auxiliary variables are introduced later on.

The suppression model for $R_{t}$

Recall from Sect. 1.2 how we assume that self-isolation works: If an infected individual tests positive, then they immediately self-isolate, resulting in a reduction, on average, of the number of people they subsequently infect by a multiplicative factor $1 - ξ_{t}$ , which we assume given, and possibly depending on the time t at which the individual was infected.8

We can then determine a relation between the “default” reproduction number density $β_{t}^{0}$ , its correction $β_{t}$ as a result of the isolation measures, and the distribution of the relative time $τ_{t}^{T}$ at which individuals infected at t receive a positive test result. This relation holds for any t greater or equal to the time $t_{0}$ at which the isolation measures are enacted.

For simplicity, let’s assume for a moment that receiving a positive test and infecting someone (assuming no isolation measures) at a given infectious age $τ$ are independent events. By $τ$ , an individual who was infected at t has already received a test with probability $P (τ_{t}^{T} < τ) = F_{t}^{T} (τ)$ . In such a case, the number of people they infect per unit time is $(1 - ξ_{t}) β_{t}^{0} (τ)$ . Alternatively, if the individual has not received a test by $τ$ (which happens with probability $1 - F_{t}^{T} (τ)$ ), they do not self-isolate, and the average number of people they infect per unit time is just $β_{t}^{0} (τ)$ . In summary, we have, for any $τ \in [0, + \infty)$ ,

\begin{matrix} β_{t} (τ) = & F_{t}^{T} (τ) (1 - ξ_{t}) β_{t}^{0} (τ) + (1 - F_{t}^{T} (τ)) β_{t}^{0} (τ) \\ = & β_{t}^{0} (τ) (1 - ξ_{t} F_{t}^{T} (τ)) . \end{matrix}

This is analogous to Eq. 6 in Fraser et al. (2004). To illustrate further, suppose that all infected individuals test positive at the same infectious age $τ_{T}$ , i.e. $F_{t}^{T} (τ)$ is a Heaviside function with step at $τ_{T}$ : then we have $β_{t} (τ) = β_{t}^{0} (τ)$ for $τ < τ_{T}$ and $β_{t} (τ) = (1 - ξ_{t}) β_{t}^{0} (τ)$ for $τ \geq τ_{T}$ .

However, the above result relies on the assumption of independence between testing positive and the number of people the individual would infect without isolation. In practice, this is not an adequate reflection of what occurs. For example, with COVID-19, it is known that a significant proportion of the infected population is asymptomatic, and less contagious—see e.g. Mizumoto et al. (2020) and Ferretti et al. (2020). Given the lack of symptoms, this population has a lower probability of self-isolating. To overcome this factor, we introduce a new random variable G, which has a finite range ${g_{1}, . . ., g_{n}}$ that describes the severity of symptoms of an infected individual. It is assumed to be independent of the time $τ^{S}$ of symptom onset, but it is related to the number of infected people and the probability of the individual recognizing their own symptoms. Then, to write a relation between $F_{t}^{T}$ and $R_{t}$ , we restrict the relevant random variables to each possible value of G: for any $g = g_{1}, . . ., g_{N}$ we denote by

\begin{matrix} F_{t, g}^{T} (τ) \end{matrix}

the probability that an individual infected at time t and with severity g has tested positive by $τ$ . Similarly, we denote by

\begin{matrix} R_{t, g} = \int_{[0, + \infty)} β_{t, g} (τ) d τ \end{matrix}

the average number of people infected by an individual infected at t and with severity g, and by $R_{t, g}^{0}$ the analogous quantity in absence of isolation measures. Assuming now that for a given g the number of people infected (without isolation) and the event of being tested are independent, we write our “suppression formula” as

\begin{matrix} \begin{matrix} β_{t, g} (τ) = β_{t, g}^{0} (τ) (1 - ξ_{t} F_{t, g}^{T} (τ)) . \end{matrix} \end{matrix}

In Sect. A.3 we include a careful derivation of this formula. Note that the relations with the aggregate variables are

\begin{matrix} F_{t}^{T} = \sum_{g} p_{g} F_{t, g}^{T}, R_{t} = \sum_{g} p_{g} R_{t, g}, \end{matrix}

where $p_{g} = P (G = g)$ is the probability that an infected individual has symptoms with severity g.9

Also, in Sect. 4 we always take G to assume the values 0 and 1 only, to describe asymptomatic versus symptomatic infected individuals. However, this formalism allows for a greater diversification of $R_{t}^{0}$ , according to the severity of the illness.

We end this subsection with an example of an application of (2) in a simplified scenario. Suppose that G only takes the values 0 and 1, describing asymptomatic and symptomatic infected individuals, and that each constitutes half of the population. Suppose also that $ξ_{t} = 1$ , and that asymptomatic individuals are never tested, so that $F_{t, 0}^{T} = 0$ , while symptomatic individuals are tested immediately after infection, so that $F_{t, 1}^{T} (τ) = θ (τ)$ , where $θ$ is the Heaviside function. Then, we have $R_{t, 0} = R_{t, 0}^{0}$ and $R_{t, 1} = 0$ , so that $R_{t} = R_{t, 0}^{0} / 2$ . Had we used Eq. (1) instead, we would have ended up with $R_{t} = R_{t}^{0} / 2$ , which does not take into account the fact that isolating symptomatic individuals has a greater impact on the reduction of $R_{t}$ than isolating the same proportion of randomly chosen individuals.

First considerations on the variables $τ^{S}$ , $τ_{t}^{A}$ , and $τ_{t}^{T}$

The distribution of the time $τ^{S}$ of symptom onset is independent of the isolation policy and is considered as given throughout the paper, although its specific shape is irrelevant in this section.10

The description of $τ_{t}^{A}$ is addressed in the next subsection. Here, we only consider its relation with $τ_{t}^{T}$ : Having assumed that the time between notification and testing positive is described by a given random variable $Δ^{A \to T}$ , which is independent from $τ_{t}^{A}$ and for simplicity constant in absolute time, we have

\begin{matrix} τ_{t}^{T} = τ_{t}^{A} + Δ^{A \to T} . \end{matrix}

The relation still holds if we restrict it to individuals with a given severity g, and hence

\begin{matrix} F_{t, g}^{T} (τ) = \int_{[0, + \infty)} F_{t, g}^{A} (τ - τ^{'}) d F^{A \to T} (τ^{'}), \end{matrix}

where $F^{A \to T}$ is the improper CDF of $Δ^{A \to T}$ .

Describing $τ_{t}^{A}$

In this subsection, we consider the random variable $τ_{t}^{A}$ and study the relations with it that formalize the assumptions of Sect. 1.2, namely:

When an infected individual shows symptoms, they receive an immediate notification to get tested, with probability $s_{t, g}^{s}$ depending on the severity g of symptoms, and possibly on the infection time t.
Immediately after an infector tests positive, each infectee is notified of the risk, with probability $s_{t}^{c}$ . If the contagion takes place after the positive test, then the infectee is never notified.

We introduce two new random variables relative to individuals infected at a given time t, describing the receiving of a notification for either cause:

We denote by $τ_{t, g}^{A,s}$ the time from infection at which an individual infected at t and with severity g is notified because of symptoms. We assume that this happens with probability $s_{t, g}^{s}$ at the time $τ^{S}$ of the symptom onset, so its improper CDF is simply11
$\begin{matrix} F_{t, g}^{A,s} = s_{t, g}^{s} F^{S} . \end{matrix}$ 4
We denote by $τ_{t}^{A,c}$ the time from infection at which an individual infected at t receives a notification resulting from the positive test of their infector. Below, we see how to describe this.

The relation between these new variables and $τ_{t, g}^{A}$ is

\begin{matrix} τ_{t, g}^{A} = min (τ_{t, g}^{A,s}, τ_{t}^{A,c}) . \end{matrix}

In terms of improper CDFs, and assuming independence of the two notification times, this gives

\begin{matrix} F_{t, g}^{A} = F_{t, g}^{A,s} + F_{t}^{A,c} - F_{t, g}^{A,s} F_{t}^{A,c} . \end{matrix}

Describing $τ_{t}^{A,c}$ requires the introduction of an additional random variable $τ_{t}^{σ}$ , that gives, for any individual infected at t, the time elapsed between the the infection time of their infector and t. In particular, we need the joint distribution of $τ_{t}^{σ}$ and the severity G, that can be described in terms of improper CDFs $F_{t}^{σ, g}$ : Let

\begin{matrix} F_{t}^{σ, g} (τ) \end{matrix}

denote the probability that, given an individual infected at t, their infector has severity g and was infected at a time $t^{'} \geq t - τ$ . Note that these improper CDFs satisfy a normalization condition

\begin{matrix} lim_{τ \to + \infty} \sum_{g} F_{t}^{σ, g} (τ) = 1, \end{matrix}

and they are completely determined by quantities relative to times preceding t, namely the number of infected people and the infectiousness (more details on how they are computed are deferred to Sect. A.5).

Now, the notification time $τ_{t}^{A,c}$ of an individual infected at t is by hypothesis equal to the testing time $τ_{t^{'}}^{T}$ of the infector minus the generation time $τ_{t}^{σ}$ , but only if the notification actually occurs, which happens with probability $s_{t}^{c}$ provided that the contagion took place before $τ_{t^{'}}^{T}$ . Hence, to get the improper CDF $F_{t}^{A,c}$ we should first average $F_{t - τ}^{T}$ , translated to the left by $τ = τ_{t}^{σ}$ , over all possible values of $τ > 0$ , each weighted by the probability of the generation time being $τ$ . In doing this we should also treat separately the different severity levels that the infector may have, as these impact the testing time distribution. So $F_{t}^{A,c} (ρ)$ should look like a sum

\begin{matrix} s_{t}^{c} \sum_{g} \int_{(0, + \infty)} F_{t - τ, g}^{T} (ρ + τ) d F_{t}^{σ, g} (τ) . \end{matrix}

This formula doesn’t take into account that by assumption the notification can only occur after the contagion time, meaning that $F_{t}^{A,c}$ must be supported on positive numbers. This is considered by replacing the integrand with the probability

\begin{matrix} P (τ < τ_{t - τ, g}^{T} \leq ρ + τ) = F_{t - τ, g}^{T} (ρ + τ) - F_{t - τ, g}^{T} (τ) . \end{matrix}

Also, in averaging the CDFs $F_{t - τ}^{T}$ we should take into account the fact that the testing time of the infector is not distributed like the testing time of an arbitrary individual: Having infected someone at the infectious age $τ$ , the infector is more likely than average to be tested after $τ$ , or to never receive a test. As we will show carefully in the Appendix, to take this into account we need to divide the integrand by the same suppression factor $1 - ξ_{t} F_{t, g}^{T} (τ)$ that appears in Eq. (2), evaluated at $t - τ$ . We conclude that, for any $ρ \geq 0$ , we have

\begin{matrix} F_{t}^{A,c} (ρ) = s_{t}^{c} \sum_{g} \int_{(0, + \infty)} \frac{F_{t - τ, g}^{T} (ρ + τ) - F_{t - τ, g}^{T} (τ)}{1 - ξ_{t - τ} F_{t - τ, g}^{T} (τ)} d F_{t}^{σ, g} (τ) . \end{matrix}

This result is proven rigorously in Sect. A.6.

Summary and discrete-time algorithm

In this section, we have translated the hypotheses made in Sect. 1.2 into mathematical equations describing a dynamical system. In doing this, we added a few natural assumptions of independence between the variables under considerations, namely:

the assumption in Sect. 2.2 that the testing time of an individual with given severity is independent from their default infectiousness
the assumption of independence between notification times $τ_{t, g}^{A,s}$ and $τ_{t}^{A,c}$ and the testing delay $Δ^{A \to T}$

Putting all the equations together, we see that we can compute, at any time t, the suppressed infectiousness $β_{t, g}$ in terms of the parameters $s_{t, g}^{s}$ , $s_{t}^{c}$ , $ξ_{t}$ , $Δ^{A \to T}$ of the model, the default infectiousness $β_{t, g}^{0}$ and the other known epidemiological quantities, and the distributions relative to previous times $t^{'} < t$ .

To add an initial condition to the dynamical system, we assume that the isolation measures start at a given absolute time $t_{0}$ , so that $s_{t}^{s} = s_{t}^{c} = ξ_{t} = 0$ for $t < t_{0}$ .12 Hence, all individuals infected at $t < t_{0}$ will never take a test (even after $t_{0}$ ) and never self-isolate. As a consequence, the effective reproduction number is $R_{t}^{0}$ for $t < t_{0}$ , while it gets reduced according to Eq. (2) for $t \geq t_{0}$ . In particular, individuals infected at $t = t_{0}$ can only be notified of the need to take a test through symptoms, so that $F_{t_{0}}^{A,c} = 0$ .

Our set of equations can be approximated with arbitrary precision to a discrete-time algorithm that computes how the epidemic evolves, given the above data.13 This is the algorithm used in the calculations of Sect. 4.

Summing up, for each time $t \geq t_{0}$ , the algorithm works as follows:

Compute the number $ν_{t}$ of individuals infected at t and the improper CDFs $F_{t}^{σ, g}$ , from $ν_{t^{'}}$ and $β_{t^{'}, g}$ for $t^{'} < t$ (as detailed in Sect. A.5).
Compute the distribution of $τ_{t, g}^{A,s}$ as in Eq. (4):
$\begin{matrix} F_{t, g}^{A,s} = s_{t, g}^{s} F^{S} . \end{matrix}$
Compute the distribution of $τ_{t}^{A,c}$ from $F_{t}^{σ, g}$ and the distribution of $τ_{t^{'}, g}^{T}$ , for $t^{'} < t$ , using Eq. (6). If $t = t_{0}$ , just take $F_{t}^{A,c} = 0$ .
Compute the distribution of $τ_{t, g}^{A}$ using Eq. (5), that is
$\begin{matrix} F_{t, g}^{A} = F_{t, g}^{A,s} + F_{t}^{A,c} - F_{t, g}^{A,s} F_{t}^{A,c}, \end{matrix}$
and then the distribution of $τ_{t, g}^{T}$ from $F_{t, g}^{A}$ via Eq. (3).
Compute $β_{t, g}$ using the distribution of $τ_{t, g}^{T}$ , via Eq. (2):
$\begin{matrix} \begin{matrix} β_{t, g} (τ) = β_{t, g}^{0} (τ) (1 - ξ_{t} F_{t, g}^{T} (τ)) . \end{matrix} \end{matrix}$

The extended model including the use of an app for epidemic suppression

So far, we have operated under the hypothesis that the ability to inform infected people that their source has been infected can be described by a single (possibly time-dependent) parameter $s_{t}^{c}$ . Now, let’s suppose that the population is divided into people who use an app for epidemic control and people who do not. This forces us to complicate the model of Sect. 2 because, when we analyze the distribution of the notification time $τ_{t}^{A,c}$ for people with the app, we need to apply different weights to the cases in which the source of the contagion has the app or does not. We also leave open the possibility that people using the app may have a different probability $s_{t}^{s}$ of requiring a test because of their symptoms.

The generalization of the homogeneous scenario to this case is quite straightforward. In any case, some more mathematical detail has been added in Sect. A.7.

Parameters and random variables in the two-component model

A share $ϵ_{t, app}$ of the infected population, perhaps depending on the absolute time t, uses an app that may do the following:

It gives the users clear instructions on how to behave when they have symptoms indicative of the disease, assuming that this can increase the probability that an infected individual asks the health authorities to be tested because of their symptoms.
It notifies the users when they have had contact with an infected individual who also uses the app, assuming that this can increase the probability that an infected individual asks the health authorities to be tested because of contact with an infected person.

We then distinguish $s_{t, g}^{s}$ into $s_{t, g}^{s,app}$ and $s^{s,no} {app}_{t, g}$ , describing the probability that an individual infected at t, respectively with or without the app, is notified of the need to be tested given that they have symptoms with severity g. Note that

\begin{matrix} s_{t, g}^{s} = ϵ_{t, app} s_{t, g}^{s,app} + (1 - ϵ_{t, app}) s^{s,no} {app}_{t, g}, \end{matrix}

so that this distinction does not complicate the model, and is made only for adding clarity in the computations.

The increased complexity of this situation lies in the fact that $s_{t}^{c}$ now has to be replaced by two parameters $s_{t}^{c,app}$ and $s^{c,no} {app}_{t}$ , describing the probabilities that, given an infector–infectee pair, the positive testing of the infector occurred after the infection caused a notification to be sent to the infectee, respectively in the cases that both the infector and the infectee have the app, and that at least one of them does not have the app. Note that there is no relation between $s_{t}^{c,app}$ and $s^{c,no} {app}_{t}$ and the general $s_{t}^{c}$ as simple as Eq. (7).

We also distinguish each random variable between people with the app and people without it. For example, the time of notification due to contact now reads $τ_{t, app}^{A,c}$ for people with the app and $τ_{t, no app}^{A,c}$ for people without it. The relation between their improper CDFs is

\begin{matrix} F_{t}^{A,c} = ϵ_{t, app} F_{t, app}^{A,c} + (1 - ϵ_{t, app}) F_{t, no app}^{A,c} . \end{matrix}

We have analogous formulae for $τ_{t}^{T}$ and $τ_{t}^{A,s}$ , while there is no need to make a distinction for $τ^{S}$ .

Likewise, we have to separate $R_{t}$ into two components $R_{t, app}$ and $R_{t, no app}$ , namely, the average number of people infected by someone infected at t who has or does not have the app, respectively:

\begin{matrix} R_{t} = ϵ_{t, app} R_{t, app} + (1 - ϵ_{t, app}) R_{t, no app} . \end{matrix}

Analogous relations hold when restricted to individuals whose illness has a given severity g.

It is reasonable to assume that having or not having the app is independent of symptom severity, so that, for example, the fraction of individuals infected at time t using the app and with severity g is $ϵ_{t, app} p_{g}$ . Also, while of course having an app does impact the testing time distribution and the infectiousness, we can safely suppose that it is independent of the default infectiousness, i.e. the number of people an individual would have infected in the absence of measures. This is why in this scenario the suppression formula (2) simply becomes

\begin{matrix} \begin{matrix} β_{t, g, a} (τ) = β_{t, g}^{0} (τ) (1 - ξ_{t} F_{t, g, a}^{T} (τ)), \end{matrix} \end{matrix}

for $a = app, no app$ .

The mathematical relations between the random variables

Now, we can write the new relations between the random variables. Eq. (4) is replaced by

\begin{matrix} F_{t, g, app}^{A,s} = s_{t, g}^{s,app} F^{S}, F_{t, g, no app}^{A,s} = s^{s,no} {app}_{t, g} F^{S} . \end{matrix}

The relations (3), (5) immediately extend to each component.

The distributions of $τ_{t, app}^{A,c}$ and $τ_{t, no app}^{A,c}$ can be computed similarly to as we did in Sect. 2.4 for the homogeneous case. But now, for each of them Eq. (6) needs to be split into two parts, accounting for the cases in which the source of the infection has or doesn’t have the app:

\begin{matrix} \begin{matrix} F_{t, app}^{A,c} (ρ) & = & s_{t}^{c,app} \sum_{g} \int_{(0, + \infty)} \frac{F_{t - τ, g, app}^{T} (ρ + τ) - F_{t - τ, g, app}^{T} (τ)}{1 - ξ_{t - τ} F_{t - τ, g, app}^{T} (τ)} d F_{t}^{σ, g, app} (τ) \\ + s^{c,no} {app}_{t} \sum_{g} \int_{(0, + \infty)} \frac{F_{t - τ, g, no app}^{T} (ρ + τ) - F_{t - τ, g, no app}^{T} (τ)}{1 - ξ_{t - τ} F_{t - τ, g, no app}^{T} (τ)} d F_{t}^{σ, g, no app} (τ) . \end{matrix} \end{matrix}

For $F_{t, no app}^{A,c}$ we get a similar equation with $s_{t}^{c,app}$ replaced by $s^{c,no} {app}_{t}$ : In this case, it doesn’t matter whether or not the infector has the app. The equation simplifies to a form analogous to Eq. (6), namely

\begin{matrix} F_{t, no app}^{A,c} (ρ) = s^{c,no} {app}_{t} \sum_{g} \int_{(0, + \infty)} \frac{F_{t - τ, g}^{T} (ρ + τ) - F_{t - τ, g}^{T} (τ)}{1 - ξ_{t - τ} F_{t - τ, g}^{T} (τ)} d F_{t}^{σ, g} (τ) . \end{matrix}

Again, we refer to the Appendix for a greater mathematical rigor: The last two equations are derived in greater detail in Sect. A.7.

Scenarios and calculations

In this section, we use the models introduced in Sects. 2 and 3 to numerically compute the suppression of $R_{t}$ due to isolation measures in certain scenarios.

The results reported here, as well as new custom calculations, can be obtained by cloning the public Python repository (Maiorana and Meneghelli 2021).

General considerations

Some inputs of the algorithm developed are parameters or distributions describing the features of the epidemic under consideration. In this section, we focus on COVID-19, and we make the following assumptions, taking all the epidemic data from Ferretti et al. (2020) (Table 1, in particular) for convenience:

The incubation period $τ^{S}$ is distributed according to a log-normal distribution:
$\begin{matrix} F^{S} (τ) = F_{N_{0, 1}} ((log (τ) - μ) σ) \end{matrix}$
where $F_{N_{0, 1}}$ denotes the CDF of the standard normal distribution. The parameters $μ = 1.64$ and $σ = 0.36$ used here imply that the mean incubation period is $≃ 5.5$ days.
The default infectiousness distribution $β_{t}^{0}$ is assumed to depend on the absolute time t only via a global factor, so that
$\begin{matrix} β_{t}^{0} (τ) = R_{t}^{0} ρ^{0} (τ), \end{matrix}$
where $ρ^{0}$ (which also represents the default generation time distribution) integrates to 1. It is described by a Weibull distribution with mean 5.00 and variance 3.61:
$\begin{matrix} ρ^{0} (τ) = \frac{k}{λ} {(\frac{τ}{λ})}^{k - 1} e^{- {(τ / λ)}^{k}} \end{matrix}$
with $k = 2.855$ , $λ = 5.611$ .
We simplify the severity of symptoms by considering only two levels of severity: $g = sym$ and $g = asy$ , respectively for symptomatic and asymptomatic individuals. We take asymptomatic individuals as 40%, and we assume that they account for 5% of $R_{t}$ .14 In formulae, this means that the input parameters of our model are
$\begin{matrix} \begin{matrix} p_{sym} = 0.6, p_{asy} = 0.4, \\ β_{t, sym}^{0} = \frac{0.95}{0.6} R_{t}^{0} ρ^{0}, β_{t, asy}^{0} = \frac{0.05}{0.4} R_{t}^{0} ρ^{0} . \end{matrix} \end{matrix}$

All these assumptions hold throughout the whole section except for Sect. 4.2.3, where we check how the results change using different epidemic data. The other parameters of the model, describing the isolation measures, are selected later.

As Key Performance Indicators (KPIs) describing the effectiveness of the isolation measures, we look at the reduction of $R_{t}$ compared to the value $R_{t}^{0}$ it would take in the absence of measures. We call effectiveness of the isolation measures the relative reduction in $R_{t}^{0}$ :

\begin{matrix} {Eff}_{t} : = 1 - \frac{R_{t}}{R_{t}^{0}} . \end{matrix}

Thus, ${Eff}_{t} = 0$ indicates that there is no effect on $R_{t}^{0}$ , while ${Eff}_{t} = 1$ describes a complete suppression of the contagion. We will see in Sect. 4.2.3 that the dependency of ${Eff}_{t}$ on the default reproduction number $R_{t}^{0}$ is very weak.15 As such, attempts to model a realistic profile for $R_{t}^{0}$ have little relevance to our computations, as any choice of $R_{t}^{0}$ leads to almost the same ${Eff}_{t}$ . Thus, in the rest of this section we simply take

\begin{matrix} R_{t}^{0} = 1 . \end{matrix}

Another useful KPI is the probability that an individual infected at a certain time t is eventually found to be positive, namely the limit

\begin{matrix} F_{t}^{T} (\infty) : = lim_{τ \to + \infty} F_{t}^{T} (τ) . \end{matrix}

In the remainder of this section, we report the results of some selected calculations, considering both the “homogeneous” scenario of Sect. 2 and, in greater detail, the scenario of Sect. 3, in which an app for epidemic control is used. First, we study how the above KPIs evolve in time for certain input parameter choices. Then, we focus on the limits for $t \to + \infty$ of these KPIs, i.e., their “stable” values after a sufficient number of iterations, to study how these vary when we change certain input parameters, leaving the others fixed.

Reduction in $R_{t}$ with homogeneous isolation measures

First, we perform some calculations in the setting of Sect. 2, where the isolation measures are “homogeneous” within the whole population. We recall the parameters that describe this situation, some of which remain fixed in all the calculations:16

Parameter	Meaning	Value
$s_{sym}^{s}$	Probability that a symptomatic infected individual is notified of the infection because of their symptoms	Not fixed
$s_{asy}^{s}$	As above, but for the asymptomatic	0
$ξ$	Probability that someone testing positive self-isolates	Not fixed
$Δ^{A \to T}$	Time from notification to positive testing	Constant distribution, whose value is not fixed at this moment
$t_{0}$	Time at which isolation measures begin	0

Open in a new tab

Note that assuming that $Δ^{A \to T}$ is a constant random variable means that we are modeling that all individuals notified of the risk test positive, and take the same time to do so. Although unrealistic, this assumption makes little difference to the results. It is made here for simplicity, although it can be easily changed by using a more realistic $Δ^{A \to T}$ , when this datum is available.

Time evolution with isolation due to both symptoms and contact-tracing

We now choose the following parameters, describing an optimistic situation, with reasonable efficiencies in spotting infected individuals:

Parameter	Value
$s_{sym}^{s}$	0.5
$s^{c}$	0.7
$ξ$	0.9
$Δ^{A \to T}$	2

Open in a new tab

The results are shown in Fig. 1. Note that immediately at $t = 0$ $R_{t}$ drops to around 0.92, as half of the symptomatic individuals are notified as soon as they show some symptoms, and they then infect a reduced number of people. Subsequently, $R_{t}$ continues to decrease due to contact-tracing, quickly approaching its limit value $R_{\infty} ≃ 0.77$ (i.e., 84% of the value it would have had with isolation due to symptoms only).

Dependency on testing timeliness

We now focus on the limit value ${Eff}_{\infty}$ , investigating its dependency on the time $Δ^{A \to T}$ from a notification to the positive result of the test (recall that we are assuming that $Δ^{A \to T}$ is a constant random variable).

Like in Sect. 4.2.1, the other parameters are fixed as follows:

Parameter	Value
$s_{sym}^{s}$	0.5
$s^{c}$	0.7
$ξ$	0.9

Open in a new tab

The result is plotted in Fig. 2. The effectiveness of the isolation measures improves dramatically with the ability to test (and then isolate) infected individuals as soon as possible after their notification of possible infection.

Dependency on the epidemic data used

In this subsection we briefly explore what happens if we change some of the data describing the epidemic, that were introduced in Sect. 4.1 and used elsewhere in this section. This is done to see how ${Eff}_{\infty}$ depends on these data. The other parameters, describing the isolation measures, are fixed as usual:

Parameter	Value
$s_{sym}^{s}$	0.5
$s^{c}$	0.7
$ξ$	0.9
$Δ^{A \to T}$	2

Open in a new tab

First, we let the fraction $p_{sym}$ of symptomatic individuals vary, along with their contribution to $R_{t}^{0}$ —let us denote it here by $κ$ —that is elsewhere taken as $κ = 0.95$ . Recall that

\begin{matrix} R_{t, sym}^{0} = \frac{κ}{p_{sym}} R_{t}^{0}, R_{t, asy}^{0} = \frac{1 - κ}{1 - p_{sym}} R_{t}^{0} . \end{matrix}

The value of ${Eff}_{\infty}$ for a few choices of $p_{sym}$ and $κ$ is plotted in Fig. 3, where it is apparent how the result is robust with respect to changes in these input data. Note that if we fix $p_{sym}$ and let $κ$ vary, then the two components of $R_{t}^{0}$ (and hence those of $R_{t}$ ) are linearly rescaled, meaning that ${Eff}_{\infty}$ also changes linearly.

Second, we fix $p_{sym} = 0.6$ and $κ = 0.95$ as usual, and we modify instead the density $ρ^{0}$ of the default generation time, by replacing it with

\begin{matrix} ρ_{f}^{0} (τ) = \frac{1}{f} ρ^{0} (τ / f) \end{matrix}

for $f > 0$ . Note that this implies that the expected value of the default generation time (denoted here by $τ^{0, C}$ ) is multiplied by f:

\begin{matrix} E (τ^{0, C}) = f (5 days) . \end{matrix}

Figure 4 depicts the relation between $E (τ^{0, C})$ and ${Eff}_{\infty}$ , as f varies. As expected, the isolation measures become more effective as the time taken by the infection to be transmitted increases.

Fig. 4 — ${Eff}_{\infty}$ for some rescalings of the distribution of $τ^{0,C}$

Finally, Fig. 5 shows how ${Eff}_{\infty}$ changes slightly as we change the value of $R_{t}^{0}$ (for $t \geq 0$ ).

Reduction in $R_{t}$ in the case of app usage

Now, we focus on applying the model of Sect. 3 to study how $R_{t}$ is reduced when a fraction of the population uses an app for epidemic control.

In this case, we summarize the input parameters in the following table, fixing some of them to the given values for the rest of the section (unless explicitly mentioned).

Parameter	Meaning	Value
$s_{sym}^{s,app}$	Probability that a symptomatic infected individual using the app is notified of the infection because of their symptoms	Not fixed
$s^{s,no} {app}_{sym}$	As above, but for individuals without the app	0.2
$s_{asy}^{s,app}$ , $s^{s,no} {app}_{asy}$	As with the two parameters above, but for asymptomatic individuals	0, 0
$s^{c,app}$	Probability that an infected individual with the app is notified of the infection because of their source having tested positive	Not fixed
$s^{c,no} app$	Probability that an infected individual without the app is notified of the infection because of their source having tested positive	0.2
$ξ$	Probability that someone testing positive self-isolates	Not fixed
$Δ^{A \to T, app}$ , $Δ^{A \to T, no app}$	Time from notification to positive testing for people with and without the app, respectively	Constant distributions, whose values are not fixed at this moment
$ϵ_{t, app}$	Fraction of the population adopting the app at time t	Not fixed
$t_{0}$	Time at which isolation measures begin	0

Open in a new tab

Time evolution in an optimistic scenario

We start with an optimistic scenario, where the app is effective at recognizing infected individuals from symptoms and contact-tracing information. The internal predictive models that estimate the probability of an individual being infected have high efficiencies (a situation likely bound to the possibility of training the predictive models on real data, in practice). The app is adopted by a large fraction (60%) of the population, and is trusted, so that most of the people notified take a test and self-isolate. The app also helps a notified individual to get tested more quickly.17

Parameter	Value
$s_{sym}^{s,app}$	0.8
$s^{c,app}$	0.8
$ξ$	0.9
$ϵ_{app}$	0.6
$Δ^{A \to T, app}$	2
$Δ^{A \to T, no app}$	4

Open in a new tab

As we start from $R_{t}^{0} = 1$ , we reach a limit value of $R_{\infty} ≃ 0.84$ for an effectiveness of 0.16. Note also that $R_{\infty, app} ≃ 0.75$ , while $R_{\infty, no app} ≃ 0.97$ . The time evolution of $R_{t}$ , along with the other main quantities of interest, is shown in Fig. 6.

Fig. 6 — KPIs evolution in the optimistic scenario

Time evolution in a pessimistic scenario

We now run an analogous computation in a “pessimistic” scenario. The app can only recognize infected individuals from contact-tracing information, and not from symptoms ( $s_{sym}^{s,app}$ consequently defaults to the no-app value). In addition, we assume a low efficiency $s^{c,app} = 0.5$ , perhaps due to poor predictive models. Also, only 70% of those testing positive self-isolate.

Parameter	Value
$s_{sym}^{s,app}$	0.2
$s^{c,app}$	0.5
$ξ$	0.7
$ϵ_{app}$	0.6
$Δ^{A \to T, app}$	2
$Δ^{A \to T, no app}$	4

Open in a new tab

Even with a high app adoption rate (60% of the population), the effectiveness drops dramatically. We get $R_{\infty} ≃ 0.96$ and ${Eff}_{\infty} ≃ 0.06$ . Most notably, the app does not change things much with respect to “standard” isolation measures: $R_{\infty, app} ≃ 0.95$ and $R_{\infty, no app} ≃ 0.99$ .

Time evolution in the case of gradual adoption of the app

Now, we study the evolution of $R_{t}$ in a scenario whereby the fraction $ϵ_{t, app}$ of people using the app is not constant, but increasing in a linear fashion until it reaches 60% in 30 days:

\begin{matrix} ϵ_{t, app} = 0.6 t / 30 for 0 \leq t < 30, ϵ_{t, app} = 0.6 for t \geq 30 . \end{matrix}

The other parameters are chosen as in the optimistic scenario of Sect. 4.3.1:

Parameter	Value
$s_{sym}^{s,app}$	0.8
$s^{c,app}$	0.8
$ξ$	0.9
$Δ^{A \to T, app}$	2
$Δ^{A \to T, no app}$	4

Open in a new tab

As shown in Fig. 7, $R_{t}$ decreases until stabilizing again to the same value obtained in Sect. 4.3.1, although it takes more time to do so. The limit values of the KPIs are not changed by a gradual adoption of the app, compared with a prompt adoption.

Dependency of effectiveness on the efficiencies $s^{s}$ and $s^{c}$

We now focus on the study of how the limit values of the KPIs change when we vary certain parameters, starting with the app efficiencies $s_{sym}^{s,app}$ and $s^{c,app}$ . In Fig. 8, we plot ${Eff}_{\infty}$ as a function of these two parameters, while the others are fixed to the following values:

Parameter	Value
$ξ$	0.9
$ϵ_{app}$	0.6
$Δ^{A \to T, app}$	2
$Δ^{A \to T, no app}$	4

Open in a new tab

Fig. 8 — ${Eff}_{\infty}$ as a function of the efficiencies $s_{sym}^{s,app}$ and $s^{c,app}$

Dependency on the app adoption

In Fig. 9, we can observe the dependency of the effectiveness ${Eff}_{\infty}$ on the share $ϵ_{app}$ of the population using the app. The remaining parameters are fixed to these values:

Parameter	Value
$s_{sym}^{s,app}$	0.5
$s^{c,app}$	0.7
$ξ$	0.9
$Δ^{A \to T, app}$	2
$Δ^{A \to T, no app}$	4

Open in a new tab

Acknowledgements

We thank our colleagues Christy Keenan and Luca Ferrari for carefully proofing the manuscript and providing significant stylistic improvements. We are also grateful to Giorgio Guzzetta and the anonymous referees for several useful comments and suggestions about this study and its publication.

A Appendix: formalism and detailed derivations of mathematical results

The goal of this Appendix is to introduce a mathematical framework in which the hypotheses of our model can be formulated precisely and their consequences proven rigorously. In particular, we will derive the formula (2) describing the suppression of $R_{t}$ due to the testing and isolation policies, and the time evolution equation (6). Some more details about the two-component scenario of Sect. 3 are added.

A.1 Modeling a deterministic epidemics with a probability space

In this subsection, we define the fundamental components of our framework. $Ω$ denotes the set of all the individuals infected during the epidemic. It is endowed with two functions. First,

\begin{matrix} t^{I} : Ω \to [0, + \infty) \end{matrix}

associates to each individual $ω \in Ω$ the absolute time of their infection. Note that we take 0 as the initial time of the epidemics. $t^{I}$ partitions $Ω$ into a foliation

\begin{matrix} Ω = \cup_{t \in R} Ω_{t} : = \cup_{t \in R} {ω \in Ω | t^{I} (ω) = t} . \end{matrix}

We also write

\begin{matrix} Ω_{> 0} : = \cup_{t \in R^{+}} Ω_{t} \end{matrix}

for the set of individuals infected at a positive time.

Second, for any individual $ω \in Ω_{> 0}$ we denote by $σ (ω) \in Ω$ their infector. This defines a map

\begin{matrix} σ : Ω_{> 0} \to Ω . \end{matrix}

It is natural to assume that, for all $ω \in Ω_{> 0}$ ,

\begin{matrix} t^{I} (σ (ω)) < t^{I} (ω) . \end{matrix}

Fig. 10 — Schematic representation of $Ω$ , $t^{I}$ , and $σ$ . Dots represent individuals $ω \in Ω$

As is the case in the rest of the paper, we will consider other quantities referring to infected individuals and study the mathematical relations between them, with special attention to their average properties over all individuals infected at a given time t. Hence, we will study functions defined over $Ω$ , and to talk about their averages over each set $Ω_{t}$ we will introduce a probability measure P on $Ω$ . Since $Ω$ is finite and we want to weight all individuals equally, P is the uniform discrete probability measure:

\begin{matrix} P (E) = \frac{# E}{# Ω}, \end{matrix}

where $# E$ denotes the cardinality of a set $E \subset Ω$ . Relevant quantities then become random variables, and we are interested in studying their distributions. This always reduces to solving certain counting problems, and introducing P is largely a way to conveniently write formulas using the language of probability theory. Let us stress that our methodology does not involve any simulation of random processes: The history of the epidemic is completely determined by the triple $(Ω, t^{I}, σ)$ , and our study of its evolution consists of writing deterministic relations that express random variables restricted to a time slice $Ω_{t}$ in terms of random variables restricted to slices $Ω_{t^{'}}$ , for times $t^{'} < t$ .

The probability measure P “disintegrates” along $t_{I}$ , giving a uniform probability measure $P_{t}$ on each $Ω_{t}$ . Also, for any random variable $X : Ω \to R$ we will denote by

\begin{matrix} X_{t} : = X ↾_{Ω_{t}} \end{matrix}

its restriction to $Ω_{t}$ . We will be mostly interested in studying the distributions of such restrictions of random variables. In considering quantities like the expected value

\begin{matrix} E_{P_{t}} (X_{t}) = \int_{Ω_{t}} X d P_{t} = \frac{1}{# Ω_{t}} \sum_{ω \in Ω_{t}} X (ω) \end{matrix}

of $X_{t}$ , we will always make the relevant probability measure $P_{t}$ explicit to avoid confusion.

A.2 Infector–infectee pairs, generation time and the reproduction number

Let us introduce some additional notation, for future convenience: First, we define the set

\begin{matrix} \tilde{Ω} = {(ω^{'}, ω) \in Ω \times Ω_{> 0} | σ (ω) = ω^{'}} \end{matrix}

of infector–infectee pairs. This is the graph of the map $σ$ , which thus determines a bijection $Ω_{> 0} \to \tilde{Ω}$ . We also consider two functions describing the generation time: a function $τ^{C} : \tilde{Ω} \to R^{+}$ , given by

\begin{matrix} τ^{C} (ω^{'}, ω) : = t^{I} (ω) - t^{I} (ω^{'}), \end{matrix}

and a function $τ^{σ} : Ω_{> 0} \to R^{+}$ given by

\begin{matrix} τ^{σ} (ω) : = τ^{C} (σ (ω)) . \end{matrix}

Consider, for all $τ \in R^{+}$ , the random variable $n^{τ} : Ω \to N$ such that $n^{τ} (ω)$ is the number of people infected by $ω$ at $ω$ ’s infectious age $τ$ (that is, at the absolute time $t^{I} (ω) + τ$ ):

\begin{matrix} n^{τ} (ω) : = # {ω^{'} \in Ω_{> 0} | σ (ω^{'}) = ω, τ^{σ} (ω^{'}) = τ} . \end{matrix}

Also, for $τ \in [0, + \infty]$ , we denote by $N^{τ} : Ω \to N$ the random variable that counts all individuals infected within the infectious age $τ$ :

\begin{matrix} N^{τ} (ω) : = \sum_{τ^{'} \leq τ} n^{τ^{'}} (ω) = # {ω^{'} \in Ω_{> 0} | σ (ω^{'}) = ω, τ^{σ} (ω^{'}) \leq τ} . \end{matrix}

In particular, $N^{\infty} (ω)$ is the total number of people infected by $ω$ . The average values of these variables are key indicators of the speed of propagation of the epidemics. In particular, restricting to an absolute time t, we let

\begin{matrix} R_{t} : = E_{P_{t}} (N_{t}^{\infty}) \end{matrix}

be the effective reproduction number at t. Note that averaging on all infected individuals simply gives $E_{P} (N^{\infty}) = 1$ . We also consider the average values of $N_{t}^{τ}$ , for finite $τ$ :

\begin{matrix} B_{t} (τ) : = E_{P_{t}} (N_{t}^{τ}) . \end{matrix}

Notice that $B_{t}$ is an improper CDF supported on $R^{+}$ , and because of the finiteness of $Ω_{t}$ , it is a sum of finitely many step functions. It represents the cumulative infectiousness of individuals infected at t, and tends to $R_{t}$ for $τ \to + \infty$ .

For practical reasons, it is common to tacitly consider a continuum limit in which each $# Ω_{t}$ tends to infinity and all random variables become continuous. $B_{t}$ is then approximated by a smooth function, whose derivative is denoted by $β_{t}$ (as is the case in the rest of the paper):

\begin{matrix} B_{t} (τ) ≃ \int_{0}^{τ} β_{t} (τ^{'}) d τ^{'}, \end{matrix}

However, in this Appendix we always work in the discrete setting discussed so far, and then we consider the continuum limit only to get formulas for $β_{t}$ , for consistency with the standard terminology and notation. Using the formalism of measure theory, or simply writing the relations between random variables in terms of their CDFs, allows us to treat both the discrete scenario and the continuum limit in a unified notation.

A.3 The suppression formula

Here, we discuss in greater mathematical detail the content of Sect. 2.2. In particular, we will derive the suppression formula (Eq. 2) relating the reproduction number $R_{t}$ and the distribution of the random variable $τ^{T} : Ω \to [0, + \infty]$ describing the infectious age (possibly infinite) at which each individual is tested positive.

To do this, we introduce two additional random variables $n^{0, τ}, N^{0, τ} : Ω \to N$ which are analogues to $n^{τ}$ and $N^{τ}$ , but which instead count the number of individuals that each $ω \in Ω$ would have infected without isolation measures. Similarly, we denote by

\begin{matrix} B_{t}^{0} (τ) : = E_{P_{t}} (N_{t}^{0, τ}), R_{t}^{0} : = E_{P_{t}} (N_{t}^{0, \infty}), \end{matrix}

and $β_{t}^{0}$ the analogues of $B_{t}$ , $R_{t}$ and $β_{t}$ in the absence of isolation measures.

Recall that we assumed the average number of people infected by each individual $ω$ is reduced by a factor $1 - ξ_{t}$ (possibly depending on the infection time $t : = t^{I} (ω)$ ) at times $τ$ greater or equal than the testing time $τ^{T} (ω)$ . This is encoded by the following relation between the expected values of $n_{t}^{τ}$ and $n_{t}^{0, τ}$ conditioned by $τ_{t}^{T}$ : For all $τ, ρ \in [0, + \infty]$ , we postulate that

\begin{matrix} \begin{matrix} E_{P_{t}} (n_{t}^{τ} | τ_{t}^{T} = ρ) & = & (1 - ξ_{t} δ_{τ \geq ρ}) E_{P_{t}} (n_{t}^{0, τ} | τ_{t}^{T} = ρ) \\ = & \{\begin{matrix} E_{P_{t}} (n_{t}^{0, τ} | τ_{t}^{T} = ρ) & for τ < ρ \\ (1 - ξ_{t}) E_{P_{t}} (n_{t}^{0, τ} | τ_{t}^{T} = ρ) & for τ \geq ρ \end{matrix}), \end{matrix} \end{matrix}

where $δ_{τ \geq ρ} : = χ_{[ρ, + \infty]} (τ)$ is 1 when $τ \geq ρ$ and 0 otherwise. Now we would like to remove the conditioning on $τ_{t}^{T}$ from the expected values of $n_{t}^{0, τ}$ to get an expression in terms of known quantities only. If, for simplicity, we supposed that $τ^{T}$ and $n_{t}^{0, τ}$ are independent, then Eq. (12) would reduce to $(1 - ξ δ_{τ \geq ρ}) E_{P_{t}} (n_{t}^{0, τ})$ . But, as discussed in Sect. 2.2, this is not a realistic hypothesis. Instead, we only assume that $τ^{T}$ and $n_{t}^{0, τ}$ are independent when restricted to individuals having the same severity of illness, which we describe through a random variable

\begin{matrix} G : Ω \to R . \end{matrix}

In other words, we take $τ_{t}^{T}$ and $n_{t}^{0, τ}$ to be conditionally independent with respect to G.18 We assume now that the suppression formula applies equally to individuals of all degrees of severity:

\begin{matrix} E_{P_{t, g}} (n_{t, g}^{τ} | τ_{t, g}^{T} = ρ) = (1 - ξ_{t} δ_{τ \geq ρ}) E_{P_{t, g}} (n_{t, g}^{0, τ} | τ_{t, g}^{T} = ρ), \end{matrix}

having introduced

\begin{matrix} Ω_{t, g} : = {ω \in Ω_{t} | G (ω) = g} \end{matrix}

and the obvious notation for restrictions of random variables to $Ω_{t, g}$ and for the uniform probability measure $P_{t, g}$ on it. The assumption of conditional independence implies

\begin{matrix} E_{P_{t, g}} (n_{t, g}^{τ} | τ_{t, g}^{T} = ρ) = (1 - ξ_{t} δ_{τ \geq ρ}) E_{P_{t, g}} (n_{t, g}^{0, τ}) . \end{matrix}

Summing over $ρ \in [0, + \infty]$ we find

\begin{matrix} \begin{matrix} E_{P_{t, g}} (n_{t, g}^{τ}) & = & \sum_{ρ \in [0, + \infty]} P_{t, g} (τ_{t, g}^{T} = ρ) E_{P_{t, g}} (n_{t, g}^{τ} | τ_{t, g}^{T} = ρ) \\ = & E_{P_{t, g}} (n_{t, g}^{0, τ}) \sum_{ρ \in [0, + \infty]} (1 - ξ_{t} δ_{τ \geq ρ}) P_{t, g} (τ_{t, g}^{T} = ρ) \\ = & E_{P_{t, g}} (n_{t, g}^{0, τ}) (1 - ξ_{t} F_{t, g}^{T} (τ)), \end{matrix} \end{matrix}

having denoted the improper CDF of $τ_{t, g}^{T}$ by $F_{t, g}^{T}$ , as usual.

Finally, to average over all $Ω_{t}$ , we simply notice that

\begin{matrix} E_{P_{t}} (n_{t}^{τ}) = \sum_{g} p_{t, g} E_{P_{t, g}} (n_{t, g}^{τ}), \end{matrix}

where $p_{t, g} : = P_{t} (G_{t} = g)$ .19 On the other hand, summing over $τ$ gives a relation between $B_{t}$ and $B_{t, g}^{0} = E_{P_{t, g}} (N_{t, g}^{0, τ})$ :

\begin{matrix} \begin{matrix} B_{t} (τ) & = & \sum_{g} p_{t, g} \sum_{τ^{'} \leq τ} E_{P_{t, g}} (n_{t, g}^{τ}) \\ = & \sum_{g} p_{t, g} \int_{(0, τ]} (1 - ξ_{t} F_{t, g}^{T} (τ^{'})) d B_{t, g}^{0} (τ^{'}) . \end{matrix} \end{matrix}

Then, we can sum up the content of this subsection as follows:

Proposition 1

Take $t \in [0, + \infty)$ . Assuming the suppression hypothesis (13) and the conditional independence of $τ^{T}$ and $n_{t}^{0, τ}$ with respect to G, we have

\begin{matrix} \begin{matrix} E_{P_{t}} (n_{t}^{τ}) & = & \sum_{g} p_{t, g} E_{P_{t, g}} (n_{t, g}^{τ}) \\ = & \sum_{g} p_{t, g} E_{P_{t, g}} (n_{t, g}^{0, τ}) (1 - ξ_{t} F_{t, g}^{T} (τ)), \end{matrix} \end{matrix}

where $p_{t, g} : = P_{t} (G_{t} = g)$ . Moreover,

\begin{matrix} R_{t} = \sum_{g \in R} p_{t, g} R_{t, g} = \sum_{g} p_{t, g} \int_{R^{+}} (1 - ξ_{t} F_{t, g}^{T} (τ)) d B_{t, g}^{0} (τ) . \end{matrix}

Note that taking the continuum limit of Eq. (15) we retrieve Eq. (2):

\begin{matrix} β_{t} (τ) = \sum_{g} p_{t, g} β_{t, g} (τ) = \sum_{g} p_{t, g} (1 - ξ_{t} F_{t, g}^{T} (τ)) β_{t, g}^{0} (τ) . \end{matrix}

We conclude this subsection by noting, for future convenience, that we can rewrite the suppression hypothesis without referring to $n^{0, τ}$ :

\begin{matrix} E_{P_{t, g}} (n_{t, g}^{τ} | τ_{t, g}^{T} = ρ) = \frac{1 - ξ_{t} δ_{τ \geq ρ}}{1 - ξ_{t} F_{t, g}^{T} (τ)} E_{P_{t, g}} (n_{t, g}^{τ}) . \end{matrix}

A.4 Random variables technology

Given a random variable $X : Ω \to R$ , it is natural to consider the composition

\begin{matrix} \hat{X} : = X \circ σ . \end{matrix}

The main use case of this is when X represents the time at which some event related to an individual happens. For example, the infectious age $τ^{T}$ at which they get tested. In this case, $P_{t} ({\hat{τ}}_{t}^{T} = τ)$ is the probability that, given an individual infected at t, their infector is tested at the infector’s infectious age $τ$ .

In fact, we are often more interested in a slightly different distribution, namely that of the random variable $\overset{ˇ}{X}$ defined by

\begin{matrix} \overset{ˇ}{X} (ω) : = X (σ (ω)) - τ^{σ} (ω) . \end{matrix}

When $X = τ^{T}$ , then $P_{t} ({\overset{ˇ}{τ}}_{t}^{T} = τ)$ is now the probability that, given an individual infected at t, their source is tested at the individual’s infectious age $τ$ .

The next Proposition relates the distributions of ${\hat{X}}_{t}$ , ${\overset{ˇ}{X}}_{t}$ , and $X_{t}$ :

Proposition 2

Take $X : Ω \to R$ . For all $x \in R$ , we have

\begin{matrix} \begin{matrix} P_{t} ({\hat{X}}_{t} = x) = \sum_{τ \in R^{+}} P_{t - τ} (X_{t - τ} = x) E_{P_{t - τ}} (n_{t - τ}^{τ} | X_{t - τ} = x) \frac{# Ω_{t - τ}}{# Ω_{t}}, \\ P_{t} ({\overset{ˇ}{X}}_{t} = x) = \sum_{τ \in R^{+}} P_{t - τ} (X_{t - τ} = x + τ) E_{P_{t - τ}} (n_{t - τ}^{τ} | X_{t - τ} = x + τ) \frac{# Ω_{t - τ}}{# Ω_{t}} . \end{matrix} \end{matrix}

Proof

We prove both formulas at once by defining $X^{α} : Ω \to R$ as

\begin{matrix} X^{α} (ω) : = X (σ (ω)) - α τ^{σ} (ω) \end{matrix}

for $α \in {0, 1}$ , so that $X^{0} = \hat{X}$ and $X^{1} = \overset{ˇ}{X}$ .

\begin{matrix} # Ω_{t} P_{t} (X_{t}^{α} = x) = # {ω \in Ω_{t} | X (σ (ω)) = x + α τ^{σ} (ω)} \\ = # {(ω^{'}, ω) \in \tilde{Ω} | ω \in Ω_{t}, X (ω^{'}) = x + α τ^{C} (ω^{'}, ω)} \\ = \sum_{τ \in R^{+}} # {(ω^{'}, ω) \in \tilde{Ω} | ω^{'} \in Ω_{t - τ}, X (ω^{'}) = x + α τ, ω \in Ω_{t}} \\ = \sum_{τ \in R^{+}} \sum_{ω^{'} \in Ω_{t - τ} | X (ω^{'}) = x + α τ} n_{t - τ}^{τ} (ω^{'}) \\ = \sum_{τ \in R^{+}} E_{P_{t - τ}} (n_{t - τ}^{τ} | X_{t - τ} = x + α τ) # {ω^{'} \in Ω_{t - τ} | X (ω^{'}) = x + α τ} \\ = \sum_{τ \in R^{+}} E_{P_{t - τ}} (n_{t - τ}^{τ} | X_{t - τ} = x + α τ) P_{t - τ} (X_{t - τ} = x + α τ) # Ω_{t - τ} . \end{matrix}

$□$

Notice that we can also break down the right hand side of the formulae of Prop. 2 by the values of G. In particular, the second equation can be rewritten as

\begin{matrix} P_{t} ({\overset{ˇ}{X}}_{t} = x) = & \sum_{τ \in R^{+}} \sum_{g \in R} P_{t - τ, g} (X_{t - τ, g} = x + α τ) E_{P_{t - τ, g}} (n_{t - τ, g}^{τ} | X_{t - τ, g} \\ = & x + α τ) \frac{# Ω_{t - τ, g}}{# Ω_{t}} . \end{matrix}

A.5 Remarks on generation time and numbers of infected individuals

In this subsection, we study the distribution of the generation time. First, we do this considering it as a function $τ^{C} : \tilde{Ω} \to R^{+}$ of infector–infectee pairs, and in particular taking its restriction ${\tilde{τ}}_{t}^{C}$ to the set

\begin{matrix} {\tilde{Ω}}_{t} : = {(ω^{'}, ω) \in \tilde{Ω} | ω^{'} \in Ω_{t}} . \end{matrix}

As usual, on ${\tilde{Ω}}_{t}$ we put the uniform probability measure, denoted by ${\tilde{P}}_{t}$ . In this way, we find the intuitive fact that the distribution of $τ^{C}$ restricted to ${\tilde{Ω}}_{t}$ is just the normalization of the infectiousness:

Proposition 3

The CDF of the random variable ${\tilde{τ}}_{t}^{C} : {\tilde{Ω}}_{t} \to R^{+}$ is given by

\begin{matrix} {\tilde{F}}_{t}^{C} (τ) = \frac{B_{t} (τ)}{R_{t}} . \end{matrix}

Proof

\begin{matrix} \begin{matrix} {\tilde{F}}_{t}^{C} (τ) & = & {\tilde{P}}_{t} ({\tilde{τ}}_{t}^{C} \leq τ) \\ = & \frac{1}{# {\tilde{Ω}}_{t}} # {(ω^{'}, ω) \in {\tilde{Ω}}_{t} | τ^{C} (ω^{'}, ω) \leq τ} \\ = & \frac{1}{# {\tilde{Ω}}_{t}} \sum_{ω^{'} \in Ω_{t}} N_{t}^{τ} (ω^{'}) \\ = & \frac{# Ω_{t}}{# {\tilde{Ω}}_{t}} E_{P_{t}} (N_{t}^{τ}) \\ = & \frac{# Ω_{t}}{# {\tilde{Ω}}_{t}} B_{t} (τ) . \end{matrix} \end{matrix}

The limit $τ \to + \infty$ gives

\begin{matrix} # {\tilde{Ω}}_{t} = # Ω_{t} E_{P_{t}} (N_{t}^{\infty}) = # Ω_{t} R_{t}, \end{matrix}

and the claim immediately follows. $□$

Next, we focus on the generation time as a function $τ_{t}^{σ} : Ω_{t} \to R^{+}$ of the infectee. Its probability distribution is given by the formula

\begin{matrix} P_{t} (τ_{t}^{σ} = τ) = \frac{# Ω_{t - τ}}{# Ω_{t}} E_{P_{t - τ}} (n_{t - τ}^{τ}), \end{matrix}

which follows from the definitions. Summing the left-hand side over all $τ > 0$ gives 1, from which we find how to compute $# Ω_{t}$ in terms of quantities relative to previous times:

\begin{matrix} # Ω_{t} = \sum_{τ \in R^{+}} # Ω_{t - τ} E_{P_{t - τ}} (n_{t - τ}^{τ}) . \end{matrix}

The last two formulae are easily proven, and the first is an immediate consequence of the next Proposition, which describes the joint probability distribution of $τ_{t}^{σ}$ and ${\hat{G}}_{t} = {(G \circ σ)}_{t}$ :

Proposition 4

\begin{matrix} P_{t} (τ_{t}^{σ} = τ, {\hat{G}}_{t} = g) = E_{P_{t - τ, g}} (n_{t - τ, g}^{τ}) \frac{# Ω_{t - τ, g}}{# Ω_{t}} . \end{matrix}

This is the probability that the infector of someone infected at t was infected at $t - τ$ and had severity g.

Proof

\begin{matrix} \begin{matrix} # Ω_{t} P_{t} (τ_{t}^{σ} = τ, {\hat{G}}_{t} = g) & = & # {ω \in Ω_{t} | τ^{σ} (ω) = τ, \hat{G} (ω) = g} \\ = & # {(ω^{'}, ω) \in \tilde{Ω} | ω \in Ω_{t}, ω^{'} \in Ω_{t - τ}, G (ω^{'}) = g)} \\ = & \sum_{ω^{'} \in Ω_{t - τ, g}} n^{τ} (ω^{'}) \\ = & # Ω_{t - τ, g} E_{P_{t - τ, g}} (n_{t - τ, g}^{τ}) . \end{matrix} \end{matrix}

$□$

This formula for the joint probability measure will be used in the next subsection, where we will often use it in integrals. Given that in this paper we always consider G to have a given discrete range, while $τ^{σ}$ becomes continuous in the continuum limit, we will preferably write these integrals with respect to the improper CDFs

\begin{matrix} F_{t}^{σ, g} (τ) : = P_{t} (τ_{t}^{σ} \leq τ, {\hat{G}}_{t} = g) = \sum_{τ^{'} \leq τ} E_{P_{t - τ^{'}, g}} (n_{t - τ^{'}, g}^{τ^{'}}) \frac{# Ω_{t - τ^{'}, g}}{# Ω_{t}} . \end{matrix}

A.6 Time evolution

As we saw in Sect. 2.3, the key step to determining the time evolution of the system is writing the distribution of the notification time $τ_{t}^{A,c}$ in terms of that of the testing time $τ_{t^{'}}^{T}$ , for $t^{'} < t$ . Our assumption is that any infected individual $ω \in Ω_{t}$ is notified precisely at the testing time of their infector $σ (ω)$ with a certain probability $s_{t}^{c}$ , provided that such testing time follows the infection time of $ω$ (otherwise, $ω$ is never notified). Referring these instants to the infectee’s infectious age, we get that $τ^{A,c} (ω)$ is equal to

\begin{matrix} {\overset{ˇ}{τ}}^{T} (ω) = τ^{T} (σ (ω)) - τ^{σ} (ω) \end{matrix}

with probability $s_{t}^{c}$ in case it is a positive number, and $τ^{A,c} (ω) = + \infty$ in the remaining cases. This can be written synthetically as

\begin{matrix} F_{t}^{A,c} (τ) = s_{t}^{c} P_{t} ({\overset{ˇ}{τ}}_{t}^{T} \in R^{+}) = s_{t}^{c} ({\overset{ˇ}{F}}_{t}^{T} (τ) - {\overset{ˇ}{F}}_{t}^{T} (0)), \end{matrix}

where $F_{t}^{A,c}$ and ${\overset{ˇ}{F}}_{t}^{T}$ are the improper CDFs of $τ_{t}^{A,c}$ and ${\overset{ˇ}{τ}}_{t}^{T}$ , respectively.

Thus, our goal reduces to computing the distribution of ${\overset{ˇ}{τ}}^{T}$ , the (possibly negative) time elapsed from the infectee’s contagion to the infector’s testing. Applying Eq. (17) to $X = τ^{T}$ and using the suppression formula (16) we get, for any $ρ \in (- \infty, + \infty]$ ,

\begin{matrix} P_{t} ({\overset{ˇ}{τ}}_{t}^{T} = ρ) = & \sum_{τ \in R^{+}} \sum_{g \in R} P_{t - τ, g} (τ_{t - τ, g}^{T} = ρ + τ) E_{P_{t - τ, g}} (n_{t - τ, g}^{τ} | τ^{T} \\ = ρ + τ) \frac{# Ω_{t - τ, g}}{# Ω_{t}} \\ = & \sum_{g \in R} \sum_{τ \in R^{+}} P_{t - τ, g} (τ_{t - τ, g}^{T} \\ = ρ + τ) \frac{1 - ξ_{t - τ} δ_{τ \geq ρ + τ}}{1 - ξ_{t - τ} F_{t - τ, g}^{T} (τ)} E_{P_{t - τ, g}} (n_{t - τ, g}^{τ}) \frac{# Ω_{t - τ, g}}{# Ω_{t}} \\ = & \sum_{g \in R} \int_{R^{+}} P_{t - τ, g} (τ_{t - τ, g}^{T} = ρ + τ) \frac{1 - ξ_{t - τ} δ_{ρ \leq 0}}{1 - ξ_{t - τ} F_{t - τ, g}^{T} (τ)} d F_{t}^{σ, g} (τ) . \end{matrix}

Notice that in the last line we used the improper CDFs $F_{t}^{σ, g}$ introduced in Eq. (18).

Let us try to interpret this formula. The probability distribution of ${\overset{ˇ}{τ}}^{T}$ is obtained by averaging the distributions of $τ_{t^{'}, g}^{T}$ for all $t^{'} = t - τ < t$ and all g, each shifted by $τ$ to the left to account for the switch from the infector’s to the infectee’s infectious age. This averaging is done by integrating over all $t^{'}$ and g with respect to the joint distribution of the generation time $τ^{σ}$ and the infector’s severity $\hat{G}$ . But a correction factor (the fraction) appears in the integral, as the fact that the infector infects at relative time $τ$ and has severity g conditions the distribution of $τ_{t - τ, g}^{T}$ , by shifting it toward values greater than $τ$ . Indeed, the correction factor is greater than 1 for $ρ > 0$ and less than 1 otherwise. This means that, compared to a hypothetical case in which the testing time and the infectiousness are independent (which happens when $ξ$ is constantly zero), the probability $P_{t} ({\overset{ˇ}{τ}}_{t}^{T} = ρ)$ is higher after the contagion time (i.e., when $ρ > 0$ ) and lower before.

It follows now that the improper CDF of ${\overset{ˇ}{τ}}_{t}^{T}$ reads

\begin{matrix} {\overset{ˇ}{F}}_{t}^{T} (ρ) = \{\begin{matrix} \sum_{g \in R} \int_{R^{+}} F_{t - τ, g}^{T} (ρ + τ) \frac{1 - ξ_{t - τ}}{1 - ξ_{t - τ} F_{t - τ, g}^{T} (τ)} d F_{t}^{σ, g} (τ) & for ρ \leq 0, \\ {\overset{ˇ}{F}}_{t}^{T} (0) + \sum_{g \in R} \int_{R^{+}} \frac{F_{t - τ, g}^{T} (ρ + τ) - F_{t - τ, g}^{T} (τ)}{1 - ξ_{t - τ} F_{t - τ, g}^{T} (τ)} d F_{t}^{σ, g} (τ) & for 0 < ρ < + \infty . \end{matrix}) \end{matrix}

We only have to replace this equation in (19) to get the time evolution formula:

Proposition 5

Assuming the suppression hypothesis (13), the conditional independence of $τ^{T}$ and $n_{t}^{0, τ}$ with respect to G, and the notification hypothesis (19), we have

\begin{matrix} F_{t}^{A,c} (ρ) = s_{t}^{c} \sum_{g \in R} \int_{R^{+}} \frac{F_{t - τ, g}^{T} (ρ + τ) - F_{t - τ, g}^{T} (τ)}{1 - ξ_{t - τ} F_{t - τ, g}^{T} (τ)} d F_{t}^{σ, g} (τ) \end{matrix}

for $ρ > 0$ and $F_{t}^{A,c} (ρ) = 0$ otherwise.

A.7 Modifications in the case of use of a contact tracing app

The inhomogeneity in the population due to the use of a contact tracing app by a part of it can be partly addressed in an analogous way to the inhomogeneity due to different degrees of severity of the illness. Namely, we introduce a new random variable

\begin{matrix} A : Ω \to {app, no app} \end{matrix}

whose value determines whether or not an individual $ω \in Ω$ has the app. We assume that whether or not an individual has the app is independent of both their severity and their infectiousness in the absence of measures. In other words, A is independent of G and $n^{0, τ}$ , for all $τ$ . On the other hand, the infectiousness (in presence of measures) and the testing time of an individual will be different depending on whether or not they use the app.

A further partitions $Ω$ : we write

\begin{matrix} Ω_{t, g, a} : = {ω \in Ω_{t} | G (ω) = g, A (ω) = a}, n_{t, g, a}^{τ} : = n^{τ} ↾_{Ω_{t, g, a}}, \end{matrix}

and so on. The content of Sect. A.3 fully applies to this scenario, but we want now to have formulae conditioned on A. As A and $n^{0, τ}$ are independent, the previous formulae simply become

\begin{matrix} E_{P_{t, g, a}} (n_{t, g, a}^{τ}) = E_{P_{t, g}} (n_{t, g}^{0, τ}) (1 - ξ_{t} F_{t, g, a}^{T} (τ)) \end{matrix}

and

\begin{matrix} \begin{matrix} E_{P_{t, g, a}} (n_{t, g, a}^{τ} | τ_{t, g, a}^{T} = ρ) & = & (1 - ξ_{t} δ_{τ \geq ρ}) E_{P_{t, g}} (n_{t, g}^{0, τ}) \\ = & \frac{1 - ξ_{t} δ_{τ \geq ρ}}{1 - ξ_{t} F_{t, g, a}^{T} (τ)} E_{P_{t, g, a}} (n_{t, g, a}^{τ}) . \end{matrix} \end{matrix}

The suppression formula for $R_{t}$ can be broken down to

\begin{matrix} \begin{matrix} R_{t} & = & \sum_{g \in R} \sum_{a} p_{t, g} ϵ_{t, a} R_{t, g, a} \\ = & \sum_{g} \sum_{a} p_{t, g} ϵ_{t, a} \int_{R^{+}} (1 - ξ_{t} F_{t, g, a}^{T} (τ)) d B_{t, g}^{0} (τ), \end{matrix} \end{matrix}

where $ϵ_{t, a} : = P_{t} (A_{t} = a)$ .

The time evolution equation has to be treated differently, as the receipt of the notification depends on whether both the infector and the infectee use the app.

Let ${\overset{ˇ}{F}}_{t}^{T, a} (τ)$ denote the probability that, given an individual $ω \in Ω_{t}$ , the infector $σ (ω)$ is tested at a time $\leq t + τ$ and we have $\hat{A} (ω) = A (σ (ω)) = a$ .

According to our assumptions, given an infection occurred at t, the probability that the infector notifies the infectee when they test positive (provided that this happens after the infection) is $s_{t}^{c,app}$ in the case that both individuals have the app, and is $s^{c,no} {app}_{t}$ otherwise. Therefore, the contact tracing hypothesis (19) is now replaced by the following expressions for the CDFs of the time of the notification received by an individual with or without the app, respectively:

\begin{matrix} \begin{matrix} F_{t, app}^{A,c} (ρ) & = & s_{t}^{c,app} ({\overset{ˇ}{F}}_{t}^{T,app} (ρ) - {\overset{ˇ}{F}}_{t}^{T,app} (0)) + s^{c,no} {app}_{t} ({\overset{ˇ}{F}}_{t}^{T,no} app (ρ) - {\overset{ˇ}{F}}_{t}^{T,no} app (0)), \\ F_{t, no app}^{A,c} (ρ) & = & s^{c,no} {app}_{t} ({\overset{ˇ}{F}}_{t}^{T,app} (ρ) - {\overset{ˇ}{F}}_{t}^{T,app} (0) + {\overset{ˇ}{F}}_{t}^{T,no} app (ρ) - {\overset{ˇ}{F}}_{t}^{T,no} app (0)) \\ = & s^{c,no} {app}_{t} ({\overset{ˇ}{F}}_{t}^{T} (ρ) - {\overset{ˇ}{F}}_{t}^{T} (0)) . \end{matrix} \end{matrix}

Now, the improper CDFs ${\overset{ˇ}{F}}_{t}^{T, a}$ can be computed just as before, simply treating the conditioning on $\hat{A}$ as we treated the conditioning on $\hat{G}$ :

\begin{matrix} \begin{matrix} P_{t} ({\overset{ˇ}{τ}}_{t}^{T} = ρ, {\hat{G}}_{t} = g, {\hat{A}}_{t} = a) \\ = \sum_{τ > 0} P_{t - τ, g, a} (τ_{t - τ, g, a}^{T} = ρ + τ) E_{P_{t - τ, g, a}} (n_{t - τ, g, a}^{τ} | τ^{T} = ρ + τ) \frac{# Ω_{t - τ, g, a}}{# Ω_{t}} \\ = \int_{R^{+}} P_{t - τ, g, a} (τ_{t - τ, g, a}^{T} = ρ + τ) E_{P_{t - τ, g, a}} (n_{t - τ, g, a}^{τ} | τ^{T} = ρ + τ) d F_{t}^{σ, g, a} (τ), \end{matrix} \end{matrix}

where we defined $F_{t}^{σ, g, a}$ as follows, proceeding like in Sect. A.5 to compute the joint distribution of $τ_{t}^{σ}$ , ${\hat{G}}_{t}$ , and ${\hat{A}}_{t}$ :

\begin{matrix} F_{t}^{σ, g, a} (τ) : = P_{t} (τ_{t}^{σ} \leq τ, {\hat{G}}_{t} = g, {\hat{A}}_{t} = a) = \sum_{τ^{'} \leq τ} E_{P_{t - τ^{'}, g, a}} (n_{t - τ^{'}, g, a}^{τ^{'}}) \frac{# Ω_{t - τ^{'}, g, a}}{# Ω_{t}} . \end{matrix}

It is worth noting that comparing this equation with (18) we get

\begin{matrix} \frac{d F_{t}^{σ, g, a} (τ)}{1 - ξ_{t - τ} F_{t - τ, g, a}^{T}} = ϵ_{t, a} \frac{d F_{t}^{σ, g} (τ)}{1 - ξ_{t - τ} F_{t - τ, g}^{T}} . \end{matrix}

Replacing the suppression formula (20) in (22) and summing over $ρ$ , we end up with

\begin{matrix} {\overset{ˇ}{F}}_{t}^{T,a} (ρ) - {\overset{ˇ}{F}}_{t}^{T,a} (0) = \sum_{g} \int_{R^{+}} \frac{F_{t - τ, g, a}^{T} (ρ + τ) - F_{t - τ, g, a}^{T} (τ)}{1 - ξ_{t - τ} F_{t - τ, g, a}^{T} (τ)} d F_{t}^{σ, g, a} (τ) \end{matrix}

for $ρ > 0$ . Plugging this into (21) gives us $F_{t, a}^{A,c}$ in terms of $F_{t}^{σ, g, a}$ and $F_{t^{'}, g, a}^{T}$ for $t^{'} < t$ , that is the time evolution equation for the scenario with app usage. For $a = app$ , this is Eq. (9), while for $a = no app$ it simplifies to Eq. (10), since when the infectee doesn’t have the app it is irrelevant whether or not the infector has the app. This is evident from the last line of Eq. (21), which in fact could also have been used, together with the expression for ${\overset{ˇ}{F}}_{t}^{T}$ derived in Sect. A.6, to get Eq. (10). Using Eq. (24), it can be checked immediately that the two approaches give the same result.

Author Contributions

M.M. and M.R. conceived the study and its main ideas; A.M. developed the mathematical model; A.M. and M.M. developed the Python repository with the calculations and wrote the manuscript.

Declarations

Conflict of Interest

This work was supported by the authors’ employer, Bending Spoons S.p.A, which was involved in the development of the contact tracing app adopted by the Italian Government.

Footnotes

$R_{t}^{0}$ must not be confused with the basic reproduction number $R_{0}$ .

This is better explained in Sect. A.5.

We stress that the time evolution of $R_{t}^{0}$ , describing how the epidemics would have evolved without the measures we are modeling, is taken as known—our goal is to study the relative impact of the measures. In particular, we do not take into account possible second-order effects on $R_{t}^{0}$ , such as general changes in the behavior of the population, that may come as a consequence of the measures and their impact.

⁴

Recall that our model simultaneously includes the effect of isolating infected individuals recognized either through their symptoms or through contact tracing. However, as becomes apparent in the examples of Sect. 4, we will immediately be able to single out the additional impact of contact tracing only, for example.

⁵

This and the following assumption of immediate notification simplify the treatment to the extent that they avoid adding further distributions modeling some delays. In fact, they are not essential hypotheses, and such real-world delays could be also taken into account in the current setting, by including them in the distribution of the time $Δ^{A \to T}$ between notification and test, introduced below.

⁶

This typically involves modeling the generation time distribution as constant in time, which is an assumption we do not make. See Cori et al. (2013) for a detailed account of this matter.

⁷

To be trusted by its users, the app should also aim at reducing the fraction of false-positives. This is something that our study does not consider.

⁸

Equivalently, this hypothesis could be viewed as the assumption that an individual who tests positive either self-isolates completely, without infecting anyone else from that moment, or, alternatively, does nothing, with the first circumstance happening with probability $ξ_{t}$ .

⁹

Note that, according to our convention, $R_{t}$ is the weighted average of its components $R_{t, g}$ . Often, in the literature (e.g. in Ferretti et al. 2020) a different convention is used, according to which the components sum to $R_{t}$ . To switch to the latter convention, each $R_{t, g}$ should be divided by the respective probability $p_{g}$ .

¹⁰

In Sect. 4 we take $F^{S}$ to be a log-normal distribution, following the literature.

¹¹

For simplicity, we use a unique distribution $F^{S}$ for all degrees of severity. For asymptomatic individuals, $s_{t, g}^{s}$ would be equal to 0.

¹²

Note that this doesn’t prevent us from modeling isolation measures gradually put into place, which can be done simply by taking these parameters to be continuous in t.

¹³

In this discrete setting, all the integrals appearing in the equations reduce to finite sums. In fact, our approach in the Appendix is to derive the same equations starting from a discrete probabilistic model.

¹⁴

Note that this does not include pre-symptomatic transmission, which is taken into account within the group $g = sym$ .

¹⁵

This dependency is due to higher order effects: A higher $R_{t}^{0}$ means that the distribution of $τ_{t}^{σ}$ is more concentrated on small values, and hence the most recent testing time distributions have a greater weight in the time-evolution equation (6).

¹⁶

In all the examples these parameters are constant in time. Hence we remove the subscript t from them.

¹⁷

Studies such as Li et al. (2020) report that the time from symptom onset to testing through the “conventional” channels (health care system) is in the order of several days. An app is expected to have substantial chances to improve this performance, being a prompt-instrument by construction (for example, when compared with the friction of calling a doctor, inserting symptom descriptions into the app is likely easier), so that

\begin{matrix} Δ^{A \to T, app} < Δ^{A \to T, no app} . \end{matrix}

¹⁸

Remember that in this work, the joint distribution of $n_{t}^{0, τ}$ and G is assumed known, and we want to study how our assumptions on the isolation measures determine the distribution of $n_{t}^{τ}$ , and hence $R_{t}$ .

¹⁹

Note that, for simplicity, in the rest of the paper we took the distribution of G independent of absolute time, and wrote $p_{g} = p_{t, g}$ .

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Andrea Maiorana, Email: anm@bendingspoons.com.

Marco Meneghelli, Email: mm@bendingspoons.com.

Mario Resnati, Email: mr@bendingspoons.com.

References

Bendavid E, Mulaney B, Sood N, Shah S, Bromley-Dulfano R, Lai C, Weissberg Z, Saavedra-Walker R, Tedrow J, Bogan A, Kupiec T, Eichner D, Gupta R, Ioannidis JPA, Bhattacharya J (2021) COVID-19 antibody seroprevalence in Santa Clara County, California. Int J Epidemiol. 10.1093/ije/dyab010 [DOI] [PMC free article] [PubMed]
Cori A, Ferguson NM, Fraser C, Cauchemez S. A new framework and software to estimate time-varying reproduction numbers during epidemics. Am J Epidemiol. 2013;178(9):1505–1512. doi: 10.1093/aje/kwt133. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ferretti L, Wymant C, Kendall M, Zhao L, Nurtay A, Abeler-Dörner L, Parker M, Bonsall D, Fraser C. Quantifying SARS-CoV-2 transmission suggests epidemic control with digital contact tracing. Science. 2020 doi: 10.1126/science.abb6936. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fraser C, Riley S, Anderson RM, Ferguson NM. Factors that make an infectious disease outbreak controllable. Proc Natl Acad Sci. 2004;101(16):6146–6151. doi: 10.1073/pnas.0307506101. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li Q, Guan X, Wu P, Wang X, Zhou L, Tong Y, Ren R, Leung KS, Lau EH, Wong JY, et al. Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N Engl J Med. 2020 doi: 10.1056/NEJMoa2001316. [DOI] [PMC free article] [PubMed] [Google Scholar]
Maiorana A, Meneghelli M (2021) Epidemics suppression model. https://github.com/BendingSpoons/epidemic-suppression-model
Mizumoto K, Kagaya K, Zarebski A, Chowell G. Estimating the asymptomatic proportion of coronavirus disease 2019 (COVID-19) cases on board the Diamond Princess cruise ship, Yokohama, Japan, 2020. Eurosurveillance. 2020;25(10):2000180. doi: 10.2807/1560-7917.ES.2020.25.10.2000180. [DOI] [PMC free article] [PubMed] [Google Scholar]
Müller J, Kretzschmar M, Dietz K. Contact tracing in stochastic and deterministic epidemic models. Math Biosci. 2000;164(1):39–64. doi: 10.1016/S0025-5564(99)00061-9. [DOI] [PubMed] [Google Scholar]
Pathogen Dynamics Group OU Big Data Institute (2020) https://github.com/BDI-pathogens/OpenABM-Covid19
Scarabel F, Pellis L, Ogden NH, Wu J (2021) A renewal equation model to assess roles and limitations of contact tracing for disease outbreak control. R Soc Open Sci 8:202091. 10.1098/rsos.202091 [DOI] [PMC free article] [PubMed]

[CR1] Bendavid E, Mulaney B, Sood N, Shah S, Bromley-Dulfano R, Lai C, Weissberg Z, Saavedra-Walker R, Tedrow J, Bogan A, Kupiec T, Eichner D, Gupta R, Ioannidis JPA, Bhattacharya J (2021) COVID-19 antibody seroprevalence in Santa Clara County, California. Int J Epidemiol. 10.1093/ije/dyab010 [DOI] [PMC free article] [PubMed]

[CR2] Cori A, Ferguson NM, Fraser C, Cauchemez S. A new framework and software to estimate time-varying reproduction numbers during epidemics. Am J Epidemiol. 2013;178(9):1505–1512. doi: 10.1093/aje/kwt133. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] Ferretti L, Wymant C, Kendall M, Zhao L, Nurtay A, Abeler-Dörner L, Parker M, Bonsall D, Fraser C. Quantifying SARS-CoV-2 transmission suggests epidemic control with digital contact tracing. Science. 2020 doi: 10.1126/science.abb6936. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR4] Fraser C, Riley S, Anderson RM, Ferguson NM. Factors that make an infectious disease outbreak controllable. Proc Natl Acad Sci. 2004;101(16):6146–6151. doi: 10.1073/pnas.0307506101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] Li Q, Guan X, Wu P, Wang X, Zhou L, Tong Y, Ren R, Leung KS, Lau EH, Wong JY, et al. Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N Engl J Med. 2020 doi: 10.1056/NEJMoa2001316. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] Maiorana A, Meneghelli M (2021) Epidemics suppression model. https://github.com/BendingSpoons/epidemic-suppression-model

[CR7] Mizumoto K, Kagaya K, Zarebski A, Chowell G. Estimating the asymptomatic proportion of coronavirus disease 2019 (COVID-19) cases on board the Diamond Princess cruise ship, Yokohama, Japan, 2020. Eurosurveillance. 2020;25(10):2000180. doi: 10.2807/1560-7917.ES.2020.25.10.2000180. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] Müller J, Kretzschmar M, Dietz K. Contact tracing in stochastic and deterministic epidemic models. Math Biosci. 2000;164(1):39–64. doi: 10.1016/S0025-5564(99)00061-9. [DOI] [PubMed] [Google Scholar]

[CR9] Pathogen Dynamics Group OU Big Data Institute (2020) https://github.com/BDI-pathogens/OpenABM-Covid19

[CR10] Scarabel F, Pellis L, Ogden NH, Wu J (2021) A renewal equation model to assess roles and limitations of contact tracing for disease outbreak control. R Soc Open Sci 8:202091. 10.1098/rsos.202091 [DOI] [PMC free article] [PubMed]

PERMALINK

Effectiveness of isolation measures with app support to contain COVID-19 epidemics: a parametric approach

Andrea Maiorana

Marco Meneghelli

Mario Resnati

Abstract

Introduction

Main concepts and goals

The assumptions of the model and outline of the paper

Discussion of the results

Fig. 2.

Fig. 9.

The mathematical model in the homogeneous population setting

Notations and conventions

The suppression model for Rt

First considerations on the variables τS, τtA, and τtT

Describing τtA

Summary and discrete-time algorithm

The extended model including the use of an app for epidemic suppression

Parameters and random variables in the two-component model

The mathematical relations between the random variables

Scenarios and calculations

General considerations

Reduction in Rt with homogeneous isolation measures

Time evolution with isolation due to both symptoms and contact-tracing

Fig. 1.

Dependency on testing timeliness

Dependency on the epidemic data used

Fig. 3.

Fig. 4.

Fig. 5.

Reduction in Rt in the case of app usage

Time evolution in an optimistic scenario

Fig. 6.

Time evolution in a pessimistic scenario

Time evolution in the case of gradual adoption of the app

Fig. 7.

Dependency of effectiveness on the efficiencies ss and sc

Fig. 8.

Dependency on the app adoption

Acknowledgements

A Appendix: formalism and detailed derivations of mathematical results

A.1 Modeling a deterministic epidemics with a probability space

Fig. 10.

A.2 Infector–infectee pairs, generation time and the reproduction number

A.3 The suppression formula

Proposition 1

A.4 Random variables technology

Proposition 2

Proof

A.5 Remarks on generation time and numbers of infected individuals

Proposition 3

Proof

Proposition 4

Proof

A.6 Time evolution

Proposition 5

A.7 Modifications in the case of use of a contact tracing app

Author Contributions

Declarations

Conflict of Interest

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

The suppression model for $R_{t}$

First considerations on the variables $τ^{S}$ , $τ_{t}^{A}$ , and $τ_{t}^{T}$

Describing $τ_{t}^{A}$

Reduction in $R_{t}$ with homogeneous isolation measures

Reduction in $R_{t}$ in the case of app usage

Dependency of effectiveness on the efficiencies $s^{s}$ and $s^{c}$