Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2021 Nov 25;37:100532. doi: 10.1016/j.epidem.2021.100532

Assessing the risk of cascading COVID-19 outbreaks from prison-to-prison transfers

Todd L Parsons a, Lee Worden b,
PMCID: PMC8612764  PMID: 34861580

Abstract

COVID-19 transmission has been widespread across the California prison system, and at least two of these outbreaks were caused by transfer of infected individuals between prisons. Risks of individual prison outbreaks due to introduction of the virus and of widespread transmission within prisons due to poor conditions have been documented. We examine the additional risk potentially posed by transfer between prisons that can lead to large-scale spread of outbreaks across the prison system if the rate of transfer is sufficiently high.

We estimated the threshold number of individuals transferred per prison per month to generate supercritical transmission between prisons, a condition that could lead to large-scale spread across the prison system. We obtained numerical estimates from a range of representative quantitative assumptions, and derived the percentage of transfers that must be performed with effective quarantine measures to prevent supercritical transmission given known rates of transfers occurring between California prisons.

Our mean estimate of the critical threshold rate of transfers was 27 individuals transferred per prison per month, with standard deviation 26, in the absence of quarantine measures. Available data documents transfers occurring at a rate of 61 transfers per prison per month. At that rate, estimates of the threshold rate of adherence to quarantine precautions had mean 61%, with standard deviation 32%. While the impact of vaccination and possible decarceration measures is unclear, we include estimates of the above quantities given reductions in the probability and extent of outbreaks.

We conclude that the risk of supercritical transmission between California prisons has been substantial, requiring quarantine protocols to be followed rigorously to manage this risk. The rate of outbreaks occurring in California prisons suggests that supercritical transmission may have occurred. We stress that the thresholds we estimate here do not define a safe level of transfers, even if supercritical transmission between prisons is avoided, since even low rates of transfer can cause very large outbreaks. We note that risks may persist after vaccination, due for example to variant strains, and in prison systems where widespread vaccination has not occurred. Decarceration remains urgently needed as a public health measure.

Keywords: Household model, Two-level mixing, Disparities, Human rights, Decarceration

1. Introduction

As the COVID-19 pandemic continues in the United States, its dynamics in congregate settings of heightened transmission, including prisons, is crucial to understanding its spread, addressing racial disparities in the burden of the disease, and strategizing effective control.

Prisons are often overcrowded, unsanitary, and provide poor health care, and have been the site of many of the most concentrated and brutal outbreaks of the pandemic so far. Prevention of prison outbreaks is essential because standard control measures such as social distancing and self-isolation are not generally available to prison residents. One in five prisoners in the United States has been infected with SARS-CoV-2, compared to one in 20 in the U.S. overall, more than 1700 have died, and prisoners continue to become infected (The Marshall Project, 2020). The New York Times reported on January 29, 2021 that of the ten largest outbreaks in U.S. correctional facilities to date, six of them have been in California state prisons (The New York Times, 2021). Every single one of California’s 35 prisons has reported 200 or more cases (Covid Behind Bars, 2021).

Likely routes of introduction of the disease into prisons are via infected prison staffers, admission of infected prison residents from outside the prison system, and transfers of residents from other prisons. A widely reported outbreak at San Quentin prison in California, which infected over 2200 of the 3563 inmates and killed 28, was caused by a transfer of prisoners from the Correctional Institute for Men in Chino, California (Office of the Inspector General, 2021), and a subsequent outbreak at California Correctional Center in Susanville, California was likely caused by transfer from San Quentin. Multiple outbreaks in winter 2020 appear to have been caused by importation via staff members.

Prison outbreaks can be very large – the outbreak at San Quentin infected over 60% of the prison population, and the outbreak in California’s Avenal State Prison topped 80% – and because of well-known inequities in the criminal justice system, they contribute to racial inequity in the burden of COVID infection (Fortuna et al., 2020, Okonkwo et al., 2020, Lee and Ahmed, 2021, Franco-Paredes et al., 2020). In October 2020, the California Court of Appeals ruled that the California Department of Corrections and Rehabilitation (CDCR) has been guilty of “deliberate indifference” and that California prison populations must be reduced by half to address the ongoing risk of SARS-CoV-2 transmission (Anon, 2020a). This decision reflects the recommendation to decarcerate California prisons to 50% of capacity published by public health experts during the San Quentin outbreak (The Amend Project, 2020). The decision is undergoing appeal, substantial decarceration has not occurred, and multiple outbreaks have occurred in California state prisons in the time since the decision.

While the risks of outbreaks sparked by staff introductions or transfers and spread within prisons due to poor conditions are well known, here we examine the potential danger from another, potentially less apparent risk: the possibility that transmission from prison to prison via transfer of prison residents may be sufficient to lead to uncontrolled spread across the prison system. If such conditions should occur, the disease could be expected to spread to substantially more prisons than otherwise, and infect far more individuals (Fig. 1). The risks to prison residents, staff, and surrounding communities could be considerably increased.

Fig. 1.

Fig. 1

Scenarios for transmission between individuals and between prisons. A. When the reproduction number R between individuals – the mean number of cases caused by a case – is below the critical threshold of 1, transmission chains are short and outbreaks are small. B. When R is above the critical threshold, a large outbreak is possible. C. When the reproduction number R between prisons – the mean number of prison outbreaks caused by a prison outbreak – is below the critical threshold of 1, transmission between prisons may still occur, but spread between prisons will be relatively limited. D. When R is above the critical threshold, transmission between prisons can cascade and cause spread throughout the prison system.

The CDCR currently has quarantine and testing policies in place to prevent transfer of infective individuals. Unfortunately, adherence to CDCR policies has not always been universal, and it cannot be assumed that no risky transfers occur.

We have addressed this question by using established theory of disease transmission, specifically a patch model to be defined below, to estimate the threshold rate of transfer associated with supercritical transmission between prisons, and the rate of adherence to transfer policies needed to prevent supercritical transmission at known rates of transfer.

2. Methods

2.1. Data

Numbers of Covid-19 cases in each CDCR facility were obtained from data from California Department of Corrections and Rehabilitation, collected by the UCLA Covid Behind Bars project (Covid Behind Bars, 2021). Overall population sizes as of March 26, 2021 were extracted directly from data released by CDCR (California Department of Corrections and Rehabilitation, 2021c) (California City Correctional Facility population extracted from California Department of Corrections and Rehabilitation (2021b))

Records released in the course of ongoing legal proceedings document the rate of transfers between prisons in the period from September 21 through October 11, 2020 at about 500 individuals transferred per week (Anon, 2020b p. 7, line 8). There are 35 institutions in the California prison system, making that equivalent to approximately 61 transfers per prison per month.

2.2. Analysis

A patch model of disease transmission can model a collection of discrete populations in which transmission happens within a population, and at a separate rate between populations. One such approach, the so-called household model (Ball et al., 1997), assumes that the population is divided into many small groups (the households) in which local contacts occur frequently, whereas global contacts may occur between any two individuals in the population, albeit at a much lower rate.

In such a model, there are two types of outbreaks: local ones, in which the infection spreads widely within a single group, but remains confined to that group, and global outbreaks, in which the epidemic spreads among many groups. Global outbreaks are governed by a group-to-group reproduction number R whose value is the expected number of groups infected by transmission from a single group: the critical value is 1, and a large global outbreak is possible if the value is greater than 1 (Fig. 1).

For our purposes, we assumed that all contacts are local, within groups, except when a transfer occurs of an individual from one group to another. We assumed also that the rate of transfer is low enough that an individual will transfer to at most one group while infective. We found (see Appendix A) that the group-to-group reproduction number has the form

R=μpG,

where μ is the expected size of a major outbreak at a randomly chosen facility and pG is the probability that an individual will be transferred to a new site and cause a major outbreak in the new site (see Appendix A for details).

In order to evaluate how prison transfers affect prison-to-prison transmission, we modelled the group-to-group reproduction number in terms of the average number of individuals transferred between prisons. We expressed the probability pG in terms of the transfer rate, used empirical prison data to obtain upper and lower bound estimates for μ, and then solved for a threshold rate at which the critical value R is equal to one.

We modelled an ensemble of scenarios for these quantities, to cover the range of possibilities. We characterize these as optimistic or pessimistic according to whether they will lead to a lower or higher estimate of R within the current modelling framework.

1. Optimistic vs. pessimistic reproduction number. As an estimate of the basic reproduction number RL within a prison we used the value 8.44 (95% credible interval: 5.00–13.13) estimated from a COVID-19 outbreak in a large urban jail in the U.S. (Puglisi et al., 2020). Because this value is estimated from a setting in which a large outbreak occurred, and conditions in some prisons may be less conducive to transmission than those in which the largest outbreaks have occurred, we took the above number as a pessimistic estimate for RL. For an optimistic estimate, we calculated the probability pG that a transfer event leads to transmission between prisons using the more optimistic value of 2.87 (95% CI, 2.39–3.44) that was estimated for a basic reproduction number for COVID-19 in general community transmission (Arif Billah et al., 2020), and cut the probability in half to reflect the possibility that conditions may be better in roughly half of prisons. The factor of one half was chosen arbitrarily, to obtain a conservative bound.

2. Optimistic vs. pessimistic outbreak sizes. We constructed lower and upper bound estimates of the size-weighted mean final outbreak size μ from reported case counts in CDCR prisons. We calculated outbreak sizes to date by taking sequences of reported resident cases at a prison separated by 14 days or more of no cases as separate outbreaks. Because our model results exclude “outbreaks” that end after only a few cases, we exclude these from our estimation. In Appendix E, we estimate the mean and standard deviation of such small outbreaks by computing the total number infected in a branching process conditioned on extinction.

Using outbreaks of size 3 and larger (Figs. 23), the size-weighted mean outbreak size μ was 660 cases. Given the realities of asymptomatic infections, incomplete and imperfect testing, and under-reporting, this is likely to underestimate the true sizes of outbreaks. We thus took this value as an estimated lower bound of final outbreak sizes in California prisons. We take the mean overall population of each prison as a conservative upper bound for μ (see Appendix D), which was 3000 as of March 26, 2021.

Fig. 2.

Fig. 2

Sizes of COVID-19 outbreaks in California prisons as of March 26, 2021. Bars show the number of cases per outbreak (blue) for each outbreak of size 3 or more, and total population (grey) at each prison. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

Fig. 3.

Fig. 3

Distribution of sizes of COVID-19 outbreaks in California prisons, up to size 50.

3. Optimistic vs. pessimistic secondary case distribution. Evidence is accumulating that transmission of SARS-CoV-2 has an overdispersed pattern, in which many people cause few or no infections and a relatively large number of infections are caused by a few people (Althouse et al., 2020, Adam et al., 2020, Susswein and Bansal, 2020). This pattern may make the probability pG lower than it could be, because relatively more people infect nobody, which reduces the likelihood of a major outbreak (Lloyd-Smith et al., 2005). We estimated pG given this pattern by assuming a negative binomial distribution of secondary cases (mean =RL, shape = 0.5 (Susswein and Bansal, 2020)), as is standard. However, this overdispersed pattern may be caused partially by wide variation in the number of people contacted by individuals socially (Susswein and Bansal, 2020), and it is not clear that this variation in contact structure is possible to the same degree in a prison setting, where individuals’ movements and locations are heavily constrained and regulated. For this reason, we also considered the possibility that secondary cases may be Poisson distributed within the prison setting, though they are more highly dispersed in community transmission.

4. Optimistic vs. pessimistic timing of transmission events. We also considered that the way in which the timing of transmission events is distributed can affect the probability of transmitting to another prison. If transmission tends to occur in bursts, for example driven by exceptional events when multiple people gather, such a burst might happen either before or after an individual is transferred from prison to prison. If transmission events are independent and happen all at different times, on the other hand, it is more likely that at least one of them will occur after a transfer. We model both of these cases.

We used each combination of the above assumptions to estimate a threshold transfer rate for the California prison system, above which transfers may create a risk of global spread of the coronavirus across the prison system. Calculations detailed in Appendix C use branching process approximations to express the probability pG in terms of the rate ρG of transfer between prisons per person per day for each combination of the above assumptions. These were used, together with our upper and lower bounds on μ, to numerically solve for the threshold rate ρG at which R=1 under each scenario. We transformed ρG to a threshold rate n of transfers per prison per month, using conversion factors of 30 days per month and the average number 2700 of individuals per prisons in the CDCR system as of March 26, 2021.

We note that this threshold number of transfers concerns potentially infective transfers who are exposed to the prison resident population in the facility where they arrive. Prisons, of course, have policies for quarantine of transferred residents and for testing before transfer to prevent transfer of infected individuals, and these policies are likely to reduce the risk due to transfer.

Transfers between California state prisons are regulated by a policy called the movement matrix (California Department of Corrections and Rehabilitation, 2021a). Residents are tested five days before transfer, rapid tested one day before transfer, and quarantined for 14 days after transfer. Quarantine is in celled housing with a solid door if possible, and otherwise in cohorts of no more than four people. Residents in quarantine are screened for symptoms daily, tested if symptomatic, and isolated if they test positive. All of them are tested after five days post-transfer, again after 12 to 14 days post-transfer, and then released if negative and asymptomatic. Both residents and staff are to wear N95 masks during transfer. Residents who have been diagnosed with COVID-19 and subsequently resolved are considered immune for 90 days and exempt from quarantine and testing. After 90 days they are considered susceptible again and subject to the above measures.

We assume that these procedures are likely to reduce the risk of transmission by transfer substantially. However, compliance with safety policies may not be perfect. For example, the California Inspector General has documented extensive noncompliance with mask guidelines in the California prisons, including during the San Quentin outbreak (Office of the Inspector General, 2020). If protective policies reduce the number of transfers who can potentially transmit the virus by some percentage, it is the number of unprotected transfers that must be compared to the threshold value. If transfers are occurring at a known rate nt, and the threshold transfer rate for uncontrolled transmission between prisons is n, then the percentage of transfers that must be conducted in adherence to the protective policy in order to reduce unprotected transfers to the threshold rate is a=100(ntn)/nt, or zero if n exceeds nt.

2.3. Vaccination and decarceration

Vaccination in the California prison system is underway, with CDCR reporting that 40% of prison residents have received COVID-19 vaccination. A recent legal filing reported that accounting for previously infected prisoners, 76% of incarcerated people may have immunity (Miller, 2021).

Both increasing immunity and decarceration are likely to affect the spread of infections in at least two important ways, firstly by reducing the reproduction number and relatedly the probability that an introduction leads to an outbreak, and second by reducing the sizes of outbreaks if they occur. Both of these changes will affect our estimates of the rate of transfers needed to produce cascading outbreaks.

We look at the relation between increasing immunity and the critical threshold for cascading outbreaks by estimating the threshold transfer rate and associated quarantine adherence rate, as above, while reducing the local reproduction number parameters discussed above (RL) by half (which affects our estimate of how often a transfer causes an outbreak, but not of outbreak size), reducing the characteristic outbreak size μ by half, and reducing both by half simultaneously.

3. Results

We first estimated the threshold transfer rate (n) under all combinations of the above listed model assumptions, without protective measures (Table 1, Fig. 4). The values estimated for n ranged from 3.6 to 92 individuals transferred per prison per month, with mean 27 and standard deviation 26. The generation time distribution used in these estimates was that estimated in a recent meta-analysis (Ferretti et al., 2020): a Weibull distribution with mean 5.5 days and standard deviation 1.8 days (parameters α=3.37, β=6.12).

Table 1.

Estimates of critical threshold for supercritical transmission between prisons in individuals transferred per prison per month and needed levels of adherence, given partial adherence with California’s transfer policy.

Optimistic μ Optimistic R Optimistic case
distribution
Optimistic
timing
n Threshold
adherence
Y Y Y Y 92.0 0
N Y Y Y 20.0 67
Y N Y Y 30.0 51
N N Y Y 6.2 90
Y Y N Y 48.0 21
N Y N Y 10.0 83
Y N N Y 22.0 64
N N N Y 4.2 93
Y Y Y N 77.0 0
N Y Y N 18.0 71
Y N Y N 24.0 62
N N Y N 5.6 91
Y Y N N 39.0 36
N Y N N 9.6 84
Y N N N 16.0 75
N N N N 3.6 94

Fig. 4.

Fig. 4

Estimated threshold transfer rates and levels of adherence to policy. (Left) threshold number of individuals transferred per prison per month, under multiple scenarios (Table 1), (Right) rate of adherence to transfer policy needed to reach threshold number of transfers, under the assumption of 61 total transfers per prison per month. Box plots display median and inter-quartile range.

We converted our threshold estimates to the percentage of transfers that must be conducted in compliance with the safety policy in order to achieve the threshold number of unprotected transfers or below, given a total of 61 transfers per prison per month (Table 1, Fig. 4). Our estimates of this threshold rate of adherence to quarantine precautions ranged widely but clustered in the upper third of the percentage scale, with mean 61% and standard deviation 32%.

As a look at the sensitivity of our estimates to reductions in risk due to vaccination and/or decarceration, we estimated the same quantities while reducing the probability of an outbreak, the size of outbreaks, or both, by half (Fig. 5). We estimated that while under our most optimistic assumptions the risk of cascading outbreaks is reduced substantially at 61 transfers per facility per month, to the point where quarantine measures could be ignored entirely without exceeding the estimated threshold (which could of course cause substantial risks other than cascading outbreaks), the change in the median and mean estimates is much more modest. We estimated the mean threshold transfers per prison per month (n) at 30, 54, and 61 respectively when reducing the assumed parameter R by half for estimation of the probability of an outbreak, reducing the characteristic outbreak size μ by half, and both (standard deviation 26, 53, 53). The mean estimate of threshold rate of adherence to policy was 55%, 40%, and 34% respectively, with standard deviation 33, 34, 35.

Fig. 5.

Fig. 5

Estimated threshold transfer rates and levels of adherence to policy given vaccination and/or decarceration, as in previous figure with reductions in reproduction number and/or outbreak size assumed. (Left) threshold number of individuals transferred per prison per month, under multiple scenarios as above, (Right) rate of adherence to transfer policy needed to reach threshold number of transfers, under the assumption of 61 total transfers per prison per month. Box plots display median and inter-quartile range.

4. Discussion

We have constructed a range of values of a threshold rate of mixing between prisons above which transmission between prisons is likely to be supercritical. Supercriticality between prisons means that a prison outbreak is expected to produce more than one other prison outbreak on average, potentially leading to uncontrolled spread throughout the prison system.

We estimate that the reported rate of transfers that has been occurring in the California prison system has likely exceeded this threshold. We estimate that at these rates of transfers, the quarantine precautions must be highly effective and rates of compliance must be high to avoid risk of supercritical transmission between prisons. The rate of outbreaks occurring in California prisons suggests that supercritical transmission may already have occurred or may be occurring.

We offer these estimates as a way of assessing one of the multiple risks posed by infectious disease transmission in the prison system. It is important to note that this threshold cannot be understood as providing a safe or acceptable rate of transfers, since transfer rates below the critical threshold can still cause huge outbreaks in multiple prisons. We are discussing an additional risk beyond the clear dangers of prisons’ unsafe conditions and spread due to transfers: the risk that in addition to having multiple large and deadly prison outbreaks of COVID-19, the rate of transfers could be sufficient to cause uncontrolled spread across a large portion of the prison system. This situation would likely lead to a great deal more harm to prison residents and staff than even the known risks of multiple large prison outbreaks, and could place communities throughout the state at risk as well.

The programme of vaccination that is underway in the California prison system is crucial in reducing transmission and saving lives, and will likely help to end the pandemic more broadly as prison transmission poses risks to communities beyond the prison walls. We caution that substantial risks may continue to exist in the CDCR system, as spread of the SARS-CoV-2 virus can still occur, prison conditions continue to be overcrowded and unsanitary, and the effects of variant strains are yet unknown, not to mention the other diseases currently circulating and the potential of future emerging pandemics. Our results and methods may also be relevant to other prison systems where vaccination is not yet widespread. Decarceration remains a crucial public health measure to bring disease spread under control.

While these estimates are necessarily imprecise due to limited availability of data, such that risks could in fact be lower than we have estimated, we note as well that in addition to the mechanism of transfer of prison residents considered here, transmission between prison facilities may also be occurring resulting from travel of infected staff who work at multiple facilities. For this reason, the risk of uncontrolled transmission between prisons may in fact have been higher than we have estimated here.

These results have a number of limitations. We have assumed that individuals are removed from the epidemic process at the end of their infective period, as we consider the final size of each local epidemic, and thus do not account for the possibility of reinfection. In using a branching process, we have implicitly assumed a very large number of local communities, so that at least initially, each global transmission is to a new site, and ignores the possibility of a second epidemic in the same location. This assumption is reasonable in the context of prisons, where there are indeed many sites. Branching processes are thus most relevant to modelling emerging pathogens, novel strains, and regions with lower rates of vaccination/acquired immunity. We have thus limited ourselves to retrospective conclusions about supercritical transmission in the California prison system; such methods could, however, inform decisions in the face of e.g. vaccine resistant variants. The assumption of a homogeneous rate of transfer per individual across all prisons may be limiting as heterogeneity may be important; with sufficient data on site-to-site transfer rates, this could potentially be addressed by dividing facilities into classes with class-specific transfer rates and using a multi-type branching processes, but that is beyond the scope of this study.

This approach is applicable to analysis of risk due to transmission between sites in a variety of hotspot settings of transmission including but not limited to prisons. Transfer, migration, and mixing between sites may be important sources of risk in other locations of high transmission as well, such as jails, ICE facilities, skilled nursing care facilities, meat packing plants, and other agricultural operations.

CRediT authorship contribution statement

Todd L. Parsons: Designed the research, Performed the research, Wrote the paper. Lee Worden: Designed the research, Performed the research, Wrote the paper.

Declaration of Competing Interest

Lee Worden received partial support from the office of the federal receiver, J. Clark Kelso, for research not including this project. Otherwise the authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgements

LW acknowledges NIH R01 GM130900/GM/NIGMS. We are grateful to Travis Porco, Seth Blumberg, and Jianda Monique, to Martha Lincoln, Sarah Ackley, and multiple colleagues in the Amend and CalPROTECT projects and at CCHCS and CDCR for helpful comments on this work, and to Michael Bien and Ernest Galvan at Rosen, Bien, Galvan & Grunfeld LLP for helpful information regarding transfers.

Appendix A. General household models

A patch model of disease transmission can model a collection of discrete populations in which transmission happens within a population, and at a separate rate between populations. One such approach, the so-called household model (Ball et al., 1997), assumes that the population is divided into many small groups (the households) in which local contacts occur frequently, whereas global contacts may occur between any two individuals in the population, albeit at a much lower rate. In such a model, there are two types of outbreaks, local ones, in which the infection spreads widely within a single subpopulation, but remains confined to that subpopulation, and global outbreaks, in which the epidemic spreads among many subpopulations. Here, we consider a variant of the model of Ball et al. (1997); whereas there, global contacts were assumed to occur between individuals that remain in their respective households, here we consider a pandemic caused by individuals moving between households.

A branching process approximation allows us to compute the probability that a global outbreak occurs: if there are a large number of households, at the beginning at least, the probability that the same household receives two or more infected individuals is negligible, so, we can assume to first approximation that each global contact is with a new household. Thus, each infected individual potentially starts a branching process of infected households. The so-called “merciless dichotomy” (Jagers, 1992) tells us that a branching process either goes extinct rapidly, say with probability q, or grows indefinitely, with probability p=1q. The latter corresponds to a major pandemic. If the branching process goes extinct, then necessarily, the branching processes started by each infected individual in the first household go extinct as well, which occurs independently for each branching process with probability q as well; this gives us a recursive formula for q.

Let πi be the probability that a global contact is made with a household with i individuals. Suppose that the fraction of households with i individuals is hi. If we were to choose a household at random, taking πi=hi gives the probability of choosing a household of size i. if, on the other hand we choose an individual at random, then the probability that individual comes from a household of size i, πi, is proportional to ihi (we have equal chance of choosing each individual from the same household), and households are chosen according to a size-biased distribution. The latter is the distribution used in Ball et al. (1997), where transmission is assumed to happen between individuals circulating globally but residing in fixed households, so larger households with more members are more likely to have infectious contacts. The distribution π=(π1,π2,) only enters our model in the calculation of the mean household outbreak size; we will use empirical upper and lower bounds for this mean size, so we shall not need to specify π.

Given a household of size i, let Pij be the probability that ji individuals in that household are ultimately infected by an incoming infected individual (“patient zero” to whom we associate the label 0). We then have

q=i=1πij=1iPijE[qZ0+Z1++Zj]

where Z1,,Zj are the number of households subsequently infected by each of the j infected individuals from the household into which patient zero was introduced (“household one”). We will assume that Z1,,Zj are identical and independent copies of a random variable Z, and all are independent of Z0, the number of households subsequently infected by patient zero. Because patient zero has already infected one household, we have additional information on that individual, and Z0 may be distributed differently in light of that information. For example, when an outbreak follows the arrival of patient zero, they may be subsequently quarantined and thus not relocate again during their infectious period, so Z0=0.

By independence,

q=i=1πij=1iPijE[qZ0]E[qZ]j. (A.1)

(n.b. it is here that we are assuming the number of households is very large, so that each infected individual is making contact with distinct households). Write F(q) for the expression on the right in (A.1); the function F(z) is the probability generating function (PGF) for the total number of households infected by patient zero. F(z) and F(z) are positive and increasing for z>0. F(1)=1 and F(1) is the expected number of households infected by individuals from household one (see e.g. Bartlett, 1956, Jagers, 1975 for results on PGFs for branching processes). The branching process has the possibility of growing indefinitely (i.e. a major pandemic occurs) if and only if the equation F(q)=q has a second solution 0<q<1.

Solving F(q)=q analytically is generally impossible, but we can easily derive a criterion for the existence of a second solution. Since

F(0)=i=1πiPi0

is the probability that the patient zero infects no other individuals in their household, we must have F(0)>0. Rolle’s Theorem tells us that F(q)=q has a solution q<1 if and only if F(q)=1 for some q<q<1. Since F(z) is increasing, this shows that there is a second solution if and only if F(1)>1.

Now,

F(1)=i=1πij=1iPijE[Z1]+jE[Z]=i=1πiE[Z1]+μiE[Z].

where μi=j=1ijPij is the expected number of the i individuals in a household of size i who are infected by the incoming patient zero (see Appendix D for an approximation for μii when i is large). Now let

μ=i=1πiμi,

so μ is the expected number of individuals infected in household one, when the household is drawn according to π=(π1,π2,), and

F(1)=E[Z1]+μE[Z]

gives the expected total number of households infected by the first infected individual (E[Z1]) and their contacts (each one makes E[Z] contacts), i.e. R.

R thus reflects the fact that transmission between households is proportional to the number of individuals within the house, who each individually make global contacts with other households. In the same way that the critical threshold R0=1 for the basic reproduction number separates subcritical from supercritical transmission in non-patch models, the boundary R=1 is the global critical threshold above which transmission is globally supercritical, meaning that an outbreak in one group is expected to infect more than one other group on average and can cause a large outbreak across the system of groups. To emphasize the distinction between R and the usual basic reproduction number, in what follows we will use RL to denote the basic reproduction number for a local epidemic at a single site.

The branching process approximation we have chosen has considerable flexibility beyond the application presented here. At the cost of making the parameter μ something of a black box, we need only make minimal assumptions about individual epidemic and transfer dynamics: for example, we do not need to make specific assumptions about the duration of the infectious or latent periods, and the results are equally compatible with SIR or SEIR dynamics. In particular, we do not make explicit use of the values πi, which only enter into the weighted mean outbreak size μ.

Each of these choices, however, will result in different values of μ; while the distribution of prison sizes (hi) is an empirical and observable quantity, the mean number of infections at a site will have to be either computed or observed empirically. When individual facilities are sufficiently large that an ordinary differential equation (ODE) model is reasonable approximation to the local dynamics, we can use standard compartmental models of estimate the final size of the epidemic. When local facilities are small, more computationally intensive methods are required (see Picard and Lefevre (1990)). Here, we will use facility sizes as an upper bound for the size of the local outbreak (the total fraction infected for e.g. in the SIR compartmental model, the total fraction infected is approximately 1eRL, which is very close to 1 for values of RL estimated for SARS-CoV-2), and the average size of reported epidemics as a lower bound: underreporting and asymptomatic infections ensure that reported sizes underestimate the true size of outbreaks.

In what follows, we will assume that individuals are moved sufficiently infrequently between sites that the probability that they move twice during their infectious period is negligible i.e. E[Z1]=0 whereas E[Z] is the probability that an individual moves to a new household during their infectious period and causes an outbreak in their new household, pG. As a result, we are underestimating the probability of a major pandemic.

In large populations, outbreaks tend to two extremes: they either “fizzle”, resulting in an average of fewer than 1RL new infections, or they result in a large outbreak in which almost all of the susceptible population is eventually infected. The former are likely to go undetected, so we neglect outbreaks that fizzle, and only consider large outbreaks in computing pG. Again, this results in an underestimation of the probability of a major pandemic. After an interlude explaining the approach we take to modelling individual transmission, we turn our attention to computing this probability.

Appendix B. Point processes for infections

B.1. Intensity functions and the generation interval density

The Crump-Mode-Jagers general branching process (Crump and Mode, 1968, Crump and Mode, 1969, Jagers, 1975) provides an extremely supple framework in which to investigate the initial phase of an epidemic, giving an individual-based stochastic model that corresponds to the renewal equation at the early phase of an epidemic when, in a large population, we may assume that each new contact made by an infectious individual is with a susceptible individual who has not been previously contacted. Fix t=0 as the time that a given focal individual is infected, and assume secondary infections at random times t1,t2,. The ensemble of times is an example of a point process, which is represented by its counting measure, N: given a subset A[0,),

N(A)=#{ti:tiA}.

We may similarly assign such a point process to each secondary infection, with the points ti now corresponding to the time elapsed between their infection and the tertiary infections they cause.

One can integrate an arbitrary function f with respect to N(dt)=#{ti:tidt}:

0tf(t)N(dt)=if(ti),

whenever the sum on the right hand side exists.

If no two points coincide, i.e. if the infection is a simple point process, in which case

N(dt)=1if tidt for some i, and0otherwise.

(n.b., while no two points coincide, we do allow points to be arbitrarily close). A simple point process is characterized by its intensity function

λ(t|Ht)dt=P{tidt for some i|Ht},

where Ht denotes the information available at time t, which includes the infection times tit, but may also include information about other hidden variables (we illustrate this with an example below). For a set A[0,), N(A) is a Poisson point process with rate Aλ(t|Ht)dt. When λ(t|Ht) is a random function independent of the prior infection times, then the point process is a Cox process or doubly stochastic Poisson process; when it is a deterministic function of time, it is an inhomogeneous Poisson process; finally, if the intensity function is a constant λ, then the point process is the classical Poisson process.

Example 1

The classical SIR compartmental differential equation model can be interpreted as the large population limit of a stochastic model in which, upon being infected, makes infectious contacts at rate β during an infectious period that is exponentially distributed with (recovery) rate γ. If L denotes the infectious period of the first infectious individual, and we measure time with t=0 corresponding to their time of infection, then their infection point process has intensity λ(t|Ht)=β1{Lt}, where 1{Lt} is the indicator function of their infectious period:

1{Lt}=1if Lt, and0otherwise.

This is an example of a Cox process: the intensity function is random, as it depends on the infectious period L, but the probability of an infectious contact at time t is independent of any previous contacts. Here, Ht contains the information of whether or not Lt (n.b. it does not include the value L itself unless Lt, as Ht contains no information about the future beyond time t).

In general, each individual has a distinct intensity function, which may depend on one or more hidden variables (e.g. in Example 1 above, each individual has a distinct – random – infectious period). We may also consider the expected intensity function of the point process, h(t), given by

h(t)=E[λ(t|Ht)].

h(t) the corresponds to the ensemble average intensity function over a large population. If 0h(s)ds<, then we can normalize h(t) to obtain a probability density function,

g(t)=h(t)0h(s)ds,

which is the generation interval density. In particular, the total expected number of infections caused by patient zero, the local reproduction number, is

RL=E[N([0,))]=E0N(ds)=0h(s)ds,

Thus, h(t)=RLg(t) e.g. in Example 1, we see that

h(t)=E[β1{Lt}]=βP{Lt}=βeγt,

and 0h(t)dt=βγ=RL.

Given the abundance of data regarding the generation interval, and the relative paucity of information on individual hidden variables, we will use h(t)=RLg(t) for the empirical generation interval distribution for each individual’s intensity function — approximating each individual by the population average. This allows us to make full use of available information about the timing of infection events, rather than imposing e.g. a compartmental model. It is not, however, without consequence, e.g. the total number of infections caused by “patient zero”, N([0,)), is a Poisson process with rate

0h(t)dt=RL0g(s)ds=RL.

On the other hand, if we compute the probability generating function (the PGF, which uniquely characterizes a random variable) for the SIR compartmental model of Example 1 above, where the total number of infections caused by a single individual is Poisson distributed with (random) rate

0β1{Lt}dt=βL,

we have

E[zN[0,)]=EE[zN[0,)|L]=EeβL(z1)=γγβ(z1)

i.e. the total number of infections is geometrically distributed with success probability p=γβ+γ=11+RL, which, despite having the same mean, is not Poisson distributed with rate RL.

This might otherwise might be considered a deficiency, but for us it becomes a virtue: the geometric distribution has variance to mean ratio (VMR) of 1p=1+RL>1 and the Poisson process has VMR one, whereas the VMR for empirically observed individual transmission lies between the two. As we discuss below, however, our inhomogeneous Poisson process model can be used to construct a family of processes with the same reproduction number RL and generation interval g(t) whose total number of infections per infected are negative binomially distributed, and thus include both the Poisson and geometric distributions as well as processes with VMRs taking all values in [1,).

B.2. Incorporating overdispersion

To allow for varying dispersion – as well as individual variation in contact rates – we consider a doubly stochastic Poisson (or Cox) process, drawing an individual reproductive ratio, r, for each individual from a fixed distribution with mean RL.

Given A[0,), set gA=Ag(t)dt. If we assume each individual’s r is gamma distributed with shape parameter k and scale parameter RLk for some k>0, then the probability of m infections in an interval [a,b] caused by a randomly chosen individual is

0(rgA)mm!ergArk1ekRLrdt=Γ(m+k)m!Γ(k)kRLgA+kk×RLgARLgA+km,

which we recognize as a negative binomial distribution with success probability RLgARLgA+k and k failures. The mean number of infections occurring at times in A is thus RLgA and the VMR of the number of those infections is 1+RLgAk. In particular, taking A to be the whole real line, we see that the mean total number of infections caused by a single individual is negative binomially distributed with mean RL and index of dispersion 1+RLk; taking k=1, we obtain a geometric distribution with success probability 11+RL, whereas taking k yields a Poisson distribution with rate RL.

B.3. Compound Poisson processes

More generally, we can consider the case when events occur with intensity function h(t), but now the number of infections occurring at the i th contact is given by a random variables νi, where ν1,ν2, are independent and identically distributed (i.i.d.). For example, given low-resolution temporal data, we may be unable to distinguish nearby infection times. The number of infections occurring at times in the set A is then

N~(A)=i=1N(A)νi,

where, as before, N(dt) is a simple point process and

E[N~(dt)]=E[ν]h(t)dt,

and

RL=E[N~([0,))]=E[ν]0h(t)dt.

Unlike previously, this relation does not fix the value of 0h(t)dt. Rather, for any λ(0,), we can have 0h(t)dt=λ, provided E[ν]=RL/λ. We then have h(t)=λg(t), so λ determines the number of points: an increased λ will correspond to fewer infections on average per event and vice versa. In particular, for a given RL, we can potentially have arbitrarily large clusters of infections, provided they occur sufficiently rarely.

B.3.1. Conditioning on a single event

As an extreme case, we consider a scenario in which all transmission from a given individual occurs at a single point in time. The probability of exactly one event in the inhomogeneous Poisson point process is

0h(t)dte0h(t)dt=λeλ,

whereas the joint probability of having a single point at t is

h(t)e0h(t)dt,

so the probability of a single point at t conditional on only one point is the ratio of these two probabilities,

h(t)e0h(t)dt0h(t)dte0h(t)dt=h(t)0h(t)dt=g(t).

Since we require that the expected number of infections remains equal to RL, we must have that their number, ν1, satisfies E[ν1]=RL.

Appendix C. Probability of a large local outbreak, pG

We now use the modelling framework described above to determine the compute the probability, pG from Appendix A, that an infected individual is transferred and causes a major outbreak in the new site. We will work in the context of our prison to prison model, taking the prisons as the “households” of the model. Recall that a minor outbreak occurs when the branching process approximating the initial transmission goes extinct, and a major outbreak occurs otherwise. In practice, it is more transparent to compute the probability qG=1pG. that a major outbreak does not occur, which we do below.

By assumption, an individual can infect a new site if and only if they are transferred while infectious. We will assume that transfers occur as a Poisson point process with rate ρG (transfers per individual per day), which for simplicity is assumed to sufficiently small that the probability that a given individual is transferred multiple times while infectious is negligibly small. Thus, each individual waits an exponentially distributed time with mean 1/ρG before being transferred.

To obtain a range of plausible values for qG/pG, we will consider each of the models discussed in Appendix B:

  • (i)

    a compound Poisson model in which infections happen individually and independently with a time-dependent intensity function h(t)=RLg(t),

  • (ii)

    a doubly-stochastic Poisson model, where individual reproduction number varies,

  • (iii)

    a compound Poisson process model where each event leads to multiple infections, and

  • (iv)

    a “burst” model in which all infections occur simultaneously.

C.1. Poisson distributed infections

To compute the probability of a local large outbreak under our generation interval approach, we must take care to distinguish between the patient zero who initiated the epidemic, for whom some fraction of the infectious period has already elapsed, and the newly infected individuals in the new site.

Under the assumption of a large local population, we can continue to use branching process recursive formulas to determine the probability that no locally infected individual gives rise to a significant outbreak, say qL:

qL=EqLN([0,T)). (C.1)

Here N([0,T)) is the number of individuals infected by a given infectious individual before that infectious individual is transferred at a random time T after infection. As before, T is exponentially distributed with rate ρG, whereas conditional on T, N([0,T)) is Poisson distributed with rate RL0Tg(t)dt. Now,

qL=EEqLN([0,T))|T,

where the outer expectation is for the random variable T. Recognizing the inner expectation as the probability generating function for a Poisson random variable gives us

qL=EeRL(qL1)0Tg(t)dt=0ρGeρGteRL(qL1)0tg(u)dudt.

Now consider patient zero. Let T (still exponentially distributed with rate ρG) be the time after the start of their infectious period at which patient zero was introduced into the local community. Then, they will infect N([T,)) individuals in the new site, whence the probability that there is not a major outbreak is

qG=EqLN([T,)).

Again, first taking the conditional expectation of N([T,)) given T, which is a Poisson random variable with rate RLTg(t)dt=RLG(T), and then over T, gives

qG=0ρGeρGteRL(1qL)G(t)dt (C.2)

e.g. for the Weibull distribution with shape a and scale b, G(t)=etba.

C.2. Individually varying reproductive number

More generally, as before, we can consider the possibility that each individual has an i.i.d. reproductive number, say Ri drawn according to a Gammak,RLk distribution, in which case, we may proceed as before to obtain (C.1), which we must now average over the gamma distribution to obtain

qL=EqN([0,T))=EEEqN([0,T))|Ri,T|T=EEeRi0Tg(t)dt(qL1)|T=E1RL(qL1)k0Tg(t)dtk

and, recalling G(t)=tg(u)du,

=E1RL(qL1)(1G(T))kk=0ρGeρGt1+RL(1qL)(1G(t))kkdt,

which we may solve numerically for qL. Proceeding similarly, averaging (C.2) over the gamma distribution and interchanging the order of integration yields

qG=0ρGeρGt1RLkG(t)(qL1)kdt=0ρGeρGt1+RLkG(t)pLkdt,

where we recall pL=1qL.

C.3. Compound Poisson processes

Now, suppose that at the i th time of the inhomogeneous Poisson processes N([0,T)) and N([T,)) (as previously, T and T indicate the transfer time out of, or into, the focal site respectively) the individual independently infects νi individuals, where νi is a random variable with probability generating function Pν(z)=E[zν].

Now, since the νi are independently and identically distributed and independent of their arrival times in N([0,T)),

qL=EqLi=1N([0,T))νi=Ei=1N([0,T))qLνi=Ei=1N([0,T))EqLνi|N([0,T))=Ei=1N([0,T))EqLνi=Ei=1N([0,T))Pν(qL)=EPν(qL)N([0,T)).

Now, proceeding as before,

EPν(qL)N([0,T))=EEPν(qL)N([0,T))|T=EeRL(Pν(qL)1)0Tg(t)dt=0ρGeρGteRL(Pν(qL)1)0tg(u)dudt,

giving us the relation qL=0ρGeρGteRL(Pν(qL)1)0tg(u)dudt.

Proceeding similarly gives us

qG=0ρGeρGteRLG(t)(Pν(qL)1)dt.

C.4. Bursts of infections

Suppose each individual waits a randomly distributed time with probability density function g(t) before causing a random number ν of infections, independent of the time of transmission. Fix a given individual, and suppose that they are transferred at time T, exponentially distributed with rate ρG and that their transmission event happens at time T. Then, their local chain of infection goes extinct if either they are transferred prior to transmission, or if not, if all those they infect have finite chains of infection:

qL=E1{T<T}+1{T<T}qLν=P{T<T}+P{T>T}EqLν.

Again, the expectation on the right is the probability generating function for ν evaluated at qL, Pν(qL).

The calculation thus depends on the choice of law for the random variables νi, or equivalently the choice of distribution and its probability generating function Pν(z). For example, if each νi is negatively binomially distributed with mean RL and k successes (and thus success probability p=RLk+RL) we have Pν(z)=kk+RL(1qL)k, whereas for a Poisson process with the same mean RL we have Pν(z)=eRL(z1).

On the other hand,

P{T>T}=EPT>T|T=EeρGT=0eρGtg(t)dt=gˆ(ρG),

where gˆ(p) indicates the Laplace transform of g(t). Thus,

qL=(1gˆ(ρG))+gˆ(ρG)Pν(qL).

For the initial individual, we need to take into account the possibility that the individual had their “burst” prior to being transferred to the focal site,

qG=P{T>T}+P{T<T}EqLν=gˆ(ρG)+(1gˆ(ρG))Pν(qL)=gˆ(ρG)+(1gˆ(ρG))qL(1gˆ(ρG))gˆ(ρG)=gˆ(ρG)+(1gˆ(ρG))(gˆ(ρG)pL)gˆ(ρG)

or, rearranging,

pG=pL1gˆ(ρG)gˆ(ρG).

C.5. Threshold mixing rates

Synthesizing the above results to derive a threshold rate of transfers, we recall that the criterion for a major outbreak on the congregate level is R>1, where R=μE[Z], μ is the (size-biased) mean size of a major outbreak, and Z is the number of facilities in which a given individual causes a major outbreak. Assuming that each individual is transferred at most once, then E[Z]=pG=1qG, where qG is calculated as above. To calculate the critical transfer rate ρG, one needs to solve R=1 for ρG.

Appendix D. The final size of a large well-mixed epidemic

In this appendix, we sketch an argument that, in the limit of large population sizes and under well-mixed assumptions, the final size of a closed epidemic (i.e. the fraction of susceptible hosts who have been infected over the duration of the epidemic, assuming no new individuals are introduced) is equal to that for the Kermack–McKendrick SIR epidemic for a wide variety of models, provided all individuals are indistinguishable. As will become clearer as we formulate the argument, the result applies equally well to e.g. a SIR or SEIR epidemic, but does not apply when e.g. individuals vary in their susceptibility or in the size of their contact networks.

We consider the epidemic from a retrospective vantage point. Suppose that the local population consists of n individuals, labelled (arbitrarily) 1,,n. Let 1i be an indicator variable, equal to 1 if the individual with label i is eventually infected, and 0 if they are never infected. Let Ni be the number of potential transmissions by individual i: if individual i is never infected (1i=0), then none of these potential transmissions result in a new infection; if on the other hand, i is eventually infected (1i=1), then the potential infections result in new infections if the contacted individual is susceptible. Thus, E[Ni]=RL The variables Ni and 1i are assumed to be independent (e.g. we exclude the possibility that some individuals are highly susceptible and highly transmissive, as might occur with a weakened immune system.

Let Rn()=i=1n1i denote the total number of individuals who were infected over the course of the epidemic, and let Sn()=nRn() be the number who were never infected.

Now, consider the probability, p1=P{11=0}, that the individual with label 1 was never infected; because the labelling is arbitrary, this is simply the probability that we uniformly drew the one of the Sn() individuals that were never infected, Sn()n=1Rn()n. Since all individuals are identical up to an arbitrary labelling, pi=p1 for all i.

On the other hand, the probability that the first individual was never infected can be written as the probability that they were not “chosen” as a contact by an infectious individual (under the well-mixed assumption, any individual i has a contact with individual 1 with probability 1n1):

i=2n11n1Ni1i=ei=2nNi1iln11n1=ei=2nNi1i1n1+O1n2.

Now, consider the sum i=2nNi1i. Let R~n() be the number of individuals with labels {2,,n} who were infected (thus R~n()=Rn() or R~n()1 according to whether individual 1 was infected or not). If the limit

r()=limnR~n()n

exists and is non-zero, then the central limit theorem tells us that

limni=2nNi1in1=limnR~n()n1i=21i=1nNiR~n()=r()E[Ni]=r()RL.

On the other hand, if r()=0, which would be the case if the branching process approximating the epidemic went extinct, then for any ɛ>0 Markov’s inequality tells us that

Pi=2nNi1in1>ɛ1ɛEi=2nNi1in1=1ɛRLE[R~n()]n1,

which tends to 0 as n. Since ɛ>0 is arbitrary, the sum is zero with probability 1.

Equating the two expressions for the probability that the first individual is never infected, and taking the limit as n, we get the tautology 1r()=1 if r()=0, and the Kermack–McKendrick final size formula otherwise:

1r()=eRLr()

Rewriting this in terms of the final fraction susceptible, s()=1r(), we get s()=eRL(1s())RL, or, equivalently, RLs()eRLs()=RLeRL. Thus,

s()=1RLW0RLeRL,

where W0(x) is the principal branch of Lambert’s W-function (Corless et al., 1996), the transcendental (multi-)function satisfying x=W(x)eW(x), where the principal branch is real valued and increasing on [e1,).

Moreover, W0(x)=x+O(x2), so

s()=eRL+O(RLe2RL).

For RL=4, we have that eRL0.0015, whereas s()0.0198, so the relative error in approximating s() by eRL is already less than 8%, whereas approximately 98% of the population will have been infected, thus justifying the use of the mean household size as a reasonable upper bound on the final size of the epidemic.

Appendix E. The final size of a small outbreak

In what follows, we derive an estimate for the final size of a local outbreak conditioned on extinction: for reasons of tractability, we will compute the final size of an outbreak assuming no transfers, and assuming that patient zero arrives at the time of their infection.

E.1. Conditioning on extinction

Let N=N([0,)) denote, as previously, the total number of infections caused by a focal individual in any of the models discussed in Appendix B, and let

F(z)=E[zN]=n=0P{N=n}zn

be the corresponding PGF. Then, as previously the unique solution 0<q~<1 to q~=F(q~) is the probability that the branching process of infections (i.e. as before, we ignore exhaustion of susceptible individuals) eventually goes extinct. Now, consider the process conditioned on extinction: the branching process goes extinct if and only if the branching processes initiated by all individuals infected by patient zero also go extinct; the independence of these secondary branching processes means the probability that each goes extinct is independent wit probability q~, and thus,

P{N=n|extinction}=P{N=n,extinction}P{extinction}=P{N=n}(q~)nq~.

In particular, the PGF for the total number of infections conditioned on extinction is Fext(z)=F(q~z)q~.

E.2. The PGF for the final size

In what follows, we will refer to generations of infections: the first generation consists of all individuals infected by patient zero, the second generation consists of all individuals infected by individuals in the first generation, etc. Let Xk be the number of infected individuals in the k th generation, and let

Yk=i=0kXi

be the total number infected in the first k generations. Let Gk(w)=E[wYk|extinction] be the PGF for Yk conditional on extinction. Now, suppose that patient zero infects N individuals, who each give rise to branching processes with Xk(i) offspring in the k th generation and Yk(i) total offspring by the k th generation. Then,

Gk(w)=Ew1+i=1NYk1(i)|extinction

(n.b. the term 1 counts patient zero in the total outbreak size)

=Ewi=1NwYk1(i)|extinction=EwEwYk1|extinctionN|extinction=EwGk1(w)N|extinction=wFext(Gk1(w))

Taking k, we get a functional equation for the final size of the epidemic conditioned on extinction,

G(w)=wFext(G(w)). (E.1)

E.3. The Poisson final size distribution

In general, no exact solution for (E.1) is available, although we can use Lagrange’s inversion formula to obtain the coefficients of the series for G(w) from those of Fext(z). In the special case when N is a Poisson(RL) random variable, we have F(z)=eRL(z1), so that q~=eRL(q~1). Thus, RLq~eRLq~=RLeRL, which we can solve as above using Lambert’s W function to obtain

q~=1RLW0RLeRL,

whereas Fext(z)=1q~eq~RL(z1). Then, G(w)=wq~eq~RL(w1), so RLG(w)eRLG(w)=wRLeRL, and again

G(w)=1q~RLW0wRLeRL=W0wRLeRLW0RLeRL.

Using the power series W0(x)=n=0(n)n1n!xn (obtained using Lagrange’s inversion formula), we get

G(w)=1W0RLeRLn=0nn1RLeRLnn!wn,

whence P{W=n|extinction}=nn1RLeRLnn!W0RLeRL.

While this could be used to establish a probabilistic cut-off to characterize small vs, large epidemics in the Poisson case (i.e. we could define an outbreak as large if its size, say Nout, is such that P{W=Nout|extinction}<ɛ for some small ɛ>0), we will instead use the mean size of a small epidemic to make the distinction (i.e. we will define an epidemic as large if its size is greater than some multiple of the mean small size).

E.4. The expected final size of a small outbreak

We thus turn our attention to computing the expected final size of an outbreak, which we recall is G(1) (n.b. G(1)=1). Implicitly differentiating (E.1), we have

G(w)=Fext(G(w))+wFext(G(w))G(w),

whence

G(w)=Fext(G(w))1wFext(G(w))

and G(w)=11Fext(1). Finally, Fext(z)=F(q~z), so Fext(1)=F(q~), so

G(1)=11F(q~)

(n.b. in Appendix A, we show that F(q~)<1). Recalling that G(w) gives the second factorial moment of the outbreak size, one may proceed similarly to show that the variance in the outbreak size is

q~F(q~)+F(q~)(1F(q~))(1F(q~))3.

In the Poissonian case, we have F(z)=RLeRL(z1). Recalling that W0(x)eW0(x)=x, this gives us

F(q~)=RLeRL(q~1)=RLeRLeW0RLeRL=W0RLeRL=RLq~,

whereas F(q~)=RLF(q~)=RLq~2. In particular, the mean (standard deviation) of the final size is approximately 1.25 (0.62) and 1.0018 (0.043) for our small and large values for RL, 2.87 and 8.44. Applying the 99% rule, we take outbreaks of 3 or fewer individuals to be small.

In the negative binomial case, we have F(z)=1p1pzk, where p=RLRL+k (see Appendix B). No exact solution is available for q~, but we can compute

F(z)=k1p1pzk1p(1p)(1pz)2=1p1pzkkp1pz=F(z)kp1pz (E.2)

and

F(q~)=F(q~)kp1pq~=kpq~1pq~=RLkq~RL(1q~)+kRLq~.

n.b. this inequality is not sufficient to show that the expected final size with the negative binomial distribution is smaller than that for the Poisson distribution, as the extinction probability q~ is larger for the former than for the latter (Lloyd-Smith et al., 2005). We can, however, find a condition under which this is true: recall from Appendix A that q~<q~, where F(q~)=1. We will use this to obtain an upper bound on q~. Using (E.2), we have

1pq~kp=F(q~)=1p1pq~k.

Substituting p=RLRL+k yields

RL(1q~)+kkRL=kRL(1q~)+kk,

whence

1RL=kRL(1q~)+kk+1=(F(q~))k+1k.

Since q~q~ and F(z) is increasing, we have F(q~)F(q~)=q~, giving us q~RL11k. Now, the final size of the Poisson process is larger than that for the negative binomial if and only if

RLkq~RL(1q~)+kW0RLeRL.

RLkxRL(1x)+k is an increasing function of x, so the left hand side is bounded above by RLkq~RL(1q~)+k=kRL1kRL+kRL1k, which is in turn less than kRL1kRL+k1, giving us the sufficient condition

kRL1kRL+k1W0RLeRL.

Solving for k (details omitted),

klnRL(RL1)(RL1)W0lnRLRL1RL1W0RleRL(RL1)lnRL.

The right hand side is decreasing on RL[1,) and greater than 0.54 for RL9, so we conclude that the final size under a Poisson distribution is an upper bound for the final size using a negative binomial distribution for the parameters used here (RL=2.87 and RL=8.44, k=0.5).

Finally, we note that in the burst model, N=ν1, where ν1 is an arbitrary distribution with expected value RL, and we cannot compute anything further without specifying the law of ν1; here, we took ν1 to be negative binomial distribution above, so that the final size with Poisson distribution remains an upper bound.

References

  1. Adam Dillon C., Wu Peng, Wong Jessica Y., Lau Eric H.Y., Tsang Tim K., Cauchemez Simon, Leung Gabriel M., Cowling Benjamin J. Clustering and superspreading potential of SARS-CoV-2 infections in Hong Kong. Nature Med. 2020;26(11):1714–1719. doi: 10.1038/s41591-020-1092-0. [DOI] [PubMed] [Google Scholar]
  2. Althouse Benjamin M., Wenger Edward A., Miller Joel C., Scarpino Samuel V., Allard Antoine, Hébert-Dufresne Laurent, Hu Hao. 2020. Stochasticity and heterogeneity in the transmission dynamics of SARS-CoV-2. arXiv:2005.13689 [physics, q-bio], arXiv:2005.13689. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. 2020. In re Von Staich, on Habeas Corpus (A160122) https://www.courts.ca.gov/opinions/documents/A160122.PDF. (Accessed 29 January 2021). Secondary URL: https://law.justia.com/cases/california/court-of-appeal/2020/a160122.html. [Google Scholar]
  4. 2020. Joint Case Management Conference Statement (Case 4:01-cv-01351-JST Document 3477) https://rbgg.com/wp-content/uploads/Dkt-3477-PLATA-Joint-CMC-Statement-11-04-20-489-ovr.pdf. (Accessed 29 January 2021) [Google Scholar]
  5. Arif Billah Md., Mamun Miah Md., Nuruzzaman Khan Md. Reproductive number of coronavirus: A systematic review and meta-analysis based on global level evidence. PLoS One. 2020;15(11):1–17. doi: 10.1371/journal.pone.0242128. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Ball Frank, Mollison Denis, Scalia-Tomba Gianpaolo. Epidemics with two levels of mixing. Ann. Appl. Probab. 1997;7(1):46–89. [Google Scholar]
  7. Bartlett, M.S., 1956. Deterministic and stochastic models for recurrent epidemics. In: Proceedings of the Third Berkeley Symposium on Mathematical Statistics and Probability, vol. 4, pp. 81–108.
  8. California Department of Corrections and Rehabilitation . 2021. COVID-19 Screening and testing for patient movement. https://www.cdcr.ca.gov/covid19/wp-content/uploads/sites/197/2021/01/COVID-19-Screening-and-Testing-Matrix-Final-21-01-08.pdf. (Accessed 4 February 2021) [Google Scholar]
  9. California Department of Corrections and Rehabilitation . 2021. Population COVID-19 tracking. https://www.cdcr.ca.gov/covid19/population-status-tracking/. (Accessed 26 March 2021) [Google Scholar]
  10. California Department of Corrections and Rehabilitation . 2021. Weekly report of population as of midnight March 24, 2021. https://www.cdcr.ca.gov/research/wp-content/uploads/sites/174/2021/03/Tpop1d210324.pdf. (Accessed 26 March 2021) [Google Scholar]
  11. Corless R.M., Gonnet G.H., Hare D.E.G., Jeffrey D.J., Knuth D.E. On the LambertW function. Adv. Comput. Math. 1996;5(1):329–359. [Google Scholar]
  12. Covid Behind Bars . 2021. UCLA Law COVID behind bars data. https://github.com/uclalawcovid19behindbars/data. (Accessed 26 March 2021) [Google Scholar]
  13. Crump K.S., Mode C.J. A general age-dependent branching process. I. J. Math. Anal. Appl. 1968;24(3):494–508. [Google Scholar]
  14. Crump K.S., Mode C.J. A general age-dependent branching process. II. J. Math. Anal. Appl. 1969;25(3):8–17. [Google Scholar]
  15. Ferretti Luca, Ledda Alice, Wymant Chris, Zhao Lele, Ledda Virginia, Abeler-Dörner Lucie, Kendall Michelle, Nurtay Anel, Cheng Hao-Yuan, Ng Ta-Chou, Lin Hsien-Ho, Hinch Rob, Masel Joanna, Marm Kilpatrick A., Fraser Christophe. Social Science Research Network; Rochester, NY: 2020. The Timing of COVID-19 Transmission: SSRN Scholarly Paper ID 3716879. [Google Scholar]
  16. Fortuna Lisa.R., Tolou-Shams Marina., Robles-Ramamurthy Barbara., Porche Michelle.V. Inequity and the disproportionate impact of COVID-19 on communities of color in the united states: The need for a trauma-informed social justice response. Psychol. Trauma. 2020 doi: 10.1037/tra0000889. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Franco-Paredes Carlos, Ghandnoosh Nazgol, Latif Hassan, Krsak Martin, Henao-Martinez Andres F., Robins Megan, Barahona Lilian Vargas, Poeschla Eric M. Decarceration and community re-entry in the COVID-19 era. Lancet Infectious Diseases. 2020 doi: 10.1016/S1473-3099(20)30730-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Jagers P. Wiley; London: 1975. Branching Processes with Biological Applications. [Google Scholar]
  19. Jagers P. Stabilities and instabilities in population dynamics. J. Appl. Probab. 1992:770–780. [Google Scholar]
  20. Lee Ian.J., Ahmed Nasar.U. The devastating cost of racial and ethnic health inequity in the covid-19 pandemic. J. National Med. Assoc. 2021;113(1):114–117. doi: 10.1016/j.jnma.2020.11.015. [DOI] [PubMed] [Google Scholar]
  21. Lloyd-Smith J.O., Schreiber S.J., Kopp P.E., Getz W.M. Superspreading and the effect of individual variation on disease emergence. Nature. 2005;438(7066):355–359. doi: 10.1038/nature04153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Miller Leila. Los Angeles Times. Section; California: 2021. 40% of inmates in California’s corrections system have been vaccinated for COVID-19. [Google Scholar]
  23. Office of the Inspector General . COVID–19 Review Series, Part Two: The California Department of Corrections and Rehabilitation Distributed and Mandated the Use of Personal Protective Equipment and Cloth Face Coverings; However, Its Lax Enforcement Led to Inadequate Adherence to Basic Safety Protocols. 2020. https://www.oig.ca.gov/wp-content/uploads/2020/10/OIG-COVID-19-Review-Series-Part-2-%E2%80%93-Face-Coverings-and-PPE.pdf. (Accessed 29 October 2020) [Google Scholar]
  24. Office of the Inspector General . 2021. COVID–19 Review series, part three: california correctional health care services and the California department of corrections and rehabilitation caused a public health disaster at San Quentin State prison when they transferred medically vulnerable incarcerated persons from the California Institution for men without taking proper safeguards. https://www.oig.ca.gov/wp-content/uploads/2021/02/OIG-COVID-19-Review-Series-Part-3-%E2%80%93-Transfer-of-Patients-from-CIM.pdf. (Accessed 4 February 2021) [Google Scholar]
  25. Okonkwo Nneoma E., Aguwa Ugochi T., Jang Minyoung, Barré Iman A., Page Kathleen R., Sullivan Patrick S., Beyrer Chris, Baral Stefan. COVID-19 and the US response: accelerating health inequities. BMJ Evidence-Based Med. 2020 doi: 10.1136/bmjebm-2020-111426. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Picard Philippe, Lefevre Claude. A unified analysis of the final size and severity distribution in collective reed-frost epidemic processes. Adv. Appl. Probab. 1990;22(2):269–294. [Google Scholar]
  27. Puglisi Lisa B., Malloy Giovanni S.P., Harvey Tyler D., Brandeau Margaret L., Wang Emily A. Estimation of COVID-19 basic reproduction ratio in a large urban jail in the United States. Ann. Epidemiol. 2020 doi: 10.1016/j.annepidem.2020.09.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Susswein Zachary, Bansal Shweta. Characterizing superspreading of SARS-CoV-2: from mechanism to measurement. MedRxiv. 2020 https://www.medrxiv.org/content/early/2020/12/11/2020.12.08.20246082. [Google Scholar]
  29. The Amend Project . 2020. Urgent memo COVID-19 outbreak: San quentin prison. https://amend.us/wp-content/uploads/2020/06/COVID19-Outbreak-SQ-Prison-6.15.2020.pdf. (Accessed 29 January 2021) [Google Scholar]
  30. The Marshall Project . 2020. 1 in 5 prisoners in the U.S. has had COVID-19. https://www.themarshallproject.org/2020/12/18/1-in-5-prisoners-in-the-u-s-has-had-covid-19, Published in collaboration with The Associated Press. (Accessed: January 29, 2021) [Google Scholar]
  31. The New York Times . 2021. Coronavirus in the United States: Latest map and case count. https://www.nytimes.com/interactive/2020/us/coronavirus-us-cases.html. (Accessed 29 January 2021) [Google Scholar]

Articles from Epidemics are provided here courtesy of Elsevier

RESOURCES