Significance
As health officials face another wave of COVID-19, they require estimates of the proportion of infected cases that develop symptoms, and the extent to which symptomatic and asymptomatic cases contribute to community transmission. Recent asymptomatic testing guidelines are ambiguous. Using an epidemiological model that includes testing capacity, we show that many infections are nonsymptomatic but contribute substantially to community transmission in the aggregate. Their individual transmissibility remains uncertain. If they transmit as well as symptomatic infections, the epidemic may spread at faster rates than current models often assume. If they do not, then each symptomatic case generates, on average, a higher number of secondary infections than typically assumed. Regardless, controlling transmission requires community-wide interventions informed by extensive, well-documented asymptomatic testing.
Keywords: COVID-19, testing submodel, asymptomatic transmission, epidemiological model, epidemiological parameter estimates
Abstract
The contributions of asymptomatic infections to herd immunity and community transmission are key to the resurgence and control of COVID-19, but are difficult to estimate using current models that ignore changes in testing capacity. Using a model that incorporates daily testing information fit to the case and serology data from New York City, we show that the proportion of symptomatic cases is low, ranging from 13 to 18%, and that the reproductive number may be larger than often assumed. Asymptomatic infections contribute substantially to herd immunity, and to community transmission together with presymptomatic ones. If asymptomatic infections transmit at similar rates as symptomatic ones, the overall reproductive number across all classes is larger than often assumed, with estimates ranging from 3.2 to 4.4. If they transmit poorly, then symptomatic cases have a larger reproductive number ranging from 3.9 to 8.1. Even in this regime, presymptomatic and asymptomatic cases together comprise at least 50% of the force of infection at the outbreak peak. We find no regimes in which all infection subpopulations have reproductive numbers lower than three. These findings elucidate the uncertainty that current case and serology data cannot resolve, despite consideration of different model structures. They also emphasize how temporal data on testing can reduce and better define this uncertainty, as we move forward through longer surveillance and second epidemic waves. Complementary information is required to determine the transmissibility of asymptomatic cases, which we discuss. Regardless, current assumptions about the basic reproductive number of severe acute respiratory syndrome coronavirus 2 (SARS-Cov-2) should be reconsidered.
Since the emergence of the novel coronavirus in December 2019 (1), the COVID-19 pandemic has resulted in over 16 million cases and 600,000 deaths worldwide (2). Schools and universities in the United States are gradually reopening amid concerns that a second wave of the epidemic may reemerge in the fall and winter of 2020.
As they craft testing policies and intervention strategies to mitigate a second wave, public health officials need to better understand the role that symptomatic and asymptomatic individuals play in the community transmission of COVID-19 and in the development of herd immunity to the disease. However, fundamental epidemiological questions remain poorly understood, including what fraction of cases are symptomatic and how well asymptomatic cases can transmit relative to symptomatic ones. These questions are especially urgent given ambiguity in recent Centers for Disease Control and Prevention (CDC) guidelines regarding the testing of asymptomatic individuals (3).
Answering these questions can also provide further insight on the basic reproductive number of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), and how the virus would spread in a population in the absence of interventions. This number, known as , is defined as the mean number of secondary cases arising from a primary case in the absence of immunity, and is estimated on the basis of a particular epidemiological model. Mathematical models for the population dynamics of COVID-19 incorporate different features such as asymptomatic and presymptomatic transmission, superspreading, or heterogeneity in susceptibility. A considerable range of estimates has been reported, ranging from at least 1.5 (4) to 5.7 (5) in Wuhan. A much narrower range, between two and three, is frequently cited in the popular press, or assumed when simulating models (6) or fitting these to data (7, 8). This assumption may be based on the dynamics of COVID-19 in regions that implemented interventions early (9–13). A more precise estimate of from a city where substantial transmission was occurring prior to intervention, such as New York City, would provide a relevant baseline. Furthermore, if “superspreading” by a small fraction of symptomatic infections fuels COVID-19 transmission, a precise estimate of the mean number of secondary cases arising from such an individual may be just as valuable. A model that precisely estimates the fraction of symptomatic cases may help epidemiologists discern whether either the overall or symptomatic reproductive numbers are higher than assumed.
The probability that a COVID-19 infection is symptomatic is difficult to estimate (14), and a wide range of values have been suggested (14–16). Estimates from cruise ship outbreaks (17), Wuhan evacuees (18), long-term care facilities (19), and contact tracing of index cases (15) may not be representative of the general population. Increases in the testing capacity for COVID-19 over time (9, 20, 21) make population-level estimation of this probability difficult due to confounding with other parameters such as the reporting, hospitalization, and fatality rates. When the testing capacity is limited in the early stages of an outbreak, severe cases are more likely to be tested, which can bias estimates of the probability that an infection is symptomatic and of the fatality rate. Changes in testing capacity over time also confound the definition itself of asymptomatic individuals in transmission models, when these are not differentiated from unreported cases. These changes can also bias the reported deaths attributed to COVID-19.
These challenges can be improved upon by explicitly incorporating changes in testing capacity into an epidemiological process model. While some early models of the COVID-19 outbreak in Wuhan attempted to take into account changes in testing capacity (21) or differences in reporting rate during periods of the epidemic (9), the limited information on these trends in Wuhan meant that they had to be estimated on a coarse temporal scale (2- to 3-wk intervals) and had to be inferred along with other parameters in the model. In the United States, many states and municipalities such as New York City (22, 23) have published daily estimates of the number of total COVID-19 tests conducted, together with the number of positive COVID-19 tests. While these data are often used by public health officials to gauge the spread of the COVID-19 outbreak, they have yet to be incorporated explicitly into epidemiological models.
We present an epidemiological model that incorporates RT-PCR testing as an integral process informed by empirical levels. The explicit consideration of testing allows us to clearly define asymptomatic individuals as those that will never transition to displaying symptoms, and to differentiate them from those who have been unreported because they were not tested. We fit the model to PCR-confirmed COVID-19 cases in New York City, using publicly available data provided by the New York State Department of Public Health (23). The resulting model can clearly delineate symptomatic and asymptomatic infections independently from the reporting rate. We subsequently fit the model to estimates of prior exposure obtained from a recent serological study in New York City (24) to further constrain inference results.
Our model obtains a precise estimate for the symptomatic proportion of COVID-19 cases. We show that most COVID-19 infections are asymptomatic, and that these asymptomatic infections together with presymptomatic ones substantially drive community transmission, contributing 50% or more of the total force of infection. Furthermore, depending on the transmissibility of individual asymptomatic cases relative to symptomatic ones, either the overall reproductive number or the symptomatic reproductive number may be higher than typically assumed. Our results highlight the importance of testing and contact tracing of asymptomatic individuals, and of making these data publicly available as health officials prepare for and manage a second wave.
Results
We present a stochastic epidemiological model (Fig. 1) that explicitly incorporates daily changes in testing capacity and the lag between sampling and testing (see Materials and Methods). The underlying model, referred to hereafter as the SEPIAR model (Fig. 1A), has a susceptible–exposed–infectious–recovered (SEIR) structure with compartments for both severe (hospitalized) and nonsevere symptomatic infections as well as presymptomatic (P) and asymptomatic (A) infections (thus SEPIAR). We also consider two nested simplified versions: one with no presymptomatic transmission (the SEIAR model; Fig. 1B) and one with no asymptomatic transmission (the SEPIR model; Fig. 1C). By varying specific parameters weighting the transmission rate of P and A relative to that of symptomatic individuals, we can continuously move across these two extreme structures. Daily reports of the number of tests conducted in New York City are fed in as a covariate in the testing submodel (SI Appendix). The model takes into account CDC priorities in sampling and testing: All hospitalized cases are sampled and eventually tested, while nonsevere symptomatic individuals are sampled and tested only if excess capacity is available at the time of sampling. We also incorporate the retesting of hospitalized individuals as they leave the hospital. This model is fit to observed cases in New York City from March 1, 2020 to June 1, 2020 and to serological estimates of herd immunity in New York City from March 8, 2020 to April 19, 2020 (Materials and Methods and SI Appendix). We compare the full model with the two nested simplified versions. Although all three model structures are supported by the case data, the model with no asymptomatic transmission is not supported when these data are considered in conjunction with serology information (SI Appendix, Table S2).
To evaluate the strength of transmission in asymptomatic cases relative to symptomatic cases, we construct a Monte Carlo profile using the full SEPIAR model (SI Appendix, Fig. S5). We isolate parameter combinations from the profile that are supported by the case and serology data, and examine the values of those combinations. Particular parameters of interest that we focus on include the proportion of cases that are symptomatic, , the ratio of the transmission rate of asymptomatic individuals to that of symptomatic individuals, , and the reproductive numbers. We use to denote the symptomatic reproductive number (i.e., the mean number of secondary cases arising from each primary symptomatic case), and use to denote the overall reproductive number for the model (i.e., the mean number of cases arising from a primary infection, where the average considers all types of infections).
The proportion of COVID-19 cases that are symptomatic is well identified, with a CI ranging from 12.9 to 17.4% (Fig. 2). Although a wide range of parameter combinations for the proportion of symptomatic infection are supported by the case data on its own, a much narrower estimate is obtained when the case and serology data are considered together (Fig. 2 A and B). Within this range, estimates of herd immunity are consistent with the dynamics of observed serology (Fig. 2C), in particular, the rapid rise in seroprevalence over March and April 2020. We validated the inference pipeline by fitting the model to two simulated trajectories from two parameter combinations that are both supported by the case, serology, and testing data but correspond to regimes with strong or weak asymptomatic transmission. As shown in SI Appendix, Fig. S11B, models fit to both of these trajectories instead of observed cases are able to accurately estimate and recover the proportion of symptomatic cases used in the simulations.
The overall reproductive number or symptomatic reproductive number may be larger than is often assumed. From our profile of the relative asymptomatic transmission rate , we identify two main regimes of transmission that are supported by both the case and serology data (Fig. 3), in which either is higher than the two to three range often assumed for COVID-19. Notably, we find no parameter combinations in which both reproductive numbers are below three and fall within this range.
In the first regime, asymptomatic individuals transmit at almost the same rate as symptomatic individuals. That is, is large, even close to one in some parameter combinations. The overall reproductive number takes on values between 3.2 and 4.4, and asymptomatic cases substantially contribute to the overall force of infection (Fig. 4).
In the second regime, asymptomatic individuals transmit at very low rates relative to symptomatic individuals, with estimates of close to zero or, in some parameter combinations, even equal to zero. Concomitantly, the symptomatic reproductive number is much higher than frequently assumed, taking on values between 3.9 and 8.1. Nevertheless, even in this regime, presymptomatic and asymptomatic infections together contribute at least 50% of the overall force of infection at the peak of the outbreak.
In both regimes, presymptomatic individuals transmit at almost the same rate as symptomatic individuals, with estimates of close to one, also making a substantial contribution to the overall force of infection (Fig. 4).
We also observe a third regime in which both reproductive numbers are higher than assumed, but, in this regime, presymptomatic individuals transmit at a very low rate, with close to zero. Several combinations in this regime can be observed in the top right corner of Fig. 3C and in SI Appendix, Fig. S7. This is also the regime obtained in SI Appendix, Fig. S6 if one uses the SEIAR model, which assumes that presymptomatic individuals do not transmit (i.e., is fixed at zero). Given previous evidence of presymptomatic transmission of COVID-19 (25, 26), we focus on the two regimes which incorporate substantial presymptomatic transmission.
In line with previous studies (27), we estimate a large value for the initial number of infected and incubating individuals with COVID-19 in New York City at the start of the simulation on March 1. Parameter combinations that were supported by the case and serology data ranged from 9,000 to 18,000 initial infected individuals and 44,000 to 72,000 exposed individuals. A key question to consider when evaluating the plausibility of this magnitude of undetected infections is whether it is consistent with no signal of an anomalous number of hospitalizations. In other words, would this large rise in early infections result in a corresponding rise in COVID hospitalizations that may not have been detected as COVID related? We examine this question by comparing simulated daily hospitalizations from our fitted model with observed COVID-19 daily hospitalizations in New York City, as well as with syndrome surveillance reports of respiratory illness from emergency departments in New York City hospitals (Fig. 5), which we can use as an indicator for a rise in undetected hospitalizations. We show that a scenario with a large number of initial infections on March 1 is indeed consistent with the time at which observed COVID-19 hospitalizations peak, providing further support for this contention. We also find that the imposition of social distancing on March 17 and the stay-at-home order on March 22 in New York City resulted in a substantial decrease in the initial transmission rate. Parameter estimates for the ratio of the postintervention transmission rate to the preintervention transmission rate () ranged from 0.134 to 0.240, corresponding to a 75.98 to 86.62% reduction in the strength of transmission after the intervention.
Testing strategies and capacity can substantially influence estimates of the infection fatality ratio, or IFR (SI Appendix, Fig. S12). This metric of outbreak severity is generally defined as the total number of deaths divided by the total number of cases. In practice, this ratio is calculated by dividing the total number of confirmed deaths by the total number of observed cases. However, depending on the testing strategy used and the testing capacity available, not all cases will be observed. Using parameters from the fitted SEPIAR model that are supported by case and serology data, we generate a range of IFRs that would be expected under two different testing strategies. Since we do not model deaths from COVID-19 hospitalized patients in our model, we estimate the proportion of hospitalizations that result in death using confirmed COVID-19 hospitalizations and deaths in New York City during the study period. In the first testing strategy, all cases are observed; in the second one, all symptomatic cases but no asymptomatic ones are observed (red and blue shaded histograms, respectively, in SI Appendix, Fig. S12). Testing only symptomatic cases can result in at least a fourfold increase in the IFR that is calculated. Limitations in testing capacity may also impact the estimated IFR. If the testing capacity is limited at the start of the outbreak, the observed IFR measured during the epidemic (the orange vertical line in SI Appendix, Fig. S12) will be higher than the IFR expected if all symptomatic cases were tested. Variation in individual model parameters within the range supported by the case and serology data does not result in substantial variation in the IFR calculated for each testing scenario.
Discussion
With a transmission model that incorporates daily changes in RT-PCR testing capacity and is fit to observed case data and serology, we estimate that the probability that an exposed individual develops symptoms is low. Since asymptomatic infections represent a large fraction of the infected population, they contribute substantially to community transmission in the aggregate together with presymptomatic cases, even when they individually transmit at a low per capita rate. They also contribute substantially to building herd immunity.
We use testing information to estimate the probability that a new case will become symptomatic without the biases present in cruise ship (17) and traveler studies (18), or the parameter confounding present in citywide models. Early cruise ship and evacuee studies found that most COVID-19 cases were symptomatic. However, given the small number of total infections (18, 28), evacuee studies may overestimate the fraction of symptomatic cases if infections in observed severe cases (29) last longer (30) than in asymptomatic ones. Cruise ship studies may likewise overestimate this parameter if asymptomatic cases, which were tested later than symptomatic cases (17), recover prior to testing. Citywide models, which avoid these biases, indicate that most COVID-19 cases are undetected (9). They confound, however, the fraction of symptomatic cases with the reporting or hospitalization rate, as they neglect daily testing changes, and cannot distinguish between asymptomatic and undetected cases. The alternative approach of fitting the models to death data is not necessarily exempt from biases in parameter estimates, due to changes in hospital capacity over time (31, 32), comorbidities in host populations (33, 34), and the long delay between the onset of infection and death (35). Furthermore, the underreporting of cases can also bias the assumed case fatality rate (32). Our approach resolves these issues by incorporating daily testing capacity as part of the model when estimating parameters from serology and case data. Models without explicit consideration of this capacity have difficulty estimating the proportion of cases that are symptomatic from these data (36), suggesting that including testing is crucial.
If asymptomatic individuals transmit at a high rate, then the overall reproductive number preintervention in New York City is larger than the two to three range often assumed in models (6–8) and media reports (11, 37–39) based on early estimates from Wuhan (4, 40, 41). Furthermore, we find no supported parameter combinations in which both the overall and symptomatic reproductive numbers fall within this range. Early Wuhan models may underestimate by ignoring presymptomatic transmission and making restrictive assumptions, including that COVID-19 has the same incubation period and serial interval as SARS-CoV (4, 40, 41), or that most cases are symptomatic (42). Early Wuhan case data may be insufficient to precisely estimate without making these assumptions (43–45). Thus, models and intervention strategies should consider that the overall may be higher than three in certain locations (5, 46).
If asymptomatic individuals are unlikely to transmit and do so with low probability, then the small fraction of cases that are symptomatic are transmitting at a high rate, in line with recently reported “superspreading” events (47, 48). Superspreading events are instances in which a single infected individual infects a large number of people. These events can be hard to measure on a population level in the absence of detailed transmission data. In classic superspreading dynamics, most primary cases do not result in many secondary cases, while a subset of primary cases result in a large number of secondary cases (8, 49, 50). This heterogeneity in the reproductive number is, indeed, what we observe when asymptomatic individuals transmit poorly. Our model is admittedly a coarse description of this heterogeneity, since it incorporates only two different classes of infections, symptomatic or asymptomatic. Future models can build upon this framework with additional classes for age, socioeconomic status, location, or susceptibility (51), using fine-scale case data. These models could elucidate how infections in hospitals or home care settings may be contributing to the high of symptomatic cases. However, our results also indicate that, even when the symptomatic reproductive number is large, presymptomatic and asymptomatic infections contribute together to at least 50% of the overall force of infection.
It follows that community-wide interventions that account for nonsymptomatic cases should be crucial for mitigating outbreaks. If asymptomatic cases transmit poorly, then concurrent additional interventions targeting superspreading symptomatic infections may help reduce community transmission.
Resolving the nonidentifiability of the relative strength of asymptomatic transmission () would require extensive community testing and contact tracing of asymptomatic cases. Community testing on its own can provide an estimate of the total proportion of cases that are asymptomatic, but it may not provide insight on whether those asymptomatic individuals can transmit and how well they can transmit. Symptomatic and asymptomatic individuals have similar viral loads (52), but a high viral load does not necessarily imply high transmissibility. One limitation of early contact tracing studies is that estimates of transmissibility may oversample symptomatic index cases and contacts, particularly during the early phase of an epidemic (15, 53). In certain studies, only symptomatic contacts are further investigated. Ideally, one would use frequent systematic community testing for studies identifying both symptomatic and asymptomatic potential index cases for further contact tracing and testing of all contacts regardless of symptoms. Furthermore, fixing the probability that an infection becomes symptomatic based on the results of serology-informed models such as ours could increase the precision with which contact tracing studies can estimate the strength of asymptomatic transmission. Colleges that are currently reopening may be ideal test locations for this kind of combined approach, which may also help detect superspreading events.
While it cannot capture all testing intricacies, our framework illustrates how transmission models can incorporate daily changes in testing capacity and identify parameters that were previously difficult to estimate, such as the probability that an infection will become symptomatic. While we do not explicitly denote differences between laboratories, hospitals, or diagnostic tests, we account for this variation by including additional measurement noise after simulating the RT-PCR testing process. We also consider how sampling individuals without COVID-19 may deplete the daily testing capacity. In particular, hospitalized individuals with non–COVID-19–related severe respiratory disease may have a higher priority for testing than nonsevere COVID-19 cases. Our model uses syndrome surveillance reports (54–57) of respiratory illness from New York City hospitals in previous years, along with weekly influenza cases, to estimate the number of non–COVID-19 severe respiratory cases that were tested. The statistical model assumes that, in every year, only a fraction of influenza cases are confirmed and that noninfluenza respiratory cases exhibit seasonality. We use flu and syndrome surveillance estimates from previous years to estimate the fraction of influenza cases that are not confirmed and the shape of the seasonality on non–influenza-related respiratory illness. During the 2020 epidemic, COVID-19 mitigation measures that reduced urban mobility may have also reduced transmission of other respiratory diseases such as influenza. The model captures some of this decrease, since the number of severe non-COVID respiratory cases is a function of the number of confirmed flu cases in the same season. The model thus captures the impact of decreased flu cases in 2020 due to changes in mobility patterns. This framework could be used in conjunction with other epidemiological models, and extended to other municipalities or countries with location-specific testing priorities, retesting procedures, or diagnostic tests.
In cities where mobility information is available, the statistical model may include overall population mobility as a covariate. In other cities that report the daily number of hospitalized individuals with COVID-19 symptoms who were tested each day, one could subtract from this number the total COVID-19 hospitalizations estimated by the epidemiological model, to obtain the number of non–COVID-19–positive hospitalized cases that were tested. Depending on the information available for each location, future iterations of this framework could explicitly incorporate different diagnostic tests and their respective sensitivities and specificities. It could also be used to examine how altering testing strategies such as switching from symptom-based testing to community testing may improve transmission parameter inference and efficacy of control efforts. This may be an important consideration for countries that have limited testing capacity but are still in the midst of the first pandemic wave, such as India.
Given the potential role of population density and socioeconomic status on contact rates and access to care, there may be considerable heterogeneity in infection rates and seropositivity in different neighborhoods of New York City. While overdispersion in measurement error can implicitly account for this variation in our implementation, future formulations could do so explicitly with a spatial model of transmission between neighborhoods and within specific settings such as hospitals and home care networks. This level of resolution would require, however, observed cases, testing capacity, and hospitalizations within each unit. Incorporating human movement estimates into the model could enable analysis of how the infectious period of the virus may impact the clustering of cases within particular neighborhoods.
Future studies can investigate the impact of including a testing submodel on parameter estimation and the level of detail required in such a submodel. For example, one could compare the results of parameter estimation from fitting a given epidemiological model with a queue-based testing model to those that assume a fixed reporting rate and a delay in the reporting of cases. We expect the former to exhibit more uncertainty when informed by surveillance data from the beginning of the pandemic, when little testing capacity is available, but to reduce this uncertainty as the time series is extended and this capacity changes. Models that assume a fixed reporting rate may underestimate the range in uncertainty of epidemiological parameters that are heavily informed by the early part of the time series, and may even underestimate the values of the parameters themselves. Models with a queue-based testing submodel may obtain more-precise estimates of parameters that impact the end of the outbreak, such as those related to the depletion of susceptible individuals, acquisition of immunity, or, in our model, the impact of social distancing and stay-at-home orders on overall transmission. Even if including some form of testing model that takes into account changes in capacity is key to obtaining more precise parameter estimates, simpler versions of our implementation may be sufficient. For example, the more generalizable components, such as the testing of hospitalized individuals, may be more important than taking into account their resampling as they leave the hospital. Simplifying the testing model based on model selection analyses can facilitate wider adaption of the testing framework to other cities, countries, or time periods.
Future versions of the model could also capture heterogeneity in the severity of infection and the acquisition of immunity. When fitting the model, we treat seropositivity as a reasonable correlate of herd immunity. The assay used to measure seroprevalence in New York City (24) elicits neutralizing antibodies (58) and can detect seroconversion in severe, mild, and asymptomatic cases (58). We assume that this herd immunity does not decay over the course of the study, given the short duration of time from March to June 2020 and the observation that antibody responses can persist for at least 5 mo following the start of the pandemic in New York City (59). The immune response to the SARS-CoV virus consists of several components including antibody (60) and T-cell mediated (61) responses, and heterogeneity in particular pathways of the immune response can influence the severity of infections (62, 63). The severity of infection may, in turn, impact the type, strength, and duration of the immunity acquired (64, 65). As future experimental studies determine how each immune response can mitigate infection, viral shedding, and transmission, relevant aspects of the dynamics of host immunity can be incorporated into the model and corroborated with data. Understanding how host heterogeneity in immune responses may impact the infection severity and herd immunity may be valuable when considering long-term vaccination policies.
While a model with explicit within-host dynamics would be challenging to fit using case data, relevant aspects could be incorporated in several ways. For example, the model could include additional classes of infection corresponding to levels of severity. Alternatively, a distribution of susceptibility or immunity could be used to capture heterogeneity in the immune response between individuals. Finally, the model could incorporate functional forms for the acquisition and waning of immunity that are fixed based on experimental observations of serology and T-cell dynamics. Fitting these models to times series from multiple locations will improve inference, but the testing capacity and strategy in each location should be taken into account when doing so.
Our finding that many individuals were already infected by March 1 is consistent with earlier estimates that community transmission began in February in New York City (27, 55, 66). Previous studies could not explain, however, why no substantial increase in COVID-19–like illness was observed prior to February 28 in syndrome surveillance data (55). Our simulations show that the lag between infection onset and hospitalization can explain this discrepancy. Even when initialized with many infected cases on March 1, simulated hospitalizations do not rise until several weeks later, concurrent with observed COVID-19 hospitalizations (Fig. 5). Most likely, the estimated initial conditions suggest multiple parallel foci of initiation of the epidemic with multiple importations of infections. Another suggested possibility is a dosage dependence effect, wherein the severity of an individual’s infection depends on the size of the virus population that the person becomes infected with during one or more transmission events, and hence on the overall viral load of COVID-19 in the community. In this scenario, early COVID-19 cases in February and early March would be less severe. This would be consistent with the syndrome surveillance data, where we see a rise, in early March, of respiratory infection reports in the emergency departments of hospitals, but do not yet see a rise in COVID-19 hospitalizations. This phenomenon might also explain why our model slightly underestimates the peak in daily hospitalizations, even though it correctly identifies the time and shape of that peak.
We show that testing capacity and strategy can substantially affect estimates of the IFR, a quantity that is frequently used by public health officials in assessing the severity of an outbreak. Our model ignores several factors, such as nonhospital deaths from COVID-19, which may increase the true IFR, and rising trends in hospital capacity and improved treatments, which may decrease it. Nevertheless, our results underscore the importance of considering testing strategy and capacity when interpreting literature estimates of the IFR.
In conclusion, explicit consideration of changes in testing capacity allows us to infer with certainty, from case and serology data, that most new COVID-19 cases do not become symptomatic. We also inferred that the overall or symptomatic reproductive number may be larger than often assumed, depending on how well asymptomatic cases can transmit. Despite this uncertainty, the strong consistent contribution to community transmission from cases without symptoms observed across scenarios supported by the data should be considered when formulating public health intervention strategies. Making available detailed information on testing policy and data on testing capacity over time will strengthen the ability of epidemiological models to learn from the past and inform us about the future.
Materials and Methods
We examine three different model structures that have been used to characterize COVID-19 dynamics in previous studies (Fig. 1). All models are modified versions of the traditional SEIR model (67). The first model, the SEPIR model (17, 68), is the most standard extension in which transitions are between a linear chain of compartments. Its formulation adds a compartment P for presymptomatic transmission. The second one, the SEIAR model (7, 9), differs conceptually in that it includes asymptomatic individuals rather than presymptomatic ones, and defines them as distinct, in the sense that they will never transition to exhibiting symptoms. This definition implicitly recognizes that there are essentially two classes of individuals in terms of susceptibility to disease and symptoms. The third structure for the SEPIAR model (6, 26) is a combination of the first two and includes them as nested, particular, cases.
All three models include a chain of exposed classes to incorporate the total time between the onset of infection and the onset of symptoms as gamma distributed (with mean 5.5 d and standard deviation 2.25 d) (69). Symptomatic individuals are subdivided into two sequential classes, and , for practical purposes, to follow their numbers before and after some of them transition to hospitalization. Individuals spend an average of 1/S days in and 1/ days in .
The parameter represents the reproductive number experienced by symptomatic individuals. We define a baseline preintervention transmission rate in symptomatic individuals, , by dividing by the average total time that nonsevere cases transmit with symptoms. We also define a postintervention transmission rate , which is equal to the preintervention transmission rate multiplied by a scaling factor . Low values of represent a substantial reduction in the transmission rate due to interventions. Social distancing guidelines were issued by New York City starting on March 17 (70, 71), and a stay-at-home order was issued which took effect on the evening of March 22 (72). Thus, prior to the imposition of social distancing, the transmission rate of symptomatic individuals in our models, , is equal to . From March 18 through March 22, decreases linearly from to . From March 23 onward, is equal to .
In all models, a fraction of exposed individuals become symptomatic. After an average of 5 d of transmission, symptomatic cases are hospitalized with probability . Symptomatic cases that are not severe enough to require hospitalization recover at rate . Hospitalized individuals recover at rate hv = 1/13 (30) and do not transmit while hospitalized. In practice, some individuals with severe COVID-19 symptoms that required hospitalization may not have been hospitalized, due to barriers to care. However, their contribution to community transmission is unlikely to have been substantial. These individuals would remain isolated at home during the period of severe infection and avoid nonhousehold contacts, while household contacts would have been exposed for a substantial time prior to the onset of severity. We assume a fixed population size for New York City of 8 million individuals (73).
Assumptions about which infected classes are infectious and how they contribute to the transmission rate allow us to reduce the SEPIAR model to the SEPIR or SEIAR models. Presymptomatic individuals transmit for an average of about a day [0.92 d (25)] at a transmission rate equal to the baseline transmission rate multiplied by a scaling factor b = . Asymptomatic infections transmit for an average of 5 d, equal to the average duration between the onset of symptoms and hospitalization in severe cases, at a transmission rate equal to the baseline rate multiplied by scaling factor .
The models are implemented numerically via an Euler approximation of the deterministic equations to which demographic stochasticity is added. Specifically, the number of individuals making state transitions from compartments with more than one exit is drawn from an Euler-multinomial distribution (74). The number of individuals making state transitions from compartments with only one exit is drawn from a binomial distribution.
Description of Testing Model.
The model takes into account daily changes in the testing capacity, using estimates of daily tests conducted in New York City from the New York State Department of Health (23), as well as the retesting of severe and nonsevere symptomatic cases prior to leaving the hospital or quarantine. We assume that there are two categories of cases—severe (hospitalized) cases and nonsevere cases subject to different testing priorities (75): the initial testing of new hospitalized COVID-19 cases (highest priority), the retesting of those individuals when they leave the hospital, the testing of new nonsevere symptomatic COVID-19 cases, and, finally, the retesting of those symptomatic cases (lowest priority). All severe COVID-19 cases after March 1 are sampled when they enter the hospital and eventually tested once enough capacity is available. We assume that symptomatic nonsevere cases are sampled at the same time, in the course of their infection, as severe cases. However, we assume that they are not tested if they recover before enough testing capacity is available. During the early stages of the epidemic, the CDC recommended test-based strategies to determine when to conclude home isolation or hospitalization (76). Accordingly, we assume that hospitalized cases are retested twice (over a 24-h period) after the average length of time in the hospital (13 d), while nonsevere cases are likewise retested twice after the end of a 14-d quarantine period.
We also take into account the potential for non–COVID-19 severe respiratory cases to be sampled in hospitals and tested (with the same priority as hospitalized COVID-19 cases). We use confirmed influenza cases (77) and syndrome surveillance reports of respiratory disease from emergency departments in New York City hospitals in previous years (78) to estimate the number of non–COVID-19 severe respiratory cases that may have been sampled (SI Appendix). We assume that the RT-PCR testing has a sensitivity of 90% (79), that testing takes 48 h (80), and that there is an additional negative-binomial distributed dispersion after the RT-PCR testing with standard deviation . This dispersion takes into account variation in sampling and testing protocols across laboratories and hospitals, as well as variation in the sensitivity and time required for different PCR assays.
Overview of the Model Fitting and Inference Strategy.
Unless otherwise mentioned, we fit the following parameters: the recovery rate for nonsevere symptomatic infections (); the scaling factors for asymptomatic, presymptomatic, and postintervention transmission (, , and ); the symptomatic probability () and the hospitalization probability (); the reproductive number for symptomatic cases (); the dispersion parameter for RT-PCR testing (); and the initial number of infected () and exposed () individuals at the start of the simulation on March 1, 2020. We use the iterated filtering algorithm MIF (81) within the R-package POMP (for partially observed Markov process models) to fit parameter combinations by likelihood maximization. The iterated filtering algorithm is specifically designed for fitting stochastic and nonlinear models with hidden variables in the presence of both process and measurement error. We apply the sequential Monte Carlo algorithm pfilter (82) to evaluate the likelihood of the final parameter combinations obtained with the computational search. Likelihoods are estimated by simulating state variables at each observation time from an underlying Markov process model, and then calculating the likelihood of each observation given the simulated value of the state variable and a measurement model. For the analysis of the full SEPIAR model, we generate a Monte Carlo profile (83) for the relative strength of asymptomatic transmission ().
For all resulting parameter combinations within two log-likelihood units of the maximum-likelihood estimates (MLE), we then calculate the likelihood with respect to serology, using seroprevalence data previously published by ref. 24 from a screening group representative of the general population, using plasma samples from patients at Mount Sinai Hospital in New York City. In the Mount Sinai study, random, deidentified, and cross-sectional samples were obtained over the course of the outbreak from patients at obstetrics and gynecology visits and deliveries and oncology-related visits, as well as hospitalizations due elected or planned surgeries, transplant surgeries, preoperative medical assessments, and related outpatient visits, cardiology office visits, or other regular office or treatment visits whose purpose was unrelated to COVID-19 (24). The assay used to measure seroprevalence (24) elicits neutralizing antibodies (58) and can detect seroconversion in severe, mild, and asymptomatic cases (58). We treat the seroprevalence measurement at each time point as a measure of short-term herd immunity in the population, specifically, of the proportion of the population that has already recovered from COVID-19 infection. We assume that this herd immunity does not decay over the course of the study, since antibodies have been shown to persist for at least 5 mo (59). We compare the seroprevalence at each time point from the serology data to the recovered fraction of the population R/N from simulated trajectories of the epidemiological model. When calculating the likelihood of each trajectory at each observation time with respect to the seroprevalence data, we assume that the number of people who seroconverted in the Mount Sinai study is drawn from a binomial distribution with equal to the value of R/N in the simulated trajectory at that time, and equal to the total number of people sampled. We sum the log-likelihoods across all observation time points and then average over all trajectories using the logmeanexp function in the R package POMP (82) to obtain a log-likelihood for each parameter combination with respect to the serology data. We isolate all combinations supported by the serology data that have log-likelihoods within two units of the MLE.
For each combination, we examine the proportion of cases that are symptomatic, , the reproductive number in symptomatic individuals, , and the overall reproductive number for the model, . We derive the following expression for using the next-generation matrix (84):
[1] |
Calculation of the IFR.
The IFR is frequently defined as the ratio of deaths to cases. Let represent the IFR with respect to all cases,
[2] |
This is equivalent to the proportion of all cases that result in death. We can estimate this quantity using the parameters from the fitted SEPIAR model. Recall that is the probability that a case is symptomatic, and is the probability that a symptomatic case becomes hospitalized. These quantities are equivalent to the proportion of all cases that are symptomatic, and the proportion of symptomatic cases that are hospitalized. Let represent the proportion of all hospitalizations that are fatal. Since this parameter is not fitted in the SEPIAR model, we estimate it from observed data by dividing the total number of confirmed COVID-19 deaths by the total number of confirmed COVID-19 hospitalizations in New York City during the study period. We use data updated on June 15, 2020 from the New York City Health Department COVID-19 Data Portal (22), and obtain an estimate of . We can then write an expression for using the parameters estimated with the fitted SEPIAR model,
[3] |
and obtain this quantity for each parameter combination supported by case and serology data (red histogram in SI Appendix, Fig. S12).
Let represent the IFR that would be estimated if all symptomatic cases were observed but no asymptomatic ones were observed. This is equivalent to the proportion of all symptomatic cases that result in death,
[4] |
(shown in the blue histogram of SI Appendix, Fig. S12).
We compare these two quantities with the observed IFR, calculated by dividing the total deaths by the total number of PCR-confirmed COVID cases in New York City during the study period (the orange line in SI Appendix, Fig. S12).
Additional Details.
Further details of the SEPIAR equations, testing model, Monte Carlo profile of the SEPIAR model, initial grid searches, and model comparison of the SEPIR and SEIAR models, and derivation of the overall reproductive number , are provided in SI Appendix.
Supplementary Material
Acknowledgments
R.S. was supported by an NSF Research Traineeship (1735359: NRT-INFEWS: Computational data science to advance research at the energy environment nexus). We thank Aaron King for his insightful discussions. This work was completed with resources and support provided by the University of Chicago’s Research Computing Center.
Footnotes
The authors declare no competing interest.
This article is a PNAS Direct Submission.
This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.2019716118/-/DCSupplemental.
Data Availability.
Code used to simulate and fit the epidemiological model has been deposited in Github at https://github.com/pascualgroup/COVID_NYC_Epi_Model (85).
References
- 1.Paules C. I., Marston H. D., Fauci A. S., Coronavirus infections—More than just the common cold. JAMA 323, 707–708 (2020). [DOI] [PubMed] [Google Scholar]
- 2.Dong E., Du H., Gardner L., An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect. Dis. 20, 533–534 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Centers for Disease Control and Prevention , Overview of testing for SARS-CoV-2 (COVID-19). https://www.cdc.gov/coronavirus/2019-ncov/hcp/testing-overview.html. Accessed 7 August 2020.
- 4.Majumder M., Mandl K. D., Early transmissibility assessment of a novel coronavirus in Wuhan, China. SSRN [Preprint] (2020). 10.2139/ssrn.3524675 (Accessed January 26, 2020). [DOI]
- 5.Sanche S., et al. , High contagiousness and rapid spread of severe acute respiratory syndrome coronavirus 2. Emerg. Infect. Dis. J. 26, 1470 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Moghadas S. M., et al. , The implications of silent transmission for the control of COVID-19 outbreaks. Proc. Natl. Acad. Sci. U.S.A. 117, 17513–17515 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Lourenco J., et al. , Fundamental principles of epidemic spread highlight the immediate need for large-scale serological surveys to assess the stage of the SARS-CoV-2 epidemic. medRxiv [Preprint] (2020). 10.1101/2020.03.24.20042291 (Accessed 26 March 2020). [DOI]
- 8.Goyal A., Reeves D. B., Cardozo-Ojeda E. F., Schiffer J. T., Mayer B. T., Wrong person, place and time: Viral load and contact network structure predict SARS-VoV-2 transmission and super-spreading events. medRxiv [Preprint] (2020). 10.1101/2020.08.07.20169920. (Accessed 7 August 2020). [DOI]
- 9.Li R., et al. , Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2). Science 368, 489–493 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Zhang J., et al. , Evolving epidemiology and transmission dynamics of coronavirus disease 2019 outside Hubei Province, China: A descriptive and modelling study. Lancet Infect. Dis. 20, 793–802 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Liu Y., Gayle A. A., Wilder-Smith A., Rocklöv J., The reproductive number of COVID-19 is higher compared to SARS coronavirus. J. Trav. Med. 27, taaa021 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Cowling B. J., et al. , Impact assessment of non-pharmaceutical interventions against coronavirus disease 2019 and influenza in Hong Kong: An observational study. Lancet Pub. Health 5, e279–e288 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ganyani T., et al. , Estimating the generation interval for coronavirus disease (COVID-19) based on symptom onset data, March 2020. Euro Surveill. 25, 2000257 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Centers for Disease Control and Prevention , COVID-19 pandemic planning scenarios. https://www.cdc.gov/coronavirus/2019-ncov/hcp/planning-scenarios.html. Accessed 5 August 2020.
- 15.Poletti P., et al. , Probability of symptoms and critical disease after SARS-CoV-2 infection. arXiv [Preprint] (2020). https://arxiv.org/abs/2006.08471 (Accessed 23 June 2020).
- 16.Byambasuren O., et al. , Estimating the extent of asymptomatic COVID-19 and its potential for 802 community transmission: Systematic review and meta-analysis. Off. J. Assoc. Med. Microbiol. 803 Infect. Dis. Can. 5, 223–234 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Mizumoto K., Kagaya K., Zarebski A., Chowell G., Estimating the asymptomatic proportion of coronavirus disease 2019 (COVID-19) cases on board the Diamond Princess cruise ship, Yokohama, Japan, 2020. Euro Surveill. 25, 2000180 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Nishiura H., et al. , Estimation of the asymptomatic ratio of novel coronavirus infections (COVID-19). Int. J. Infect. Dis. 94, 154–155 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Feaster M., Goh Y. Y., High proportion of asymptomatic SARS-CoV-2 infections in 9 long-term care facilities, Pasadena, California, USA, April 2020. Emerg. Infect. Dis. 26, 2416–2419 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Xie Q., et al. , Effect of large-scale testing platform in prevention and control of the COVID-19 pandemic: An empirical study with a novel numerical model. medRxiv [Preprint] (2020). 10.1101/2020.03.15.20036624 (Accessed 15 March 2020). [DOI] [PubMed]
- 21.L J., Yuan H. Y., Wu L., Pfeiffer D. U., Estimating effects of intervention measures on COVID-19 outbreak in Wuhan taking account of improving diagnostic capabilities using a modelling approach. medRxiv [Preprint] (2020). 10.1101/2020.03.31.20049387 (Accessed 31 March 2020). [DOI] [PMC free article] [PubMed]
- 22.New York City Department of Health and Mental Hygiene, COVID-19: Data. https://www1.nyc.gov/site/doh/covid/covid-19-data.page. Accessed 7 August 2020.
- 23.NewYork State Department of Health , New York State statewide COVID-19 testing. https://health.data.ny.gov/Health/New-York-State-Statewide-COVID-19-Testing/xdss-u53e. Accessed 7 August 2020.
- 24.Stadlbauer D., et al. , Repeated cross-sectional sero-monitoring of SARS-CoV-2 in New York City. Nature, 10.1038/s41586-020-2912-6 (2020). [DOI] [PubMed]
- 25.Nishiura H., Linton N. M., Akhmetzhanov A. R., Serial interval of novel coronavirus (COVID-19) infections. Int. J. Infect. Dis. 93, 284–286 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Gatto M., et al. , Spread and dynamics of the COVID-19 epidemic in Italy: Effects of emergency containment measures. Proc. Natl. Acad. Sci. U.S.A. 117, 10484–10491 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Davis J. T., et al. , Estimating the establishment of local transmission and the cryptic phase of the COVID-19 pandemic in the USA. medRxiv [Preprint] (2020). 10.1101/2020.07.06.20140285 (Accessed 6 July 2020). [DOI]
- 28.Nishiura H., et al. , The rate of underascertainment of novel coronavirus (2019-nCoV) infection: Estimation using Japanese passengers data on evacuation flights. J. Clin. Med. 9, 419 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Reuters News Agency , Three Japanese evacuees from Wuhan test positive for virus, two had no symptoms. CNBC, 30 January 2020. https://www.cnbc.com/2020/01/30/japanese-evacuees-from-wuhan-test-positive-for-virus.html. Accessed 23 August 2020.
- 30.Ferguson N., et al. , “Impact of non-pharmaceutical interventions (NPIS) to reduce COVID19 mortality and healthcare demand” (Rep. 9, Imperial College London, 2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Grasselli G., Pesenti A., Cecconi M., Critical care utilization for the COVID-19 outbreak in Lombardy, Italy: Early experience and forecast during an emergency response. JAMA 323, 1545–1546 (2020). [DOI] [PubMed] [Google Scholar]
- 32.Vincent J. L., Taccone F. S., Understanding pathways to death in patients with COVID-19. Lancet Respir. Med. 8, 430–432 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Price-Haywood E. G., Burton J., Fort D., Seoane L., Hospitalization and mortality among black patients and white patients with COVID-19. N. Engl. J. Med. 382, 2534–2543 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Richardson S., et al. , Presenting characteristics, comorbidities, and outcomes among 5700 patients hospitalized with COVID-19 in the New York City area. JAMA 323, 2052–2059 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Gostic K. M., et al. , Practical considerations for measuring the effective reproductive number, Rt. PLoS Comput. Biol. 16, e1008409 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Fox S. J., et al. , The impact of asymptomatic COVID-19 infections on future pandemic waves. medRxiv [Preprint] (2020). 10.1101/2020.06.22.20137489 (Accessed 23 June 2020). [DOI]
- 37.Adam D., A guide to R—The pandemic’s misunderstood metric. Nature 583, 346–348 (2020). [DOI] [PubMed] [Google Scholar]
- 38.Yong E., The deceptively simple number sparking coronavirus fears. The Atlantic, 28 January 2020. https://www.theatlantic.com/science/archive/2020/01/how-fast-and-far-will-new-coronavirus-spread/605632/. Accessed 23 August 2020.
- 39.Schumaker E., What is R-naught for the COVID-19 virus and why it’s a key metric for re-opening plans. ABC News, 26 May 2020. https://abcnews.go.com/Health/r0-covid-19-virus-key-metric-opening-plans/story?id=7086899. Accessed 23 August 2020.
- 40.Imai N., et al. , “Transmissibility of 2019-nCoV” (Rep. 3, Imperial College London, 2020). [Google Scholar]
- 41.Read J. M., Bridgen J. R., Cummings D. A., Ho A., Jewell C. P., Novel coronavirus 2019-nCoV: Early estimation of epidemiological parameters and epidemic predictions. medRxiv [Preprint] (2020). 10.1101/2020.01.23.20018549 (Accessed 28 January 2020). [DOI] [PMC free article] [PubMed]
- 42.Wu J. T., et al. , Estimating clinical severity of COVID-19 from the transmission dynamics in Wuhan, China. Nat. Med. 26, 506–510 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Pan A., et al. , Association of public health interventions with the epidemiology of the COVID-19 outbreak in Wuhan, China. JAMA 323, 1915–1923 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Kucharski A. J., et al. , Early dynamics of transmission and control of COVID-19: A mathematical modelling study. Lancet Infect. Dis. 20, 553–558 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Riou J., Althaus C. L., Pattern of early human-to-human transmission of Wuhan 2019 novel coronavirus (2019-nCoV), December 2019 to January 2020. Euro Surveill. 25, 2000058 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Flaxman S., et al. , Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe. Nature 584, 257–261 (2020). [DOI] [PubMed] [Google Scholar]
- 47.Zhang Y., Li Y., Wang L., Li M., Zhou X., Evaluating transmission heterogeneity and super-spreading event of COVID-19 in a metropolis of China. Int. J. Environ. Res. Publ. Health 17, 3705 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Adam D., et al. , Clustering and superspreading potential of SARS-CoV-2 infections in Hong Kong. Nat. Med. 26, 1714–1719 (2020). [DOI] [PubMed] [Google Scholar]
- 49.Galvani A. P., May R. M., Dimensions of superspreading. Nature 438, 293–295 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Lloyd-Smith J. O., Schreiber S. J., Kopp P. E., Getz W. M., Superspreading and the effect of individual variation on disease emergence. Nature 438, 355–359 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Gomes M. G. M., et al. , Individual variation in susceptibility or exposure to SARS-CoV-2 lowers the herd immunity threshold, medRxiv [Preprint] (2020). 10.1101/2020.04.27.20081893 (Accessed 27 April 2020). [DOI] [PMC free article] [PubMed]
- 52.Lee S., et al. , Clinical course and molecular viral shedding among asymptomatic and symptomatic patients with SARS-CoV-2 infection in a community treatment center in the Republic of Korea. JAMA Intern. Med. 180, 1447–1452 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Park Y. J., et al. , Contact tracing during coronavirus disease outbreak, South Korea, 2020. Emerg. Infect. Dis. J. 26, 2465–2468 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Silverman J. D., Hupert N., Washburne A. D., Using influenza surveillance networks to estimate state-specific prevalence of SARS-CoV-2 in the United States. Sci. Transl. Med. 12, eabc1126 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.CDC COVID ResponseTeam et al. , Evidence for limited early spread of COVID-19 within the United States, January-February 2020. Morb. Mortal. Wkly Rep. 69, 680–684 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Hiller K. M., Stoneking L., Min A., Rhodes S. M., Syndromic surveillance for influenza in the emergency department—A systematic review. PloS One 8, e73832 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Gehret P. M., et al. , Syndromic surveillance during pandemic (H1N1) 2009 outbreak, New York, New York, USA. Emerg. Infect. Dis. J. 17, 1724 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Amanat F., et al. , A serological assay to detect SARS-CoV-2 seroconversion in humans. Nat. Med. 26, 1033–1036 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Wajnberg A., et al. , Robust neutralizing antibodies to SARS-CoV-2 infection persist for months. Science 370, 1227–1230 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Krammer F., Simon V., Serology assays to manage COVID-19. Science 368, 1060–1061 (2020). [DOI] [PubMed] [Google Scholar]
- 61.Grifoni A., et al. , Targets of T cell responses to SARS-CoV-2 coronavirus in humans with COVID-19 disease and unexposed individuals. Cell 181, 1489–1501 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Zhang Q., et al. , Inborn errors of type I IFN immunity in patients with life-threatening COVID-19. Science 370, eabd4570 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Bastard P., et al. , Autoantibodies against type I IFNs in patients with life-threatening COVID-19. Science 370, eabd4585 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Ward H., et al. , Declining prevalence of antibody positivity to SARS-CoV-2: A community study of 365,000 adults. medRxiv [Preprint] (2020). 10.1101/2020.10.26.20219725 (Accessed 27 October 2020). [DOI]
- 65.Zuo J., et al. , Robust SARS-CoV-2-specific T-cell immunity is maintained at 6 months following primary infection. bioRxiv [Preprint] (2020). 10.1101/2020.11.01.362319 (Accessed 2 November 2020). [DOI]
- 66.Fauver J. R., et al. , Coast-to-coast spread of SARS-CoV-2 during the early epidemic in the United States. Cell 181, 990–996.e5 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Anderson R. M., Anderson B., May R. M., Infectious Diseases of Humans: Dynamics and Control (Oxford University Press, 1992). [Google Scholar]
- 68.Yuan H. Y., et al. , The importance of the timing of quarantine measures before symptom onset to prevent COVID-19 outbreaks—Illustrated by Hong Kong’s intervention model. medRxiv [Preprint] (2020). 10.1101/2020.05.03.20089482 (Accessed 6 May 2020). [DOI]
- 69.Lauer S. A., et al. , The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: Estimation and application. Ann. Intern. Med. 172, 577–582 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.City of New York , Statement from Mayor de Blasio on bars, restaurants, and entertainment venues. https://www1.nyc.gov/office-of-the-mayor/news/152-20/statement-mayor-de-blasio-bars-restaurants-entertainment-venues. Accessed 29 August 2020.
- 71.City of New York , Office of the Mayor, Emergency Executive Order No. 100. https://www1.nyc.gov/assets/home/downloads/pdf/executive-orders/2020/eeo-100.pdf. Accessed 29 August 2020.
- 72.Press Office, Governor of New York , Governor Cuomo signs the ‘New York state on pause’ executive order. https://www.governor.ny.gov/news/governor-cuomo-signs-new-york-state-pause-executive-order. Accessed 29 August 2020.
- 73.US Census Bureau , Quickfacts New York City, New York. https://www.census.gov/quickfacts/fact/table/newyorkcitynewyork/POP010210#POP010210. Accessed 29 August 2020.
- 74.He D., Ionides E. L., King A. A., Plug-and-play inference for disease dynamics: Measles in large and small populations as a case study. J. R. Soc. Interface 7, 271–283 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.US Public Health Service , Priorities for testing patients with suspected COVID-19 infection. https://www.fsmb.org/siteassets/advocacy/pdf/hhs-covid-19-testing-guidance.pdf. Accessed 29 August 2020.
- 76.Centers for Disease Control and Prevention , Discontinuation of isolation for persons with COVID-19 not in healthcare settings. https://www.cdc.gov/coronavirus/2019-ncov/hcp/disposition-in-home-patients.html#previous-updates. Accessed 29 August 2020.
- 77.New York State Department of Health , Influenza laboratory-confirmed cases by county: Beginning 2009-10 season. https://health.data.ny.gov/Health/Influenza-Laboratory-Confirmed-Cases-By-County-Beg/jr8b-6gh6. Accessed 29 August 2020.
- 78.City of New York , Syndromic surveillance data. https://a816-health.nyc.gov/hdi/epiquery/visualizations?PageType=ps&PopulationSource=Syndromic. Accessed 29 August 2020.
- 79.US Food and Drug Administration , In vitro diagnostics EUAs: Individual EUAs for molecular diagnostic tests for SARS-CoV-2. https://www.fda.gov/medical-devices/coronavirus-disease-2019-covid-19-emergency-use-authorizations-medical-devices/vitro-diagnostics-euas#individual-molecular. Accessed 1 September 2020.
- 80.Appleby J., Why it takes so long to get most COVID-19 test results. NPR, 28 March 2020. https://www.npr.org/sections/health-shots/2020/03/28/822869504/why-it-takes-so-long-to-get-most-covid-19-test-results. Accessed 1 September 2020.
- 81.Ionides E. L., Nguyen D., Atchadé Y., Stoev S., King A. A., Inference for dynamic and latent variable models via iterated, perturbed Bayes maps. Proc. Natl. Acad. Sci. U.S.A. 112, 719–724 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.King A. A., Nguyen D., Ionides E. L., Statistical inference for partially observed Markov processes via the R package POMP. J. Stat. Software 69, 43 (2016). [Google Scholar]
- 83.Ionides E. L., Breto C., Park J., Smith R. A., King A. A., Monte Carlo profile confidence intervals for dynamic systems. J. R. Soc. Interface 14, 20170126 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Diekmann O., Heesterbeek J. A. P., Roberts M. G., The construction of next-generation matrices for compartmental epidemic models. J. R. Soc. Interface 7, 873–885 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Subramanian R., et al. , COVID_NYC_Epi_Model. Github. https://github.com/pascualgroup/COVID_NYC_Epi_Model. Deposited 3 February 2021.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Code used to simulate and fit the epidemiological model has been deposited in Github at https://github.com/pascualgroup/COVID_NYC_Epi_Model (85).