Abstract
Background
As the COVID-19 pandemic continues to unfold, the infection-fatality risk (ie, risk of death among all infected individuals including those with asymptomatic and mild infections) is crucial for gauging the burden of death due to COVID-19 in the coming months or years. Here, we estimate the infection-fatality risk of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in New York City, NY, USA, the first epidemic centre in the USA, where the infection-fatality risk remains unclear.
Methods
In this model-based analysis, we developed a meta-population network model-inference system to estimate the underlying SARS-CoV-2 infection rate in New York City during the 2020 spring pandemic wave using available case, mortality, and mobility data. Based on these estimates, we further estimated the infection-fatality risk for all ages overall and for five age groups (<25, 25–44, 45–64, 65–74, and ≥75 years) separately, during the period March 1 to June 6, 2020 (ie, before the city began a phased reopening).
Findings
During the period March 1 to June 6, 2020, 205 639 people had a laboratory-confirmed infection with SARS-CoV-2 and 21 447 confirmed and probable COVID-19-related deaths occurred among residents of New York City. We estimated an overall infection-fatality risk of 1·39% (95% credible interval 1·04–1·77) in New York City. Our estimated infection-fatality risk for the two oldest age groups (65–74 and ≥75 years) was much higher than the younger age groups, with a cumulative estimated infection-fatality risk of 0·116% (0·0729–0·148) for those aged 25–44 years and 0·939% (0·729–1·19) for those aged 45–64 years versus 4·87% (3·37–6·89) for those aged 65–74 years and 14·2% (10·2–18·1) for those aged 75 years and older. In particular, weekly infection-fatality risk was estimated to be as high as 6·72% (5·52–8·01) for those aged 65–74 years and 19·1% (14·7–21·9) for those aged 75 years and older.
Interpretation
Our results are based on more complete ascertainment of COVID-19-related deaths in New York City than other places and thus probably reflect the true higher burden of death due to COVID-19 than that previously reported elsewhere. Given the high infection-fatality risk of SARS-CoV-2, governments must account for and closely monitor the infection rate and population health outcomes and enact prompt public health responses accordingly as the COVID-19 pandemic unfolds.
Funding
National Institute of Allergy and Infectious Diseases, National Science Foundation Rapid Response Research Program, and New York City Department of Health and Mental Hygiene.
Introduction
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emerged in late 2019 in China and has subsequently spread to more than 200 other countries. As of Oct 5, 2020, over 35 million cases of COVID-19 and over 1 million COVID-19-related deaths have been reported worldwide.1 As the pandemic continues to unfold and populations in many places worldwide largely remain susceptible, understanding the severity, and, in particular, the infection-fatality risk of the virus is crucial for gauging the full impact of COVID-19 in the coming months or years. However, estimating the infection-fatality risk of SARS-CoV-2 is challenging due to the large number of undocumented infections, fluctuating infection detection rates, and inconsistent reporting of fatalities. Furthermore, the infection-fatality risk of SARS-CoV-2 could vary by location, given differences in demographics, health-care systems, and social structures (eg, intergenerational households are the norm in some societies whereas older adults commonly reside and congregate in long-term care and adult care facilities in other societies). Most estimates of infection-fatality risk for SARS-CoV-2 to date have come from data recorded in China, the Diamond Princess cruise ship, and France.2, 3, 4, 5 As yet, the infection-fatality risk in the USA—the country currently reporting the largest number of cases1—remains unclear.
New York City, NY, USA, reported its first case of COVID-19 on March 1, 2020, in a traveller, and quickly became the epicentre of the pandemic in the country.6 Intense community transmission occurred during the following 3 months before a series of public health interventions brought the pandemic under control. In particular, public schools in the city were closed on March 16, 2020, and a citywide stay-at-home order was imposed on all non-essential workers starting the week of March 22, 2020.7 The city was able to reopen industries according to a phased schedule starting the week of June 7, 2020. By June 6, 2020, before the city's phased reopening, over 200 000 people had been diagnosed with laboratory-confirmed SARS-CoV-2 infection and more than 20 000 COVID-19-related deaths had been reported in the city. Since the beginning of the pandemic, the New York City Department of Health and Mental Hygiene (DOHMH) and the Mailman School of Public Health at Columbia University (New York City, NY) have been collaborating to generate real-time model projections in support of the city's pandemic response. Our model-inference system uses a meta-population network model to simulate SARS-CoV-2 transmission in the city's 42 United Hospital Fund neighbourhoods.8 The model is run in conjunction with the ensemble adjustment Kalman filter9 and fit simultaneously to case and mortality data for each of the 42 neighbourhoods while accounting for under-detection, delay between infection, case reporting, and death, and changing interventions (eg, physical distancing). In this analysis, we applied this network model-inference system to estimate the infection-fatality risk for five age groups (ie, <25, 25–44, 45–64, 65–74, and ≥75 years) and all ages overall, from March 1 to June 6, 2020. In the process, we also estimated infection detection rates (ie, the fraction of infections documented as confirmed cases) and the cumulative infection rate by June 6, 2020.
Research in context.
Evidence before this study
We searched PubMed for studies published from database inception to July 1, 2020, with no language restrictions, on the fatality risk of COVID-19 using the terms “COVID-19” and “fatality”. Our search returned 376 papers, from which we read the abstracts and identified 36 relevant studies. Most studies estimated the crude case–fatality risk (CFR; ie, number of deaths per number of confirmed cases) or adjusted CFR (ie, adjusting the crude CFR for delay from infection or diagnosis to death). For all ages overall, estimated crude CFR ranged 0–28% (30 estimates) and estimated adjusted CFR ranged 0·12–13·1% (24 estimates). Several studies also estimated age-specific CFR and reported higher CFR among older adults than in younger age groups. Four studies reported infection-fatality risk (ie, number of deaths among all infections). Of the four studies reporting infection-fatality risk, three also included age-specific estimates with varying age grouping.
Added value of this study
Using a comprehensive epidemic model-inference system and detailed population data of weekly cases, deaths, and mobility, we estimated the infection-fatality risk of severe acute respiratory syndrome coronavirus 2 for all ages overall and by age group in New York City, NY, USA, during the 2020 spring pandemic (March 1, to June 6, 2020). We also estimated the fluctuations in infection-fatality risk over the course of the pandemic. Our estimates addressed three main challenges in estimating the infection-fatality risk of COVID-19: age differences, under-ascertainment of deaths, and under-detection of infections.
Implications of all the available evidence
We estimated that the overall infection-fatality risk was approximately double previous estimates for elsewhere during earlier or similar periods. Our results are based on more complete ascertainment of COVID-19-associated deaths in New York City than are those from previous studies, and thus probably reflect the true, higher burden of death due to COVID-19 than previously reported elsewhere. Given this high infection-fatality risk, governments must account for and closely monitor the infection rate and population health outcomes and enact prompt public health responses accordingly as the COVID-19 pandemic unfolds.
Methods
Study design and data
In this model-based analysis, we aggregated laboratory confirmed SARS-CoV-2 infections reported to the New York City DOHMH by week of diagnosis and age group (<1, 1–4, 5–14, 15–24, 25–44, 45–64, 65–74, and ≥75 years) for each of the 42 United Hospital Fund neighbourhoods8 in New York City, according to the patient's residential address at time of reporting. We aggregated mortality data for confirmed and probable COVID-19-associated deaths from deaths registered and analysed by the New York City DOHMH. Confirmed COVID-19-associated deaths were defined as those occurring in people with laboratory-confirmed SARS-CoV-2 infection; and probable COVID-19 deaths were defined as those with COVID-19, SARS-CoV-2, or a similar term listed on the death certificate as an immediate, underlying, or contributing cause of death but did not have laboratory confirmation of SARS-CoV-2 infection.10 Due to privacy concerns, the New York City DOHMH aggregated mortality data to five coarser age groups (<18, 18–44, 45–64, 65–74, and ≥75 years) for each neighbourhood by week of death. To match with the age grouping for case data, we used the citywide fraction of deaths occurring in each of the five finer age groups (ie, <1, 1–4, 5–14, 15–24, 25–44) to apportion deaths in the younger than 18 and 18–44 year age categories. For this study, case and mortality data were both retrieved on Aug 7, 2020.
We used mobility data to model changes in the rate of SARS-CoV-2 transmission due to public health interventions implemented during the pandemic. We sourced these data from SafeGraph11, 12 and they contained counts of visitors to locations in each zip code based on mobile device locations. The released data were anonymised and aggregated in weekly intervals (with weeks defined as Sunday to Saturday). We spatially aggregated these data to the neighbourhood level.
This study was classified as public health surveillance and was exempt from ethical review and informed consent by the Institutional Review Boards of both Columbia University and New York City DOHMH.
Meta-population network transmission model
Our meta-population network model simulated intra-neighbourhood and inter-neighbourhood transmission of SARS-CoV-2 and assumed susceptible-exposed-infectious-removed dynamics, per the following equation system:
where S i is the number of susceptible individuals, E i is the number of exposed (but not yet infectious) individuals, I i is the number of infectious individuals, R i is the number of removed individuals (either recovered or deceased), and N i is the total population from a given age group in neighbourhood i. Due to model complexity and a scarcity of information for parameterising interactions among age groups, we modelled each age group separately (ie, combining all sources of infection to each age group); as such, system (1) describes the spatial transmission across neighbourhoods with no interactions among age groups. t is time, and we make this time dependence explicit for the parameters to indicate that they were estimated for each week and could vary over time due to disease seasonality or public health interventions, or both; state variables (S i, E i, I i and R i) are inherently vary with time. βcity(t) is the citywide transmission rate, which incorporated seasonal variation as observed for OC43, a betacoronavirus in humans from the same genus as SARS-CoV-2 (appendix pp 1–3, 11–14). To allow for differential transmission in each neighbourhood, we included a multiplicative factor, b i, to scale neighbourhood local transmission rates. Z is the latency period and D is the infectious period (appendix pp 4–6).
The matrix [c ij(t)] represents changes in contact rates over time and connectivity among neighbourhoods and was calculated on the basis of mobility data. Briefly, we calculated changes in contact rates (either intra-neighbourhood or inter-neighbourhood) for week-t as a ratio of the number of visitors during week-t to the number of visitors during the week of March 1, 2020 (the first week of the pandemic in New York City when no interventions were in place), and further scaled by a multiplicative factor m 1; m 1 was estimated along with other parameters (appendix pp 4–6). To calculate the connectivity among the neighbourhoods, we first divided the inter-neighbourhood mobility by the local mobility, which gave a relative measure of connectivity (eg, if two neighbourhoods are highly connected with lots of individuals travelling between them, inter-neighbourhood mobility would be closer to 1, but if they were not highly connected then inter-neighbourhood mobility would be much lower than 1); we then scaled these relative rates by a multiplicative factor m 2, which was also estimated along with other parameters (appendix pp 4–6).
Observational model
To account for delays in diagnosis and detection, we included a lag of time from infectious to detection (ie, an infection being diagnosed as a case), drawn from a gamma distribution with a mean of T m and an SD of T SD days. To account for under-detection, we included an infection detection rate (r)—ie, the fraction of infections (including subclinical or asymptomatic infections) reported as cases. To calculate the model-simulated number of new cases per week, we multiplied the model-simulated number of infections per day (including those from the previous weeks) by the infection detection rate, and further distributed these simulated cases in time per the distribution of time from infectious to detection. We then aggregated the daily lagged, simulated cases to weekly totals for model inference. Similarly, to calculate the model-simulated deaths per week and account for delays in time to death, we multiplied the simulated number of infections by the infection-fatality risk and then distributed these simulated deaths in time per the distribution of time-from-infectious-to-death lag, and aggregated these daily numbers to weekly totals. For each week, we estimated the infection detection rate (r), the mean (T m) and SD (T SD) of time from infectious to detection, and the infection-fatality risk on the basis of weekly case and mortality data. The distribution of time from diagnosis to death was based on observations of 15 686 confirmed COVID-19-related deaths in New York City as of May 17, 2020, from New York City DOHMH (gamma distribution with a mean of 9·36 days [SD 9·76]; appendix pp 4–6).
Parameter estimation
To estimate model parameters (βcity, Z, D, m 1, m 2, T m, T SD, r, infection-fatality risk and b i, for i=1,…,42) and state variables (S i, E i, and I i, for i=1,…,42) for each week, we ran the meta-population network model stochastically with a daily timestep in conjunction with the ensemble adjustment Kalman filter and fit to weekly case and mortality data from the week starting March 1, 2020, to the week ending June 6, 2020. The ensemble adjustment Kalman filter uses an ensemble of model realisations (n=500 here), each with initial set of parameters and variables randomly drawn from a prior range (appendix pp 4–6). After model initialisation, the model ensemble was integrated forwards in time for a week to calculate the model-simulated number of cases and deaths for that week; these prior estimates were then combined with the observed cases and deaths for the same week to calculate the posterior distribution of each model parameter or variable for that week per Bayes' theorem.9 Notably, the ensemble adjustment Kalman filter also models the observational errors (eg, due to imperfect sensitivity and specificity of SARS-CoV-2 RT-PCR tests for case diagnosis) over time by specifying an error structure and using this information when calculating the posterior distribution.9 We did this parameter estimation process separately for each of the eight age groups (ie, <1, up to ≥75). To include transmission from other age groups, we used measured intra-group and inter-group contacts from the POLYMOD study13 to calculate the total number of contacts made to each age group and adjusted the prior range of the transmission rate (βcity) during the first week of the pandemic for each age group accordingly. We calculated the posterior estimate on the basis of case and mortality data for each age group, which included all sources of infection. Thus, the estimated transmission rate for each age group included all sources of transmission.
To account for stochasticity in model initiation, we ran the parameter estimation process independently ten times. We combined results for each age group from these ten runs (each with 500 realisations). To combine estimates of the infection-detection rate and infection-fatality risk for those younger than 25 years or all ages overall, we weighted the age-group specific estimates (median and credible interval [CrI]) by the fraction of estimated infections from each related age group.
Model validation
As a model validation, we compared our estimates of cumulative infection rates to three independent serology datasets measuring the seroprevalence of antibodies to SARS-CoV-2 during the pandemic wave in New York City. Details of the serology data and matching by timing of measurement, age group, and location are in the appendix (pp 1, 8, 10).
Role of the funding source
The funders of the study had a role in the data collection and no role in study design, data analysis, data interpretation, or writing of the report. The corresponding author had full access to all the data in the study and had final responsibility for the decision to submit for publication.
Results
Between March 1 and June 6, 2020, 205 639 people had been diagnosed with laboratory-confirmed SARS-CoV-2 infection and 21 447 COVID-19-related deaths had been reported in New York City (table ). The epidemic timing (eg, peak of confirmed cases and mortality) varied substantially by age group and neighbourhood (appendix p 10). We were able to use our model-inference system to recreate the case and mortality time series for each age group and all ages overall (figure 1 ). For most age groups, confirmed cases peaked during the week of March 29, 2020, and the mortality rate peaked about 1 week later than the case rate, due to the time-lag from severe infection to death.
Table.
Summary estimates of cases of COVID-19 and COVID-19-related deaths in New York City, NY, USA for the period March 1 to June 6, 2020, by age group
Confirmed cases | Confirmed and probable deaths | Estimated cumulative infection rate | Estimated infection-fatality risk | |
---|---|---|---|---|
<25 years | 16 332 | 45 | 8·56% (5·66–17·5) | 0·00972% (0·00405–0·0154) |
25–44 years | 64 753 | 734 | 22·6% (16·6–31·2) | 0·116% (0·0729–0·148) |
45–64 years | 74 798 | 4732 | 22·7% (18·0–29·2) | 0·939% (0·729–1·19) |
65–74 years | 25 460 | 5181 | 15·0% (11·4–21·6) | 4·87% (3·37–6·89) |
≥75 years | 24 296 | 10 755 | 12·8% (9·92–18·6) | 14·2% (10·2–18·1) |
Overall | 205 639 | 21 447 | 17·2% (12·9–25·1) | 1·39% (1·04–1·77) |
Data are n, median cumulative infection rate with 95% CrI in parentheses, and median estimated infection-fatality risk with 95% CrI in parentheses. Data are given to three significant figures. Cases and deaths were reported by the New York City Department of Health and Mental Hygiene between March 1 and June 6, 2020. Cumulative infection rate is for all those infected by June 6, 2020. And infection-fatality risk is averaged over March 22 to June 6, 2020, with estimates for March 1–21 excluded because estimates were less accurate for these earliest weeks when zero or few deaths were reported. CrI=credible interval.
Figure 1.
Model fit for confirmed number of cases of COVID-19 (A, C, E, G, I, K) and model estimate of number of COVID-19 related deaths (B, D, F, H, J, L), by age group and overall
Boxes and whiskers show the median, 50% CrI, and 95% Crl. Red dots show the observed confirmed case rates (A, C, E, G, I, K) and observed mortality rates (B, D, F, H, J, L). CrI=credible interval.
However, there was substantial under-detection of infections, variations by age group, and fluctuations of infection-detection rates over time, in part due to changing testing criteria.14, 15 The estimated infection-detection rate for all ages overall started at a low level of 2·2% (95% CrI 0·3–4·5) during the week of March 1, 2020, and increased to 17·4% (11·3–26·1) during the week of March 15 (figure 2 ). However, due to shortages in testing and personal protective equipment, testing was restricted to severely ill patients in early April14 before it became more widely available in May.15 Consistently, the estimated infection detection rate dropped to approximately 13% in early-mid April, then gradually increased to approximately 19% in early May and stayed at similar levels through the week of May 31, 2020 (figure 2). The estimated infection detection rate was highest for the two oldest age groups and was substantially lower for younger age groups (figure 2). During the week of May 31, 2020, before the city began its phased reopening, we estimated that 29·8% (21·7–42·3) of infections among those aged 65–74 years and 36·0% (28·4–47·9) of infections among those aged 75 years and older were detected; by comparison, only 11·0% (7·2–17·6) of infections among those younger than 25 years and 16·8% (11·8–23·1) of infections among those aged 25–44 years were detected.
Figure 2.
Estimated infection rates and infection-detection rates over time by age group (A–E) and overall (F)
Black box plots show the estimated median, 50% CrI, and 95% CrI of infection rate, and the red lines show the estimated median infection-detection rate and the red shaded area shows the 50% CrI (dark red) and the 95% CrI (light red) of estimated infection detection rate. Horizontal arrows indicate the timing of two major public health intervention measures—ie, school closures starting the week of March 15, 2020, and the stay-at-home order starting the week of March 22, 2020. CrI=credible interval.
After accounting for the infection-detection rate, the epidemic peak for new infections occurred 1–2 weeks before the peak in confirmed cases, during the week of March 22, 2020, for those younger than 65 years and in all ages combined (figure 2). This peak was coincident with the timing of public health interventions in New York City (ie, public schools closing and the citywide stay-at-home order was imposed). Tallied over the entire study period, the estimated overall cumulative infection rate was 17·2% (95% CrI 12·9–25·1) by June 6, 2020 (table). However, estimated cumulative infection rates varied substantially across age groups and neighbourhoods in the city (figure 3 ). Specifically, the highest cumulative infection rates were in people aged 25–44 years and 45–64 years, and those aged 65–74 years and 75 years and older had the second highest cumulative infection rates, and those younger than 25 years had the lowest cumulative infection rate (table). Spatially, among the five boroughs in New York City, estimated cumulative infection rates were highest in neighbourhoods in the Bronx and lowest in neighbourhoods in Manhattan (figure 3).
Figure 3.
Estimated cumulative infection rates across neighbourhoods in New York City, NY, USA, by age group (A–E) and overall (F)
New York City has five boroughs (Manhattan, Bronx, Brooklyn, Queens, and Staten Island) and 42 neighbourhoods (shown by the black lines). The heat maps show the estimated median cumulative infection rates for the period March 1, to June 6, 2020, for each age group and neighbourhood.
Our model estimates of cumulative infection rates have large uncertainties. To assess the accuracy of our model, we compared our model estimates with three datasets of seroprevalence of antibodies to SARS-CoV-2 measured during three phases of the pandemic in New York City (ie, the early phase in March,16 the mid-phase in April,17 and end phase before reopening in early June;18 details on the available serological data and matching by timing of measurement, age group, and location are in the appendix [pp 1, 7–8]). Although we had large uncertainties in our estimates, our estimated cumulative infection rates were in line with corresponding measures from antibody tests for all three phases of the pandemic wave (appendix p 10). Consistent with serological data, our model-inference system estimated higher infection rates among adults aged 25–64 years than in other age groups (appendix p 10). Additionally, the spatial variation estimated by our model-inference system was in line with reported measures (ie, highest in the Bronx and lowest in Manhattan; appendix p 10). This consistency with independent serological data provides some independent validation of our model estimates.
During the period March 1 to June 6, 2020, the crude confirmed case-fatality risk was 8·23% (16 924 confirmed COVID-19-related death and 205 639 confirmed SARS-CoV-2 infections). After accounting for changing infection detection rates and excluding the first 3 weeks of the study period (ie, March 1–21, 2020, in which none or few deaths were reported, hence making the model estimates less accurate), we estimated that the overall infection-fatality risk, including both confirmed and probable deaths, was 1·39% (95% CrI 1·04–1·77) during March 22 to June 6, 2020 (table). If only confirmed COVID-19-related deaths were included, given that 16 924 (78·9%) deaths were confirmed to be due to SARS-CoV-2 infection, the overall infection-fatality risk would be around 1·10% (ie, 1·39% × 0·789).
Examining estimates by age group, the estimated infection-fatality risk was lowest in young age groups, increasing substantially with age (table; figure 4 ). These estimates were similar to infection-fatality risks reported for China for corresponding age groups.3 However, the estimated infection-fatality risk for the two oldest age groups was much higher than the younger age groups and about twice as high as the rates reported for these age groups in China.3, 4 Additionally, the estimated infection-fatality risk fluctuated substantially over time for the two oldest age groups. For those aged 65–74 years, the estimated infection-fatality risk was 6·72% (95% CrI 5·52–8·01) during the week of April 5, 2020, but decreased to 4·20% (2·22–7·01) during the week of May 31, 2020 (figure 4). For those aged 75 years and older, estimated infection-fatality risk was 19·11% (14·70–21·92) during the week of April 5, 2020, but decreased to 10·38% (6·17–14·96) during the week of May 31, 2020 (figure 4).
Figure 4.
Estimated infection-fatality risk, by age group (A–E) and overall (F)
Red lines show the estimated median infection-fatality risk with shaded areas indicating the 50% CrI (dark red) and 95% CrI (light red). For comparison, the grey bars show the number of deaths reported for each week from the week of March 1, to the week of May 31, 2020.
Discussion
In light of the large uncertainties in infection-fatality risks for SARS-CoV-2 due to under-detection of infections, we used a model-inference system, developed to support the pandemic response in New York City, to estimate local infection-fatality risks. During the 2020 spring pandemic (March 1–June 6, 2020), New York City recorded the largest number of COVID-19 cases and related deaths in the USA. Despite public health efforts to slow the pandemic (eg, via physical distancing), and to increase health-care capacity, 21 447 people died due to COVID-19 in the city in the short span of 3 months. Based on this large number of deaths, the estimated overall infection-fatality risk in New York City was 1·39% if both confirmed and probable COVID-19-related deaths were included or 1·10% if only confirmed COVID-19-related deaths were included. Both estimates were higher than previously reported elsewhere (eg, about 0·7% in both China3 and France5). Importantly, New York City has nosologists who rapidly review all death certificates and record deaths into a unified electronic reporting system (the average death certification time was 22·2 h and 95% of deaths were certified within 3·1 days during the pandemic wave; unpublished data, New York City DOHMH, Huynh M). This mortality surveillance infrastructure and enhanced nosology thus allow more rapid and complete death reporting in New York City than other places in the world. As such, our estimates here probably reflect the underlying fatality risk of SARS-CoV-2 infection more accurately than do those in previous studies. Furthermore, because the public health infrastructure and health-care systems in New York City are probably stronger than in many other places,19 the higher infection-fatality risk estimated here suggests that the fatality risk from SARS-CoV-2 might be higher in the USA and some other countries than has been previously reported. Notably, despite the large surge in cases and admissions to hospital, through quick expansion of health-care systems, most hospitals in New York City were able to meet demand for patient care during the pandemic. Because COVID-19 continues to pose pandemic risk in many places worldwide, governments must account for and closely monitor the infection rate and health outcomes, including admissions to hospital and mortality, and take prompt public health responses accordingly.
Although the infection-fatality risk we estimated here was similar to that previously reported elsewhere for younger age groups,3, 5 we found that the infection-fatality risk for individuals aged 65 years and older in New York City were about twice as high as in reports from other locations.3 These higher infection-fatality risk estimates might be in part due to differences in population characteristics, in particular, the prevalence of underlying medical conditions such as diabetes, chronic lung disease, and cardiovascular disease.20, 21 Regardless, our estimated weekly infection-fatality risk was as high as 6·7% for those aged 65–74 years and 19·1% for those aged 75 years and older. These dire estimates highlight the increased risk of COVID-19-related mortality in older populations and the importance of infection prevention in congregate settings. Thus, early detection and adherence to infection control guidance in long-term and adult care facilities should be a priority for COVID-19 response as the pandemic continues to unfold.
Over 5000 COVID-19-related deaths occurred among adults aged 25–64 years during the study period. Despite this large number of deaths, estimated cumulative infection rates in these age groups were only around 20% by the week of May 31, 2020, much lower than the 50–70% herd immunity needed to prevent large epidemics of COVID-19 (assuming the basic reproductive number for SARS-CoV-2 is around 2·0–3·5 and infection confers long-lasting immunity).5, 22, 23 By July, 2020, many places where lockdown-like measures were lifted saw increases in the number of cases of COVID-19 among young adults.24, 25, 26 These continuous infections could ignite new epidemics of COVID-19 and lead to further devastating effects in older populations and in younger adults (in particular, those aged 45–64 years) given the remaining high population susceptibility in many places and transmission across age groups. As such, young adults must strictly adhere to physical distancing and preventive measures (eg, mask wearing) in places with continuous transmission, despite their relatively low infection-fatality risk.
In this study, we incorporated multiple data sources, including age-grouped, spatially resolved case and mortality data and mobility data, to calibrate our model-inference system. Notably, the timing of the peak of the COVID-19 pandemic varied substantially among New York City neighbourhoods. For instance, peak mortality rates occurred up to 8 weeks apart among the 42 neighbourhoods. Fitting the model-inference system simultaneously to these diverse case and mortality time series thus enabled improved constraint of key model parameters (eg, infection detection rate and infection-fatality risk).
We note there remain large uncertainties in our model estimates. A full assessment of COVID-19 severity will require comprehensive serological surveys of the population by age group and neighbourhood due to the large heterogeneity of infection rates across populations and space. Additionally, we only included deaths that were laboratory confirmed as related to SARS-CoV-2 infection or explicitly coded as related to COVID-19. Previous studies have reported that excess deaths in New York City during about the same period were over 24 000,10, 27 which are more than the 21 447 COVID-19-related deaths included in this study. Furthermore, studies have reported severe sequelae of COVID-19 in children—ie, multisystem inflammatory syndrome in children.28, 29 Thus, monitoring health outcomes in younger age groups after infection is important as the pandemic unfolds, despite the low infection-fatality risk in these age groups noted to date.
Acknowledgments
Acknowledgments
This study was supported by the National Institute of Allergy and Infectious Diseases (AI145883), the National Science Foundation Rapid Response Research Program (RAPID; 2027369), and the New York City DOHMH. We thank the New York City DOHMH Bureau of Vital Statistics team and staff members in the New York City DOHMH Incident Command System Surveillance and Epidemiology Section for data management. We thank Miranda S Moore at New York City DOHMH, who wrote and maintained the code to generate weekly extracts of COVID-19 case data for provision to Columbia University under a data use agreement. We thank Jaimie Shaff at New York City DOHMH and Helen Alesbury and Kathryn Klein at the Office of Chief Medical Examiner of the City of New York for initiating and coordinating discussions on modelling COVID-19 mortality in New York City. We also thank Columbia University Mailman School of Public Health for high performance computing and SafeGraph for providing the mobility data used in this study.
Contributors
WY conceived the study with input from JS, SK, MH, SKG, GVW, and AF. MH, GVW, WL, and HTC oversaw the collection of and provided the COVID-19-associated mortality data. SKG oversaw the specification and provision of COVID-19 laboratory-confirmed case data. AY managed and provided the human endemic coronaviruses data used for estimating the seasonality of the pandemic. EM contributed to data management of confirmed cases and confirmed and probable COVID-19-associated deaths. AF oversaw all data collection processes at the DOHMH. SK complied the human mobility data from SafeGraph and wrote the initial version of data processing scripts. SKG and AF provided critical input on parameter estimation. DO provided input on the mortality pattern. WY did all modelling analyses and wrote the first draft of the manuscript. All authors critically reviewed and revised the manuscript.
Declaration of interests
JS and Columbia University disclose partial ownership of SK Analytics. JS discloses consulting for BNI and Merck. All other authors declare no competing interests.
Supplementary Material
References
- 1.WHO . World Health Organization; Geneva: Oct 5, 2020. Coronavirus disease (COVID-2019) situation reports.https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports/ [Google Scholar]
- 2.Wu JT, Leung K, Bushman M, et al. Estimating clinical severity of COVID-19 from the transmission dynamics in Wuhan, China. Nat Med. 2020;26:506–510. doi: 10.1038/s41591-020-0822-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Verity R, Okell LC, Dorigatti I, et al. Estimates of the severity of coronavirus disease 2019: a model-based analysis. Lancet Infect Dis. 2020;20:669–677. doi: 10.1016/S1473-3099(20)30243-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Russell TW, Hellewell J, Jarvis CI, et al. Estimating the infection and case fatality ratio for coronavirus disease (COVID-19) using age-adjusted data from the outbreak on the Diamond Princess cruise ship, February 2020. Euro Surveill. 2020;25 doi: 10.2807/1560-7917.ES.2020.25.12.2000256. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Salje H, Tran Kiem C, Lefrancq N, et al. Estimating the burden of SARS-CoV-2 in France. Science. 2020;369:208–211. doi: 10.1126/science.abc3517. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.NBC News Manhattan woman, 39, is NYC's first COVID-19 case; husband's test results are pending. NBC News. March 3, 2020. https://www.nbcnewyork.com/news/coronavirus/person-in-nyc-tests-positive-for-covid-19-officials/2308155/
- 7.New York State Department of Health . New York State Department of Health; NY: 2020. New York State on PAUSE.https://coronavirus.health.ny.gov/new-york-state-pause [Google Scholar]
- 8.New York City Department of Health and Mental Hygiene NYC UHF 42 neighborhoods. New York City, NY: New York City Department of Health and Mental Hygiene. http://a816-dohbesp.nyc.gov/IndicatorPublic/EPHTPDF/uhf42.pdf
- 9.Anderson JL. An ensemble adjustment Kalman filter for data assimilation. Mon Weather Rev. 2001;129:2884–2903. [Google Scholar]
- 10.Olson DR, Huynh M, Fine A, et al. Preliminary estimate of excess mortality during the COVID-19 outbreak - New York City, March 11–May 2, 2020. MMWR Morb Mortal Wkly Rep. 2020;69:603–605. doi: 10.15585/mmwr.mm6919e5. [DOI] [PubMed] [Google Scholar]
- 11.SafeGraph Weekly patterns: foot traffic data to understand the COVID-19 pandemic. SafeGraph. 2020. https://www.safegraph.com/weekly-foot-traffic-patterns
- 12.Lasry A, Kidder D, Hast M, et al. Timing of community mitigation and changes in reported COVID-19 and community mobility - four U.S. metropolitan areas, February 26–April 1, 2020. MMWR Morb Mortal Wkly Rep. 2020;69:451–457. doi: 10.15585/mmwr.mm6915e2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Mossong J, Hens N, Jit M, et al. Social contacts and mixing patterns relevant to the spread of infectious diseases. PLoS Med. 2008;5:e74. doi: 10.1371/journal.pmed.0050074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.New York City Department of Health and Mental Hygiene . New York City Department of Health and Mental Hygiene; New York City, NY: April 11, 2020. 2020 health alert #10: COVID-19 updates for New York City: face mask use policy, swab shortage, reporting COVID-19 related deaths and crisis communication resources.https://www1.nyc.gov/assets/doh/downloads/pdf/han/alert/2020/covid-19-update-04112020.pdf [Google Scholar]
- 15.New York City Department of Health and Mental Hygiene . New York City Department of Health and Mental Hygiene; New York City, NY: May 15, 2020. 2020 health advisory #15: updated NYC Health Department recommendations for identifying and testing patients with suspected COVID-19.https://www1.nyc.gov/assets/doh/downloads/pdf/han/advisory/2020/covid-19-provider-id-testing.pdf [Google Scholar]
- 16.Havers FP, Reed C, Lim T, et al. Seroprevalence of antibodies to SARS-CoV-2 in 10 sites in the United States, March 23–May 12, 2020. JAMA Intern Med. 2020 doi: 10.1001/jamainternmed.2020.4130. published online July 21. [DOI] [PubMed] [Google Scholar]
- 17.Government of New York State . Government of New York State; Albany, NY: May 2, 2020. Amid ongoing COVID-19 pandemic, Governor Cuomo announces results of completed antibody testing study of 15,000 people showing 12.3 percent of population has COVID-19 antibodies.https://www.governor.ny.gov/news/amid-ongoing-covid-19-pandemic-governor-cuomo-announces-results-completed-antibody-testing [Google Scholar]
- 18.Government of New York State . Government of New York State; Albany, NY: June 16, 2020. Video, audio, photos & rush transcript: Governor Cuomo announces U.S. Open to be held without fans from August 31st to September 13th.https://www.governor.ny.gov/news/video-audio-photos-rush-transcript-governor-cuomo-announces-us-open-be-held-without-fans-august [Google Scholar]
- 19.Medbelle Best hospital cities ranking 2019. Medbelle. 2019. https://www.medbelle.com/best-hospital-cities-usa
- 20.Clark A, Jit M, Warren-Gash C, et al. How many are at increased risk of severe COVID-19 disease? Rapid global, regional and national estimates for 2020. medRxiv. 2020 doi: 10.1101/2020.04.18.20064774. published online April 22. (preprint) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.CDC COVID-19 Response Team Preliminary estimates of the prevalence of selected underlying health conditions among patients with coronavirus disease 2019—United States, February 12–March 28, 2020. MMWR Morb Mortal Wkly Rep. 2020;69:382–386. doi: 10.15585/mmwr.mm6913e2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Cobey S. Modeling infectious disease dynamics. Science. 2020;368:713–714. doi: 10.1126/science.abb5659. [DOI] [PubMed] [Google Scholar]
- 23.Li R, Pei S, Chen B, et al. Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2) Science. 2020;368:489–493. doi: 10.1126/science.abb3221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Durkin E. New York City sees uptick in coronavirus cases among young adults. Politico. July 13, 2020. https://www.politico.com/states/new-york/albany/story/2020/07/13/new-york-city-sees-uptick-in-coronavirus-cases-among-young-adults-1300223
- 25.Soucheray S. Center for Infectious Disease Research and Policy; June 26, 2020. COVID-19 cases among US young adults spike.https://www.cidrap.umn.edu/news-perspective/2020/06/covid-19-cases-among-us-young-adults-spike [Google Scholar]
- 26.Bisserbe N, Pancevski B. Coronavirus cases rise in Europe as youth hit beaches and bars. The Wall Street Journal. Aug 2, 2020 https://www.wsj.com/articles/coronavirus-cases-rise-in-europe-as-youth-hit-beach-and-bars-11596364200 [Google Scholar]
- 27.Weinberger DM, Chen J, Cohen T, et al. Estimation of excess deaths associated with the COVID-19 pandemic in the United States, March to May 2020. JAMA Intern Med. 2020 doi: 10.1001/jamainternmed.2020.3391. published online July 1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Cheung EW, Zachariah P, Gorelik M, et al. Multisystem inflammatory syndrome related to COVID-19 in previously healthy children and adolescents in New York City. JAMA. 2020;324:294–296. doi: 10.1001/jama.2020.10374. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Jiang L, Tang K, Levin M, et al. COVID-19 and multisystem inflammatory syndrome in children and adolescents. Lancet Infect Dis. 2020 doi: 10.1016/S1473-3099(20)30651-4. published online Aug 17. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.