In the December 2021 issue of AJPH, Hennessee et al. reminded readers of an important limitation of COVID-19 surveillance data.1 As they noted, there are many dates that can be captured in a time series, including dates of SARS-CoV-2 infection, symptom onset, test, and report. These dates provide information useful to public health, from calculating measures of contagion to demand on the health care system. We agree with the authors that, epidemiologically, infection date is most relevant.
However, the authors do not do justice to the complexities inherent in analyzing these data and, worse yet, may inadvertently mislead readers regarding best practices for addressing these complexities. Indeed, the authors state “many estimations methods [for infection date] homogenize substantial heterogeneities,” but they disregard this in their demonstration of recovering infection date in Figure 2: “infection dates were estimated as symptom onset dates minus a median incubation period.”1(p2129) Although this is straightforward to calculate, it ignores the “substantial heterogeneities,” and leads to invalid conclusions.2 For those interested in obtaining infection date, we would suggest deconvolution, which has been applied in infectious disease surveillance for decades,3 and has also been implemented for estimating the reproductive number of SARS-CoV-2 during the pandemic.2,4
There is another issue with these data that must be dealt with before making any inferences about the outbreak: outcome misclassification.4,5 Such errors will primarily be underreporting (i.e., suboptimal sensitivity of the surveillance program) through asymptomatic infection or those symptomatic and unable or unwilling to test, but there may also be issues with diagnostic accuracy, including both false positives and false negatives. As such, taking epidemic curves at face value is problematic.6
Methods are freely available to rigorously address both the timing of infection and flaws in the capture of cases.4 Figure 1 is our application of these methods to the Georgia Department of Public Health time series COVID-19 data used by Hennessee et al. and shows the divergence in the two approaches beyond the stochastic error of the 95% uncertainty interval, which may have implications for pandemic management.7 Methodologically rigorous adjustment for biases in the surveillance data smoothed out fluctuations and leads to a more definitive interpretation, reducing the need for qualitative judgements. Although our methods can be improved upon, we urge public health practitioners to adopt the best available methods, because both our collective credibility and, more importantly, success in combating COVID-19 rest on using tools that are state-of-the-art.
FIGURE 1—
Deconvolution of SARS-CoV-2 Infection Date From Symptom Onset Date While Also Accounting for Misclassification Error in the Case Ascertainment of COVID-19 Surveillance Data: Georgia Department of Public Health, December 1, 2020–April 11, 2021
Note. Dashed gray line depicts the double adjustment method described in Goldstein et al.,4 with corresponding 95% uncertainty interval. Solid black line depicts the method described in Hennessee et al.1 of infection date estimated as symptom onset date minus a median incubation period of 5.1 days. Tails of the distribution may be unreliable because of left and right truncation. Analytic codes are available for download from https://doi.org/10.5281/zenodo.5798398.
CONFLICTS OF INTEREST
N. D. Goldstein and I. Burstyn serve as expert witnesses on COVID-19–related litigation.
HUMAN PARTICIPANT PROTECTION
The analysis used publicly available aggregated data.
REFERENCES
- 1.Hennessee I, Clennon JA, Waller LA, Kitron U, Bryan JM. Considerations for improving reporting and analysis of date-based COVID-19 surveillance data by public health agencies. Am J Public Health. 2021;111(12):2127–2132. doi: 10.2105/AJPH.2021.306520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Gostic KM, McGough L, Baskerville EB, et al. Practical considerations for measuring the effective reproductive number, Rt [erratum in PLoS Comput Biol. 2021;17(12):e1009679] PLoS Comput Biol. 2020;16(12):e1008409. doi: 10.1371/journal.pcbi.1008409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Brookmeyer R. Reconstruction and future trends of the AIDS epidemic in the United States. Science. 1991;253(5015):37–42. doi: 10.1126/science.2063206. [DOI] [PubMed] [Google Scholar]
- 4.Goldstein ND, Quick H, Burstyn I. Effect of adjustment for case misclassification and infection date uncertainty on estimates of COVID-19 effective reproduction number. Epidemiology. 2021;32(6):800–806. doi: 10.1097/EDE.0000000000001402. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Burstyn I, Goldstein ND, Gustafson P. Towards reduction in bias in epidemic curves due to outcome misclassification through Bayesian analysis of time-series of laboratory test results: case study of COVID-19 in Alberta, Canada and Philadelphia, USA. BMC Med Res Methodol. 2020;20(1):146. doi: 10.1186/s12874-020-01037-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Burstyn I, Goldstein ND, Gustafson P. It can be dangerous to take epidemic curves of COVID-19 at face value. Can J Public Health. 2020;111(3): 397–400. doi: 10.17269/s41997-020-00367-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Yuan HY, Blakemore C. The impact of multiple non-pharmaceutical interventions on controlling COVID-19 outbreak without lockdown in Hong Kong: a modelling study. Lancet Reg Health West Pac. 2022;20:100343. doi: 10.1016/j.lanwpc.2021.100343. [DOI] [PMC free article] [PubMed] [Google Scholar]