Skip to main content
Journal of Urban Health : Bulletin of the New York Academy of Medicine logoLink to Journal of Urban Health : Bulletin of the New York Academy of Medicine
. 2003 Mar;80(Suppl 1):i57–i65. doi: 10.1007/PL00022316

Biosurveillance applying scan statistics with multiple, disparate data sources

Howard S Burkom 1
PMCID: PMC3456540  PMID: 12791780

Abstract

Researchers working on the Department of Defense Global Emerging Infections System (DoD-GEIS) pilot system, the Electronic Surveillance System for the Early Notification of Community-Based Epidemics (ESSENCE), have applied scan statistics for early outbreak detection using hoth traditional and nontraditional data sources. These sources include medical data indexed byInternational Classification of Disease, 9th Revision (ICD-9) diagnosis codes, as well as less-specific, but potentially timelier, indicators such as records of over-the-counter remedy sales and of school absenteeism. Early efforts employed the Kulldorff scan statistic as implemented in the SaTScan software of the National Cancer Institute. A key obstacle to this application is that the input data streams are typically based on time-varying factors, such as consumer behavior, rather than simply on the populations of the component subregions. We have used both modeling and recent historical data distributions to obtain background spatial distributions. Data analyses have provided guidance on how to condition and model input data to avoid excessive clustering. We have used this methodology in combining data sources for both retrospective studies of known outbreaks and surveillance of high-profile events of concern to local public health authorities. We have integrated the scan statistic capability into a Microsoft Access-based system in which we may include or exclude data sources, vary time windows separately for different data sources, censor data from subsets of individual providers or subregions, adjust the background computation method, and run retrospective or simulated studies.

Keywords: Biosurveillance, Clustering, Kulldorff, Scan statistics

Full Text

The Full Text of this article is available as a PDF (200.3 KB).

References

  • 1.Kulldorff M. Spatial scan statistics: models, calculations, and applications. In: Glaz J, Balakrishnan N, editors. Scan Statistics and Applications. Boston, MA: Birkhauser; 1999. pp. 303–322. [Google Scholar]
  • 2.Kulldorff M.SaTScan [computer program]. Version 2. Available at: http://srab.cancer.gov/satscan/. Accessed on: September 1, 2002.
  • 3.Kulldorff M. A spatial scan statistic. Commun Stat Theory Meth. 1999;26:1481–1496. [Google Scholar]
  • 4.Swets JA, Pickett RM. Evaluation of Diagnostic Systems: Methods From Signal Detection Theory. New York, NY: Academic Press; 1982. [Google Scholar]
  • 5.Edgington ES. A normal curve method for combining probability values from independent experiments. J Psychol. 1972;82:85–89. [Google Scholar]

Articles from Journal of Urban Health : Bulletin of the New York Academy of Medicine are provided here courtesy of New York Academy of Medicine

RESOURCES