Abstract
Vital records data bases describe large populations over long periods of time, yet their organization and size often preclude or discourage their use. We constructed a microcomputer-based data base of all singleton births in Massachusetts, 1975-84. The original data were stored in 700,000 records, each 174 bytes long, occupying a total of over 120 megabytes (MB). By removing redundant information and unique identifiers, and packing the data, we store 21 fields of this information in a 16-byte record resulting in a data base of 11.1 MB, a saving of over 90 percent of disk space. By using programs written expressly for this data base, we can display a birth weight frequency plot of the entire data set in under 65 seconds on an IBM-compatible PC-AT. Comparable assessments in SAS-PC took over 105 minutes and in main frame SAS on an AS-9000 took over 37 CPU seconds. Implementing similar systems for state registries on births, deaths, cancers, and birth defects potentially offers investigators easy access to vast stores of information and would enable public health officials to produce timely reports, initiate a variety of surveillance activities, and respond rapidly to residents' inquiries about clusters and anomalous disease patterns.
Full text
PDFSelected References
These references are in PubMed. This may not be the complete list of references from this article.
- Archer V. E. Association of nuclear fallout with leukemia in the United States. Arch Environ Health. 1987 Sep-Oct;42(5):263–271. doi: 10.1080/00039896.1987.9935819. [DOI] [PubMed] [Google Scholar]
- Connell F. A., Diehr P., Hart L. G. The use of large data bases in health care studies. Annu Rev Public Health. 1987;8:51–74. doi: 10.1146/annurev.pu.08.050187.000411. [DOI] [PubMed] [Google Scholar]
- Dubrow R., Sestito J. P., Lalich N. R., Burnett C. A., Salg J. A. Death certificate-based occupational mortality surveillance in the United States. Am J Ind Med. 1987;11(3):329–342. doi: 10.1002/ajim.4700110309. [DOI] [PubMed] [Google Scholar]
- Gittelsohn A. M. On the distribution of underlying causes of death. Am J Public Health. 1982 Feb;72(2):133–140. doi: 10.2105/ajph.72.2.133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Milne K. L., Sandler D. P., Everson R. B., Brown S. M. Lung cancer and occupation in Alameda County: a death certificate case-control study. Am J Ind Med. 1983;4(4):565–575. doi: 10.1002/ajim.4700040410. [DOI] [PubMed] [Google Scholar]
- Thomas T. L., Fontham E. T., Norman S. A., Stemhagen A., Hoover R. N. Occupational risk factors for brain tumors. A case-referent death-certificate analysis. Scand J Work Environ Health. 1986 Apr;12(2):121–127. doi: 10.5271/sjweh.2168. [DOI] [PubMed] [Google Scholar]