Skip to main content
Proceedings of the AMIA Symposium logoLink to Proceedings of the AMIA Symposium
. 1998:805–809.

Validation of clinical problems using a UMLS-based semantic parser.

H S Goldberg 1, C Hsu 1, V Law 1, C Safran 1
PMCID: PMC2232212  PMID: 9929330

Abstract

The capture and symbolization of data from the clinical problem list facilitates the creation of high-fidelity patient resumes for use in aggregate analysis and decision support. We report on the development of a UMLS-based semantic parser and present a preliminary evaluation of the parser in the recognition and validation of disease-related clinical problems. We randomly sampled 20% of the 26,858 unique non-dictionary clinical problems entered into OMR (Online Medical Record) between 1989 and August, 1997, and eliminated a series of qualified problem labels, e.g., history-of, to obtain a dataset of 4122 problem labels. Within this dataset, the authors identified 2810 labels (68.2%) as referring to a broad range of disease-related processes. The parser correctly recognized and validated 1398 of the 2810 disease-related labels (49.8 +/- 1.9%) and correctly excluded 1220 of 1312 non-disease-related labels (93.0 +/- 1.4%). 812 of the 1181 match failures (68.8%) were caused by terms either absent from UMLS or modifiers not accepted by the parser; 369 match failures (31.2%) were caused by labels having patterns not recognized by the parser. By enriching the UMLS lexicon with terms commonly found in provider-entered labels, it appears that performance of the parser can be significantly enhanced over a few subsequent iterations. This initial evaluation provides a foundation from which to make principled additions to the UMLS lexicon locally for use in symbolizing clinical data; further research is necessary to determine applicability to other health care settings.

Full text

PDF
805

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Chute C. G., Elkin P. L. A clinically derived terminology: qualification to reduction. Proc AMIA Annu Fall Symp. 1997:570–574. [PMC free article] [PubMed] [Google Scholar]
  2. Claus P. L., Carpenter P. C., Chute C. G., Mohr D. N., Gibbons P. S. Clinical care management and workflow by episodes. Proc AMIA Annu Fall Symp. 1997:91–95. [PMC free article] [PubMed] [Google Scholar]
  3. Safran C., Rury C., Rind D. M., Taylor W. C. A computer-based outpatient medical record for a teaching hospital. MD Comput. 1991 Sep-Oct;8(5):291–299. [PubMed] [Google Scholar]
  4. Sager N., Lyman M., Bucknall C., Nhan N., Tick L. J. Natural language processing and the representation of clinical data. J Am Med Inform Assoc. 1994 Mar-Apr;1(2):142–160. doi: 10.1136/jamia.1994.95236145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Zelingher J., Rind D. M., Caraballo E., Tuttle M. S., Olson N. E., Safran C. Categorization of free-text problem lists: an effective method of capturing clinical data. Proc Annu Symp Comput Appl Med Care. 1995:416–420. [PMC free article] [PubMed] [Google Scholar]

Articles from Proceedings of the AMIA Symposium are provided here courtesy of American Medical Informatics Association

RESOURCES