Validation of clinical problems using a UMLS-based semantic parser

H S Goldberg; C Hsu; V Law; C Safran

. 1998:805–809.

Validation of clinical problems using a UMLS-based semantic parser.

H S Goldberg ¹, C Hsu ¹, V Law ¹, C Safran ¹

PMCID: PMC2232212 PMID: 9929330

Abstract

The capture and symbolization of data from the clinical problem list facilitates the creation of high-fidelity patient resumes for use in aggregate analysis and decision support. We report on the development of a UMLS-based semantic parser and present a preliminary evaluation of the parser in the recognition and validation of disease-related clinical problems. We randomly sampled 20% of the 26,858 unique non-dictionary clinical problems entered into OMR (Online Medical Record) between 1989 and August, 1997, and eliminated a series of qualified problem labels, e.g., history-of, to obtain a dataset of 4122 problem labels. Within this dataset, the authors identified 2810 labels (68.2%) as referring to a broad range of disease-related processes. The parser correctly recognized and validated 1398 of the 2810 disease-related labels (49.8 +/- 1.9%) and correctly excluded 1220 of 1312 non-disease-related labels (93.0 +/- 1.4%). 812 of the 1181 match failures (68.8%) were caused by terms either absent from UMLS or modifiers not accepted by the parser; 369 match failures (31.2%) were caused by labels having patterns not recognized by the parser. By enriching the UMLS lexicon with terms commonly found in provider-entered labels, it appears that performance of the parser can be significantly enhanced over a few subsequent iterations. This initial evaluation provides a foundation from which to make principled additions to the UMLS lexicon locally for use in symbolizing clinical data; further research is necessary to determine applicability to other health care settings.

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

Chute C. G., Elkin P. L. A clinically derived terminology: qualification to reduction. Proc AMIA Annu Fall Symp. 1997:570–574. [PMC free article] [PubMed] [Google Scholar]
Claus P. L., Carpenter P. C., Chute C. G., Mohr D. N., Gibbons P. S. Clinical care management and workflow by episodes. Proc AMIA Annu Fall Symp. 1997:91–95. [PMC free article] [PubMed] [Google Scholar]
Safran C., Rury C., Rind D. M., Taylor W. C. A computer-based outpatient medical record for a teaching hospital. MD Comput. 1991 Sep-Oct;8(5):291–299. [PubMed] [Google Scholar]
Sager N., Lyman M., Bucknall C., Nhan N., Tick L. J. Natural language processing and the representation of clinical data. J Am Med Inform Assoc. 1994 Mar-Apr;1(2):142–160. doi: 10.1136/jamia.1994.95236145. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zelingher J., Rind D. M., Caraballo E., Tuttle M. S., Olson N. E., Safran C. Categorization of free-text problem lists: an effective method of capturing clinical data. Proc Annu Symp Comput Appl Med Care. 1995:416–420. [PMC free article] [PubMed] [Google Scholar]

[OCR_00530] Chute C. G., Elkin P. L. A clinically derived terminology: qualification to reduction. Proc AMIA Annu Fall Symp. 1997:570–574. [PMC free article] [PubMed] [Google Scholar]

[OCR_00560] Claus P. L., Carpenter P. C., Chute C. G., Mohr D. N., Gibbons P. S. Clinical care management and workflow by episodes. Proc AMIA Annu Fall Symp. 1997:91–95. [PMC free article] [PubMed] [Google Scholar]

[OCR_00519] Safran C., Rury C., Rind D. M., Taylor W. C. A computer-based outpatient medical record for a teaching hospital. MD Comput. 1991 Sep-Oct;8(5):291–299. [PubMed] [Google Scholar]

[OCR_00525] Sager N., Lyman M., Bucknall C., Nhan N., Tick L. J. Natural language processing and the representation of clinical data. J Am Med Inform Assoc. 1994 Mar-Apr;1(2):142–160. doi: 10.1136/jamia.1994.95236145. [DOI] [PMC free article] [PubMed] [Google Scholar]

[OCR_00545] Zelingher J., Rind D. M., Caraballo E., Tuttle M. S., Olson N. E., Safran C. Categorization of free-text problem lists: an effective method of capturing clinical data. Proc Annu Symp Comput Appl Med Care. 1995:416–420. [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Validation of clinical problems using a UMLS-based semantic parser.

H S Goldberg

C Hsu

V Law

C Safran

Abstract

Full text

Selected References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Validation of clinical problems using a UMLS-based semantic parser.

H S Goldberg

C Hsu

V Law

C Safran

Abstract

Full text

Selected References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases