Skip to main content
Proceedings of the AMIA Annual Fall Symposium logoLink to Proceedings of the AMIA Annual Fall Symposium
. 1996:542–546.

Identification of suspected tuberculosis patients based on natural language processing of chest radiograph reports.

N L Jain 1, C A Knirsch 1, C Friedman 1, G Hripcsak 1
PMCID: PMC2233236  PMID: 8947725

Abstract

Identification of eligible patients from electronically available patient data is a key difficulty in computerizing clinical practice guidelines because a large amount of the relevant data is stored as free text. We have been using MedLEE (Medical Language Extraction and Encoding System), a natural language processing system, to encode the clinical information in all chest radiograph and mammogram reports. This paper describes a retrospective study to determine if MedLEE can identify patients at risk for having tuberculosis (TB) based on their admission chest radiographs. Reports of 171 adult inpatients with culture-positive TB during 1992 and 1993 were manually coded (by a TB specialist) using seven terms suggestive of TB, and were also encoded by MedLEE. Using manual coding as the gold standard, MedLEE agreed on the classification of 152/171 (88.9%) reports--129/142 (90.8%) suspicious for TB and 23/29 (79.3%) not suspicious for TB; and 1072/1197 (89.6%) terms indicative of TB. Analysis showed that most of the discrepancies were caused by MedLEE not finding the location of the infiltrate. By ignoring the location of the infiltrate, the agreement became 157/171 (91.8%) reports and 946/1026 (92.2%) terms. Thus, natural language processing offers a practical alternative for using free-text reports to determine patient eligibility for computerized clinical practice guidelines.

Full text

PDF
543

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Barnes P. F., Barrows S. A. Tuberculosis in the 1990s. Ann Intern Med. 1993 Sep 1;119(5):400–410. doi: 10.7326/0003-4819-119-5-199309010-00009. [DOI] [PubMed] [Google Scholar]
  2. Bell D. S., Pattison-Gordon E., Greenes R. A. Experiments in concept modeling for radiographic image reports. J Am Med Inform Assoc. 1994 May-Jun;1(3):249–262. doi: 10.1136/jamia.1994.95236156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bloom B. R., Murray C. J. Tuberculosis: commentary on a reemergent killer. Science. 1992 Aug 21;257(5073):1055–1064. doi: 10.1126/science.257.5073.1055. [DOI] [PubMed] [Google Scholar]
  4. Conroy M., Shannon W. Clinical guidelines: their implementation in general practice. Br J Gen Pract. 1995 Jul;45(396):371–375. [PMC free article] [PubMed] [Google Scholar]
  5. Elson R. B., Connelly D. P. Computerized patient records in primary care. Their role in mediating guideline-driven physician behavior change. Arch Fam Med. 1995 Aug;4(8):698–705. doi: 10.1001/archfami.4.8.698. [DOI] [PubMed] [Google Scholar]
  6. Frieden T. R., Fujiwara P. I., Washko R. M., Hamburg M. A. Tuberculosis in New York City--turning the tide. N Engl J Med. 1995 Jul 27;333(4):229–233. doi: 10.1056/NEJM199507273330406. [DOI] [PubMed] [Google Scholar]
  7. Friedman C., Alderson P. O., Austin J. H., Cimino J. J., Johnson S. B. A general natural-language text processor for clinical radiology. J Am Med Inform Assoc. 1994 Mar-Apr;1(2):161–174. doi: 10.1136/jamia.1994.95236146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Haug P. J., Koehler S., Lau L. M., Wang P., Rocha R., Huff S. M. Experience with a mixed semantic/syntactic parser. Proc Annu Symp Comput Appl Med Care. 1995:284–288. [PMC free article] [PubMed] [Google Scholar]
  9. Hripcsak G., Clayton P. D., Jenders R. A., Cimino J. J., Johnson S. B. Design of a clinical event monitor. Comput Biomed Res. 1996 Jun;29(3):194–221. doi: 10.1006/cbmr.1996.0016. [DOI] [PubMed] [Google Scholar]
  10. Hripcsak G., Friedman C., Alderson P. O., DuMouchel W., Johnson S. B., Clayton P. D. Unlocking clinical data from narrative reports: a study of natural language processing. Ann Intern Med. 1995 May 1;122(9):681–688. doi: 10.7326/0003-4819-122-9-199505010-00007. [DOI] [PubMed] [Google Scholar]
  11. Hripcsak G. Writing Arden Syntax Medical Logic Modules. Comput Biol Med. 1994 Sep;24(5):331–363. doi: 10.1016/0010-4825(94)90002-7. [DOI] [PubMed] [Google Scholar]
  12. Johnson S., Friedman C., Cimino J. J., Clark T., Hripcsak G., Clayton P. D. Conceptual data model for a central patient database. Proc Annu Symp Comput Appl Med Care. 1991:381–385. [PMC free article] [PubMed] [Google Scholar]
  13. Kuhn K., Zemmler T., Reichert M., Rösner D., Baumiller O., Knapp H. An integrated knowledge-based system to guide the physician during structured reporting. Methods Inf Med. 1994 Oct;33(4):417–422. [PubMed] [Google Scholar]
  14. Lenert L. A., Tovar M. Automated linkage of free-text descriptions of patients with a practice guideline. Proc Annu Symp Comput Appl Med Care. 1993:274–278. [PMC free article] [PubMed] [Google Scholar]
  15. Moorman P. W., van Ginneken A. M., van der Lei J., van Bemmel J. H. A model for structured data entry based on explicit descriptional knowledge. Methods Inf Med. 1994 Dec;33(5):454–463. [PubMed] [Google Scholar]
  16. Safran C., Rind D. M., Davis R. B., Ives D., Sands D. Z., Currier J., Slack W. V., Makadon H. J., Cotton D. J. Guidelines for management of HIV infection with computer-based patient's record. Lancet. 1995 Aug 5;346(8971):341–346. doi: 10.1016/s0140-6736(95)92226-1. [DOI] [PubMed] [Google Scholar]
  17. Sager N., Lyman M., Bucknall C., Nhan N., Tick L. J. Natural language processing and the representation of clinical data. J Am Med Inform Assoc. 1994 Mar-Apr;1(2):142–160. doi: 10.1136/jamia.1994.95236145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Sepkowitz K. A., Telzak E. E., Recalde S., Armstrong D. Trends in the susceptibility of tuberculosis in New York City, 1987-1991. New York City Area Tuberculosis Working Group. Clin Infect Dis. 1994 May;18(5):755–759. doi: 10.1093/clinids/18.5.755. [DOI] [PubMed] [Google Scholar]
  19. Tierney W. M., McDonald C. J., Martin D. K., Rogers M. P. Computerized display of past test results. Effect on outpatient testing. Ann Intern Med. 1987 Oct;107(4):569–574. doi: 10.7326/0003-4819-107-4-569. [DOI] [PubMed] [Google Scholar]
  20. Tierney W. M., Miller M. E., McDonald C. J. The effect on test ordering of informing physicians of the charges for outpatient diagnostic tests. N Engl J Med. 1990 May 24;322(21):1499–1504. doi: 10.1056/NEJM199005243222105. [DOI] [PubMed] [Google Scholar]
  21. Tierney W. M., Miller M. E., Overhage J. M., McDonald C. J. Physician inpatient order writing on microcomputer workstations. Effects on resource utilization. JAMA. 1993 Jan 20;269(3):379–383. [PubMed] [Google Scholar]
  22. Tierney W. M., Overhage J. M., Takesue B. Y., Harris L. E., Murray M. D., Vargo D. L., McDonald C. J. Computerizing guidelines to improve care and patient outcomes: the example of heart failure. J Am Med Inform Assoc. 1995 Sep-Oct;2(5):316–322. doi: 10.1136/jamia.1995.96073834. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Zingmond D., Lenert L. A. Monitoring free-text data using medical language processing. Comput Biomed Res. 1993 Oct;26(5):467–481. doi: 10.1006/cbmr.1993.1033. [DOI] [PubMed] [Google Scholar]
  24. do Amaral M. B., Satomura Y. Structuring medical information into a language-independent database. Med Inform (Lond) 1994 Jul-Sep;19(3):269–282. doi: 10.3109/14639239409025332. [DOI] [PubMed] [Google Scholar]

Articles from Proceedings of the AMIA Annual Fall Symposium are provided here courtesy of American Medical Informatics Association

RESOURCES