Comparison of Methods for Evaluation of Medical Terminological Systems

DGT Arts; R Cornet; E de Jonge; NF de Keizer

. 2003;2003:779.

Comparison of Methods for Evaluation of Medical Terminological Systems

DGT Arts ^1,², R Cornet ¹, E de Jonge ², NF de Keizer ¹

PMCID: PMC1480163 PMID: 14728284

Abstract

The importance of terminological systems (TS) for the medical domain is widely recognized. The usability of such a system depends primarily on its content. We have designed four methods to evaluate the content of TS and applied them in a case study.

INTRODUCTION

Several developments in health care have led to an increase in the need for accurate, detailed and structured registration of medical data. To support standardized and structured registration of patient information many terminological systems (TS) have been and are still being developed. A TS relates concepts and terms belonging to a specific domain ¹. The aspects of TS that can be evaluated and the methods that can be applied are various. This study provides a comparison of four methods for evaluation of the content of TS by applying them in a case study on a TS for the domain of intensive care, DICE (Diagnoses for Intensive Care Evaluation).

METHODS

As in most recent publications examining TS we will focus on the evaluation of the content of TS. ‘Content’ here refers to the concepts that are included in the TS, their terms and (hierarchical and non-hierarchical) relations. We have designed four methods for the evaluation of the domain completeness and the correctness of the content of TS and applied them in a case study on DICE.

Method A: Field testing

The TS is used in real practice by the intended users of the system, e.g. physicians, to register patient infomation. The user assigns a ‘match score’ to each coded concept, indicating the extent to which it matches the concept he has in mind.

Method B: Research queries

A set of concepts, e.g. clustering criteria, is obtained from studies that are related to the domain of the TS. The concepts are given a ‘match score’ indicating the extent to which they are represented in the content and structure of the TS.

Method C: Description logics

The semantics of the TS are represented using a description logic (DL) formalism. Reasoning with the knowledge in DL reveals inconsistencies that lead to the uncovering of missing or incorrect relations.

Method D: Expert knowledge

Experts in the domain of the TS manually review the concepts and their relations and tems.

EXPERIENCES

Table 1 displays the extent to which each method uncovered incompleteness or incorrectness in the content of the TS. Methods A and B differ only in the way of retrieving the concepts that are being matched to the TS. Method B appeared to be more suitable to retrieve rare concepts. Disadvantage of method B was that clustering criteria are often vaguely described. It appeared to be a time consuming effort to uncover the causes of the inconsistencies that were identified by method C. However this method has the potential to support maintenance and evaluation of TS through an automated approach. Method D, although time consuming, appeared to be very valuable, since concepts that were included in this TS had not yet been evaluated by an editorial board.

Table 1.

The extent to which each method identified the types of incompleteness and incorrectness.

		A	B	C	D
Incomplete	Concepts	+	+	−	+/−
	Terms	+/−	+/−	−	+/−
	Non-hierarchical rel.	+/−	+/−	+	+
	Hierarchical rel.	−	−	+	+/−
Incorrect	Terms	+/−	+/−	−	+
	Non-hierarchical rel.	−	−	+	+
	Hierarchical rel.	−	−	+	+

Open in a new tab

CONCLUSION

All four methods appear to be valuable and feasible for evaluating the content of TS. Which method to apply depends on the focus of the evaluation. Table 1 can be a useful reference. Research on other issues, such as user interface and maintenance is ongoing.

Footnotes

This study was supported by the Dutch Intensive Care Evaluation (NICE) foundation. (www.stichting-nice.nl)

REFERENCES

1.de Keizer NF, Abu-Hanna A, Zwetsloot-Schonk JH. Understanding terminological systems I: Terminology and typology. Methods Inf Med. 2000 Mar;39(1):16–21. [PubMed] [Google Scholar]

[b1-163] 1.de Keizer NF, Abu-Hanna A, Zwetsloot-Schonk JH. Understanding terminological systems I: Terminology and typology. Methods Inf Med. 2000 Mar;39(1):16–21. [PubMed] [Google Scholar]

PERMALINK

Comparison of Methods for Evaluation of Medical Terminological Systems

DGT Arts, MSc

R Cornet, MSc

E de Jonge, PhD

NF de Keizer, PhD

Abstract

INTRODUCTION

METHODS

Method A: Field testing

Method B: Research queries

Method C: Description logics

Method D: Expert knowledge

EXPERIENCES

Table 1.

CONCLUSION

Footnotes

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Comparison of Methods for Evaluation of Medical Terminological Systems

DGT Arts, MSc

R Cornet, MSc

E de Jonge, PhD

NF de Keizer, PhD

Abstract

INTRODUCTION

METHODS

Method A: Field testing

Method B: Research queries

Method C: Description logics

Method D: Expert knowledge

EXPERIENCES

Table 1.

CONCLUSION

Footnotes

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases