Skip to main content
Proceedings of the AMIA Symposium logoLink to Proceedings of the AMIA Symposium
. 2001:746–750.

Developing a test collection for biomedical word sense disambiguation.

M Weeber 1, J G Mork 1, A R Aronson 1
PMCID: PMC2243574  PMID: 11825285

Abstract

Ambiguity, the phenomenon that a word has more than one sense, poses difficulties for many current Natural Language Processing (NLP) systems. Algorithms that assist in the resolution of these ambiguities, i.e. which make unambiguous a word, or more generally, a text string, will boost performance of these systems. To test such techniques in the biomedical language domain, we have developed a Word Sense Disambiguation (WSD) test collection that comprises 5,000 unambiguous instances for 50 ambiguous UMLS Metathesaurus strings.

Full text

PDF
746

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Aronson A. R., Bodenreider O., Chang H. F., Humphrey S. M., Mork J. G., Nelson S. J., Rindflesch T. C., Wilbur W. J. The NLM Indexing Initiative. Proc AMIA Symp. 2000:17–21. [PMC free article] [PubMed] [Google Scholar]
  2. Aronson A. R. The effect of textual variation on concept based information retrieval. Proc AMIA Annu Fall Symp. 1996:373–377. [PMC free article] [PubMed] [Google Scholar]
  3. Friedman C. A broad-coverage natural language processing system. Proc AMIA Symp. 2000:270–274. [PMC free article] [PubMed] [Google Scholar]
  4. Nadkarni P., Chen R., Brandt C. UMLS concept indexing for production databases: a feasibility study. J Am Med Inform Assoc. 2001 Jan-Feb;8(1):80–91. doi: 10.1136/jamia.2001.0080080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Rindflesch T. C., Aronson A. R. Ambiguity resolution while mapping free text to the UMLS Metathesaurus. Proc Annu Symp Comput Appl Med Care. 1994:240–244. [PMC free article] [PubMed] [Google Scholar]
  6. Swanson D. R. Migraine and magnesium: eleven neglected connections. Perspect Biol Med. 1988 Summer;31(4):526–557. doi: 10.1353/pbm.1988.0009. [DOI] [PubMed] [Google Scholar]
  7. Weeber M., Klein H., Aronson A. R., Mork J. G., de Jong-van den Berg L. T., Vos R. Text-based discovery in biomedicine: the architecture of the DAD-system. Proc AMIA Symp. 2000:903–907. [PMC free article] [PubMed] [Google Scholar]

Articles from Proceedings of the AMIA Symposium are provided here courtesy of American Medical Informatics Association

RESOURCES