Abstract
Categorization by Reference is a novel text classification technique that examines the existing classifications of the citations found in an as-yet unclassified text to determine what terms should be assigned to that text. The existence of the Medical Subject Headings and MEDLINE make the biomedical domain a prime candidate for application of this technique. We describe our approach and implementation of a prototype, presenting some results of our initial tests. We further discuss refinements that could improve the precision of the technique, and describe its possible use in categorizing portions of the World-Wide Web.
Full text
PDF




Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Bernstein L. M., Williamson R. E. Testing of a natural language retrieval system for a full text knowledge base. J Am Soc Inf Sci. 1984 Jul;35(4):235–247. doi: 10.1002/asi.4630350407. [DOI] [PubMed] [Google Scholar]
