Abstract
An effective and efficient learning method, Expert Network (ExpNet), is introduced in this paper. ExpNet predicts the related categories of an arbitrary text based on a search of its nearest neighbors in a set of training texts, and a reasoning from the expert-assigned categories of these neighbors. Evaluations in patient-record text classification and MEDLINE document indexing show a performance of ExpNet in recall and precision comparable to the Linear Least Squares Fit (LLSF) mapping method, and significantly better than other methods tested. We also observed that ExpNet is much more efficient than LLSF in computation. The total training and testing time on the patient-record text collection (6134 texts) was 4 minutes for ExpNet versus 96 minutes for LLSF; on the MEDLINE document collection (2344 documents), the total time was 15 minutes for ExpNet versus 4.6 hours for LLSF. It is evident in this study that human knowledge of text categorization can be statistically learned without expensive computation, and that ExpNet is such a solution.
Full text
PDF




Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Haynes R. B., McKibbon K. A., Walker C. J., Ryan N., Fitzgerald D., Ramsden M. F. Online access to MEDLINE in clinical settings. A study of use and usefulness. Ann Intern Med. 1990 Jan 1;112(1):78–84. doi: 10.7326/0003-4819-112-1-78. [DOI] [PubMed] [Google Scholar]
- Hersh W., Hickam D. H., Haynes R. B., McKibbon K. A. Evaluation of SAPHIRE: an automated approach to indexing and retrieving medical literature. Proc Annu Symp Comput Appl Med Care. 1991:808–812. [PMC free article] [PubMed] [Google Scholar]
- Salton G. Developments in automatic text retrieval. Science. 1991 Aug 30;253(5023):974–980. doi: 10.1126/science.253.5023.974. [DOI] [PubMed] [Google Scholar]
- Yang Y., Chute C. G. Words or concepts: the features of indexing units and their optimal use in information retrieval. Proc Annu Symp Comput Appl Med Care. 1993:685–689. [PMC free article] [PubMed] [Google Scholar]