Abstract
Multiple-word biomedical terms may point to concepts in bibliographic citations with greater precision than individual words. The barrier word method detects multiple-word terms at first encounter in narrative text. Words with low biomedical information content (prepositions, articles, etc.) are designated as barrier words; each word sequence occurring between consecutive barrier words is a candidate multiple-word term. In 1407 consecutive titles and abstracts listed under DNA, RECOMBINANT (D13.444.308.460), there were 1,275 barrier words, and 13,548 multiple-word terms were selected. Results demonstrate an effective method for detecting multiple-word terms in molecular biology narrative text.
Full text
PDF




Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Condon E. U. STATISTICS OF VOCABULARY. Science. 1928 Mar 16;67(1733):300–300. doi: 10.1126/science.67.1733.300. [DOI] [PubMed] [Google Scholar]
- Matheson N. W. Medical libraries and computers. The role of medical libraries in medical informatics. West J Med. 1986 Dec;145(6):859–863. [PMC free article] [PubMed] [Google Scholar]
- Moore G. W., Boitnott J. K., Miller R. E., Eggleston J. C., Hutchins G. M. Integrated pathology reporting, indexing, and retrieval system using natural language diagnoses. Mod Pathol. 1988 Jan;1(1):44–50. [PubMed] [Google Scholar]
- Wirth T., Staudt L., Baltimore D. An octamer oligonucleotide upstream of a TATA motif is sufficient for lymphoid-specific promoter activity. Nature. 1987 Sep 10;329(6135):174–178. doi: 10.1038/329174a0. [DOI] [PubMed] [Google Scholar]
