Abstract
The entire collection of 11.5 million MEDLINE abstracts was processed to extract 549 million noun phrases using a shallow syntactic parser. English language strings in the 2002 and 2001 releases of the UMLS Metathesaurus were then matched against these phrases using flexible matching techniques. 34% of the Metathesaurus names (occurring in 30% of the concepts) were found in the titles and abstracts of articles in the literature. The matching concepts are fairly evenly chemical and non-chemical in nature and span a wide spectrum of semantic types. This paper details the approach taken and the results of the analysis.
Full text
PDF![727](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e07/2244184/e77294866509/procamiasymp00001-0768.png)
![728](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e07/2244184/50e3720c2a9e/procamiasymp00001-0769.png)
![729](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e07/2244184/2168c980eacc/procamiasymp00001-0770.png)
![730](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e07/2244184/20ff0944cfb4/procamiasymp00001-0771.png)
![731](https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1e07/2244184/31afcc476289/procamiasymp00001-0772.png)
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Aronson A. R., Rindflesch T. C. Query expansion using the UMLS Metathesaurus. Proc AMIA Annu Fall Symp. 1997:485–489. [PMC free article] [PubMed] [Google Scholar]
- Bennett N. A., He Q., Powell K., Schatz B. R. Extracting noun phrases for all of MEDLINE. Proc AMIA Symp. 1999:671–675. [PMC free article] [PubMed] [Google Scholar]
- Kim W., Wilbur W. J. Corpus-based statistical screening for phrase identification. J Am Med Inform Assoc. 2000 Sep-Oct;7(5):499–511. doi: 10.1136/jamia.2000.0070499. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McCray A. T., Burgun A., Bodenreider O. Aggregating UMLS semantic types for reducing conceptual complexity. Stud Health Technol Inform. 2001;84(Pt 1):216–220. [PMC free article] [PubMed] [Google Scholar]
- van Mulligen E. M. UMLS-based access to CPR data. Stud Health Technol Inform. 1998;52(Pt 1):166–170. [PubMed] [Google Scholar]