Skip to main content
Comparative and Functional Genomics logoLink to Comparative and Functional Genomics
. 2004 Dec;5(8):648–654. doi: 10.1002/cfg.445

Overview and Utilization of the NCI Thesaurus

Gilberto Fragoso 1,, Sherri de Coronado 1, Margaret Haber 2, Frank Hartel 1, Larry Wright 2
PMCID: PMC2447470  PMID: 18629178

Abstract

The NCI Thesaurus is a reference terminology covering areas of basic and clinical science, built with the goal of facilitating translational research in cancer. It contains nearly 110 000 terms in approximately 36000 concepts, partitioned in 20 subdomains, which include diseases, drugs, anatomy, genes, gene products, techniques, and biological processes, among others, all with a cancer-centric focus in content, and originally designed to support coding activities across the National Cancer Institute. Each concept represents a unit of meaning and contains a number of annotations, such as synonyms and preferred name, as well as annotations such as textual definitions and optional references to external authorities. In addition, concepts are modelled with description logic (DL) and defined by their relationships to other concepts; there are currently approximately 90 types of named relations declared in the terminology. The NCI Thesaurus is produced by the Enterprise Vocabulary Services project, a collaborative effort between the NCI Center for Bioinformatics and the NCI Office of Communications, and is part of the caCORE infrastructure stack (http://ncicb.nci.nih.gov/NCICB/core). It can be accessed programmatically through the open caBIO API and browsed via the web (http://nciterms.nci.nih.gov). A history of editing changes is also accessible through the API. In addition, the Thesaurus is available for download in various file formats, including OWL, the web ontology language, to facilitate its utilization by others.

Full Text

The Full Text of this article is available as a PDF (142.2 KB).

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Bakken S., Parker J., Konicek D., Campbell K. E. An evaluation of ICNP intervention axes as terminology model components. Proc AMIA Symp. 2000:42–46. [PMC free article] [PubMed] [Google Scholar]
  2. Campbell K. E., Cohn S. P., Chute C. G., Rennels G., Shortliffe E. H. Gálapagos: computer-based support for evolution of a convergent medical terminology. Proc AMIA Annu Fall Symp. 1996:269–273. [PMC free article] [PubMed] [Google Scholar]
  3. Campbell K. E., Cohn S. P., Chute C. G., Shortliffe E. H., Rennels G. Scalable methodologies for distributed development of logic-based convergent medical terminology. Methods Inf Med. 1998 Nov;37(4-5):426–439. [PubMed] [Google Scholar]
  4. Covitz Peter A., Hartel Frank, Schaefer Carl, De Coronado Sherri, Fragoso Gilberto, Sahni Himanso, Gustafson Scott, Buetow Kenneth H. caCORE: a common infrastructure for cancer informatics. Bioinformatics. 2003 Dec 12;19(18):2404–2412. doi: 10.1093/bioinformatics/btg335. [DOI] [PubMed] [Google Scholar]
  5. Hartel Francis W., Fragoso Gilberto, Ong Kim L., Dionne Robert. Enhancing quality of retrieval through concept edit history. AMIA Annu Symp Proc. 2003:279–283. [PMC free article] [PubMed] [Google Scholar]

Articles from Comparative and Functional Genomics are provided here courtesy of Wiley

RESOURCES