Table 7.
A comparison of popular knowledge resources, typically used in TM for the life sciences
Name | Type | Domain | Size | Format | License |
---|---|---|---|---|---|
Uniprota | Knowledge base | Proteomics | 63 million sequences | Own, RDF, FASTA | CC |
UMLSb | Thesaurus | Biomedical | 3.2 million concepts | Own | Proprietary |
Gene Ontologyc | Ontology | Genetics | 44 000 terms | OBO | CC |
Agrovocd | Thesaurus | Agriculture | 32 000 concepts | RDF | CC |
HPOe | Vocabulary | Human phenotype | 10 000 terms | OBO, OWL, RDF | Free to use |
CNOf | Vocabulary | Neuroscience | 395 classes | OWL, RDF | CC |
CAROg | Ontology | Anatomy | 96 classes | OBO, OWL | Unspecified |
These resources differ in terms of type, domain and intended use. These differences make size difficult to compare as different resources have different base elements. Nonetheless, we have presented the table in an approximate order of size from largest to smallest.