Table 2.
Comparison between the number of plain text values for the fields disease, sex, and tissue, and the number of ontology terms resulting from applying our Semantic Annotation Pipeline (SAP) to the plain text values.
Field | Description | Plain text | Ontology terms | Annotation ratio | ||
---|---|---|---|---|---|---|
Values | Examples | Values | Examples (preferred labels) | |||
disease | Disease diagnosed | 1,064 | Lung carcinoma, carcinoma of lung | 261 | lung carcinoma | 4.07 |
sex | Sex of sampled organism | 16 | female, Female, f, F, FEMALE | 2 | female | 8 |
tissue | Type of tissue the sample was taken from | 604 | liver, Liver, liver tissue, liver biopsy, Liver biopsy tissue | 171 | liver | 3.53 |