Table 1. Gene similarities used in the gene-to-gene search and the respective similarity measure.
Similarity | Data source | Similarity measure | |
---|---|---|---|
1. | Homology (SHOM) | Ensembl Compara | Sequence identity |
2. | InterPro protein domain (SIPD) | Ensembl Core | Cosine |
3. | Gene variant related publications (SVP) | Ensembl Variation | Cosine |
4. | Swiss-Prot protein feature (SSPF) | UniProtKB/Swiss-Prot | Cosine |
5. | GO cellular component (SCC) | Ensemble Core | Resnik-BMA |
6. | GO molecular function (SMF) | Ensemble Core | Resnik-BMA |
7. | GO biological process (SBP) | Ensemble Core | Resnik-BMA |
8. | Normal tissue expression profile (SNEX) | Human Protein Atlas | Spearman |
9. | HUGO gene symbol (SHGS) | HGNC | Prefix distance |