Table 1.
Database | Domain | Interpretation of the term ‘duplicate’ |
---|---|---|
(29) | biomolecular interaction network | repeated interactions between protein to protein, protein to DNA, gene to gene; same interactions but in different organism-specific files |
(30) | gene annotation | (near) identical genes; fragments; incomplete gene duplication; and different stages of gene duplication |
(31) | gene annotation | near or identical coding genes |
(32) | gene annotation | same measurements on different tissues for gene expression |
(33) | genome characterization | records with same meta data; same records with inconsistent meta data; same or inconsistent record submissions |
(34) | genome characterization | create a new record with the configuration of a selected record |
(35) | ligand for drug discovery | records with multiple synonyms; for example, same entries for TR4 (Testicular Receptor 4) but some used a synonym TAK1 (a shared name) rather than TR4 |
(36) | peptidase cleavages | cleavages being mapped into wrong residues or sequences |
Databases in the same domain, for example gene annotation, may be specialized for different perspectives, such as annotations on genes in different organisms or different functions, but they arguably belong to the same broad domain.