Skip to main content
. 2017 Jan 10;2017:baw163. doi: 10.1093/database/baw163

Table 1.

Definitions of ‘duplicate’ in genomic databases from 2009 to 2015

Database Domain Interpretation of the term ‘duplicate’
(29) biomolecular interaction network repeated interactions between protein to protein, protein to DNA, gene to gene; same interactions but in different organism-specific files
(30) gene annotation (near) identical genes; fragments; incomplete gene duplication; and different stages of gene duplication
(31) gene annotation near or identical coding genes
(32) gene annotation same measurements on different tissues for gene expression
(33) genome characterization records with same meta data; same records with inconsistent meta data; same or inconsistent record submissions
(34) genome characterization create a new record with the configuration of a selected record
(35) ligand for drug discovery records with multiple synonyms; for example, same entries for TR4 (Testicular Receptor 4) but some used a synonym TAK1 (a shared name) rather than TR4
(36) peptidase cleavages cleavages being mapped into wrong residues or sequences

Databases in the same domain, for example gene annotation, may be specialized for different perspectives, such as annotations on genes in different organisms or different functions, but they arguably belong to the same broad domain.