Table 2.
Real datasets of DNA motifs
| Database | Link | Description |
|---|---|---|
| TRANSFAC | http://gene-regulation.com/pub/databases.html | TRANSFAC is the database of eukaryotic TFs, their genomic binding sites, and DNA-binding profiles |
| JASPAR | http://jaspar.genereg.net/ | A public dataset of motifs for multicellular eukaryotes |
| PROSITE | http://prosite.expasy.org/ | PROSITE includes documentation sections describing protein domains, families and functional sites in addition to related patterns and profiles to recognize them |
| YEASTRACT | http://www.yeastract.com/ | It contains predicted TFs for S. cerevisiae. |
| SCPD | http://rulai.cshl.edu/SCPD/ | |
| RegulonDB | http://regulondb.ccg.unam.mx/ | Provides curated information on the transcriptional regulatory network of E. coli and contains both computational as well as experimental data of predicted objects |
| CisBP | http://cisbp.ccbr.utoronto.ca/ | It contains a list of >160,000 predicted TFs from >300 species |
| DBTBS | http://dbtbs.hgc.jp/ | It contains TFs for Bacillus subtilis |