Abstract
With the goal of providing a comprehensive, high-quality resource for both plant transcription factors (TFs) and their regulatory interactions with target genes, we upgraded plant TF database PlantTFDB to version 4.0 (http://planttfdb.cbi.pku.edu.cn/). In the new version, we identified 320 370 TFs from 165 species, presenting a more comprehensive genomic TF repertoires of green plants. Besides updating the pre-existing abundant functional and evolutionary annotation for identified TFs, we generated three new types of annotation which provide more directly clues to investigate functional mechanisms underlying: (i) a set of high-quality, non-redundant TF binding motifs derived from experiments; (ii) multiple types of regulatory elements identified from high-throughput sequencing data; (iii) regulatory interactions curated from literature and inferred by combining TF binding motifs and regulatory elements. In addition, we upgraded previous TF prediction server, and set up four novel tools for regulation prediction and functional enrichment analyses. Finally, we set up a novel companion portal PlantRegMap (http://plantregmap.cbi.pku.edu.cn) for users to access the regulation resource and analysis tools conveniently.
INTRODUCTION
Transcription factors (TFs) temporarily and spatially turn on or off the transcription of their target genes through binding certain upstream elements. After the first attempt to identify the TF repertoire in Arabidopsis thaliana genome in 2000 (1), several dedicated TF databases for either individual or multiple species (2–10) have been publicly available, effectively advancing the study of plant transcriptional regulatory system. In response to the demand of systematical identification and annotation of plant TFs, we have developed three species-specific TF databases (DATF (11) for Arabidopsis, DRTF (12) for rice, and DPTF (13) for poplar) and three versions of comprehensive plant TF database (PlantTFDB) (14–17) during the past decade. These resources receive tens of millions web hits annually and have been widely used for the functional and evolutionary study of plant TFs and the identification and annotation of TFs in newly sequenced plants (18,19).
The regulatory interactions between TFs and target genes provide crucial information to investigate functional mechanisms of TFs. Recently, the DNA binding motifs of hundreds of plant TFs have been determined (20–22), combined with genomic regulatory elements identified from ChIP-seq and DNase-seq (23), providing us an opportunity to systematically map downstream regulatory interactions of TFs. To meet the urgent demand of our user community for transcriptional regulation information, we upgraded PlantTFDB to version 4.0 (http://planttfdb.cbi.pku.edu.cn/). Compared with previous versions, PlantTFDB 4.0 covers more species and more TFs with updated abundant functional and evolutionary annotation. Moreover, three new types of annotation including binding motifs of TFs, regulatory elements and interactions between them have been added. In addition, we upgraded previous TF prediction server, and set up four novel tools for regulation prediction and functional enrichment analyses (Figure 1). Finally, we set up a new portal PlantRegMap (http://plantregmap.cbi.pku.edu.cn) for users to access the regulation resource and analysis tools conveniently.
Figure 1.
The flowchart for construction of PlantTFDB 4.0. Items marked with ‘(W)’ are web services. RBH means BLAST reciprocal best hit, and OG means orthologous group.
We believe that PlantTFDB 4.0 provides users a comprehensive resource of plant TFs, regulatory elements and regulatory interactions between them as well as useful analysis tools, advancing the study in the community.
RESULTS AND DISCUSSION
More representative genomic TF repertoires of green plants
By incorporating 89 newly sequenced genomes released during past three years, we totally collected 156 species with genome sequences. Using our established TF prediction pipeline (16) and protein sequences downloaded from Joint Genome Institute and other plant genome sequencing and annotation projects (see http://planttfdb.cbi.pku.edu.cn/help_datasrc.php for detailed data source), we identified 315 099 TFs (266 849 loci) from 6 381 676 proteins (5 555 839 loci) of 156 species (Supplementary Table S1). In addition, 5271 TFs from 9 species in version 3.0 remaining without genome sequences were kept.
By incorporating the 89 new species with genome sequences, we provide genomic TF repertoires covering two more early-diverged lineages, charophyta and marchantiophyta, of green plants (Figure 2A) and present a refined origination profiling of TF families (Figure 2B) than previous studies (24). In addition, the inclusion of more wild species for domestic species (e.g., Oryza rufipogon and Oryza nivara for Oryza sativa) also facilitates the study of crop domestication at transcriptional regulation level and further crop breeding.
Figure 2.
Summary of TFs identified from 156 species with genome sequences. (A) Average number of TFs in different taxonomic lineages. (B) The origination stage of TF families. The number in the parentheses (e.g., Eudicots (95)) is the number of species used to date the origination stage for TF families according to the protocol described in (24).
Updated pipelines for evolutionary and functional annotation
As the continuous effort to improve annotation coverage and quality, we not only updated all existing annotation from multiple canonical sources (24–27) but also refined the annotation pipelines for Orthologous Group (OG) inference and Gene Ontology (GO) term assignment.
Orthologous genes usually possess similar function and are widely used to infer function interspecies. Given the recent reported methodology bias for OrthoMCL (28), we employed a new tool OrthoFinder (28) to infer orthologous groups (OGs) for proteins from 156 species with genome sequences (Figure 3), and got 2368 TF OGs for all species, 800 TF OGs for 17 representative species (Supplementary Table S2), 209, 1898, 1261, 1846 and 1900 TF OGs for the clades of Chlorophytae, Monocots, Asterids, Fabids and Malvids, respectively. Meanwhile, we also identified orthologous pairs under a stricter criterion, BLAST reciprocal best hits (RBHs), for all species pairs among 156 species with genome sequences, which serve as a basis for transferring annotation interspecies (e.g., from well-studied model plants like Arabidopsis thaliana to poor-annotated ones).
Figure 3.
Workflow for parsing BLAST reciprocal best hits (RBHs) and inferring orthologous groups.
Gene Ontology is widely used to describe gene function. Taking an integrative strategy by combining sequence-based prediction (29), orthologous-based projection, and collection of existing annotation in canonical sources (25,26), we successfully annotated 93.2% (293 696 of 315 099) TFs in 165 species. Moreover, to provide a comprehensive reference dataset for follow-up functional enrichment analysis, we further annotated 59.3% (3 374 258 of 5 691 627) genes in these species, increasing the coverage of GO annotation to 9.1 times on average (Supplementary Table S3, see Supplementary Text 1 for details on the annotation pipeline).
Genome-wide regulatory maps of green plants
TFs regulate the transcription of their target genes through binding certain cis-elements. Mapping regulatory elements is essential for the identification of regulatory interactions and further functional mechanisms of TFs. High-throughput assays like ChIP-seq (30) and genome-wide TF footprinting (23,31) have been efficiently employed to identify genome-wide TF occupancy sites. Here, we identified 345 920 TF binding peaks for 14 TFs based on 26 public ChIP-seq datasets as well as 4 794 773 TF footprints based on 13 public DNase-seq datasets according to the protocol described in (31) (Table 1, Supplementary Tables S4 and S5 and Supplementary Text 2). As histone modification and nucleosome positioning also provide useful clues for the study of transcriptional regulation, such information is also incorporated (Table 1, Supplementary Tables S6–S8 and Supplementary Text 2). In addition, we set up a genome browser for users to access these regulatory elements conveniently (http://plantregmap.cbi.pku.edu.cn/cis-map.php).
Table 1. Summary of integrated regulation information identified from high-throughput sequencing.
Type | Species | Organ | Peak |
---|---|---|---|
DNase I hypersensitive site | 2 | 10 | 1 026 748 |
DNase I genomic TF footprints | 1 | 4 | 4 794 773 |
Histone modification | 5 | 7 | 765 371 |
Nucleosome positioning | 1 | 2 | 15 348 541 |
TF binding peak (ChIP-seq) | 2 | 4 | 345 920 |
The DNA binding motifs of TFs are crucial to identifying their downstream target genes. During the past three years, binding motifs of hundreds of plant TFs have been determined and publicly available (20–22). We firstly collected 1790 binding motifs derived from experiments from multiple databases (including PlantCistromeDB (20), CIS-BP (21), JASPAR (32), UniPROBE (33), TRANSFAC (public 7.0) (34)), literature (22) and motifs discovered from ChIP-seq peaks using MEME-ChIP (35). For a TF with more than one motif, we manually curated the best one preferentially for the motif determined in vivo and presenting more similarity with other motifs of this TF. After filtering out low-quality ones (these with information content < 4.5), we got 674 non-redundant, high-quality binding motifs, covering 77.6% (45 of 58) TF families in PlantTFDB. To take full advantage of this resource, we projected these motifs to other species using the RBHs identified above (only RBHs belong to the same TF family were used) and adjusted the motifs using the nucleic acid background in promoter regions (TSS −500 bp to +100bp). Finally, we got sets of species-specific binding motifs for 156 species (Figure 4 and Supplementary Table S9).
Figure 4.
Curation and projection of TF binding motifs. (A) Workflow for curation and projection of high-quality TF binding motifs derived from experiments. (B) The visualization of TF binding motifs in PlantTFDB.
The regulatory interactions between TFs and target genes provide the most direct evidences to investigate the functional mechanisms of TFs. We collected transcriptional regulatory interactions in following three ways: (i) through mining transcriptional regulatory interactions from literature and then manually curated each interaction, we got 1431 high-confident transcriptional regulatory interactions, covering 388 TFs from 47 families, in A. thaliana (24), which were integrated in this update; (ii) identified 9340 experiment-based regulatory interactions identified for six TFs with at least two ChIP-seq replications; (iii) in silico predicted 342 000 regulatory interactions by combining TF binding motifs and footprint data (Table 2, Supplementary Tables S4 and S5 and Supplementary Text 2). As the high-quality interactions are mainly limited to A. thaliana, we further inferred regulatory interactions through in silico searching potential binding sites of TFs using binding motifs in gene promoter regions for 132 species with fine annotation of gene positions (Table 2, Supplementary Table S10 and Supplementary Text 2). Users are encouraged to make use of the less reliable interactions combined with other assistant information such as expression information and conserved elements.
Table 2. Summary of the regulatory interactions at the new portal PlantRegMap.
Method | Species | Organ | Interaction |
---|---|---|---|
Literature mining and manual curation | 1 | – | 1 431 |
ChIP-seq | 1 | 2 | 9 340 |
Motif + TF footprint | 1 | 4 | 342 005 |
Motif | 132 | – | 50 850 582 |
Novel online tools for transcriptional regulation prediction and analyses
For users to take full advantage of our rich data and pipelines, we upgraded the TF prediction server and set up four novel servers for TF identification, regulation prediction and functional enrichment analyses (see Supplementary Text 3 for more details).
TF prediction: According to the requirement of users on TF prediction from newly identified transcripts, support for nucleic acid sequences was added. ESTScan (36) is employed to identify CDS regions of input nucleic acid sequences and translate them to protein sequences. In addition, the limitation on the size of uploaded file is relaxed to 100M based on users’ feedback.
Binding site prediction: Based on the sets of species-specific, high-quality TF binding motifs integrated above, FIMO (37) is employed to scan for TF binding sites from the input sequences.
Regulation prediction and enrichment analysis: Different from ‘Binding site prediction’ server, this tool focuses on inferring regulatory interactions between TFs and the input promoters, and searches for the enriched upstream regulators in the input gene set.
GO enrichment: Based on our compiled GO annotation for 165 species generated (see Supplementary Text 1 for details on the annotation pipeline), this tool identifies the significantly enriched GO terms in the input gene set based on Fisher's exact tests (38).
TF enrichment: Based on our pre-computed regulatory interactions, this tool allows user to search for enriched upstream regulators in the input gene set.
Resource availability
For users to access our resource conveniently, we set up a new portal PlantRegMap (http://plantregmap.cbi.pku.edu.cn/) for PlantTFDB (http://planttfdb.cbi.pku.edu.cn/), with PlantTFDB focusing on TF identification and annotation and the new portal for regulation prediction and analyses (Table 3). All of data in our database can be freely accessed and downloaded online.
Table 3. Resources available at the new portal PlantRegMap.
Data | Tool |
---|---|
Regulatory elements | Binding site prediction |
Regulatory interactions | Regulation prediction |
GO annotation for all genes | GO enrichment |
OGs and RBHs for all genes | TF enrichment |
FURTHER DIRECTION
We have upgraded our PlantTFDB to version 4.0 with a new portal PlantRegMap, which provides genomic TF repertoires covering the main lineages of green plants, sets of high-quality binding motifs, genome-wide regulatory interactions and rich analysis tools. The information for TFs and transcriptional regulation are useful resources for users to explore the functional mechanisms of TFs and plant transcriptional regulatory system (24). We will continue working on this project to embrace more useful annotation, and improve the regulation prediction through introducing the conserved elements.
Acknowledgments
We thank Joint Genome Institute for the genome annotation of 13 unpublished species, AGI for six rice species and Leersia perrieri, THGP for Castanea mollissima, BTI for Nicotiana benthamiana and JHU for Juglans regia. We specially thank our users for their invaluable suggestions and comments. The main analyses of this work were performed on the HPC of Center for Life Sciences of Peking University and we thank Dr. Fangjin Chen for his help.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
National Natural Science Foundation of China [1470330]; International Collaboration Program for Proteome Biological Big Data and Standard System [2014DFB30030]; State Key Laboratory of Protein and Plant Gene Research; China Postdoctoral Science Foundation Grant [2014M560017, 2015T80015 to J.J.]; Postdoctoral Fellowship at Peking-Tsinghua Center for Life Sciences (to J.J.); National Program for Support of Top-notch Young Professionals (to G.G.). Funding for open access charge: China Postdoctoral Science Foundation Grant [2015T80015 to J.J.].
Conflict of interest statement. None declared.
REFERENCES
- 1.Riechmann J.L., Heard J., Martin G., Reuber L. Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes. Science. 2000;290:2105. doi: 10.1126/science.290.5499.2105. [DOI] [PubMed] [Google Scholar]
- 2.Yilmaz A., Mejia-Guerra M.K., Kurz K., Liang X., Welch L., Grotewold E. AGRIS: the Arabidopsis gene regulatory information server, an update. Nucleic Acids Res. 2011;39:D1118–D1122. doi: 10.1093/nar/gkq1120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Iida K., Seki M., Sakurai T., Satou M., Akiyama K., Toyoda T., Konagaya A., Shinozaki K. RARTF: database and tools for complete sets of Arabidopsis transcription factors. DNA Res. 2005;12:247–256. doi: 10.1093/dnares/dsi011. [DOI] [PubMed] [Google Scholar]
- 4.Pérez-Rodríguez P., Riaño-Pachón D.M., Corrêa L.G.G., Rensing S.A., Kersten B., Mueller-Roeber B. PlnTFDB: updated content and new features of the plant transcription factor database. Nucleic Acids Res. 2010;38:D822–D827. doi: 10.1093/nar/gkp805. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Fredslund J. DATFAP: a database of primers and homology alignments for transcription factors from 13 plant species. BMC Genomics. 2008;9:140. doi: 10.1186/1471-2164-9-140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Mochida K., Yoshida T., Sakurai T., Yamaguchi-Shinozaki K., Shinozaki K., Tran L.-S.P. TreeTFDB: an integrative database of the transcription factors from six economically important tree crops for functional Predictions and comparative and functional genomics. DNA Res. 2013;20:151–162. doi: 10.1093/dnares/dss040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Yilmaz A., Nishiyama M.Y., Jr, Fuentes B.G., Souza G.M., Janies D., Gray J., Grotewold E. GRASSIUS: a platform for comparative regulatory genomics across the grasses. Plant Physiol. 2009;149:171. doi: 10.1104/pp.108.128579. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Mochida K., Yoshida T., Sakurai T., Yamaguchi-Shinozaki K., Shinozaki K., Tran L.S. LegumeTFDB: an integrative database of Glycine max, Lotus japonicus and Medicago truncatula transcription factors. Bioinformatics. 2010;26:290–291. doi: 10.1093/bioinformatics/btp645. [DOI] [PubMed] [Google Scholar]
- 9.Rushton P.J., Bokowiec M.T., Laudeman T.W., Brannock J.F., Chen X., Timko M.P. TOBFAC: the database of tobacco transcription factors. BMC Bioinformatics. 2008;9:53. doi: 10.1186/1471-2105-9-53. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Romeuf I., Tessier D., Dardevet M., Branlard G., Charmet G., Ravel C. wDBTF: an integrated database resource for studying wheat transcription factor families. BMC Genomics. 2010;11:185. doi: 10.1186/1471-2164-11-185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Guo A., He K., Liu D., Bai S., Gu X., Wei L., Luo J. DATF: a database of Arabidopsis transcription factors. Bioinformatics. 2005;21:2568. doi: 10.1093/bioinformatics/bti334. [DOI] [PubMed] [Google Scholar]
- 12.Gao G., Zhong Y., Guo A., Zhu Q., Tang W., Zheng W., Gu X., Wei L., Luo J. DRTF: a database of rice transcription factors. Bioinformatics. 2006;22:1286. doi: 10.1093/bioinformatics/btl107. [DOI] [PubMed] [Google Scholar]
- 13.Zhu Q.H., Guo A.Y., Gao G., Zhong Y.F., Xu M., Huang M., Luo J. DPTF: a database of poplar transcription factors. Bioinformatics. 2007;23:1307. doi: 10.1093/bioinformatics/btm113. [DOI] [PubMed] [Google Scholar]
- 14.Guo A.Y., Chen X., Gao G., Zhang H., Zhu Q.H., Liu X.C., Zhong Y.F., Gu X., He K., Luo J. PlantTFDB: a comprehensive plant transcription factor database. Nucleic Acids Res. 2008;36:D966–D969. doi: 10.1093/nar/gkm841. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Zhang H., Jin J., Tang L., Zhao Y., Gu X., Gao G., Luo J. PlantTFDB 2.0: update and improvement of the comprehensive plant transcription factor database. Nucleic Acids Res. 2011;39:D1114–D1117. doi: 10.1093/nar/gkq1141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Jin J., Zhang H., Kong L., Gao G., Luo J. PlantTFDB 3.0: a portal for the functional and evolutionary study of plant transcription factors. Nucleic Acids Res. 2014;42:D1182–D1187. doi: 10.1093/nar/gkt1016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Jin J., Guo A., He K., Zhang H., Zhu Q., Chen X., Gao G., Luo J. Classification, prediction and database construction of plant transcription factors. Biotechnol. Bull. 2015;31:68–77. [Google Scholar]
- 18.Jia J., Zhao S., Kong X., Li Y., Zhao G., He W., Appels R., Pfeifer M., Tao Y., Zhang X. Aegilops tauschii draft genome sequence reveals a gene repertoire for wheat adaptation. Nature. 2013;496:91–95. doi: 10.1038/nature12028. [DOI] [PubMed] [Google Scholar]
- 19.Zhang T., Hu Y., Jiang W., Fang L., Guan X., Chen J., Zhang J., Saski C.A., Scheffler B.E., Stelly D.M., et al. Sequencing of allotetraploid cotton (Gossypium hirsutum L. acc. TM-1) provides a resource for fiber improvement. Nat. Biotechnol. 2015;33:531–537. doi: 10.1038/nbt.3207. [DOI] [PubMed] [Google Scholar]
- 20.O'Malley R.C., Huang S.S., Song L., Lewsey M.G., Bartlett A., Nery J.R., Galli M., Gallavotti A., Ecker J.R. Cistrome and epicistrome features shape the regulatory DNA landscape. Cell. 2016;165:1280–1292. doi: 10.1016/j.cell.2016.04.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Weirauch M.T., Yang A., Albu M., Cote A.G., Montenegro-Montero A., Drewe P., Najafabadi H.S., Lambert S.A., Mann I., Cook K., et al. Determination and inference of eukaryotic transcription factor sequence specificity. Cell. 2014;158:1431–1443. doi: 10.1016/j.cell.2014.08.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Franco-Zorrilla J.M., López-Vidriero I., Carrasco J.L., Godoy M., Vera P., Solano R. DNA-binding specificities of plant transcription factors and their potential to define target genes. Proc. Natl. Acad. Sci. U.S.A. 2014;111:2367–2372. doi: 10.1073/pnas.1316278111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Sullivan A.M., Arsovski A.A., Lempe J., Bubb K.L., Weirauch M.T., Sabo P.J., Sandstrom R., Thurman R.E., Neph S., Reynolds A.P., et al. Mapping and dynamics of regulatory DNA and transcription factor networks in A. thaliana. Cell Rep. 2014;8:2015–2030. doi: 10.1016/j.celrep.2014.08.019. [DOI] [PubMed] [Google Scholar]
- 24.Jin J., He K., Tang X., Li Z., Lv L., Zhao Y., Luo J., Gao G. An Arabidopsis transcriptional regulatory map reveals distinct functional and evolutionary features of novel transcription factors. Mol. Biol. Evol. 2015;32:1767–1773. doi: 10.1093/molbev/msv058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Lamesch P., Berardini T.Z., Li D., Swarbreck D., Wilks C., Sasidharan R., Muller R., Dreher K., Alexander D.L., Garcia-Hernandez M. The Arabidopsis Information Resource (TAIR): improved gene annotation and new tools. Nucleic Acids Res. 2012;40:D1202–D1210. doi: 10.1093/nar/gkr1090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Consortium U. UniProt: a hub for protein information. Nucleic Acids Res. 2015;43:D204–D212. doi: 10.1093/nar/gku989. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Brown G.R., Hem V., Katz K.S., Ovetsky M., Wallin C., Ermolaeva O., Tolstoy I., Tatusova T., Pruitt K.D., Maglott D.R., et al. Gene: a gene-centered information resource at NCBI. Nucleic Acids Res. 2015;43:D36–D42. doi: 10.1093/nar/gku1055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Emms D.M., Kelly S. OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol. 2015;16:1. doi: 10.1186/s13059-015-0721-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Jones P., Binns D., Chang H.-Y., Fraser M., Li W., McAnulla C., McWilliam H., Maslen J., Mitchell A., Nuka G. InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014;30:1236–1240. doi: 10.1093/bioinformatics/btu031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Moyroud E., Minguet E.G., Ott F., Yant L., Pose D., Monniaux M., Blanchet S., Bastien O., Thevenon E., Weigel D., et al. Prediction of regulatory interactions from genome sequences using a biophysical model for the Arabidopsis LEAFY transcription factor. Plant Cell. 2011;23:1293–1306. doi: 10.1105/tpc.111.083329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Neph S., Vierstra J., Stergachis A.B., Reynolds A.P., Haugen E., Vernot B., Thurman R.E., John S., Sandstrom R., Johnson A.K., et al. An expansive human regulatory lexicon encoded in transcription factor footprints. Nature. 2012;489:83–90. doi: 10.1038/nature11212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Mathelier A., Fornes O., Arenillas D.J., Chen C.Y., Denay G., Lee J., Shi W., Shyr C., Tan G., Worsley-Hunt R., et al. JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles. Nucleic Acids Res. 2016;44:D110–D115. doi: 10.1093/nar/gkv1176. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Hume M.A., Barrera L.A., Gisselbrecht S.S., Bulyk M.L. UniPROBE, update 2015: new tools and content for the online database of protein-binding microarray data on protein-DNA interactions. Nucleic Acids Res. 2015;43:D117–D122. doi: 10.1093/nar/gku1045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Matys V., Kel-Margoulis O.V., Fricke E., Liebich I., Land S., Barre-Dirrie A., Reuter I., Chekmenev D., Krull M., Hornischer K. TRANSFAC® and its module TRANSCompel®: transcriptional gene regulation in eukaryotes. Nucleic Acids Res. 2006;34:D108–D110. doi: 10.1093/nar/gkj143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Machanick P., Bailey T.L. MEME-ChIP: motif analysis of large DNA datasets. Bioinformatics. 2011;27:1696–1697. doi: 10.1093/bioinformatics/btr189. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Iseli C., Jongeneel C.V., Bucher P. ESTScan: a program for detecting, evaluating, and reconstructing potential coding regions in EST sequences. Proc. Int. Conf. Intell. Syst. Mol. Biol. 1999:138–148. [PubMed] [Google Scholar]
- 37.Grant C.E., Bailey T.L., Noble W.S. FIMO: scanning for occurrences of a given motif. Bioinformatics. 2011;27:1017–1018. doi: 10.1093/bioinformatics/btr064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Alexa A., Rahnenfuhrer J. topGO: Enrichment Analysis for Gene Ontology. 2010. R package version 2.22.0.