Abstract
EpoDB is a database designed for the study of gene regulation during differentiation and development of vertebrate red blood cells. In building EpoDB, we have taken the in advance approach to the data integration problem: we have extracted data relevant to red blood cells from GenBank, SWISS-PROT, TRRD (transcriptional regulation data) and GERD (expression levels data) to create a single integrated, highly curated view. Tools have been developed to automate data extraction from online resources, cleanse data of errors, enter information manually from the primary literature, generate a uniform, canonical representation of information and maintain data currency. The database is organized around biological features, e.g., genes, rather than sequences, which are supported by a controlled and consistent vocabulary for gene names and gene family names. Beyond the standard database queries, the functionality of EpoDB includes the ability to extract features and subsequences, display sequences and features graphically using bioWidget viewers and integrated analysis tools. EpoDB may be accessed at: http://cbil.humgen.upenn.edu/epodb/
Full Text
The Full Text of this article is available as a PDF (27.8 KB).
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Bairoch A., Apweiler R. The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1998. Nucleic Acids Res. 1998 Jan 1;26(1):38–42. doi: 10.1093/nar/26.1.38. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dong S., Searls D. B. Gene structure prediction by linguistic methods. Genomics. 1994 Oct;23(3):540–551. doi: 10.1006/geno.1994.1541. [DOI] [PubMed] [Google Scholar]
- Overton G. C., Aaronson J. S., Haas J., Adams J. QGB: a system for querying sequence database fields and features. J Comput Biol. 1994 Spring;1(1):3–14. doi: 10.1089/cmb.1994.1.3. [DOI] [PubMed] [Google Scholar]
- Wingender E., Kel A. E., Kel O. V., Karas H., Heinemeyer T., Dietze P., Knüppel R., Romaschenko A. G., Kolchanov N. A. TRANSFAC, TRRD and COMPEL: towards a federated database system on transcriptional regulation. Nucleic Acids Res. 1997 Jan 1;25(1):265–268. doi: 10.1093/nar/25.1.265. [DOI] [PMC free article] [PubMed] [Google Scholar]