Abstract
SBASE 3.0 is the third release of SBASE, a collection of annotated protein domain sequences. SBASE entries represent various structural, functional, ligand-binding and topogenic segments of proteins as defined by their publishing authors. SBASE can be used for establishing domain homologies using different database-search tools such as FASTA [Lipman and Pearson (1985) Science, 227, 1436-1441], and BLAST3 [Altschul and Lipman (1990) Proc. Natl. Acad. Sci. USA, 87, 5509-5513] which is especially useful in the case of loosely defined domain types for which efficient consensus patterns can not be established. The present release contains 41,749 entries provided with standardized names and cross-referenced to the major protein and nucleic acid databanks as well as to the PROSITE catalogue of protein sequence patterns. The entries are clustered into 2285 groups using the BLAST algorithm for computing similarity measures. SBASE 3.0 is freely available on request to the authors or by anonymous 'ftp' file transfer from < ftp.icgeb.trieste.it >. Individual records can be retrieved with the gopher server at < icgeb.trieste.it > and with a www-server at < http:@www.icgeb.trieste.it >. Automated searching of SBASE by BLAST can be carried out with the electronic mail server < sbase@icgeb.trieste.it >. Another mail server < domain@hubi.abc.hu > assigns SBASE domain homologies on the basis of SWISS-PROT searches. A comparison of pertinent search strategies is presented.
Full text
PDFSelected References
These references are in PubMed. This may not be the complete list of references from this article.
- Altschul S. F., Lipman D. J. Protein database searches for multiple alignments. Proc Natl Acad Sci U S A. 1990 Jul;87(14):5509–5513. doi: 10.1073/pnas.87.14.5509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Appel R. D., Sanchez J. C., Bairoch A., Golaz O., Miu M., Vargas J. R., Hochstrasser D. F. SWISS-2DPAGE: a database of two-dimensional gel electrophoresis images. Electrophoresis. 1993 Nov;14(11):1232–1238. doi: 10.1002/elps.11501401185. [DOI] [PubMed] [Google Scholar]
- Bairoch A., Boeckmann B. The SWISS-PROT protein sequence data bank, recent developments. Nucleic Acids Res. 1993 Jul 1;21(13):3093–3096. doi: 10.1093/nar/21.13.3093. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bairoch A. The PROSITE dictionary of sites and patterns in proteins, its current status. Nucleic Acids Res. 1993 Jul 1;21(13):3097–3103. doi: 10.1093/nar/21.13.3097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barker W. C., George D. G., Mewes H. W., Pfeiffer F., Tsugita A. The PIR-International databases. Nucleic Acids Res. 1993 Jul 1;21(13):3089–3092. doi: 10.1093/nar/21.13.3089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barker W. C., Hunt L. T., George D. G. Identifying domains in protein sequences. Protein Seq Data Anal. 1988;1(5):363–373. [PubMed] [Google Scholar]
- Baron M., Norman D. G., Campbell I. D. Protein modules. Trends Biochem Sci. 1991 Jan;16(1):13–17. doi: 10.1016/0968-0004(91)90009-k. [DOI] [PubMed] [Google Scholar]
- Benson D., Lipman D. J., Ostell J. GenBank. Nucleic Acids Res. 1993 Jul 1;21(13):2963–2965. doi: 10.1093/nar/21.13.2963. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bernstein F. C., Koetzle T. F., Williams G. J., Meyer E. F., Jr, Brice M. D., Rodgers J. R., Kennard O., Shimanouchi T., Tasumi M. The Protein Data Bank: a computer-based archival file for macromolecular structures. J Mol Biol. 1977 May 25;112(3):535–542. doi: 10.1016/s0022-2836(77)80200-3. [DOI] [PubMed] [Google Scholar]
- Devereux J., Haeberli P., Smithies O. A comprehensive set of sequence analysis programs for the VAX. Nucleic Acids Res. 1984 Jan 11;12(1 Pt 1):387–395. doi: 10.1093/nar/12.1part1.387. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ghosh D. Status of the transcription factors database (TFD). Nucleic Acids Res. 1993 Jul 1;21(13):3117–3118. doi: 10.1093/nar/21.13.3117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hegyi H., Pongor S. Predicting potential domain homologies from FASTA search results. Comput Appl Biosci. 1993 Jun;9(3):371–372. doi: 10.1093/bioinformatics/9.3.371. [DOI] [PubMed] [Google Scholar]
- Henikoff S., Henikoff J. G. Automated assembly of protein blocks for database searching. Nucleic Acids Res. 1991 Dec 11;19(23):6565–6572. doi: 10.1093/nar/19.23.6565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lipman D. J., Pearson W. R. Rapid and sensitive protein similarity searches. Science. 1985 Mar 22;227(4693):1435–1441. doi: 10.1126/science.2983426. [DOI] [PubMed] [Google Scholar]
- Pongor S., Skerl V., Cserzö M., Hátsági Z., Simon G., Bevilacqua V. The SBASE domain library: a collection of annotated protein segments. Protein Eng. 1993 Jun;6(4):391–395. doi: 10.1093/protein/6.4.391. [DOI] [PubMed] [Google Scholar]
- Pongor S., Skerl V., Cserzö M., Hátsági Z., Simon G., Bevilacqua V. The SBASE protein domain library, release 2.0: a collection of annotated protein sequence segments. Nucleic Acids Res. 1993 Jul 1;21(13):3111–3115. doi: 10.1093/nar/21.13.3111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rice C. M., Fuchs R., Higgins D. G., Stoehr P. J., Cameron G. N. The EMBL data library. Nucleic Acids Res. 1993 Jul 1;21(13):2967–2971. doi: 10.1093/nar/21.13.2967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roberts R. J., Macelis D. REBASE--restriction enzymes and methylases. Nucleic Acids Res. 1993 Jul 1;21(13):3125–3137. doi: 10.1093/nar/21.13.3125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith R. F., Smith T. F. Automatic generation of primary sequence patterns from sets of related protein sequences. Proc Natl Acad Sci U S A. 1990 Jan;87(1):118–122. doi: 10.1073/pnas.87.1.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sonnhammer E. L., Kahn D. Modular arrangement of proteins as inferred from analysis of homology. Protein Sci. 1994 Mar;3(3):482–492. doi: 10.1002/pro.5560030314. [DOI] [PMC free article] [PubMed] [Google Scholar]