Abstract
The Blocks Database World Wide Web (http://www.blocks.fhcrc.org ) and Email (blocks@blocks.fhcrc.org) servers provide tools for the detection and analysis of protein homology based on alignment blocks representing conserved regions of proteins. During the past year, searching has been augmented by supplementation of the Blocks Database with blocks from the Prints Database, for a total of 4754 blocks from 1163 families. Blocks from both the Blocks and Prints Databases and blocks that are constructed from sequences submitted to Block Maker can be used for blocks-versus-blocks searching of these databases with LAMA, and for viewing logos and bootstrap trees. Sensitive searches of up-to-date protein sequence databanks are carried out via direct links to the MAST server using position-specific scoring matrices and to the BLAST and PSI-BLAST servers using consensus-embedded sequence queries. Utilizing the trypsin family to evaluate performance, we illustrate the superiority of blocks-based tools over expert pairwise searching or Hidden Markov Models.
Full Text
The Full Text of this article is available as a PDF (142.9 KB).
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Altschul S. F., Gish W., Miller W., Myers E. W., Lipman D. J. Basic local alignment search tool. J Mol Biol. 1990 Oct 5;215(3):403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- Altschul S. F., Madden T. L., Schäffer A. A., Zhang J., Zhang Z., Miller W., Lipman D. J. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997 Sep 1;25(17):3389–3402. doi: 10.1093/nar/25.17.3389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Attwood T. K., Beck M. E., Bleasby A. J., Degtyarenko K., Michie A. D., Parry-Smith D. J. Novel developments with the PRINTS protein fingerprint database. Nucleic Acids Res. 1997 Jan 1;25(1):212–217. doi: 10.1093/nar/25.1.212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Attwood T. K., Beck M. E. PRINTS--a protein motif fingerprint database. Protein Eng. 1994 Jul;7(7):841–848. doi: 10.1093/protein/7.7.841. [DOI] [PubMed] [Google Scholar]
- Bailey T. L., Gribskov M. Score distributions for simultaneous matching to multiple motifs. J Comput Biol. 1997 Spring;4(1):45–59. doi: 10.1089/cmb.1997.4.45. [DOI] [PubMed] [Google Scholar]
- Bairoch A., Bucher P., Hofmann K. The PROSITE database, its status in 1997. Nucleic Acids Res. 1997 Jan 1;25(1):217–221. doi: 10.1093/nar/25.1.217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eddy S. R. Hidden Markov models. Curr Opin Struct Biol. 1996 Jun;6(3):361–365. doi: 10.1016/s0959-440x(96)80056-x. [DOI] [PubMed] [Google Scholar]
- Henikoff J. G., Henikoff S. Blocks database and its applications. Methods Enzymol. 1996;266:88–105. doi: 10.1016/s0076-6879(96)66008-x. [DOI] [PubMed] [Google Scholar]
- Henikoff J. G., Henikoff S. Using substitution probabilities to improve position-specific scoring matrices. Comput Appl Biosci. 1996 Apr;12(2):135–143. doi: 10.1093/bioinformatics/12.2.135. [DOI] [PubMed] [Google Scholar]
- Henikoff J. G., Pietrokovski S., Henikoff S. Recent enhancements to the Blocks Database servers. Nucleic Acids Res. 1997 Jan 1;25(1):222–225. doi: 10.1093/nar/25.1.222. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henikoff S., Endow S. A., Greene E. A. Connecting protein family resources using the proWeb network. Trends Biochem Sci. 1996 Nov;21(11):444–445. doi: 10.1016/s0968-0004(96)30039-x. [DOI] [PubMed] [Google Scholar]
- Henikoff S., Henikoff J. G., Alford W. J., Pietrokovski S. Automated construction and graphical presentation of protein blocks from unaligned sequences. Gene. 1995 Oct 3;163(2):GC17–GC26. doi: 10.1016/0378-1119(95)00486-p. [DOI] [PubMed] [Google Scholar]
- Henikoff S., Henikoff J. G. Automated assembly of protein blocks for database searching. Nucleic Acids Res. 1991 Dec 11;19(23):6565–6572. doi: 10.1093/nar/19.23.6565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henikoff S., Henikoff J. G. Embedding strategies for effective use of information from multiple sequence alignments. Protein Sci. 1997 Mar;6(3):698–705. doi: 10.1002/pro.5560060319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henikoff S., Henikoff J. G. Position-based sequence weights. J Mol Biol. 1994 Nov 4;243(4):574–578. doi: 10.1016/0022-2836(94)90032-9. [DOI] [PubMed] [Google Scholar]
- Neuwald A. F., Liu J. S., Lawrence C. E. Gibbs motif sampling: detection of bacterial outer membrane protein repeats. Protein Sci. 1995 Aug;4(8):1618–1632. doi: 10.1002/pro.5560040820. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pearson W. R. Identifying distantly related protein sequences. Comput Appl Biosci. 1997 Aug;13(4):325–332. doi: 10.1093/bioinformatics/13.4.325. [DOI] [PubMed] [Google Scholar]
- Pietrokovski S., Henikoff J. G., Henikoff S. The Blocks database--a system for protein classification. Nucleic Acids Res. 1996 Jan 1;24(1):197–200. doi: 10.1093/nar/24.1.197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pietrokovski S. Searching databases of conserved sequence regions by aligning protein multiple-alignments. Nucleic Acids Res. 1996 Oct 1;24(19):3836–3845. doi: 10.1093/nar/24.19.3836. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schneider T. D., Stephens R. M. Sequence logos: a new way to display consensus sequences. Nucleic Acids Res. 1990 Oct 25;18(20):6097–6100. doi: 10.1093/nar/18.20.6097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith H. O., Annau T. M., Chandrasegaran S. Finding sequence motifs in groups of functionally related proteins. Proc Natl Acad Sci U S A. 1990 Jan;87(2):826–830. doi: 10.1073/pnas.87.2.826. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sonnhammer E. L., Eddy S. R., Durbin R. Pfam: a comprehensive database of protein domain families based on seed alignments. Proteins. 1997 Jul;28(3):405–420. doi: 10.1002/(sici)1097-0134(199707)28:3<405::aid-prot10>3.0.co;2-l. [DOI] [PubMed] [Google Scholar]