Abstract
HSSP is a derived database merging structural (3-D) and sequence (1-D) information. For each protein of known 3-D structure from the Protein Data Bank (PDB), the database has a multiple sequence alignment of all available homologues and a sequence profile characteristic of the family. The list of homologues is the result of a database search in SwissProt using a position-weighted dynamic programming method for sequence profile alignment (MaxHom). The database is updated frequently. The listed homologues are very likely to have the same 3-D structure as the PDB protein to which they have been aligned. As a result, the database is not only a database of aligned sequence families, but also a database of implied secondary and tertiary structures covering 29% of all SwissProt-stored sequences.
Full Text
The Full Text of this article is available as a PDF (156.1 KB).
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Bairoch A., Boeckmann B. The SWISS-PROT protein sequence data bank. Nucleic Acids Res. 1992 May 11;20 (Suppl):2019–2022. doi: 10.1093/nar/20.suppl.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bernstein F. C., Koetzle T. F., Williams G. J., Meyer E. F., Jr, Brice M. D., Rodgers J. R., Kennard O., Shimanouchi T., Tasumi M. The Protein Data Bank: a computer-based archival file for macromolecular structures. J Mol Biol. 1977 May 25;112(3):535–542. doi: 10.1016/s0022-2836(77)80200-3. [DOI] [PubMed] [Google Scholar]
- Casari G., Andrade M. A., Bork P., Boyle J., Daruvar A., Ouzounis C., Schneider R., Tamames J., Valencia A., Sander C. Challenging times for bioinformatics. Nature. 1995 Aug 24;376(6542):647–648. doi: 10.1038/376647a0. [DOI] [PubMed] [Google Scholar]
- Hobohm U., Sander C. A sequence property approach to searching protein databases. J Mol Biol. 1995 Aug 18;251(3):390–399. doi: 10.1006/jmbi.1995.0442. [DOI] [PubMed] [Google Scholar]
- Hobohm U., Scharf M., Schneider R., Sander C. Selection of representative protein data sets. Protein Sci. 1992 Mar;1(3):409–417. doi: 10.1002/pro.5560010313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Holm L., Sander C. Database algorithm for generating protein backbone and side-chain co-ordinates from a C alpha trace application to model building and detection of co-ordinate errors. J Mol Biol. 1991 Mar 5;218(1):183–194. doi: 10.1016/0022-2836(91)90883-8. [DOI] [PubMed] [Google Scholar]
- Holm L., Sander C. Fast and simple Monte Carlo algorithm for side chain optimization in proteins: application to model building by homology. Proteins. 1992 Oct;14(2):213–223. doi: 10.1002/prot.340140208. [DOI] [PubMed] [Google Scholar]
- Holm L., Sander C. Protein structure comparison by alignment of distance matrices. J Mol Biol. 1993 Sep 5;233(1):123–138. doi: 10.1006/jmbi.1993.1489. [DOI] [PubMed] [Google Scholar]
- Kabsch W., Sander C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers. 1983 Dec;22(12):2577–2637. doi: 10.1002/bip.360221211. [DOI] [PubMed] [Google Scholar]
- Rost B., Sander C., Schneider R. PHD--an automatic mail server for protein secondary structure prediction. Comput Appl Biosci. 1994 Feb;10(1):53–60. doi: 10.1093/bioinformatics/10.1.53. [DOI] [PubMed] [Google Scholar]
- Rost B., Schneider R., Sander C. Progress in protein structure prediction? Trends Biochem Sci. 1993 Apr;18(4):120–123. doi: 10.1016/0968-0004(93)90017-h. [DOI] [PubMed] [Google Scholar]
- Vriend G. WHAT IF: a molecular modeling and drug design program. J Mol Graph. 1990 Mar;8(1):52-6, 29. doi: 10.1016/0263-7855(90)80070-v. [DOI] [PubMed] [Google Scholar]