Abstract
The FSSP database presents a continuously updated structural classification of three-dimensional protein folds. It is derived using an automatic structure comparison program (Dali) for the all-against-all comparison of over 6000 three-dimensional coordinate sets in the Protein Data Bank (PDB). Sequence-related protein families are covered by a representative set of 813 protein chains. Hierachical clustering based on structural similarities yields a fold tree that defines 253 fold classes. For each representative protein chain, there is a database entry containing structure-structure alignments with its structural neighbours in the PDB. The database is accessible online through World Wide Web browsers and by anonymous ftp (file transfer protocol). The overview of fold space and the individual data sets provide a rich source of information for the study of both divergent and convergent aspects of molecular evolution, and define useful test sets and a standard of truth for assessing the correctness of sequence-sequence or sequence-structure alignments.
Full Text
The Full Text of this article is available as a PDF (97.0 KB).
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Bairoch A., Boeckmann B. The SWISS-PROT protein sequence data bank. Nucleic Acids Res. 1992 May 11;20 (Suppl):2019–2022. doi: 10.1093/nar/20.suppl.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bairoch A. PROSITE: a dictionary of sites and patterns in proteins. Nucleic Acids Res. 1992 May 11;20 (Suppl):2013–2018. doi: 10.1093/nar/20.suppl.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bernstein F. C., Koetzle T. F., Williams G. J., Meyer E. F., Jr, Brice M. D., Rodgers J. R., Kennard O., Shimanouchi T., Tasumi M. The Protein Data Bank: a computer-based archival file for macromolecular structures. J Mol Biol. 1977 May 25;112(3):535–542. doi: 10.1016/s0022-2836(77)80200-3. [DOI] [PubMed] [Google Scholar]
- Gibrat J. F., Madej T., Bryant S. H. Surprising similarities in structure comparison. Curr Opin Struct Biol. 1996 Jun;6(3):377–385. doi: 10.1016/s0959-440x(96)80058-3. [DOI] [PubMed] [Google Scholar]
- Hobohm U., Scharf M., Schneider R., Sander C. Selection of representative protein data sets. Protein Sci. 1992 Mar;1(3):409–417. doi: 10.1002/pro.5560010313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Holm L., Sander C. Mapping the protein universe. Science. 1996 Aug 2;273(5275):595–603. doi: 10.1126/science.273.5275.595. [DOI] [PubMed] [Google Scholar]
- Holm L., Sander C. Protein structure comparison by alignment of distance matrices. J Mol Biol. 1993 Sep 5;233(1):123–138. doi: 10.1006/jmbi.1993.1489. [DOI] [PubMed] [Google Scholar]
- Holm L., Sander C. The FSSP database: fold classification based on structure-structure alignment of proteins. Nucleic Acids Res. 1996 Jan 1;24(1):206–209. doi: 10.1093/nar/24.1.206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Islam S. A., Luo J., Sternberg M. J. Identification and analysis of domains in proteins. Protein Eng. 1995 Jun;8(6):513–525. doi: 10.1093/protein/8.6.513. [DOI] [PubMed] [Google Scholar]
- Murzin A. G., Brenner S. E., Hubbard T., Chothia C. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol. 1995 Apr 7;247(4):536–540. doi: 10.1006/jmbi.1995.0159. [DOI] [PubMed] [Google Scholar]
- Orengo C. A., Flores T. P., Taylor W. R., Thornton J. M. Identification and classification of protein fold families. Protein Eng. 1993 Jul;6(5):485–500. doi: 10.1093/protein/6.5.485. [DOI] [PubMed] [Google Scholar]
- Sander C., Schneider R. Database of homology-derived protein structures and the structural meaning of sequence alignment. Proteins. 1991;9(1):56–68. doi: 10.1002/prot.340090107. [DOI] [PubMed] [Google Scholar]
- Sayle R. A., Milner-White E. J. RASMOL: biomolecular graphics for all. Trends Biochem Sci. 1995 Sep;20(9):374–374. doi: 10.1016/s0968-0004(00)89080-5. [DOI] [PubMed] [Google Scholar]
- Scharf M., Schneider R., Casari G., Bork P., Valencia A., Ouzounis C., Sander C. GeneQuiz: a workbench for sequence analysis. Proc Int Conf Intell Syst Mol Biol. 1994;2:348–353. [PubMed] [Google Scholar]
- Suzuki M., Sugimoto H., Nakagawa A., Tanaka I., Nishihira J., Sakai M. Crystal structure of the macrophage migration inhibitory factor from rat liver. Nat Struct Biol. 1996 Mar;3(3):259–266. doi: 10.1038/nsb0396-259. [DOI] [PubMed] [Google Scholar]