Abstract
As the number of protein molecules with known, high-resolution structures increases, it becomes necessary to organize these structures for rapid retrieval, comparison, and analysis. The Protein Data Bank (PDB) currently contains nearly 5,000 entries and is growing exponentially. Most new structures are similar structurally to ones reported previously and can be grouped into families. As the number of members in each family increases, it becomes possible to summarize, statistically, the commonalities and differences within each family. We reported previously a method for finding the atoms in a family alignment that have low spatial variance and those that have higher spatial variance (i.e., the "core" atoms that have the same relative position in all family members and the "non-core" atoms that do not). The core structures we compute have biological significance and provide an excellent quantitative and visual summary of a multiple structural alignment. In order to extend their utility, we have constructed a library of protein family cores, accessible over the World Wide Web at http:/ /www-smi.stanford.edu/projects/helix/LPFC/. This library is generated automatically with publicly available computer programs requiring only a set of multiple alignments as input. It contains quantitative analysis of the spatial variation of atoms within each protein family, the coordinates of the average core structures derived from the families, and display files (in bitmap and VRML formats). Here, we describe the resource and illustrate its applicability by comparing three multiple alignments of the globin family. These three alignments are found to be similar, but with some significant differences related to the diversity of family members and the specific method used for alignment.
Full Text
The Full Text of this article is available as a PDF (1.9 MB).
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Altman R. B., Hughes C., Gerstein M. B. Methods for displaying macromolecular structural uncertainty: application to the globins. J Mol Graph. 1995 Jun;13(3):142-52, 109-2. doi: 10.1016/0263-7855(95)00002-n. [DOI] [PubMed] [Google Scholar]
- Bairoch A., Boeckmann B. The SWISS-PROT protein sequence data bank. Nucleic Acids Res. 1992 May 11;20 (Suppl):2019–2022. doi: 10.1093/nar/20.suppl.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gerstein M., Altman R. B. Average core structures and variability measures for protein families: application to the immunoglobulins. J Mol Biol. 1995 Aug 4;251(1):161–175. doi: 10.1006/jmbi.1995.0423. [DOI] [PubMed] [Google Scholar]
- Gerstein M., Lesk A. M., Chothia C. Structural mechanisms for domain movements in proteins. Biochemistry. 1994 Jun 7;33(22):6739–6749. doi: 10.1021/bi00188a001. [DOI] [PubMed] [Google Scholar]
- Gerstein M., Schulz G., Chothia C. Domain closure in adenylate kinase. Joints on either side of two helices close like neighboring fingers. J Mol Biol. 1993 Jan 20;229(2):494–501. doi: 10.1006/jmbi.1993.1048. [DOI] [PubMed] [Google Scholar]
- Gerstein M., Sonnhammer E. L., Chothia C. Volume changes in protein evolution. J Mol Biol. 1994 Mar 4;236(4):1067–1078. doi: 10.1016/0022-2836(94)90012-4. [DOI] [PubMed] [Google Scholar]
- Lesk A. M., Chothia C. How different amino acid sequences determine similar protein structures: the structure and evolutionary dynamics of the globins. J Mol Biol. 1980 Jan 25;136(3):225–270. doi: 10.1016/0022-2836(80)90373-3. [DOI] [PubMed] [Google Scholar]
- Murzin A. G., Brenner S. E., Hubbard T., Chothia C. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol. 1995 Apr 7;247(4):536–540. doi: 10.1006/jmbi.1995.0159. [DOI] [PubMed] [Google Scholar]
- Overington J. P., Zhu Z. Y., Sali A., Johnson M. S., Sowdhamini R., Louie G. V., Blundell T. L. Molecular recognition in protein families: a database of aligned three-dimensional structures of related proteins. Biochem Soc Trans. 1993 Aug;21(3):597–604. doi: 10.1042/bst0210597. [DOI] [PubMed] [Google Scholar]
- Sali A., Overington J. P. Derivation of rules for comparative protein modeling from a database of protein structure alignments. Protein Sci. 1994 Sep;3(9):1582–1596. doi: 10.1002/pro.5560030923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stein P., Chothia C. Serpin tertiary structure transformation. J Mol Biol. 1991 Sep 20;221(2):615–621. doi: 10.1016/0022-2836(91)80076-7. [DOI] [PubMed] [Google Scholar]