Abstract
PRINTS is a database of protein family 'fingerprints' offering a diagnostic resource for newly-determined sequences. By contrast with PROSITE, which uses single consensus expressions to characterise particular families, PRINTS exploits groups of motifs to build characteristic signatures. These signatures offer improved diagnostic reliability by virtue of the mutual context provided by motif neighbours. To date, 800 fingerprints have been constructed and stored in PRINTS. The current version, 17.0, encodes approximately 4500 motifs, covering a range of globular and membrane proteins, modular polypeptides, and so on. The database is accessible via the UCL Bioinformatics World Wide Web (WWW) Server at http://www. biochem.ucl.ac.uk/bsm/dbbrowser/ . We have recently enhanced the usefulness of PRINTS by making available new, intuitive search software. This allows both individual query sequence and bulk data submission, permitting easy analysis of single sequences or complete genomes. Preliminary results indicate that use of the PRINTS system is able to assign additional functions not found by other methods, and hence offers a useful adjunct to current genome analysis protocols.
Full Text
The Full Text of this article is available as a PDF (537.2 KB).
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Altschul S. F., Gish W., Miller W., Myers E. W., Lipman D. J. Basic local alignment search tool. J Mol Biol. 1990 Oct 5;215(3):403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- Attwood T. K., Avison H., Beck M. E., Bewley M., Bleasby A. J., Brewster F., Cooper P., Degtyarenko K., Geddes A. J., Flower D. R. The PRINTS database of protein fingerprints: a novel information resource for computational molecular biology. J Chem Inf Comput Sci. 1997 May-Jun;37(3):417–424. doi: 10.1021/ci960468e. [DOI] [PubMed] [Google Scholar]
- Attwood T. K., Beck M. E., Bleasby A. J., Degtyarenko K., Michie A. D., Parry-Smith D. J. Novel developments with the PRINTS protein fingerprint database. Nucleic Acids Res. 1997 Jan 1;25(1):212–217. doi: 10.1093/nar/25.1.212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Attwood T. K., Beck M. E., Bleasby A. J., Parry-Smith D. J. PRINTS--a database of protein motif fingerprints. Nucleic Acids Res. 1994 Sep;22(17):3590–3596. [PMC free article] [PubMed] [Google Scholar]
- Attwood T. K., Beck M. E. PRINTS--a protein motif fingerprint database. Protein Eng. 1994 Jul;7(7):841–848. doi: 10.1093/protein/7.7.841. [DOI] [PubMed] [Google Scholar]
- Attwood T. K., Findlay J. B. Design of a discriminating fingerprint for G-protein-coupled receptors. Protein Eng. 1993 Feb;6(2):167–176. doi: 10.1093/protein/6.2.167. [DOI] [PubMed] [Google Scholar]
- Attwood T. K., Findlay J. B. Fingerprinting G-protein-coupled receptors. Protein Eng. 1994 Feb;7(2):195–203. doi: 10.1093/protein/7.2.195. [DOI] [PubMed] [Google Scholar]
- Bairoch A., Apweiler R. The SWISS-PROT protein sequence data bank and its supplement TrEMBL in 1998. Nucleic Acids Res. 1998 Jan 1;26(1):38–42. doi: 10.1093/nar/26.1.38. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bairoch A., Bucher P., Hofmann K. The PROSITE database, its status in 1997. Nucleic Acids Res. 1997 Jan 1;25(1):217–221. doi: 10.1093/nar/25.1.217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barker W. C., Garavelli J. S., Haft D. H., Hunt L. T., Marzec C. R., Orcutt B. C., Srinivasarao G. Y., Yeh L. S., Ledley R. S., Mewes H. W. The PIR-International Protein Sequence Database. Nucleic Acids Res. 1998 Jan 1;26(1):27–32. doi: 10.1093/nar/26.1.27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bleasby A. J., Akrigg D., Attwood T. K. OWL--a non-redundant composite protein sequence database. Nucleic Acids Res. 1994 Sep;22(17):3574–3577. [PMC free article] [PubMed] [Google Scholar]
- Flower D. R., North A. C., Attwood T. K. Structure and sequence relationships in the lipocalins and related proteins. Protein Sci. 1993 May;2(5):753–761. doi: 10.1002/pro.5560020507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fábián P., Murvai J., Hátsági Z., Vlahovicek K., Hegyi H., Pongor S. The SBASE protein domain library, release 5.0: a collection of annotated protein sequence segments. Nucleic Acids Res. 1997 Jan 1;25(1):240–243. doi: 10.1093/nar/25.1.240. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henikoff J. G., Pietrokovski S., Henikoff S. Recent enhancements to the Blocks Database servers. Nucleic Acids Res. 1997 Jan 1;25(1):222–225. doi: 10.1093/nar/25.1.222. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Michie A. D., Jones M. L., Attwood T. K. DbBrowser: integrated access to databases worldwide. Trends Biochem Sci. 1996 May;21(5):191–191. [PubMed] [Google Scholar]
- Murzin A. G., Brenner S. E., Hubbard T., Chothia C. SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol. 1995 Apr 7;247(4):536–540. doi: 10.1006/jmbi.1995.0159. [DOI] [PubMed] [Google Scholar]
- Parry-Smith D. J., Attwood T. K. ADSP--a new package for computational sequence analysis. Comput Appl Biosci. 1992 Oct;8(5):451–459. doi: 10.1093/bioinformatics/8.5.451. [DOI] [PubMed] [Google Scholar]
- Parry-Smith D. J., Attwood T. K. SOMAP: a novel interactive approach to multiple protein sequences alignment. Comput Appl Biosci. 1991 Apr;7(2):233–235. doi: 10.1093/bioinformatics/7.2.233. [DOI] [PubMed] [Google Scholar]
- Pattabiraman N., Namboodiri K., Lowrey A., Gaber B. P. NRL-3D: a sequence-structure database derived from the protein data bank (PDB) and searchable within the PIR environment. Protein Seq Data Anal. 1990 Oct;3(5):387–405. [PubMed] [Google Scholar]
- Perkins D. N., Attwood T. K. XFINGER: a tool for searching and visualising protein fingerprints and patterns. Comput Appl Biosci. 1996 Apr;12(2):89–94. doi: 10.1093/bioinformatics/12.2.89. [DOI] [PubMed] [Google Scholar]
- Sonnhammer E. L., Eddy S. R., Durbin R. Pfam: a comprehensive database of protein domain families based on seed alignments. Proteins. 1997 Jul;28(3):405–420. doi: 10.1002/(sici)1097-0134(199707)28:3<405::aid-prot10>3.0.co;2-l. [DOI] [PubMed] [Google Scholar]