Abstract
From its origin the Protein Information Resource (http://www-nbrf. georgetown.edu/pir/) has supported research on evolution and computational biology by designing and compiling a comprehensive, quality controlled, and well-organized protein sequence database. The database has been produced and updated on a regular schedule since 1984. Since 1988 it has been maintained collaboratively by the PIR-International, an association of data collection centers engaged in international cooperation for the development of this research resource during a period of explosive acquisition of new data. As of June 1997, essentially all sequence entries have been classified into families, allowing the efficient application of methods to propagate and standardize annotation among related sequences. The databases are available through the Internet by the World-Wide Web and FTP, or on CD-ROM and magnetic media.
Full Text
The Full Text of this article is available as a PDF (59.1 KB).
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Barker W. C., George D. G., Mewes H. W., Pfeiffer F., Tsugita A. The PIR-International databases. Nucleic Acids Res. 1993 Jul 1;21(13):3089–3092. doi: 10.1093/nar/21.13.3089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barker W. C., Pfeiffer F., George D. G. Superfamily classification in PIR-International Protein Sequence Database. Methods Enzymol. 1996;266:59–71. doi: 10.1016/s0076-6879(96)66006-6. [DOI] [PubMed] [Google Scholar]
- Biemann K., Scoble H. A. Characterization by tandem mass spectrometry of structural modifications in proteins. Science. 1987 Aug 28;237(4818):992–998. doi: 10.1126/science.3303336. [DOI] [PubMed] [Google Scholar]
- Blattner F. R., Plunkett G., 3rd, Bloch C. A., Perna N. T., Burland V., Riley M., Collado-Vides J., Glasner J. D., Rode C. K., Mayhew G. F. The complete genome sequence of Escherichia coli K-12. Science. 1997 Sep 5;277(5331):1453–1462. doi: 10.1126/science.277.5331.1453. [DOI] [PubMed] [Google Scholar]
- Bult C. J., White O., Olsen G. J., Zhou L., Fleischmann R. D., Sutton G. G., Blake J. A., FitzGerald L. M., Clayton R. A., Gocayne J. D. Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii. Science. 1996 Aug 23;273(5278):1058–1073. doi: 10.1126/science.273.5278.1058. [DOI] [PubMed] [Google Scholar]
- Dayhoff M. O. The origin and evolution of protein superfamilies. Fed Proc. 1976 Aug;35(10):2132–2138. [PubMed] [Google Scholar]
- Fleischmann R. D., Adams M. D., White O., Clayton R. A., Kirkness E. F., Kerlavage A. R., Bult C. J., Tomb J. F., Dougherty B. A., Merrick J. M. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science. 1995 Jul 28;269(5223):496–512. doi: 10.1126/science.7542800. [DOI] [PubMed] [Google Scholar]
- Fraser C. M., Gocayne J. D., White O., Adams M. D., Clayton R. A., Fleischmann R. D., Bult C. J., Kerlavage A. R., Sutton G., Kelley J. M. The minimal gene complement of Mycoplasma genitalium. Science. 1995 Oct 20;270(5235):397–403. doi: 10.1126/science.270.5235.397. [DOI] [PubMed] [Google Scholar]
- George D. G., Barker W. C., Hunt L. T. The protein identification resource (PIR). Nucleic Acids Res. 1986 Jan 10;14(1):11–15. doi: 10.1093/nar/14.1.11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- George D. G., Dodson R. J., Garavelli J. S., Haft D. H., Hunt L. T., Marzec C. R., Orcutt B. C., Sidman K. E., Srinivasarao G. Y., Yeh L. S. The Protein Information Resource (PIR) and the PIR-International Protein Sequence Database. Nucleic Acids Res. 1997 Jan 1;25(1):24–28. doi: 10.1093/nar/25.1.24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goffeau A., Barrell B. G., Bussey H., Davis R. W., Dujon B., Feldmann H., Galibert F., Hoheisel J. D., Jacq C., Johnston M. Life with 6000 genes. Science. 1996 Oct 25;274(5287):546, 563-7. doi: 10.1126/science.274.5287.546. [DOI] [PubMed] [Google Scholar]
- Himmelreich R., Hilbert H., Plagens H., Pirkl E., Li B. C., Herrmann R. Complete sequence analysis of the genome of the bacterium Mycoplasma pneumoniae. Nucleic Acids Res. 1996 Nov 15;24(22):4420–4449. doi: 10.1093/nar/24.22.4420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kaneko T., Sato S., Kotani H., Tanaka A., Asamizu E., Nakamura Y., Miyajima N., Hirosawa M., Sugiura M., Sasamoto S. Sequence analysis of the genome of the unicellular cyanobacterium Synechocystis sp. strain PCC6803. II. Sequence determination of the entire genome and assignment of potential protein-coding regions. DNA Res. 1996 Jun 30;3(3):109–136. doi: 10.1093/dnares/3.3.109. [DOI] [PubMed] [Google Scholar]
- Pattabiraman N., Namboodiri K., Lowrey A., Gaber B. P. NRL-3D: a sequence-structure database derived from the protein data bank (PDB) and searchable within the PIR environment. Protein Seq Data Anal. 1990 Oct;3(5):387–405. [PubMed] [Google Scholar]
- Pearson W. R., Lipman D. J. Improved tools for biological sequence comparison. Proc Natl Acad Sci U S A. 1988 Apr;85(8):2444–2448. doi: 10.1073/pnas.85.8.2444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takao T., Yoshino K., Suzuki N., Shimonishi Y. Analysis of post-translational modifications of proteins by accurate mass measurement in fast atom bombardment mass spectrometry. Biomed Environ Mass Spectrom. 1990 Nov;19(11):705–712. doi: 10.1002/bms.1200191109. [DOI] [PubMed] [Google Scholar]
- Tomb J. F., White O., Kerlavage A. R., Clayton R. A., Sutton G. G., Fleischmann R. D., Ketchum K. A., Klenk H. P., Gill S., Dougherty B. A. The complete genome sequence of the gastric pathogen Helicobacter pylori. Nature. 1997 Aug 7;388(6642):539–547. doi: 10.1038/41483. [DOI] [PubMed] [Google Scholar]
- Yates J. R. Protein structure analysis by mass spectrometry. Methods Enzymol. 1996;271:351–377. doi: 10.1016/s0076-6879(96)71017-0. [DOI] [PubMed] [Google Scholar]