Abstract
A set of programs was developed for searching nucleic acid and protein sequence data bases for sequences similar to a given sequence. The programs, written in FORTRAN 77, were optimized for vector processing on a Hitachi S810-20 supercomputer. A search of a 500-residue protein sequence against the entire PIR data base Ver. 1.0 (1) (0.5 M residues) is carried out in a CPU time of 45 sec. About 4 min is required for an exhaustive search of a 1500-base nucleotide sequence against all mammalian sequences (1.2M bases) in Genbank Ver. 29.0. The CPU time is reduced to about a quarter with a faster version.
Full text
PDF







Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Fitch W. M., Smith T. F. Optimal sequence alignments. Proc Natl Acad Sci U S A. 1983 Mar;80(5):1382–1386. doi: 10.1073/pnas.80.5.1382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gotoh O. An improved algorithm for matching biological sequences. J Mol Biol. 1982 Dec 15;162(3):705–708. doi: 10.1016/0022-2836(82)90398-9. [DOI] [PubMed] [Google Scholar]
- Naharro G., Robbins K. C., Reddy E. P. Gene product of v-fgr onc: hybrid protein containing a portion of actin and a tyrosine-specific protein kinase. Science. 1984 Jan 6;223(4631):63–66. doi: 10.1126/science.6318314. [DOI] [PubMed] [Google Scholar]
- Sanchez-Pescador R., Power M. D., Barr P. J., Steimer K. S., Stempien M. M., Brown-Shimer S. L., Gee W. W., Renard A., Randolph A., Levy J. A. Nucleotide sequence and expression of an AIDS-associated retrovirus (ARV-2). Science. 1985 Feb 1;227(4686):484–492. doi: 10.1126/science.2578227. [DOI] [PubMed] [Google Scholar]
- Smith T. F., Waterman M. S. Identification of common molecular subsequences. J Mol Biol. 1981 Mar 25;147(1):195–197. doi: 10.1016/0022-2836(81)90087-5. [DOI] [PubMed] [Google Scholar]
- Waterfield M. D., Scrace G. T., Whittle N., Stroobant P., Johnsson A., Wasteson A., Westermark B., Heldin C. H., Huang J. S., Deuel T. F. Platelet-derived growth factor is structurally related to the putative transforming protein p28sis of simian sarcoma virus. Nature. 1983 Jul 7;304(5921):35–39. doi: 10.1038/304035a0. [DOI] [PubMed] [Google Scholar]
- Wilbur W. J., Lipman D. J. Rapid similarity searches of nucleic acid and protein data banks. Proc Natl Acad Sci U S A. 1983 Feb;80(3):726–730. doi: 10.1073/pnas.80.3.726. [DOI] [PMC free article] [PubMed] [Google Scholar]
