Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 1990 Jul;87(14):5509–5513. doi: 10.1073/pnas.87.14.5509

Protein database searches for multiple alignments.

S F Altschul 1, D J Lipman 1
PMCID: PMC54354  PMID: 2196570

Abstract

Protein database searches frequently can reveal biologically significant sequence relationships useful in understanding structure and function. Weak but meaningful sequence patterns can be obscured, however, by other similarities due only to chance. By searching a database for multiple as opposed to pairwise alignments, distant relationships are much more easily distinguished from background noise. Recent statistical results permit the power of this approach to be analyzed. Given a typical query sequence, an algorithm described here permits the current protein database to be searched for three-sequence alignments in less than 4 min. Such searches have revealed a variety of subtle relationships that pairwise search methods would be unable to detect.

Full text

PDF
5509

Selected References

These references are in PubMed. This may not be the complete list of references from this article.

  1. Abeles A. L., Friedman S. A., Austin S. J. Partition of unit-copy miniplasmids to daughter cells. III. The DNA sequence and functional organization of the P1 partition region. J Mol Biol. 1985 Sep 20;185(2):261–272. doi: 10.1016/0022-2836(85)90402-4. [DOI] [PubMed] [Google Scholar]
  2. Gribskov M., McLachlan A. D., Eisenberg D. Profile analysis: detection of distantly related proteins. Proc Natl Acad Sci U S A. 1987 Jul;84(13):4355–4358. doi: 10.1073/pnas.84.13.4355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Ishioka N., Takahashi N., Putnam F. W. Amino acid sequence of human plasma alpha 1B-glycoprotein: homology to the immunoglobulin supergene family. Proc Natl Acad Sci U S A. 1986 Apr;83(8):2363–2367. doi: 10.1073/pnas.83.8.2363. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Karlin S., Altschul S. F. Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. Proc Natl Acad Sci U S A. 1990 Mar;87(6):2264–2268. doi: 10.1073/pnas.87.6.2264. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Kelley R. L., Yanofsky C. Mutational studies with the trp repressor of Escherichia coli support the helix-turn-helix model of repressor recognition of operator DNA. Proc Natl Acad Sci U S A. 1985 Jan;82(2):483–487. doi: 10.1073/pnas.82.2.483. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Lipman D. J., Pearson W. R. Rapid and sensitive protein similarity searches. Science. 1985 Mar 22;227(4693):1435–1441. doi: 10.1126/science.2983426. [DOI] [PubMed] [Google Scholar]
  7. Miller J. L., Anderson S. K., Fujita D. J., Chaconas G., Baldwin D. L., Harshey R. M. The nucleotide sequence of the B gene of bacteriophage Mu. Nucleic Acids Res. 1984 Nov 26;12(22):8627–8638. doi: 10.1093/nar/12.22.8627. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Needleman S. B., Wunsch C. D. A general method applicable to the search for similarities in the amino acid sequence of two proteins. J Mol Biol. 1970 Mar;48(3):443–453. doi: 10.1016/0022-2836(70)90057-4. [DOI] [PubMed] [Google Scholar]
  9. Patthy L. Detecting homology of distantly related proteins with consensus sequences. J Mol Biol. 1987 Dec 20;198(4):567–577. doi: 10.1016/0022-2836(87)90200-2. [DOI] [PubMed] [Google Scholar]
  10. Power M. D., Marx P. A., Bryant M. L., Gardner M. B., Barr P. J., Luciw P. A. Nucleotide sequence of SRV-1, a type D simian acquired immune deficiency syndrome retrovirus. Science. 1986 Mar 28;231(4745):1567–1572. doi: 10.1126/science.3006247. [DOI] [PubMed] [Google Scholar]
  11. Smith R. F., Smith T. F. Automatic generation of primary sequence patterns from sets of related protein sequences. Proc Natl Acad Sci U S A. 1990 Jan;87(1):118–122. doi: 10.1073/pnas.87.1.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Steitz T. A., Ohlendorf D. H., McKay D. B., Anderson W. F., Matthews B. W. Structural similarity in the DNA-binding domains of catabolite gene activator and cro repressor proteins. Proc Natl Acad Sci U S A. 1982 May;79(10):3097–3100. doi: 10.1073/pnas.79.10.3097. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Stokes H. W., Hall B. G. Sequence of the ebgR gene of Escherichia coli: evidence that the EBG and LAC operons are descended from a common ancestor. Mol Biol Evol. 1985 Nov;2(6):478–483. doi: 10.1093/oxfordjournals.molbev.a040373. [DOI] [PubMed] [Google Scholar]
  14. Taylor W. R. Identification of protein sequence homology by consensus template alignment. J Mol Biol. 1986 Mar 20;188(2):233–258. doi: 10.1016/0022-2836(86)90308-6. [DOI] [PubMed] [Google Scholar]
  15. Toh H., Kikuno R., Hayashida H., Miyata T., Kugimiya W., Inouye S., Yuki S., Saigo K. Close structural resemblance between putative polymerase of a Drosophila transposable genetic element 17.6 and pol gene product of Moloney murine leukaemia virus. EMBO J. 1985 May;4(5):1267–1272. doi: 10.1002/j.1460-2075.1985.tb03771.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Valentin-Hansen P., Larsen J. E., Højrup P., Short S. A., Barbier C. S. Nucleotide sequence of the CytR regulatory gene of E. coli K-12. Nucleic Acids Res. 1986 Mar 11;14(5):2215–2228. doi: 10.1093/nar/14.5.2215. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Williams A. F., Barclay A. N. The immunoglobulin superfamily--domains for cell surface recognition. Annu Rev Immunol. 1988;6:381–405. doi: 10.1146/annurev.iy.06.040188.002121. [DOI] [PubMed] [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES