Skip to main content
Oxford University Press - PMC COVID-19 Collection logoLink to Oxford University Press - PMC COVID-19 Collection
. 1992 Sep;5(6):479–488. doi: 10.1093/protein/5.6.479

Construction of a dictionary of sequence motifs that characterize groups of related proteins

Atsushi Ogiwara 1, Ikuo Uchiyama 1, Yasuhiko Seto 1, Minoru Kanehisa 1,2
PMCID: PMC7528547  PMID: 1438158

Abstract

An automatic procedure is proposed to identify, from the protein sequence database, conserved amino acid patterns (or sequence motifs) that are exclusive to a group of functionally related proteins. This procedure is applied to the PIR database and a dictionary of sequence motifs that relate to specific superfamilies constructed. The motifs have a practical relevance in identifying the membership of specific superfamilies without the need to perform sequence database searches in 20% of newly determined sequences. The sequence motifs identified represent functionally important sites on protein molecules. When multiple blocks exist in a single motif they are often close together in the 3-D structure. Furthermore, occasionally these motif blocks were found to be split by introns when the correlation with exon structures was examined.

Keywords: consensus patterns, genome analysis, homology search, multiple sequence alignment, prediction of protein function


Articles from Protein Engineering are provided here courtesy of Oxford University Press

RESOURCES