Abstract
We have compiled a list of all the inteins (protein splicing elements) whose sequences have been published or were available from on-line sequence databases as of September 18, 1996. Analysis of the 36 available intein sequences refines the previously described intein motifs and reveals the presence of another intein motif, Block H. Furthermore, analysis of the new inteins reshapes our view of the conserved splice junction residues, since three inteins lack the intein penultimate His seen in prior examples. Comparison of intein sequences suggests that, in general, (i) inteins present in the same location within extein homologs from different organisms are very closely related to each other in paired sequence comparison or phylogenetic analysis and we suggest that they should be considered intein alleles; (ii) multiple inteins present in the same gene are no more similar to each other than to inteins present in different genes; (iii) phylogenetic analysis indicates that inteins are so divergent that trees with statistically significant branches cannot be generated except for intein alleles.
Full Text
The Full Text of this article is available as a PDF (473.5 KB).
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Anraku Y., Hirata R. Protozyme: emerging evidence in nature. J Biochem. 1994 Feb;115(2):175–178. doi: 10.1093/oxfordjournals.jbchem.a124313. [DOI] [PubMed] [Google Scholar]
- Attwood T. K., Beck M. E., Bleasby A. J., Degtyarenko K., Parry Smith D. J. Progress with the PRINTS protein fingerprint database. Nucleic Acids Res. 1996 Jan 1;24(1):182–188. doi: 10.1093/nar/24.1.182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Belfort M., Perlman P. S. Mechanisms of intron mobility. J Biol Chem. 1995 Dec 22;270(51):30237–30240. doi: 10.1074/jbc.270.51.30237. [DOI] [PubMed] [Google Scholar]
- Belfort M., Reaban M. E., Coetzee T., Dalgaard J. Z. Prokaryotic introns and inteins: a panoply of form and function. J Bacteriol. 1995 Jul;177(14):3897–3903. doi: 10.1128/jb.177.14.3897-3903.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bell-Pedersen D., Quirk S. M., Aubrey M., Belfort M. A site-specific endonuclease and co-conversion of flanking exons associated with the mobile td intron of phage T4. Gene. 1989 Oct 15;82(1):119–126. doi: 10.1016/0378-1119(89)90036-x. [DOI] [PubMed] [Google Scholar]
- Berghöfer B., Kröckel L., Körtner C., Truss M., Schallenberg J., Klein A. Relatedness of archaebacterial RNA polymerase core subunits to their eubacterial and eukaryotic equivalents. Nucleic Acids Res. 1988 Aug 25;16(16):8113–8128. doi: 10.1093/nar/16.16.8113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bremer M. C., Gimble F. S., Thorner J., Smith C. L. VDE endonuclease cleaves Saccharomyces cerevisiae genomic DNA at a single site: physical mapping of the VMA1 gene. Nucleic Acids Res. 1992 Oct 25;20(20):5484–5484. doi: 10.1093/nar/20.20.5484. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bult C. J., White O., Olsen G. J., Zhou L., Fleischmann R. D., Sutton G. G., Blake J. A., FitzGerald L. M., Clayton R. A., Gocayne J. D. Complete genome sequence of the methanogenic archaeon, Methanococcus jannaschii. Science. 1996 Aug 23;273(5278):1058–1073. doi: 10.1126/science.273.5278.1058. [DOI] [PubMed] [Google Scholar]
- Chong S., Shao Y., Paulus H., Benner J., Perler F. B., Xu M. Q. Protein splicing involving the Saccharomyces cerevisiae VMA intein. The steps in the splicing pathway, side reactions leading to protein cleavage, and establishment of an in vitro splicing system. J Biol Chem. 1996 Sep 6;271(36):22159–22168. doi: 10.1074/jbc.271.36.22159. [DOI] [PubMed] [Google Scholar]
- Cooper A. A., Chen Y. J., Lindorfer M. A., Stevens T. H. Protein splicing of the yeast TFP1 intervening protein sequence: a model for self-excision. EMBO J. 1993 Jun;12(6):2575–2583. doi: 10.1002/j.1460-2075.1993.tb05913.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Davis E. O., Jenner P. J., Brooks P. C., Colston M. J., Sedgwick S. G. Protein splicing in the maturation of M. tuberculosis recA protein: a mechanism for tolerating a novel class of intervening sequence. Cell. 1992 Oct 16;71(2):201–210. doi: 10.1016/0092-8674(92)90349-h. [DOI] [PubMed] [Google Scholar]
- Davis E. O., Jenner P. J. Protein splicing--the lengths some proteins will go to. Antonie Van Leeuwenhoek. 1995;67(2):131–137. doi: 10.1007/BF00871208. [DOI] [PubMed] [Google Scholar]
- Davis E. O., Sedgwick S. G., Colston M. J. Novel structure of the recA locus of Mycobacterium tuberculosis implies processing of the gene product. J Bacteriol. 1991 Sep;173(18):5653–5662. doi: 10.1128/jb.173.18.5653-5662.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Davis E. O., Thangaraj H. S., Brooks P. C., Colston M. J. Evidence of selection for protein introns in the recAs of pathogenic mycobacteria. EMBO J. 1994 Feb 1;13(3):699–703. doi: 10.1002/j.1460-2075.1994.tb06309.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fleischmann R. D., Adams M. D., White O., Clayton R. A., Kirkness E. F., Kerlavage A. R., Bult C. J., Tomb J. F., Dougherty B. A., Merrick J. M. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science. 1995 Jul 28;269(5223):496–512. doi: 10.1126/science.7542800. [DOI] [PubMed] [Google Scholar]
- Fraser C. M., Gocayne J. D., White O., Adams M. D., Clayton R. A., Fleischmann R. D., Bult C. J., Kerlavage A. R., Sutton G., Kelley J. M. The minimal gene complement of Mycoplasma genitalium. Science. 1995 Oct 20;270(5235):397–403. doi: 10.1126/science.270.5235.397. [DOI] [PubMed] [Google Scholar]
- Fsihi H., Vincent V., Cole S. T. Homing events in the gyrA gene of some mycobacteria. Proc Natl Acad Sci U S A. 1996 Apr 16;93(8):3410–3415. doi: 10.1073/pnas.93.8.3410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gimble F. S., Stephens B. W. Substitutions in conserved dodecapeptide motifs that uncouple the DNA binding and DNA cleavage activities of PI-SceI endonuclease. J Biol Chem. 1995 Mar 17;270(11):5849–5856. doi: 10.1074/jbc.270.11.5849. [DOI] [PubMed] [Google Scholar]
- Gimble F. S., Thorner J. Homing of a DNA endonuclease gene by meiotic gene conversion in Saccharomyces cerevisiae. Nature. 1992 May 28;357(6376):301–306. doi: 10.1038/357301a0. [DOI] [PubMed] [Google Scholar]
- Gimble F. S., Thorner J. Purification and characterization of VDE, a site-specific endonuclease from the yeast Saccharomyces cerevisiae. J Biol Chem. 1993 Oct 15;268(29):21844–21853. [PubMed] [Google Scholar]
- Gu H. H., Xu J., Gallagher M., Dean G. E. Peptide splicing in the vacuolar ATPase subunit A from Candida tropicalis. J Biol Chem. 1993 Apr 5;268(10):7372–7381. [PubMed] [Google Scholar]
- Guan C., Cui T., Rao V., Liao W., Benner J., Lin C. L., Comb D. Activation of glycosylasparaginase. Formation of active N-terminal threonine by intramolecular autoproteolysis. J Biol Chem. 1996 Jan 19;271(3):1732–1737. doi: 10.1074/jbc.271.3.1732. [DOI] [PubMed] [Google Scholar]
- Henikoff S., Henikoff J. G. Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A. 1992 Nov 15;89(22):10915–10919. doi: 10.1073/pnas.89.22.10915. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hirata R., Ohsumk Y., Nakano A., Kawasaki H., Suzuki K., Anraku Y. Molecular structure of a gene, VMA1, encoding the catalytic subunit of H(+)-translocating adenosine triphosphatase from vacuolar membranes of Saccharomyces cerevisiae. J Biol Chem. 1990 Apr 25;265(12):6726–6733. [PubMed] [Google Scholar]
- Hodges R. A., Perler F. B., Noren C. J., Jack W. E. Protein splicing removes intervening sequences in an archaea DNA polymerase. Nucleic Acids Res. 1992 Dec 11;20(23):6153–6157. doi: 10.1093/nar/20.23.6153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang C., Wang S., Chen L., Lemieux C., Otis C., Turmel M., Liu X. Q. The Chlamydomonas chloroplast clpP gene contains translated large insertion sequences and is essential for cell growth. Mol Gen Genet. 1994 Jul 25;244(2):151–159. doi: 10.1007/BF00283516. [DOI] [PubMed] [Google Scholar]
- Kane P. M., Yamashiro C. T., Wolczyk D. F., Neff N., Goebl M., Stevens T. H. Protein splicing converts the yeast TFP1 gene product to the 69-kD subunit of the vacuolar H(+)-adenosine triphosphatase. Science. 1990 Nov 2;250(4981):651–657. doi: 10.1126/science.2146742. [DOI] [PubMed] [Google Scholar]
- Kaneko T., Tanaka A., Sato S., Kotani H., Sazuka T., Miyajima N., Sugiura M., Tabata S. Sequence analysis of the genome of the unicellular cyanobacterium Synechocystis sp. strain PCC6803. I. Sequence features in the 1 Mb region from map positions 64% to 92% of the genome. DNA Res. 1995 Aug 31;2(4):153-66, 191-8. doi: 10.1093/dnares/2.4.153. [DOI] [PubMed] [Google Scholar]
- Klenk H. P., Renner O., Schwass V., Zillig W. Nucleotide sequence of the genes encoding the subunits H, B, A' and A'' of the DNA-dependent RNA polymerase and the initiator tRNA from Thermoplasma acidophilum. Nucleic Acids Res. 1992 Oct 11;20(19):5226–5226. doi: 10.1093/nar/20.19.5226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klenk H. P., Schwass V., Lottspeich F., Zillig W. Nucleotide sequence of the genes encoding the three largest subunits of the DNA-dependent RNA polymerase from the archaeum Thermococcus celer. Nucleic Acids Res. 1992 Sep 11;20(17):4659–4659. doi: 10.1093/nar/20.17.4659. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koonin E. V. A protein splice-junction motif in hedgehog family proteins. Trends Biochem Sci. 1995 Apr;20(4):141–142. doi: 10.1016/s0968-0004(00)88989-6. [DOI] [PubMed] [Google Scholar]
- Lambowitz A. M., Belfort M. Introns as mobile genetic elements. Annu Rev Biochem. 1993;62:587–622. doi: 10.1146/annurev.bi.62.070193.003103. [DOI] [PubMed] [Google Scholar]
- Lawrence C. E., Altschul S. F., Boguski M. S., Liu J. S., Neuwald A. F., Wootton J. C. Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment. Science. 1993 Oct 8;262(5131):208–214. doi: 10.1126/science.8211139. [DOI] [PubMed] [Google Scholar]
- Lee J. J., Ekker S. C., von Kessler D. P., Porter J. A., Sun B. I., Beachy P. A. Autoproteolysis in hedgehog protein biogenesis. Science. 1994 Dec 2;266(5190):1528–1537. doi: 10.1126/science.7985023. [DOI] [PubMed] [Google Scholar]
- Leffers H., Gropp F., Lottspeich F., Zillig W., Garrett R. A. Sequence, organization, transcription and evolution of RNA polymerase subunit genes from the archaebacterial extreme halophiles Halobacterium halobium and Halococcus morrhuae. J Mol Biol. 1989 Mar 5;206(1):1–17. doi: 10.1016/0022-2836(89)90519-6. [DOI] [PubMed] [Google Scholar]
- Perler F. B., Comb D. G., Jack W. E., Moran L. S., Qiang B., Kucera R. B., Benner J., Slatko B. E., Nwankwo D. O., Hempstead S. K. Intervening sequences in an Archaea DNA polymerase gene. Proc Natl Acad Sci U S A. 1992 Jun 15;89(12):5577–5581. doi: 10.1073/pnas.89.12.5577. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perler F. B., Davis E. O., Dean G. E., Gimble F. S., Jack W. E., Neff N., Noren C. J., Thorner J., Belfort M. Protein splicing elements: inteins and exteins--a definition of terms and recommended nomenclature. Nucleic Acids Res. 1994 Apr 11;22(7):1125–1127. doi: 10.1093/nar/22.7.1125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perler F. B., Kumar S., Kong H. Thermostable DNA polymerases. Adv Protein Chem. 1996;48:377–435. doi: 10.1016/s0065-3233(08)60367-8. [DOI] [PubMed] [Google Scholar]
- Pietrokovski S. A new intein in cyanobacteria and its significance for the spread of inteins. Trends Genet. 1996 Aug;12(8):287–288. doi: 10.1016/0168-9525(96)20005-8. [DOI] [PubMed] [Google Scholar]
- Pietrokovski S. Conserved sequence features of inteins (protein introns) and their use in identifying new inteins and related proteins. Protein Sci. 1994 Dec;3(12):2340–2350. doi: 10.1002/pro.5560031218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Porter J. A., Ekker S. C., Park W. J., von Kessler D. P., Young K. E., Chen C. H., Ma Y., Woods A. S., Cotter R. J., Koonin E. V. Hedgehog patterning activity: role of a lipophilic modification mediated by the carboxy-terminal autoprocessing domain. Cell. 1996 Jul 12;86(1):21–34. doi: 10.1016/s0092-8674(00)80074-4. [DOI] [PubMed] [Google Scholar]
- Porter J. A., von Kessler D. P., Ekker S. C., Young K. E., Lee J. J., Moses K., Beachy P. A. The product of hedgehog autoproteolytic cleavage active in local and long-range signalling. Nature. 1995 Mar 23;374(6520):363–366. doi: 10.1038/374363a0. [DOI] [PubMed] [Google Scholar]
- Pühler G., Lottspeich F., Zillig W. Organization and nucleotide sequence of the genes encoding the large subunits A, B and C of the DNA-dependent RNA polymerase of the archaebacterium Sulfolobus acidocaldarius. Nucleic Acids Res. 1989 Jun 26;17(12):4517–4534. doi: 10.1093/nar/17.12.4517. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quirk S. M., Bell-Pedersen D., Belfort M. Intron mobility in the T-even phages: high frequency inheritance of group I introns promoted by intron open reading frames. Cell. 1989 Feb 10;56(3):455–465. doi: 10.1016/0092-8674(89)90248-1. [DOI] [PubMed] [Google Scholar]
- Roberts R. J., Macelis D. REBASE--restriction enzymes and methylases. Nucleic Acids Res. 1996 Jan 1;24(1):223–235. doi: 10.1093/nar/24.1.223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schuler G. D., Altschul S. F., Lipman D. J. A workbench for multiple alignment construction and analysis. Proteins. 1991;9(3):180–190. doi: 10.1002/prot.340090304. [DOI] [PubMed] [Google Scholar]
- Shao Y., Xu M. Q., Paulus H. Protein splicing: characterization of the aminosuccinimide residue at the carboxyl terminus of the excised intervening sequence. Biochemistry. 1995 Aug 29;34(34):10844–10850. doi: 10.1021/bi00034a017. [DOI] [PubMed] [Google Scholar]
- Shao Y., Xu M. Q., Paulus H. Protein splicing: evidence for an N-O acyl rearrangement as the initial step in the splicing process. Biochemistry. 1996 Mar 26;35(12):3810–3815. doi: 10.1021/bi952592h. [DOI] [PubMed] [Google Scholar]
- Sun D., Setlow P. Cloning and nucleotide sequence of the Bacillus subtilis ansR gene, which encodes a repressor of the ans operon coding for L-asparaginase and L-aspartase. J Bacteriol. 1993 May;175(9):2501–2506. doi: 10.1128/jb.175.9.2501-2506.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xu M. Q., Comb D. G., Paulus H., Noren C. J., Shao Y., Perler F. B. Protein splicing: an analysis of the branched intermediate and its resolution by succinimide formation. EMBO J. 1994 Dec 1;13(23):5517–5522. doi: 10.1002/j.1460-2075.1994.tb06888.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xu M. Q., Perler F. B. The mechanism of protein splicing and its modulation by mutation. EMBO J. 1996 Oct 1;15(19):5146–5153. [PMC free article] [PubMed] [Google Scholar]
- Xu M. Q., Southworth M. W., Mersha F. B., Hornstra L. J., Perler F. B. In vitro protein splicing of purified precursor and the identification of a branched intermediate. Cell. 1993 Dec 31;75(7):1371–1377. doi: 10.1016/0092-8674(93)90623-x. [DOI] [PubMed] [Google Scholar]