Abstract
Little knowledge exists about branch points in plants; it has even been claimed that plant introns lack conserved branch point sequences similar to those found in vertebrate introns. A putative branch point consensus sequence for Arabidopsis thaliana resembling the well known metazoan consensus sequence has been proposed, but this is based on search of sequences similar to those in yeast and metazoa. Here we present a novel consensus sequence found by a non-circular approach. A hidden Markov model with a fixed A nucleotide was trained on sequences upstream of the acceptor site. The consensus found by the Markov model shares features with the metazoan consensus, but differs in its details from the consensus proposed earlier. Despite the fact that branch point consensus sequences in plants are weak, we show that a prediction scheme incorporating them leads to a substantial improvement in the recognition of true acceptor sites; the false positive rate being reduced by a factor of 2. We take this as an indication that the consensus found here is the genuine one and that the branch point does play a role in the proper recognition of the acceptor site in plants.
Full Text
The Full Text of this article is available as a PDF (90.4 KB).
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Abovich N., Liao X. C., Rosbash M. The yeast MUD2 protein: an interaction with PRP11 defines a bridge between commitment complexes and U2 snRNP addition. Genes Dev. 1994 Apr 1;8(7):843–854. doi: 10.1101/gad.8.7.843. [DOI] [PubMed] [Google Scholar]
- Brown J. W. A catalogue of splice junction and putative branch point sequences from plant introns. Nucleic Acids Res. 1986 Dec 22;14(24):9549–9559. doi: 10.1093/nar/14.24.9549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown J. W., Smith P., Simpson C. G. Arabidopsis consensus intron sequences. Plant Mol Biol. 1996 Nov;32(3):531–535. doi: 10.1007/BF00019105. [DOI] [PubMed] [Google Scholar]
- Gozani O., Feld R., Reed R. Evidence that sequence-independent binding of highly conserved U2 snRNP proteins upstream of the branch site is required for assembly of spliceosomal complex A. Genes Dev. 1996 Jan 15;10(2):233–243. doi: 10.1101/gad.10.2.233. [DOI] [PubMed] [Google Scholar]
- Green M. R. Biochemical mechanisms of constitutive and regulated pre-mRNA splicing. Annu Rev Cell Biol. 1991;7:559–599. doi: 10.1146/annurev.cb.07.110191.003015. [DOI] [PubMed] [Google Scholar]
- Hebsgaard S. M., Korning P. G., Tolstrup N., Engelbrecht J., Rouzé P., Brunak S. Splice site prediction in Arabidopsis thaliana pre-mRNA by combining local and global sequence information. Nucleic Acids Res. 1996 Sep 1;24(17):3439–3452. doi: 10.1093/nar/24.17.3439. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoffman B. E., Grabowski P. J. U1 snRNP targets an essential splicing factor, U2AF65, to the 3' splice site by a network of interactions spanning the exon. Genes Dev. 1992 Dec;6(12B):2554–2568. doi: 10.1101/gad.6.12b.2554. [DOI] [PubMed] [Google Scholar]
- Hughey R., Krogh A. Hidden Markov models for sequence analysis: extension and analysis of the basic method. Comput Appl Biosci. 1996 Apr;12(2):95–107. doi: 10.1093/bioinformatics/12.2.95. [DOI] [PubMed] [Google Scholar]
- Korning P. G., Hebsgaard S. M., Rouze P., Brunak S. Cleaning the GenBank Arabidopsis thaliana data set. Nucleic Acids Res. 1996 Jan 15;24(2):316–320. doi: 10.1093/nar/24.2.316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Krogh A., Brown M., Mian I. S., Sjölander K., Haussler D. Hidden Markov models in computational biology. Applications to protein modeling. J Mol Biol. 1994 Feb 4;235(5):1501–1531. doi: 10.1006/jmbi.1994.1104. [DOI] [PubMed] [Google Scholar]
- Lamond A. I. The spliceosome. Bioessays. 1993 Sep;15(9):595–603. doi: 10.1002/bies.950150905. [DOI] [PubMed] [Google Scholar]
- Liu H. X., Filipowicz W. Mapping of branchpoint nucleotides in mutant pre-mRNAs expressed in plant cells. Plant J. 1996 Mar;9(3):381–389. doi: 10.1046/j.1365-313x.1996.09030381.x. [DOI] [PubMed] [Google Scholar]
- Matthews B. W. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim Biophys Acta. 1975 Oct 20;405(2):442–451. doi: 10.1016/0005-2795(75)90109-9. [DOI] [PubMed] [Google Scholar]
- Michaud S., Reed R. A functional association between the 5' and 3' splice site is established in the earliest prespliceosome complex (E) in mammals. Genes Dev. 1993 Jun;7(6):1008–1020. doi: 10.1101/gad.7.6.1008. [DOI] [PubMed] [Google Scholar]
- Penotti F. E. Human DNA TATA boxes and transcription initiation sites. A statistical study. J Mol Biol. 1990 May 5;213(1):37–52. doi: 10.1016/S0022-2836(05)80120-2. [DOI] [PubMed] [Google Scholar]
- Query C. C., Moore M. J., Sharp P. A. Branch nucleophile selection in pre-mRNA splicing: evidence for the bulged duplex model. Genes Dev. 1994 Mar 1;8(5):587–597. doi: 10.1101/gad.8.5.587. [DOI] [PubMed] [Google Scholar]
- Schneider T. D., Stephens R. M. Sequence logos: a new way to display consensus sequences. Nucleic Acids Res. 1990 Oct 25;18(20):6097–6100. doi: 10.1093/nar/18.20.6097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schneider T. D., Stormo G. D., Gold L., Ehrenfeucht A. Information content of binding sites on nucleotide sequences. J Mol Biol. 1986 Apr 5;188(3):415–431. doi: 10.1016/0022-2836(86)90165-8. [DOI] [PubMed] [Google Scholar]
- Simpson C. G., Clark G., Davidson D., Smith P., Brown J. W. Mutation of putative branchpoint consensus sequences in plant introns reduces splicing efficiency. Plant J. 1996 Mar;9(3):369–380. doi: 10.1046/j.1365-313x.1996.09030369.x. [DOI] [PubMed] [Google Scholar]
- Smith C. W., Chu T. T., Nadal-Ginard B. Scanning and competition between AGs are involved in 3' splice site selection in mammalian introns. Mol Cell Biol. 1993 Aug;13(8):4939–4952. doi: 10.1128/mcb.13.8.4939. [DOI] [PMC free article] [PubMed] [Google Scholar]
