Abstract
A database of 210 Schizosaccharomyces pombe DNA sequences (524,794 bp) was extracted from GenBank (release number 81.0) and examined by a number of methods in order to characterize statistical features of these sequences that might serve as signals or constraints for messenger RNA splicing. The statistical information compiled includes splicing signal (donor, acceptor and branch site) profiles, translational initiation start profile, exon/intron length distributions, ORF distribution, CDS size distribution, codon usage table, and 6-tuple distribution. The information content of the various signals are also presented. A rule-based interactive computer program for finding introns called INTRON.PLOT has been developed and was used to successfully analyze 7 newly sequenced genes.
Full text
PDFImages in this article
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Bennetzen J. L., Hall B. D. Codon selection in yeast. J Biol Chem. 1982 Mar 25;257(6):3026–3031. [PubMed] [Google Scholar]
- Berget S. M., Moore C., Sharp P. A. Spliced segments at the 5' terminus of adenovirus 2 late mRNA. Proc Natl Acad Sci U S A. 1977 Aug;74(8):3171–3175. doi: 10.1073/pnas.74.8.3171. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brennwald P., Porter G., Wise J. A. U2 small nuclear RNA is remarkably conserved between Schizosaccharomyces pombe and mammals. Mol Cell Biol. 1988 Dec;8(12):5575–5580. doi: 10.1128/mcb.8.12.5575. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown J. D., Plumpton M., Beggs J. D. The genetics of nuclear pre-mRNA splicing: a complex story. Antonie Van Leeuwenhoek. 1992 Aug;62(1-2):35–46. doi: 10.1007/BF00584461. [DOI] [PubMed] [Google Scholar]
- Brown J. W., Feix G., Frendewey D. Accurate in vitro splicing of two pre-mRNA plant introns in a HeLa cell nuclear extract. EMBO J. 1986 Nov;5(11):2749–2758. doi: 10.1002/j.1460-2075.1986.tb04563.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cigan A. M., Donahue T. F. Sequence and structural features associated with translational initiator regions in yeast--a review. Gene. 1987;59(1):1–18. doi: 10.1016/0378-1119(87)90261-7. [DOI] [PubMed] [Google Scholar]
- Claverie J. M., Sauvaget I., Bougueleret L. K-tuple frequency analysis: from intron/exon discrimination to T-cell epitope mapping. Methods Enzymol. 1990;183:237–252. doi: 10.1016/0076-6879(90)83017-4. [DOI] [PubMed] [Google Scholar]
- Fickett J. W., Tung C. S. Assessment of protein coding measures. Nucleic Acids Res. 1992 Dec 25;20(24):6441–6450. doi: 10.1093/nar/20.24.6441. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fields C. A., Soderlund C. A. gm: a practical tool for automating DNA sequence analysis. Comput Appl Biosci. 1990 Jul;6(3):263–270. doi: 10.1093/bioinformatics/6.3.263. [DOI] [PubMed] [Google Scholar]
- Fields C. Information content of Caenorhabditis elegans splice site sequences varies with intron length. Nucleic Acids Res. 1990 Mar 25;18(6):1509–1512. doi: 10.1093/nar/18.6.1509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Galas D. J., Eggert M., Waterman M. S. Rigorous pattern-recognition methods for DNA sequences. Analysis of promoter sequences from Escherichia coli. J Mol Biol. 1985 Nov 5;186(1):117–128. doi: 10.1016/0022-2836(85)90262-1. [DOI] [PubMed] [Google Scholar]
- Gatermann K. B., Hoffmann A., Rosenberg G. H., Käufer N. F. Introduction of functional artificial introns into the naturally intronless ura4 gene of Schizosaccharomyces pombe. Mol Cell Biol. 1989 Apr;9(4):1526–1535. doi: 10.1128/mcb.9.4.1526. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Green M. R. Pre-mRNA splicing. Annu Rev Genet. 1986;20:671–708. doi: 10.1146/annurev.ge.20.120186.003323. [DOI] [PubMed] [Google Scholar]
- Guigó R., Knudsen S., Drake N., Smith T. Prediction of gene structure. J Mol Biol. 1992 Jul 5;226(1):141–157. doi: 10.1016/0022-2836(92)90130-c. [DOI] [PubMed] [Google Scholar]
- Hawkins J. D. A survey on intron and exon lengths. Nucleic Acids Res. 1988 Nov 11;16(21):9893–9908. doi: 10.1093/nar/16.21.9893. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hutchinson G. B., Hayden M. R. The prediction of exons through an analysis of spliceable open reading frames. Nucleic Acids Res. 1992 Jul 11;20(13):3453–3462. doi: 10.1093/nar/20.13.3453. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Iida Y. Quantification analysis of 5'-splice signal sequences in mRNA precursors. Mutations in 5'-splice signal sequence of human beta-globin gene and beta-thalassemia. J Theor Biol. 1990 Aug 23;145(4):523–533. doi: 10.1016/s0022-5193(05)80486-2. [DOI] [PubMed] [Google Scholar]
- Jacob M., Gallinaro H. The 5' splice site: phylogenetic evolution and variable geometry of association with U1RNA. Nucleic Acids Res. 1989 Mar 25;17(6):2159–2180. doi: 10.1093/nar/17.6.2159. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kozak M. Point mutations define a sequence flanking the AUG initiator codon that modulates translation by eukaryotic ribosomes. Cell. 1986 Jan 31;44(2):283–292. doi: 10.1016/0092-8674(86)90762-2. [DOI] [PubMed] [Google Scholar]
- Kozak M. Structural features in eukaryotic mRNAs that modulate the initiation of translation. J Biol Chem. 1991 Oct 25;266(30):19867–19870. [PubMed] [Google Scholar]
- Mizukami T., Chang W. I., Garkavtsev I., Kaplan N., Lombardi D., Matsumoto T., Niwa O., Kounosu A., Yanagida M., Marr T. G. A 13 kb resolution cosmid map of the 14 Mb fission yeast genome by nonrandom sequence-tagged site mapping. Cell. 1993 Apr 9;73(1):121–132. doi: 10.1016/0092-8674(93)90165-m. [DOI] [PubMed] [Google Scholar]
- Moore M. J., Sharp P. A. Evidence for two active sites in the spliceosome provided by stereochemistry of pre-mRNA splicing. Nature. 1993 Sep 23;365(6444):364–368. doi: 10.1038/365364a0. [DOI] [PubMed] [Google Scholar]
- Mount S. M. A catalogue of splice junction sequences. Nucleic Acids Res. 1982 Jan 22;10(2):459–472. doi: 10.1093/nar/10.2.459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mount S. M., Burks C., Hertz G., Stormo G. D., White O., Fields C. Splicing signals in Drosophila: intron size, information content, and consensus sequences. Nucleic Acids Res. 1992 Aug 25;20(16):4255–4262. doi: 10.1093/nar/20.16.4255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mount S. M., Pettersson I., Hinterberger M., Karmas A., Steitz J. A. The U1 small nuclear RNA-protein complex selectively binds a 5' splice site in vitro. Cell. 1983 Jun;33(2):509–518. doi: 10.1016/0092-8674(83)90432-4. [DOI] [PubMed] [Google Scholar]
- Nakata K., Kanehisa M., DeLisi C. Prediction of splice junctions in mRNA sequences. Nucleic Acids Res. 1985 Jul 25;13(14):5327–5340. doi: 10.1093/nar/13.14.5327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Newman A. J., Norman C. U5 snRNA interacts with exon sequences at 5' and 3' splice sites. Cell. 1992 Feb 21;68(4):743–754. doi: 10.1016/0092-8674(92)90149-7. [DOI] [PubMed] [Google Scholar]
- Osawa S., Jukes T. H., Watanabe K., Muto A. Recent evidence for evolution of the genetic code. Microbiol Rev. 1992 Mar;56(1):229–264. doi: 10.1128/mr.56.1.229-264.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Padgett R. A., Grabowski P. J., Konarska M. M., Seiler S., Sharp P. A. Splicing of messenger RNA precursors. Annu Rev Biochem. 1986;55:1119–1150. doi: 10.1146/annurev.bi.55.070186.005351. [DOI] [PubMed] [Google Scholar]
- Porter G., Brennwald P., Wise J. A. U1 small nuclear RNA from Schizosaccharomyces pombe has unique and conserved features and is encoded by an essential single-copy gene. Mol Cell Biol. 1990 Jun;10(6):2874–2881. doi: 10.1128/mcb.10.6.2874. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Prabhala G., Rosenberg G. H., Käufer N. F. Architectural features of pre-mRNA introns in the fission yeast Schizosaccharomyces pombe. Yeast. 1992 Mar;8(3):171–182. doi: 10.1002/yea.320080303. [DOI] [PubMed] [Google Scholar]
- Reich C. I., VanHoy R. W., Porter G. L., Wise J. A. Mutations at the 3' splice site can be suppressed by compensatory base changes in U1 snRNA in fission yeast. Cell. 1992 Jun 26;69(7):1159–1169. doi: 10.1016/0092-8674(92)90637-r. [DOI] [PubMed] [Google Scholar]
- Russell P., Nurse P. Schizosaccharomyces pombe and Saccharomyces cerevisiae: a look at yeasts divided. Cell. 1986 Jun 20;45(6):781–782. doi: 10.1016/0092-8674(86)90550-7. [DOI] [PubMed] [Google Scholar]
- Sakuraba H., Eng C. M., Desnick R. J., Bishop D. F. Invariant exon skipping in the human alpha-galactosidase A pre-mRNA: Ag+1 to t substitution in a 5'-splice site causing Fabry disease. Genomics. 1992 Apr;12(4):643–650. doi: 10.1016/0888-7543(92)90288-4. [DOI] [PubMed] [Google Scholar]
- Schneider T. D., Stormo G. D., Gold L., Ehrenfeucht A. Information content of binding sites on nucleotide sequences. J Mol Biol. 1986 Apr 5;188(3):415–431. doi: 10.1016/0022-2836(86)90165-8. [DOI] [PubMed] [Google Scholar]
- Senapathy P. Origin of eukaryotic introns: a hypothesis, based on codon distribution statistics in genes, and its implications. Proc Natl Acad Sci U S A. 1986 Apr;83(7):2133–2137. doi: 10.1073/pnas.83.7.2133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shapiro M. B., Senapathy P. RNA splice junctions of different classes of eukaryotes: sequence statistics and functional implications in gene expression. Nucleic Acids Res. 1987 Sep 11;15(17):7155–7174. doi: 10.1093/nar/15.17.7155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sharp P. M., Cowe E. Synonymous codon usage in Saccharomyces cerevisiae. Yeast. 1991 Oct;7(7):657–678. doi: 10.1002/yea.320070702. [DOI] [PubMed] [Google Scholar]
- Sharp P. M., Tuohy T. M., Mosurski K. R. Codon usage in yeast: cluster analysis clearly differentiates highly and lowly expressed genes. Nucleic Acids Res. 1986 Jul 11;14(13):5125–5143. doi: 10.1093/nar/14.13.5125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith C. W., Chu T. T., Nadal-Ginard B. Scanning and competition between AGs are involved in 3' splice site selection in mammalian introns. Mol Cell Biol. 1993 Aug;13(8):4939–4952. doi: 10.1128/mcb.13.8.4939. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith M. W. Structure of vertebrate genes: a statistical analysis implicating selection. J Mol Evol. 1988;27(1):45–55. doi: 10.1007/BF02099729. [DOI] [PubMed] [Google Scholar]
- Snyder E. E., Stormo G. D. Identification of coding regions in genomic DNA sequences: an application of dynamic programming and neural networks. Nucleic Acids Res. 1993 Feb 11;21(3):607–613. doi: 10.1093/nar/21.3.607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Steitz J. A. Splicing takes a holliday. Science. 1992 Aug 14;257(5072):888–889. doi: 10.1126/science.1386941. [DOI] [PubMed] [Google Scholar]
- Stormo G. D. Consensus patterns in DNA. Methods Enzymol. 1990;183:211–221. doi: 10.1016/0076-6879(90)83015-2. [DOI] [PubMed] [Google Scholar]
- Teem J. L., Abovich N., Kaufer N. F., Schwindinger W. F., Warner J. R., Levy A., Woolford J., Leer R. J., van Raamsdonk-Duin M. M., Mager W. H. A comparison of yeast ribosomal protein gene DNA sequences. Nucleic Acids Res. 1984 Nov 26;12(22):8295–8312. doi: 10.1093/nar/12.22.8295. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Uberbacher E. C., Mural R. J. Locating protein-coding regions in human DNA sequences by a multiple sensor-neural network approach. Proc Natl Acad Sci U S A. 1991 Dec 15;88(24):11261–11265. doi: 10.1073/pnas.88.24.11261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weiner A. M. mRNA splicing and autocatalytic introns: distant cousins or the products of chemical determinism? Cell. 1993 Jan 29;72(2):161–164. doi: 10.1016/0092-8674(93)90654-9. [DOI] [PubMed] [Google Scholar]
- Woolford J. L., Jr Nuclear pre-mRNA splicing in yeast. Yeast. 1989 Nov-Dec;5(6):439–457. doi: 10.1002/yea.320050604. [DOI] [PubMed] [Google Scholar]
- Woolford J. L., Jr Nuclear pre-mRNA splicing in yeast. Yeast. 1989 Nov-Dec;5(6):439–457. doi: 10.1002/yea.320050604. [DOI] [PubMed] [Google Scholar]
- Zhang M. Q., Marr T. G. A weight array method for splicing signal analysis. Comput Appl Biosci. 1993 Oct;9(5):499–509. doi: 10.1093/bioinformatics/9.5.499. [DOI] [PubMed] [Google Scholar]