Abstract
At the DNA/RNA level, biological signals are defined by a combination of spatial structures and sequence motifs. Until now, few attempts had been made in writing general purpose search programs that take into account both sequence and structure criteria. Indeed, the most successful structure scanning programs are usually dedicated to particular structures and are written using general purpose programming languages through a complex and time consuming process where the biological problem of defining the structure and the computer engineering problem of looking for it are intimately intertwined. In this paper, we describe a general representation of structures, suitable for database scanning, together with a programming language, Palingol, designed to manipulate it. Palingol has specific data types, corresponding to structural elements-basically helices-that can be arranged in any way to form a complex structure. As a consequence of the declarative approach used in Palingol, the user should only focus on 'what to search for' while the language engine takes care of 'how to look for it'. Therefore, it becomes simpler to write a scanning program and the structural constraints that define the required structure are more clearly identified.
Full Text
The Full Text of this article is available as a PDF (124.8 KB).
Selected References
These references are in PubMed. This may not be the complete list of references from this article.
- Cotmore S. F., Tattersall P. An asymmetric nucleotide in the parvoviral 3' hairpin directs segregation of a single active origin of DNA replication. EMBO J. 1994 Sep 1;13(17):4145–4152. doi: 10.1002/j.1460-2075.1994.tb06732.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dandekar T., Hentze M. W. Finding the hairpin in the haystack: searching for RNA motifs. Trends Genet. 1995 Feb;11(2):45–50. doi: 10.1016/s0168-9525(00)88996-9. [DOI] [PubMed] [Google Scholar]
- Dandekar T., Stripecke R., Gray N. K., Goossen B., Constable A., Johansson H. E., Hentze M. W. Identification of a novel iron-responsive element in murine and human erythroid delta-aminolevulinic acid synthase mRNA. EMBO J. 1991 Jul;10(7):1903–1909. doi: 10.1002/j.1460-2075.1991.tb07716.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Devereux J., Haeberli P., Smithies O. A comprehensive set of sequence analysis programs for the VAX. Nucleic Acids Res. 1984 Jan 11;12(1 Pt 1):387–395. doi: 10.1093/nar/12.1part1.387. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Duret L., Mouchiroud D., Gouy M. HOVERGEN: a database of homologous vertebrate genes. Nucleic Acids Res. 1994 Jun 25;22(12):2360–2365. doi: 10.1093/nar/22.12.2360. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eddy S. R., Durbin R. RNA sequence analysis using covariance models. Nucleic Acids Res. 1994 Jun 11;22(11):2079–2088. doi: 10.1093/nar/22.11.2079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fichant G. A., Burks C. Identifying potential tRNA genes in genomic DNA sequences. J Mol Biol. 1991 Aug 5;220(3):659–671. doi: 10.1016/0022-2836(91)90108-i. [DOI] [PubMed] [Google Scholar]
- Gautheret D., Major F., Cedergren R. Pattern searching/alignment with RNA primary and secondary structures: an effective descriptor for tRNA. Comput Appl Biosci. 1990 Oct;6(4):325–331. doi: 10.1093/bioinformatics/6.4.325. [DOI] [PubMed] [Google Scholar]
- Grundy F. J., Henkin T. M. Conservation of a transcription antitermination mechanism in aminoacyl-tRNA synthetase and amino acid biosynthesis genes in gram-positive bacteria. J Mol Biol. 1994 Jan 14;235(2):798–804. doi: 10.1006/jmbi.1994.1038. [DOI] [PubMed] [Google Scholar]
- Grundy F. J., Henkin T. M. tRNA as a positive regulator of transcription antitermination in B. subtilis. Cell. 1993 Aug 13;74(3):475–482. doi: 10.1016/0092-8674(93)80049-k. [DOI] [PubMed] [Google Scholar]
- Hentze M. W., Caughman S. W., Casey J. L., Koeller D. M., Rouault T. A., Harford J. B., Klausner R. D. A model for the structure and functions of iron-responsive elements. Gene. 1988 Dec 10;72(1-2):201–208. doi: 10.1016/0378-1119(88)90145-x. [DOI] [PubMed] [Google Scholar]
- Laferrière A., Gautheret D., Cedergren R. An RNA pattern matching program with enhanced performance and portability. Comput Appl Biosci. 1994 Apr;10(2):211–212. doi: 10.1093/bioinformatics/10.2.211. [DOI] [PubMed] [Google Scholar]
- Leibold E. A., Laudano A., Yu Y. Structural requirements of iron-responsive elements for binding of the protein involved in both transferrin receptor and ferritin mRNA post-transcriptional regulation. Nucleic Acids Res. 1990 Apr 11;18(7):1819–1824. doi: 10.1093/nar/18.7.1819. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lisacek F., Diaz Y., Michel F. Automatic identification of group I intron cores in genomic DNA sequences. J Mol Biol. 1994 Jan 28;235(4):1206–1217. doi: 10.1006/jmbi.1994.1074. [DOI] [PubMed] [Google Scholar]
- Mehldau G., Myers G. A system for pattern matching applications on biosequences. Comput Appl Biosci. 1993 Jun;9(3):299–314. doi: 10.1093/bioinformatics/9.3.299. [DOI] [PubMed] [Google Scholar]
- Melefors O., Hentze M. W. Translational regulation by mRNA/protein interactions in eukaryotic cells: ferritin and beyond. Bioessays. 1993 Feb;15(2):85–90. doi: 10.1002/bies.950150203. [DOI] [PubMed] [Google Scholar]
- Michel F., Westhof E. Modelling of the three-dimensional architecture of group I catalytic introns based on comparative sequence analysis. J Mol Biol. 1990 Dec 5;216(3):585–610. doi: 10.1016/0022-2836(90)90386-Z. [DOI] [PubMed] [Google Scholar]
- Moszer I., Glaser P., Danchin A. SubtiList: a relational database for the Bacillus subtilis genome. Microbiology. 1995 Feb;141(Pt 2):261–268. doi: 10.1099/13500872-141-2-261. [DOI] [PubMed] [Google Scholar]
- Médigue C., Moszer I., Viari A., Danchin A. Analysis of a Bacillus subtilis genome fragment using a co-operative computer system prototype. Gene. 1995 Nov 7;165(1):GC37–GC51. doi: 10.1016/0378-1119(95)00636-k. [DOI] [PubMed] [Google Scholar]
- Saurin W., Marlière P. Matching relational patterns in nucleic acid sequences. Comput Appl Biosci. 1987 Jun;3(2):115–120. doi: 10.1093/bioinformatics/3.2.115. [DOI] [PubMed] [Google Scholar]
- Shapiro B. A., Zhang K. Z. Comparing multiple RNA secondary structures using tree comparisons. Comput Appl Biosci. 1990 Oct;6(4):309–318. doi: 10.1093/bioinformatics/6.4.309. [DOI] [PubMed] [Google Scholar]
- Sibbald P. R., Sommerfeldt H., Argos P. Overseer: a nucleotide sequence searching tool. Comput Appl Biosci. 1992 Feb;8(1):45–48. doi: 10.1093/bioinformatics/8.1.45. [DOI] [PubMed] [Google Scholar]
- Spedding G., Gluick T. C., Draper D. E. Ribosome initiation complex formation with the pseudoknotted alpha operon messenger RNA. J Mol Biol. 1993 Feb 5;229(3):609–622. doi: 10.1006/jmbi.1993.1067. [DOI] [PubMed] [Google Scholar]
- Woese C. R., Gutell R., Gupta R., Noller H. F. Detailed analysis of the higher-order structure of 16S-like ribosomal ribonucleic acids. Microbiol Rev. 1983 Dec;47(4):621–669. doi: 10.1128/mr.47.4.621-669.1983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zuker M. Prediction of RNA secondary structure by energy minimization. Methods Mol Biol. 1994;25:267–294. doi: 10.1385/0-89603-276-0:267. [DOI] [PubMed] [Google Scholar]
- d'Aubenton Carafa Y., Brody E., Thermes C. Prediction of rho-independent Escherichia coli transcription terminators. A statistical analysis of their RNA stem-loop structures. J Mol Biol. 1990 Dec 20;216(4):835–858. doi: 10.1016/s0022-2836(99)80005-9. [DOI] [PubMed] [Google Scholar]