Skip to main content
BMC Research Notes logoLink to BMC Research Notes
. 2009 Jul 7;2:125. doi: 10.1186/1756-0500-2-125

DNA watermarks in non-coding regulatory sequences

Dominik Heider 1,3, Martin Pyka 2, Angelika Barnekow 1,
PMCID: PMC2713970  PMID: 19583865

Abstract

Background

DNA watermarks can be applied to identify the unauthorized use of genetically modified organisms. It has been shown that coding regions can be used to encrypt information into living organisms by using the DNA-Crypt algorithm. Yet, if the sequence of interest presents a non-coding DNA sequence, either the function of a resulting functional RNA molecule or a regulatory sequence, such as a promoter, could be affected. For our studies we used the small cytoplasmic RNA 1 in yeast and the lac promoter region of Escherichia coli.

Findings

The lac promoter was deactivated by the integrated watermark. In addition, the RNA molecules displayed altered configurations after introducing a watermark, but surprisingly were functionally intact, which has been verified by analyzing the growth characteristics of both wild type and watermarked scR1 transformed yeast cells. In a third approach we introduced a second overlapping watermark into the lac promoter, which did not affect the promoter activity.

Conclusion

Even though the watermarked RNA and one of the watermarked promoters did not show any significant differences compared to the wild type RNA and wild type promoter region, respectively, it cannot be generalized that other RNA molecules or regulatory sequences behave accordingly. Therefore, we do not recommend integrating watermark sequences into regulatory regions.

Background

DNA watermarks can be used for authenticating genetically modified organisms or in future for labeling animals in breeding [1,2]. It has already been shown in silico and in vivo that these watermarks do not affect the translation of proteins [1-4]. These assumptions only apply to coding regions, thus the insertion of watermarks into regulatory sequences like promoter regions or regulatory RNA molecules, had to be examined. In our studies we integrated watermark sequences into a widely known promoter region of bacteria. Watermarks integrated into regulatory regions like promoter or enhancer sequences can affect their functionality. We integrated watermark sequences into the lac promoter of Escherichia coli to examine, whether the watermarks affect the promoter activity. The lac operon of Escherichia coli consists of a promoter, three operators and three genes (lacY, lacZ and lacA), coding for the β-galactoside permease, β-galactosidase and β-galactoside transacetylase, which are required for the lactose metabolism in Escherichia coli [5]. The β-galactosidase cleaves the lactose into glucose and galactose [5]. The promoter sequence of the lac operon consists of two highly conserved regions called the -35 (TTTACA) and -10 (TATGTT) regions, upstream from the transcription start. It has already been shown that single point mutations within conserved regions can affect or even stop promoter activity of different promoters (Additional file 1) [6-10].

In a second approach we integrated a watermark sequence into a regulatory RNA molecule called small cytoplasmic RNA 1 (scR1) from yeast (scR1 gene on chromosome V of Saccharomyces cerevisiae, http://www.yeastgenome.org/). The scR1 consists of 519 nucleotides and represents the homologous RNA molecule in Saccharomyces cerevisiae to the mammalian 7SL RNA, which is part of the signal recognition particle (SRP), necessary for the targeting of proteins into the endoplasmic reticulum (ER) [11-13]. The SRP complex is a ribonucleoprotein consisting of six polypeptides and the 7SL RNA. In Saccharomyces cerevisiae the six polypeptides are SRP14, SRP19, SRP21, SRP54, SRP68 and SRP72 [14]. The SRP displays a GTPase activity [15]. The 7SL molecule is a conserved functional RNA constituting the backbone of the SRP complex and has been shown to be essential for all organisms tested, except Saccharomyces cerevisiae cells. Saccharomyces cerevisiae cells are viable without scR1, but have a reduced reproduction rate [11,12,14,16-21].

Results and discussion

The analyses of the watermarked DNA sequences of both, the scR1 and the lac promoter sequence with the DNA-Crypt fuzzy controller, reveal that the use of any correction code is not recommended.

We inserted a watermark sequence starting at position 898 in the lac promoter of pBluescriptII KS+ plasmid. The watermark sequence deactivated the lac promoter, which was verified by qualitative β-galactosidase assay using X-Gal (5-bromo-4-chloro-3-indolyl-β-D-galactopyranoside) as a substrate (Table 1).

Table 1.

Qualitative β-galactosidase assays

promoter qualitative β-galactosidase assay
lac +

lac K5 +

lac II -

lac wt: the wild type lac promoter

lac K5: the lac promoter containing the watermark sequence WWU

lac II: the lac promoter containing the watermark sequence 42

+: promoter is active

-: promoter is inactive

In 2001, Dieci et al. showed that after transformation of YRA130 ΔscR1 yeast cells with the wild type scR1 Yep352 plasmid, a normal division rate was achieved and the wild type phenotype could be rescued [12]. For our studies we transformed the YRA130 ΔscR1 yeast cells with the Yep352 plasmid containing a watermarked scR1 gene (Yep352-SCR1-TB). The watermark sequence was integrated into the wild type scR1 gene starting at position 471 (see mutagenic primer sequences). The secondary structure predictions of the watermarked scR1 using the ViennaRNA-1.5 web interface revealed significant changes within the secondary structure compared to the structure of the wild type scR1 (Figure 1) [22]. The 3D model was created using Blender http://www.blender.org.

Figure 1.

Figure 1

Secondary structure of the wild type scR1 and the watermarked scR1. The secondary structure predictions were performed using the ViennaRNA-1.5 web interface. The 3D model was created using Blender. A: wild type scR1 of Saccharomyces cerevisiae; B: watermarked scR1 (see section methods)

Although the predicted secondary structure of the watermarked scR1 differs from the wild type structure, the resulting RNA molecule proved to be functional active indirectly demonstrated by the YRA130 cells transformed either with the Yep352-SCR1 or the Yep352-SCR1-TB, which displayed no significant differences in their growth characteristics (Figure 2).

Figure 2.

Figure 2

Growth characteristics. Growth characteristics of strain YRA130 and the Yep352-SCR1 and Yep352-SCR1-TB transformed YRA130 strains in YPD full medium (A) and SD-ura medium (B). Three independent experiments with a standard deviation <0.03 are shown. All data show p-values <0.05, except YRA130 cells transformed with Yep352-SCR1 compared with YRA130 cells transformed with Yep352-SCR1-TB.

In addition, we inserted a second overlapping watermark sequence into the lac promoter region. The watermark sequence was integrated into the wild type lac promoter sequence starting at position 903 in the pBluescriptII KS+ plasmid and overlapped with the aforementioned deactivating watermark sequence. The watermarked lac promoter was proved to be active, verified by the qualitative β-galactosidase assay using X-Gal. Furthermore, the watermarked lac promoter did not display any significant differences compared to the wild type lac promoter using the quantitative β-galactosidase assay with ONPG (ortho-Nitrophenyl-β-galactoside) (Table 1, Figure 3).

Figure 3.

Figure 3

Quantitative β-galactosidase assay with ONPG. Three independent analyses were performed.

Conclusion

Our results show that the integration of watermarks is not suitable for regulatory regions like promoter or regulatory RNA molecules. We have shown that these watermarks can deactivate promoter regions and further affect the secondary structure of regulatory RNA molecules. Although the watermarked scR1 was not deactivated by the integration of a watermark sequence, the secondary structure prediction displayed an altered structure. These results cannot be generalized for other RNA molecules.

The affect of integrated watermarks in non-coding regulatory regions cannot be generalized and has to be tested in every single case. Therefore, we propose not to integrate watermark sequences into regulatory sequences.

Methods

Watermark design

We wanted to integrate a watermark into the lac promoter containing the answer to life, the universe, and everything [23]. Further we wanted to encrypt the watermark 'TB' into the scR1 gene and 'WWU' into the lac promoter sequence, respectively. To demonstrate the flexibility of our algorithm we used different translation codes. The first one only used the binary representation. The second translation code used for the scR1 experiments just slightly differs from the standard one used in DNA-Crypt [1]. The third one imitates the official international Morse code. But identical binary encoding tables described in Heider and Barnekow 2007 were used [1]. The inserted DNA watermark sequences are

42 → 1010102 → CCC

TB → 10011000012 → CGCTG

and

WWU → .--.--..- → 01101100112 → GCATA

We created artificial reading frames within the scR1 gene and the lac promoter sequence that do not exist in non-coding regions, to apply the DNA-Crypt algorithm. Because of cost-benefit equation, we scanned manually for the best location within the DNA sequences of the scR1 gene and the lac promoter for integrating authenticating watermarks. The two watermark sequences in the lac promoter region overlap.

DNA-Crypt fuzzy controller

The watermark sequences were analyzed with the DNA-Crypt fuzzy controller with standard settings. The life time was set at 1000 cycles [1]. Based on the three input dimensions, the length of the watermark sequence, individual mutation rate and life time, the DNA-Crypt fuzzy controller recommends, based on the rule base, whether to use a specific mutation correction code or not [1].

Secondary structure prediction

The secondary structure predictions of the RNA molecules were performed using the ViennaRNA-1.5 web interface with rescaled energy parameters to 30°C [22]. The three dimensional structure predictions were created with Blender.

Site-directed mutagenesis

The site-directed mutagenesis was used to integrate the watermark sequences into the wild type DNA. We used a modified QuikChange(R) Site-Directed Mutagenesis Kit protocol from Stratagene (Stratagene, Amsterdam, The Netherlands) described in Heider and Barnekow 2008 [4]. For the scR1 studies we used the Yep352-SCR1 plasmid carrying the wild type sequence of the scR1 gene (a kind gift of Giorgio Dieci, Universita di Parma, Italy) [12]. For our lac promoter studies we used the pBluescriptII KS+ plasmid (Stratagene, Amsterdam, The Netherlands). The site-directed mutagenesis was performed with the following primer sequences:

scR1 – forward primer:

5'-GCACATTGTGGCCGTGCCCTCTGGGATGGAGTGTGTC-3'

scR1 – reverse primer:

5'-GACACACTCCATCCCAGAGGGCACGGCCACAATGTGC-3'

lac – forward primer:

5'-GTTAGCTCACTCATTTGGGACCCCAGGCTTTACA-3'

lac – reverse primer:

5'-TGTAAAGCCTGGGGTCCCAAATGAGTGAGCTAAC-3'

lac2 – forward primer:

5'-CTCACTCATTAGGCCCCCCAGGCCTTACACTTTATGC-3'

lac2 – reverse primer:

5'-GCATAAAGTGTAAGGCCTGGGGGGCCTAATGAGTGAG-3'

The bold letters represent the watermark sequences. The mutagenesis was confirmed by sequencing with the M13 primer 5'-GTAAAACGACGGCCAGT-3' and the lac-SEQ primer 5'-CCGCCTCTCCCCGCG-3', respectively.

Transformation of bacteria

For our lac promoter studies we used the Escherichia coli DH5α strain. 20 ml 2YT were inoculated with 200 μl overnight culture of DH5α bacteria and incubated at 37°C to an OD600 of 0.3 to 0.5. The cells were centrifuged for 10 minutes at 2000 × g and mixed with 10 ml ice-cold sterile 0.1 M CaCl2. After incubation for 30 minutes on ice, the cells were centrifuged for 10 minutes at 4°C 2000 × g and resuspended in 2 ml ice-cold 0.1 M CaCl2. 200 μl of the cells were mixed up with 50 ng DNA. After incubation for 30 minutes on ice, the cells were incubated for 45 seconds at 42°C and cooled down for 2 minutes on ice. After mixing with 500 μl 2YT the cells were incubated for 45 minutes at 37°C and 220 rpm. Finally 200 μl of the cell suspension were plated and grown overnight on 2YT plates containing ampicillin.

Transformation of yeast

We used the YRA130 ΔscR1::His3 yeast strain for our scR1 experiments that has a reduced reproduction rate (a kind gift of Peter Walter, University of California, San Francisco, USA). The yeast strain YRA130 was transformed using the lithium acetate method and grown on SD-ura plates [24].

β-galactosidase assay

The activity of the lac promoter was verified by qualitative and quantitative β-galactosidase assays, using 5-bromo-4-chloro-3-indolyl-β-D-galactopyranoside (X-Gal, Carl Roth GmbH, Karlsruhe, Germany) and ONPG (Carl Roth GmbH, Karlsruhe, Germany), respectively. For the qualitative β-galactosidase assay we pipetted 40 μl 100 mM IPTG (Carl Roth GmbH, Karlsruhe, Germany) and 40 μl X-Gal (20 mg/ml) onto a 2YT agar plate containing ampicillin. After drying the plates for one hour, 20 μl of overnight cultures were plated and incubated overnight at 37°C. The quantitative analyses we performed using the method established by Miller in 1972, except some modifications published by Zhang and Bremer and some additional modifications [25,26]. We used 100 μl of IPTG induced overnight culture for our assays and froze them for 30 minutes at 20°C before adding the permealization buffer. In addition we used a substrate buffer containing 3 mg/ml instead of 1 mg/ml ONPG to guarantee saturation for the β-galactosidase.

Growth characteristics

The growth characteristics of YRA130, and the Yep352-SCR1 and Yep352-SCR1-TB transformed YRA130 cells were analyzed by measuring optical densities at 600 nm every 60 minutes for nine hours (Pharmacia LKB Novaspec II).

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

DH: conception, structure predictions, sequence alignments, mutagenesis, transformation of yeast and bacteria cells, microscopy, β-galactosidase assays, growth characteristics, figure preparation, manuscript preparation. MP: secondary structure visualization. AB: conception, manuscript preparation. All authors read and approved the final manuscript.

Supplementary Material

Additional file 1

Table S1 – Conserved promoter regions. lac: the lac promoter; hexokinase: Rat type III hexokinase promoter; H2B: histone H2B promoter; AD4: adenovirus type 4; AD2: adenovirus type 2; SV40: SV40 enhancer

Click here for file (25.5KB, pdf)

Acknowledgments

Acknowledgements

The authors want to thank Peter Walter (University of California, San Francisco, USA) for the YRA130 strain, Giorgio Dieci (Universita di Parma, Italy) for the Yep352-SCR1 plasmid and Sarah-Jane Barnard for critically reading the manuscript. This work is part of the PhD thesis of D.H.

Contributor Information

Dominik Heider, Email: dominik.heider@uni-due.de.

Martin Pyka, Email: martin.pyka@uni-muenster.de.

Angelika Barnekow, Email: barneko@uni-muenster.de.

References

  1. Heider D, Barnekow A. DNA-based watermarks using the DNA-Crypt algorithm. BMC Bioinformatics. 2007;8:176. doi: 10.1186/1471-2105-8-176. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Heider D, Kessler D, Barnekow A. Watermarking sexually reproducing diploid organisms. Bioinformatics. 2008;24:1961–1962. doi: 10.1093/bioinformatics/btn342. [DOI] [PubMed] [Google Scholar]
  3. Heider D, Barnekow A. DNA watermarks and sexual transmission. Europ J Cell Biol. 2008;87S1:58. [Google Scholar]
  4. Heider D, Barnekow A. DNA watermarks: A proof of concept. BMC Molecular Biology. 2008;9:40. doi: 10.1186/1471-2199-9-40. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Beckwith JR. Regulation of the lac operon. Recent studies on the regulation of lactose metabolism in Escherichia coli support the operon model. Science. 1967;156:597–604. doi: 10.1126/science.156.3775.597. [DOI] [PubMed] [Google Scholar]
  6. Rothmel RK, LeClerc JE. Mutational analysis of the lac regulatory region: second-site changes that activate mutant promoters. Nucleic Acids Research. 1989;17:3909–3925. doi: 10.1093/nar/17.10.3909. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Depto AS, Stenberg RM. Regulated Expression of the Human Cytomegalovirus pp65 Gene: Octamer Sequence in the Promoter Is Required for Activation by Viral Gene Products. Journal of Virology. 1989;63:1232–1238. doi: 10.1128/jvi.63.3.1232-1238.1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bentley K, Deacon N, Sonza S, Zeichner S, Churchill M. Mutational analysis of the HIV-1 LTR as a promoter of negative sense transcription. Arch Virol. 2004;149:2277–2294. doi: 10.1007/s00705-004-0386-8. [DOI] [PubMed] [Google Scholar]
  9. Romano G, Suon S, Jin H, Donaldson AE, Iacovitti L. Characterization of Five Evolutionary Conserved Regions of the Human Tyrosine Hydroxylase (TH) Promoter: Implications for the Engineering of a Human TH Minimal Promoter Assembled in a Self-Inactivating Lentiviral Vector System. J Cell Physiol. 2005;204:666–677. doi: 10.1002/jcp.20319. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Almeida AM, Murakami Y, Layton DM, Hillmen P, Sellick GS, Maeda Y, Richards S, Patterson S, Kotsianidis I, Crawford LMDH, Baker A, Ferguson M, Roberts I, Houlston R, Kinoshita T, Karadimitris A. Hypomorphic promoter mutation in PIGM causes inherited glycosylphosphatidylinositol deficiency. Nature Medicine. 2006;12:846–851. doi: 10.1038/nm1410. [DOI] [PubMed] [Google Scholar]
  11. Felici F, Cesareni G G, Hughes JM. The Most Abundant Small Cytoplasmic RNA of Saccharomyces cerevisiae Has an Important Function Required for Normal Cell Growth. Mol Cell Biol. 1989;9:3260–3268. doi: 10.1128/mcb.9.8.3260. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Dieci G, Giuliodori S, Catellani M, Percudani R, Ottonello S. Intragenic Promoter Adaptation and Facili-tated RNA Polymerase III Recycling in the Transcription of SCR1, the 7SL RNA Gene of Saccharomyces cerevisiae. J Biol Chem. 2001;277:6903–6914. doi: 10.1074/jbc.M105036200. [DOI] [PubMed] [Google Scholar]
  13. Wild K, Weichenrieder O, Strub K, Sinning I, Cusack S. Towards the structure of the mammalian signal recognition particle. Curr Opin Struct Biol. 2002;12:72–81. doi: 10.1016/s0959-440x(02)00292-0. [DOI] [PubMed] [Google Scholar]
  14. Rosenblad AM, Gorodkin J, Knudsen B, Zwieb C, Samuelsson T. SRPDB (Signal Recognition Particle Database) Nucleic Acids Res. 2003;31:363–364. doi: 10.1093/nar/gkg107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Borschbach M, Hauke S, Pyka M, Heider D. Opportunities and limitations of a principle component analysis optimized machine learning approach for the identification and classification of cancer involved proteins. Communications of the SIWN. 2009;6:85–89. [Google Scholar]
  16. Hann BC, Walter P. The signal sequence recognition particle in S.cerevisiae. Cell. 1991;67:131–144. doi: 10.1016/0092-8674(91)90577-l. [DOI] [PubMed] [Google Scholar]
  17. Stirling CJ, Hewitt EW. The S. cerevisiae SEC65 gene encodes a component of yeast signal recognition particle with homology to human SRP19. Nature. 1992;356:534–537. doi: 10.1038/356534a0. [DOI] [PubMed] [Google Scholar]
  18. Ogg SC, Poritz MA, Walter P. Signal sequence recognition particle receptor is important for cell growth and protein secretion in Saccharomyces cerevisiae. Mol Biol Cell. 1992;3:895–911. doi: 10.1091/mbc.3.8.895. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Brown J, Hann BC, Medzihradszky KF, Niwa M, Burlingame AL, Walter P. Subunits of the Saccharomyces cerevisiae signal recognition particle required for itsfunctional expression. EMBO J. 1994;13:4390–4400. doi: 10.1002/j.1460-2075.1994.tb06759.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22:4673–4680. doi: 10.1093/nar/22.22.4673. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Kin T, et al. fRNAdb: a platform for mining/annotating functional RNA candidates from non-coding RNA sequences. Nucleic Acids Res. 2007;35:D145–D148. doi: 10.1093/nar/gkl837. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Vienna RNA Secondary Structure Prediction, A web interface to the RNAfold programm version 1.5 http://rna.tbi.univie.ac.at/cgi-bin/RNAfold.cgi
  23. Adams D. The Hitchhiker's Guide to the Galaxy. Ballantine Books; 1979. [Google Scholar]
  24. Kail M, Barnekow A. Identification and characterization of interacting partners of Rab GTPases by yeast two-hybrid analyses. Methods Mol Biol. 2008;440:111–125. doi: 10.1007/978-1-59745-178-9_9. [DOI] [PubMed] [Google Scholar]
  25. Miller JH. JH Miller Experiments in Molecular Genetics. Cold Spring Harbor Laboratory 1972 chap. J.H. Miller Assay of β-Galactosidase; pp. 352–355. [Google Scholar]
  26. Zhang X, Bremer H. Control of the Escherichia coli rrnB P1 Promoter Strength by ppGpp. J Biol Chem. 1995;270:11181–11189. doi: 10.1074/jbc.270.19.11181. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Additional file 1

Table S1 – Conserved promoter regions. lac: the lac promoter; hexokinase: Rat type III hexokinase promoter; H2B: histone H2B promoter; AD4: adenovirus type 4; AD2: adenovirus type 2; SV40: SV40 enhancer

Click here for file (25.5KB, pdf)

Articles from BMC Research Notes are provided here courtesy of BMC

RESOURCES