Skip to main content
. 2012 Aug 8;7(8):e42465. doi: 10.1371/journal.pone.0042465

Table 1. Summary of the genes tested. Wildtype gene sequences optimized for expression in different organisms1 were used for embedding various messages.

Protein Protein Length Optimization [Encoding] Message Message Length CAI GC Rel. Expression (msg : opt) Assay System
Δ opt : msg Opt msg opt msg
GST-T7RNAP 1134 aa E. coli [msg] GENEART AG, GERMANY/THE GENE OF YOURCHOICE/MARCH 19TH 2008/WAGNER & LISS…. 83 char 498 bit 351 bp 10% 0.90 0.85 50% 47% 1.05±0.21 0.80 E. coli
GFP 239 aa S. cerevisiae [msg] AEQUOREA VICTORIA. 18 char 108 bit 52 bp 7% 0.95 0.86 35% 34% 0.87±0.20 0.37 S. cerevisiae
GFP 239 aa A. thaliana [msg] AEQUOREA VICTORIA. 18 char 108 bit 69 bp 10% 0.92 0.89 44% 44% 0.94±0.39 0.67 1.03±0.22 0.91 N. benthamiana in vitro - wheat
HIVgag 513 aa H. sapiens [msg] GENE DESIGNED BY MARCUS GRAF/GENEART 2008. 45 char 270 bit 167 bp 11% 0.99 0.90 65% 59% 1.03±0.05 0.40 HEK 293
EMG1 252 aa H. sapiens [msg] GENEART AG PAT US1234567 24 char 144 bit 92 bp 12% 0.79 0.87 64% 59% 0.92±0.12 0.35 HEK 293
EMG1 252 aa H. sapiens[msg enc] :JQWF&G%DY%$4Y#′XE%87G;K Pwd “Secret” →GENEART AG PAT US1234567 24 char 144 bit 93 bp 12% 0.79 0.87 64% 59% 0.67±0.27 0.19 HEK 293
GFP 239 aa H. sapiens [msg] AEQUOREA VICTORIA. 18 char 108 bit 63 bp 9% 0.97 0.91 59% 57% 1.12±0.31 0.61 1.04±0.25 0.89 HEK 293 in vitro - rabbit
GFP 239 aa H. sapiens[msg enc] 4JT′T&8F#(NWGTU[FB Pwd “Secret” →AEQUOREA VICTORIA. 18 char 108 bit 68 bp 9% 0.97 0.90 59% 55% 1.00±0.22 0.91 1.07±0.37 0.88 HEK 293 in vitro - rabbit
GFP 239 aa H. sapiens[msg + cut] AEQUOREA VICTORIA. 18 char 108 bit 63 bp 9% 0.97 0.91 59% 57% 0.92±0.07 0.20 0.91±0.26 0.54 HEK 293 in vitro - rabbit
GFP 239 aa H. sapiens[msg long] GREEN FLUORESCENT PROTEIN GENEART 2008 38 char 228 bit 126 bp 18% 0.97 0.83 59% 48% 0.81±0.17 0.23 0.71±0.43 0.33 HEK 293 in vitro - rabbit

[msg]  =  Only codons for amino acids with ≥4 alternative codons (AGPTVLRS) were used for data storage. [msg enc]  =  Text message was keyed with the password “Secret” using the polyalphabetic Vigenère cipher. [msg + cut]  =  the codon usage table for Homo sapiens, used for encryption, was added in a 35 bp sequence after the stop codons of the ORF. [msg long]  =  additional codons for amino acids (CDEFHIKNQY) were used for message embedding, doubling the capacity of the container gene. Δ opt:msg  =  total number and percentage of altered nucleotides resulting from message deposition. CAI and GC  =  codon adaptation index and GC content of optimized and watermarked genes. Rel. Expression  =  mean relative expression ratio between watermarked and optimized genes ± standard deviation and students t-test p-value. Assay systems included expressing constructs in vivo in E. coli BL 21, S. cerevisiae strain AH109, human HEK293 cells, and in vitro with wheat germ and rabbit reticulocyte lysates; protein expression was determined by Western blots and ELISA.