Skip to main content
Biochemistry and Biophysics Reports logoLink to Biochemistry and Biophysics Reports
. 2020 Mar 5;22:100749. doi: 10.1016/j.bbrep.2020.100749

The information isn't lost in gene expression

Massimo Fioranelli 1,∗∗, Maria Grazia Roccia 1
PMCID: PMC7063240  PMID: 32181372

Abstract

The present question is the possibility of information loss in gene expression? Information loss in the gene expression disrupts the cellular dynamics and can lead to serious defects, including cancer. Using Gottesman and Preskill method for calculating information loss in black holes,a mechanism for calculating the amount of information transformation in gene expression is proposed. In this proposal, there are three different Hilbert spaces that belong to degrees of freedom of DNA, RNA, and protein. The genetic sequence of the DNA is transcribed into protein at two stages. At first stage it is shown that the internal stationary state of the cell can be represented by a maximally entangled two-mode squeezed state of DNA and mRNA.At second stage, the state of the cell is described by a maximally entangled two-mode squeezed state of mRNA and protein. The amount of information transformation can be obtain by projecting the state at first stage on the state at second stage. Evidently for all finite values of the transcription factor concentration y, binding energy E and free energy F of the transcription factor, the information isn't lost in gene expression.

Keywords: Gene expression, Information loss, DNA, RNA, Protein

Highlights

  • Using Gottesman and Preskill method,a mechanism for calculating the amount of information transformation in gene expression is proposed.

  • The genetic sequence of the DNA is transcribed into protein at two stages.

  • At first stage, the internal stationary state of the cell can be represented by a maximally entangled two-mode squeezed state of DNA and mRNA.

  • At second stage, the state of the cell is described by a maximally entangled two-mode squeezed state of mRNA and protein.

  • The amount of information transformation can be obtain by projecting the state at first stage at second stage.

1. Introduction

The information necessary for the functioning of a given organism is encoded in. its DNA [1,2]. Gene expression is a process by which this information is extracted from the DNA in order to synthesize proteins that carry out specific functions in the cell. The expression of genes in cells is controlled mainly by binding and unbinding of regulatory proteins, called tran-scription factors (TFs), to sepecific short DNA sequences, called binding sites [[3], [4], [5], [6]]. These regulatory proteins can act either as activators, which means they increase the rate of expression of the genes, or as repressors that decrease the rate of expression of the regulated genes. The genetic sequence of the DNA is transcribed into mRNA by a holoenzyme called RNA polymerase. Activators often act by recruiting the polymerase, whereas repressors often act by sterically blocking the polymerase from binding. Ribosomes translate mRNA strands into proteins [6].

The present question is the possibility of information loss in gene expression? Information loss in the gene expression disrupts the cellular dynamics and can lead to serious defects, including cancer. Using Gottesman and Preskill method [7] we suggest a mechanism to calculate the amount of information transformation in gene expression. In this proposal, there are three different Hilbert spaces that belong to degrees of freedom of DNA, RNA, and protein. We define annihilation and creation operators for information in each space and obtain the relation between these operators during transcribing genetic sequence of the DNA into mRNA and then transcribing information of the mRNA into protein. We derive the entangled state on DNA and mRNA spaces at first stage and the entangled state on mRNA and protein spaces at second stage. The amount of information loss is obtained by projection the first state onto the second state.

The outline of the paper is as follows. In section II we obtain the entangled two-mode squeezed states on DNA and mRNA Hilbert spaces of cells. Then we study the entangled two-mode squeezed states on mRNA and protein Hilbert spaces of the cells in section III. Finally we calculate the information transformation from DNA to protein in section IV. The last section is devoted to summary and conclusion.

2. The entangled two-mode squeezed states on DNA and mRNA Hilbert spaces of cells

The primary step in the production of a protein is the transcription of a gene from the DNA template to a m(essenger)-RNA molecule by activating RNA-polymerase by transcription factors. Transcription factors are proteins which bind to specific sites on DNA. These binding sites are located in the so-called regulatory region of a gene, which for yeast typically extends over several hundred nucleotides before the starting point of transcription. A fully assembled set of transcription factors will attract and activate the machinery (called RNA-polymerase) which transcribes the DNA template to an mRNA molecule. When one site is occupied by transcription factors, the regulated gene will get transcribed into mRNA.Thus the probability of this transcribing is equal to the probability for a site to be occupied.

Assuming that a transcription factor may either be bound at a binding site, or be suspended in solution or bound non-specifically elsewhere on DNA, the probability of a given site being occupied is given by elementary statistical mechanics [5,6] as

noccupied(y)=yeβEyeβE+eβF (1)

and depends on the transcription factor concentration y, binding energy E and a free energy F of the transcription factor in solution or bound elsewhere.

Now we want to consider the transcribing information of DNA into mRNA.To this end we construct two Hilbert spaces with a set of operators of creation/annihilation that have the same commutation properties. The total Hilbert space of cell at this stage is the tensor product of the two spaces Hcell=HDNAHmRNA, where in this case Hcell, HDNA and HmRNA denote the physical quantum states spaces of cell, DNA and mRNA respectively.

The commutation relations satisfied by the various operators in these spaces are:

[αE,F,yDNA,αE,F,ymRNA]=0,[αE,F,yDNA,αE,F,yDNA]=δE,EδF,Fδy,y,[αE,F,ymRNA,αE,F,ymRNA]=δE,EδF,Fδy,y[αE,F,yDNA,αE,F,ymRNA]=0,[αE,F,yDNA,αE,F,yDNA]=0,[αE,F,ymRNA,αE,F,ymRNA]=0 (2)

Where αE,F,yDNA,αE,F,yDNA are annihilation and creation operators that act on DNA space of cells respectively. Also αE,F,ymRNA,αE,F,ymRNA are annihilation and creation operators that act on mRNA space of cells respectively. The indices E,F,y are binding energy, free energy and concentration of transcription factor respectively. The Bogoliubov transformation between these operators can be written as following:

αE,F,ycell=cosh(rE,F,y)αE,F,yDNA+sinh(rE,F,y)αE,F,ymRNA (3)

where αE,F,ycell is annihilation operator that acts on the Hilbert space of cell. By using the Bogoliubov transformation, the condition

αE,F,ycell|cellDNAmRNA=0 (4)

give rise the so called thermal state conditions:

cosh(rE,F,Y)αE,F,YDNA+sinh(rE,F,y)αE,F,YmRNA|cellDNAmRNA (5)

or equivalently

αE,F,yDNA+tanh(rE,F,y)αE,F,ymRNA|cellDNAmRNA=0 (6)

Now, we assume that when information is transcribed of DNA into mRNA, the cell state |cellDNAmRNA is related to the cell vacuum |0 by

|cellDNAmRNA=F(αE,F,yDNA,αE,F,ymRNA)|0 (7)

where F is some function to be determined later.

From [αE,F,yDNA,αE,F,ymRNA]=1 , we obtain [αE,F,yDNA,(αE,F,ymRNA)m]=αE,F,yDNA(αE,F,ymRNA)m and [αE,F,yDNA,F]=FαE,F,yDNA. Then using equations (6) and (7), we get the following differential equation for F.

(FαE,F,yDNAtanh(rE,F,y)αE,F,ymRNAF)=0 (8)

and the solution is given by

F=etanh(rE,F,y)αE,F,yDNAαE,F,ymRNA (9)

By substituting (6), (7) and by properly normalizing the state vector, we get

|cellDNAmRNA=etanh(rE,F,y)αE,F,yDNAαE,F,ymRNA|0=1coshrE,F,ymtanhmrE,F,y|m(E,F,y)DNA|m(E,F,y)mRNA (10)

where |m(E,F,y)DNAand|m(E,F,y)mRNA are orthonormal bases (normal mode solutions) for HDNA and HmRNA respectively. Now we intend to determine tanh(rE,F,y) by calculating the probability for transcribing information into mRNA and comparing it with equation (1). We can calculate this probability as the following:

nmRNA=cell|DNAmRNAαE,F,ymRNAαE,F,ymRNA|cellDNAmRNA=m(E,F,y)|mRNAm(E,F,y)|DNA1cosh2rE,F,yαE,F,ymRNAαE,F,ymRNAn=0tanh2n(rE,F,y)|m(E,F,y)DNA|m(E,F,y)mRNA=m(E,F,y)1|mRNAm(E,F,y)|DNA1cosh2(rE,F,y)n=0tanh2n(rE,F,y)(n)|m(E,F,y)DNA|m(E,F,y)1mRNA=1cosh2rE,F,yn=0tanh2n(rE,F,y)(n)=sinh2(rE,F,y) (11)

As mention before when one site of DNA is occupied, it's information will get transcribed into mRNA.Thus the probability of transcribing information into mRNA is equal to the probability for a site to be occupied.

nmRNA=noccupiedsinh2(rE,F,y)=yeβEyeβE+eβFtanh(rE,F,y)=111+yeβEyeβE+eβF (12)

Equation (10) shows that the internal stationary state of the cell can be represented by a maximally entangled two-mode squeezed state of DNA and mRNA.The entanglement between these molecules depends on binding energy, free energy and concentration of transcription factor.

3. The entangled two-mode squeezed states on mRNA and protein Hilbert spaces of cell

Now we consider the second step in extracting information from the DNA in order to synthesize proteins. At this stage, the mRNA molecule will be translated to a protein. The proposed reaction scheme for this translation is shown in Fig.(1) [8]Before protein production, mRNA molecules may bind to form a complex with rate α, the dissociation rate for this complex is β. The probability for translating one protein from a single mRNA before it decays can be obtained as [8]:

nprotein=kp1(μc+β)+kp2αμm(μc+β)+μcα (13)

Fig. 1.

Fig. 1

Kinetic scheme for regulation of protein production by a mRNA-binding regulator [8].

In this scheme, the parameters kp1 and kp2 are the rates of protein production from the mRNA in free and bound states and μm and μc are the corresponding decay rates.

Now we discuss the transcribing information of mRNA into protein. For this purpose we construct two Hilbert spaces with a set of operators of creation/annihilation that have the same commutation properties. The total Hilbert space of cell at this stage is the tensor product of the two spaces Hcell=HmRNAHprotein, where in this case Hprotein and HmRNA denote the physical quantum states spaces of protein and mRNA respectively.

The commutation relations satisfied by the various operators in these spaces are:

[αkp1,kp2,μm,μcprotein,αkp1,kp2,μm,μcmRNA]=0,[αkp1,kp2,μm,μcprotein,αkp1,kp2,μm,μcprotein]=δkp1,kp1δkp2,kp2δμm,μmδμc,μc,[αkp1,kp2,μm,μcmRNA,αkp1,kp2,μm,μcmRNA]=δkp1,kp1δkp2,kp2δμm,μmδμc,μc (14)

where αkp1,kp2,μm,μcprotein,αkp1,kp2,μm,μcprotein are annihilation and creation operators that act on protein space of cells respectively. Also αkp1,kp2,μm,μcmRNA,αkp1,kp2,μm,μcmRNA are annihilation and creation operators that act on mRNA space of cells respectively. The Bogoliubov transformation between these operators can be written as following:

αkp1,kp2,μm,μccell=cosh(rkp1,kp2,μm,μc)αkp1,kp2,μm,μcmRNA+sinh(rkp1,kp2,μm,μc)αkp1,kp2,μm,μcprotein (15)

where αkp1,kp2,μm,μccell is annihilation operator that acts on the Hilbert space of cell. By using the Bogoliubov transformation, the condition

αkp1,kp2,μm,μccell|cellmRNAprotein=0 (16)

Give rise the so called thermal state conditions:

cosh(rkp1,kp2,μm,μc)αkp1,kp2,μm,μcmRNA+sinh(rkp1,kp2,μm,μc)αkp1,kp2,μm,μcprotein|cellmRNAprotein=0 (17)

or equivalently

αkp1,kp2,μm,μcmRNA+tanh(rkp1,kp2,μm,μc)αkp1,kp2,μm,μcprotein|cellmRNAprotein=0 (18)

With similar calculations for the first stage we calculate the stationary state of cell at second stage which is a maximally entangled two-mode state on mRNA and protein Hilbert spaces as:

|cellmRNAprotein=etanh(rkp1,kp2,μm,μc)αkp1,kp2,μm,μcprotein|0S=1coshrkp1,kp2,μm,μcmtanhmrkp1,kp2,μm,μc|m(kp1,kp2,μm,μc)mRNA|m(kp1,kp2,μm,μc)protein (19)

where |m(kp1,kp2,μm,μc)mRNAand|m(kp1,kp2,μm,μc)protein are orthonormal bases (normal mode solutions) for HmRNA and Hprotein respectively. We can obtain tanhrkp1,kp2,μm,μc by calculating the probability for transcribing information to protein and comparing it with equation (13). We can obtain this probability as following:

nprotein=cell|proteinmRNAαkp1,kp2,μm,μcproteinαkp1,kp2,μm,μcprotein|cellmRNAprotein=m|proteinm|mRNA1cosh2rkp1,kp2,μm,μcαkp1,kp2,μm,μcproteinαkp1,kp2,μm,μcproteinn=0tanh2n(rkp1,kp2,μm,μc)|mmRNA|mprotein=m1|proteinm|mRNA1cosh2(rkp1,kp2,μm,μc)n=0tanh2n(rkp1,kp2,μm,μc)(n)|mmRNA|m1protein=1cosh2rkp1,kp2,μm,μcn=0tanh2n(rkp1,kp2,μm,μc)(n)=sinh2(rkp1,kp2,μm,μc) (20)

Using equations (13) and (20) we can calculate tanh(rkp1,kp2,μm,μc) as following:

nprotein=sinh2(rkp1,kp2,μm,μc)=kp1(μc+β)+kp2αμm(μc+β)+μcαtanh(rkp1,kp2,μm,μc)=11kp1(μc+β)+kp2αμm(μc+β)+μcα (21)

4. IV.The information loss for DNA in cells

Now we can calculate the resulting information transformation from DNA to protein. At first stage, there is an entanglement between the DNA and mRNA spaces of cell. At second stage, we can describe the state of the cell as an entangled state of both mRNA and protein spaces of cell.

Using Horowitz and Maldacena mechanism for black holes, we describe the unknown effects of transcribing genetic sequences of DNA by an additional unitary transformation S [9]. Also by using Gottesman and Preskill idea we introduce a unitary transformation U [7] to describe interactions between DNA and mRNA molecules.

cell|DNAmRNA=cell|DNAmRNA(SI)U (22)

By extending Gottesman and Preskill method to gene expression, we calculate the information transformation from DNA to protein.

Tgene=DNAmRNAcell'|cellmRNAprotein=1cosh(rkp1,kp2,μm,μc)cosh(rE,F,y)tanhm(rkp1,kp2,μm,μc)tanhm(rE,F,y)m|mRNAm|DNASU|mmRNA|mprotein (23)
f=|Tgene|2=1cosh2(rkp1,kp2,μm,μc)cosh2(rE,F,y)tanhm+m'(rkp1,kp2,μm,μc)tanhm+m'(rE,F,y)m'|proteinm'|mRNAm|mRNAm|DNASSUU|mmRNA|mprotein|m'DNA|m'mRNA=1cosh2(rkp1,kp2,μm,μc)cosh2(rE,F,y)tanhm+m'(rkp1,kp2,μm,μc)tanhm+m'(rE,F,y)m'|mproteinproteinm'|mmRNAmRNAm|m'mRNAmRNAm|m'DNADNA=1cosh2(rkp1,kp2,μm,μc)cosh2(rE,F,y)tanh2m(rkp1,kp2,μm,μc)tanh2m(rE,F,y)=1cosh2(rkp1,kp2,μm,μc)cosh2(rE,F,y)11tanh2(rkp1,kp2,μm,μc)11tanh2(rE,F,y)=cosh2(rkp1,kp2,μm,μc)cosh2(rE,F,y)cosh2(rkp1,kp2,μm,μc)cosh2(rE,F,y)=1 (24)

If information transformation from the collapsing matter to the state of outgoing Hawking radiation be complete, the value of f should be one. Evidently for all finite values of the transcription factor concentration y, binding energy E and free energy F of the transcription factor, the value of f is unity and so information isn't lost in gene expression.

5. Summary and conclusion

In this manuscript, we extend the Gottesman and Preskil method for calculating information loss in black holes to gene expression. We calculate the amount of information transformation from DNA to protein in cells. To this end, we introduce three Hilbert spaces that belong to degrees of freedom of DNA, RNA, and protein. At first stage we show that the internal stationary state of the cell can be represented by a maximally entangled two-mode squeezed state of DNA and mRNA.At second stage, the state of the cell can be described by a maximally entangled two-mode squeezed state of mRNA and protein. The amount of information transformation can be obtain by projecting the state at first stage on the state at second stage. We observe that information isn't lost in gene expression.

Information loss has many effects on the activity of cells and gene expression within the nucleus. For example, if during replication, some errors occur in genetic sequences or choosing base pairs, some new DNAs are emerged which are different respect to original ones. This causes that number of binding sites for polymerase, RNAs and proteins increases or decreases and the velocity of transcription, translation and replication grows or decreases. Consequently, the velocity of producing new cells grows or decreases and some diseases like the cancers are emerged. For melanocytes, information loss causes to some destructions in the structure of produced DNAs during replication. These destructions may be similar to those seen in exposed cells to ultraviolet ray and cause to melanoma.

Author statement

This is a paper regarding new mechanism for exchanging information between genes. There is no conflict of interest.

Declaration of competing interest

There is no conflict of interest.

References

  • 1.Broach J.R. Cell. 2004;119:583. doi: 10.1016/j.cell.2004.11.029. [DOI] [PubMed] [Google Scholar]
  • 2.Roeder R.G. Nat. Med. 2003;9:1239. doi: 10.1038/nm938. Lewin. ,B Genes IX (2007), Jones and Bartlett Publishers,Boston, MA. [DOI] [PubMed] [Google Scholar]
  • 3.Lee T.I. Science. 2002;298:799. doi: 10.1126/science.1075090. Harbison C. T. et al., Nature , 431 (2004) 99. [DOI] [PubMed] [Google Scholar]
  • 4.Stormo G.D. DNA binding sites: representation and discovery. Bioinformatics. 2000;16:16–23. doi: 10.1093/bioinformatics/16.1.16. JHertz G. , G. Stormo, Bioinformatics , 15 (1999)563. [DOI] [PubMed] [Google Scholar]
  • 5.Vilar Jose M.G., Saiz Leonor. Phys. Rev. Lett. 2006;96:238103. doi: 10.1103/PhysRevLett.96.238103. [DOI] [PubMed] [Google Scholar]
  • 6.Berg, Johannes Revised Version to Appear in Europhys. Lett.) arXiv: 0712. 3791) . Tkacik Gasper , Aleksandra M. Walczak,arXiv:1101.4240.
  • 7.Gottesman D., Preskill J. J. High Energy Phys. 2004;3:26. [Google Scholar]
  • 8.Jia Tao, Kulkarni Rahul V. Phys. Rev. Lett. 2010;105 doi: 10.1103/PhysRevLett.105.018101. [DOI] [PubMed] [Google Scholar]
  • 9.Horowitz G.T., Maldacena J. J. High Energy Phys. 2003 008 (2004) arXiv:hep-th/0310281v2. [Google Scholar]

Articles from Biochemistry and Biophysics Reports are provided here courtesy of Elsevier

RESOURCES