Abstract
In recent years, physical elements of transcription have emerged as central in our understanding of gene expression. Recent work has been done introducing a simple description of the basic physical elements of transcription where RNA elongation, RNA polymerase (RNAP) rotation and DNA supercoiling are coupled (1). Here we generalize this framework to accommodate the behavior of many RNAPs operating on multiple genes on a shared piece of DNA. The resulting framework is combined with well-established stochastic processes of transcription resulting in a model which characterizes the impact of the mechanical properties of transcription on gene expression and DNA structure. Transcriptional bursting readily emerges as a common phenomenon with origins in the geometric nature of the genetic system and results in the bounding of gene expression statistics. Properties of a multiple gene system are examined with special attention paid to the role that genome composition (gene orientation, size and intergenic distance) plays in the ability of genes to transcribe. The role of transcription in shaping DNA structure is examined and the possibility of transcription driven domain formation is discussed.
INTRODUCTION
Though the study of transcription has played a central role in modern molecular biology, much of its physical foundation and behavior is just now being appreciated (2). Due to the helical nature of DNA the elongation of nascent RNA must obey important physical and topological constraints. Through the use of single-molecule techniques a number of important stochastic and physical aspects of transcription have emerged including the discovery of transcriptional bursting (3–5) and its role in gene expression (6,7), wide-spread RNA polymerase (RNAP) pausing (8), site-specific transcriptional dependence (9) and the interplay between chromosome structure and function (10). In this article, we will trace the origin of aspects of these phenomena to the physical act of transcription, offering insights into many open problems in biology.
The essential physical and topological constraints placed on transcription due to the helical nature of DNA is conceptualized in the twin-domain model (11) where it was first articulated that transcription and replication cause over-twisting and under-twisting of DNA. The over- or under-twisting of DNA is referred to as supercoiling (SC) and a number of experimental observations have revealed its central role in transcription (12). In particular, it can serve as the source of transcriptional bursting (13) and domain formation in bacteria (14).
A recent theoretical framework (1) has turned this decades old conceptual description into a physical model which characterizes the relative amount of RNA polymerase rotation and DNA super-coiling that occurs during RNA elongation. So far, this framework has been applied only to the case of a single RNAP. In this work we will extend this framework so that it can consider the case of multiple RNAPs operating on a common piece of DNA. Additionally, central stochastic elements such as RNAP initiation and mRNA degradation are added to the model. This will allow us to characterize the role of mechanics in gene expression for both isolated as well as interacting genes. The results presented in this work offer plausible, albeit preliminary, explanations for many currently unexplained phenomena in the biology of transcription.
MATERIALS AND METHODS
Basic construction of multiple-RNAP multiple-gene system
This article will focus on the characterization of the mechanical properties of transcription and their role in gene expression and DNA organization. The three key physical elements are DNA rotation, RNA polymerase (RNAP) rotation and RNA elongation. Due to the helical nature of DNA, linear RNA elongation is coupled to rotational motion of both RNAP and connected nascent RNA. We will refer to RNAP and nascent RNA collectively as the RNA complex (RNAC). With this in mind, we will need to keep track of both the linear as well as the rotational position of the RNACs as they move along DNA during transcription.
Let us imagine a series of N genes. At a given time there are Mn transcribing polymerases on each gene n. The basic coordinates for a given polymerase complex is the distance x of the RNAC along the DNA from a transcription start site and the relative rotation of the RNAC θ from that site. In the case of multiple RNACs on gene n, each one has a position and rotation coordinate . As there are in general many genes present, the start sites are labeled by the coordinate sn and hence the absolute position of the ith RNAC located on the nth gene is ; the two signs reflect the two possible gene orientations and thus directions of RNAC movement. A critical variable is the extra DNA twist ϕ(Z); we will refer to the value of this twist at a particular RNAC location, , as a linking number constraint (LNC). For an RNAC to move along DNA the topological condition.
(1) |
must be obeyed, where ω0 = 1.85 nm−1 encodes the natural linking number of DNA. Dots denote derivatives with respect to time. Again, the two signs refer to the direction of transcription. The relative difficulty in twisting the DNA (because of opposing torque) or difficulty rotating the RNAC (because of drag) determines the balance between changing θ or ϕ. For most cases, where the DNA is not completely free to rotate, the relative ratio during transcription can be determined by the balance between DNA torque τ and RNAC drag as
(2) |
where we have introduced a twisting mobility for the DNA χ and the torque (which is in principle a functional of the entire ϕ field) is evaluated at the RNAC position. As each RNAC moves along DNA its position changes, resulting in changes to the dynamic linking number constraint (LNC) at this point.
The mechanical nature of the rotating RNAC is largely unknown however it is clear that RNA elongation plays a key role (15,16). Even though this is a critical factor, the coefficient and functional dependence of transcript length on the rotational drag Γ are not known at this time. We will posit an RNAC viscous rotational drag which is linear in the rotation speed with a power-law dependence on the transcript length as where is the angular speed of the RNAC and η an unknown coefficient of friction. A length independent drag of the RNAP can be added, however nascent RNA plays the dominant role in generating SC (15,16) so we will not consider that here. Additional barriers to rotation of the RNAC, such as co-transcriptional translation, can be accounted for by altering this term. The effects of varying the parameters are examined in the SM and simple convenient choices have been made for the simulations conducted in this work.
Combining (Equations 1 and 2) results in coupled dynamic equations for all the different LNCs ϕi. First, we can use the RNAC viscous rotational drag term outlined above to derive a dynamic equation for ϕ
(3) |
Substituting in the topological constraint (Equation 1) we can write
(4) |
so that we have an equation for the DNA twist in time as
(5) |
The denominator for both terms is an attenuating factor which incorporates the drag of the RNAP rotation as well as resistance to rotation of the DNA.
Supercoiling and DNA mechanical dynamics occur on a sub-second time-scale (17,18) whereas typical speeds for transcription are (19). This means that for genes on the order of 1 kbp, transcriptional dynamics happen on the second and minute time-scales. Additionally, RNAP operation is robust against sub-second torque fluctuations (20). Subsequently, we expect the locally produced supercoiling at the LNC locations to spread throughout the allowed DNA segment on a time-scale faster than that on which transcription occurs. Thus, the torsional response of DNA at the point at which an RNAC is operating is determined by the state of DNA on either side. More generally, we might expect to solve a supercoiling transport equation with both twisting and writhing degrees of freedom.
With this assumption of complete relaxation, the supercoiling density (SCD) is constant in regions between LNCs. In particular in front or back of a given RNAC we have
(6) |
This labeling reflects the fact that the nearest polymerases are the ones with the closest indices. For notational consistency, we must have conditions for when the interaction is between RNACs on different genes. Thus, in the case when the neighboring genes to gene n as well as gene n itself are oriented in the positive direction, we have
(7) |
These of course assume that there is at least one RNAC at the neighboring gene; if not we just skip genes until the next RNAC is found. There are analogous expressions (listed for completeness in the SM) for all possible orientations. Finally at the boundaries of the entire system we define M0 = MN+ 1 = 1, , and . We use a combination of fixed and free boundary conditions which correspond to the introduction of an additional LNCs at the leftmost and rightmost edges of the DNA under consideration. The values of ϕ at these edges will depend on whether we assume fixed or free boundary conditions.
The above formula incorporates the torsional response from DNA SC both in front τF of and behind τB the RNAC
(8) |
Once these functions are specified, we have a closed form equation for the LNCs.
Due to the previously discussed time-scale separation the torsional (τ(σ)) response of DNA between two LNCs will be that of steady-state supercoiled DNA. In this framework super-coiled DNA can exist in a purely twisted, purely plectonemic or a mixed state. Following the phenomenological approach given by Marko (21) the torque in a given piece of DNA held at a constant force f is specified by the SCD as
(9) |
where the coefficients S, τo, P and SC transition values are given by DNA mechanical constants and are a function of applied force (given in (21)). This framework accounts for the structural changes naked DNA undergoes as it buckles forming plectonemes. In this work we use the torsional response of naked DNA, however the torsional response of DNA with histones displays similar mechanical behavior (22,23). Incorporating the torsional response of DNA with histones can be accomplished by using an altered τ. It is worth noting that the introduction of a well-defined applied force is at this time cloudy from an in vivo perspective, its experimental implementation in vitro is straightforward. Additionally, one can interchange the force f for a constraint on the average end-to-end distance of the DNA (24).
The simplest example of this system gives rise to an equation for the dynamics of the super-coiling for a simple isolated RNAC operating against a fixed barrier a distance L away and with open (zero torque) condition at Z = 0; this was discussed in our previous work (1). Having a fixed barrier ahead of the the RNAC means that ϕ = 0 at Z = L and having a free rotation behind the elongating polymerase means that and hence σB = 0. To simplify the notation we can use and σ = σF. In the limit that the gene length is short compared to the barrier distance, the substitution ϕ → σLω0 yields a dynamic SC equation
(10) |
In the limit of very large RNAC drag the DNA relaxation is highly attenuated and so that . In the opposite limit, the torque injection term dominates so that . These basic behaviors put bounds on the amount of SC which can be added to a piece of DNA by RNAP during transcription. A more complete examination of this system was recently published (1) and methods for experimentally determining the unknown mechanical parameters of the RNAC drag were outlined.
Creating a full model
To make a complete model of the transcriptional process, and hence to allow for the simulation of many RNACs on multiple genes, we need to add a few more ingredients. So far, we have not discussed the velocity of the RNACs which is obviously needed in the above system of equations. It is well-known that the velocity can depend on the accumulation of SCD in regions between operating dynamic RNACs as well as between RNACs and static boundaries. We will use a simple empirical form of this relationship that qualitatively matches the existent data and accounts for the relatively persistent motion of the polymerase up to some critical value of the opposing torque. Afterwards, we will add additional pieces to allow for stochastic transcription initiation, SC relaxation via topoisomerases and RNA decay.
The mechanical properties of RNA polymerase itself are well characterized and it displays constant velocity (19) behavior over a wide range of torque (−12 to +12 pN nm) (20). Following the previously used RNAP behavior given by (20) we can incorporate a supercoiling dependent velocity by using an expression with logistic-like dependence for each RNAP. The torque dependent velocity observations for SC in front of and behind an RNAP were studied separately; we will consider their simultaneous effects here as
(11) |
For both negative and positive SC the cutoff torque is near τc = 12 (pN nm) when applied separately (20). Though the cutoff for positive and negative SC torque are the same, the severity of the cutoff seems to be more intense for negative SC than for positive but we will ignore that here. Additionally, there are stochastic properties of the stalling that we will not consider here but could be incorporated into future work.
It is clear that the mechanics alone will not stop a trailing RNAP from “passing through” a forward RNAP which is stalled. To prevent passing we have explicitly implemented a hard-core RNAP repulsion so that RNACs cannot come within a distance δ of one another. This distance could arise because of a number of physical interactions, however, at this time there is no experimental data to constrain this interaction. We will take the cutoff distance to be the physical displacement caused by the space occupied by RNAP on DNA during transcription. A similar cutoff has been added for the initiation of new RNAP so that another cannot start until the most recent one has moved out of the transcription start site (TSS). Both are fixed at δ = 15 (nm) for all results presented. The effects of changing the nature and size of the distance δ are examined in the SM.
In addition to the mechanical elements outlined above we must include other stochastic elements of transcription to examine the statistical properties of gene expression. We will include stochastic RNAP initiation, decay and topoisomerase action. A new RNAP will start transcribing with a stochastic rate r and begin moving from the TSS with a velocity governed by RNAP mechanics given above. A transcript which has been made in its entirety (by an RNAC moving all the way along the length of the gene) will decay stochastically with rate λ. Both the initiation and degradation steps are modeled as simple Poissonian processes.
There are generally speaking two types of topoisomerases: topo1 which most effectively removes negative SC and topo2 which primarily removes positive SC (25). Topo1 breaks a single strand of DNA and allows the DNA to unwind using no ATP while topo2 uses a double strand passing method which uses ATP and is much slower than topo1 (25). The precise microscopic nature of the relaxation of built up SC, as caused by topoisomerases, is not well known at this time. Consequentially, we use a coarse-grained method for topoisomerase action. For simplicity we remove all the SC between two genes (or between a gene and a static barrier) randomly with rate g by replacing the DNA in this region with DNA with SCD that matches the two LNC on both sides of the two genes selected for relaxation. This means that if genes β and β + 1 are chosen for relaxation the LNC and position of the ’end’ RNAC (the RNAC closest or furthest from the TSS depending on the orientation) for genes β − 1 and β + 2 are used to set the SC level. Explicitly we set
(12) |
(13) |
for all i ∈ Mβ, Mβ + 1 RNAPs on genes β, β + 1. For a system with one or two genes this step results in replacing the DNA in the system with completely relaxed DNA with no SC. This is done for two reasons. The first reason is that due to (equation 5) the differences in SC between RNACs within a gene tends to be small (see Figure 2). The second reason is because the intergenic regions are much larger and homogeneous than the regions containing genes for the systems considered in this article. Thus the action of the topoisomerase is most likely to take place in-between genes. The creation of more sophisticated models of SC relaxation which incorporate more precise modeling of topoisomerase action is left for future work.
RESULTS
Bursting statistics for isolated genes
The widespread observations of the stalling of RNAPs and bursting transcription in in vitro and in vivo systems have been the subject of much research (13,26–28). Though much progress has been made, the source and control of these related phenomena is still a matter of debate. In this section we will examine how the mechanical properties of transcription, namely gene and barrier lengths, alter the ability of a gene to transcribe. To do this we will simulate the dynamical motion of multiple RNACs using the framework constructed above for a simple isolated gene with length GL with a static LNCs a distance L from the TSS as shown schematically in Figure 1. At this time we will not specify the precise nature of the torsional constraint and imagine it as a generic SC barrier. In this configuration positive SC will build up in the region of DNA between active transcription and the barrier.
As positive SC builds the motion of the RNACs is inhibited. Consequentially, RNACs form clusters of various sizes and spacing controlled through random initiation, SC dependant velocities and hard core repulsion. A typical configuration of the system is shown in Figure 2. The clustering of transcribing RNAPs has been observed in a number of experimental studies (29,30) but a satisfactory mechanism for their formation has not previously been developed. Additionally, this behavior limits the ability of the RNACs to reach the end of a gene and thus affects the production of mature mRNA. Due to this, the production of mature mRNA occurs in burst with periods of active and inactive transcription production emerging in simulations of the system (Figure 3). This effect is present only with mechanically regulated transcription (red lines) as opposed to transcription with only stochastic elements (blue lines). Thus, RNAP clustering as well as mRNA bursting robustly emerge from this framework with a common origin in gene composition and mechanics. The size of the bursts vary, because relaxation events can occur between arresting, but happen with a typical value which we will refer to as mc.
The properties of bursting are controlled by a number of intrinsic and extrinsic parameters that are outlined in the previous sections. However, not all parameters effect the properties of stalling in the same manner. The mechanical form of the drag, through the coefficient of friction η and nascent RNA length dependence α, influence the quantitative properties of the stalling and burst size mc. Though their values are unknown at this time, we should imagine that they are fixed for all genes. We shall fix them for all the results presented in the article. Consequentially, the role of gene specific parameters, namely gene length (GL) and barrier distance (L), as well as external parameters such as force, are of much greater interest as they vary throughout the living world. Unless otherwise specified the values η = 1/20, χ = 10, vo = 20, α = 1 have been used. The effects of changing these intrinsic mechanical parameters are examined in the SM.
We will first examine the role of gene length in bursting. In Figure 4A the number of transcripts which are made between stalling (the burst size distribution) is given for three genes with various lengths transcribing against a fixed barrier. Burst size distributions for genes of lengths 500, 1000 and 1500 bp are shown. The characteristic burst size, mc, emerges as clear mode in the statistical distribution. The data shows a clear relationship between gene length and burst size. As the gene size increases, mc decreases and there is also a sharpened burst size distribution. These results arise due to the fact that stalling becomes less frequent for short genes with large mc because the characteristic time to arrest becomes significantly less than the time between relaxation events. Thus, genes with large mc remove the competition between stalling and relaxation resulting in Poissonian (constitutive) gene expression. In agreement with this, one can recover a sharper distribution at higher mc by lowering the relaxation rate g; this is shown in Figure 4B. Whenever this competition becomes relevant, we obtain regular bursting which can then dictate gene expression statistics, as will be examined later.
In addition to changing the gene length the distance to the barrier can be altered. Doing this will also change the burst properties of the gene. The characteristic burst size mc, as given by the mode of the burst size distributions, is shown in Figure 5 for a wide range gene lengths transcribing toward a fixed barrier at various distances. As expected, increasing gene length decreases mc while increasing the barrier distance increases mc. As the value of mc becomes very large (due to short gene length or large barrier distance) bursting becomes infrequent. These properties demonstrate that geometric properties of genes (their length and spacing) play a fundamental role in their operation.
To get a qualitative handle on how mc changes as a function of gene and barrier length we can imagine that each transcript introduces Δσ supercoiling during its elongation and that the stalling torque τc occurs at a corresponding level of SC, σc. In this case the gene can make mc = σc/Δσ transcripts between relaxation events. Writing this more explicitly, the change in SC is given by Δσ ∼ ϕ/ω0L where ϕ is the amount of twist added during the making of a single transcript and L is the distance to the barrier. For a long gene, of length GL, most of the elongation goes into DNA twist over RNAC rotation so that ϕ ∼ ω0GL. This is due to the fact that longer genes have RNACs which are more difficult to rotate (equation 5). Then mc ∼ L/GL matching the qualitative behavior of Figure 5. In general, however, an exact analytical calculation of mc is not possible and will depend on the precise values of the mechanical properties of transcription outlined in the previous section (28).
The generic changes in gene expression patterns caused by arrangement of the genes, as well as differences in susceptibility in gene expression under varying external constraints, might offer an explanation for a number of known gene expression phenomena. These include mechanical transduction of gene expression and cell differentiation (31), diseases related to changes in mechanical properties of the cell (32) as well as direct changes in the constraints and structure of chromatin itself such as been recently observed in cancer (33).
As the external force is increased, genes in the system are affected differentially. This is shown in Figure 6 where short genes undergo small changes as the force is increased while long genes undergo large changes. Since one can interchange the force constraint f for a constraint on the average end-to-end distance of the DNA (24) the mechanical consequences of the external constraints as well as of their positions can be examined. The model and results presented in this article are at this time too crude to quantitatively examine any specific examples of this type of phenomena. However, the results clearly demonstrate the ability of external physical and mechanical changes to a cell and its DNA to alter its gene expression.
Gene expression statistics
As is shown in Figure 5 the burst size for fixed mechanical properties is determined through the gene and barrier lengths. The gene specific stochastic rates controlling RNAP initiation r and degradation do not influence mc (shown in SM). However, changing the nature of the mechanical properties of the drag associated with RNACs rotation changes mc as shown in SM Supplementary Figure S3. It is important to note though that while mc is a function only of the geometric properties of the gene (along with its intrinsic mechanical properties) the stochastic rates for RNAP initiation, mRNA degradation and relaxation have major effects on gene expression. In addition to setting time scales for the system these rates control characteristic times to frustration and relaxation, thus controlling how big a role mechanics and the burst size mc play in determining gene expression statistics. Thus the balance between frustration and relaxation are central to the quantitative understanding of transcription (Figures 3 and 7). This effect offers a possible explanation for the absence of transcriptional bursting and its role in gene expression for some systems (34,35).
In a previous work (34), we presented analytical calculations for the mean and variance of mRNA produced for a gene which had a hard cutoff to the number of transcripts mc which can be made without a relaxation event. It is therefore of interest to see how well that simple model can capture the consequences of bursting in this full mechanical treatment. Predictions for the mean mRNA levels and the associated Fano factor for this simple stochastic model are shown as solid curves for the corresponding parameter values in Figure 7. The mean expression level is set by the burst size mc as
(14) |
showing how the mechanical parameter mc controls the mean expression of a given gene. In general the expression for the Fano factor is complicated (34) however for the system examined in Figure 7 with λ = g the Fano factor and mean can be written as a function of
(15) |
(16) |
Changing the characteristic time to frustration mc/r or the relaxation time 1/g changes the effects of the mechanical properties of transcription; for mc/r < 1/g the noise of the system is driven away from the Poissonian limit (Fano factor of one). As shown in Figure 7 the mechanical properties of transcription constrain the mean expression level and the noise. Deviations between the analytical values and the simulated data can be attributed to a number of effects not considered in the simplified mathematical model such as RNAP occupancy exclusion. The mean expression rate in the fast initiation rate limit r → ∞ is set by the burst size mc as with a corresponding Fano factor of . This interplay offers an explanation for both universal as well as gene specific aspects of transcriptional noise. Incorporating additional important elements such as regulatory effects is not done here but was examined in our previous work (34) and can be added to future models. However, there is strong qualitative and semi-quantitative agreement with experimental results (26,34).
These results clearly demonstrate the effects of the mechanical properties of transcription on the statistical properties of gene expression for isolated genes interacting with a mechanical barrier. However, we have not specified the nature of this barrier thus far. In the following section we consider the effects of a multiple-gene system on a shared piece of DNA where neighboring genes serve as twist barriers for each other.
Interacting genes
In this section we will consider multiple genes acting in concert on a shared piece of DNA. In the same way that RNAPs on an isolated gene can interact, RNAPs on neighboring genes can also interact, mediated by DNA torsion. This will introduce a non-local correlation across genes. The ability for neighboring genes to influence each other has long been known (36) and a number of direct and indirect phenomena in gene expression have pointed to the location and orientation of genes as central determinants (37) of their function. Additionally, a previous theoretical study (38) has examined interaction between genes due to SC controlled initiation. Here we will be concerned with how SC changes the ability of RNAC to elongate. As before, the motion of each RNAC will be governed by the DNA twist equation.
We will first examine the behavior of two genes. In addition to the role of gene length and barrier distance (now intergenic spacing), the role of gene orientation (which direction the RNAC travels) plays a pivotal role in determining the behavior of the respective genes. Two genes which are convergently or divergently oriented relative to one another will cause positive or negative SC to accumulate in the intergenic region, each serving as a twist barrier to the other. The accumulation of SC causes RNAC stalling and transcriptional bursting for genes which are convergently or divergently oriented in the same manner as an isolated gene with a static barrier. Consequentially, gene expression statistics for such configurations are altered resulting in diminished expression levels as well generating correlations between genes as shown in Figure 8. This effect is not present for tandemly oriented genes, in agreement with recent observations in synthetic systems (37). The role of gene lengths and intergenic distances follows the results of isolated genes and is examined in the SM. Increasing the non-coding intergenic distances diminishes transcriptional bursting and allows for Poissonian gene expression.
The non-local interaction between neighboring genes through DNA torsion and the effects of gene arrangement and intergenic length on gene expression offer an alternative method to alter gene expression beyond the ability of promoters and enhancers to interact (36,39). Additionally, these results offer an interesting role for non-coding DNA as a space for SC to accumulate, acting as a mechanical buffer for highly transcribed genes. This previously unconsidered phenomena will be important for the construction of synthetic genes and may serve as an explanation for many existing paradoxes in the arrangement of non-coding DNA in the genomes of living systems.
The behavior of small systems with multiple genes such as the one shown in Figure 9A (composition details are in SM) follows from the behavior of two interacting genes. Many genes interacting on a shared region of DNA will affect each other in the same manner as we have previously outlined, where the leading or trailing RNAC serves as a twist barrier to the leading or trailing RNAC of the neighboring gene. Transcriptional bursting occurs for all the genes in a gene size and inter-gene length manner. Stochastic mRNA trajectories and statistics for the simple multiple gene system shown in Figure 9A are shown in the SM. Changes in gene expression by tuning intrinsic parameters follow in the same manner as for isolated or pairs of genes.
Super-coiling and DNA structure
Recent observations have shown that regions of actively transcribed DNA have significant altered structural and epigenetic properties as compared to inactive regions (40). These differences manifest themselves as chromatin domains of varying sizes in many organisms. The emergence of under and over-wound regions of inactive DNA flanked by torsionally neutral areas of active transcription as shown in Figure 9 is a plausible explanation for the formation of chromatin domains which have been observed in structural studies of DNA in many organisms (41) and which in fact have been directly connected to active transcription in recent studies (42,43). The relative lengths, positions and orientations of genes determine the regions of SC in DNA and will strongly influence the conformational structures realized by DNA in those regions. In a recent study in bacteria, it has been shown that the longest most strongly transcribed genes serve as DNA domain boundaries (43). Changing the location or lengths of these genes changed the boundary positions. Using the framework presented here a very similar behavior is observed where SCD domains move according to gene position, orientation and length (see Supplementary Figure S1).
This connection offers an intriguing link between gene expression and DNA structure which has only been partially explored. Previous studies comparing SC regions to domains have used methods which detect DNA under-twisting (44). These studies found partial agreement between SC regions and structural domains but this technique is not suited for observing fully plectonemic regions as should be expected in regions of significant SC. Additionally, increasing resolution in chromatin capture techniques have revealed previously missed small structural domains (45) and one must be careful to account for resolution limitations. Overlap between SC regions created through active transcription and domains is consistent with many known aspects of DNA structure and provides a compelling evolutionarily conserved mechanism for their formation in all organisms (45). This mechanism can exist alongside additional mechanisms for alternating chromatin structure in higher organism, such as loop extrusion (46,47). Additionally changing the location and size of SC domains may lead to changes in histone occupancy resulting in altered epigenetic patterns and chromatin structure and mechanics.
As stressed throughout this work, the level of SC between genes is set by the mechanical limits to RNAP. This offers an additional constraint to understanding the impact of transcription on DNA structure. Indeed some coarse polymer simulations have already shown the ability of SC to generate a number of observed properties of chromatin structure (48,49). However additional study is still needed to fully understand the role of transcription in determining chromatin structure.
DISCUSSION
The framework constructed here and the resulting phenomena offer a different perspective on the processes which determine and drive gene expression. Many results presented in this article make strong qualitative predictions which agree with experimental observations. Furthermore, the results of this article are able to explain the interaction of multiple genes on shared DNA as well as offer an explanation for how external non-molecular changes can influence gene expression. These results are connected to the role of DNA structure and mechanics in transcription. The non-local interaction and the resulting structural changes to DNA can connect disparate levels of cellular structure and function.
Examining the mechanical properties of transcription from this perspective can tie together many disparate phenomena in biology. The non-local interaction and the resulting structural changes to DNA which effect genes on the same piece of DNA generate a number of interesting phenomena. As shown in the paper this includes transcriptional bursting (Figure 3) and its origins as a geometric property of a gene and it’s surroundings (Figure 5). The geometric origin of bursting allows us to make predictions on the burst size mc. The relationship between burst size and gene expression statistics is readily observed and a plausible explanation for the observed generic relationship between gene expression levels and noise is given. The non-local interaction between genes leads to significant changes in gene expression, not due to the role of transcription factors, but do to the behavior of the surrounding genes. Additionally this phenomena offers an interesting role for non-coding regions of DNA as a mechanical buffer to active transcription of surrounding genes. Due to the strong ability of RNAP to introduce torsion into DNA, transcription introduces significant changes in the SC levels throughout DNA resulting in over and under winding in non-active areas and neutral SC in active regions of transcription. This phenomena offers a compelling mechanism for the formation of active and inactive domains observed in structural studies of DNA across all organisms.
Throughout the article a simple method for understanding the torsional response of DNA (equation 5) was utilized. As explained above, this formulation relies on a separation of time-scales between DNA mechanics and RNAC movement allowing for the use of an equilibrium formulation of DNA mechanics which assumes an instantaneous torsional response of DNA. While this method is fine for relatively short pieces of DNA, simulating long intergenic regions might require the use of more sophisticated models of DNA which include torsion transport and explicitly incorporate the interplay between twist and writhe. Introduction of DNA proteins such as histones could also be accomplished (changing the torsional response of DNA) though we do not believe the qualitative predictions made in the article will be changed. In fact it has been shown that the addition of histones can lead to buckling and plectoneme formation for smaller applied torques in chromatin than for naked DNA (22). Incorporating more sophisticated models of DNA into the framework used here can be done without any obvious barriers and is left for future work.
Additional processes beyond RNAP velocity are effected by SC. One obvious process is initiation, which relies on DNA unwinding to operate. Incorporating this phenomena into the model would amount to making the stochastic initiation rate SC dependent and some theoretical work has been completed examining it’s effect in gene expression (50). In the case of an isolated gene with only one forward barrier to DNA rotation (which was considered here) SC does not accumulate in the promoter region. Consequentially, the initiation rate in this case is not SC dependent allowing for an independent examination of the effect of the mechanics of elongation on transcription. However, for the case of multiple genes SC can accumulate in the promoter region which may lead to dynamic SC dependent initiation. However, for a highly transcribed gene with large bare initiation rate (as was considered here), initiation of new RNAP is largely controlled by the clearance of paused nascent RNACs following a relaxation event leading to a burst of initiation events. Thus, for highly transcribed genes with pausing, the dynamic SC initiation rate is overruled by the effects of exclusion by the paused nascent RNACs. Additionally, active reorganization of DNA due to transcription introduces a mechanism by which transcription increases access to enhancers and repressors. While not considered here these effects are interesting and potentially important for understanding transcription in living systems. Future work will can be done to introduce these processes and others into the model presented.
Finally, the theoretical discussion within this article has not centered on any particular organism. Though we have neglected a number of potentially important organism specific effects, we believe that the presented framework is capable of capturing the same phenomena in many organisms. As stated in the introduction, we have been concerned with the conceptual, broad characterization of the mechanical properties of transcription and their role in gene expression and DNA organization.
In conclusion it has become clear over the past several decades that chemical and physical processes play a central role in determining where and when a particular gene is active. In this work we have laid out some of the most basic features of the physical act of transcription for multiple RNAPs acting in a multi-gene system. Examining the physical side to transcription has uncovered a role for genome composition (gene orientation, size, and intergenic distance) beyond the organization of regulatory elements. Though many aspects of the model are simple, additional theoretical and experimental works can constrain and refine the model's precision and offer increasing levels of prediction. Future efforts to understand the precise mechanisms of cellular function will have to take the effects outlined here under consideration.
Supplementary Material
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
This work was supported by the National Science Foundation Center for Theoretical Biological Physics [NSF PHY-1427654]. Funding for open access charge: National Science Foundation Center for Theoretical Biological Physics [NSF PHY-1427654].
Conflict of interest statement. None declared.
REFERENCES
- 1. Sevier S., Levine H.. Mechanical properties of transcription. Phys. Rev. Lett. 2017; 118:doi:10.1103/PhysRevLett.118.268101 [DOI] [PubMed] [Google Scholar]
- 2. Lavelle C. Pack, unpack, bend, twist, pull, push: the physical side of gene expression. Curr. Opin. Genet. Dev. 2014; 25:74–84. [DOI] [PubMed] [Google Scholar]
- 3. Golding I., Paulsson J., Zawilski S.M., Cox E.C.. Real-time kinetics of gene activity in individual bacteria. Cell. 2005; 123:1025–1036. [DOI] [PubMed] [Google Scholar]
- 4. Suter D.M., Molina N., Gatfield D., Schneider K., Schibler U., Naef F.. Mammalian genes are transcribed with widely different bursting kinetics. Science. 2011; 332:472–474. [DOI] [PubMed] [Google Scholar]
- 5. Dar R.D., Razooky B.S., Singh a., Trimeloni T.V., McCollum J.M., Cox C.D., Simpson M.L., Weinberger L.S.. Transcriptional burst frequency and burst size are equally modulated across the human genome. Proc. Natl. Acad. Sci. U.S.A. 2012; 109:17454–17459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Raj A., Oudenaarden A.. Nature, nurture, or chance: Stochastic gene expression and its consequences. Cell. 2008; 135:216–226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Sanchez A., Golding I.. Genetic determinants and cellular constraints in noisy gene expression. Science. 2013; 342:1188–1193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Herbert K.M., La Porta A., Wong B.J., Mooney R.A., Neuman K.C., Landick R., Block S.M.. Sequence-resolved detection of pausing by single RNA polymerase molecules. Cell. 2006; 125:1083–1094. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Akhtar W., de Jong J., Pindyurin A.V., Pagie L., Meuleman W., de Ridder J., Berns A., Wessels L.F., van Lohuizen M., van Steensel B. et al. . Chromatin position effects assayed by thousands of reporters integrated in parallel. Cell. 2013; 154:914–927. [DOI] [PubMed] [Google Scholar]
- 10. Dekker J., Mirny L.. The 3D genome as moderator of chromosomal communication. Cell. 2016; 164:1110–1121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Liu L.F., Wang J.C.. Supercoiling of the DNA template during transcription. Proc. Natl. Acad. Sci. U.S.A. 1987; 84:7024–7027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Ma J., Wang M.D., Wang M.D.. DNA supercoiling during transcription. Biophys. Rev. 2016; 8:75–87. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Chong S., Chen C., Ge H., Xie S.X.. Mechanism of transcriptional bursting in bacteria. Cell. 2014; 158:314–326. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Le T.B., Imakaev M.V., Mirny L.A., Laub M.T.. High-resolution mapping of the spatial organization of a bacterial chromosome. TL - 342. Science (New York, N.Y.). 2013; 342:731–734. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Wu H.Y., Shyy S., Wang J.C., Liu L.F.. Transcription generates positively and negatively supercoiled domains in the template. Cell. 1988; 53:433–440. [DOI] [PubMed] [Google Scholar]
- 16. Tsao Y.P., Wu H.Y., Liu L.F.. Transcription-driven supercoiling of DNA: direct biochemical evidence from in vitro studies. Cell. 1989; 56:111–118. [DOI] [PubMed] [Google Scholar]
- 17. Crut A., Koster D.A., Seidel R., Wiggins C.H., Dekker N.H.. Fast dynamics of supercoiled DNA revealed by single-molecule experiments. Proc. Natl. Acad. Sci. U.S.A. 2007; 104:11957–11962. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. van Loenhout M.T.J., de Grunt M.V., Dekker C.. Dynamics of DNA Supercoils. Science. 2012; 338:94–97. [DOI] [PubMed] [Google Scholar]
- 19. Adelman K., La Porta A., Santangelo T.J., Lis J.T., Roberts J.W., Wang M.D.. Single molecule analysis of RNA polymerase elongation reveals uniform kinetic behavior. Proc. Natl. Acad. Sci. U.S.A. 2002; 99:13538–13543. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Ma Jie, Lu Bai, Wang M.D.. Transcription under torsion. Science (New York, N.Y.). 2013; 340:1580–1583. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Marko J.F. Torque and dynamics of linking number relaxation in stretched supercoiled DNA. Phys. Rev. E - Stat. Nonlinear Soft Matter Phys. 2007; 76:1–13. [DOI] [PubMed] [Google Scholar]
- 22. Bancaud A., Silva N., Barbi M., Wagner G., Allemand J.-F., Mozziconacci J., Lavelle C., Croquette V., Victor J.-M., Prunell A. et al. . Structural plasticity of single chromatin fibers revealed by torsional manipulation. Nat. Struct. Mol. Biol. 2006; 13:444–450. [DOI] [PubMed] [Google Scholar]
- 23. Nam G.-M., Arya G.. Torsional behavior of chromatin is modulated by rotational phasing of nucleosomes. Nucleic Acids Res. 2014; 42:9691–9699. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Marko J.F. Biophysics of protein-DNA interactions and chromosome organization. Physica A: Stat. Mech. Applic. 2015; 418:126–153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Wang J.C. Cellular roles of DNA topoisomerases: a molecular perspective. Nat. Rev. Mol. Cell Biol. 2002; 3:430–440. [DOI] [PubMed] [Google Scholar]
- 26. Fujita K., Iwaki M., Yanagida T.. Transcriptional bursting is intrinsically caused by interplay between RNA polymerases on DNA. Nat. Commun. 2016; 7:13788. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Tantale K., Mueller F., Alja K.-P., Lesne A., Victor J.-M., Robert M., Capozi S., Chouaib R., Bäcker V., Julio M.-L. et al. . A single-molecule view of transcription reveals convoys of RNA polymerases and multi-scale bursting. Nat. Commun. 2016; 7:12248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. So L.H., Ghosh A., Zong C., Sepúlveda L.A., Segev R., Golding I.. General properties of transcriptional time series in Escherichia coli. Nat. Genet. 2011; 43:554–560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Cisse I.I., Izeddin I., Causse S.Z., Boudarene L., Senecal A., Muresan L., Claire D.-D., Hajj B., Dahan M., Darzacq X.. Real-time dynamics of RNA polymerase II clustering in live human cells. Science. 2013; 341:664–667. [DOI] [PubMed] [Google Scholar]
- 30. Tantale K., Mueller F., Kozulic-Pirher A., Lesne A., Victor J.-M., Robert M., Capozi S., Chouaib R., Bäcker V., Mateos-Langerak J. et al. . A single-molecule view of transcription reveals convoys of RNA polymerases and multi-scale bursting. Nat. Commun. 2016; 7:12248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Engler A., Sen S., Sweeney H., Discher D.. Matrix elasticity directs stem cell lineage specification. Cell. 2006; 126:677–689. [DOI] [PubMed] [Google Scholar]
- 32. Swift J., Ivanovska I.L., Buxboim A., Harada T., Dingal P.C., Pinter J., Pajerowski J.D., Spinler K.R., Shin J.W., Tewari M. et al. . Nuclear lamin-A scales with tissue stiffness and enhances matrix-directed differentiation. Science. 2013; 341:1240104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Flavahan W.A., Drier Y., Liau B.B., Gillespie S.M., Venteicher A.S., Anat O.S.-R., Suvà M.L., Bernstein B.E.. Insulator dysfunction and oncogene activation in IDH mutant gliomas. Nature. 2016; 529:110–114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Sevier S.A., Kessler D.A., Levine H.. Mechanical bounds to transcriptional noise. Proc. Natl. Acad. Sci. U.S.A. 2016; 113:13983–13988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Jones D.L., Brewster R.C., Phillips R.. Promoter architecture dictates cell-to-cell variability in gene expression. Science. 2014; 346:1533–1536. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Feuerborn A., Cook P.. Why the activity of a gene depends on its neighbors. Trends Genet. 2015; 31:483–490. [DOI] [PubMed] [Google Scholar]
- 37. Yeung E., Dy A., Martin K., Ng A., Vecchio D., Beck J., Collins J., Murray R.. Biophysical constraints arising from compositional context in synthetic gene networks. Cell Syst. 2017; 5:11–24. [DOI] [PubMed] [Google Scholar]
- 38. Brackley C.A., Johnson J., Bentivoglio A., Corless S., Gilbert N., Gonnella G., Marenduzzo D.. Stochastic model of supercoiling-dependent transcription. Phys. Rev. Lett. 2016; 117:018101. [DOI] [PubMed] [Google Scholar]
- 39. Fukaya T., Lim B., Levine M.. Enhancer control of transcriptional bursting. Cell. 2016; 166:358–368. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Rao S., Huntley M.H., Durand N.C., Stamenova E.K., Bochkov I.D., Robinson J.T., Sanborn A.L., Machol I., Omer A.D., Lander E.S. et al. . A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell. 2014; 159:1665–1680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Rowley J.M., Corces V.G.. The three-dimensional genome: principles and roles of long-distance interactions. Curr. Opin. Cell Biol. 2016; 40:8–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Le T.B., Imakaev M.V., Mirny L.A., Laub M.T.. High-resolution mapping of the spatial organization of a bacterial chromosome. Science. 2013; 342:731–734. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Le T., Laub M.. Transcription rate and transcript length drive formation of chromosomal interaction domain boundaries. EMBO J. 2016; 35:1582–1595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Naughton C., Avlonitis N., Corless S., Prendergast J.G., Mati I.K., Eijk P.P., Cockroft S.L., Bradley M., Ylstra B., Gilbert N.. Transcription forms and remodels supercoiling domains unfolding large-scale chromatin structures. Nat. Struct. Mol. Biol. 2013; 20:387–395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Rowley M.J., Nichols M.H., Lyu X., Ando-Kuri M., Rivera I.S.M., Hermetz K., Wang P., Ruan Y., Corces V.G.. Evolutionarily Conserved Principles Predict 3D Chromatin Organization. Mol. Cell. 2017; 67:837–852. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Fudenberg G., Imakaev M., Lu C., Goloborodko A., Abdennur N., Mirny L.A.. Formation of chromosomal domains by loop extrusion. Cell Rep. 2016; 15:2038–2049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Sanborn A.L., Rao S.S., Huang S.-C.C., Durand N.C., Huntley M.H., Jewett A.I., Bochkov I.D., Chinnappan D., Cutkosky A., Li J. et al. . Chromatin extrusion explains key features of loop and domain formation in wild-type and engineered genomes. Proc. Natl. Acad. Sci. U.S.A. 2015; 112:E6456–E6465. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Benedetti F., Racko D., Dorier J., Burnier Y., Stasiak A.. Transcription-induced supercoiling explains formation of self-interacting chromatin domains in s. pombe. Nucleic Acids Res. 2017; 45:9850–9859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Benedetti F., Dorier J., Burnier Y., Stasiak A.. Models that include supercoiling of topological domains reproduce several known features of interphase chromosomes. Nucleic Acids Res. 2014; 42:2848–2855. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Brackley C.A., Johnson J., Bentivoglio A., Corless S., Gilbert N., Gonnella G., Marenduzzo D.. Stochastic model of supercoiling-dependent transcription. Phys. Rev. Lett. 2016; 117:1–5. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.