Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2020 Mar 25;16(3):e1007717. doi: 10.1371/journal.pcbi.1007717

An in-silico human cell model reveals the influence of spatial organization on RNA splicing

Zhaleh Ghaemi 1,2,*, Joseph R Peterson 1, Martin Gruebele 1,2,3, Zaida Luthey-Schulten 1,2,3
Editor: Attila Csikász-Nagy4
PMCID: PMC7094823  PMID: 32210422

Abstract

Spatial organization is a characteristic of all cells, achieved in eukaryotic cells by utilizing both membrane-bound and membrane-less organelles. One of the key processes in eukaryotes is RNA splicing, which readies mRNA for translation. This complex and highly dynamical chemical process involves assembly and disassembly of many molecules in multiple cellular compartments and their transport among compartments. Our goal is to model the effect of spatial organization of membrane-less organelles (specifically nuclear speckles) and of organelle heterogeneity on splicing particle biogenesis in mammalian cells. Based on multiple sources of complementary experimental data, we constructed a spatial model of a HeLa cell to capture intracellular crowding effects. We then developed chemical reaction networks to describe the formation of RNA splicing machinery complexes and splicing processes within nuclear speckles (specific type of non-membrane-bound organelles). We incorporated these networks into our spatially-resolved human cell model and performed stochastic simulations for up to 15 minutes of biological time, the longest thus far for a eukaryotic cell. We find that an increase (decrease) in the number of nuclear pore complexes increases (decreases) the number of assembled splicing particles; and that compartmentalization is critical for the yield of correctly-assembled particles. We also show that a slight increase of splicing particle localization into nuclear speckles leads to a disproportionate enhancement of mRNA splicing and a reduction in the noise of generated mRNA. Our model also predicts that the distance between genes and speckles has a considerable effect on the mRNA production rate, with genes located closer to speckles producing mRNA at higher levels, emphasizing the importance of genome organization around speckles. The HeLa cell model, including organelles and sub-compartments, provides a flexible foundation to study other cellular processes that are strongly modulated by spatiotemporal heterogeneity.

Author summary

The spliceosome is one of the most complex cellular machineries. It cuts and splices the RNA code in eukaryotic cells by dynamically assembling and disassembling. The components of spliceosome are formed in both the nucleus and the cytoplasm within the cell and primarily localized in nuclear membrane-less organelles. Therefore, a computational model of spliceosomal function must contain a spatial model of the entire cell. However, building such a model is a challenging task, mainly due to the lack of homogeneous experimental data and a suitable computational framework. Here, we overcome these challenges and present a spatially-resolved HeLa cell model, with nuclear, subnuclear, and extensive cytoplasmic structures. The three-dimensional model is supplemented by reaction-diffusion processes to shed light on the function of the spliceosome.

Introduction

Cells use spatial organization to mediate their complex biochemical reaction networks. Although membranes have long been recognized as means to confine organelle-specific biomolecules, non-membrane-bound organelles are increasingly found to play crucial roles in cellular function [1]. Such membraneless organelles can be formed as liquid-liquid phase-separated regions, and are therefore also known as “liquid droplets” [2]. Cells have numerous such liquid droplets that form either in the cytoplasm or in the nucleus [3, 4]. Some types of droplets are known to be involved only in a specific cellular process. As a prime example, nuclear speckles, or interchromatin granules, are droplets formed in the nucleus that are thought to be primarily involved in pre-mRNA splicing [5].

The RNA splicing process that led to eukaryotic cell evolution enables cell complexity without massively increasing the gene count or genome size. Instead, the structure of genes evolved such that the coding regions (exons) are interrupted by non-coding regions (introns) [6]. Coding regions must then be ligated to form functional transcripts. A single gene can encode for a variety of functional proteins by a process called alternative splicing [7]. The spliceosome is the cellular machinery that binds to the intron/exon sites, removes the introns and joins the exon ends. It is a multi-megadalton complex consisting of five protein-RNA small nuclear ribonucleoprotein complexes (Uridine-rich snRNPs): U1 snRNP, U2 snRNP, U4 snRNP, U5 snRNP and U6 snRNP [8]. The biogenesis of splicing particles occurs in multiple steps spanning both the nucleus and cytoplasm, transports between these two compartments and terminates in the nucleus [6]. The mature splicing particles then localize in nuclear speckles and assemble on the pre-messenger RNA (pre-mRNA) transcripts in a coordinated and step-wise fashion. Upon completion of the splicing reaction, they disassemble [9], thus resulting in a highly dynamic machinery.

Analogous to the influence of molecular-level heterogeneity, organelle heterogeneity can lead to different cellular phenotypic behaviors [10]. However, the effect of variations in the organelles involved in the spliceosomal particles assembly (e.g., the number of nuclear pore complexes and the size of the nucleus), is yet to be investigated. There is yet another motivation for studies in this realm. It is well-recognized that the particle-assembly in most species, although not yeast, occurs in multiple compartments (the nucleus and cytoplasm) and sub-compartments [6]. What is missing in this context is a quantitative rationale for the shuttling of the precursors of the splicing particles between different compartments. Thirdly, whereas the basic utility of nuclear speckles in pre-mRNA splicing is appreciated [11], the influence of spatial localization on splicing activity and mRNA production is not quantitatively understood. We anticipate that the localization of splicing components influences the efficiency of splicing.

To enable a comprehensive view of RNA splicing and splicing components formation, we developed a detailed spatial model of a human cell containing the most relevant organelles. As explained in the next sections, almost a hundred different biomolecules are involved in splicing processes. Some occur in relatively small numbers, necessitating a stochastic treatment of the processes leading up to splicing. In particular cell-to-cell heterogeneity in a population and its phenotypic behavior demand this approach.

Here, we construct an 18-μm spatially-resolved model of a HeLa cell from a library of experimental data such as cryo-electron tomography [12], mass spectrometry [13], fluorescence microscopy and live-cell imaging [11, 1416], and -omics data [17, 18]. We develop kinetic models to describe the reaction network of RNA splicing in accordance with the known biological events. The kinetic models were incorporated into our spatially-resolved eukaryotic cell model, complete with organelles, compartments and biomacromolecules. We then perform simulations using stochastic reaction-diffusion master equations (RDME) with our previously-developed cell simulation software, Lattice Microbes (LM) [19, 20] for up to 15 minutes of biological time. Interfacing our model with the LM [19, 20], uniquely enables us to study the kinetics of cellular activity on the relevant biological time scale up to hours. LM takes advantage of multiple-GPU processors, and therefore benchmarks much faster than similar softwares (e.g., Virtual Cell), as we have shown previously [21, 22]. Therefore, simulating 15 minutes of biological time on multiple-GPU processors, for nuclear processes and entire human cell, takes 20 minutes and 15 hours of walltime, respectively. The utilized GPU architecture which is also available to other research groups (through supercomputers and GPU cloud computing) and the details of the working principles of LM are explained in the Methods section.

The assumptions of constructing the spatial model of the human cell and the reaction schemes include: 1. Our study aims to investigate the RNA splicing process and the main organelles that are directly associated with this process including: the nucleus, nuclear speckles, Cajal bodies and parts of the cytoplasm (ER, mitochondria, Golgi apparatus). Therefore, our spatial model of the human cell mainly includes these organelles/ compartments. 2. Because the average protein half-life in human cells is ∼ 9 hours [23], a quasi-steady cell state is assumed over the 15 minutes of biological time that was simulated. Therefore, processes such as transcription and translation of the genes encoding the proteins involving in the RNA splicing process were not explicitly modeled. 3. We focused on building the groundwork for an adaptable platform for construction and simulation of spatial models of human cells. Hence, the cellular organelles/compartments that have not yet been included in the current version of the model can be readily added through the provided Python code. We assume that any required reaction or spatial components will be added by the researchers of our community.

Our simulations, featuring 15 minutes of biological time of the first spatially-resolved human cell model, explore how cellular organization affects the efficiency of spliceosomal particle formation and pre-mRNA splicing. We find that changes in the number of nuclear pore complexes affect the number of assembled splicing particles, an effect that remains consistent for different nuclear sizes; and quantitatively, the formation of correctly-assembled splicing particles in multiple compartments is more efficient. We also show that even a slight increase in the relative localization of splicing particles in nuclear speckles simultaneously enhances mRNA production and reduces noise in the generated mRNA. We are able to rationalize that the properties of nuclear speckles evolved while subject to physical constraints, such as their size and number. Finally, we predict that the organization of active genes around nuclear speckles affects mRNA production.

Results

Spatially-resolved model of a HeLa cell

We used a data-driven approach to construct a representative HeLa cell model that has not been available thus far. First, we gathered structural and -omics data from a variety of experimental studies [1118, 2427]. Then for organelles such as the endoplasmic reticulum (ER), we used protein composition percentages of HeLa cell organelles determined by mass spectrometry [13] to determine the fraction of the entire cell volume that is occupied by each organelle (see Methods for details). The mass percentages of organelles excluding water were approximated by volume percentages because of the fact that average proteins are composed of similar C/N/O/H atoms ratios with a density of 1.4g/cm3.

Fig 1A and 1B show the overall HeLa cell model and a detailed view of the nuclear region. Assuming experimental growth conditions resulting in spherically-shaped cells [26, 28], a volume of 3000 μm3 [24] leads to a cellular radius of 8.9 μm. The essential components of the cell for studying the RNA splicing processes are: the plasma membrane, the cytoplasm, the ER, mitochondria, the Golgi apparatus and the nucleus. The ER units were modeled by stochastic shapes using a cellular automata algorithm (see S1 Text and S1 Table for details and algorithm). ER units were distributed in the cytoplasm, spanning from the nuclear envelope to the plasma membrane, and were intertwined with other cytoplasmic organelles [29, 30]. The ER units made up ∼ 4.5% of the cell volume [13]. About 2000 rod-shaped mitochondria with dimensions of 0.9 μm × 0.5 μm were randomly placed throughout the cytoplasm, filling ∼ 11% of the total volume [13], additionally, a network-like mitochondria model was also constructed (see Methods section for details). A Golgi apparatus consisting of five stacked sheets, each with a thickness of 0.128 μm, was placed close to the nucleus [31].

Fig 1. A data-driven model for a 18-μm HeLa cell: A) the cytoplasmic components are: ER, mitochondria and Golgi (not included in the figure for clarity); and B) a nucleus containing nuclear pore complexes, Cajal bodies and nuclear speckles.

Fig 1

The cytosol, nuclear pore complexes, nuclear speckles, and Cajal bodies are directly involved in RNA splicing processes, whereas ER, mitochondria and Golgi apparatus provide excluded volume effects.

The nucleus, which plays a critical role in our model, had a radius in the range of 3.7-5.3 μm [2527]. It consists of nuclear pore complexes (NPCs) of 0.08 μm radii [12] at a density of 7 per μm2 [14], 20 spherically-shaped nuclear speckles with 0.35 μm radii [11] and 4 Cajal bodies (sites of maturation of splicing particles) with 0.5 μm radii [16]. Actively transcribing genes (black dots in Fig 1B) were were randomly distributed within 0.02 μm of the edge of each nuclear speckle [3234]. The nuclear components were chosen mainly among those that play a role in RNA splicing processes. Cellular components included in the in-silico model are listed in Table 1 along with their dimensions. The details of the construction of each organelle are provided in the Methods section.

Table 1. The cellular components of the constructed HeLa model.

Component Dimension (μm) Number Reference
HeLa cell R = 8.9 1 [24]
Nucleus R = 3.7, 4.2, 4.7, 5.3 1 [13, 2527]
Nuclear pore complexes R = 0.08 7/ μm2 [12, 14]
Mitochondria 0.9 × 0.5 2000 [15]
Nuclear speckles R = 0.35 20 [11]
Cajal bodies R = 0.5 4 [16]
ER 4.5% cell volume [13]

The kinetic model for spliceosome formation and action

We studied two processes: firstly, the formation of splicing particles U1snRNP and U2snRNP (U1 and U2 hereafter), which is a multi-compartmental process; secondly, spliceosomeal assembly, the splicing reaction itself and generation of mRNA transcripts. Together, these two series of events capture the whole process of splicing from machinery construction to functional transcript production. After the assembly of U1 and U2 in our model (the first process), the active genes are transcribed and pre-mRNA transcripts are spliced (the second process) according to the following reduced scheme for the spliceosome assembly:

  1. An active-28 Kb gene is transcribed, without explicitly accounting for transcriptional machinery, and pre-mRNA transcripts are produced

  2. U1 and U2 particles are formed and are present in the cell nucleus

  3. Tri.U (U4/U6.U5) particles present in the nucleus; because of the complexity in the formation of these complexes we assumed they are pre-formed in our model [35].

  4. The spliceosome assembles in a step-wise manner on pre-mRNA transcripts as described below

  5. The splicing reaction occurs (described below) and an mRNA transcript is produced

  6. The spliceosome disassembles after splicing, ready to assemble on another transcript

Below, we describe in detail the splicing particles’ formation and splicing assembly and reaction.

Formation of splicing particles

A splicing particle consists of a uridine-rich small nuclear RNA (U snRNA) that is bound to a heptamer ring of Smith proteins (Sm), and a variable number of particle-specific proteins. The formation of splicing particles happens in multiple steps and compartments including the nucleus and the cytoplasm. To understand the effects of geometry on the assembly process, we developed a kinetic model to describe these processes and studied them in our spatially-resolved human cell model.

Fig 2A shows the steps associated with the formation of splicing particles. The reactions together with their associated rates are summarized in Table 2. Although not required for the scenarios studied here, our methodology has the flexibility to incorporate rate constants that are concentration dependent [36, 37].

Fig 2.

Fig 2

A) The reaction scheme describing the formation of U1 and U2 splicing particles mapped on a cross-section of our in-silico HeLa cell [38, 39]. The four spherically-shaped regions in cyan color are Cajal bodies. B) Splicing reactions as implemented in our simulations. The reactions together with their corresponding rate constants are presented in Tables 2 and 3. Abbreviations are: Pol II (RNA polymerase II), Ex1 and 2 (Exon 1 and 2).

Table 2. Reactions describing U1 (u1snRNP) and U2 (U2snRNP) splicing particles formation together with their associated rates.

Abbreviations are: DNA(D), Gemin 5(G5), five already-assembled Sm proteins (Sm5), the remaining Sm proteins (Sm2), diffusion-limited (D.L.), model assumption (M), nucleus (N), NPC (P), Cajal bodies (J) and cytoplasm (C).

Reaction Rate Units Reference Compartment
In Nucleus
Dktrans1D+U1snRNAnuc 0.285 s−1 [62] N
Dktrans2D+U2snRNAnuc 0.224 s−1 [62] N
Nucleus to Cytoplasm
U1(2)snRNAnucknpcU1(2)snRNAcyt 2 × 104 s−1 M P
Cytoplasmic Assembly
U1(2)snRNAcyt+G5konGU1(2)snRNA·G5 1.02 × 108 M−1 s−1 D.L. C
U1(2)snRNA·G5koffGU1(2)snRNAcyt+G5 3.05 s−1 [63] C
U1snRNA·G5+Sm5kS1U1snRNA·Sm5+G5 5.9 × 107 M−1 s−1 D.L. C
U2snRNA·G5+Sm5kS2U2snRNA·Sm5+G5 1.18 × 107 M−1 s−1 D.L. C
U1(2)snRNA·Sm5+Sm2kon2U1(2)snRNA·Sm7 1.39 × 108 M−1 s−1 D.L. C
U1(2)snRNA·Sm7koff2U1(2)snRNA·Sm5+Sm2 2.78 s−1 [63] C
Cytoplasm to Nucleus
U1(2)snRNA·Sm7knpcU1(2)snRNA·Smnuc7 2 × 104 s−1 M P
Nuclear Maturation
U1snRNA·Smnuc7+U1protkonu1U1snRNP 1.22 × 107 M−1 s−1 [64] J-N
U1snRNPkoffu1U1snRNA·Smnuc7+U1prot 4.8 × 10−4 s−1 [64] J-N
U2snRNA·Smnuc7+U2protkonu2U2snRNP 0.24 × 107 M−1 s−1 [64] J-N
U2snRNPkoffu2U2snRNA·Smnuc7+U2prot 4.8 × 10−4 s−1 [64] J-N

Upon transcription, U1(2) snRNAs have to pass through nuclear pore complexes to reach the cytoplasm where, by a series of complex reactions, they bind to Sm proteins. Inspired by two studies [38, 39], we proposed the following mechanism for the cytoplasmic part of the process: the U1(2) snRNA transcript binds to Gemin 5 (G5) which is part of the survival of motor neurons (SMN)-Gemin complex that mediates the Sm proteins assembly on snRNAs. The complex formed by this process then binds to a ring of five previously-assembled Sm proteins (Sm5) through a process called RNP exchange suggested by Ref. [39]. This process facilitates the binding of Sm proteins to the snRNA transcript and the release of G5. In the last step, the remaining Sm proteins (Sm2) join the complex and the U1(2)snRNA.Sm7 complex is formed. After the completion of the binding of Sm proteins to snRNA, the complex again passes through the nuclear pores and makes its way into the nucleus. In the nucleus, the U1(2)snRNA.Smnuc7 complex localizes to the Cajal bodies [40] and binds to particle-specific proteins, U1(2)prot, and as a result the mature splicing particle is formed. The diffusion coefficients for the species involved in the splicing particle formation reactions were mainly adopted from various experimental sources listed in S2 Table, and the concentration of species are reported in Methods.

Assembly of the spliceosome and splicing reaction

The assembly process of the spliceosome machinery is entangled with a complex network of auxiliary and regulatory proteins that detect the splice site and vary the splice sites according to cellular cues by a process called alternative splicing [7]. To simplify this network, we assume that a particular splice site has been chosen according to the constitutive splicing process, and focus only on the assembly of the spliceosomal particles on that site and the subsequent splicing reaction.

Fig 2B depicts our model for the splicing reaction; and the details of the reactions and their associated rates are presented in Table 3. According to the conventional splicoeosome assembly model [41], the U1 particle binds to the 5′ end of the exon (“complex E”), followed by binding of the U2 particle to the associate 3′ end to form “complex A”. The tri.U (U4/U6.U5) then joins the complex forming “complex B”. Subsequently, U1 leaves the complex, yielding the catalytically active “complex B*”. The intron is then removed and the splicing particles are recycled for another round of assembly. The spliceosomal particle concentrations are reported in Methods and the diffusion coefficients are listed in S3 Table.

Table 3. Spliceosome assembly and splicing reaction.

Abbreviations are: DNA (D), U1snRNP (U1), U2snRNP (U2), U4/U6⋅ U5snRNP (tri⋅ U), model assumption (M), diffusion-limited (D.L.), nuclear speckles (S) and nucleus (N).

Reaction Rate Units Reference Compartment
DktransD+premRNA 4.7 × 10−3 s−1 [42] S-N
U1+premRNAkon1complexE 4.66 × 107 M−1 s−1 D.L. [16] S-N
complexEkoff1U1+premRNA 1.57 s−1 [65] S-N
complexE+U2kon2complexA 8.8 × 107 M−1 s−1 D.L. [16] S-N
complexAkoff2complexE+U2 0.062 s−1 [65] S-N
complexA+triUkon3complexB 4.66 × 107 M−1 s−1 D.L. [16] S-N
complexBkoff3complexA+triU 1.55 s−1 [65] S-N
complexBkactcomplexB+U1 6 × 104 M−1 s−1 M S-N
complexB*ksplmRNA+triU+U2 0.067 s−1 [50] S-N

Co-transcriptional splicing

Splicing is known to be overwhelmingly co-transcriptional; as transcription occurs, the spliceosome assembles on the transcribed pre-mRNA and splicing reactions begin. In our model, an average gene consisting of 8 introns with an intron length of 3.4 Kbase (plus 137 base for each exon) is considered [42]. After the transcription of the first exon-intron-exon piece, the splicing reaction proceeds as discussed above. Simultaneously, another intron-exon pair is transcribed, continuing the spliced transcript. The cycle repeats until the end of the gene is reached.

Organelle heterogeneity influences the formation of splicing particles

Cellular phenotypic variation arising from organelle heterogeneity is becoming a subject recognized as worthy of study [10]. We investigated how heterogeneity in NPC count and nuclear size affect the formation of splicing particles. All splicing particles, except U6, are complexes of uridine-rich small nuclear RNA bound to a heptameric ring of Sm proteins, along with specific proteins that bind to each splicing particles. Among the five splicing particles which are required for spliceosome function, we focus on the first two (U1 and U2) that start the spliceosome assembly process. As shown in Fig 2A and described above, these particles are formed in a multi-compartmental process [6]. Since splicing particles are assembled partly in the cytoplasm and partly in the nucleus, translocation through the NPCs is a crucial step. Live cell imaging has shown that the NPC count varies by as much as 10% [14], as does nuclear size [13, 2527]. We hypothesized that these variations could influence the formation of splicing particles. We tested this hypothesis by varying NPC count and nuclear size, and examining the effect on the number of particles formed after 30 seconds of biological time, which was long enough to provide us with sufficient statistics. Performing RDME simulations of the entire HeLa cell model is computationally expensive but worthwhile in this case; it has been suggested that explicit representation of obstacles can clarify the extent of anomalous diffusion [43, 44]. Additionally, previous simulations of bacterial cells with spatial heterogeneity using LM [45], showed that the crowding within the cell provides an excluded volume effect and leads to anomalous subdiffusive behavior. Similarly, in our simulations (see simulation details in Methods), the cytoplasmic organelles (the ER, mitochondria and Golgi apparatus) can represent an upper bound for in vivo crowding effects. This effect is confirmed by a 28% increase in the production of U1 for a cell model without cytoplasmic organelles with respect to the full model. Therefore, we performed the splicing particles formation simulations involving reactions within the cytoplasm, with the entire HeLa cell geometry shown in Fig 1.

Likewise nuclear pores have an important effect on transport. Fig 3 demonstrates that increasing (decreasing) the number of NPCs by 20%, at a constant nuclear size, results in an increase (decrease) in the number of mature U1 and U2 splicing particles. The computed effect is consistent across the tested nuclear radii with a range from 3.7 to 5.3 μm [13, 2527]. This suggests there are trafficking bottlenecks imposed by the nuclear pore count. We found that the number of splicing particles formed does not change significantly with nuclear size, as long as the number of NPCs scales with size. Longer diffusion times in a larger nucleus are compensated by shorter translocation times required when there are more NPCs. This trend is consistent with our observation that for a constant nuclear radius, decreasing the number of NPCs, results in a decrease in the number of formed splicing particles as shown in S1 Fig. We also studied the effects of mitochondrial morphology on the production of splicing particles, by generating a cell with network-like mitochondria as shown in S2 Text and S2 Fig (see Methods section for details). By performing simulations using this geometry, we observed that the morphology of the mitochondria does not significantly affect the production of splicing particles. The results are presented in S4 and S5 Tables.

Fig 3. Spliceosomal particle formation depends on NPC count.

Fig 3

Increase (pink) or decrease (blue) in the number NPCs by 20% results in a corresponding change in the number of U1 (A) and U2 (B) particles formed. The effect is consistent for different nuclear sizes. P-values for U1 results are: 1.8 × 10−2 (3.7 μm), 6.8 × 10−3 (4.7 μm), 4 × 10−3 (5.3 μm); and for U2 results are: 3.9 × 10−12 (3.7 μm), 6.4 × 10−6 (4.7 μm), 8.5 × 10−5 (5.3 μm). Error bars represent the standard deviations. For each condition, 20 simulation replicates for duration of 30 seconds were performed.

Human cells require about 105 splicing particles [46]. With our computed particle production rate of ≈ 13.7 s−1, it will take about 2 hours to generate the required abundance of splicing particles. This time scale is well within the lifetime of human cells, which further validates our mammalian cell model.

To obtain insight into the formation of splicing particles, we dissected the overall kinetics of the process in terms of discrete reactions occurring in each compartment, i.e., the nucleus and cytoplasm. These reactions include the transcription of snRNA and formation of (U1snRNAnuc), cytoplasmic production of U1snRNA · Sm7, and finally the assembly of mature U1. We determined the timescale for the formation of each of these three species within the first assembled U1 particle. The series of cytoplasmic reactions takes the longest to complete, irrespective of nuclear size, as shown in S3 Fig.

We examined the importance of multi-compartmentality by allowing all particle assembly steps to occur solely in the nucleus. As discussed above, in higher eukaryotes different components of the splicing particles join the assembly in different compartments [6]. This separation likely allows for higher quality control and prevents mixing of the partially-assembled particles with their substrates, thus preventing partially formed spliceosomes from deleteriously modifying pre-mRNAs. We postulated that assembly in the nucleus alone will result in snRNA binding to proteins in an incorrect order, or in incomplete assembly of the splicing particles. The series of uni-compartmental reactions can lead to the formation of misassembled splicing particle through reactions such as: U1snRNA · Sm5 + U1prot → RNA · Sm5 · U1prot reaction. The complex assembly of Sm-core on snRNA is followed by snRNA modification that triggers the nuclear import of the snRNA bound to Sm core [6]. Therefore, in the multi-compartmental assembly process, the misassembled splicing particle (RNA · Sm5 · U1prot complex) is not found in the nucleus.

As an outcome of simulating the nuclear assembly of the splicing particles, we found that although the system can make fully assembled splicing particles, it produces significantly more misassembled particles (771 ± 25) as compared to mature particles (248 ± 15), since the former are not required to go through the full assembly cycle (see Fig 2A). This simulation result demonstrates the critical need for the compartmentalization of the overall assembly of the splicing particles. Similar multi-compartmental processes have been observed for other cellular machines such as ribosomal subunits [6].

Nuclear speckles enhance effective splicing rate and control noise in mRNA

Nuclear speckles are self-organized liquid droplets that act as stores for splicing particles, and provide a mechanism to enhance other processes such as DNA repair and RNA modifications [5, 11]. Evidence suggests that a subset of splicing events occur within or at the periphery of these nuclear bodies [47] and speckles can have internal sub-organization [48]. Nuclear speckles, being liquid-liquid phase-separated regions, can promote certain biochemical reactions due to an enhanced concentration of the reactants [2]. To examine this phenomenon, we developed a reaction network to account for the effect of speckles on spliceosomal assembly and splicing (see Fig 2B). This network was included in stochastic RDME simulations containing speckles in the HeLa cell and we determined the resulting effect on mRNA production post-splicing and noise. The speckles were comprised of a concentrated store of splicing particles simulated as follows (see Methods section). We set the probability, Pn, for splicing particles to transit from the nucleus into the speckles higher than the probability, Ps, for the reverse transition. The higher the imbalance (Pn/Ps), the greater the degree of localization of splicing particles in the speckles (see Fig 4A). We compared mRNA production in cells with different degrees of splicing particle localization in the speckles with a control cell containing no speckles and splicing particles (U1) randomly distributed throughout the nucleus. Relative to the scenario containing no speckles (U1 fraction in speckles = 0 in Fig 4), a cell with about 10% of U1 located in speckles showed a large enhancement in the number of spliced mRNA transcripts from 0.25 to 60, which is effectively a ≈ 250-fold amplification (Fig 4B and 4C). Thus, even a slight increase in the localization of splicing particles in speckle, greatly enhances mRNA production. After the initial steep increase, the rate of mRNA production and enhancement, decrease and saturation is reached. This is because, as more particles are being localized in nuclear speckles, more pre-mRNA transcripts undergo splicing reactions, until almost all transcripts at a given time are bound to the splicing machinery (Fig 4B). This is verified by the reduction of average number of free transcripts from 53.3 to 0.2, for the cells with no speckles and with a localization of 85% U1 within speckles, respectively. Using green fluorescent protein labels, Rino et al. determined the ratio of splicing protein U2AF in speckles to that in the nucleus to be 1.27 ± 0.07 [49], which translates to 56% of U1 in speckles with respect to its total number. Strikingly, this experimentally determined ratio corresponds in our model to a ∼ 57% localization of splicing particles in speckles, which validates our model.

Fig 4.

Fig 4

Splicing efficiency increases in the presence of speckles in the cells: A) The higher the probability for the splicing particles to transition from the cell nucleus to the speckles, relative to the reverse transition, the higher the localization of splicing particles in speckles. Schematically, the randomly distributed splicing particles (yellow dots) in the cell nucleus (colored in purple), localize in nucleus speckles (blue shaded regions) as the probability imbalance increases (the splicing particles concentration is 1 nM). B) As the percentage of splicing particles located in speckles increases, the number of spliced mRNA also increases. C) This enhancement in mRNA production is highly sensitive to the localization of splicing particles in speckles; with only a 10% localization of splicing particles in speckles, the splicing reaction is enhanced ∼ 250-fold relative to the case with no speckles. D) Noise estimated as coefficient of variation (CV), decreases as a greater percentage of splicing particles are localized in speckles. Splicing particle concentration affects the functional advantage of speckles: E) Enhancement in mRNA production due to the presence of speckles, depends on the U1 splicing particle concentration; F) Effect of the U1 splicing particle concentration on the mRNA production noise. Note that for cells with 0.1 nM U1, below 25% splicing particle localization, the CV is not defined due to lack of mRNA production. For each condition, 20 simulation replicates were performed. For simulation details see Methods.

Nuclear speckles not only enhance splicing activity, but they also help limit the noise that splicing introduces into the whole gene expression process. To see this, we examined the effect of speckles on the noise associated with mRNA production, estimated in terms of the coefficient of variation (CV), which is the ratio of standard deviation of mRNA count to the average mRNA count. For CV calculations, we performed simulations for a fixed amount of time and the average and standard deviation were calculated from the value of mRNA at the end of the simulation time over 20 simulations. As the percentage of splicing particles in speckles increases (Fig 4D), the noise associated with spliced mRNA, decreases.

Speckle-enhanced splicing is concentration-dependent

The number of splicing particles required per pre-mRNA transcript is a function of many variables including the rate of transcription [42] and therefore this number may vary from one gene to another. We investigated how variation in the ratio of splicing particles to pre-mRNA transcripts affects overall mRNA transcript production and noise in a cell with nuclear speckles. Specifically, for 20 constitutively transcribing genes, we changed the number of particles available for pre-mRNA binding and splicing from 16 to 803 corresponding to a concentration range of 0.1-5 nM. The remainder of the total of 105 splicing particles [46] was bound to pre-mRNA transcripts actively in the splicing process. Fig 4E summarizes how the concentration of U1 splicing particles affects the ability of speckles to enhance splicing. For cells with 0.1 nM U1, the absence of speckles results in no generation of mRNA. In fact the no mRNA production was observed until the U1 localization reaches 25%. At 1 nM U1, mRNA production in a cell with speckles is ≈ 770-fold that for a cell with no speckles. On the contrary, at 5 nM, there is little enhancement. Thus, at lower concentrations of U1, speckles enhance splicing much more strongly. Consistently, as Fig 4F shows, the noise of mRNA production is also influenced. Because of the lack of mRNA production in control cells with no speckles, the CV for cells with 0.1 nM U1 is not defined. At 1 nM U1, the noise in the presence of speckles is ≈ 25-fold lower as compared to a cell with no speckles; whereas, at 5 nM, the noise is unaffected by whether speckles are present or not.

Speckle size and number have been fine-tuned by cells to optimize mRNA production

A physical constraint of phase separation is that the the concentration of splicing particles within speckles remains roughly constant. Thus we investigated how a cell controls the number and the size of nuclear speckles. We hypothesize that the experimentally observed anatomy of the speckles is optimized by the environment inside cells. To test this hypothesis, we assigned about 10% of the nuclear volume to speckles [18]. Keeping the total volume of speckles constant, we increased the number of speckles and reduced their sizes, as shown in Fig 5A. Increasing the number of speckles results in increasing their surface area (Fig 5A), which in turn enhances the pre-mRNA splicing, as shown in Fig 5B. This is because the higher surface area increases the probability of splicing particles to diffuse into the nuclear speckles resulting in increased localization. However, beyond ∼ 50 speckles the number of produced mRNA plateaus, due to the compensation of splicing particle localization by relatively smaller-sized nuclear speckles. Production of mRNA was maximized when there were between 20 and 50 speckles, which coincides directly with the experimentally determined values [11]. In addition, the size of the nuclear speckles corresponding to the maximum mRNA production falls between 1.4 to 1 μm, which is also compatible to the known nuclear speckles diameters of one to a few microns [11]. Therefore, our results suggest that it is plausible that the cells optimize the design parameters of speckles (the number and size) to maximize the mRNA production.

Fig 5. A) Increasing the number of nuclear speckles, results in an increase of the surface area (magenta curve) and decrease of the speckles diameter (blue curve); B) mRNA production increases as the number of speckles increases till about 50 speckles beyond which the production plateaus.

Fig 5

Error bars represent the standard deviations. For each condition, 20 simulation replicates were performed.

Gene distribution around speckles affect transcript splicing and mRNA production

It is known that genes are organized non-randomly around nuclear speckles [32, 33]. In a recent study, Chen et al. investigated the organization of whole genome using TSA-seq method [34]. They showed that the most highly expressed genes are located between ≈ 0.05 and 0.4 μm from the periphery of a speckle. It was also speculated that the genome movement of several hundreds of nanometers from nuclear periphery towards speckles could have functional significance. To test their hypothesis, we investigated the effects of active genes distribution around speckles’ peripheries. We varied the gene distance from 0.05 to 0.94 μm and observed the effect on the number of spliced mRNA transcripts found in cytoplasm, but without explicitly accounting for transcriptional machinery. As Fig 6 demonstrates, increasing the distance of the genes to speckle periphery from ≈ 0.05 to 0.1 μm sharply decreases the mRNA counts by a factor of 2, with no further significant decrease at larger distances. Thus, the speckle-gene proximity effect might be even more pronounced over a short distance range than Chen et al. were able to resolve.

Fig 6. Our model predicts that the mRNA production decreases by a factor of two, above 0.05 μm.

Fig 6

An schematics shows a nuclear speckle in red and the distance of active genes on chromatin (curved line in grey) from the speckle periphery (d). Error bars represent the standard deviations. For each condition, 20 simulation replicates were performed.

Since our model does not involve movement of speckles toward an active transcription site, the observed effect is mainly due to the diffusion of the transcripts in the nucleus before they become associated with the speckles. Therefore, our model makes the simple prediction that the gene distribution around speckles has an effect on mRNA splicing. It is plausible that this effect might be regulated by speckle movement towards transcriptionally active genes, consistent with the fluid nature of these nuclear bodies.

Discussion

Spatial organization is one of the key features of eukaryotic cells that brings order to complex biochemical reactions. We studied two aspects of this organization both in connection with RNA splicing (the process of removal of non-coding regions from pre-mRNA transcripts). We investigate, firstly, the effects of variation of cellular organelles on assembly of the building blocks of splicing; and secondly, mRNA production at nuclear speckles which are membrane-less organelles that have higher concentrations of reactants involved in splicing. We constructed a spatially-resolved mammalian cell (HeLa cell) model from a variety of experimental data. Using this model, we simulate the splicing event with a kinetic model we developed, as stochastic reaction-diffusion processes.

We show that number of nuclear pore complexes—assemblies of proteins embedded in the nuclear envelope that control the traffic between nucleus and cytoplasm—affects the splicing particles formation rate for different nuclear sizes, and the particles formation is more efficient when the precursors are produced in different compartments before maturation. The localization of splicing particles in nuclear speckles enhances the mRNA production and reduces its associated noise. We suggest a rationale for the number and size of speckles based on optimal resulting mRNA concentrations. By demonstrating the effects of the genes distribution around our speckles on mRNA counts, we propose that the movement of nuclear speckles toward active chromatin regions could be regulated by cells to control the transcripts production.

Limitations of the current spatially-resolved human cell model

While novel, our model approximates the underlying biophysics, and therefore has some limitations:

  1. There is a lack of experimental data for describing the rates of individual reactions within the splicing process. To overcome this challenge, we defined approximate lumped reactions (e.g., NPC transport and Sm proteins binding to snRNA [39]) and assigned rates based on either available experimental data [50] or simple models, such as diffusion-limited reactions. A potential alternative to solving this problem is detailed in the next paragraph.

  2. Our current model includes nucleus, nuclear speckles and Cajal bodies within it, and cytosolic organelles such as ER, Golgi apparatus, and mitochondria, that are near the nuclear periphery and contribute substantially to the excluded volume effects. However, it does not yet include compartments or structural features such as chromatin, the nucleolus, microtubules, actin filaments, endosomes, peroxisomes, and lysosomes. For the purposes of modeling RNA splicing process, none of these compartments (aside from chromatin) are thought to play a significant role. For a complete spatial model of the human cell, especially when studying processes that are involved with these organelles/compartments, including them will be required.

  3. For simplicity we considered a constitutively spliced gene in our studies. To account for complexity within human cells, alternative splicing should be also included which itself requires the inclusion of splice-sites selection. Splice-sites selection is affected by inter- and intracellular signaling pathways. [51].

Challenges of simulating the complete cell cycle of the entire human cell and strategies for overcoming them

There are many challenges to create a dynamical model for the entire human cell. Here, we provide three challenges that in our opinion are the most immediate ones:

  1. For simulating the complete life cycle of a eukaryotic cell two additional components will be required: a) inclusion of metabolism, and a more complete description of the genetic information processes (transcription, translation, and DNA replication). Although our code presents a platform for the inclusion of these processes, however, as demonstrated by Karr et al. [52], even for bacterial systems, accounting for these processes will add considerable complexity to the model –especially as they are distributed among different compartments in eukaryotic cells; b) addition of the remaining organelles such as microtubules, actin filaments and the nucleolus. We envision they can be constructed following a similar approach as outlined in detail in Methods. For representation of sub-compartments such as chromatin, rich sets of experimental data for instance from the 4D nucleome project [53] are becoming available. Such developments will enable modelers to improve the existing model and to capture chromatin dynamics [54, 55]. We hope that extension of the model will be a community process and provide other researchers with the tools to make it so.

  2. Modeling a comprehensive set of cellular processes requires an extensive amount of computational time, even when performing the simulations on multiple GPUs. Such studies can be accelerated by employing hybrid methodologies. As an example, parts of the cellular processes involving a high concentration of species (e.g., metabolites) can be simulated as ordinary differential equations (ODEs) that can be integrated with stochastic simulations such as chemical master equations (CMEs) for treating the processes like transcription. Recently, a hybrid CME-ODE method was implemented in the LM and can provide a 10-50-fold speedup in comparison with pure CME simulations [56].

  3. For describing an extensive set of biological reactions, especially in eukaryotic cells, the rates of individual reaction steps are often not yet available from experimental data. To overcome this barrier, two approaches can be taken: a) to estimate/infer the rates from computational simulations of each specific reaction using methods such as enhanced sampling techniques [57, 58] and quantum mechanical calculations [59]; however, these simulations are computationally expensive and time-consuming; and b) to introduce lumped reactions, based on available experimental data describing the biological processes and time-scale analysis [50] and c) Bayesian approaches using a multitude of experimental data elements to fit unknown rates.

Model extensibility

To build upon our model, we provided a documentation within our Python package which describes how to add/ remove reactions; modify the number, size and morphology of the organelles currently present in the model; and add new organelles. The code can be edited within the widely-used Jupyter notebooks. As discussed in the Introduction, to perform hour-long simulations of the entire human cell, using GPU acceleration is an inevitable architecture choice. Because our LM software excels on GPU computing, and such technologies are currently available to the majority of researchers, our platform is particularly primed for extensions to various cell types and longer simulations. The Python code for setting up a HeLa cell model is available at: https://eukaryoticcellbuilder.github.io/HeLa_Builder/.

Overall, our reaction-diffusion model of spliceosome assembly and function in a realistic mammalian cell environment that included some of the most relevant constituents, allowed us to meaningfully connect the cellular geometry to the underlying biological processes. At the same time, we expect the presented cell model to provide a versatile platform for studying processes with spatio-temporal elements beyond mRNA splicing. Having said that, our platform presents one of the many steps required to be able to seamlessly integrate various models and fully simulate the complexity of a eukaryotic cell across different length and time scales. Such an integration can eventually lead to the prediction of inter- and intra-cellular emergent phenomena.

Methods

Construction of a representative spatially-resolved HeLa cell model

Spliceosomal assembly and activity consists of multi-compartmental, reaction-diffusion processes, necessitating a spatial representation of the cellular geometry. HeLa cells are an optimal model system as they have been the subject of extensive investigations exploring cell geometry and cellular composition. Additionally, data from individual measurements of specific components (e.g., size, morphology, relative mass fraction) were used to inform the construction of a representative cell model [1118, 2427]. A constructive solid geometry (CSG) approach, wherein basic geometric objects are combined programmatically via set operations (e.g. unions, differences, intersections), was used to build the HeLa cell. Since LM (v 2.3) [19, 20] requires that each location within the space be defined as a single site-type the various CSG objects were stenciled onto the simulation lattice in “depth order” (also called the “Painter’s algorithm”). Overall, our model consists of 11 different site-types including: 1) extracellular space, 2) plasma membrane, 3) cytoplasm, 4) nuclear membrane, 5) nucleoplasm, 6) Cajal bodies, 7) nuclear speckles, 8) nuclear pore complexes, 9) mitochondria, 10) Golgi apparatus and 11) endoplasmic reticulum (ER). The overall simulation volume was constructed as a cubic box with 18.432 μm side-length. The space was discretized into a cubic lattice of points spaced 64 nm apart. HeLa cell volumes have been measured at 2600, 3000 and 4400–5000 μm3 [25, 60, 61]; we chose the mid- size cell as a template for our model. HeLa cells that are grown in suspension appear spherically-shaped, so we chose to design the overall cell architecture as a sphere with radius 8.9 μm. Nuclei have measured volumes of, 220 and 374 μm3 [26, 27]. Refs. [25] and [13] suggested nuclear volumes corresponding to 10% and 21.1% of the total cell volume. As we wanted to test the importance of nuclei size on splicing, multiple nuclear radii were investigated, including 3.7, 4.2, 4.7 and 5.3 μm corresponding to all the above-motioned volumes and volume-fractions. Plasma and nuclear membranes were implemented as a thin sheet of lattice points (128nm thick) separating the extracellular space, the cytoplasm and the nucleus. The Golgi apparatus was constructed as an intersection of a cone with several spherical shells of various radii placed successively from the edge of the nucleus into the cytoplasm. The apex of the cone was centered in the cell with the based positioned deep in the cytoplasm. In this way, the Golgi roughly approximates what is seen in experiments. Nuclear speckles and Cajal bodies were modeled as spheres placed randomly within the nucleus. Mitochondria were modeled as randomly oriented spherocylinders placed within the cytoplasm. A network-like mitochondria was also constructed by randomly placing 2.95 μm-long rods that can cross each other, while keeping the total volume of the mitochondria constant, as shown in S2 Fig. Nuclear pore complexes (NPCs) were embedded in the nuclear envelope. NPCs were constructed as a set of spheres of radii 0.08 μm to ensure connectivity from the nucleus to the cytoplasm. Sizes for these organelles can be found in Table 1. The ER was also constructed in a randomized fashion with the details and construction algorithm presented in Supplementary Information. Total counts of the organelles were based on either direct experimental quantification or based on relative volume fraction measured for the overall cell. The endosomes, lysosomes, actin-cytoskeleton, peroxisomes in the cytoplasm; nucleolus and chromatin have not been included into the present version of the model. According to Ref. [13], each of the cytoplasmic organelles contribute less than 1% of to the total cell composition, and therefore, were not modelled. The nuclear components were chosen mainly among those that play a role in RNA splicing processes. A representative HeLa cell geometry resulting from this procedure is shown in Fig 1. The associated code for setting up a HeLa cell model for LM [19, 20] is available at: https://eukaryoticcellbuilder.github.io/HeLa_Builder/.

Kinetic models

Implementation of nuclear speckles

Nuclear speckles were modelled as spherical regions in the nucleus with radii of 0.35 μm. Previously, the splicing particles localization in speckles has been implemented by assuming a higher affinity for splicing particles to bind to unknown binding partners in speckles with respect to binding to pre-mRNA transcripts in the nucleoplasm [49]. We imposed an imbalance on the transition probabilities for the splicing particles and pre-mRNA transcripts to move from the nuclear speckles to the nucleoplasm, and vice-versa. This approach, as shown in the Results section, will reproduce experimentally observed concentration ratio of splicing particles between the speckles and the nucleoplasm [54], additionally, the presence of dummy particles in the speckles are not required. Specifically, the probability of the splicing particles to move from the nucleoplasm regions to the speckles (Pn) was higher than the reverse direction (Ps). To examine the effect of the bias, the Pn/Ps values were varied. With increasing Pn values, more particles accumulate in the speckles and the nucleus becomes more diluted. Fig 7 shows the localization of splicing particles in speckles upon application of this bias in our model.

Fig 7. Formation of speckles in our simulations: A) splicing particles (U1 and U2 colored in yellow and orange, respectively) diffusing freely in the nucleus without speckle, B) Introduction of an imbalance on transition probabilities of splicing particles from the nucleus to speckles results in the localization of the splicing particles in the speckles shown as red-shaded regions.

Fig 7

RDME simulations methodology in LM

We use formalism known as the reaction-diffusion master equation (RDME), which describes the state of a stochastic system to be in a specified spatial and compositional “state”, as the underpinning for the simulations in this paper. For systems of our interests, this state comprises of the number of each interacting species along with their locations. In RDME simulations using LM, the geometry of the system is first discritized to a cubic grid with a lattice spacing λ. Additionally, a fundamental time step τ is specified. The time evolution of the probability of the system to be in specific state x, containing number of molecules for each of the Nsp species present at each subvolume νV, is determined by the rates of change due to reaction and diffusion, defined by operators R and D, respectively:

dP(x,t)dt=RP(x,t)+DP(x,t),dP(x,t)dt=νVr=1Nrxn[-ar(xν)P(xν,t)+ar(xν-Sr)P(xν-Sr,t)]+νVζ±i^,j^,k^α=1Nsp[-dναxναP(x,t)+dν+ζα(xν+ζα+1)P(x+1ν+ζα-1να,t)]. (1)

The first two terms of this equation define the probability flux within each subvolume due to reactions: ar(xν) is the reaction propensity for reaction r of Nrxn reactions and Sr is the N × R stoichiometric matrix describing the net change in molecules number when a reaction occurs. The last two terms describe the rate of the change of the probability due to molecules’ propensity to diffuse between neighboring subvolumes. dνα is the diffusive propensity for a molecule of species α to jump to a neighboring subvolume, which is related to the macroscopic diffusion coefficient by dνα=Dναλ2. The first part of the diffusion operator defines the probability flux out of the current state due to the diffusion of molecules from subvolume ν; and the second part of the diffusion operator, describes the diffusion of molecules into the current state from subvolumes, ν + ζ, where ζ defines the neighboring subvolume in ±x, ±y, ±z directions.

Using an approximation for sampling Eq 1, as implemented in LM, simulations of biologically relevant timescales (∼15 min) for the HeLa cell model on multiple GPU hardware became possible [19, 20].

RDME simulation details

Formation of splicing particles

In addition to structural features discussed above, the abundance of the proteins participating in the formation of splicing particles were derived from proteomics data of HeLa cell [17]. The concentration of cytoplasmic proteins of G5, Sm5 and Sm2 was 0.61 μM; the nuclear proteins of U1prot and U2prot were 0.89 μM and 0.56 μM, respectively. Moreover, the number of active snRNA genes have been determined to be 30 in haploid cells [66]. These abundances were used as the initial condition for the simulations. Proteins and RNA molecules were randomly distributed throughout their designated regions with locations sampled from a uniform distribution; for instance, cytoplasmic proteins (G5, Sm5 and Sm2) were distributed throughout the cytoplasm, at sites which were not occupied by mitochondria or ER. For each separate simulation replicate, the placement of proteins and active genes, together with cellular organelles such as: mitochondria, ER, nuclear speckles, NPCs and Cajal bodies, were randomized. The RDME simulations were performed for 30 s of biological time eight Tesla GPUs with a walltime of 30 minutes. The time step was 7.3 × 10−5s and the lattice spacing was 64 nm.

pre-mRNA splicing within nuclear speckles and nucleoplasm

For studying the RNA splicing, as discussed in the main text, we started with already assembled splicing particles (U1 to U6) and distributed them randomly in the nucleoplasm. 20 constituitively transcribing active genes were localized around 20 randomly placed nuclear speckles. These simulations were performed for 15 minutes of biological time on eight Tesla K80 GPUs with a walltime of 20 minutes. The simulation details were: time step of 3.3 × 10−3s, lattice spacing of 64 nm, nucleus size of 4.2 μm with 1515 NPCs.

Supporting information

S1 Text. Endoplasmic reticulum construction.

(TEX)

S2 Text. Effects of mitochondria network-like morphology on the splicing particles formation.

(TEX)

S1 Fig. Changes in the number of NPCs, correlates withe the number of formed U1 splicing particles.

(PNG)

S2 Fig. The HeLa cell with a network-like mitochondria.

See Methods section for construction details of the mitochondria.

(PNG)

S3 Fig. The time required to form the first splicing particle dissected by the set of discrete reactions occurring in the nucleus and cytoplasm.

Error bars represent the standard deviations. For each condition, 20 simulation replicates were performed.

(PNG)

S1 Table. Parameters used in the cellular automata program to create realistic ER.

(TEX)

S2 Table. Diffusion coefficients for species involved in splicing particles formation.

Abbreviations are: nucleus (N), NPC (P), Cajal bodies (J), nuclear speckles (S) and cytoplasm (C).

(TEX)

S3 Table. Diffusion coefficients of spliceosomal particles.

Abbreviations are: nucleus (N) and nuclear speckles (S).

(TEX)

S4 Table. The results of the simulations sets with a network-like morphology of the mitochondria as compared to the fragmented-mitochondria that were reported in the main text.

(TEX)

S5 Table. The p-values of data from various simulation sets.

(TEX)

Acknowledgments

Z.G. thanks Prof. Andrew Belmont and Dr. Joseph Dopie (MCB, U of Illinois) for many stimulating conversations, and Prof. Andrew Belmont, Prof. Prashant Jain (Chemistry, U of Illinois), Prof. K. Prasanth (MCB, U of Illinois), Dr. Marian Breuer (Chemistry, U of Illinois) and Dr. Pankaj Chaturvedi (MCB, U of Illinois) for critical reading of the manuscript and helpful comments. Z.G. especially thanks Mike Hallock (School of Chemical Sciences, U of Illinois) for support with LM. Supercomputer time was provided by XStream-XSEDE [grant TG-MCA03S027].

Data Availability

All relevant data are within the manuscript and its Supporting Information files. Additionally, the code we developed can be downloaded from Github through: https://eukaryoticcellbuilder.github.io/HeLa_Builder/.

Funding Statement

This work was supported by the NSF grants MCB-1244570 on the Evolution of Translation:From Molecules to Cells to Z.G. and Z.L-S. and NSF Center for the Physics of Living Cells grant PHY-1430124 to M.G. and Z.L-S.,the NSF Graduate Fellowship [grant DGE-1144245] to J.R.P., and the NIH P41-GM104601 to Z.L-S. Z.L-S held the Murchison-Mallory Chair and M.G. held the James R. Eiszner Chair while this work was carried out. Supercomputer time was provided by XStream-XSEDE [grant TG-MCA03S027]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. Courchaine EM, Lu A, Neugebauer KM. Droplet organelles? The EMBO Journal. 2016;35(15):1603–1612. 10.15252/embj.201593517 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Hyman AA, Weber CA, Jülicher F. Liquid-Liquid Phase Separation in Biology. Annual Review of Cell and Developmental Biology. 2014;30(1):39–58. 10.1146/annurev-cellbio-100913-013325 [DOI] [PubMed] [Google Scholar]
  • 3. Zhu L, Brangwynne CP. Nuclear bodies: the emerging biophysics of nucleoplasmic phases. Current Opinion in Cell Biology. 2015;34:23–30. 10.1016/j.ceb.2015.04.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Uversky VN. Intrinsically disordered proteins in overcrowded milieu: Membrane-less organelles, phase separation, and intrinsic disorder. Current Opinion in Structural Biology. 2017;44:18–30. 10.1016/j.sbi.2016.10.015 [DOI] [PubMed] [Google Scholar]
  • 5. Galganski L, Urbanek MO, Krzyzosiak WJ. Nuclear speckles: Molecular organization, biological function and role in disease. Nucleic Acids Research. 2017;45(18):10350–10368. 10.1093/nar/gkx759 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Matera AG, Wang Z. A day in the life of the spliceosome. Nature reviews Molecular cell biology. 2014;15(2):108–21. 10.1038/nrm3742 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Kornblihtt AR, Ignacio E S, Alló M, Dujardin G, Petrillo E, Muñoz MJ. Alternative splicing: a pivotal step between eukaryotic transcription and translation. Nature Reviews Molecular Cell Biology. 2013;14:153–165. 10.1038/nrm3525 [DOI] [PubMed] [Google Scholar]
  • 8. Wahl MC, Will CL, Lührmann R. The Spliceosome: Design Principles of a Dynamic RNP Machine. Cell. 2009;136(4):701–718. 10.1016/j.cell.2009.02.009 [DOI] [PubMed] [Google Scholar]
  • 9. Will CL, Lührmann R. Spliceosome Structure and Function. Cold Spring Harbor Perspectives in Biology. 2011;3(7). 10.1101/cshperspect.a003707 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Chang AY, Marshall WF. Organelles—understanding noise and heterogeneity in cell biology at an intermediate scale. Journal of Cell Science. 2017;130(5):819–826. 10.1242/jcs.181024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Spector DL, Lamond AI. Nuclear Speckles. Cold Spring Harbor Perspectives in Biology. 2011;3(2). 10.1101/cshperspect.a000646 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Mahamid J, Pfeffer S, Schaffer M, Villa E, Danev R, Kuhn Cuellar L, et al. Visualizing the molecular sociology at the HeLa cell nuclear periphery. Science. 2016;351(6276):969–972. 10.1126/science.aad8857 [DOI] [PubMed] [Google Scholar]
  • 13. Itzhak DN, Tyanova S, Cox J, Borner GHH. Global, quantitative and dynamic mapping of protein subcellular localization. eLife. 2016;5(JUN2016):1–36. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Dultz E, Ellenberg J. Live imaging of single nuclear pores reveals unique assembly kinetics and mechanism in interphase. The Journal of Cell Biology. 2010;191(1):15–22. 10.1083/jcb.201007076 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Posakpny JW, England JM, Attardi G. MITOCHONDRIAL GROWTH AND DIVISION DURING THE CELL CYCLE IN HeLa CELLS. Journal of Cell Biology. 1977;74:468–491. 10.1083/jcb.74.2.468 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Klingauf M, Stanek D, Neugebauer KM, Matera AG. Enhancement of U4/U6 Small Nuclear Ribonucleoprotein Particle Association in Cajal Bodies Predicted by Mathematical Modeling. Molecular Biology of the Cell. 2006;17(12):4972–4981. 10.1091/mbc.E06-06-0513 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Nagaraj N, Wisniewski JR, Geiger T, Cox J, Kircher M, Kelso J, et al. Deep proteome and transcriptome mapping of a human cancer cell line. Molecular Systems Biology. 2011;7(548):1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Thul PJ, Åkesson L, Wiking M, Mahdessian D, Geladaki A, Ait Blal H, et al. A subcellular map of the human proteome. Science. 2017;356 (6340). 10.1126/science.aal3321 [DOI] [PubMed] [Google Scholar]
  • 19. Peterson JR, Hallock MJ, Cole JA, Luthey-Schulten Z. A Problem Solving Environment for Stochastic Biological Simulations. PyHPC 2013. 2013;. [Google Scholar]
  • 20. Hallock MJ, Stone JE, Roberts E, Fry C, Luthey-Schulten Z. Simulation of reaction diffusion processes over biologically relevant size and time scales using multi-GPU workstations. Parallel Computing. 2014;40(5):86–99. 10.1016/j.parco.2014.03.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Earnest TM, Cole JA, Luthey-Schulten Z. Simulating biological processes: stochastic physics from whole cells to colonies. Reports on Progress in Physics. 2018;81(5):052601 10.1088/1361-6633/aaae2c [DOI] [PubMed] [Google Scholar]
  • 22.Hallock MJ, Luthey-Schulten Z. Improving Reaction Kernel Performance in Lattice Microbes: Particle-Wise Propensities and Run-Time Generated Code. In: 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW); 2016. p. 428–434.
  • 23. Eden E, Geva-Zatorsky N, Issaeva I, Cohen A, Dekel E, Danon T, et al. Proteome Half-Life Dynamics in Living Human Cells. Science. 2011;331(6018):764LP–768. 10.1126/science.1199784 [DOI] [PubMed] [Google Scholar]
  • 24. Milo R, Phillips R. Cell Biology by the Numbers. Taylor & Francis Group; 2015. Available from: https://books.google.com/books?id=9NPRCgAAQBAJ. [Google Scholar]
  • 25. Moran U, Phillips R, Milo R. SnapShot: Key Numbers in Biology. Cell. 2010;141(7):1262–1262.e1. 10.1016/j.cell.2010.06.019 [DOI] [PubMed] [Google Scholar]
  • 26. Fujioka A, Terai K, Itoh RE, Aoki K, Nakamura T, Kuroda S, et al. Dynamics of the Ras/ERK MAPK Cascade as Monitored by Fluorescent Probes. Journal of Biological Chemistry. 2006;281(13):8917–8926. 10.1074/jbc.M509344200 [DOI] [PubMed] [Google Scholar]
  • 27. Maul G, Deaven L. Quantitative determination of nuclear pore complexes in cycling cells with differing DNA content. The Journal of Cell Biology. 1977;73(3):748–760. 10.1083/jcb.73.3.748 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Puck TT, Marcus PI, Cieciura SJ. CLONAL GROWTH OF MAMMALIAN CELLS IN VITRO. Journal of Experimental Medicine. 1956;103(2):273–284. 10.1084/jem.103.2.273 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. English AR, Voeltz GK. Endoplasmic Reticulum Structure and Interconnections with Other Organelles. Cold Spring Harbor Perspectives in Biology. 2013;5(4). 10.1101/cshperspect.a013227 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Friedman JR, Voeltz GK. The ER in 3D: a multifunctional dynamic membrane network. Trends in Cell Biology. 2011;21(12):709–717. 10.1016/j.tcb.2011.07.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Sin ATW, Harrison RE. Growth of the Mammalian Golgi Apparatus during Interphase. Molecular and Cellular Biology. 2016;36(18):2344–2359. 10.1128/MCB.00046-16 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Moen PT, Johnson CV, Byron M, Shopland LS, de la Serna IL, Imbalzano AN, et al. Repositioning of Muscle-specific Genes Relative to the Periphery of SC-35 Domains during Skeletal Myogenesis. Molecular Biology of the Cell. 2004;15(1):197–206. 10.1091/mbc.E03-06-0388 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Xing Y, Johnson CV, Moen PT, McNeil JA, Lawrence J. Nonrandom gene organization: structural arrangements of specific pre-mRNA transcription and splicing with SC-35 domains. The Journal of Cell Biology. 1995;131(6):1635–1647. 10.1083/jcb.131.6.1635 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Chen Y, Zhang Y, Wang Y, Zhang L, Brinkman EK, Adam SA, et al. Mapping 3D genome organization relative to nuclear compartments using TSA-Seq as a cytological ruler. The Journal of Cell Biology. 2018;217(11):4025–4048. 10.1083/jcb.201807108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Didychuk AL, Montemayor EJ, Brow DA, Butcher SE. Structural requirements for protein-catalyzed annealing of U4 and U6 RNAs during di-snRNP assembly. Nucleic Acids Research. 2016;44(3):1398–1410. 10.1093/nar/gkv1374 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Kamar RI, Banigan EJ, Erbas A, Giuntoli RD, Olvera de la Cruz M, Johnson RC, et al. Facilitated dissociation of transcription factors from single DNA binding sites. Proceedings of the National Academy of Sciences. 2017;114(16):E3251–E3257. 10.1073/pnas.1701884114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Chen TY, Cheng YS, Huang PS, Chen P. Facilitated Unbinding via Multivalency-Enabled Ternary Complexes: New Paradigm for Protein-DNA Interactions. Acc Chem Res. 2018;51:860–868. 10.1021/acs.accounts.7b00541 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Grimm C, Chari A, Pelz JP, Kuper J, Kisker C, Diederichs K, et al. Structural Basis of Assembly Chaperone- Mediated snRNP Formation. Molecular Cell. 2013;49(4):692–703. 10.1016/j.molcel.2012.12.009 [DOI] [PubMed] [Google Scholar]
  • 39. So BR, Wan L, Zhang Z, Li P, Babiash E, Duan J, et al. A U1 snRNP–specific assembly pathway reveals the SMN complex as a versatile hub for RNP exchange. Nature Structural & Molecular Biology. 2016;23(3):225–230. 10.1038/nsmb.3167 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Roithová A, Klimešová K, Pánek J, Will CL, Lührmann R, Staněk D, et al. The Sm-core mediates the retention of partially-assembled spliceosomal snRNPs in Cajal bodies until their full maturation. Nucleic Acids Research. 2018;46(7):3774–3790. 10.1093/nar/gky070 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Hoskins AA, Moore MJ. The spliceosome: a flexible, reversible macromolecular machine. Trends in Biochemical Sciences. 2012;37(5):179–188. 10.1016/j.tibs.2012.02.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Herzel L, Ottoz DSM, Alpert T, Neugebauer KM. Splicing and transcription touch base: Co-transcriptional spliceosome assembly and function. Nature Reviews Molecular Cell Biology. 2017;18(10):637–650. 10.1038/nrm.2017.63 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Anomalous diffusion due to binding: A Monte Carlo study. Biophysical Journal. 1996;70(3):1250–1262. 10.1016/S0006-3495(96)79682-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Bressloff PC, Newby JM. Stochastic models of intracellular transport. Rev Mod Phys. 2013;85:135–196. 10.1103/RevModPhys.85.135 [DOI] [Google Scholar]
  • 45. Roberts E, Magis A, Ortiz JO, Baumeister W, Luthey-Schulten Z. Noise Contributions in an Inducible Genetic Switch: A Whole-Cell Simulation Study. PLoS Computational Biology. 2011;7(3):e1002010 10.1371/journal.pcbi.1002010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Chen W, Moore MJ. Current Biology. 2015;25(5):R181–R183. [DOI] [PubMed] [Google Scholar]
  • 47. Girard C, Will CL, Peng J, Makarov EM, Kastner B, Lemm I, et al. Post-transcriptional spliceosomes are retained in nuclear speckles until splicing completion. Nature communications. 2012;3:994 10.1038/ncomms1998 [DOI] [PubMed] [Google Scholar]
  • 48. Fei J, Jadaliha M, Harmon TS, Li ITS, Hua B, Hao Q, et al. Quantitative analysis of multilayer organization of proteins and RNA in nuclear speckles at super resolution. Journal of Cell Science. 2017;130(24):4180–4192. 10.1242/jcs.206854 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Rino J, Carvalho T, Braga J, Desterro JMP, Lührmann R, Carmo-Fonseca M. A Stochastic View of Spliceosome Assembly and Recycling in the Nucleus. PLOS Computational Biology. 2007;3(10):1–13. 10.1371/journal.pcbi.0030201 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Alpert T, Herzel L, Neugebauer KM. Perfect timing: splicing and transcription rates in living cells. Wiley Interdisciplinary Reviews: RNA. 2017;8(2):1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Fu XD, Ares M. Context-dependent control of alternative splicing by RNA-binding proteins. Nature Reviews Genetics. 2014;15(10):689–701. 10.1038/nrg3778 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Karr JR, Sanghvi JC, Macklin DN, Gutschow MV, Jacobs JM, Bolival B, et al. A whole-cell computational model predicts phenotype from genotype. Cell. 2012;150(2):389–401. 10.1016/j.cell.2012.05.044 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53. Dekker J, Belmont AS, Guttman M, Leshyk VO, Lis JT, Lomvardas S, et al. The 4D nucleome project. Nature. 2017;549:219–226. 10.1038/nature23884 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Rosa A, Di Stefano M, Micheletti C. Topological Constraints in Eukaryotic Genomes and How They Can Be Exploited to Improve Spatial Models of Chromosomes; 2019. Available from: https://www.frontiersin.org/article/10.3389/fmolb.2019.00127. [DOI] [PMC free article] [PubMed]
  • 55. Lin D, Bonora G, Yardımcı GG. Computational methods for analyzing and modeling genome structure and organization. WIREs Systems Biology and Medicine. 2019;11(1):e1435 10.1002/wsbm.1435 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Bianchi D, Peterson J, Earnest T, Hallock M, Luthey-Schulten Z. A Hybrid CME-ODE Method for Efficient Simulation of the Galactose Switch in Yeast. IET Systems Biology. 2018. 10.1049/iet-syb.2017.0070 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Kästner J. Umbrella sampling. WIREs Computational Molecular Science. 2011;1(6):932–942. 10.1002/wcms.66 [DOI] [Google Scholar]
  • 58. Laio A, Gervasio FL. Metadynamics: a method to simulate rare events and reconstruct the free energy in biophysics, chemistry and material science. Reports on Progress in Physics. 2008;71(12):126601 10.1088/0034-4885/71/12/126601 [DOI] [Google Scholar]
  • 59. Miller WH. Beyond transition-state theory: a rigorous quantum theory of chemical reaction rates. Accounts of Chemical Research. 1993;26(4):174–181. 10.1021/ar00028a007 [DOI] [Google Scholar]
  • 60. Zhao L, Kroenke CD, Song J, Piwnica-Worms D, Ackerman JJH, Neil JJ. Intracellular water specific MR of microbeadadherent cells: the HeLa cell intracellular water exchange lifetime. NMR in Biomedicine. 2008;21(2):159–164. 10.1002/nbm.1173 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Cohen LS, Studzinski GP. Correlation between cell enlargement and nucleic acid and protein content of HeLa cells in unbalanced growth produced by inhibitors of DNA synthesis. Journal of Cellular Physiology. 1967;69(3):331–339. 10.1002/jcp.1040690309 [DOI] [PubMed] [Google Scholar]
  • 62. Jawdekar GW, Henry RW. Transcriptional regulation of human small nuclear RNA genes. Biochimica et Biophysica Acta (BBA)—Gene Regulatory Mechanisms. 2008;1779(5):295–305. 10.1016/j.bbagrm.2008.04.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Lau Ck, Bachorik JL, Dreyfuss G. Gemin5-snRNA interaction reveals an RNA binding function for WD repeat domains. Nature structural & molecular biology. 2009;16(5):486–91. 10.1038/nsmb.1584 [DOI] [PubMed] [Google Scholar]
  • 64. Law MJ, Linde ME, Chambers EJ, Oubridge C, Katsamba PS, Nilsson L, et al. The role of positively charged amino acids and electrostatic interactions in the complex of U1A protein and U1 hairpin II RNA. Nucleic acids research. 2006;34(1):275–85. 10.1093/nar/gkj436 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Huranová M, Ivani I, Benda A, Poser I, Brody Y, Hof M, et al. The differential interaction of snRNPs with pre-mRNA reveals splicing kinetics in living cells. Journal of Cell Biology. 2010;191(1):75–86. 10.1083/jcb.201004030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Lund E, Dahlberg JE. True Genes for Human U1 Small Nuclear RNA. Journal of Biological Chemistry. 1984;259(3):2013–2021. [PubMed] [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1007717.r001

Decision Letter 0

Jason M Haugh, Attila Csikász-Nagy

7 Oct 2019

Dear Dr Ghaemi,

Thank you very much for submitting your manuscript 'An In-Silico Human Cell Model Reveals the Influence of Spatial Organization on RNA Splicing' for review by PLOS Computational Biology. Your manuscript has been fully evaluated by the PLOS Computational Biology editorial team and in this case also by independent peer reviewers. The reviewers appreciated the attention to an important problem, but raised some substantial concerns about the manuscript as it currently stands. While your manuscript cannot be accepted in its present form, we are willing to consider a revised version in which the issues raised by the reviewers have been adequately addressed. We cannot, of course, promise publication at that time.

Specifically, in the revised version please clarify which features of the whole cell model are truly needed to reach the conclusions of the study, since as it seems, most of the claimed data integration into a whole cell model is not used. A further point is that it is stated the software"will be available". We need a more clear statement on this and a specific link where the software can be reached.

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

Your revisions should address the specific points made by each reviewer. Please return the revised version within the next 60 days. If you anticipate any delay in its return, we ask that you let us know the expected resubmission date by email at ploscompbiol@plos.org. Revised manuscripts received beyond 60 days may require evaluation and peer review similar to that applied to newly submitted manuscripts.

In addition, when you are ready to resubmit, please be prepared to provide the following:

(1) A detailed list of your responses to the review comments and the changes you have made in the manuscript. We require a file of this nature before your manuscript is passed back to the editors.

(2) A copy of your manuscript with the changes highlighted (encouraged). We encourage authors, if possible to show clearly where changes have been made to their manuscript e.g. by highlighting text.

(3) A striking still image to accompany your article (optional). If the image is judged to be suitable by the editors, it may be featured on our website and might be chosen as the issue image for that month. These square, high-quality images should be accompanied by a short caption. Please note as well that there should be no copyright restrictions on the use of the image, so that it can be published under the Open-Access license and be subject only to appropriate attribution.

Before you resubmit your manuscript, please consult our Submission Checklist to ensure your manuscript is formatted correctly for PLOS Computational Biology: http://www.ploscompbiol.org/static/checklist.action. Some key points to remember are:

- Figures uploaded separately as TIFF or EPS files (if you wish, your figures may remain in your main manuscript file in addition).

- Supporting Information uploaded as separate files, titled Dataset, Figure, Table, Text, Protocol, Audio, or Video.

- Funding information in the 'Financial Disclosure' box in the online system.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see here

We are sorry that we cannot be more positive about your manuscript at this stage, but if you have any concerns or questions, please do not hesitate to contact us.

Sincerely,

Attila Csikász-Nagy

Associate Editor

PLOS Computational Biology

Jason Haugh

Deputy Editor

PLOS Computational Biology

A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately:

[LINK]

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: Whole-cell models that connect molecular biology with phenotype are needed to help researchers integrate disparate data, assemble a comprehensive understanding of biology, and enable experiments that are inaccessible to conventional experimental methods. Toward whole-cell models, Ghaemi et al. report a spatially-detailed detailed computational model of splicing in HeLa cells and its usage to elucidate several aspects of the role of spatial organization in splicing. The study is innovative for developing one of the first mechanistic models of splicing -- an essential aspect of the biochemical complexity of eukaryotic cells, for reporting some of the longest (15 minutes) spatially detailed simulations of entire eukaryotic cells, and for several novel quantitative analyses of splicing. These innovations are important, especially given the substantial challenges to building and simulating models of whole-cells. The methods are thorough and the manuscript is well-written. However, we have several minor comments about several aspects of the manuscript which could be clearer. In addition, we think that the manuscript would be strengthened by an expanded conclusion about the limitations of the reported model and the challenges going forward to building and simulating models that capture more chemical complexity, more intracellular processes, more molecules and spatial compartments, and longer lengths of time.

Minor comments

------------------------

- Because the reported model represents a portion of a HeLa cell and there are multiple complementary lines of whole-cell modeling research ongoing in the literature, we think would it be helpful for readers to clarify what is meant by a HeLa "whole-cell model" on pp 5. At the authors discretion, this could be described in an additional paragraph in the introduction or conclusion. In particular, it would be helpful/interesting to discuss what's missing from the model and what's needed to capture whole cells. For example, what is needed to capture splicing site selection, the combinatorial number of transcripts that could be created from each gene, how to handle the complexity of simulating the entire genome, etc.?

- (optional) In the interest of encouraging additional whole-cell modeling research, we think it would be helpful to comment on the extensibility of the model/methods at the end of the conclusions. While the reported model is an advance, realistically it would likely be difficult for most researchers to build upon the implementation, given lack of easy to use whole-cell modeling tools. It would be helpful to comment on how extensible the present model is and what technology is needed to make models like this easier to build and extend.

- Please clarify several methodological details:

- pp 5: "target volume fractions": Is this intended to mean the modeled volumes of the organelles?

- pp 5: "The [essential] components of the cell include: ...": Is this intended to mean essential for the splicing processes that will be investigated?

- pp 5: "... genes ... were placed around the speckles ...": Please clarify how they were placed.

- pp 7: "... assembles in a [stepwise] manner ..."

- pp 7: "... pre-mRNA is [converted] to an mRNA transcript ..."

- pp 8: "... splice site has been chosen ...": Which sites does the model choose?

- pp 10: "... production rate of ...": Does this account for the turnover of the particles?

- Lightening the background in Figure 2A might make the reactions easier to read.

- For consistency, should "Complex A" in Figure 2B show one rather than two U1 particles?

- The word choice and grammar could be clarified throughout the manuscript. Below are several examples:

- pp 3: "without massively increasing [the] gene count ..."

- pp 3: "The [order of] intronic removal ...". Is this intended to mean temporal order?

- pp 3: "... coding regions can be shuffled after the removal ...". Do you intend to mean "shuffle" as in exon scrambling? The statement also seems to confuse the order of events; the coding sequence is determined by splicing.

- pp 3: "... occurs in multiple steps ..., [transport between ] .., and terminates ...". We think transport should be plural

- pp 5: The inconsistent use of past and present tense in reference to the distribution of units makes it unclear whether the statements refer to the biology or to the model. Consistent tense would clarify that these statements refer to the model.

- pp 7: Correct comma placement in "... reach the cytoplasm where, by a series of complex reactions, they bind ..."

- pp 8 and others: The first double quote in each pair of quotes should point to the right (e.g., "complex E", "complex B")

- pp 15: Remove the space in "nano meter" and pluralize

Reviewer #2: The manuscript by Ghaemi et al describes two main things. First, is a claimed “whole cell” model of a HeLa (mammalian) cell. It would seem to be a whole cell model in the sense that it is spatially resolved, being parameterized in a careful and detailed way from a variety of image based and biochemical composition data. However, the number of biochemical processes considered seems to be quite few to support the claim of a whole cell model (a few RNA splicing reactions). Second, they incorporate in their model a few reactions that describe, mostly in terms of lumped kinetic processes, pre-mRNA splicing. They use spatial kinetic master equation approaches to simulate reaction diffusion mechanisms underlying spliceosome formation, coalescence into nuclear speckles, and production of mature mRNA from pre-mRNA. While splicing is surely a central and important cellular process, the specific questions and rationale for their modeling here was not laid out clearly. Most of the findings from analyzing the model incorporating these processes seem fairly obvious, such as increasing the number of nuclear pores allows more spliceosomes to be formed because of increased nucleocytoplasmic transport, or that when pre-mRNA are localized with the splicing factors, splicing rates increase significantly. Moreover, the authors do not seem to use most of the spatial model features incorporated (besides nucleus vs. cytoplasm and nuclear subcompartments such as speckles). Overall, there does not seem to be significant advances or new knowledge presented in the manuscript to warrant publication in its current form. Some more specific points are listed below in addition.

-It seems that this model is of a spherical cell, when most HeLa lines are adherent (some are floating).

-Mitochondria form large networks, and are usually not thousands of individual compartments (as is currently modeled).

-Liquid liquid phase separation is known to be caused by intrinsically disordered proteins and/or multi-valent interactions, but no such mechanisms were used to invoke nuclear speckles, which were claimed to be separate liquid phases.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Jonathan Karr and Bilal Shaikh

Reviewer #2: No

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1007717.r003

Decision Letter 1

Jason M Haugh, Attila Csikász-Nagy

8 Jan 2020

Dear Dr Ghaemi,

Thank you very much for submitting your manuscript, 'An In-Silico Human Cell Model Reveals the Influence of Spatial Organization on RNA Splicing', to PLOS Computational Biology. As with all papers submitted to the journal, yours was fully evaluated by the PLOS Computational Biology editorial team, and in this case, by independent peer reviewers. The reviewers appreciated the attention to an important topic but identified some aspects of the manuscript that should be improved.

We would therefore like to ask you to modify the manuscript according to the review recommendations before we can consider your manuscript for acceptance. Your revisions should address the specific points made by each reviewer and we encourage you to respond to particular issues Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.raised.

In addition, when you are ready to resubmit, please be prepared to provide the following:

(1) A detailed list of your responses to the review comments and the changes you have made in the manuscript. We require a file of this nature before your manuscript is passed back to the editors.

(2) A copy of your manuscript with the changes highlighted (encouraged). We encourage authors, if possible to show clearly where changes have been made to their manuscript e.g. by highlighting text.

(3) A striking still image to accompany your article (optional). If the image is judged to be suitable by the editors, it may be featured on our website and might be chosen as the issue image for that month. These square, high-quality images should be accompanied by a short caption. Please note as well that there should be no copyright restrictions on the use of the image, so that it can be published under the Open-Access license and be subject only to appropriate attribution.

Before you resubmit your manuscript, please consult our Submission Checklist to ensure your manuscript is formatted correctly for PLOS Computational Biology: http://www.ploscompbiol.org/static/checklist.action. Some key points to remember are:

- Figures uploaded separately as TIFF or EPS files (if you wish, your figures may remain in your main manuscript file in addition).

- Supporting Information uploaded as separate files, titled 'Dataset', 'Figure', 'Table', 'Text', 'Protocol', 'Audio', or 'Video'.

- Funding information in the 'Financial Disclosure' box in the online system.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com  PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

We hope to receive your revised manuscript within the next 30 days. If you anticipate any delay in its return, we ask that you let us know the expected resubmission date by email at ploscompbiol@plos.org.

If you have any questions or concerns while you make these revisions, please let us know.

Sincerely,

Attila Csikász-Nagy

Associate Editor

PLOS Computational Biology

Jason Haugh

Deputy Editor

PLOS Computational Biology

A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately:

[LINK]

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors have addressed our major concerns. However, we feel that the manuscript would be strengthened by clarifying the purpose and goals of the study in the abstract, author summary, and introduction. In addition, although the HelaBuilder tutorial is easy to follow and the model is extensible *in principle*, in practice the procedural definition of the model and the lack of annotation within the model code will make it challenging for others to expand upon the model. In the interest of transparency, we think its important to acknowledge that much more needs to be done to make this model and other really extensible.

Minor comments

- knwon is mispelled on page 3

Reviewer #2: The authors have substantially revised the manuscript for clarity which has much improved it. The specific application to splicing remains somewhat lacking in this reviewer's opinion (mainly in terms of new biological knowledge). However, the bigger picture aspects of the manuscript, such as a spatially resolved mammalian cell model scaffold with available code for extension, and pushing the boundaries of simulation time for such a model, are appreciated and should benefit the literature and community.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Jonathan Karr

Reviewer #2: No

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1007717.r005

Decision Letter 2

Jason M Haugh, Attila Csikász-Nagy

6 Feb 2020

Dear Dr. Ghaemi,

We are pleased to inform you that your manuscript 'An In-Silico Human Cell Model Reveals the Influence of Spatial Organization on RNA Splicing' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch within two working days with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Attila Csikász-Nagy

Associate Editor

PLOS Computational Biology

Jason Haugh

Deputy Editor

PLOS Computational Biology

***********************************************************

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1007717.r006

Acceptance letter

Jason M Haugh, Attila Csikász-Nagy

2 Mar 2020

PCOMPBIOL-D-19-01350R2

An In-Silico Human Cell Model Reveals the Influence of Spatial Organization on RNA Splicing

Dear Dr Ghaemi,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Laura Mallard

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Text. Endoplasmic reticulum construction.

    (TEX)

    S2 Text. Effects of mitochondria network-like morphology on the splicing particles formation.

    (TEX)

    S1 Fig. Changes in the number of NPCs, correlates withe the number of formed U1 splicing particles.

    (PNG)

    S2 Fig. The HeLa cell with a network-like mitochondria.

    See Methods section for construction details of the mitochondria.

    (PNG)

    S3 Fig. The time required to form the first splicing particle dissected by the set of discrete reactions occurring in the nucleus and cytoplasm.

    Error bars represent the standard deviations. For each condition, 20 simulation replicates were performed.

    (PNG)

    S1 Table. Parameters used in the cellular automata program to create realistic ER.

    (TEX)

    S2 Table. Diffusion coefficients for species involved in splicing particles formation.

    Abbreviations are: nucleus (N), NPC (P), Cajal bodies (J), nuclear speckles (S) and cytoplasm (C).

    (TEX)

    S3 Table. Diffusion coefficients of spliceosomal particles.

    Abbreviations are: nucleus (N) and nuclear speckles (S).

    (TEX)

    S4 Table. The results of the simulations sets with a network-like morphology of the mitochondria as compared to the fragmented-mitochondria that were reported in the main text.

    (TEX)

    S5 Table. The p-values of data from various simulation sets.

    (TEX)

    Attachment

    Submitted filename: Ghaemi_response_letter.pdf

    Attachment

    Submitted filename: Ghaemi_Response2.pdf

    Data Availability Statement

    All relevant data are within the manuscript and its Supporting Information files. Additionally, the code we developed can be downloaded from Github through: https://eukaryoticcellbuilder.github.io/HeLa_Builder/.


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES