Abstract
Currently, a significant barrier to building predictive models of cellular self-assembly processes is that molecular models cannot capture minutes-long dynamics that couple distinct components with active processes, whereas reaction-diffusion models cannot capture structures of molecular assembly. Here, we introduce the nonequilibrium reaction-diffusion self-assembly simulator (NERDSS), which addresses this spatiotemporal resolution gap. NERDSS integrates efficient reaction-diffusion algorithms into generalized software that operates on user-defined molecules through diffusion, binding and orientation, unbinding, chemical transformations, and spatial localization. By connecting the fast processes of binding with the slow timescales of large-scale assembly, NERDSS integrates molecular resolution with reversible formation of ordered, multisubunit complexes. NERDSS encodes models using rule-based formatting languages to facilitate model portability, usability, and reproducibility. Applying NERDSS to steps in clathrin-mediated endocytosis, we design multicomponent systems that can form lattices in solution or on the membrane, and we predict how stochastic but localized dephosphorylation of membrane lipids can drive lattice disassembly. The NERDSS simulations reveal the spatial constraints on lattice growth and the role of membrane localization and cooperativity in nucleating assembly. By modeling viral lattice assembly and recapitulating oscillations in protein expression levels for a circadian clock model, we illustrate the adaptability of NERDSS. NERDSS simulates user-defined assembly models that were previously inaccessible to existing software tools, with broad applications to predicting self-assembly in vivo and designing high-yield assemblies in vitro.
Significance
A major roadblock in quantitative modeling of cell biology is the need to resolve fast processes over long timescales and individual proteins over large length scales. This is particularly evident in self-assembly processes such as clathrin-mediated endocytosis and viral assembly, which proceed over minutes, building from often weak and short-lived binding events. The NERDSS software presented here uses the reaction-diffusion model to simulate such self-assembly processes, with broad applications to biological processes as they occur in the nonequilibrium cell.
Introduction
Watching the dynamics of individual molecules in the cell as they function as part of a collective is now possible thanks to revolutions in live-cell microscopy. Despite technical advances in mapping dynamics onto nanoscale structure formation in the cell (1), resolution is still limited by events that are too fast and combine too many distinct components. Computer simulations offer the promise of reproducing dynamics using models designed from the underlying physics and mechanics, providing high-resolution spatial and temporal predictions and exquisite control over components and their interactions. Current tools for capturing cell-scale complexity are a growing companion to cell biology. For example, fields such as cell-signaling (2), developmental biology (3,4), and systems biology (5) have benefitted from a variety of spatial (6,7) and nonspatial tools (8,9) in which interactions and reactions are modeled as events parameterized by rate constants. However, many cell-scale processes involve self-assembly, which is a challenge for modeling because it spans similarly long length and timescales as biochemical signals while also depending fundamentally on molecular structural geometry. Although rate-based approaches have been shown to be highly insightful for modeling self-assembly, previous models lack explicit spatial resolution (8,10, 11, 12), apply only to small systems (13,14), or include potentials that prevent quantitative comparison to experiment (15). Here, we present the nonequilibrium reaction-diffusion self-assembly simulator (NERDSS), a higher-resolution rate-based software tool with the addition of user-specified, coarse molecular structure to enable the study of self-assembly at the cell scale.
NERDSS was developed to build self-assembly into the reaction-diffusion (RD) model because of the strength of RD in simulating relatively slow, nonequilibrium dynamics. The name of the software may sound somewhat redundant (particularly NERD). The “nonequilibrium” is included in the name to emphasize how self-assembly can be simulated here in generalized, nonequilibrium systems such as the cell. Additionally, although the RD model is commonly applied to studying systems not at thermodynamic equilibrium (16), it is nonetheless useful for quantifying the pathways to and composition of equilibrium for complex multicomponent systems. The standard computational approach for studying self-assembly, using coarse-grained molecular modeling (17, 18, 19), provides access to these pathways and equilibria with more physically detailed models, in which interactions emerge because of distance-dependent energy functions rather than the rate-controlled events of RD models. A coarse molecular model thus naturally can accommodate a range of structures or defects (17,19,20) and capture emergent cooperativity, strain, and localization of components (21,22). However, they are still relatively limited in the length and timescales of dynamics they can access, which does not extend to the cell scale. They are parameterized by potential energy functions, and converting these to the binding free energies (e.g., as done here (21,23)) or rate constants (24, 25, 26) for comparison to experiment is nontrivial. Lastly, chemical reactions require covalent bond breaking that is not typically accessible in molecular modeling, preventing systematic and transferable methods for involving enzymatic or ATP-driven reactions ubiquitous in cells. These models are unable to simulate in vivo cell signaling, cytoskeletal dynamics, or clathrin-mediated endocytosis, for example. Our NERDSS software addresses this substantial application gap, and although it lacks the detail of energy-function based models, it is able to uniquely preserve important features of molecular assembly.
NERDSS overcomes technical challenges with adding structure to RD and allowing large, reversible complexes to form in a user-friendly and widely applicable way. The major challenges for applying RD to self-assembly are expanding to multisite, volume-excluding species and capturing orientation-dependent interactions for arbitrary species. For continuum partial differential equation (PDE) approaches (6,7) to RD and lattice-based reaction-diffusion master equation (RDME) approaches (27, 28, 29), species have no individual coordinates but are well mixed in the infinitesimal or finite grid spacings, respectively, preventing the resolution of structural assemblies. Capturing assembly kinetics is possible in these methods for complexes with uniquely defined subunits, with recent RDME-based studies of ribosome formation (30) and essential steps in spliceosome assembly (31). Although single-particle RD is considerably more computationally expensive than PDE or RDME, it does track individual coordinates (32, 33, 34, 35, 36, 37, 38, 39), often with excluded volume, providing a starting point for multisite species and structure resolution. Recent methods have generated multisite species by linking them using spherically symmetric potentials (14,40), producing simulations that can capture effects of clustering (41) and polymer assembly (40). However, without orientation-dependent interactions, well-defined structural geometries such as lattices and spherical shells will not form. Brownian dynamics methods have been successfully used to capture orientation-dependent, rate-controlled association but are limited to small systems (42,43). Here, we use our recent algorithm for structure-resolved RD of rigid bodies, which includes and accounts for effects of rotational as well as translational diffusion on binding reactions (44). Instead of using potentials, orientational constraints are applied after binding events to maintain accurate solutions to the RD equations of motion and direct comparison to experimental rates. With previous application of this algorithm to clathrin lattice assembly in solution (44), NERDSS generalizes these routines for orienting molecules to apply here to arbitrary user-defined systems, including on the membrane, which can act as a potent driver of self-assembly via dimensionality reduction (45). This approach retains the flexibility and the adaptability of the method to new molecules and structures because binding is parameterized by rates and not customized interaction potentials.
A critical feature of generalized rate-based software for usability, model portability, and construction of conditional and cooperative interactions is the adoption of well-defined formatting languages for model encoding. Rule-based modeling (46), developed for nonspatial models, supports descriptions of species as multisite molecules in which each site can exist in distinct states. Interactions or binding rates can be coupled within species to capture effects of phosphorylation, for example, on subsequent binding events. Formatting languages such as BioNetGen Language (46,47) and Kappa (48,49) implement rule-based modeling, which precludes the need to define and track all possible species and avoids the issue of combinatorial complexity (50). The rule-based format is now available in efficient nonspatial methods such as RuleBender (51) and NFSim (8) and spatial methods such as Virtual Cell (52) and Smoldyn (34), facilitating model comparison and portability. Rule-based encoding translates quite naturally to NERDSS because of the spatial specification of each site. We validate our rule-based encoding here by simulating a model of a circadian clock (53) (and other simple models), compared with the same model simulated in Virtual Cell (6).
To illustrate the capabilities of NERDSS as a spatial self-assembly simulator, we apply it here to steps in clathrin-mediated endocytosis (CME) and virion assembly. These examples highlight several challenges and opportunities for modeling approaches, and both assembly processes have been studied using coarse molecular modeling simulations (19,54, 55, 56, 57). CME is an essential pathway used by all eukaryotes for transport across the plasma membrane. It is a stochastic assembly process that depends on membrane mechanics (1,58), enzymatic reactions (59, 60, 61), and the stoichiometry of dozens of distinct components (59,62, 63, 64), making it a complex system that can benefit from integrated physics-based simulations. Spatial simulations of clathrin, a 600 kDa trimeric protein, assembling in solution (44,54,56,65) and on the membrane (via recruitment by adaptor proteins) (19) have determined assembly pathways, energetic and structural determinants for assembly, and dependence on links to the membrane. The clathrin models developed here for NERDSS illustrate how rate-based models can be adapted and extended to additional components without requiring the substantial expertise of energy-function-based methods (19,66,67) and without sacrificing spatial resolution (45,68, 69, 70). To enable nonequilibrium simulations of immature viral lattice formation in vivo, we introduce here a model of the retroviral Gag protein, which assemble a spherical lattice (71), an essential component of the HIV infection and maturation cycle (72). Viral capsid assembly is widely studied using coarse molecular modeling approaches (see (17)), but applications to studying assembly of the precursor lattices that occur within the host cell are more limited (55). In this study, we focus on describing the technical features of the models necessary to produce realistic assemblies, leaving much of the biological implications for future work.
In summary, NERDSS offers a distinct tool that uses the RD model to simulate self-assembly at the cell scale, thus allowing for space- and time-dependent dynamics and kinetics that are immediately comparable to experiment by a broad user base. Below, we first describe the operation of the software and the features we introduced as necessary to move beyond existing tools (Fig. 1). We provide multiple forms of validations, including against fundamental reactions and nonassembly problems such as a circadian clock model, accessible to most RD software. We then present multiple self-assembly applications, with input files provided in the software repository. Models of clathrin lattice assembly are chosen to highlight distinct features of models that can be used to tune the dynamics and stability of the observed structures. We simulate clathrin assembly driven by membrane localization and show how introducing enzymes to the system to control the lipid populations on the membrane drives disassembly. These clathrin models thus capture relaxation from out-of-equilibrium starting states to an equilibrium steady state, with or without active energy consumption. We illustrate the model design process for the self-assembly of Gag retroviral protein monomers into an immature lattice in solution, in which monomers are titrated into solution and have limited lifetimes when unbound. Finally, we discuss the current limitations and most promising future advancements of NERDSS software for realistic cell-scale dynamics.
Methods
Implementation
NERDSS is written in ANSI/ISO standard C++11 and is available on Linux and macOS. The accompanying GUI is implemented in Java. The source code for both can be found at https://github.com/mjohn218/NERDSS and is provided under the GNU general public license. A user guide and simulation example systems can be found on the software page. Input files are formatted according to BioNetGen Language (47), such as are used in RuleBender (51) and the NFSim software (8) and is compatible with Virtual Cell (6). Additional features are necessary, however, because of the spatial and structural details used in NERDSS.
Reactions
NERDSS supports the three primary types of reactions: zeroth, first, and second order. The rates of these reactions, other than creation, can depend on the states and/or pre-existing interactions of any participant species’ sites.
Zeroth-order reactions
De novo particle creation reactions are treated as a Poisson process. No events occur if a uniform random number (URN) fulfills URN < exp(−λ), where λ = k0VΔt, k0 is the creation rate input in units of M/s, V is the simulation volume (converted from nm3 to units of M−1 by 0.602), and Δt is the timestep. Therefore, N events occur based on . Created particles are placed within the simulation volume with random coordinates. Zeroth-order reactions can be used to titrate species into the simulation volume.
First-order
First-order reactions are treated as Poisson processes, with a reaction probability per each site of p1(Δt) = 1 − exp(−λ), where λ = k1Δt, k1 is the microscopic rate in units of s−1, and Δt is the time step. The reaction rates can be specified either as microscopic rates or as macroscopic rates. Only one such event can occur per time step per molecule. Four types of first-order reactions are supported:
-
1)
Creation: create a copy of a particle B from another particle A, in which sites required such as (a) are indicated within the parentheses—examples are transcription or translation, e.g., A(a) → A(a) + B(b).
-
2)
Destruction: destroy a molecule and all other molecules in its complex. If the sites (a) are specified as unbound, it will only destroy unbound molecules, e.g., A(a) → NULL.
-
3)
State change: change a state of a site on a molecule, e.g., A(a∼p) → A(a∼u).
-
4)
Dissociation: remove an interaction between the sites of two particles, written as the conjugate back reaction of an association reaction. To retain detailed balance (32), the particles are left such that the dissociating sites are at their binding radius, e.g., A(a!1).B(b!1) → A(a) + B(b).
Second-order reactions
Bimolecular reactions are applied to specific sites between two proteins. Association events can be conditional on the state of the site, e.g., occurring only when the binding site is phosphorylated, or only if the protein is bound through another site as well.
For the free propagator reweighting (FPR) algorithm (32,44,73) used in NERDSS, reaction probabilities are invariant with respect to orientation. Therefore, the reacting particles are “snapped” into place in a predefined geometry to prevent arbitrary structures from forming, with sites always placed at the binding radius σ from one another. Thus, the rigid assemblies that form have strict structures they can form; they do not predict emergent structures. The geometry is defined by a set of vectors and five angles between and within each particle (Fig. 1). If angles and/or orientations are undefined, sites bind to a separation of σ at the orientation they were when the event occurred. Both molecules are translated and rotated into place based on their relative translational and rotational diffusion constants. Thus, smaller complexes will move more to orient, and complexes restricted to the membrane do not rotate out of their membrane-localized orientation. Detailed definitions of these vectors and angles can be found in the Supporting Materials and Methods.
Reaction rates can be defined either as macroscopic rates kon (i.e., rates measured experimentally and dependent on diffusion to contact and energetic barriers), in units of μM−1s−1, or as microscopic or intrinsic rates ka, in units of nm3/μs (converts to μM−1s−1 with multiplication by 0.602). All rates are inputted as three-dimensional (3D) values, where KD = kb/ka = koff/kon, where kb and koff are the corresponding unbinding rates in units s−1. For example, reaction rates in two dimensions (2D) use the 3D value divided by a length scale that by default is set to 2σ (units of nm) but that can be independently specified per reaction using an input parameter. A reaction is identified as 2D when it involves two species that have no diffusion in z (e.g., lipids) or between sites on two complexes that are localized to 2D (Dz = 0). Probabilities of each reaction are calculated with the FPR method as previously described and are parameterized by the intrinsic reaction rate ka, the net diffusion coefficient of the reactants Dtot, and the binding radius σ. Conversions between macroscopic rates and the rates needed to calculate the reaction probabilities for each reaction type are performed by the software (see Table S1). Self-binding between molecules such as clathrin with repeated identical sites requires distinctions between binding sites with the same versus distinct labels (see Supporting Materials and Methods). Two types of bimolecular reactions are supported:
-
1)
Association: form an interaction between two sites of two molecules, which, if the association is reversible, has a conjugate first-order dissociation reaction. The resulting complex is then treated as one unit for future propagation, e.g., A(a) + B(b) → A(a!1).B(b!1)
-
2)
State change: change the state of a site on a particle, facilitated by another particle, such as phosphorylation by a kinase. This is thus a composite reaction that combines a bimolecular reaction with an automatic first-order state change of one site, and both reactant species remain unbound. Hence, it is a simplification of Michaelis-Menten, in which reversible binding can trigger a chemical reaction and rapid dissociation, here assuming the chemical reaction is instantaneous; e.g., A(a) + B(b∼u) → A(a) + B(b∼p).
Coupling reactions
NERDSS allows the completion of one reaction (such as dissociation of a complex) to be coupled to another reaction occurring immediately. Thus, one can readily model enzymatic reactions and Michaelis-Menten kinetics. In particular, if an enzyme binds a substrate, the dissociation reaction can be directly coupled to a state change on the substrate, for example, from unphosphorylated to phosphorylated.
Intracomplex binding and defect formation
An important consequence of forming self-assemblies is that molecules can be capable of binding to one another through free sites when they are within the same complex. These intracomplex binding events involve two sites becoming a bound complex but are not truly bimolecular because they do not involve a search to find one another—they are already colocalized. They are thus comparable to conformational rearrangements and are dependent on time (and fundamentally on the details of the atomic interactions) and not on the system size or density. This type of unimolecular or first-order reaction cannot be treated with the same binding probabilities as are used for bimolecular events despite it involving a bond forming between two species. In particular, for the rigid species that we model here, they will have no motion relative to one another (Dtot = 0), and as noted previously (44), this results in binding probabilities of either one or zero, with the former violating detailed balance and the latter preventing the formation of important stabilizing bonds. Even with flexible molecules, in which intracomplex sites can undergo relative diffusion, constraints on mobility due to the bonded structure would, over most time steps, require adjusted reaction probabilities. We define the binding probability of these intracomplex or loop-closure events thus using a Poisson probability, similar to the first-order reactions described above. Then, we must specify a unimolecular rate, given a bimolecular rate constant. We define this rate such that the equilibrium between the bound and unbound states for a two-step process of loop closure, that is, a bimolecular event and a unimolecular loop closure, is the same as if the protein closed the loop in a single step, forming both bonds at once. The rate is given by
where C0 is the standard state concentration of 1 M. For loop closure, the user can thus specify one additional parameter, ΔGcoop. By default, ΔGcoop = 0, and loop-closure events are typically difficult to reverse, requiring two bond-breaking events to separate the loop back into two pieces. Positive values of this parameter render loops less stable and more reversible or dynamic (Fig. S3). We derive this expression in the Supporting Materials and Methods.
We allow these events to form when the sites are spatially proximate, with the user able to specify a maximum distance to allow these events to occur. The maximum distance is the binding radius, σ, multiplied by a factor, bindRadSameCom, which by default is set to 1.1, which includes only perfectly (with numerical precision errors allowed) aligned contacts. The primary reason to increase it is because for some self-assembly structures (such as the curved clathrin cage), the rigid structures cannot form perfect, defect-less structures (see Fig. S8). Allowing binding between legs that are close together mimics structural flexibility present in the biological molecules. A more sophisticated treatment in the future could couple defects to energetic penalties that impact reaction rates and allow relaxation within the rigid assembly as deviations relative to the user-defined orientation angles. Finally, the software prohibits binding between a pair of molecules if they are already bound through a distinct set of sites. Hence, intracomplex or loop-closure events must be mediated through at least a third molecule (e.g., four molecules in the case of clathrin lattices).
Evaluation of steric overlap
Once association events occur between two sites, their two parent complexes are rotated into place to generate the proper, predefined geometry of the two binding proteins. The exception is intracomplex binding, in which because the binding happens within a single complex, no rotation or movement occurs. For two separate complexes, however, this rotation into place can result in steric overlap between sites that are part of the complex but not part of the binding event. We evaluate overlap after association by measuring distances between all centers of mass (COMs) in the new complex and defining a minimum distance, overlapSepLimit, that will determine steric overlap. If any pair of proteins have COMs less than overlapSepLimit, which by default is 1 nm, the binding event is rejected, and the two complexes instead undergo a diffusion move during the time step. This is the simplest evaluation of steric overlap and could be improved, particularly in cases of defect-forming structures (Fig. S9). Here again, more elaborate schemes could determine whether the full space occupied by a molecule overlaps with other molecules, not just the COM.
A final steric overlap check is then performed between the newly bound complex and all other binding sites of complexes present in the simulation to ensure excluded volume is maintained. If the new complex produces steric clashes with other binding sites in the simulation volume, the binding event is rejected.
Excluded volume
Sites that react with one another have excluded volume that is determined by their binding radius, σ. Specifically, in 3D, a reactive site excludes all partners from entering within a sphere of radius σ, and in 2D, within a disk of radius σ. By default, sites otherwise do not exclude volume with each other but act as point particles, including the COM sites. Therefore, if one desires sites to exclude one another, they must be specified as undergoing a bimolecular reaction, and by setting the rate to zero, they will reflect off one another at σ without ever binding. We note that the addition of reactions inevitably slows down the simulations.
An important consequence of having excluded volume only between specified reactive pairs is that once a pair of sites is in the bound state, they no longer automatically exclude volume from other sites in the system. For proteins that are fully bound (and thus are temporarily nonreactive to bind), they can thus pass through other proteins, which is often physically undesirable. To maintain excluded volume, proteins can be given dummy sites that are present purely to exclude volume with dummy sites on other molecules, at all times. A future alternative will be to have an option for all bimolecular reactions that sites can continue reflecting off their reaction partners, whether they are in the free or bound states.
Boundary effects
By default, the simulation boundaries are reflecting. Rectangular and spherical geometries are implemented, with no restriction on user-defined size. If association events result in a very large lattice, it is possible for the structure to extend beyond the physical boundaries of the simulation volume. These moves are thus rejected. On a related note, if association events result in very large rotational reorientation of a complex, these moves can also be rejected because they result in an unphysical amount of displacement per time step. Binding to lipids or sites restricted to the surface can be defined using explicit lipids or a field-based implicit lipid model (74).
Diffusion constants of complexes
Once proteins bind to form a new complex, their translational and rotational diffusion constants are updated to reflect the larger hydrodynamic radius of the bound complex. Each protein molecule has a user-defined Dt and DR. Once a bound complex forms, the new transport coefficients are then defined based on all N components of the complex by simply assuming the radii sum:
and the same in y and z. For molecules restricted to the surface, Dt,z = 0 and all rotational diffusion except (if desired) DR,z are also zero. Correspondingly, any complex that contains these molecules will also have that diffusional component set at zero.
Data output
NERDSS produces restart files that ensure that when simulations are interrupted for any reason, they can be restarted exactly from where they left off, similar to molecular dynamics software. NERDSS also allows restarts to introduce new molecules or reactions into the system using input files formatted the same as the standard inputs. This allows systems to be prepared or equilibrated in one state before, for example, enzymes are introduced.
Simulation algorithm outline
-
1)
Initialization. Copies of rigid species are created from the provided molecule templates, given random coordinates, and checked for placement within the simulation volume and overlap with other particles. If two reactive partners are found to be overlapping, one of the particles is reinitialized with new random coordinates, and all particles are rechecked for overlap.
-
2)
Optimization. To optimize evaluation of the two-body (bimolecular) events, the simulation volume is split into N sub-boxes, in which the length of each sub-box edge is at least Rmax,sys = , where m loops over all binding reactions, σ is the binding radius, D the total diffusion coefficient of both reactants, and lim is the radius of the reactant protein i. Molecules are assigned to boxes based on the position of their COM. This allows for looping over all possible binding partners only in a molecule’s current and neighboring boxes, without excluding any partners with a nonzero reaction probability.
-
3)
Reactions. Within each time step, reaction probabilities are calculated in order from zeroth-, first-, and second-order reactions. Reactions are described in detail above. We note that if a molecule is created or involved in a reaction in any way, it cannot participate in any other reaction during the course of the time step, nor can it diffuse. Hence, only one site per molecule can react per step. Other diffusing molecules will continue to include these molecules in overlap checks to prevent overlap in subsequent steps.
-
4)
Propagation and overlap. After all reaction checks are completed, sites that were within Rmax of a reaction-partner during the time step are checked for overlap. Because a multicomponent complex only moves as a rigid unit, each complex with reactive sites within is looped over, along with a stored list of all reaction partners. If the newly displaced positions overlap (with displacements sampled from Gaussians for translational and rotational diffusion for both reaction partners), the displacements of complex i and its partner are resampled, and the overlap check restarts. Once a complex’s position has been updated, it cannot be moved again.
Simulation details and parameters
Simulation parameters are listed in Table 1, and values are included in figure legends. Transport coefficients are defined using the Einstein-Stokes equation, using each molecule’s hydrodynamic radius estimated from its approximate radius of gyration. Unless otherwise noted, σ = 1 nm for binding.
Table 1.
Name | Unit | Meaning |
---|---|---|
Δt | μs | time step |
kon | μM−1s−1 | macroscopic reaction rate |
koff | s−1 | macroscopic unbinding rate |
ka | nm3/μs | microscopic/intrinsic reaction rate |
kb | s−1 | microscopic/intrinsic unbinding rate |
KD | μM | dissociation constant |
Dt | nm2/μs | translational diffusion coefficient/constant |
DR | rad2/μs | rotational diffusion coefficient/constant |
Dtot | μnm2/μs | net diffusion coefficient of one reactant relative to another |
σ | nm | binding radius |
Rmax | nm | largest distance within which binding reactions can occur |
ΔGcoop | kBT | adjustment of the free energy during the loop-closure event |
f | loop cooperativity factor, f = exp(−ΔGcoop/kBT) | |
bindRadSameCom | Multiplier σ, sets max distance between sites to allow the loop-closure event to occur | |
overlapSepLimit | nm | Max distance between molecule COMs resulting in steric overlap |
We describe here the geometry of the molecules from the clathrin and Gag systems and the orientation angles.
Clathrin model
For the flat clathrin molecule, each clathrin trimer has a COM and three leg binding sites rotated 120° relative to one another, and each leg site is 10 nm from the center such that when dimerized, the centers are 21 nm apart, which is comparable to experimental lattices (75). When two clathrin molecules bind to each other, we suppose their leg sites are head to head and on a line with their centers, and, to generate a flat hexagon structure, all their centers and legs should be on the same plane. Thus, the five orientation angles are [θ1, θ2, φ1, φ2, ω] = [π, π, nan, nan, 0] (“nan” means no value constraint, and see Fig. 1 and Supporting Materials and Methods for the definition of orientation angles). For binding to adaptor proteins, each flat clathrin includes three more leg sites, which we choose to locate directly beneath the legs (π/4.656 angle between the long leg and short leg), as shown in Fig. 5 a. When the clathrin binds to a linear adaptor protein, the orientation should make the long legs parallel to the membrane, where the adaptor and lipid site are designed to be perpendicular to membrane surface. Therefore, the five orientation angles of adaptor to clathrin (adaptor is molecule 1) should be set as [π/2, 3.656π/4.656, nan, π, π]. For the puckered clathrin, the legs are tilted below the plane (5, 10, or 20°) and rotated 120° around the molecule normal, with lleg = 7.5 nm. The orientation angles of two bound puckered clathrin is the same as flat clathrin [π, π, nan, nan, 0]. The adaptor binding to the lipid is two linear molecules binding with angles [π/2, π/2, nan, nan, π].
Gag model
The Gag lattice will form a sphere of radius ∼50 nm because we design the diameter of each hexamer to be ∼8 nm and the angle between two hexamers is ∼9.2°, comparable to experimental lattices (76). Relative to the COM, the Gag has homodimerization sites at [0, 0.5 nm, 0]. The homodimerization angles are [θ1, θ2, φ1, φ2, ω] = [1.49, 1.49, 0, 0, 0]. The heterodimerization sites are set at [−2 nm, 0, 0] and [2 nm, 0, 0], such that the side of each hexamer is ∼4 nm (slightly longer because of the binding radius). The five angles are [2.62, 2.62, π, π, 0]. Our Gag monomer has been designed with binding sites for membrane lipids and RNA binding sites, so future work can effectively probe assembly kinetics as controlled by Gag binding partners and membrane localization.
Results
NERDSS enables self-assembly simulations
NERDSS expands beyond existing single-particle RD software by propagating rigid, user-defined, orientable components that assemble rigid multicomponent structures such as hexagonal clathrin lattices (Fig. 1 a). The algorithmic elements of NERDSS that are new for single-particle and spatial RD (Fig. 1 b) arise because of this propagation of rigid multisite molecules (44), which requires generalized routines to orient molecules and complexes once they bind, corresponding assessment of steric overlap and defect contacts, integration of rule-based control of each site, and generalized output for visualization and analysis (see Methods). We provide a GUI to facilitate the otherwise cumbersome construction of rigid molecules and bound states. The binding between molecules is independent of their orientation before the event (44), which is true of all single-particle methods, including those with interaction potentials, because of their spherical symmetry (14,40). NERDSS is efficient enough to simulate seconds to minutes of assembly on a single processer (Fig. S1), primarily because of algorithms that support relatively large time steps (32). Localization and assembly on membranes also can proceed efficiently and accurately by using a recent algorithm for implicit lipids, which avoids the need to propagate large populations of individual lipid binding sites (74). Although not yet optimized for parallel or GPU architectures (27,34,40), this is an active development. The significant advantages of NERDSS are the types of biological model systems it makes accessible and the accuracy of the single-particle algorithms used for propagating species, illustrated in the next section.
Validation of NERDSS kinetics and equilibria
We validate the NERDSS software first for several non-self-assembly models to verify fundamental reactions and the rule-based implementation of the RD model either against theory or other simulation methods. Zeroth- and first-order reactions reproduce the exact analytical solutions as expected, and they are not dependent on space (newly created particles are placed randomly in the volume) (Fig. 2 a). Bimolecular reactions for single-particles have been previously validated for the FPR algorithms in 3D (32), 2D (73), and transitions from 3D to 2D with flat and curved surfaces (74), but we perform additional verification here (Fig. 2 b). The FPR algorithms are derived to preserve detailed balance for reversible binding interactions, ensuring thermodynamic equilibrium is reached when systems are designed to (excluding kinetic traps). Binding to the surface can be mediated through explicit lipid sites, or we have implemented an implicit lipid model that is substantially more accurate than surface adsorption models at reproducing binding kinetics of the explicit lipid simulations (74). This model provides orders of magnitude speedups relative to explicit lipid simulations (74) and couples with all other reaction types (Fig. 2 c). Although NERDSS is compatible with arbitrarily curved surfaces (77), the user inputs currently accommodate only spherical geometries (Fig. 2 c) or rectangular volumes in which the flat membrane is on one surface. We propagate diffusion on the sphere by exactly sampling displacements on the spherical surface (Supporting Materials and Methods), and all distances between surface-bound particles are measured using geodesics.
To validate the self-assembly simulations, we follow previous work (44) by comparing our results against nonspatial rate-based simulations and by comparing our simulation results as we change the time step of integration (Fig. 2 d). Changing the time step is a valuable test for two reasons: first, as the time step decreases, the assumption that each multisite molecule is interacting with only one other site becomes more and more accurate, which improves adherence to solving only two-body problems per step (32). Second, the reaction probabilities depend nonlinearly on the time step, so if inaccuracies existed in diffusion or reactions, they show up quantifiably in the kinetics and equilibria. We performed the same validation for assembly in 2D (Fig. S4). For self-assembly, we must introduce a new loop-closure reaction type (Methods), and we show how the choice of this parameter, which controls the binding free energy of loop-closure events, impacts the equilibrium (Fig. S3). We cannot compare NERDSS self-assembly with other single-particle software because they do not capture orientation-dependent interactions and thus cannot form a structured lattice or cage. It is informative to compare NERDSS to NFSim (8), which lacks spatial resolution but simulates rate-based models that can form multiprotein assemblies via multisite species and rule-based modeling (Figs. S2 and S3). In weak-binding regimes, they agree exactly, but with stronger binding, the structure resolution in NERDSS drives spatially localized interactions and captures several elements of self-assembly not possible with NFSim, including lattice geometries and topologies and their growth patterns.
To test the rule-based implementation with a more complex nonassembly model, we simulated a previously developed minimal model of a circadian genetic oscillator (53) with NERDSS, as well as with PDEs and the stochastic simulation algorithm (via Virtual Cell software (6)), showing quantitative agreement across all models (Fig. 3). Our simulation recapitulates how the expression of two proteins, an activator protein (A) and repressor protein (R), can be coupled to produce robust oscillations of both proteins and their bound complex. For our NERDSS model, when messenger RNA and proteins are created, they are placed adjacent to the molecule creating them. All rates were accelerated by a factor of 3600 relative to the original rates because of computational costs, such that oscillations occurred over timescales of seconds rather than hours. This change made the binding events diffusion limited, meaning they could become sensitive to the spatial distribution of particles. However, because the species mix rapidly and the volume is not too large (Video S5), the kinetics were essentially insensitive to diffusion and the spatial dimension, indicating the resolution offered by NERDSS is not necessary for this model. We note the PDE and nonspatial simulations have significantly faster CPU time.
NERDSS tests clathrin cage assembly designed by four distinct models of clathrin-clathrin interactions
Clathrin, a 600 kDa trimeric protein, assembles into both flat and spherical lattices in vivo and in vitro (1,78,79), although in vivo this process only occurs on the plasma membrane, with the requirement of a host of accessory and adaptor proteins, because clathrin itself does not bind the membrane. Clathrin trimers are observed to assemble in solution in vitro either at low pH or in the presence of clathrin terminal-binding adaptor proteins (80,81). We model distinct assembly conditions by changing binding strengths, cooperativity, and binding geometries of clathrin-only solutions, contrasting how multiple observables vary with distinct model parameterizations at a fixed (1.3 μM) clathrin concentration. The baseline model has a KD for leg-leg binding of 100 μM, which is comparable to experimental estimates of trimer dimerization (82). Very little binding is observed in solution at this concentration (Fig. 4 a), which is consistent with in vitro studies of clathrin in solution at physiological pH (81). The same results are observed in nonspatial simulations (Fig. S3).
By simply strengthening the KD, the flat clathrin lattices start to assemble rapidly, to the point that a KD = 1 μM produces a similar equilibrium in bound legs as 0.2 μM (Figs. 2 d, 4 b, and S10). As we strengthen KD, average size of complexes transitions to a giant component at ∼10 to 1 μM. With the formation of closed loops in the clathrin lattice, we can introduce positive or negative cooperativity, due to the formation of two simultaneous bound interfaces within the closed loop (Methods and Supporting Materials and Methods). This can thus alter the final equilibrium for a given dimerization KD, favoring fewer bonds and more remodeling (Figs. S3 and S10), and will also shift the value of kon when nucleation of complexes progresses to complete lattices.
By instead introducing cooperativity in binding between clathrins such that the rate depends on whether a monomer is already bound to another clathrin, flat lattices assemble after a delay, followed by rapid growth (Fig. 4 c). This model thus has three on-rates for clathrin-clathrin binding, with strongest binding between clathrins that are already bound to others. This growth model results in a broader distribution of lattice sizes (Fig. 4 c; Video S1).
By taking the model of Fig. 4 b (KD = 0.2 μM) and altering the geometry of a clathrin monomer to tilt legs 10° below the plane (100° pucker), spherical cages form (Video S2). The kinetics of leg-leg binding remains very similar to Fig. 4 b, but the sizes of the complexes are smaller due to spatial constraints of closed spheres and steric overlap (Fig. 4 d). The extent of the tilt controls the radius of the sphere, with this 100° clathrin producing a sphere of R ∼45 nm containing ∼100 clathrin. A 95° clathrin, in contrast, produce a sphere of R ∼92 nm, estimated to contain ∼500 trimers, and a 110° degree clathrin produces a sphere of R ∼25 containing ∼20 trimers (Fig. S7). We note that the rigid clathrin trimer cannot form a perfect spherical lattice, and thus, defects appear even within a single closed hexagon. In the most puckered clathrin (110°), the geometry much better accommodates pentagons, not hexagons (Fig. S7). Biomolecules typically have the flexibility to bend and form contacts by supporting a distribution of structural geometries, and clathrin is known to form cages in solution and on the membrane at a range of curvatures (75). We thus allow sites within a complex to bind one another even when they are not at perfect contact (Methods). The choice of cutoff distances for binding within a complex and for identifying steric overlap (which is more difficult with defects) does impact the stability of the cages and whether they prevent unphysical overlap (Figs. S8 and S9); more sophisticated treatments in the future will improve the treatment of defect formation (Methods).
NERDSS probes lattice assembly driven by localization to the membrane
Increasing component concentrations is another natural mechanism to nucleate assembly. In Fig. 5, we show how localization to the 2D membrane surface can, on its own, nucleate clathrin lattices by increasing the effective concentration of clathrin relative to the 3D solution (Video S3). Clathrin does not bind the membrane directly but can be localized to the surface by an adaptor protein. The clathrin-clathrin interaction is set here to a weak value of 100 μM (Fig. 5 a; (82)), and thus, minimal binding occurs in solution. Once the clathrin have been localized to the surface, however, they can assemble into a lattice that is stabilized by relatively weak clathrin-clathrin contacts that are favorable because of the small search space available on the 2D surface (45). Although binding to many adaptor proteins is known to stabilize clathrin-clathrin interactions (80), we do not incorporate this cooperativity here to isolate the role of 2D localization in nucleating assembly.
Enzymes can drive lattice disassembly by removing links to the membrane
By removing links between clathrin and the membrane, we can drive the clathrin assembly back into solution where our clathrin lattice is no longer stable. Physiologically, this is achievable by changing the phosphorylation state of the lipid PI(4,5)P2, which is essential for localizing adaptor proteins, and thus clathrin, to the membrane (83). Thus, without altering the clathrin or the adaptor proteins directly, we can drive disassembly of the lattice. We include here 10 copies of a lipid phosphatase (e.g., synaptojanin), which converts PI(4,5)P2 to PI(4)P irreversibly. Critically, the adaptor protein cannot bind to PI(4)P, thus removing its link to the surface. Importantly, the phosphatase synaptojanin can be localized to sites of clathrin-coated structures in vivo through protein interactions with the adaptor protein AP-285, thus allowing it to act in 2D to rapidly dephosphorylate unbound PI(4,5)P2. Although in vivo, it has been shown to play an important role in dephosphorylating PI(4,5)P2 after fission from the membrane (59), we demonstrate here how it is capable of driving disassembly of plasma-membrane-bound clathrin-coated lattices, at least in this in vitro-type model.
We find the timescales of the lattice disassembly are sensitive to the binding and unbinding kinetics of all protein-protein and protein-lipid interactions (Fig. 5). In particular, although the phosphatases rapidly dephosphorylate unbound PI(4,5)P2, the adaptor proteins protect the subset of lipids they are bound to, and their links are cut on a timescale that is determined by unbinding of the adaptor-lipid interactions. Because clathrin can bind up to three adaptor proteins, its dissociation from the membrane then also depends on dissociation of all three adaptors from the surface. Although this model is not fully parameterized to reproduce experiment, it illustrates how localization, enzymatic activity, and binding kinetics can be tuned to control assembly and disassembly. Equilibrium constants used are within reasonable ranges of experimental measurements for endocytic proteins (collected here (45)).
Model building requires orientations for each pairwise binding reaction: application to viral assembly in solution
To illustrate the model design process, we develop a coarse model of the Gag monomer, a retroviral protein that dimerizes with itself and oligomerizes into a hexagonal lattice on the plasma membrane of infected cells to form the so-called immature virion (72). To form a lattice of hexamers from these approximately linear monomers, two distinct types of dimers are sufficient, a homodimer and heterodimer interaction, with relative orientations defined by five separate angles for each dimer (Fig. 6). The heterodimer forms between head-to-tail monomers, allowing polymer-like growth at a specific hexameric geometry. The lattice forms with sizes of hexameric subunits (∼8 nm in diameter) and a spherical curvature (R ∼50 nm) that would support an inclusion of ∼2500 Gags in the full spherical shell, as observed experimentally (76). The Gag-Gag homodimerization is stronger than other Gag-Gag contacts at an estimated 1–10 μM and appears to form first (84). The weaker Gag-Gag hexameric interactions are known to be autoinhibited before binding to either RNA, negatively charged analogs, or the membrane (72,84); however, here we treat all interactions as constitutive for simplicity. Our Gag monomer has been designed with binding sites for membrane lipids and RNA binding sites, so future work can effectively probe assembly kinetics as controlled by Gag binding partners and membrane localization.
To illustrate the functionality of NERDSS, we initialize the volume with zero Gag monomers. We then titrate them in using a zeroth-order reaction, for which they are placed randomly in the simulation volume. Thus, the concentration slowly increases, favoring slower nucleation and more efficient growth of the Gag into a small number of larger assemblies (Video S4). Monomeric Gag is also degraded, although assembled Gag is not. Thus, the long-time behavior consists of Gag as stable spheroids or as short-lived (∼1 s) monomers. As designed, the Gag is able to assemble into large spheroids entirely because of the two specified reactions (Fig. 6). The assembly kinetics depends on the titration speed, which can provide additional knobs to help optimize the model against experimental data.
Discussion
NERDSS has several advantageous features that make it a powerful and immediately useful tool for cell-scale simulations. First, NERDSS is transferrable between distinct systems thanks to its rate-based interaction framework, which avoids the time-consuming parameterization of energy functions often hard coded for specific assembly systems (19). Although protein geometries and orientations of bound complexes in NERDSS are system specific, we provide a GUI to facilitate user design of proteins and their bound states. Second, unlike existing spatial rate-based approaches (33), the built-in molecular structure of NERDSS not only enforces excluded volume of binding sites but evaluates whether steric overlap or volume restrictions would prevent the formation of unphysical assembly structures. Third, it uses the FPR algorithms (32,44,73) to efficiently propagate species while retaining accurate rates of association and dissociation, allowing for current simulations of timescales on the order of seconds to minutes. NERDSS converts carefully between microscopic rates used in FPR and macroscopic rates commonly defined from experiment, with extensive validation (44). Fourth, NERDSS uses a BioNetGen Language-style syntax (46), and models built in other software packages using similar syntax can be ported to NERDSS with minimal alterations, making it available for immediate use. In this combination of structural resolution, efficient propagation and accurate treatment of association rates, and user-friendly input file syntax, NERDSS is a unique tool for simulating equilibrium and nonequilibrium self-assembly and other cell-scale phenomena.
A current limitation of NERDSS is that all species must be rigid bodies, and thus, assembly defects that arise within complexes that cannot perfectly tile surfaces (such as rigid bodies on a sphere) are accommodated approximately by using a distance tolerance. Molecule flexibility (14) is a natural extension beyond rigid molecules that would improve defects and also enable better treatment of disordered regions and genomes. NERDSS also lacks any orientation dependence on binding between complexes before association, which is an approximation that can result in unphysical binding events, particularly between large multicomponent intermediates, as also discussed previously (44). Although we have implemented options to reject these moves, introducing orientation dependence to reaction probabilities (13,85) would be more physically realistic and could allow NERDSS to predict a range of structures rather than the ones predetermined by the defined geometries, although this would require more expense or approximate. NERDSS is designed to be extensible to not only users but to developers expanding functionality. The FPR algorithms support, for example, the implementation of more elaborate physical models that introduce electrostatic interactions between particles (13,32). A particularly exciting future development is integration of NERDSS with continuum membrane models (74) to allow more realistic simulations of vesicle and virion formation dynamics and coupling of assembly to mechanical force generation, which is a key component, for example, in cytoskeletal assembly models (86) and continuum membrane remodeling studies (87, 88, 89).
NERDSS is designed to be more user friendly and portable than previous versions of the FPR algorithms (44) with the addition of rule-based modeling, the formatting of input and output files, and the addition of the GUI, all available open source here: github.com/mjohn218/NERDSS. However, it does not match the usability of software such as Virtual Cell in terms of model setup and analysis, including defining cell geometries from experimental data (6,52). It also does not match the level of parallelization or optimization of software such as Lattice Microbes (27). These will be important future software developments to support a broad and diverse user base.
We have introduced self-assembly models here motivated by CME and virion formation in cells as a foundation to extend these models to additional components and experimental comparison. We show how assembly can be nucleated despite weak clathrin-clathrin interactions through cooperativity or through reduction of dimensionality to the membrane (3D to 2D) (45). With NERDSS, we predicted here how the dephosphorylation of essential plasma membrane lipids, which occurs at sites of vesicle budding (59), could de-stabilize clathrin-coated structures and drive their disassembly. Ultimately, NERDSS has encoded as a core feature the ability to resolve relatively fast processes over long timescales and individual proteins over long length scales, simulating otherwise intractable protein assembly dynamics. This provides immediate use in helping overcome the challenges of understanding or designing self-assembling structures in biology.
Author Contributions
M.J.V., Y.F., and M.E.J. wrote the software. Y.F. implemented the implicit lipid and spherical model to the software. O.N.Y. and S.L. designed and developed the GUI. M.J.V., Y.F., and M.E.J. ran simulations, collected results, and wrote the article.
Acknowledgments
We thank Danny Evans for setting up the Gag model, Ipsita Saha for providing feedback on Gag simulations, Nomongo Dorjsuren for formatting the output files from the GUI, and Dr. Sikao Guo for software updates.
M.E.J. gratefully acknowledges funding from a National Science Foundation CAREER Award #1753174 and a National Institutes of Health MIRA Award R35GM133644. We acknowledge use of the MARCC supercomputer at Johns Hopkins, and the XSEDE supercomputer Stampede2 through XRAC MCB150059.
Editor: Padmini Rangamani.
Footnotes
Matthew J. Varga and Yiben Fu contributed equally to this work.
Supporting Material can be found online at https://doi.org/10.1016/j.bpj.2020.05.002.
Supporting Material
References
- 1.Bucher D., Frey F., Boulant S. Clathrin-adaptor ratio and membrane tension regulate the flat-to-curved transition of the clathrin coat during endocytosis. Nat. Commun. 2018;9:1109. doi: 10.1038/s41467-018-03533-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Bartocci E., Lió P. Computational modeling, formal analysis, and tools for systems biology. PLoS Comput. Biol. 2016;12:e1004591. doi: 10.1371/journal.pcbi.1004591. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Lewis J. From signals to patterns: space, time, and mathematics in developmental biology. Science. 2008;322:399–403. doi: 10.1126/science.1166154. [DOI] [PubMed] [Google Scholar]
- 4.Jilkine A., Edelstein-Keshet L. A comparison of mathematical models for polarization of single eukaryotic cells in response to guided cues. PLoS Comput. Biol. 2011;7:e1001121. doi: 10.1371/journal.pcbi.1001121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gunawardena J. Models in biology: ‘accurate descriptions of our pathetic thinking’. BMC Biol. 2014;12:29. doi: 10.1186/1741-7007-12-29. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Moraru I.I., Schaff J.C., Loew L.M. Virtual Cell modelling and simulation software environment. IET Syst. Biol. 2008;2:352–362. doi: 10.1049/iet-syb:20080102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Angermann B.R., Klauschen F., Meier-Schellersheim M. Computational modeling of cellular signaling processes embedded into dynamic spatial contexts. Nat. Methods. 2012;9:283–289. doi: 10.1038/nmeth.1861. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Sneddon M.W., Faeder J.R., Emonet T. Efficient modeling, simulation and coarse-graining of biological complexity with NFsim. Nat. Methods. 2011;8:177–183. doi: 10.1038/nmeth.1546. [DOI] [PubMed] [Google Scholar]
- 9.Sanft K.R., Wu S., Petzold L.R. StochKit2: software for discrete stochastic simulation of biochemical systems with events. Bioinformatics. 2011;27:2457–2458. doi: 10.1093/bioinformatics/btr401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Sweeney B., Zhang T., Schwartz R. Exploring the parameter space of complex self-assembly through virus capsid models. Biophys. J. 2008;94:772–783. doi: 10.1529/biophysj.107.107284. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Zlotnick A. To build a virus capsid. An equilibrium model of the self assembly of polyhedral protein complexes. J. Mol. Biol. 1994;241:59–67. doi: 10.1006/jmbi.1994.1473. [DOI] [PubMed] [Google Scholar]
- 12.Endres D., Zlotnick A. Model-based analysis of assembly kinetics for virus capsids or other spherical polymers. Biophys. J. 2002;83:1217–1230. doi: 10.1016/S0006-3495(02)75245-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Zhou H.X. Brownian dynamics study of the influences of electrostatic interaction and diffusion on protein-protein association kinetics. Biophys. J. 1993;64:1711–1726. doi: 10.1016/S0006-3495(93)81543-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Michalski P.J., Loew L.M. SpringSaLaD: a spatial, particle-based biochemical simulation platform with excluded volume. Biophys. J. 2016;110:523–529. doi: 10.1016/j.bpj.2015.12.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Gruenert G., Ibrahim B., Dittrich P. Rule-based spatial modeling with diffusing, geometrically constrained molecules. BMC Bioinformatics. 2010;11:307. doi: 10.1186/1471-2105-11-307. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Turing A.M. The chemical basis of morphogenesis. Philos Trans R Soc B. 1953;237:37–72. [Google Scholar]
- 17.Hagan M.F. Modeling viral capsid assembly. Adv. Chem. Phys. 2014;155:1–68. doi: 10.1002/9781118755815.ch01. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Grime J.M.A., Dama J.F., Voth G.A. Coarse-grained simulation reveals key features of HIV-1 capsid self-assembly. Nat. Commun. 2016;7:11568. doi: 10.1038/ncomms11568. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Giani M., den Otter W.K., Briels W.J. Early stages of clathrin aggregation at a membrane in coarse-grained simulations. J. Chem. Phys. 2017;146:155102. doi: 10.1063/1.4979985. [DOI] [PubMed] [Google Scholar]
- 20.Hall D.M., Grason G.M. How geometric frustration shapes twisted fibres, inside and out: competing morphologies of chiral filament assembly. Interface Focus. 2017;7:20160140. doi: 10.1098/rsfs.2016.0140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Hagan M.F., Elrad O.M., Jack R.L. Mechanisms of kinetic trapping in self-assembly and phase transformation. J. Chem. Phys. 2011;135:104115. doi: 10.1063/1.3635775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Perlmutter J.D., Mohajerani F., Hagan M.F. Many-molecule encapsulation by an icosahedral shell. eLife. 2016;5:e14078. doi: 10.7554/eLife.14078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Wilber A.W., Doye J.P., Lewis A.C. Monodisperse self-assembly in a model with protein-like interactions. J. Chem. Phys. 2009;131:175102. doi: 10.1063/1.3243581. [DOI] [PubMed] [Google Scholar]
- 24.Saglam A.S., Chong L.T. Highly efficient computation of the basal kon using direct simulation of protein-protein association with flexible molecular models. J. Phys. Chem. B. 2016;120:117–122. doi: 10.1021/acs.jpcb.5b10747. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Saglam A.S., Chong L.T. Protein-protein binding pathways and calculations of rate constants using fully-continuous, explicit-solvent simulations. Chem. Sci. (Camb.) 2018;10:2360–2372. doi: 10.1039/c8sc04811h. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Martinez M., Bruce N.J., Wade R.C. SDA 7: a modular and parallel implementation of the simulation of diffusional association software. J. Comput. Chem. 2015;36:1631–1645. doi: 10.1002/jcc.23971. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Roberts E., Stone J.E., Luthey-Schulten Z. Lattice microbes: high-performance stochastic simulation method for the reaction-diffusion master equation. J. Comput. Chem. 2013;34:245–255. doi: 10.1002/jcc.23130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Drawert B., Hellander S., Petzold L. A framework for discrete stochastic simulation on 3D moving boundary domains. J. Chem. Phys. 2016;145:184113. doi: 10.1063/1.4967338. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Drawert B., Engblom S., Hellander A. URDME: a modular framework for stochastic simulation of reaction-transport processes in complex geometries. BMC Syst. Biol. 2012;6:76. doi: 10.1186/1752-0509-6-76. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Earnest T.M., Lai J., Luthey-Schulten Z. Toward a whole-cell model of ribosome biogenesis: kinetic modeling of SSU assembly. Biophys. J. 2015;109:1117–1135. doi: 10.1016/j.bpj.2015.07.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Ghaemi Z., Peterson J.R., Luthey-Schulten Z. An in-silico human cell model reveals the influence of spatial organization on RNA splicing. PLoS Comput. Biol. 2020;16:e1007717. doi: 10.1371/journal.pcbi.1007717. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Johnson M.E., Hummer G. Free-propagator reweighting integrator for single-particle dynamics in reaction-diffusion models of heterogeneous protein-protein interaction systems. Phys. Rev. X. 2014;4:031037. doi: 10.1103/PhysRevX.4.031037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.van Zon J.S., ten Wolde P.R. Simulating biochemical networks at the particle level and in time and space: Green’'s function reaction dynamics. Phys. Rev. Lett. 2005;94:128103. doi: 10.1103/PhysRevLett.94.128103. [DOI] [PubMed] [Google Scholar]
- 34.Andrews S.S. Smoldyn: particle-based simulation with rule-based modeling, improved molecular interaction and a library interface. Bioinformatics. 2017;33:710–717. doi: 10.1093/bioinformatics/btw700. [DOI] [PubMed] [Google Scholar]
- 35.Schoneberg J., Noé F. ReaDDy--a software for particle-based reaction-diffusion dynamics in crowded cellular environments. PLoS One. 2013;8:e74261. doi: 10.1371/journal.pone.0074261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Takahashi K., Tanase-Nicola S., ten Wolde P.R. Spatio-temporal correlations can drastically change the response of a MAPK pathway. Proc. Natl. Acad. Sci. USA. 2010;107:2473–2478. doi: 10.1073/pnas.0906885107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Gillespie D.T., Seitaridou E., Gillespie C.A. The small-voxel tracking algorithm for simulating chemical reactions among diffusing molecules. J. Chem. Phys. 2014;141:234115. doi: 10.1063/1.4903962. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Arjunan S.N.V., Takahashi K. Multi-algorithm particle simulations with spatiocyte. Methods Mol. Biol. 2017;1611:219–236. doi: 10.1007/978-1-4939-7015-5_16. [DOI] [PubMed] [Google Scholar]
- 39.Sokolowski T.R., Paijmans J., Ten Wolde P.R. eGFRD in all dimensions. J. Chem. Phys. 2019;150:054108. doi: 10.1063/1.5064867. [DOI] [PubMed] [Google Scholar]
- 40.Hoffmann M., Fröhner C., Noé F. ReaDDy 2: fast and flexible software framework for interacting-particle reaction dynamics. PLoS Comput. Biol. 2019;15 doi: 10.1371/journal.pcbi.1006830. e1006830. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Chattaraj A., Youngstrom M., Loew L.M. The interplay of structural and cellular biophysics controls clustering of multivalent molecules. Biophys. J. 2019;116:560–572. doi: 10.1016/j.bpj.2019.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Antosiewicz J., McCammon J.A. Electrostatic and hydrodynamic orientational steering effects in enzyme-substrate association. Biophys. J. 1995;69:57–65. doi: 10.1016/S0006-3495(95)79874-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Vijaykumar A., Ouldridge T.E., Bolhuis P.G. Multiscale simulations of anisotropic particles combining molecular dynamics and Green’s function reaction dynamics. J. Chem. Phys. 2017;146:114106. doi: 10.1063/1.4977515. [DOI] [PubMed] [Google Scholar]
- 44.Johnson M.E. Modeling the self-assembly of protein complexes through a rigid-body rotational reaction-diffusion algorithm. J. Phys. Chem. B. 2018;122:11771–11783. doi: 10.1021/acs.jpcb.8b08339. [DOI] [PubMed] [Google Scholar]
- 45.Yogurtcu O.N., Johnson M.E. Cytosolic proteins can exploit membrane localization to trigger functional assembly. PLoS Comput. Biol. 2018;14 doi: 10.1371/journal.pcbi.1006031. e1006031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Blinov M.L., Faeder J.R., Hlavacek W.S. BioNetGen: software for rule-based modeling of signal transduction based on the interactions of molecular domains. Bioinformatics. 2004;20:3289–3291. doi: 10.1093/bioinformatics/bth378. [DOI] [PubMed] [Google Scholar]
- 47.Faeder J.R., Blinov M.L., Hlavacek W.S. Rule-based modeling of biochemical systems with BioNetGen. Methods Mol. Biol. 2009;500:113–167. doi: 10.1007/978-1-59745-525-1_5. [DOI] [PubMed] [Google Scholar]
- 48.Deeds E.J., Krivine J., Fontana W. Combinatorial complexity and compositional drift in protein interaction networks. PLoS One. 2012;7 doi: 10.1371/journal.pone.0032032. e32032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Boutillier P., Maasha M., Fontana W. The Kappa platform for rule-based modeling. Bioinformatics. 2018;34:i583–i592. doi: 10.1093/bioinformatics/bty272. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Hlavacek W.S., Faeder J.R., Fontana W. Rules for modeling signal-transduction systems. Sci. STKE. 2006;2006:re6. doi: 10.1126/stke.3442006re6. [DOI] [PubMed] [Google Scholar]
- 51.Smith A.M., Xu W., Marai G.E. RuleBender: integrated modeling, simulation and visualization for rule-based intracellular biochemistry. BMC Bioinformatics. 2012;13(Suppl 8):S3. doi: 10.1186/1471-2105-13-S8-S3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Schaff J.C., Vasilescu D., Blinov M.L. Rule-based modeling with virtual cell. Bioinformatics. 2016;32:2880–2882. doi: 10.1093/bioinformatics/btw353. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Vilar J.M., Kueh H.Y., Leibler S. Mechanisms of noise-resistance in genetic oscillators. Proc. Natl. Acad. Sci. USA. 2002;99:5988–5992. doi: 10.1073/pnas.092133899. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Ilie I.M., den Otter W.K., Briels W.J. Rotational Brownian dynamics simulations of clathrin cage formation. J. Chem. Phys. 2014;141:065101. doi: 10.1063/1.4891306. [DOI] [PubMed] [Google Scholar]
- 55.Ayton G.S., Voth G.A. Multiscale computer simulation of the immature HIV-1 virion. Biophys. J. 2010;99:2757–2765. doi: 10.1016/j.bpj.2010.08.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Schoen A.P., Cordella N., Heilshorn S.C. Dynamic remodelling of disordered protein aggregates is an alternative pathway to achieve robust self-assembly of nanostructures. Soft Matter. 2013;9:9137–9145. [Google Scholar]
- 57.VanDersarl J.J., Mehraeen S., Melosh N.A. Rheology and simulation of 2-dimensional clathrin protein network assembly. Soft Matter. 2014;10:6219–6227. doi: 10.1039/c4sm00025k. [DOI] [PubMed] [Google Scholar]
- 58.Antonny B., Burd C., Schmid S. Membrane fission by dynamin: what we know and what we need to know. EMBO J. 2016;35:2270–2284. doi: 10.15252/embj.201694613. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.He K., Marsland R., III, Kirchhausen T. Dynamics of phosphoinositide conversion in clathrin-mediated endocytic traffic. Nature. 2017;552:410–414. doi: 10.1038/nature25146. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Chang-Ileto B., Frere S.G., Di Paolo G. Synaptojanin 1-mediated PI(4,5)P2 hydrolysis is modulated by membrane curvature and facilitates membrane fission. Dev. Cell. 2011;20:206–218. doi: 10.1016/j.devcel.2010.12.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Schöneberg J., Lehmann M., Noé F. Lipid-mediated PX-BAR domain recruitment couples local membrane constriction to endocytic vesicle fission. Nat. Commun. 2017;8:15873. doi: 10.1038/ncomms15873. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Schmid E.M., McMahon H.T. Integrating molecular and network biology to decode endocytosis. Nature. 2007;448:883–888. doi: 10.1038/nature06031. [DOI] [PubMed] [Google Scholar]
- 63.Traub L.M. Tickets to ride: selecting cargo for clathrin-regulated internalization. Nat. Rev. Mol. Cell Biol. 2009;10:583–596. doi: 10.1038/nrm2751. [DOI] [PubMed] [Google Scholar]
- 64.Sochacki K.A., Dickey A.M., Taraska J.W. Endocytic proteins are partitioned at the edge of the clathrin lattice in mammalian cells. Nat. Cell Biol. 2017;19:352–361. doi: 10.1038/ncb3498. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.den Otter W.K., Renes M.R., Briels W.J. Asymmetry as the key to clathrin cage assembly. Biophys. J. 2010;99:1231–1238. doi: 10.1016/j.bpj.2010.06.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Cordella N., Lampo T.J., Spakowitz A.J. Membrane indentation triggers clathrin lattice reorganization and fluidization. Soft Matter. 2015;11:439–448. doi: 10.1039/c4sm01650e. [DOI] [PubMed] [Google Scholar]
- 67.Ramanan V., Agrawal N.J., Radhakrishnan R. Systems biology and physical biology of clathrin-mediated endocytosis. Integr. Biol. 2011;3:803–815. doi: 10.1039/c1ib00036e. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Holland D.O., Johnson M.E. Stoichiometric balance of protein copy numbers is measurable and functionally significant in a protein-protein interaction network for yeast endocytosis. PLoS Comput. Biol. 2018;14 doi: 10.1371/journal.pcbi.1006022. e1006022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Banerjee A., Berezhkovskii A., Nossal R. Stochastic model of clathrin-coated pit assembly. Biophys. J. 2012;102:2725–2730. doi: 10.1016/j.bpj.2012.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Sorokin A., Heil K.F., Sorokina O. Rule-based modelling provides an extendable framework for comparing candidate mechanisms underpinning clathrin polymerisation. Sci. Rep. 2018;8:5658. doi: 10.1038/s41598-018-23829-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Wright E.R., Schooler J.B., Jensen G.J. Electron cryotomography of immature HIV-1 virions reveals the structure of the CA and SP1 Gag shells. EMBO J. 2007;26:2218–2226. doi: 10.1038/sj.emboj.7601664. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Sundquist W.I., Kräusslich H.G. HIV-1 assembly, budding, and maturation. Cold Spring Harb. Perspect. Med. 2012;2:a006924. doi: 10.1101/cshperspect.a006924. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Yogurtcu O.N., Johnson M.E. Theory of bi-molecular association dynamics in 2D for accurate model and experimental parameterization of binding rates. J. Chem. Phys. 2015;143:084117. doi: 10.1063/1.4929390. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Fu Y., Yogurtcu O.N., Johnson M.E. An implicit lipid model for efficient reaction-diffusion simulations of protein binding to surfaces of arbitrary topology. J. Chem. Phys. 2019;151:124115. doi: 10.1063/1.5120516. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Morris K.L., Jones J.R., Smith C.J. Cryo-EM of multiple cage architectures reveals a universal mode of clathrin self-assembly. Nat. Struct. Mol. Biol. 2019;26:890–898. doi: 10.1038/s41594-019-0292-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Briggs J.A., Riches J.D., Kräusslich H.G. Structure and assembly of immature HIV. Proc. Natl. Acad. Sci. USA. 2009;106:11090–11095. doi: 10.1073/pnas.0903535106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Feng F., Klug W.S. Finite element modeling of lipid bilayer membranes. J. Comput. Phys. 2006;220:394–408. [Google Scholar]
- 78.Kelly B.T., Graham S.C., Owen D.J. Clathrin adaptors. AP2 controls clathrin polymerization with a membrane-activated switch. Science. 2014;345:459–463. doi: 10.1126/science.1254836. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Saffarian S., Cocucci E., Kirchhausen T. Distinct dynamics of endocytic clathrin-coated pits and coated plaques. PLoS Biol. 2009;7 doi: 10.1371/journal.pbio.1000191. e1000191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Zaremba S., Keen J.H. Assembly polypeptides from coated vesicles mediate reassembly of unique clathrin coats. J. Cell Biol. 1983;97:1339–1347. doi: 10.1083/jcb.97.5.1339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Pearse B.M., Crowther R.A. Structure and assembly of coated vesicles. Annu. Rev. Biophys. Biophys. Chem. 1987;16:49–68. doi: 10.1146/annurev.bb.16.060187.000405. [DOI] [PubMed] [Google Scholar]
- 82.Wakeham D.E., Chen C.Y., Brodsky F.M. Clathrin self-assembly involves coordinated weak interactions favorable for cellular regulation. EMBO J. 2003;22:4980–4990. doi: 10.1093/emboj/cdg511. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.McMahon H.T., Boucrot E. Molecular mechanism and physiological functions of clathrin-mediated endocytosis. Nat. Rev. Mol. Cell Biol. 2011;12:517–533. doi: 10.1038/nrm3151. [DOI] [PubMed] [Google Scholar]
- 84.Datta S.A., Zhao Z., Rein A. Interactions between HIV-1 Gag molecules in solution: an inositol phosphate-mediated switch. J. Mol. Biol. 2007;365:799–811. doi: 10.1016/j.jmb.2006.10.072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Shoup D., Lipari G., Szabo A. Diffusion-controlled bimolecular reaction rates. The effect of rotational diffusion and orientation constraints. Biophys. J. 1981;36:697–714. doi: 10.1016/S0006-3495(81)84759-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Mund M., van der Beek J.A., Ries J. Systematic nanoscale analysis of endocytosis links efficient vesicle formation to patterned actin nucleation. Cell. 2018;174:884–896.e17. doi: 10.1016/j.cell.2018.06.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Rangamani P., Mandadap K.K., Oster G. Protein-induced membrane curvature alters local membrane tension. Biophys. J. 2014;107:751–762. doi: 10.1016/j.bpj.2014.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Wu Z., Su M., Liu J. Membrane shape-mediated wave propagation of cortical protein dynamics. Nat. Commun. 2018;9:136. doi: 10.1038/s41467-017-02469-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Akamatsu M., Vasan R., Drubin D.G. Principles of self-organization and load adaptation by the actin cytoskeleton during clathrin-mediated endocytosis. eLife. 2020;9 doi: 10.7554/eLife.49840. e49840. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Humphrey W., Dalke A., Schulten K. VMD: visual molecular dynamics. J. Mol. Graph. 1996;14:33–38. doi: 10.1016/0263-7855(96)00018-5. 27–38. [DOI] [PubMed] [Google Scholar]
- 91.Stukowski A. Visualization and analysis of atomistic simulation data with OVITO-the open visualization tool. Model Simul. Mater. Sci. Eng. 2010;18:015012. Published online December 15, 2009. [Google Scholar]
- 92.Yoon Y., Lee P.J., Cho W. In situ quantitative imaging of cellular lipids using molecular sensors. Nat. Chem. 2011;3:868–874. doi: 10.1038/nchem.1163. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.