Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2022 Jan 18;119(4):e2116373119. doi: 10.1073/pnas.2116373119

The time complexity of self-assembly

Florian M Gartner a,b, Isabella R Graf a,b,1, Erwin Frey a,b,2
PMCID: PMC8795547  PMID: 35042812

Significance

An important limiting factor for self-assembly processes is the time it takes to assemble large structures with high yield. While equilibrium self-assembly systems slowly relax toward a state of minimal free energy, nonequilibrium systems offer various ways to control assembly processes and to optimize their time efficiency. We show that these different control scenarios can informatively be characterized by their time complexity, i.e., their scaling of the assembly time with the structure size, analogous to algorithms for computational problems. Especially for large structures, differences in the time complexity of the scenarios lead to strongly diverging time efficiencies. Most significantly, we show that by effectively regulating the supply of constituents, high resource and time efficiency can be achieved for self-assembly processes.

Keywords: nonequilibrium self-assembly, time efficiency, time complexity, self-assembly scenario, supply control

Abstract

Time efficiency of self-assembly is crucial for many biological processes. Moreover, with the advances of nanotechnology, time efficiency in artificial self-assembly becomes ever more important. While structural determinants and the final assembly yield are increasingly well understood, kinetic aspects concerning the time efficiency, however, remain much more elusive. In computer science, the concept of time complexity is used to characterize the efficiency of an algorithm and describes how the algorithm’s runtime depends on the size of the input data. Here we characterize the time complexity of nonequilibrium self-assembly processes by exploring how the time required to realize a certain, substantial yield of a given target structure scales with its size. We identify distinct classes of assembly scenarios, i.e., “algorithms” to accomplish this task, and show that they exhibit drastically different degrees of complexity. Our analysis enables us to identify optimal control strategies for nonequilibrium self-assembly processes. Furthermore, we suggest an efficient irreversible scheme for the artificial self-assembly of nanostructures, which complements the state-of-the-art approach using reversible binding reactions and requires no fine-tuning of binding energies.


Time efficiency of self-assembly plays an important role in biology. For example, virus assembly must be fast to produce many virus particles before the infected cell is eliminated by the host’s immune system (13). Moreover, as larger and ever more complex nanostructures are to be realized for technological or medical applications, time efficiency in artificial self-assembly becomes vital (4, 5). Designing self-assembly schemes that are fast and resource efficient is, however, challenging. The task amounts to finding strategies that avoid the formation of large numbers of incompatible and incomplete fragments of the desired target structure. Such kinetic traps (610) arise even when all building blocks have a high binding specificity and erroneous binding is negligible, and they become more prominent with increasing structure size. Consequently, assembly time increases with structure size.

But how exactly does the assembly time scale with the size of the target structure, and how does this scaling depend on the self-assembly scheme? What kinds of schemes optimize the assembly time? Answers to these questions will enable assembly strategies to be identified that are optimally suited for the production of large, functionally complex macromolecular structures via artificial self-assembly, a major goal in nanotechnology (4, 5, 1113). Here, we address these questions by studying the time complexity (as opposed to structural complexity) (1417) of four prototypical self-assembly scenarios, using scaling arguments and in silico modeling of the stochastic dynamics. Three of these scenarios have well-established realizations in biological and artificial self-assembly processes. The fourth strategy is a distinct idea conceptualized to achieve efficient self-assembly in a technological context by effectively regulating the supply of building blocks.

General Model and Self-Assembly Scenarios

To explore these questions in their simplest form, we consider an assembly process involving N identical copies of S different species of building blocks (monomers) and assume chemical reaction kinetics in a well-mixed fluid environment. By C=N/V we denote the concentration of monomers per species, where V is the reaction volume. As we expect the time efficiency of the assembly process to depend on the dimensionality of the structure, we investigate the assembly of linear polymers, two-dimensional sheets, and three-dimensional cubes of edge length L (volume S) (Fig. 1). We specify the system as a fully heterogeneous system with S distinct species because this case defines the most general self-assembly process that allows for the largest set of different assembly strategies to be applied. Our analysis shows, however, that for three of the four strategies we consider, the heterogeneity of the building blocks is indeed irrelevant in the limiting case of large particle numbers N and therefore our results hold independently of the heterogeneity of the structures. We assume that all binding reactions are specific and take place only between “neighboring” species as illustrated in Fig. 1. Erroneous binding between the constituents that would lead to malformed structures is thereby not taken into account. Following the assumptions of classical aggregation theory, we furthermore neglect interactions among oligomers (17, 18).

Fig. 1.

Fig. 1.

Schematic description of the model. N identical copies of S different species of monomers assemble into one- (1D), two- (2D), or three-dimensional (3D) heterogeneous structures of edge length L (only the 2D case is illustrated explicitly). A constant influx of monomers of species i takes place during the time interval [Ti,Ti+1α] with net influx rate Nα. Once added to the system (activated), monomers start to self-assemble. A monomer of a bulk species has two (1D), four (2D), or six (3D) possible binding partners as shown. In the 1D case, we assume periodic boundary conditions, i.e., species 1 and S can bind as well and the final structures form closed rings. In the higher-dimensional cases, we assume open boundaries, implying that the species located at the boundary have a reduced number of binding partners. Any two fitting monomers can dimerize with rate μ. Subsequent to dimerization, structures grow by attachment of single monomers with rate ν per binding site. Furthermore, monomers can detach from a cluster with rate δn=AenEB, where n is the number of bonds that need to be broken and EB the binding energy per bond. We set A=1018Cν, with C=N/V denoting the concentration of monomers per species. Our aim is to minimize the assembly time T90 when 90% of all resources are assembled into complete structures. To this end, we control particular elements of the assembly process (control parameters) and distinguish four scenarios that are defined through the respective control parameter(s). The other parameters are fixed from the following set of “default” values: Ti=0,α=,μ=ν,EB=(δn=0). Each scenario can be used to elude kinetic traps and achieve a high assembly yield but how much time do these different strategies require?

Specifically, we assume the following reaction kinetics: Any two compatible monomers can bind at rate μ, forming a dimer that serves as a nucleus for further growth by sequential addition of monomers at rate ν per binding site. Analyses of more complex reaction schemes including heterogeneous binding rates are discussed in SI Appendix and show that our conclusions are robust against model modifications. We mainly consider irreversible processes, in which structures can only grow. To assess the relevance of reversible binding, we also discuss a scenario in which individual monomers may detach from the edges of incomplete structures at a finite detachment rate δn that decreases exponentially with the number n of bonds that need to be broken: δn=AenEB (Arrhenius’ law). Here EB is the binding energy per contact (bond) in units of kBT and the constant A can typically be assumed to be large relative to the rate of reactions (19, 20). Note that we consider only the detachment of single monomer units. In the special case of one-dimensional structures, this implies that the structures grow and shrink only at the ends but do not break up in the middle. This assumption can be justified if some mechanism stabilizes linear structures in the middle (for example, if allosteric effects stabilize the interior bonds). Otherwise, fragmentation of one-dimensional structures would strongly reduce the time efficiency of their self-assembly and the result of our analysis below must be interpreted as an upper limit for the efficiency.

Once a structure contains all S species it is considered complete, and no further attachment or detachment processes occur (absorbing state). The yield of the assembly process is defined as the number of complete structures relative to their maximum possible number N.

In artificial self-assembly systems, the temporal supply of components can usually be controlled externally. This offers effective ways of regulating the assembly dynamics. To examine the potential of such supply-control strategies, we study two diametrically opposed cases. In the first case, all building blocks are supplied (activated) uniformly over a fixed time interval τ=1/α at a constant influx rate Nα. By controlling α one can regulate the concentrations of monomers and hence the effective dimerization rate. In the second case, the different species are added in a defined temporal sequence (Fig. 1), which allows one to favor specific assembly pathways by altering the order of the time points Ti at which a species i is added (supply order).

Besides the binding rate ν that fixes the timescale, we are left with four control parameters, EB,μ,α,{Ti}, which define different assembly scenarios (Fig. 1). In the reversible binding scenario, kinetic traps are avoided by “designing” monomers with an optimal binding energy EB and resulting detachment rates δn. This strategy is considered as the state of the art in DNA-brick–based self-assembly (2124) but it also plays an important role in biology, for instance for virus capsid assembly (25). In the dimerization scenario, the assembly process is controlled by the dimerization rate μ. A nucleation barrier μ/ν<1 can be implemented for example by allosteric effects or with the help of enzymes (assembly factors) and is known to play a central role in many instances of biological self-assembly (2629). In the activation scenario, the assembly efficiency is controlled by an overall influx rate α without discrimination between species. Such a control of the availability of active monomers has been suggested as a means to effectuate the self-assembly of some virus capsids (26), as well as other cellular macromolecular structures like the membrane attack complex (30). Finally, in the just-in-sequence (JIS) scenario, the monomers are supplied just in sequence with a favorably chosen assembly path by appropriate design of the supply order {Ti}. We therefore expect that these four key scenarios cover the underlying mechanisms of a large class of biologically and experimentally relevant self-assembly processes.

Time Complexity Analysis

For each of the four scenarios, we investigated the minimal time required to achieve a target yield of 90% (denoted as T90min). This requires us to identify the optimal value of the respective control parameter that maximizes the time efficiency. We are interested in the asymptotic dependence of T90min on the structure size S for S,N1. In particular, while we have shown previously (31) that for small copy numbers N, the activation scenario is strongly influenced by stochastic effects, we here assume N to be large enough so that stochastic effects can be considered irrelevant.

Maximal time efficiency can then be obtained by a proper choice of the relative frequency of nucleation and growth events: Initiation of new structures must be sufficiently retarded relative to the growth of existing structures to avoid kinetic traps (“slow nucleation principle”) (3133). The larger the target structure is, the smaller the ratio between the effective nucleation and growth rate has to be to achieve a yield of 90%. However, too small a nucleation rate severely limits the required assembly time on the other hand. The various scenarios (with the exception of the one-dimensional reversible-binding scenario, which constitutes a special case) represent different mechanisms to control the ratio between the nucleation and growth rate and therefore allow one to tune it to an optimal value.

However, the effectiveness with which the ratio is controlled, and thus the minimum assembly time that can be achieved, varies greatly between the different strategies. In all cases, we find numerically that both the optimal control parameter and the minimal assembly time exhibit power-law dependencies on the size S of the target structure (Fig. 2). The corresponding exponents are referred to as the control parameter exponent ϕ and the (time) complexity exponent θ, respectively. Both exponents are scenario specific and, moreover, depend on the dimensionality of the assembled structure, as is discussed in detail below for each scenario.

Fig. 2.

Fig. 2.

Time complexity. (A–C) The minimal assembly time T90min in the four scenarios in dependence of the size S of the target structure as obtained from stochastic simulations for different dimensionalities of the structures: (A) 1D, (B) 2D, and (C) 3D. The reactive timescale (Cν)1 defines the basic timescale in the system, which depends on the initial concentration C of monomers per species. Hence, the minimal assembly time is measured in units of (Cν)1. Each data point represents an average over several independent realizations of the stochastic simulation for the same (optimal) parameter value, determined by a parameter sweep (SI Appendix, section 1). We find power-law dependencies of the minimal assembly time on the size of the target structure. The corresponding time complexity exponents θsim resulting from the simulations are summarized in the tables in A–C together with their theoretic estimates θth (which we derive in SI Appendix, section 3). We indicate the scenarios as rev, reversible binding; act, activation; jis, just-in-sequence; and dim, dimerization.

To derive analytical estimates for the exponents, we use that the optimal ratio between nucleation and growth rate should approximately scale inversely with the structure size,

 number of nucleation events per time  number of attached monomers per time S1. [1]

A detailed mathematical evaluation of the scaling of the terms on the left-hand side with the system parameters can be found in SI Appendix, section 3. In the main text, we restrict ourselves to a discussion of the phenomenology of our numerical results and use heuristic scaling arguments.

Reversible-Binding Scenario

In the reversible-binding scenario, the time complexity strongly depends on the dimensionality of the structure. For one-dimensional structures, the rate of monomer detachment is the same for all unfinished structures. Hence, it is not possible to selectively disfavor nucleation of new structures relative to the growth of existing structures by varying the binding energy EB. In this respect, the one-dimensional reversible-binding scenario constitutes a special case among all scenarios, since it realizes a profoundly different self-assembly mechanism. We find that structures are initially formed in such an amount that the overall attachment and detachment processes of the monomers balance out and the concentration of monomers becomes stationary. Growth and shrinkage of a structure then become approximately equally likely and the cluster sizes evolve (approximately) diffusively, with diffusion constant given by D=νm+δ, where m denotes the stationary monomer concentration. Hence, varying the detachment rate δ allows one to maximize the diffusive flux. We show in SI Appendix, section 3 that the optimal detachment rate and the resulting effective diffusion constant scale like δoptDνμ(Cν)S2. This implies that the assembly time for one-dimensional structures scales like the diffusive timescale (to diffusively transverse a distance S) T90minS2/Dν2S4 with time complexity exponent θ = 4, which agrees very well with the results obtained from stochastic simulations (Fig. 2A).

In higher dimensions, large clusters are typically bound more tightly and hence become energetically favored over clusters of small size, as illustrated in Fig. 3A. This creates an effective nucleation barrier, which allows one to strongly enhance the time efficiency compared to the one-dimensional case. Essentially, the monomer concentration is thereby much larger than in the one-dimensional case, which enables nucleated structures to grow quickly. However, to guarantee both high resource efficiency (high yield) and time efficiency, the binding energy must be fine-tuned to within few percent of its optimal value (Fig. 3B). Larger binding energies imply a lower nucleation barrier and lead to kinetic trapping, whereas lower binding energies progressively reduce the effective nucleation rate. By fine-tuning of the binding energy EB, we obtain the time complexity exponents θ2D1.19 and θ3D0.75, respectively, for the two-dimensional (2D) and three-dimensional (3D) cases (Fig. 2 B and C). Both exponents can also be estimated analytically from Eq. 1 by deriving effective rates for nucleation and attachment reactions (SI Appendix, section 3 and tables in Fig. 2 B and C). Note that the binding energy EB is measured in units of kBT and the detachment rate relative to the reaction rate (Cν). Hence, the most feasible way to fine-tune the control parameter in experiments will be to adapt either the temperature or the monomer concentration C.

Fig. 3.

Fig. 3.

Reversible-binding scenario. (A) In the reversible-binding scenario (if δ2δ1), the cluster evolution typically proceeds via stable intermediate states (in which all constituents form two or more bonds), whereas unstable intermediates are short lived. Hence, nucleation is disfavored relative to growth because nucleation proceeds via two unstable intermediate states whereas attachment proceeds only via one. (B) Assembly time to achieve 50% yield (T50) and 90% yield (T90) plotted against the binding energy EB for two-dimensional target structures of size S = 100 (with preexponential factor A=1018Cν). To achieve high yield with maximal time efficiency, EB must be fine-tuned to a narrow range (here 1.4%) around its optimal value. In Inset, the optimal detachment rate δ1opt exhibits a power-law dependence on the structure size with exponent characterized by the dimensionality of the structure. The control parameter exponents ϕsim together with their theoretic estimates ϕth are summarized in the table.

Dimerization Scenario

We then analyzed the remaining irreversible assembly scenarios, setting δn=0. In the dimerization scenario, decreasing the dimerization rate μ disfavors initiation of new structures relative to the growth of existing structures. Fig. 4A shows the corresponding transition from zero to perfect final yield, with μ90 indicating the rate at which a final yield of 90% is achieved. We find that the optimal rate μopt that minimizes the time required to achieve 90% yield is only slightly lower than μ90 and, for linear structures, scales as μoptνS2 (Fig. 4 A, Inset). This dependence of μopt on S for linear structures can be explained as follows: According to Eq. 1, when increasing the structure size S, the ratio between nucleation (= dimerization) and growth rate must be reduced to allow the structures to grow to the larger size. However, to achieve the desired scaling, μopt must scale quadratically with 1/S, because the number of dimerization events per time increases with the number of possible dimerization partners (S) leading to an additional factor of 1/S.

Fig. 4.

Fig. 4.

Dimerization and activation scenario. (A and B) The final yield in dependence of the dimerization rate (A) and the activation or influx rate (B) for different sizes (symbols) and dimensionality (color shading) of the target structure. Data points represent averages over at least 20 independent realizations. Upon decreasing either the dimerization or the activation rate, perfect final yield is achieved. For the leftmost transition we indicate the optimal parameter value μopt or αopt that minimizes the time to achieve a yield of 90%. Insets show the dependence of the optimal parameter value on the structure size for different dimensionality. The corresponding control parameter exponents ϕsim are summarized in the tables together with their theoretic estimates ϕth (main text).

Since dimerization is the rate-limiting step, we expect that the assembly time will predominantly be determined by the total dimerization rate T90min(CμoptS)1(Cν)1S. This estimate correctly predicts the time complexity exponent θ = 1 for linear structures (Fig. 2A). For target structures of higher dimension, the effective growth rate of clusters is increased compared to the one-dimensional case because structures grow radially. This allows for a simple possibility to relate the exponents for target structures of higher dimension to the one-dimensional case by rescaling the binding rate ν: Note that the number of possible binding partners of a globular structure with s particles is proportional to its surface area and thus scales approximately as s(d1)/d, where d is the dimensionality of the structure. Thus, defining an effective average binding rate νSνS(d1)/d for a target structure size S allows one to map higher-dimensional growth processes to an effective one-dimensional process along the radial coordinate. Replacing ννS therefore translates the scaling laws for linear objects into approximate scaling laws for higher-dimensional structures. This scaling idea for the dimerization scenario accurately yields the control parameter exponents for higher-dimensional structures (table in Fig. 4A) and only slightly overestimates the time complexity exponents in higher dimensions (Fig. 2 B and C). These deviations may be attributed to the subleading contribution of the growth process to the total assembly time, which becomes more pronounced in higher dimensions. The dimerization scenario is the most time-efficient scenario because reducing the dimerization rate allows one to specifically control the effective nucleation speed without simultaneously affecting the attachment rate. In contrast, changing the binding energy in the reversible-binding scenario at the same time reduces the effective attachment speed and therefore renders this strategy less efficient.

Activation Scenario

In the activation scenario, nucleation is inhibited by controlling the concentration of available (or active) monomers. Decreasing the influx rate α reduces the momentary concentration of active monomers and therefore reduces the effective dimerization rate. As in the dimerization scenario, this leads to a transition from zero to perfect final yield (Fig. 4B). The transition is not strictly monotonic but exhibits some small-scale peaks, whose origin is not entirely clear. From the stochastic simulations and scaling analysis (SI Appendix, section 3) we infer for linear structures an optimal influx rate scaling as αoptνμ(Cν)S3 (Fig. 4 B, Inset). The dependence of αopt on S can be explained similarly to that for the dimerization scenario: When increasing S, according to Eq. 1, the ratio between effective nucleation and growth rate must be reduced, while the increase of the total nucleation rate with increasing number of species must be balanced. Together, this accounts for a factor of 1/S2 in αopt, analogous to the dimerization scenario. Additionally, however, increasing S would enhance the total influx of particles and thus the momentary concentration of monomers. This again would increase the nucleation rate and needs to be balanced, thus explaining the third factor of 1/S. The control parameter exponents ϕ for higher-dimensional structures can again be derived with our rescaling argument, ννS(d1)/d, and are found to be only slightly larger than those obtained from simulations (table in Fig. 4B). As the monomers are activated over a time span 1/α, the time complexity exponents are the reciprocals of the parameter exponents (Fig. 2).

We expect that the exponents for the activation scenario remain the same if other forms of monomer input are considered. For example, monomers could (reversibly or irreversibly) switch between an assembly inactive and active state. In SI Appendix, section 4, we simulate the reversible case explicitly, assuming that the switch is fast so that active and inactive monomers are at equilibrium. This case might indeed be relevant in virus capsid assembly (26) and it exhibits the same scaling as the constant influx scenario (SI Appendix, Fig. S5C). Controlling the switching rate between particle configurations (for example with light) (34) could also be a feasible way to implement the activation scenario experimentally.

Just-in-Sequence Scenario

In the irreversible assembly scenarios discussed so far, all species are made available simultaneously. Consequently, excess nucleation of structures can only be suppressed by using a low dimerization or activation rate. In contrast, the JIS scenario favors specific assembly paths by regulating the order in which building blocks are supplied. The species supplied first in this temporal sequence define the nuclei for subsequent growth. Formation of other competing nuclei (dimers) during the assembly process is suppressed by the sequential delivery of building blocks, which ensures that mutual binding partners are supplied successively. Binding of newly added monomers to existing structures is therefore more likely than formation of new dimers. The frequency of competing nucleation events can be controlled by adjusting the interval ΔT between the equidistant time points Ti at which subsequent “batches” of monomers are provided. Longer time intervals increase the yield at the cost of a lower time efficiency.

To minimize the total number of batches, we chose an “onion” supply protocol, which allows structures to grow radially from the inside out, like the skins of an onion (Fig. 5C). Furthermore, the time efficiency can be enhanced by using increasing, nonstoichiometric concentrations for the monomers in successive batches (Fig. 5B). Nonstoichiometric concentrations in a properly chosen ratio (SI Appendix, section 1, Just-in-sequence scenario) reduce competition for resources between growing structures (Fig. 5A) and thereby greatly enhance the time efficiency, as well as robustness to extrinsic noise in the particle numbers supplied, especially for higher-dimensional structures (Fig. 5D and E). Therefore, nonstoichiometric concentrations are the key to successful implementation of the JIS strategy for higher-dimensional structures. Since we assume equidistant time intervals ΔT between subsequent batches, the total assembly time is the product of ΔT (S) and the total number of batches (LS1/d), yielding the complexity exponents θ=1+1/d, as shown in Fig. 2. To demonstrate the broad experimental applicability of the JIS supply strategy with a concrete example, we discuss in detail in SI Appendix, section 5 how the JIS strategy could efficiently be used to assemble artificial T = 1 capsids. Artificial capsids have important potential technological and medical applications (3537) and the simulations show that the JIS strategy might indeed be a feasible and efficient way to assemble these structures.

Fig. 5.

Fig. 5.

JIS scenario. (A) In the JIS scenario, the different species are added sequentially; here, for illustration, they are in a linear sequence (T1<T2<T3<). Along the regular assembly paths, A1D (1D) or A2D (2D), additional dimers B can form, competing for resources with the regular structures and thereby disrupting their growth. While for one-dimensional structures a disruption event prevents a structure A1D from further growth, in higher dimensions both defective structures A2D and B continue to grow, thereby increasing competition for resources. (B) Competition for resources can be alleviated by enhancing the amount of resources with each assembly step (nonstoichiometric concentrations; SI Appendix, section 1). For example, providing the first species in concentration 0.9N and increasing linearly up to 1.1N for the last species strongly enhances assembly efficiency (D) and robustness (E). (C) Parallel supply protocol illustrated for a 2D structure of size S = 25 causing the structures to grow radially in an “onion-skin”–like fashion. Roman numbers indicate the order in which species are supplied. Species with identical numbers (“onion skins”) are supplied simultaneously in “batches.” (D) When using nonstoichiometric concentrations, high yield can be achieved with a shorter time span ΔT between subsequent batches, exhibiting a smaller control parameter exponent (Inset) compared to the case of stoichiometric concentrations. Simulations were performed for 3D structures with N=104 to 105. (E) External noise in the concentrations jeopardizes the yield when stoichiometric concentrations are used, whereas nonstoichiometric concentrations are much more robust. Here, for each species we assumed a coefficient of variation CV=0.1% with average particle numbers as in D.

Discussion

Fig. 2 shows the dependence of the minimal assembly time on target structure size, together with the resulting time complexity exponents for the different scenarios and dimensionalities. All exponents decrease with increasing dimensionality of the target structure and can even change their relative order. For the dimerization, activation, and reversible-binding scenario, one can show that the analysis is independent of the heterogeneity of the building blocks (SI Appendix, section 2). Remarkably, the exponents are furthermore robust to various modifications of the model such as heterogeneous binding rates, modified boundary conditions, or altered definitions of the assembly time (SI Appendix, section 4). Similarly, advanced protocols like annealing or different forms of monomer input in the activation scenario leave the exponents invariant. This invariance shows that the time complexity analysis yields a reliable and robust characterization of self-assembly processes. Furthermore, the invariance of the parameter exponents allows for an optimal control strategy to be identified in dependence of the size of the target structure in each of the four scenarios.

The dimerization scenario turns out to be the most time-efficient scenario in all dimensions. Controlling the dimerization rate is efficient as it allows one to initiate just as many structures as are needed, followed by a rapid growth phase if all particles are readily available. For linear structures, the supply-control strategies rank second and third, with coordinated supply in the JIS scenario being more efficient than uncoordinated supply in the activation scenario. Reversible binding is the least efficient approach to assembling large linear structures, but it is efficient for the assembly of higher-dimensional structures and then becomes competitive with the JIS scenario, slightly outperforming it for large structure sizes.

The reason why reversible binding is inefficient for one-dimensional structures is that for linear objects—in contrast to higher-dimensional objects—nucleation cannot be slowed down relative to growth by increasing the detachment rate. This strong dependence of the efficiency on the dimensionality implies that, generally, the morphology of the assembled structures plays an important role for the reversible-binding scenario. For example, assembling quasi-linear objects with two (or more) layers of subunits instead of a one-layered linear object might significantly increase the assembly efficiency. Identifying and designing those morphologies that are particularly favorable and assemble efficiently could therefore be an interesting direction for future research.

In conclusion, our time-complexity analysis of self-assembly describes lower bounds for the required assembly time as a function of the target structure size. Furthermore, it provides a robust description of how the parameters of the system must be controlled to achieve optimal time and resource efficiency. The analysis enables us to compare the efficiency of different self-assembly scenarios. In computer science, the complexity of a computational problem is defined as the complexity of the fastest algorithm available to solve it (38). Among the assembly scenarios discussed here, limiting the dimerization rate defines the fastest assembly process and might thereby determine the time complexity of self-assembly (of course, we cannot exclude the possibility of even faster assembly strategies). Experimentally, however, controlling the dimerization rate is difficult, as it effectively requires building blocks that exhibit allosteric binding effects. So far, experiments have typically resorted to rendering binding reactions reversible (2123, 25, 39, 40). Our analysis shows that this common approach is time efficient for the assembly of higher-dimensional structures. However, to be truly competitive, fairly precise tuning of bond strengths, temperature, and the concentration is required. Our analysis suggests that a supply-control strategy like the JIS scenario is a promising alternative that offers similar or better time efficiency using irreversible self-assembly. As a significant advantage, this strategy does not rely on sophisticated properties of the building blocks (like allosteric effects or fine-tuned bond strengths) but only on temporal supply control and hence on parameters that might be more amenable to regulation and adaptation in experiments: In its simplest implementation, the different species could just be added manually to the system in the designated temporal sequence.

Compared to the current state-of-the-art approach via reversible reactions, irreversible assembly schemes might thus provide a complementary and more versatile strategy for assembling complex structures, requiring control over relative concentrations, rather than fine-tuning of the molecular details. Importantly, the idea underlying the JIS scenario entails a rather specific design principle for efficient irreversible assembly protocols of complex nanostructures (“batches without mutual binding partners”); we demonstrate in SI Appendix, section 5 how this principle is applied exemplarily for the assembly of artificial T = 1 capsids. This design principle thereby provides a clear path toward the experimental realization of the JIS scenario, suggesting that the strategy will be broadly applicable to the assembly of artificial structures.

An interesting question for future research concerns the prospects for spatiotemporal supply control, i.e., controlling not only the time interval but also the site at which monomers are injected into a spatial system, for further enhancement of the time efficiency. Moreover, it would be interesting to consider the time complexity of assembly schemes like hierarchical self-assembly (4144), which include polymer–polymer interactions, or assembly schemes in which interactions among the particles allow for multiple self-assembly states. Finally, other potentially important aspects of self-assembly include susceptibility to errors in the case of reduced binding specificities or defective particles, as well as robustness to stochastic effects for small copy numbers. If particle numbers are large and nonspecific bonds are sufficiently weak and reversible, we expect that these factors will not considerably affect the assembly dynamics. Otherwise, it might be instructive to test how the different assembly scenarios are influenced by these factors and compare the robustness of the various strategies in this respect.

Materials and Methods

This paper is accompanied by a detailed SI Appendix file, which discusses the numerical and analytical methods that were used to simulate the four scenarios and to determine their time complexity exponents. Specifically, SI Appendix, section 1 shows the details of the numerical simulation and, in particular, explains how the concentrations for the various species in the just-in-sequence scenario were determined. SI Appendix, section 2 analyzes the master equation and shows mathematically that the heterogeneity (distinguishability) of the building blocks is irrelevant for the dynamics in the limit of large particle numbers. SI Appendix, section 3 is dedicated to the mathematical scaling analysis and explains how the analytic estimates for the time complexity and control parameter exponents are derived. Furthermore, SI Appendix, section 4 demonstrates the robustness of the time complexity exponents to various modifications of the model and variations in the parameters. Finally, SI Appendix, section 5 illustrates how the just-in-sequence supply strategy can be used in practice for the concrete example of artificial T = 1 capsid assembly and thereby demonstrates the broad applicability of the just-in-sequence scenario.

Supplementary Material

Supplementary File

Acknowledgments

We thank Patrick Wilke and Philipp Geiger for stimulating discussions. This research was supported by the German Excellence Initiative via the program “NanoSystems Initiative Munich” and was funded by the Deutsche Forschungsgemeinschaft (DFG) (German Research Foundation) under Germany’s Excellence Strategy, EXC-2094–390783311, as well as under Project ID 364653263–TRR 235. F.M.G. and I.R.G. were supported by a DFG fellowship through the Graduate School of Quantitative Biosciences Munich.

Footnotes

The authors declare no competing interest.

This article is a PNAS Direct Submission.

This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.2116373119/-/DCSupplemental.

Data Availability

C++ code for simulations and data have been deposited in GitHub (https://github.com/FloGat88/Self_Assembly.git) (45).

References

  • 1.Zlotnick A., Theoretical aspects of virus capsid assembly. J. Mol. Recognit. 18, 479–490 (2005). [DOI] [PubMed] [Google Scholar]
  • 2.Perlmutter J. D., Hagan M. F., Mechanisms of virus assembly. Annu. Rev. Phys. Chem. 66, 217–239 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Hagan M. F., Modeling viral capsid assembly. Adv. Chem. Phys. 155, 1–68 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Stewart J. M., Franco E., Self-assembly of large RNA structures: Learning from DNA nanotechnology. DNA RNA Nanotechnol. 2, 23–35 (2015). [Google Scholar]
  • 5.Chidchob P., Sleiman H. F., Recent advances in DNA nanotechnology. Curr. Opin. Chem. Biol. 46, 63–70 (2018). [DOI] [PubMed] [Google Scholar]
  • 6.Deeds E. J., Bachman J. A., Fontana W., Optimizing ring assembly reveals the strength of weak interactions. Proc. Natl. Acad. Sci. U.S.A. 109, 2348–2353 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Whitelam S., Jack R. L., The statistical mechanics of dynamic pathways to self-assembly. Annu. Rev. Phys. Chem. 66, 143–163 (2015). [DOI] [PubMed] [Google Scholar]
  • 8.Endres D., Zlotnick A., Model-based analysis of assembly kinetics for virus capsids or other spherical polymers. Biophys. J. 83, 1217–1230 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Jacobs W. M., Frenkel D., Self-assembly of structures with addressable complexity. J. Am. Chem. Soc. 138, 2457–2467 (2016). [DOI] [PubMed] [Google Scholar]
  • 10.Hagan M. F., Elrad O. M., Jack R. L., Mechanisms of kinetic trapping in self-assembly and phase transformation. J. Chem. Phys. 135, 104115 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Ramezani H., Dietz H., Building machines with DNA molecules. Nat. Rev. Genet. 21, 5–26 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Seeman N. C., Sleiman H. F., DNA nanotechnology. Nat. Rev. Mater. 3, 1–23 (2017). [Google Scholar]
  • 13.Cademartiri L., Bishop K. J., Programmable self-assembly. Nat. Mater. 14, 2–9 (2015). [DOI] [PubMed] [Google Scholar]
  • 14.Ahnert S. E., Johnston I. G., Fink T. M., Doye J. P., Louis A. A., Self-assembly, modularity, and physical complexity. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 82, 026117 (2010). [DOI] [PubMed] [Google Scholar]
  • 15.Soloveichik D., Winfree E., Complexity of self-assembled shapes. SIAM J. Comput. 36, 1544–1569 (2007). [Google Scholar]
  • 16.Evans C. G., Winfree E., Physical principles for DNA tile self-assembly. Chem. Soc. Rev. 46, 3808–3829 (2017). [DOI] [PubMed] [Google Scholar]
  • 17.Chen C., Kao C. C., Dragnea B., Self-assembly of brome mosaic virus capsids: Insights from shorter time-scale experiments. J. Phys. Chem. A 112, 9405–9412 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Ben-Shaul A., Gelbart W. M., “Statistical thermodynamics of a mphiphile self-assembly: Structure and phase transitions in micellar solutions” in Micelles, Membranes, Microemulsions, and Monolayers, Gelbart W. M., Ben-Shaul A., Roux D., Eds. (Springer, New York, NY, 1994), pp. 1–104. [Google Scholar]
  • 19.Schnier P. D., Klassen J. S., Strittmatter E. F., Williams E. R., Activation energies for dissociation of double strand oligonucleotide anions: Evidence for Watson-Crick base pairing in vacuo. J. Am. Chem. Soc. 120, 9605–9613 (1998). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Bielec K., et al., Kinetics and equilibrium constants of oligonucleotides at low concentrations. Hybridization and melting study. Phys. Chem. Chem. Phys. 21, 10798–10807 (2019). [DOI] [PubMed] [Google Scholar]
  • 21.Whitesides G. M., Grzybowski B., Self-assembly at all scales. Science 295, 2418–2421 (2002). [DOI] [PubMed] [Google Scholar]
  • 22.Zhang S., Fabrication of novel biomaterials through molecular self-assembly. Nat. Biotechnol. 21, 1171–1178 (2003). [DOI] [PubMed] [Google Scholar]
  • 23.Hong F., Zhang F., Liu Y., Yan H., DNA origami: Scaffolds for creating higher order structures. Chem. Rev. 117, 12584–12640 (2017). [DOI] [PubMed] [Google Scholar]
  • 24.Freeman R., et al., Reversible self-assembly of superstructured networks. Science 362, 808–813 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Rapaport D. C., Role of reversibility in viral capsid growth: A paradigm for self-assembly. Phys. Rev. Lett. 101, 186101 (2008). [DOI] [PubMed] [Google Scholar]
  • 26.Lazaro G. R., Hagan M. F., Allosteric control of icosahedral capsid assembly. J. Phys. Chem. B 120, 6306–6318 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Marsh J. A., Teichmann S. A., Structure, dynamics, assembly, and evolution of protein complexes. Annu. Rev. Biochem. 84, 551–575 (2015). [DOI] [PubMed] [Google Scholar]
  • 28.Zlotnick A., Mukhopadhyay S., Virus assembly, allostery and antivirals. Trends Microbiol. 19, 14–23 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Caspar D. L., Movement and self-control in protein assemblies. Quasi-equivalence revisited. Biophys. J. 32, 103–138 (1980). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Leung C., et al., Stepwise visualization of membrane pore formation by suilysin, a bacterial cholesterol-dependent cytolysin. eLife 3, e04247 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Gartner F. M., Graf I. R., Wilke P., Geiger P. M., Frey E., Stochastic yield catastrophes and robustness in self-assembly. eLife 9, 1–37 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Morozov A. Y., Bruinsma R. F., Rudnick J., Assembly of viruses and the pseudo-law of mass action. J. Chem. Phys. 131, 155101 (2009). [DOI] [PubMed] [Google Scholar]
  • 33.Murugan A., Zou J., Brenner M. P., Undesired usage and the robust self-assembly of heterogeneous structures. Nat. Commun. 6, 6203 (2015). [DOI] [PubMed] [Google Scholar]
  • 34.Gil A. A., et al., Optogenetic control of protein binding using light-switchable nanobodies. Nat. Commun. 11, 4044 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Heddle J. G., Chakraborti S., Iwasaki K., Natural and artificial protein cages: Design, structure and therapeutic applications. Curr. Opin. Struct. Biol. 43, 148–155 (2017). [DOI] [PubMed] [Google Scholar]
  • 36.Schwarz B., Uchida M., Douglas T., Biomedical and catalytic opportunities of virus-like particles in nanotechnology. Adv. Virus Res. 97, 1–60 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Sigl C., et al., Programmable icosahedral shell system for virus trapping. Nat. Mater. 20, 1281–1289 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Sipser M., Introduction to the Theory of Computation (Thomson Learning, Boston, MA, 2006). [Google Scholar]
  • 39.Whitesides G. M., Boncheva M., Beyond molecules: Self-assembly of mesoscopic and macroscopic components. Proc. Natl. Acad. Sci. U.S.A. 99, 4769–4774 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Zlotnick A., Are weak protein-protein interactions the general rule in capsid assembly? Virology 315, 269–274 (2003). [DOI] [PubMed] [Google Scholar]
  • 41.Haxton T. K., Whitelam S., Do hierarchical structures assemble best via hierarchical pathways? Soft Matter 9, 6851 (2013). [Google Scholar]
  • 42.Whitelam S., Hierarchical assembly may be a way to make large information-rich structures. Soft Matter 11, 8225–8235 (2015). [DOI] [PubMed] [Google Scholar]
  • 43.Pfeifer W., Saccà B., From nano to macro through hierarchical self-assembly: The DNA paradigm. ChemBioChem 17, 1063–1080 (2016). [DOI] [PubMed] [Google Scholar]
  • 44.Baschek J. E., Klein H. C. R., Schwarz U. S., Stochastic dynamics of virus capsid formation: Direct versus hierarchical self-assembly. BMC Biophys. 5, 22 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Gartner F. M., Stochastic simulation of self-assembly scenarios (C++ code). GitHub. https://github.com/FloGat88/Self_Assembly.git. Deposited 24 September 2021.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File

Data Availability Statement

C++ code for simulations and data have been deposited in GitHub (https://github.com/FloGat88/Self_Assembly.git) (45).


Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES