Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Jul 26.
Published in final edited form as: J Phys Chem B. 2012 Mar 21;116(29):8355–8362. doi: 10.1021/jp210407e

Nanosystem Self-Assembly Pathways Discovered via All-Atom Multiscale Analysis

Stephen D Pankavich 1,*, Peter J Ortoleva 2
PMCID: PMC3406252  NIHMSID: NIHMS365481  PMID: 22372746

Abstract

We consider the self-assembly of composite structures from a group of nanocomponents, each consisting of particles within an N-atom system. Self-assembly pathways and rates for nanocomposites are derived via a multiscale analysis of the classical Liouville equation. From a reduced statistical framework, rigorous stochastic equations for population levels of beginning, intermediate, and final aggregates are also derived. It is shown that the definition of an assembly type is a self-consistency criterion that must strike a balance between precision and the need for population levels to be slowly varying relative to the timescale of atomic motion. The deductive multiscale approach is complemented by a qualitative notion of multicomponent association and the ensemble of exact atomic-level configurations consistent with them. In processes such as viral self-assembly from proteins and RNA or DNA, there are many possible intermediates so that it is usually difficult to predict the most efficient assembly pathway. However, in the current study rates of assembly of each possible intermediate can be predicted. This avoids the need, as in a phenomenological approach, for recalibration with each new application. The method accounts for the feedback across scales in space and time that is fundamental to nanosystem self-assembly. The theory has applications to bionanostructures, geomaterials, engineered composites, and nanocapsule therapeutic delivery systems.

Keywords: self-assembly, virus assembly, bionanosystems, nanomaterials, multiscale analysis

Introduction

The self-assembly of composite structures from nanocomponents is central to many natural and engineered processes. Viruses self-assemble from macromolecules and their associated complexes1. Smart materials, molecular machines, ribosomes, and molecular electronics are other examples of self-assembling nanostructured materials24, as are geological materials such as opal. Hence, understanding self-assembly pathways and the factors that control them is of great interest in the pure and applied sciences. In the present study, a theory of self-assembly is derived from the laws of molecular physics. Our motivation is to both attain a deeper understanding of the pathways involved and to arrive at a calibration-free theory. In short, we seek an understanding of the interplay of processes across multiple scales in space and time that underlies self-assembly.

Modeling the self-assembly of composites from nanocomponents presents numerous conceptual and practical challenges. Each nanocomponent consists of many highly fluctuating atoms, and hence interactions among components are not well defined. This is in contrast to interatomic forces that have been formulated and well calibrated with experimental data and quantum computations that provide unique values of atomic forces given a corresponding configuration. Nanocomponents have a highly fluctuating internal state, which underlies their entropics and average energetics. Furthermore, the internal structure and fluctuations of one nanocomponent can be strongly affected by the presence of others. Thus, fluctuations and frictional/dissipative effects play a key role in the self-assembly and stability of nanocomposites. Similarly, the position, orientation, and nanoscale internal structure of the components affect assembly in a complex fashion as they modify statistical properties of fluctuations within the composite and its microenvironment. Conditions in the microenvironment (i.e., pH, salinity, selected ion concentrations, and temperature) alter the internal state and the evolving nature of interactions between nanocomponents. From this, we conclude that a predictive calibration-free theory of self-assembly should be based on an underlying all-atom description for which all forces are well-understood.

Kinetic theories of self-assembly have been presented for many of the aforementioned systems59. As for chemically reacting systems, phenomenological kinetic theories require extensive experimental data and prior knowledge of steps along the self-assembly pathway. This severely limits the predictive power of such approaches. The well-known variety of assembly pathways (e.g. A + BAB, AB + CABC versus the alternative permutations of A, B and C) illustrate the factorial multiplicity that could emerge when assembly involves many components. While phenomenological kinetic studies have furthered the understanding of self-assembly in well-characterized systems, the detailed kinetics of the processes in most systems remains unclear, especially considering the highly dissipative and stochastic nature of the Brownian dynamics of the nanocomponents involved. When so much experimental data is needed, the question arises as to the meaning of “prediction”. To arrive at a complete theory, all intermediate structures along the assembly pathway must be identified and rate data for individual assembly steps must be obtained. Given this we conclude that a theory of self-assembly should have the following characteristics:

  1. the underlying description is all-atom in detail to arrive at a calibration-free model

  2. the internal atomic structure of the nanocomponents adjusts dynamically as the many-component system changes so that inter-component forces must be co-evolved with the many-component state

  3. the population levels of initial, intermediate, and final assemblies are co-evolved with the probability density of atomistic configurations to capture the multiscale character of nanocomposite self-assembly

  4. some intermediate assemblies may be rare; thus population fluctuations can be significant, requiring that the theory must be stochastic in nature10 and allow for multiple outcomes or particular assembly pathways beginning with the same initial population levels of individual nanocomponents

  5. criteria are provided to determine the completeness of the description and impose restrictions on characteristics of the nanocomponents involved for the self-consistency of the theory.

The objective of this study, then, is to develop a theory of self-assembly with these characteristics.

Since self-assembly is ultimately directed by the laws of molecular physics, an all-atom description is the natural starting point for deriving the kinetic theory. Given the broad range of characteristic times in self-assembly (e.g., 10−14 seconds for atomic collisions/vibrations versus 10−3 seconds or longer to form complete assemblies), we adopt a multiscale formalism. In this framework, order parameters (i.e., population levels of long-lived assemblies) are defined in terms of the 6N atomic positions and momenta, Γ, and describe overall properties of the system. Further, they are shown via Newton’s equations to evolve on a timescale much greater than that of individual atomic vibration. Since the order parameters depend on Γ, they do not evolve independently of atomic variables. Hence, the order parameter evolution alone does not provide a complete theory of population dynamics.

One technique that has been proposed to address this incompleteness dilemma is the projection operator method of Mori11 and Zwanzig12. However, this method presents practical and conceptual difficulties. Projection operators involve the integration of a subset of atomic variables from the N -atom probability density ρ, and the method arrives at a formal equation satisfied by a reduced density for the variables of interest (population levels here). This formal equation involves expressions (memory kernels) for which the time-dependence is generated by an evolution operator Inline graphic− where is a P projection operator, Inline graphic is the Liouville operator, and t is time. This evolution operator acts on functions of Γ. Unfortunately, evolution of this type is difficult to simulate as no software like molecular dynamics exists for carrying out the projection operator-mediated evolution. In fact, if the associated memory kernels decay on a timescale that is longer than that of atomistic changes, then they are at least as intensive to compute as solving the original Liouville equation itself, or more realistically using MD for the entire computation. Progress can be made when there is a timescale separation, but this raises the question as to why the projection operators should have been used in the first place. For instance, why not use a multiscale perturbation scheme directly to utilize the timescale separation?

There is an extensive literature on the use of multiscale techniques to analyze the Liouville equation1323. The present work is an extension of that approach, notably the all-atom multiscale (AMA) method2433. Frictional effects in the dynamics of many nanoparticle systems undergoing aggregation have been considered using a multiscale approach3436. Such effects are also included in the work presented here but via a distinct formulation involving population levels as collective variables.

The AMA method adopts a direct perturbative strategy2433. In AMA the Liouville equation, originally cast in 6N dimensions, is mapped to a higher-dimensional product space using a multiscale ansatz on the N-atom probability density. Thus, ρ is hypothesized to have the dependence ρ(Γ, Φ, t) for a set of Na order parameters Φ, and hence is a function in (6N + Na)-dimensional state space. With the chain rule and the Γ-dependence of Φ, one obtains a reformulated Liouville equation for ρ as an explicit function of both Γ and Φ. This unfolded Liouville equation is solved as a power series in a timescale ratio ε. The parameter ε is a ratio of characteristic lengths, times, or masses and naturally emerges due to the slow variations of the order parameters relative to the timescale of atomic collisions. Upon defining the reduced probability density Ψ(Φ, t) = ∫ *δ(ΦΦ*) ρ (Γ*, Φ*, t) an exact conservation law results from the structure of ρ. Then, using the multiscale solution of the Liouville equation, a closed equation of Fokker-Planck or Smoluchowski type for the state of the Na order parameters Φ is obtained, and the need for projection operators is avoided.

Coarse-grained models have also been proposed but are built entirely upon equations for a reduced set of variables (i.e., < 6N). It is incorrect to term these approaches “multiscale” unless they track both the atomistic and the course-grained descriptive variables. Thus, we term such approaches “decoupled coarse-graining” when they do not co-evolve dynamics at all scales. The objective of decoupled approaches is to reduce the number of variables (e.g., to describe clusters of atoms) using heuristic arguments and calibration with experimental data to fit the number of degrees of freedom and computational time. Unfortunately, such methods do not account for the key interscale feedback suggested in Fig. 1. In reality, the order parameters mediate the ensemble of atomic-scale configurations while the latter determine the free-energy gradients and diffusivities that drive order parameter dynamics. In the decoupled approach, effective interaction between clusters of atoms3738 or between an atomically described zone and a continuum3940 are calibrated prior to simulation. Thus, the feedback of Fig. 1 is ignored. In contrast, our multiscale approach co-evolves the probability density of atomistic configurations along with order parameters. Decoupled approaches assume the lumped elements evolve via a Langevin equation. However, elements are usually too small to justify the timescale separation required for this approach. AMA displays that this requires the ratio of the mass of a typical atom to that of a lumped element is small. But if the elements are large, it is necessary to account for the internal atomistic fluctuations and associated dissipative nature of collisions of larger elements. Thus, in the presence of a distinct characteristic scaling AMA provides a precise multiscale description of the system.

Fig. 1.

Fig. 1

Population levels characterize the overall organization of a self-assembling system, which affects the probability of the atomistic configurations. This mediates thermal-average forces driving population dynamics, and diffusion factors mediating the rate of change and coupling among the population levels. This interscale feedback loop is central to understanding nanosystem self-assembly. The proposed multiscale approach captures this feedback by co-evolving population levels with the thermal-average forces and diffusion factors, thereby providing the essence of an algorithm for the numerical simulation of self-assembly.

A self-consistency check is also provided by the all-atom analysis. In the stochastic equation of order parameter dynamics, diffusion factors appear. These factors are related to correlation functions of order parameter velocities. The correlation functions can be readily computed via MD when the correlation time is short relative to the characteristic time of order parameter evolution. If key order parameters are missing from the analysis, these correlation functions can have long-time tails. In this case, unacceptably large or even divergent diffusivities may arise. This difficulty can then be remedied by including more order parameters in the description.

A challenge for the theory of self-assembly is that there is no clear notion of species (i.e., assembly types). Each nanocomponent in an assembly could be in any of a number of nanometer and atomic scale configurations, and similarly for the architecture of an assembly of these components. Thus, an assembly type (species) is best conceived of as an association of a given set of atoms with a broad spectrum of possible configurations. However, as the definition of the common structural feature shared by the ensemble becomes more narrowly defined, the smaller the number of configurations in the ensemble and the more highly and rapidly fluctuating the population will be. For example, if all interatomic distances in the defined assembly can only vary by 0.01% from a reference configuration, then only a very few assemblies in the system will reside within the ensemble. For a multiscale analysis to hold, and for the conventional notions of chemical kinetics as the gradual evolution of population levels to be valid, one must define an assembly type somewhat loosely. In that case the associated ensemble represents a large enough collection of N -atom configurations so that under appropriate conditions it changes slowly in time. Thus, vagueness in defining the assembly types becomes a part of the solution to, and not a difficulty of, the self-assembly problem. Since AMA co-evolves order parameters with the probability distribution for the atomistic configurations (notably of each of the assemblies), it is well suited for modeling self-assembly.

In the present study, we combine the above notions into a quantitative kinetic theory of self-assembly wherein population levels are the order parameters. Since the formulation is based on AMA, recalibration with each new application is unnecessary if an accurate interatomic force field and complete set of order parameters are used. Thus, the danger of over-calibration is avoided.

Methods

Multiscale Analysis

The equation for stochastic dimer count (χ) dynamics is obtained using an all-atom multiscale approach2433. First, the Liouville equation and integration by parts imply an exact conservation law for the evolution of Ψ:

Ψt=-εχdΓδ(χ-χ)ρJ. (1)

The ansatz that the N -atom probability density ρ takes the form ρ(Γ, χ, t) and thus depends on Γ directly and, via χ, indirectly is used. This dual dependence can be constructed when ε is sufficiently small. The derivation presented below differs from that of other approaches because

  1. there is no need for tedious bookkeeping to conserve the number of degrees of freedom (6N) despite the introduction of counter variables in ρ,

  2. difficulties from integration over atomic coordinates and memory kernels in projection operator methods41 are avoided,

  3. special ensembles constrained to fixed values of counters arise naturally27, and

  4. use of the conservation law (1) with multiscale perturbation theory avoids the need to reconstruct ρ to second order in the perturbation expansion in ε to obtain corrections to them30.

We begin the derivation by defining a new timescale structure for the system, separated in factors of ε and denoted by tn = εnt. Using the chain rule, the Liouville equation becomes

n=0εnρ/tn=(L0+εL1)ρ, (2)
L0=-i=1N[pimiri+Fipi] (3)
L1=-Jχ. (4)

In this equation, Γ derivatives in Inline graphic are at constant values of χ and vice versa for Inline graphic. The analysis proceeds by constructing solutions of this multiscale Liouville equation as an expansion in ε, i.e., ρ=n=0ρnεn. The Liouville equation is solved to O(ε) and then (1) yields an equation for Ψ to O(ε2).

To lowest order we seek quasi-equilibrium states, i.e., for which ρ0 is independent of the fastest time scale t0. This is consistent with our expectation that the self-assembly of nanocomponents into composite structures takes place on timescales much longer than 10−14 seconds. In this case, the Liouville equation at O(ε0) implies Inline graphicρ0 = 0, while the O(ε) equation is derived to be (t0-L0)ρ1=-(t1-L1)ρ0. Since the lowest order solution is quasi-equilibrium in character, it is determined via entropy maximization27,28. For an isothermal closed system, this implies

ρ0=e-βHQ(χ,β)W(χ,t_)ρ^W, (5)
Q=dΓδ(χ-χ)e-βHe-βF (6)

where H is the total energy. The δ-function in the expression for Q arises naturally when one starts from a quantum formalism and maintains a careful count of states in the expression for the entropy. Here, F is the χ, β-dependent free energy, and from its definition, Ψ is W to lowest order in ε.

To O(ε) the Liouville equation admits the formal solution ρ1=eL0t0A1-0t0dt0eL0(t0-t0)(t1-L1)ρ0. Here, A1(Γ, χ, t), is ρ1 at t0 = 0 and t = {t1, t2, ···}. With the expressions for Inline graphic and ρ0, and the change of variables s=t0-t0, this implies ρ1=eL0t0A1-ρ^t0Wt1-ρ^-t00dse-L0sJ([χ-βf]W) where 〈f〉 = −∂F/∂χ is the thermal-average force driving χ dynamics. The Gibbs hypothesis and the self-consistency condition that ρ1 must be well-behaved as t0 → ∞ together imply ∂W/∂t1 must be zero since 〈J〉, the ρ̂–weighted average of J, is zero. With this,

ρ1=eL0t0A1-ρ^-t00dse-L0sJ([χ-βf]W), (7)

completing the O(ε) analysis of the Liouville equation.

For arbitrary choice of A1, the above analysis does not necessarily lead to a closed equation for Ψ, nor does Ψ necessarily evolve slowly (i.e., is independent of t0), violating our earlier assumption. However, if A1 lies in the nullspace of Inline graphic, then a closed equation for the slow evolution of Ψ emerges. In particular, we take the first-order initial data to be zero, i.e., the system starts in the quasi-equilibrium state ρ̂W. With this, for later times (t0 > 0) Ψ remains slowly varying suggesting self-consistency of the development. It follows that Ψ = W up to O(ε2). From equations (5) and (7), the Ψ-conservation equation (1) implies Ψt=ε2χ[-t00dtJe-L0tJ(χ-βf)Ψ]. To reconstruct the full time dependence of Ψ, we express all time variables in terms of τ = t2, notably t0 = τ/ε2. Taking the limit ε → 0 of the above equation for ∂Ψ/∂t yields equations (13) and (14) of the next section. This is the Smoluchowski equation of stochastic dimerization. The diffusion coefficient depends on χ, while the χ-dependence of 〈f〉 drives the dimerization. At equilibrium, Ψ is proportional to eβF as expected; thus eβF solves (13) and (14) when Ψ is time-independent.

Langevin/Smoluchowski Equivalence

We now postulate a Langevin model, given by equation (19) of the Results and Discussion section, and show that it is equivalent to the Smoluchowski equation in a Monte Carlo sense. Let Ψ(χ, t) be the joint probability density for the set of scaled counter variables χ. We assume the set of random forces ξ changes on a more rapid timescale than χ. A time interval Δt is postulated to exist such that the system experiences a representative sample of variations in ξ (t) during Δt. Additionally, it is assumed that the processes ξ are Markov, so that Ψ can be advanced from t to t+ Δt solely knowing Ψ at t and the probability for a transition in χ during Δt. The evolution of Ψ is determined by the statistics of all ξ timecourses between t and t + Δt. Let T[ξ, t, Δt] be the transitional probability as a functional of a scenario of ξ during the time period t to t + Δt. Then, Ψ evolves via

Ψ(χ_,t+Δt)=dLχSξ_T[ξ_,t,Δt]δ(χ_-χ_[ξ_;t,Δt,χ_])Ψ(χ_,t). (8)

Here, χ̃ is the solution of (19) at time t + Δt, given the state was χ′ at t. The functional integral in (8) is over all scenarios of ξ between t and t + Δt, and the identity Sξ_T=1 follows from normalization of T.

Integration of (19) yields χ_=χ_+[α__f_χ_Δt+tt+Δtdsξ_(s)]. Next, we insert this result for χ̃ into (8), formally expand the δ-functions in the changes χ̃χ′ that vanish as Δt → 0 using δ(χ_-χ_)k=021k!kδ(χ_-χ_)χ_k(χ_-χ_)k. After this we perform the averaging implied by the T-weighted functional integral in (8), denoted by 〈 〉T. As with the reaction flux it represents, we assume the average random force vanishes, i.e., 〈 ξT.= 0. If the process generating ξ (t) is stationary, then 〈 ξ(t)ξ (t′)〉T only depends on tt′, and not on t and t′ independently, i.e, ξ_(t)ξ_(t)Tφ__(t-t). Neglecting terms of order higher than Δt, the statistical properties of ξ and integration of (19) imply χ_-χ_T=-α__f_Δt and (χ_-χ_)2T=0Δtdt2-t2Δt-tdsφ__(s). For essentially all t′ in (0, Δt), the upper limit of the s integral is much greater than the correlation time. Since φ__(t) is negligible except for |t| ≪ Δt, one finds (χ_-χ_)2TA__Δt Where A__-0dsφ__(s).

Under the postulate that Ψ changes sparingly over Δt, we find Ψ(χ_,t+Δt)Ψ(χ_,t)+ΨtΔt. Substituting these results into (8) and neglecting terms of order higher than Δt yields the equation ΨtΔt=χ_{χ_-χ_TΨ}+122χ_2{(χ_-χ_)2TΨ}. The computed values for 〈(χ′− χ̃)kT for k=1,2 from above imply the Smoluchowski equation:

Ψt=χ_[α__(12α__-1A__χ_-f_)Ψ]. (9)

At long times the closed, isothermal system reaches equilibrium. Hence Ψ(χ_,t)~texp(-βF)ZΨeq(χ_) where Z = ∫ dL χeβF As this must be a time-independent solution of the Smoluchowski equation, it follows that α__=βA__2. Comparison of (13) and (14) with (9) finally implies βD__=α__.

Results and Discussion

Dimerization: An Illustrative Example

We begin by formulating the self-assembly problem for the construction of dimers and generalize in a later section. Consider dimerization of members in a collection of M nanocomponents. For simplicity, assume all components contain the same number and types of atoms with the same interatomic bonding connectivity. Let R(={R⃑1, ··· R⃑M}) be a set of nanocomponent centers-of-mass (CMs). The notion of a dimer, even for an isolated pair of nanocomponents, is not precise. For example, how close together must the components be to constitute a dimer? Is a pair of components categorized as a dimer when a third may be colliding with them, or is this a trimer? For elongate nanocomponents, is the attachment architecture end-to-end, parallel, or perpendicular? Such ambiguities are an inherent part of the speciation problem. In fact “species” are never well-defined in a dense system, but rather are an abstraction, valid in a precise way only in extreme cases, such as a low temperature rarefied gas wherein bound dimeric quantum states can be defined. However, the impossibility of a precise structural definition of a dimer does not preclude a quantitative, operational definition of a dimeric ensemble.

Given the free energy profile of Fig. 2, it might be said that components k and k′ form a dimer if the distance between their CMs is less than a critical value Rc. Such a profile represents an ensemble of atomistic configurations and not a given relative orientation or internal configuration of the two monomers. Adopting a critical CM-CM distance definition will be unsatisfactory in some cases since it does not rule out trimers and other assemblies. For components with a single small attachment patch, the three-body nature of the interaction makes trimers unlikely at low density. Thus, if two components are joined via their attachment patches, a third is sterically inhibited from attachment with them if the patch is small relative to component size. Hence, there can be simple criteria for defining a dimer when the many-body nature of the free energy-derived forces between components eliminates assemblies that are not consistent with the geometry of the components and attachment regions. The free-energy landscape may render a simple definition of a dimer inadequate. Thus, the best one can do to construct a theory of dimerization is to introduce a notion of dimeric associations, i.e., of the dynamics of an ensemble of detailed configurations with dimeric association defined in a semi-quantitative fashion. Regardless of the definition adopted, it must contain some ambiguity or there will never be an appreciable population. In fact, when one poses a classical chemical model (e.g., 2X ⇌ X2) it is assumed there is a dimeric association usually without being concerned with configuration detail. Thus, we develop the present approach in the same spirit.

Fig. 2.

Fig. 2

The pseudo-pair free energy profile shows a well indicating the tendency to form a bound state with center-to-center distance at the minimum. For dimerization, the free energy has three- body character, such that a third component is repelled, or at least not attracted, to a dimer.

With these cautionary notes, let a given monomer pair be classified as a dimeric association if the CMs of its two components are within the critical distance Rc. For this definition to suffice, it is assumed that collisions between components are inelastic. This ensures that dimers are stable for a time period that is long compared to the timescale of atomic collisions3133 and which naturally follows from the formalism developed.

Any approach must have a built-in self-consistency criterion to determine if a more precise definition of a dimer is required. Let a dimer population counter X2 be defined via X2=k<kθ(Rkk) where RKK = |R⃑kR⃑k and θ(R) is one when R < Rc and zero otherwise. Thus, a state of dimeric association is the ensemble of all configurations of the N-atom system consistent with a given value of X2. With these definitions, multiscale analysis can be used to derive an equation for the stochastic evolution of this ensemble.

To establish the validity of a multiscale theory of dimerization, X2 must be shown to evolve on a timescale much longer than the 10−14 seconds characteristic of atomic vibrations. Newton’s equations imply

dX2dt=1mck<kδ(Rkk-Rc)(Rk-Rk)Rkk·(Pk-Pk)J, (10)

where δ is the Dirac delta function, mc is the total mass of one of the identical components, and P⃑k is the total momentum of component k. Let ε−2 be the number, M, of components in the system. We now show that under certain conditions a kinetic theory for X2 emerges as ε → 0. In the limit of large population levels, the dimer counter X2 is less than M/2 for the single-attachment patch system at low density (i.e., <Rc-3). Hence, it is convenient to introduce the intensive variable χ via χ = 2X2, 0 < χ <1 which represents the proportion of dimers that form in the system. Newton’s equations then imply

dχdt=εJ,J=2εJ, (11)

for defined in (10). There are O(ε−4) terms on the RHS of (10). However, there are only O(ε−2) terms for which the δ-term can be zero. The fluctuating signs of these terms due to the momentum factors P⃑k under near-equilibrium conditions imply that, in the net, is O(ε−1). Hence, (11) implies χ evolves slowly, i.e., on a timescale of O(ε). With these hypotheses and estimates, χ is therefore a slow variable. Since J is O(ε0), similar arguments show that it is not slow, i.e., dJ/dt = f where f is O(ε0). This suggests that an equation for the stochastic dynamics of χ is of the Smoluchowski, or friction-dominated, type29.

Unfortunately, we see that equation (10) is not closed with respect to χ. To derive such an equation, a multiscale analysis is conducted beginning with the Liouville equation for the N-atom probability density ρ(Γ, t), namely ∂ ρ/∂t = Inline graphicρ where

L=-i=1Npimiri+Fipi (12)

is the Liouville operator and Γ = {p⃑1, r⃑1, ··· p⃑N, r⃑N} with atomic positions and momenta denoted by r⃑i, p⃑i; i = 1, …, N. The reduced probability density Ψ is defined via Ψ (χ, t) = ∫ *δ( χχ*)ρ where χ* is the value of the scaled dimer counter evaluated at the integration variables Γ*.

We proceed as in our analysis of multiscale theory to other phenomena2433 by utilizing the multiscale ansatz that ρ depends on Γ both directly and, via χ, indirectly; using the chain rule to derive an equation for ρ in its 6N+Na dimensional representation; expanding ρ in terms of ε; constructing the lowest order distribution ρ0 via entropy maximization; and involving the Gibbs-hypothesized equivalence of long-time and ensemble averages to ensure that the ε expansion is well-behaved at long times. As a result a Smoluchowski equation for Ψ emerges:

Ψτ=χ[D(χ-βf)Ψ] (13)
D=-0dsJ(0)J(s). (14)

Here the thermal-average force 〈f〉 is the χ-derivative of the free energy F, the bracket 〈···〉 indicates an average with both canonical and δ(χχ*) weights, τ is a scaled time variable, J(s) represents the variable J evolved to time s, and eβF is theχ-dependent partition function

e-βF=dΓδ(χ-χ)e-βH. (15)

Thus, a rigorous equation for the stochastic dynamics of the dimer count, i.e. an analogue of the chemical master equation27, emerges from the N-atom Liouville equation. As shown earlier in Methods, the use of the Gibbs hypothesis and all-atom multiscale analysis allows one to avoid the need for projection operators to extract Ψ from the fluctuating, atomistic state. The timescale separation implied by the smallness of ε yields this closed equation for Ψ, a feature not provided by Newton’s equations directly. Since Ψ depends upon ρ and ρ can be reconstructed from Ψ, our approach co-evolves the atomistic probability density with Ψ, i.e., is a fully coupled multiscale procedure capturing the feedback of Fig. 1.

The above results may appear to ignore the possible existence of multiple, long-lived dimeric isomers. In that case, the theory must be augmented to include these assemblies. This is critical when the various isomers evolve at distinct rates as it implies that one may have ignored slow variables other than χ. Such an error would be signaled by the existence of long-time tails in 〈J(s)J(0)〉 and the anomalous (divergent) behavior of D. Hence, computation of D provides a self-consistence check on our theory that guides the discovery of self-assembly pathways. Evaluation of the correlation function yielding D can be constructed using an MD package. This is not straightforward for an operator that requires memory kernels involving non-Newtonian evolution. In summary, we provide a procedure for predicting self-assembly kinetics that is calibration-free, as it is derived from the basic laws of molecular physics.

Generalizations for Complex Assemblies

The self-consistency criterion provided by our methodology facilitates the development of a theory for self-assembly in complex systems involving aggregates of multiple nanocomponents into any of a number of alternative architectures (e.g., normal versus abnormal structures of viruses or ribosomes). If key intermediate structures, and hence assembly pathways are omitted, unphysically slow assembly kinetics will be predicted, and similarly for the creation of incorrect populations of otherwise unlikely final assemblies. The construction of counters for a g -type assembly (e.g., g = 5 for a pentamer of given architecture and range of atomic configurations) enables the development of a multiscale approach to more complex self-assembling structures. An increase in the number of components in an assembly may allow for many possible isomeric architectures. The number of different assembly intermediates depends on the geometry and interactions of the nanocomponents; the size, orientation, and positioning of attachment patches; and the possible multiple internal conformations of individual nanocomponents. This isomerization complexity could even arise for dimers as noted in the previous section. The number of possible subassembly isomers could increase exponentially with the number of monomers M so that a theoretical framework for evaluating possible intermediate assemblies and pathways is of great interest. We illustrate below how criteria can be developed to ensure a more complete set of assembly intermediates, thereby leading to the discovery of key self assembly pathways. Through refinements of the algorithms for population counters, one may discriminate among a number of intermediate associations and eliminate anomalous behaviors in the associated matrix of diffusion factors.

To illustrate our pathway refinement strategy, consider a trimeric association. For a trimer, components may be arranged in a triangular versus a linear form. This can arise if each component has two sufficiently small and appropriately positioned attachment patches. Patch-to-patch interactions must have a range less than the monomer size if unbounded (i.e., supply-limited) aggregation is to occur. A possible counter for trimers constructed from identical components is X3=k>k>kθ(Rkk)θ(Rkk)θ(Rkk), where θ and Rkk are defined as for X2. For components with attachment patches positioned so as to inhibit thermally stable linear arrangements and allow only triangular ones, X3 will not count linear associations since they are not present in appreciable numbers. However, care must be taken not to count dimers that are actually part of a trimer or other more complex association.

For the general aggregate of type g, let Xg be a counter defined in analogy to that for dimers and trimers, but with additional constraints discriminating among various isomers and avoiding counting substructures of a larger structure. For example, trimer substructures of a four-cluster would not be included in X3. A promising direction for developing counters for more general assemblies is as follows. Let Xg> be a counter for all aggregates with g or more monomers. Then, a counter Xg for g-mers that does not incorrectly count substructures is given by Xg=Xg>-Xg+1>. This addresses the problem that while it is relatively easy to construct Xg>, constructing Xg directly is more difficult. Within the definition of Xg one can also add isomeric restrictions (e.g., a linear g-mer versus a globular one) via θ-factors on structural variables.

The joint probability Ψ(X, t) for X = {X1, X2, ···, XL} in a system supporting L types of assemblies is central to a stochastic theory of self-assembly. In the present framework, it is defined via Ψ (X, t) = ∫*Δ(XX*)ρ, where X_={X1,X2,,XL} is the set of counters evaluated at the all-atom state Γ* over which integration is taken and Δ is the L-fold product of delta functions. As in the derivation of (13) and (14), the Liouville equation implies

Ψt=-_J_ (16)
Jg=dΓΔ(X_-X_)i=1NpimiXgriρ. (17)

One might term Jg the “reaction flux” for creation of g-type assemblies. This result does not constitute a complete stochastic chemical kinetic theory of self-assembly unless it can be shown that J can be expressed as an explicit functional of Ψ alone, and not of the full N-atom density. As with the dimerization problem, this requires that the set of counters X be slowly varying in time and do not couple strongly to other slow variables. Counters can also be introduced to address spatially non-uniform systems by keeping track of X for each of a number of spatial compartments or by introducing order parameter field variables42,43.

To complete the derivation of an equation of stochastic population dynamics, the scaling arguments used for dimerization are introduced. Let the index g (=1,2 ···, L) label assembly types consisting of g monomers (i.e., isomeric multiplicity ignored). For a system with M monomer units, Xg can be at most M/g(≡ ε2g)−1. In analogy with the dimer case, one may define the scaled counter χg via χg= ε2 gXg. Proceeding in this way, all developments set forth for dimerization follow, and we arrive at the Smoluchowski equation

Ψt=χ_[D__(χ_-βf_)Ψ] (18)

that involves analogous diffusion coefficients D__. For extremely complex systems arising when L is large, say O(M), the development must be re-evaluated.

Langevin Simulation

Since it is not practical to solve the Smoluchowski equation directly, Langevin equations constitute a viable way to simulate stochastic processes. Using methods of stochastic calculus27 one may derive the following Langevin equations that are Monte Carlo-equivalent to our Smoluchowski equation:

-dχ_dt+α__f_+ξ_=0_ (19)

for the set of L scaled population counters X = {χ1, …, χL}. The matrix α__ is associated with the diffusion factors, 〈f〉 is the thermal-average force, and ξ is a random force. The development ultimately implies α__=βD__, and constrains the statistics of ξ by α__=β2-0dsξ_(0)ξ_(s)β2A__. Thus there is a constraint on random forces via the integrated random force correlation functions, i.e., the classic fluctuation-dissipation relation.

The Langevin formulation can be used to conduct practical simulations of self-assembly. As D__, 〈f〉, and the properties of the random noise ξ are determined by our statistical mechanical formulas, this constitutes a calibration-free kinetic theory of self-assembly. In broad-strike, the procedure is to construct the diffusion factors and thermal-average forces for given instantaneous population levels. These are then used to perform a Langevin time step. The atomistic state must be re-established at the new population levels so that the diffusion factors and thermal-average forces can be re-computed via molecular dynamics and the cycle is repeated. If the critical distance for interaction of the nanocomponents and the density of components are small, then the calculations of the thermal-average forces and diffusion factors can be carried out on relatively small all-atom subsystems, yielding a type of domain decomposition algorithm that can be readily parallelized. Thus, the interscale feedback of Fig. 1 implies a multiscale computational algorithm for simulating the self-assembly of nanocomponents into composite structures (e.g., see Fig. 3).

Fig. 3.

Fig. 3

(a) An example of order parameter time courses and (b) velocity auto-correlation functions for the OPs. Results show the absence of a long-time tail, and hence the lack of coupling to other slow variables not included in the set of OPs.

Summary and Conclusions

Smoluchowski equations for the stochastic dynamics of the population levels of intermediates along the pathway to self-assembly were derived. A necessary condition for the self-consistency of the theory is that counters for the number of each type of intermediate assembly must be sufficiently vague to ensure a statistically significant population level, but precise enough to discriminate among conformers to ensure that key pathways are not missed. A novel multiscale technique enables the derivation of Langevin-type stochastic population equations without the need to address memory kernels that arise in the projection operator method and are difficult to simulate practically. Necessary conditions for the validity of our stochastic population equations include slow evolution of the population counters relative to the timescale of atomic collisions/vibrations and the ability of counters to discriminate among key conformers. The counters must form a complete set - they cannot couple to slow variables omitted from the model. Violation of this completeness is indicated by the presence of long-time tails in the counter rate autocorrelation functions, i.e., the divergence of D__. As the theory holds the promise of being parameter-free (i.e., the rate coefficients and assembly pathways are constructed via first-principles molecular physics), our methodology is predictive and quantitative. Thus it could serve as the basis of a computer-aided self-assembly process design strategy. Similar comments hold for understanding natural processes such as viral self-assembly. To fully realize the benefits of our approach, a multiscale algorithm for Langevin simulations can be implemented, as briefly described in the previous section.

Acknowledgments

The first author appreciates support from the United States Naval Academy under a NARC grant and the National Science Foundation under grant number DMS-098413. The second author was supported in part by the National Science Foundation (Collaborative Research in Chemistry program), National Institute of Health (NIBIB), Department Of Energy (Office of Basic Science), METAcyt, and the Indiana University College of Arts and Sciences through the Center for Cell and Virus Theory.

Contributor Information

Stephen D. Pankavich, Department of Mathematics, United States Naval Academy, Annapolis, MD 21402.

Peter J. Ortoleva, Center for Cell and Virus Theory, Department of Chemistry, Indiana University, Bloomington, IN 47405

Literature Cited

  • 1.Whitesides G, Boncheva M. Proc Nat Acad Sci. 2002;99:4769–4774. doi: 10.1073/pnas.082065899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Gracias D, Tien J, Breen TL, Hsu C, Whitesides GM. Science. 2000;289:1170–1172. doi: 10.1126/science.289.5482.1170. [DOI] [PubMed] [Google Scholar]
  • 3.Alberts B, Bray D, Lewis J, Raff M, Roberts K, Watson JD. Molecular Biology of the Cell. Garland; New York: 1994. [Google Scholar]
  • 4.Whitesides G, Grzybowski B. Science. 2002;295:2418–2421. doi: 10.1126/science.1070821. [DOI] [PubMed] [Google Scholar]
  • 5.van der Schoot P, Zandi R. Phys Bio. 2007;4:296–304. doi: 10.1088/1478-3975/4/4/006. [DOI] [PubMed] [Google Scholar]
  • 6.Zlotnick A, Stray SJ. Trends in Biotechnology. 2003;21:536–542. doi: 10.1016/j.tibtech.2003.09.012. [DOI] [PubMed] [Google Scholar]
  • 7.Zlotnick A, Aldrich R, Johnson J, Ceres P, Young M. Virology. 2000;277:450–456. doi: 10.1006/viro.2000.0619. [DOI] [PubMed] [Google Scholar]
  • 8.Zlotnick A, Johnson J, Wingfield P, Stahl S, Endres D. Biochemistry. 1999;38:14644–14652. doi: 10.1021/bi991611a. [DOI] [PubMed] [Google Scholar]
  • 9.Zlotnick A. Journal of Molecular Recognition. 2005;18:479–490. doi: 10.1002/jmr.754. [DOI] [PubMed] [Google Scholar]
  • 10.McQuarrie DA. Statistical Mechanics. Harper Collins; NY: 1976. [Google Scholar]
  • 11.Mori H. Progress in Theoretical Physics. 1965;34:423–455. [Google Scholar]
  • 12.Zwanzig R. In: Systems Far From Equilibrium. Garrido L, editor. Interscience; New York: 1980. pp. 198–221. [Google Scholar]
  • 13.Chandrasekhar S. Astrophys J. 1943;97:255–262. [Google Scholar]
  • 14.Bose S, Ortoleva P. J Chem Phys. 1979;70:3041–3056. [Google Scholar]
  • 15.Bose S, Ortoleva P. Physics Letters A. 1979;69:367–369. [Google Scholar]
  • 16.Bose S, Bose S, Ortoleva P. J Chem Phys. 1980;72:4258–4263. [Google Scholar]
  • 17.Bose S, Medina-Noyola M, Ortoleva P. J Chem Phys. 1981;75:1762–1771. [Google Scholar]
  • 18.Deutch JM, Oppenheim I. Faraday Discuss Chem Soc. 1987;83:1–20. [Google Scholar]
  • 19.Ortoleva P. Nonlinear Chemical Waves. John Wiley and Sons; New York: 1992. [Google Scholar]
  • 20.Shea JE, Oppenheim I. J Phys Chem. 1996;100:19035–19042. [Google Scholar]
  • 21.Peters MH. J Chem Phys. 1998;110:528–538. [Google Scholar]
  • 22.Peters MH. J Stat Phys. 1999;94:557–586. [Google Scholar]
  • 23.Coffey WT, Kalmykov Y, Waldron J. The Langevin Equation With Applications to Stochastic Problems in Physics, Chemistry and Electrical Engineering. World Scientific Publishing; River Edge, NJ: 2004. [Google Scholar]
  • 24.Ortoleva P. J Phys Chem. 2005;109:21258–21266. doi: 10.1021/jp051381b. [DOI] [PubMed] [Google Scholar]
  • 25.Miao Y, Ortoleva P. J Chem Phys. 2006;125:044901. doi: 10.1063/1.2218838. [DOI] [PubMed] [Google Scholar]
  • 26.Miao Y, Ortoleva P. J Chem Phys. 2006;125:214901. doi: 10.1063/1.2400858. [DOI] [PubMed] [Google Scholar]
  • 27.Pankavich S, Miao Y, Ortoleva J, Shreif Z, Ortoleva P. J Chem Phys. 2008;128:234908. doi: 10.1063/1.2931572. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Pankavich S, Shreif Z, Ortoleva P. Physica A. 2008;387:4053–4069. [Google Scholar]
  • 29.Shreif Z, Ortoleva P. J Stat Phys. 2008;130:669–685. [Google Scholar]
  • 30.Shreif Z, Pankavich S, Ortoleva P. Phys Rev E. 2009;80:031703. doi: 10.1103/PhysRevE.80.031703. [DOI] [PubMed] [Google Scholar]
  • 31.Shreif Z, Ortoleva P. Physica A. 2009;388:593–600. [Google Scholar]
  • 32.Pankavich S, Shreif Z, Miao Y, Ortoleva P. J Chem Physics. 2009;130:194115. doi: 10.1063/1.3134683. [DOI] [PubMed] [Google Scholar]
  • 33.Pankavich S, Ortoleva P. J Math Phys. 2010;51:063303. doi: 10.1063/1.3420578. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Shea JE, Oppenheim I. Physica A. 1997;247:417–443. [Google Scholar]
  • 35.Morgado WAM, Oppenheim I. Phys Rev E. 1997;55:1940–1945. [Google Scholar]
  • 36.Morgado WAM, Oppenheim I. Physica A. 1997;246:547–562. [Google Scholar]
  • 37.Harries D, May S, Gelbart W, Ben-Shaul A. Biophys J. 1998;75:159. doi: 10.1016/S0006-3495(98)77503-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Arkhipov A, Freddolino P, Schulten K. Structure. 2006;14:1767–1777. doi: 10.1016/j.str.2006.10.003. [DOI] [PubMed] [Google Scholar]
  • 39.Chang R, Ayton G, Voth G. J Chem Phys. 2005;122:244716. doi: 10.1063/1.1931651. [DOI] [PubMed] [Google Scholar]
  • 40.Ayton G, Voth G. Journal of Structural Biology. 2007;157:570–578. doi: 10.1016/j.jsb.2006.10.020. [DOI] [PubMed] [Google Scholar]
  • 41.Zwanzig R. Nonequilibrium Statistical Mechanics. Oxford University Press; United Kingdom: 2001. [Google Scholar]
  • 42.Shreif Z, Adhangale P, Cheluvaraja S, Perera R, Kuhn R, Ortoleva P. Lecture Notes in Computational Science and Engineering. Vol. 15. Springer; Netherlands: 2008. pp. 363–380. [Google Scholar]
  • 43.DelleDonne M, Ortoleva P. J Stat Phys. 1979;20:473–486. [Google Scholar]

RESOURCES