Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2014 Feb 15.
Published in final edited form as: Physica A. 2013 Feb 15;392(4):628–638. doi: 10.1016/j.physa.2012.10.005

Variational methods for time-dependent classical many-particle systems

Yuriy V Sereda 1, Peter J Ortoleva 1,*
PMCID: PMC3580877  NIHMSID: NIHMS414870  PMID: 23459064

Abstract

A variational method for the classical Liouville equation is introduced that facilitates the development of theories for non-equilibrium classical systems. The method is based on the introduction of a complex-valued auxiliary quantity Ψ that is related to the classical position-momentum probability density ρ via ρ = Ψ*Ψ. A functional of Ψ is developed whose extrema imply that ρ satisfies the Liouville equation. Multiscale methods are used to develop trial functions to be optimized by the variational principle. The present variational principle with multiscale trial functions can capture both the microscopic and the coarse-grained descriptions, thereby yielding theories that account for the two way exchange of information across multiple scales in space and time. Equations of the Smoluchowski form for the coarse-grained state probability density are obtained. Constraints on the initial state of the N-particle probability density for which the aforementioned equation is closed and conserves probability are presented. The methodology has applicability to a wide range of systems including macromolecular assemblies, ionic liquids, and nanoparticles.

Keywords: variational principle, multiscale analysis, non-equilibrium systems, N-particle probability density, Liouville equation, coarse-grained variables

1. Introduction

Variational principles have been used in many areas of science and engineering [1]. Here, a classical nonequilibrium variational method is introduced to facilitate the analysis of the Liouville equation (LE). Its use is demonstrated in deriving stochastic equations for coarse-grained (CG) variables characterizing long-scale dynamics. A methodology based on multiscale perturbation analysis is provided for constructing trial functions to be optimized via the variational principle.

Variational approaches to quantum and classical systems include the following. The entropy maximum principle is used to construct the equilibrium probability [2]. For the stationary states of quantum systems, the energy-minimum principle is used [3, 4]. The least-action principle is used for time-dependent pure-state quantum systems [5]. For nonequilibrium reacting systems described by a master equation, an evolution criterion has been discussed [6]. Variational approaches based on the maximum entropy production hypothesis have been presented [710]. For example, the Onsager-Machlup approach [10, 11] is based on a phenomenological maximum entropy production principle; it appears to be appropriate for systems which are close to equilibrium.

A variational method for the LE was developed in the 1970s by Gross; it was cast in terms of the Laplace transform of the N-particle density ρ [12]. The functional considered is related to correlation functions of interest. The stationary values of the constructed functionals implied the LE exactly for ρ when no restrictions on the trial function are imposed. The trial functions used were taken in successive approximations as the sum of one-body, then two-body, etc., terms [13]. It was noted that there is a multiplicity of alternative variational functionals whose extrema satisfy the LE [12]. While this multiplicity of functionals raises the question of the physical motivation for choosing one over another, it suggests that this may enable one to gain experience in choosing functionals that yield results that are optimal for addressing a given problem, e.g., for constructing correlation functions of a given type. In the present formulation, a functional is introduced that involves a complex-valued quantity Ψ related to the N-particle density ρ via ρ = Ψ*Ψ. The complex valuedness of the auxiliary function Ψ in effect introduces two N-particle densities, and provides a self-consistent way to construct both of them. In the Gross formulation, a second function in addition to the N-particle density is needed to formulate the variational principle. This function was chosen to be a specific modification of ρ (namely, a momentum reversed density). Here, the second function is simply the complex conjugate of the first, and the variational principle implies equations for both.

The Martin, Siggia, Rose method [1416] starts with a coarse-grained statistical description that follows from renormalization group arguments. The present approach starts with a variational principle built on the original (not renormalized) LE. Trial functions are then used that are inspired by multiscale perturbation theory [17]. Such trial functions will be shown to account for the coupling of processes across scales in space and time that underlies the dissipative phenomena of interest in nonequilibrium systems.

The formulation proposed here is suggested by analogy with the quantum least-action principle. The quantum principle is based on a functional whose extrema correspond to the solutions of the time-dependent Schrödinger equation [5]. The quantum action takes the form of an expectation value of iħ∂ / ∂tH, for Hamiltonian H. Here, a similar quadratic functional is constructed for the classical LE. To achieve this, a wave function-like quantity Ψ is introduced. It is an essential feature of the present variational method that this auxiliary function Ψ is complex-valued. In effect, ReΨ is found to serve as a test function [18] for ImΨ, and conversely. The use of this variational principle in deriving stochastic equations of CG dynamics is discussed based on multiscale perturbation theory-inspired trial functions.

Multiscale perturbation methods [19, 20] yield Smoluchowski or other equations for the stochastic dynamics of the probability density of a set of coarse-grained variables [2023], the latter describing the larger-scale features of a classical N-particle system. These methods account for the coupling of processes across multiple scales in space and time. They follow a long tradition of multiple time and length scale approaches to many-particle quantum [24] and classical [19, 21, 22, 25, 26] systems. Multiscale perturbation approaches for classical systems have been validated via, e.g., simulation of viral capsids [27, 28], RNA [20], and virus-like particles [26, 29]. A necessary condition for their validity is that correlation functions of the CG momenta do not display long-time tails [20]. The variational method presented here provides a framework to extend/utilize perturbation theory in a manner similar to that of other variational approaches.

A variety of CG variables have been used as a starting point of a multiscale perturbation approach [20, 3035]. It is hoped that reexamination of these results by using them to construct trial functions for the present variational method will facilitate the development of novel stochastic equations for CG dynamics. All methods based on the identification of slow modes must address the requirement for the existence of a timescale gap between the slow and fast modes of an N-particle system. One may proceed by assuming the existence of this gap between the fastest modes and the quasi-continuum of slower ones [36], and then justifying it a posteriori. The latter justification is achieved via comparison with experimental or computer molecular dynamics results, or through the identification of self-consistency criteria. Examples of the latter are as follows.

  • CG momenta correlation functions should not have long-time tails since the latter indicate coupling to slow modes not included in the set considered.

  • Emergence of large differences between the all-atom configuration generated coherently through the CG variables versus that from MD computations may be used to identify additional modes to be included [20]. In developing a multiscale approach to classical nanosystems, it was shown that certain CG variables could be introduced in a manner that enables the automated detection of key missing CG variables, and provides an explicit algorithm for relating these otherwise hidden variables to the underlying N-particle description [20].

  • Straightforward use of multiscale perturbation theory may lead to non-conserving equations for a reduced probability density. Certainly, one does not expect that conserving equations should follow for an arbitrarily chosen set of slow variables given any initial statistical state. This is addressed in the context of the present variational principle by identifying initial statistical states which lead to a conserving equation for the reduced probability of the CG state.

These considerations can also be applied to the projection operator methods [19, 21, 25]. For example, if key CG variables are missing then memory functions have long time tails.

The variational principle introduced here has the following features (those indicated with * are unique to our approach).

  • *

    A quadratic functional of a complex-valued quantity is introduced in analogy with the quantum least-action principle and has symmetry properties that greatly facilitate variational calculations.

  • Extrema of the proposed functional satisfy the LE.

  • *

    A remote time factor is introduced to ensure that the operator ∂ / ∂t is skew Hermitian.

  • *

    Multiscale perturbation theory is used to inspire trial functions.

  • *

    Intuitive arguments starting with multiscale mean-field trial functions provide a way to develop novel kinetic theories of nonequilibrium classical systems. These properties will be explored via several illustrative examples.

Closure has been a long standing issue, e.g., for the BBGKY hierarchy [37]. A method for discovering a closed equation for the probability of the CG state is presented. It follows from a detailed examination of the initial statistical state’s influence on the dynamics of the N-particle probability density. A variational principle is used to generate an approximation for the N-particle density.

The proposed variational method is formulated and demonstrated via simple Vlasov approximation, which describes the inertia-dominated, frictionless dynamics of N identical particles (Section 2). A multiscale perturbation scheme for generating trial functions is presented (Section 3). A Smoluchowski equation for a set of coarse-grained variables is derived via the variational principle, and conditions on the initial N-particle statistical state for its validity are obtained (Section 4). The presentation is concluded with a prospective on the use of the variational approach for deriving kinetic equations for nonequilibrium classical systems via the use of perturbation theory-inspired trial functions (Section 5).

2. The variational principle: definitions, properties, and simple illustration

2.1. Formulation

The LE for a classical N -particle system ∂ρ / ∂t = ℒρ yields the evolution of the position-momentum density ρ, and the Liouville operator ℒ is given by

==1N[pm·r+F·p]. (1)

Analysis of this equation is challenging due to the complexity of the structure of ρ since it subsumes a range of phenomena from continuous processes like diffusion to discontinuous ones like shock waves. The objective here is to develop a variational approach that facilitates the analysis of the LE based on approximations inspired by intuition and perturbation methods.

The first step in the development is to introduce a complex-valued auxiliary quantity Ψ related to ρ via

ρ=Ψ*Ψ. (2)

Introduction of ψ enables the construction of a quadratic functional C*, Ψ) which has a similar form to the action of quantum theory [5]. By design, the structure of this functional is chosen such that its extrema occur at Ψ that imply the classical LE for ρ, and which has convenient symmetry properties.

Consider the functional C defined such that

C=idtdΓeηt2Ψ*[/t]Ψ. (3)

Here, Γ is a point in 6N-dimensional particle position-momentum space, η is a positive infinitesimal constant, the range of t goes from the remote past to the remote future, while the range of Γ is determined by constraints on the system (e.g., a closed volume). The factor e−ηt2 insures that after integration by parts of a ∂ / ∂t term, the boundary values at t → ±∞ are zero; this remote-time repression factor ensures that ∂ / ∂t is skew Hermitian. With this, C is found to be real and nonnegative.

Finding extrema of C with respect to the complex function Ψ is equivalent to doing so simultaneously for the real and imaginary parts of Ψ. This is also equivalent to minimizing C with respect to Ψ and Ψ* themselves, as pointed out in the 1930s in the context of the quantum least-action principle [12, 38]. Such symmetry is built in the variational principle discussed below. With this, one may compute the functional derivative δC / δΨ* to arrive at an equation for Ψ, and similarly for taking the functional derivative with respect to Ψ to obtain an equation for Ψ*. With this, the variational equation δC / δΨ* = 0 implies the LE for Ψ,

Ψ/t=Ψ. (4)

Recalling that ρ = Ψ*Ψ, the dynamical equation for Ψ (and a similar one for Ψ*) imply that ρ satisfies the LE. This is an advantage over other approaches which start with the renormalized coarse-grained equations [1416], and not with the full LE. The complex auxiliary function formulation has advantages over other approaches [12, 13] where a second probability density is introduced that is arbitrarily taken to be a time-reversed solution of the LE. In the present formulation no such assumption is needed,– a factor that has particular advantages when constructing and using trial functions for optimization (as in Section 4).

2.2. Necessity of complex-valuedness of the auxiliary function

The above development does not yet clarify the importance of introducing Ψ as a complex-valued function. To do so, consider finding the extrema of C. If the auxiliary function Ψ is real, the variational equation δC / δΨ (Γ, t) = 0 based on (3) yields no information, as follows. If Ψ equals the real-valued function A, it follows that

C=i2dtdΓeηt2[/t]A2. (5)

The fact that ℒ only involves first-order derivatives with respect to Γ, and that A vanishes at the boundaries in Γ-space, imply the integrated ℒA2 term of (5) is zero. Since ∫ dΓA2 can be shown to be constant, it follows that dΓtA2=0. Thus, seeking the extrema of C when Ψ is real yields no information. However, since Ψ is complex-valued, the extrema of C are nontrivial, as follows.

Let Ψ = A + iB for independent real functions A(Γ, t) and B(Γ, t). With this, C can be thought of as a functional of A and B. Letting Ω = ∂ / ∂t − ℒ, one obtains

C(A,B)=dtdΓeηt2[i(AΩA+BΩB)BΩA+AΩB]. (6)

The AΩA and BΩB terms yield no information when seeking extrema of C with respect to A or B, as can be shown using arguments similar to the case when Ψ is real. The −BΩA term is equal to the AΩB term, as can be shown using integration by parts and the fact that AB vanishes at the boundaries of the Γ domain. With this, one finds that δC / δA = 0 implies that B obeys the LE. Similarly, setting δC / δB to zero implies that A satisfies the LE. Thus, when Ψ is complex-valued the variational principle yields nontrivial results. A similar conclusion can be derived using a polar representation (see Appendix).

2.3. Relation to the Gross formulation

Consider a reformulation of the functional C into Fourier representation. The Fourier transform Ψ̅ is related to Ψ via

Ψ(Γ,t)=12πdωeiωtΨ̅(Γ,ω). (7)

With this, C (3) can be written as a functional of Ψ̅:

C=i(2π)2dωdω'dtdΓeηt2ei(ωω')tΨ̅*(Γ,ω')[iω]Ψ̅(Γ,ω). (8)

For infinitesimal η

12πdteηt2ei(ωω')t=δ(ωω'), (9)

and hence

C=i2πdωdΓΨ̅*[iω]Ψ̅. (10)

This functional looks similar to that of Gross except that here (1) the objective is to construct both the ReΨ̅ and ImΨ̅ via a unified approach, i.e., finding the extrema of C with respect to Re Ψ̅ and ImΨ̅ rather than focusing on the N-particle density and its momentum-reversed image, and (2) the integration over ω is not considered in the Gross formulation and the present representation is in Fourier and not Laplace transform. However, the results of Sections 3 and 4 suggest that it is convenient to adopt a time-domain formulation when using multiscale perturbation methods to generate trial functions. Finally, the time-domain ρ is related to Ψ via the product rule (ρ = Ψ*Ψ), while for the Fourier transform ρ̅ ≠ Ψ̅*Ψ̅ but is related to Ψ̅ via a convolution. Again, this suggests the convenience of working in the time-domain. These advantages are illustrated more specifically in Sections 3 and 4.

To further explore the relation of the present approach to that of Gross, first let and be the set of particle positions and momenta, respectively. The method of Gross is equivalent to the present one when ImΨ(, ;t) = ReΨ(,−;−t). If this ReΨ is a solution of the LE then so is ImΨ. In contrast, the present formulation is less restricted, i.e., the only criterion is that ReΨ and ImΨ may not be simply related via constant of proportionality. The assumption that ImΨ is the momentum-time reversed ReΨ is restrictive when, e.g., developing a trial function for optimization by the present variational method.

2.4. Simple mean-field trial function: the Vlasov equation

To illustrate that the variational procedure of Section 2.1 allows one to recover a familiar result, consider a mean-field approximation for a collection of N identical particles interacting with pairwise forces. Stating the classic mean-field approximation in the present notation, Ψ(Γ, t) is postulated to be a product of single-particle functions ψ(r⃑, p⃑,t). In this case, the trial function, denoted T, is given by

T==1Nψ(r,p,t). (11)

With this, the functional C (3) takes the form

C=idtdΓeηt2=1Nψ*(r,p,t)[/t]'=1Nψ(r',p',t). (12)

A dynamical equation for ψ is then obtained from the extremum condition

δC/δψ*(r,p,t)=0. (13)

The result is

ψt=pm·ψrF·ψp. (14)

The mean-field force F⃑ is given by

F(r,t)=(N1)ϑrd3,r'd3p'|ψ(r',p',t)|2u(r,r'), (15)

where u (r⃑, r⃑') is the pair potential and

ϑ=d3rd3p|ψ(r,p,t)|2. (16)

The norm ϑ can be shown to be independent of time for finite systems assuming that ψ = 0 at the boundaries of r⃑, p⃑ space.

The kinetic equation (14), rewritten for the single-particle density |ψ|2, is the Vlasov equation from plasma theory [39]. The only damping is of the Landau type and is, therefore, not appropriate for liquids.

3. Multiscale perturbation theory-inspired trial functions for dissipative phenomena

The mean-field trial function used in Section 2.4 generated the Vlasov equation upon use of the variational principle. However, the Vlasov equation does not provide a realistic account of dissipative phenomena in a condensed system. Here, multiscale perturbation theory is used to generate trial functions which are rich enough to capture dissipative effects that dominate most phenomena in classical liquids. A further advantage of using trial functions inspired by multiscale perturbation theory is that they incorporate a degree of physical intuition into a formulation via the specific choice of CG variables used.

There is often a smallness parameter ε that naturally emerges when computing the rate of change of CG variables. Typically, ε is a dimensionless ratio of characteristic masses, lengths, or force constants. For simplicity, it is assumed that there is a regular progression of times tn = εnt that characterize all the relevant timescales displayed by a system of interest. These timescales are O (εn) (n = 0,1, ⋯). Consider a set Φ of CG variables which, by construction, evolve on timescales much greater than that for individual particle collisions/vibrations [20, 40]. With this, the multiscale ansatz is made

Ψ=Ψ(Γ,Φ;t0,t¯;ε), (17)

where = {t1,t2,⋯} is the set of long time variables. Using the LE for Ψ (4), the ansatz (17), and the chain rule, one obtains

n=0εnΨ/tn=(0+ε1+)Ψ. (18)

The operators ℒn follow from ℒ and the Γ -dependence of the NCG CG variables Φ. Specifically, ℒ0 has the same form as ℒ but the Γ -derivatives are at constant Φ, while the ℒn (n > 0) involve Φ-derivatives at constant Γ. This framework is not a violation of the number of degrees of freedom; rather, it simply reflects the ansatz (17) on the multiple dependencies of Ψ on Γ [20, 41]. The resulting formulation allows for the construction of these dependencies via a perturbation scheme valid when ε is small.

To be more specific, consider the set Φ of CG variables introduced earlier [41]. Assuming all these CG variables evolve on about the same timescale, then the form of the multiscale LE (18) holds with

1=Π·Φ, (19)

where the notation

A·Bk=1NCGAkBk (20)

Note that ℒ0 and ℒ1 operate on the space of functions that depend on both Γ and Φ, and k labels the CG variables. The CG variable velocity Πk is given by

Πk=Φk. (21)

Next, the smallness of ε is used to construct Ψ as a perturbation expansion

Ψ=Ψ0+εΨ1+. (22)

Placing this expansion in (18), and collecting terms to each order in ε, equations for the coefficients Ψn of the expansion are obtained.

As with the analysis of the N -particle probability density ρ as earlier [20, 27, 3235, 4143], one may construct Ψ0 under conditions where it evolves slowly (i.e., is independent of t0). Thus, to lowest order,

0Ψ0=0. (23)

This equation admits the solution

Ψ0=Ψ̂(Γ,Φ)ϒ(Φ,t¯). (24)

Since Ψ̂*Ψ̂ is the conditional probability for Γ given Φ (in analogy with earlier studies [20, 27, 3235, 4143]), Ψ̂ is determined via the entropy maximum principle. For isothermal conditions, this implies

Ψ̂=eβH/2/Q1/2,Q=dΓ̃Δ(ΦΦ̃(Γ))eβĤ. (25)

Here, ˜ indicates evaluation at Γ̃ over which integration is taken. The factor Δ is a product of Gaussian-like functions (one for each of the NCG CG variables in the set Φ) that favors states Γ̃ for which the CG variables have values near Φ. Since Ψ̂*Ψ̂, and not Ψ̂, is a probability, the entropy maximization principle was used to construct Ψ̂*Ψ̂, and not Ψ̂ itself.

Next, one collects terms to first order in ε in (18) to find

[/t00]Ψ1=1Ψ0. (26)

Removal of secular behavior [44] in Ψ1 implies ∂ϒ / ∂t = 0 ; this results in the absence of a ∂Ψ0 / ∂t1 term in (26). Collecting the above results yields

Ψ1=e0t0Ψ1o(Γ,Φ,)+Ψ̂t00dt'0e0t'0k=1NCGΠk{β2fkϒϒΦk}. (27)

Here fk is the thermal-average force:

fk(Φ)=F/Φk. (28)

The Helmholtz free energy F is related to the partition function Q via

Q(Φ)=eβF. (29)

The value of Ψ1 at t0 = 0 is denoted Ψ1o.

The integral term in Ψ1 contains the timecourse of a fluctuating microscopic variable, i.e., the momentum Πk (21) conjugate to the CG variable Φk. The exchange of information between particle-like and CG variables contained in Ψ1 is the source of dissipative effects. Thus, including them in a trial function for condensed systems should overcome some of the shortcomings of the Vlasov equation (Section 2.4) as developed in the next section.

4. Equations for stochastic CG dynamics derived from the variational principle

4.1. General considerations

The variational principle of Section 2 opens the way to use the multiscale perturbation theory of Section 3 to generate trial functions and then optimize them. This allows one to go beyond perturbation methods by constructing trial functions such as those involving mean-field approximations. In this section this theme is developed by taking the O(ε) results from Section 3 as the trial function to be optimized. First consider some general issues.

Let T be a trial function. Then the functional (3) takes the form

C(T,T*)=idtdΓeηt2T*[/t]T=idtdΓeηt2T[/t]T*. (30)

The second result is implied by the first upon using integration by parts and the fact that the positive infinitesimal η removes the remote time boundary terms. These expressions for C show that δC / δT* and δC / δT are related via

δC(T,T*)/δT=δC(T*,T)/δT*. (31)

This symmetry rule enables one to avoid computing derivatives with respect to both T and T*, thereby reducing the number of calculations. This is a consequence of the present complex formulation of the variational principle. This result is used to facilitate obtaining the kinetic equation for a CG probability density in Section 4.5.

Note that when T is an exact solution of the LE, C(T,T*) = 0. In this way, the closeness of C to zero provides an assessment of the quality of a variational solution.

4.2. Multiscale trial functions

Consider trial functions with the multiscale dependence T (Γ,Φ;t0,t2;ε) as suggested by the O(ε) multiscale development of Section 3. This is to suggest that T depends on Γ both directly and, via a set of CG variables Φ, indirectly. Using the chain rule and introducing an expression for the CG variables as functions of Γ (denoted Φ̃ (Γ)), one has ℒT = ℒ0T+εℒ1T, where ℒ0 and ℒ1, as in Section 3, act on quantities which are functions of both Γ and Φ. With the above, the action C for multiscale trial functions may be written in the convenient form

C(T,T*)=idtdt0dt2dΓdΦeηt2Δ(ΦΦ̃(Γ))δ(t0t)δ(t2ε2t)T*[/t0+ε2/t20ε1]T, (32)

where Δ is defined above, after Eq. (25); δ(tn − εnt) indicates the Dirac delta function. In the above, the dependence of T and T* on Γ,Φ,t0,t2 and ε is implied.

Take the trial function T to be given by the series (22) truncated to a given order in ε. If the truncation is to O(ε), then the CG factors are ϒ and ϒ* (see (22), (24), and (27)). With this, one can use the variational principle to find a pair of equations to determine ϒ and ϒ*, i.e., δC / δϒ = 0, δC / δϒ = 0. These two equations obey a symmetry rule analogous to (31).

For the choice T = Ψ0 of (24) one has ℒ0T = 0 as in (23). This implies that

C(ϒ,ϒ*)=iεdtdt0dt2dΓdΦeηt2Δ(ΦΦ̃(Γ))δ(t0t)δ(t2ε2t)Ψ̂ϒ*1(Ψ̂ϒ). (33)

The following facts can be used to simplify the expression for C(T,T*): (1) Ψ̂ is even in the particle momenta; (2) the CG momenta Πk (see (19) and (21)) are odd in the individual particle momenta; and (3) ϒ is independent of the particle momenta. As in Section 3, ∂Ψ0 / ∂t = 0 ; thus, for the choice T = Ψ0 from (26) it follows that C = 0 and there is no useful information for optimizing ϒ with this choice of trial function. Hence, a higher-order trial function is now considered.

Next, take the trial function to be the series (22) truncated to O(ε) :

T=Ψ0+εΨ1. (34)

The ϒ and ϒ* dependence of T is embedded in Ψ1o or explicitly provided as in (27). Since Ψ1 obeys (26), it is seen that [∂ / ∂t0 − ℒ01 − ℒ1Ψ0 term does not contribute to C. Assuming Ψ1o/t1=0 (which is found to be self-consistent a posteriori), for the trial function (34) C takes the form

C(ϒ,ϒ*)=iε2dtdt0dt2dΓdΦeηt2Δ(ΦΦ̃(Γ))δ(t0t)δ(t2ε2t)Ψ0*{Ψ0/t21Ψ1}. (35)

4.3. Reduced probability density

The main objective of the remainder of this section is to develop an approximate equation for the reduced probability W(Φ,t). By definition,

W(Φ,t)=dΓΔ(ΦΦ̃(Γ))ρ. (36)

To lowest order in the multiscale perturbation scheme, ρ → Ψ2ϒ*ϒ, since Ψ̂ can be taken to be real-valued. With this, W → ϒ*ϒ as ε → 0. Hence, to lowest order, W is a function of t2 and thus

W/t2=ϒ*ϒ/t2+ϒϒ*/t2. (37)

With this and an approximate equation for ϒ, one can construct a kinetic equation for W, as follows.

4.4. Variational calculus for multiscale trial functions

To facilitate the use of multiscale perturbation theory-inspired trial functions to construct a kinetic equation for W, consider the following device. Define a quantity U which is a typical term encountered when constructing C during optimization of trial functions derived from multiscale perturbation theory:

U(ϒ,ϒ*)=dtdt0dt2dΓdΦeηt2Δ(ΦΦ̃)δ(t0t)δ(t2ε2t)ϒ*X(Γ,Φ,t0,t2;ϒ), (38)

where X is a function of Γ, Φ, t0 and t2, and is a functional of ϒ(Φ,t2). The goal is to compute the functional derivative of U with respect to ϒ*(Φ,t2). First carry out the straightforward integration with respect to t, leaving only t0 and t2 time integrations. The remote time factor e−ηt2 requires special attention since it will be most convenient to express the integrand of (38) only in terms of t2. To this end, rewrite it as e−ηt22 / ε4. Since η is smaller than the square of the longest inverse time, and this time is O−2), then it is convenient to write η = ε4η'. With this,

U(ϒ,ϒ*)=dt2dΦϒ*eη't22Ũ(Φ,t2;ϒ),Ũ(Φ,t2;ϒ)=dt0dΓΔ(ΦΦ̃)δ(t2ε2t0)X(Γ,Φ,t0,t2;ϒ). (39)

Thereby, t0 and Γ integrations can be folded into the surviving Φ, t2 integrations so that the remaining functional dependence on ϒ* becomes transparent, as follows. Since X in (38) is independent of ϒ*, then the functional derivative of U with respect ϒ* follows by inspection:

δU/δϒ*=eη't22Ũ(Φ,t2;ϒ). (40)

4.5. Kinetic equations

Using the trial function (34) and the technique of Section 4.4, a kinetic equation for ϒ is implied by the extrema of C, as follows from setting the functional derivative of C with respect to ϒ* to zero. In light of Section 4.2 and the expression for action (35), the variational problem can be cast in terms of the analysis of Section 4.4 with X of (38) given by

X(Γ,Φ,t0,t2;ϒ)=iε2Ψ̂*{Ψ0/t21Ψ1}, (41)

with Ψ0 and Ψ1 as in (24) and (27), and the operator ℒ1 as in (19). Since ε «1, the lower limit of the integration in (27) can be set to −∞ and the kinetic equation for ϒ becomes

ϒt2=k,k'Dkk'[Φkβ2fk][Φk'β2fk']ϒδZδϒ*, (42)
Z(ϒ)=kdΦdt2eη't22ϒ*dΓΔ(ΦΦ̃)Ψ̂*ΠkΨ1o(Γ,Φ;ϒ)Φk. (43)

In obtaining (42) the following was used:

Ψ̂Φk=β2fkΨ̂. (44)

The thermal-average force fk is given by (28). The diffusivities Dkk' are related to correlation functions of the momenta conjugate to Φ via

Dkk'=0dt'0dΓΔ(ΦΦ̃)Ψ̂2Πke0t'0Πk'=0dt'0Πke0t'0Πk'. (45)

The notation 〈⋯〉 implies a Ψ̂2 -weighted, Φ -constrained phase space average:

=dΓΔ(ΦΦ̃)Ψ̂2(). (46)

The initial term Ψ1o is folded into the Z term. In the above, it was assumed Ψ1o is independent of ϒ* and lies in the null space of ℒ0, as is shown below.

Using the symmetry rule (31), one may obtain a dynamical equation for ϒ* as the complex conjugate of (42). Inserting the equations for ϒ and ϒ* in (37) yields

Wt2=ϒZϒϒ*Zϒ*k,k'(Φk(β2fk'{Dkk'+Dk'k}W)+ϒ2(Dkk'ϒ*)ΦkΦk'Dkk'ϒ*2ϒΦkΦk'), (47)

a kinetic equation for the CG probability density W. Defining G such that,

Ψ1o=Ψ̂G(Φ;ϒ), (48)

yields

Z=kdΦdt2eη't22ϒ*ΠkGΦk. (49)

4.6. Closure of the kinetic equation for W : identifying initial statistical states leading to Smoluchowski behavior

For a closed system, the N-particle position-momentum density ρ maintains normalization, as can be verified via integration of the LE over all Γ. This does not guarantee that conservation is maintained for the reduced density W of (36). It is shown here that a closed, conserving equation for W does exist under appropriate restrictions on the initial statistical state. Earlier, a conserving and closed equation of the Smoluchowski form was derived for stochastic CG dynamics starting with the LE and using a multiscale perturbation method [20, 32, 33, 41]. Here, we investigate the emergence of the Smoluchowski equation for W that follows from a particular class of initial data Ψ1o.

Consider a condition on the initial statistical state for which W (36) displays Smoluchowski behavior. In the multiscale framework, this state (denoted ρo) is given by

ρo(Γ)=ρ(Γ,Φ̃(Γ);tn=0 with n=0,1,;ε). (50)

To O(ε), the initial state of the associated auxiliary function is given by

Ψo=Ψ̂ϒ(Φ(Γ),t2=0)+εΨ10(Γ,Φ(Γ),t2=0). (51)

With this, the initial statistical state question reduces to a consideration of Ψ1o for which W satisfies a Smoluchowski equation.

The rate equation for W (47) can be reduced to Smoluchowski form for a specific choice of the initial term Ψ1o. In the present notation, the Smoluchowski equation has the form

Wt2=k,k'Φk(Dkk'[Φk'βfk']W). (52)

From the comparison of (47) and (52) one obtains the following condition on the initial term

ϒδZδϒϒ*δZδϒ*=k,k'(Φk([Φk'β2fk'](Dkk'W)+ϒDkk'ϒ*Φk')+Φk'(β2fkDkk'Wϒ*Dkk'ϒΦk)). (53)

Since the LHS of (53) involves an integral over Γ, there is a rich class of initial statistical states Ψ1o satisfying this restriction. The above development opens the way to understanding the relationship between the initial statistical state and the closure of a reduced description. Non-closure of the kinetic equation for W arises due to the incompleteness of a chosen set of CG variables. The existence of slow variables not included in the description would manifest itself in long-time behaviors of the correlation functions underlying the diffusion coefficients (45).

That this result was also obtained using multiscale perturbation theory [45], provides a validation for the variational approach. The above demonstrates how the variational approach can yield closed conserving equations for the probability of the CG state.

5. Conclusions

A variational approach for time-dependent classical many-particle systems is presented and shown to imply the LE. This variational approach can be used to optimize trial functions generated, e.g., via intuitive arguments or perturbation theory. The key to the variational approach is an auxiliary function Ψ that depends on the 6N particle positions and momenta. The approach is based on a quadratic functional in Ψ and has a form similar to that of the quantum least-action principle. It is only when Ψ is complex-valued that the variational principle yields non-trivial results, i.e., enables the derivation of approximations for nonequilibrium many-particle systems.

There are many functionals whose extrema satisfy the LE. That presented here is expressed in terms of Ψ and its complex conjugate. It has a number of interesting properties. First, it yields known results such as the Vlasov and Smoluchowski equations when appropriate trial functions are used. The question remains as to whether the form (12) is, in some sense, more fundamental than others, or does it simply have technical advantages in deriving approximate solutions to the LE? Its analogy with the quantum least-action principle does not necessarily indicate that it is fundamental. However, both the LE and the Schrödinger equation are linear so that there may be merit in casting variational approaches in terms of bilinear functionals. To accomplish this, we introduce Ψ for which |Ψ|2 is the N -particle position-momentum probability density ρ, further continuing the analogy with the quantum least-action principle. The multiscale character of many-particle systems suggests that ρ, and hence the auxiliary function Ψ, has multiscale structure. It was shown in Sections 3 and 4 that the multiscale perturbation method facilitates the development of trial functions to be optimized by the variational principle. A method to facilitate the use of multiscale trial functions was presented.

Results show that the variational principle can lead to an equation for the reduced probability of CG variables that is not conservative, unless restrictions are placed on the initial statistical state of the system. One of the origins of this non-conservation is the existence of additional CG variables not included in the model, but which couple to those which are included. The question of probability conservation is shown to be related to the initial statistical state of the system. For example, if the system was initialized with a shock wave, then inertial behavior would be important and, therefore a non-inertial Smoluchowski dynamics would be inappropriate). A specific example of these restrictions is given which leads to the conserving Smoluchowski equation for a set of CG variables.

Introduction of the specific variational functional, integration with multiscale perturbation theory, and mean-field or other intuitive arguments make the present variational approach of great potential value for deriving approximate solutions to nonequilibrium problems in many-particle classical systems.

If Ψ is an exact solution to the LE, then the variational functional is zero. Thus, the value of the variational functional provides an assessment of the quality of an approximation based on an optimized trial function. This measure can be used via a sequence of trial functions to determine if there is a sense of convergence, e.g., for trial functions based on multiscale perturbation theory of various orders in the perturbation parameter.

The trial functions proposed here are based on multiscale perturbation theory. Such trial functions lead naturally to a mean-field argument at the CG, and not at the single-particle, scale. Coarse-grained mean-field approximations account for realistic damping, and not that of Landau [46]. This is seen for the calculations of Section 3 via the appearance of the diffusion factors in the expression for Ψ1 (27) provided in (45) (Section 4.5).

As an example, consider a mean-field approximation for the coarse-grained factor ϒ of Sections 3 and 4. In particular, take the CG variables Φ to be the scaled positions of each atom in a liquid argon droplet. This scaled position variable is defined so that it undergoes a unit displacement as the atom traverses the million-atom droplet. Thus, the dependence of ϒ on scaled atom position describes the long space-time dynamics of the droplet. On these long scales, each particle has ample time, and traverses sufficiently long distances, to interact with many others. This suggests that, to good approximation, ϒ can be written as a product of factors, one for each atom. Unlike for the N-particle density directly, it is arguable that each particle interacts with an average environment, the “mean field”. Thus, based on this mean-field coarse-grained picture one may use the variational approach to derive an equation for the density of a viscous many-atom droplet. It is concluded here that the variational approach with multiscale trial functions holds great promise for arriving at novel equations describing the kinetics of nonequilibrium condensed classical systems.

Highlights.

  • A novel variational method for non-equilibrium classical systems is proposed.

  • A functional is introduced whose extrema satisfy the Liouville equation.

  • Trial functions are developed using multiscale methods.

  • A general method for deriving coarse-grained kinetic equations is developed.

  • Conditions on initial statistical state rendering Smoluchowski behavior are found.

Acknowledgments

This project was supported in part by the National Science Foundation (Computational and Theoretical Chemistry Program and Collaborative Research in Chemistry Program), National Institutes of Health (NIBIB), the METACyt project, and Indiana University College of Arts and Sciences through the Center for Cell and Virus Theory.

Appendix

Consider the effect of complex-valuedness of Ψ in the polar representation, i.e., Ψ = Rexp(iΘ) for Γ, t -dependent factors R and Θ. Using the same arguments as in Section 2.2, one obtains

C(R,Θ)=idtdΓeηt2Rexp(iΘ)ΩRexp(iΘ)=dtdΓeηt2R2ΩΘ. (A.1)

Thus, δC / δR = 0 leads to the LE for the phase.

Using integration by parts and the fact that R vanishes at the boundaries of Γ space, (A.1) can be rewritten in the form

C(R,Θ)=dtdΓeηt2[t(R2Θ)ΘΩR2]. (A.2)

The first term in (A.2) does not contribute as can be seen after its integration over t by parts. Thus, δC / δΘ = 0 leads to the LE for R.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  • 1.Berdichevsky VL. Variational Principles of Continuum Mechanics. Springer; 2009. [Google Scholar]
  • 2.Jaynes ET. Information Theory and Statistical Mechanics. Physical Review. 1957;106:620, 630. [Google Scholar]
  • 3.Raab A. On the Dirac-Frenkel/McLachlan variational principle. Chemical Physics Letters. 2000;319:674–678. [Google Scholar]
  • 4.Deumens E, Öhrn Y. Complete Electron Nuclear Dynamics. J. Phys. Chem. A. 2001;105:2660–2667. doi: 10.1021/jp500532d. [DOI] [PubMed] [Google Scholar]
  • 5.Feynman RP. Space-Time Approach to Non-Relativistic Quantum Mechanics. Reviews of Modern Physics. 1948;20:367–387. [Google Scholar]
  • 6.Nicolis G. Equality governing nonequilibrium fluctuations and its information theory and thermodynamic interpretations. Physical Review E. 2009;79 doi: 10.1103/PhysRevE.79.011106. 011106. [DOI] [PubMed] [Google Scholar]
  • 7.Martyushev LM, Seleznev VD. Maximum entropy production principle in physics, chemistry and biology. Physics Reports. 2006;426:1–45. [Google Scholar]
  • 8.Kleidon A. Life, hierarchy, and the thermodynamic machinery of planet Earth. Physics of Life Reviews. 2010;7:424–460. doi: 10.1016/j.plrev.2010.10.002. [DOI] [PubMed] [Google Scholar]
  • 9.Nicolis C, Nicolis G. Stability, complexity and the maximum dissipation conjecture. Quarterly Journal of the Royal Meteorological Society. 2010;136:1161–1169. [Google Scholar]
  • 10.Onsager L, Machlup S. Fluctuations and Irreversible Processes. Physical Review. 1953;91:1505–1512. [Google Scholar]
  • 11.Adib AB. Stochastic Actions for Diffusive Dynamics: Reweighting, Sampling, and Minimization. The Journal of Physical Chemistry B. 2007;112:5910–5916. doi: 10.1021/jp0751458. [DOI] [PubMed] [Google Scholar]
  • 12.Gross EP. Approximate solutions of the Liouville equation. II. Stationary variational principles. Journal of Statistical Physics. 1973;9:275–295. [Google Scholar]
  • 13.Gross EP. Approximate solutions of the Liouville equation I. A truncation scheme for distribution functions. Annals of Physics. 1972;69:42–55. [Google Scholar]
  • 14.Martin PC, Siggia ED, Rose HA. Statistical Dynamics of Classical Systems. Physical Review A. 1973;8:423–437. [Google Scholar]
  • 15.Jensen RV. Functional integral approach to classical statistical dynamics. Journal of Statistical Physics. 1981;25:183–210. [Google Scholar]
  • 16.Andersen HC. Functional and graphical methods for classical statistical dynamics. I. A formulation of the Martin-Siggia-Rose method. Journal of Mathematical Physics. 2000;41:1979–2020. [Google Scholar]
  • 17.Kevorkian J, Cole JD. Multiple scale and singular perturbation methods. Springer; 1996. [Google Scholar]
  • 18.Fox C. An introduction to the calculus of variations. London: Oxford University Press; 1950. [Google Scholar]
  • 19.Deutch JM, Oppenheim I. The Concept of Brownian Motion in Modern Statistical Mechanics. Faraday Discuss. Chem. Soc, London. 1987:1–20. [Google Scholar]
  • 20.Singharoy A, Cheluvaraja S, Ortoleva PJ. Order parameters for macromolecules: Application to multiscale simulation. J. Chem. Phys. 2011;134 doi: 10.1063/1.3524532. 044104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Shea JE, Oppenheim I. Fokker-Planck equation and Langevin equation for one Brownian particle in a nonequilibrium bath. Journal of Physical Chemistry. 1996;100:19035–19042. [Google Scholar]
  • 22.Shea JE, Oppenheim I. Fokker–Planck and non-linear hydrodynamic equations of an inelastic system of several Brownian particles in a non-equilibrium bath. Physica A. 1998;250:265–294. [Google Scholar]
  • 23.Peters MH. Fokker-Planck equation and the grand molecular friction tensor for coupled rotational and translational motions of structured Brownian particles near structured surfaces. Journal of Chemical Physics. 1999;110:528–538. [Google Scholar]
  • 24.Cukier RI, Deutch JM. Spin Relaxation: The Multiple-Time-Scale Point of View. J. Chem. Phys. 1969;50:36–41. [Google Scholar]
  • 25.Shea JE, Oppenheim I. Fokker-Planck equation and non-linear hydrodynamic equations of a system of several Brownian particles in a non-equilibrium bath. Physica A. 1997;247:417–443. [Google Scholar]
  • 26.Joshi H, Singharoy AB, Sereda YV, Cheluvaraja SC, Ortoleva PJ. Multiscale simulation of microbe structure and dynamics. Prog. Biophys. Mol. Biol. 2011;107:200–217. doi: 10.1016/j.pbiomolbio.2011.07.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Miao Y, Ortoleva PJ. Molecular Dynamics/Order Parameter eXtrapolation (MD/OPX) for Bionanosystem Simulations. J. Comput. Chem. 2009;30:423–437. doi: 10.1002/jcc.21071. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Miao Y, Johnson JE, Ortoleva PJ. All-Atom Multiscale Simulation of Cowpea Chlorotic Mottle Virus Capsid Swelling. J. Phys. Chem. B. 2010;114:11181–11195. doi: 10.1021/jp102314e. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Joshi H, Cheluvaraja S, Somogyi E, Brown DR, Ortoleva PJ. A molecular dynamics study of loop fluctuation in human papillomavirus type 16 virus-like particles: a possible indicator of immunogenicity. Vaccine. 2011;29:9423–9430. doi: 10.1016/j.vaccine.2011.10.039. [DOI] [PubMed] [Google Scholar]
  • 30.Shreif Z, Pankavich S, Ortoleva PJ. Liquid-crystal transitions: A first-principles multiscale approach. Phys. Rev. E. 2009;80 doi: 10.1103/PhysRevE.80.031703. 031703. [DOI] [PubMed] [Google Scholar]
  • 31.Shreif Z, Ortoleva PJ. Multiscale Approach to Nanocapsule Design. Technical Proceedings of the 2008 NSTI Nanotechnology Conference and Trade Show. 2008;3:741–744. [Google Scholar]
  • 32.Cheluvaraja S, Ortoleva P. Thermal nanostructure: An Order Parameter/Multiscale Ensemble Approach. J. Chem. Phys. 2010;132 doi: 10.1063/1.3316793. 075102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Ortoleva PJ. Nanoparticle dynamics: A multiscale analysis of the Liouville equation. J. Phys. Chem. B. 2005;109:21258–21266. doi: 10.1021/jp051381b. [DOI] [PubMed] [Google Scholar]
  • 34.Shreif Z, Ortoleva P. Curvilinear All-Atom Multiscale (CAM) Theory of Macromolecular Dynamics. J. Stat. Phys. 2008;130:669–685. [Google Scholar]
  • 35.Shreif Z, Ortoleva P. Scaling Behavior of Quantum Nanosystems: Emergence of Quasiparticles, Collective Modes, and Mixed Exchange Symmetry States. J. Chem. Phys. 2011;134 doi: 10.1063/1.3560450. 104106. [DOI] [PubMed] [Google Scholar]
  • 36.Pankavich S, Ortoleva P. Multiscaling for systems with a broad continuum of characteristic lengths and times: Structural transitions in nanocomposites. J. Math. Phys. 2010;51 doi: 10.1063/1.3420578. 063303–063316. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Dobbertin R. On Functional Relations between Reduced Distribution Functions and Entropy Production by Non-Hamiltonian Perturbations. Physica Scripta. 1976;14:85–88. [Google Scholar]
  • 38.Frenkel J. Wave Mechanics; Advanced General Theory. Oxford University Press; 1934. [Google Scholar]
  • 39.Vlasov AA. The vibrational properties of an electron gas. Soviet Physics Uspekhi. 1968;10:721–723. [Google Scholar]
  • 40.Singharoy A, Sereda YV, Ortoleva PJ. Hierarchical Order Parameters for Macromolecular Assembly Simulations. 1. Construction and Dynamical Properties of Order Parameters. J. Chem. Theor. Comput. 2012;8:1379–1392. doi: 10.1021/ct200574x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Pankavich S, Miao Y, Ortoleva J, Shreif Z, Ortoleva PJ. Stochastic dynamics of bionanosystems: Multiscale analysis and specialized ensembles. J. Chem. Phys. 2008;128 doi: 10.1063/1.2931572. 234908–234920. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Pankavich S, Shreif Z, Miao Y, Ortoleva PJ. Self-assembly of nanocomponents into composite structures: Derivation and simulation of Langevin equations. J. Chem. Phys. 2009;130 doi: 10.1063/1.3134683. 194115–194124. [DOI] [PubMed] [Google Scholar]
  • 43.Shreif Z, Ortoleva PJ. Multiscale derivation of an augmented Smoluchowski equation. Physica A. 2009;388:593–600. [Google Scholar]
  • 44.Nayfeh AH, Balachandran B. Applied nonlinear dynamics: analytical, computational, and experimental methods. John Wiley and Sons; 1995. [Google Scholar]
  • 45.Pankavich S, Shreif Z, Ortoleva PJ. Multiscaling for Classical Nanosystems: Derivation of Smoluchowski and Fokker-Planck Equations. Physica A. 2008;387:4053–4069. [Google Scholar]
  • 46.Zwanzig R. Approximate Eigenfunctions of the Liouville Operator in Classical Many-Body Systems. Physical Review. 1966;144:170–177. [Google Scholar]

RESOURCES