Abstract
We present a rigorous thermodynamic treatment of irreversible binary aggregation. We construct the Smoluchowski ensemble as the set of discrete finite distributions that are reached in fixed number of merging events and define a probability measure on this ensemble, such that the mean distribution in the mean-field approximation is governed by the Smoluchowski equation. In the scaling limit this ensemble gives rise to a set of relationships identical to those of familiar statistical thermodynamics. The central element of the thermodynamic treatment is the selection functional, a functional of feasible distributions that connects the probability of distribution to the details of the aggregation model. We obtain scaling expressions for general kernels and closed-form results for the special case of the constant, sum and product kernel. We study the stability of the most probable distribution, provide criteria for the sol-gel transition and obtain the distribution in the post-gel region by simple thermodynamic arguments.
Keywords: statistical thermodynamics, irreversible aggregation, Smoluchowski equation, gelation, phase transitions
1. Introduction
Aggregation is the process of forming structures through the merging of clusters. This generic process is encountered in a large variety of systems, from polymerization and colloidal aggregation to the clustering of social groups and the merging of galaxies. The mathematical foundations of aggregation were set by Smoluchowski [1], whose particular interest was in Brownian coagulation. The aggregation equation, more commonly known as Smoluchowski equation, is a rate equation on a distribution of clusters whose size (mass) changes by binary aggregation events. For a discrete population of clusters with integer masses in multiples of a unit mass (“monomer”) it takes the form [1],
| (1) |
where is the number concentration of clusters with mass k and the aggregation kernel, a rate constant for the merging of masses i and j. A large body of literature has focused on the theory of the Smoluchowski equation, the existence of analytic solution and the scaling limit [2]. Of particular interest is gelling, a condition that arises under the product kernel ; it refers to the formation of a giant structure, as in polymer gels, and is manifested by the failure of the Smoluchowski equation to conserve mass. This process is commonly described as a phase transition, suggesting the possibility that statistical thermodynamics, a theory developed for equilibrium states of interacting particles, may perhaps be applicable in this clearly irreversible process.
Studies of Smoluchowski aggregation broadly fall in one of two categories, kinetic and stochastic. The kinetic approach is based on Equation (1) and its solution. Stable solutions conserve mass; gelling is identified as the point where mass conservation breaks down [3,4]. Post-gel solutions require additional assumptions as to how the gel and the dispersed phase interact [5]. The stochastic approach views clusters as entities that merge with probabilities proportional to the aggregation kernel. It was first formulated by Marcus [6] for a discrete finite population, and its formal mathematical treatment was developed by Lushnikov, who obtained solutions for certain special cases, including gelation [7,8,9,10,11]. In Lushnikov’s method all feasible distributions are given a probability, whose evolution in time is tracked via a generating functional. The approach is explicitly probabilistic and views the Smoluchowski equation as the mean-field approximation of the underlying stochastic process [12]. A different approach within the probabilistic realm makes use of combinatorial methods. This treatment originated with Stockmayer [13] and was further explored by Spouge [14,15,16,17]. The combinatorial approach considers the number of ways to build a particular distribution of clusters and assigns probabilities in proportion to that combinatorial weight. The ensemble of distributions is then reduced to the most probable distribution, which is identified by maximizing the combinatorial weight. This approach has two appealing advantages. It deals with a time-free ensemble in which time appears implicitly via the mean cluster mass. More importantly, it brings the problem closer to the viewpoint of statistical mechanics and the notion that an ensemble may be represented in the scaling limit by its most probable element. Stockmayer recognized this connection and his treatment of gelation is replete with references to the theory of phase transitions [13]. The analogy between aggregation and thermodynamics was not formalized, however. Stockmayer obtained the gel point by mathematical, not thermodynamic methods, and arrived at a post-gel solution that is not consistent with the kinetics of aggregation [5].
We have previously shown that gelation can be indeed treated as a formal phase transition and have presented solutions for the product kernel in the pre- and post-gel regions [18] based on our earlier work on the cluster ensemble [19,20]. Here we generalize the methodology to formulate a rigorous thermodynamic theory of Smoluchowski aggregation. We begin with a finite population that starts from a well defined state and construct the set of all possible distributions that can be reached in a fixed number of elementary transitions. The probability of distribution in this ensemble is governed by the kinetics of the elementary processes that act on the population. In the thermodynamic limit the most probable distribution is overwhelmingly more probable than all others and is governed by a set of mathematical relationships that we recognize as thermodynamics. The work is organized as follows. In Section 2 we define the Smoluchowski ensemble of distributions and their probabilities. In Section 3 we formulate the probability of distribution in terms of a special functional W that introduces the partition function and the Shannon entropy of distribution. In Section 4 we treat the scaling limit and derive the thermodynamic relationships of the Smoluchowski ensemble. In Section 5 we obtain solutions of the Gibbs form for the classical kernels, constant, sum and product. We analyze the stability and phase behavior of the ensemble in Section 6 and treat the sol-gel process as a phase transition. In Section 7 we express the results in the continuous domain and finally offer concluding remarks in Section 8.
2. The Smoluchowski Ensemble
We consider a population of clusters composed of units (monomers). In binary aggregation two clusters merge to form a new cluster that conserves mass, via the schematic reaction
| (2) |
The merging of a pair constitutes an elementary stochastic event, whose probability depends on the aggregation kernel . At the initial state the population consists of single members (monomers). This distribution constitutes generation . The next generation is constructed by implementing every possible aggregation event in the distribution of generation . The set of distributions formed in this manner constitutes the microcanonical ensemble of generation . We continue recursively to form the ensemble of distributions in generation g by implementing all possible aggregation events, one at a time, in all distributions of the parent ensemble. We represent a distribution of clusters by the vector , where is the number of clusters with i members. All distributions in generation g satisfy the conditions
| (3) |
The first condition expresses the fact each elementary event decreases the number of clusters by 1, according to the stoichiometry of binary merging; the second condition expresses the fact that the number of members is conserved. Conversely, any distribution that satisfies the conditions in Equation (3) is a member of the ensemble of generation g because it can be formed in g steps from M monomers. We view the two equations in Equation (3) as the constraints that define the ensemble of feasible distributions. We call this ensemble microcanonical to indicate that it is conditioned by two extensive constraints that fix the mean cluster mass in all distributions of the ensemble.
The evolution of the ensemble may be represented in the form of a layered graph (Figure 1), whose vertices represent distributions and edges represent elementary transitions according to Equation (2). Edges are directed from parent in generation to offspring in generation g. Layers are organized by generation and contain all distributions in a generation. The graph begins in generation with a distribution of all monomers and ends when all units have joined the same cluster. Stochastic aggregation is a random walk on this graph. A trajectory is a possible sequence of connected edges from top to bottom. Our goal is to establish the probability of distribution in generation , in terms the aggregation kernel for any M.
Figure 1.
The aggregation graph for . Each layer contains all feasible distributions in that generation.
2.1. Kinetics
When cluster masses and j, in distribution of generation , merge to form a cluster of mass i, the parent distribution is transformed to offspring distribution via the transition
| (4) |
This transition is represented by an edge in the graph of Figure 1. Its rate is proportional to the number of ways to choose the reactants and the proportionality factor is the aggregation kernel:
| (5) |
The total rate by which parent produces offspring is
| (6) |
where is the number of clusters and is the mean kernel in parent distribution :
| (7) |
In physical terms the aggregation kernel is the rate constant for the reaction between masses i and j. Its mathematical form may be constructed on the basis of a kinetic model for the particular problem. It is beyond the scope of this work to review the numerous kernels that have been proposed in the literature. We mention a selected few that are important for their physical, mathematical and historical significance, and summarize them in Table 1.
Table 1.
Selected aggregation kernels.
| Brownian coagulation | |
| Constant kernel | |
| Flory/Stockmayer kernel | |
| Product kernel | |
| Sum kernel |
The Brownian coagulation kernel was derived by Smoluchowski [1] to describe the kinetics of diffusion limited aggregation in colloidal systems. The constant kernel was adopted by Smoluchowski [1] as an approximation for the Brownian kernel, a simplification that allows analytic results. This kernel is obtained by setting in the Brownian kernel. The Flory/Stockmayer kernel [13,21] is a model for polymerization of chains composed of monomers with f functional groups. Assuming no cycles, a polymer with i monomers contains unreacted functional groups that are available to react. The Flory/Stockmayer kernel is the product of the unreacted functional groups in the two chains that merge. This kernel leads to gelation [13]. The product kernel is the limiting form of the Flory/Stockmayer kernel when the number of functional groups approaches infinity. It also leads to gelation, and being a simpler kernel than the Flory/Stockmayer, it serves as the standard model to study gelation. The sum kernel is proportional to the number of units in each cluster. This kernel may be viewed as the limiting form of a Flory/Stockmayer type kernel with two kinds functional groups [15], but its significance is primarily mathematical as one of a handful of kernels that lead to analytic solutions.
We discuss the constant, sum and product kernel in detail in Section 5. For now we leave the kernel general and unspecified. We only place the minimum conditions, , which are required from elementary physical considerations; additionally, we adopt the normalization .
2.2. Probabilities
We assign a probability to each distribution within generation g and formulate its propagation in proportion to the probability of the parent and the rate of the parent-offspring transition:
| (8) |
Here is a distribution in generation g, is its parent of in generation via the reaction , is the rate of the reaction, and is the mean reaction rate in parent generation :
| (9) |
In both Equations (8) and (9) the summations are over all distributions in generation . Expressing the transition rate in terms of the aggregation kernel we obtain
| (10) |
with
| (11) |
We may confirm that as defined in Equation (8) satisfies normalization over all distributions in the same generation. Beginning with at the initial state, Equation (8) uniquely determines the probabilities of all distributions in all future generations.
2.3. Smoluchowski Equation
The mean number of clusters with mass k in generation g is
| (12) |
with the summation going over all distributions in the same generation. We will derive the evolution of the mean distribution from parent generation to generation g. The probability of distribution is given by Equation (8) and is expressed as a summation over its parents . By the stoichiometry of the transition in Equation (2), the parent and offspring distributions satisfy
| (13) |
Combining this relationship with (12) and (8) the result is (see Supplementary Material)
| (14) |
The left-hand side is the change in the mean number of k-mers between generations; the right-hand side is the ensemble average of the production and depletion of k-mers within all distributions of the parent ensemble. Define the mean time increment from parent to offspring generation as
| (15) |
then Equation (14) reads
| (16) |
In the mean-field approximation we reduce the ensemble into a single distribution, . This resolves the ensemble averages trivially and leads to the governing equation for :
| (17) |
This is the Smoluchowski equation for binary aggregation, the discrete finite equivalent of Equation (1). The mean field approximation, which is invoked to obtain (17), implies that a single distribution is representative of the entire ensemble. In Section 4 and Section 6 we examine the conditions under which this is true.
3. Thermodynamic Formalism
3.1. Partition Function and Selection Functional
We now formulate the probability of distribution in terms of a special functional, . It is through this formulation that we will make contact with statistical thermodynamics. We begin by writing the probability in generation g in the form
| (18) |
where is the multinomial coefficient of vector ,
| (19) |
is the number of clusters in all distributions of generation g, is a functional of distribution , to be determined, and is the partition function. By the normalization condition on the partition function satisfies
| (20) |
with the summation over all distributions in generation .
3.2. Shannon Entropy
The multinomial coefficient represents the combinatorial multiplicity of distribution , namely, the number of ways to order the clusters in the distribution, if clusters with the same number of units are treated as indistinguishable. In the Stirling approximation, , the log of the multinomial coefficient is
| (21) |
It is a concave functional of with functional derivatives
| (22) |
It is also homogeneous in with degree 1 and satisfies the Euler condition
| (23) |
Setting and applying H to vector we obtain
| (24) |
In this form H reverts to the familiar entropy functional, historically associated with Boltzmann, Gibbs and Shannon. We will call it Shannon functional and avoid the generic term “entropy,” whose meaning across disciplines varies. For our purposes the Shannon functional is defined as
| (25) |
and may be applied to any vector with non-negative elements regardless of normalization.
3.3. The Selection Functional
Functional biases the statistical weight of distribution relative to its combinatorial multiplicity. We call it selection functional because it effectively selects distributions relative to each other. The functional derivative of is
| (26) |
and defines the cluster function , a property cluster mass i in distribution . The cluster function depends not only on i but also on the distribution on which this factor is evaluated. In the special case that is linear functional of the functional derivative is a function of i alone and is the same in all distributions. This special condition is associated with Gibbs distributions, which are discussed in Section 5.
If for all distributions, then the probability of distribution is proportional to its combinatorial multiplicity . If this special condition is met we will call the ensemble unbiased. The partition function of the unbiased ensemble can be easily determined by a combinatorial argument: it is equal to number of ways to assign M objects into N groups and is given by [22]
| (27) |
Accordingly, the probability of distribution in this special case is
| (28) |
In a population undergoing transformations, for example aggregation, fragmentation and so forth, the selection functional is determined by the kinetic details of the mechanisms that produce these transformations; in the case of aggregation it is determined by the aggregation kernel . The question arises whether the unbiased ensemble is a possible solution of the Smoluchowski ensemble under some kernel. The answer is yes, and is given in Section 5.
3.4. Propagation Equations
At the initial state all clusters are monomers and the distribution is . We set and since we also have . We insert Equation (18) into Equation (8) and express the summation over parents of as a summation over all pairs that produce mass i in distribution . The result is (see Supplementary Material)
| (29) |
Here N is the number of clusters in distribution of generation , is the parent distribution via the transition and is the mean kernel in the parent generation . The left-hand side of Equation (29) depends solely on M and N whereas the second term on the right-hand side contains functionals of distribution . This term must be the same for all distributions in the same generation in order to produce a result that is a function of M and N alone. From Equation (18) it is clear that W and may be defined within a proportionality constant ; as long as this constant is common for all distributions in a generation it has no effect on probabilities and may be chosen arbitrarily. We choose it to satisfy the following criterion: if for all distributions, we require this constant to be 1. The choice that satisfies this condition is to set the double summation in Equation (29) to 1. Equation (29) now splits into two separate recursions, one for the partition function,
| (30) |
and one for the selection functional,
| (31) |
The recursion for the partition function is readily solved to produce the partition function in generation :
| (32) |
Accordingly, the partition function is equal to the unbiased partition function times the product of all mean kernels from generation 0 up to the parent generation . We write the recursion for the selection functional in the form
| (33) |
The result gives the selection functional of the offspring as a linear combination of selection functionals of all its parents. In principle this can be solved recursively for any distribution in any generation. For certain special cases the recursion can be solved in closed form. These are discussed in Section 5.
4. Scaling Limit
4.1. Most Probable Distribution
We define the scaling limit by the condition at fixed . The expectation is that in this limit the intensive mean distribution must converge to a limiting distribution that is independent of M and N and depends only on :
| (34) |
We further anticipate that the probability of distribution becomes infinitely sharp around a single distribution, , such that is not merely the most probable distribution, it is overwhelmingly more probable than any other distribution in the ensemble. This further implies that the mean distribution and most probable distribution converge to each other:
| (35) |
This convergence is an implicit requirement for the validity of the Smoluchowski equation: the mean-field approximation is exact if a single distribution is representative of the entire ensemble. This is possible only if peaks very sharply about the most probable distribution. When a single term dominates the summation that defines the partition function in Equation (20), the log of the sum converges to the log of the maximum term,
| (36) |
with . As a further consequence of the intensive convergence in (34) we have the Euler relationship for :
| (37) |
where is the functional derivative of ,
| (38) |
Equation (37) expresses the fact that is homogeneous functional of the MPD. This condition follows from Equation (36) and the homogeneity properties of and .
The most probable distribution (MPD) maximizes the probability in Equation (18) among all distributions that satisfy the constraints in Equation (3). By Lagrange maximization we obtain the MPD in the form
| (39) |
and q and are parameters related to the Lagrange multipliers. We insert the MPD into Equation (36) to obtain
| (40) |
This fundamental equation relates the partition function to the primary variables of the ensemble: the macroscopic variables that define the ensemble, and the Lagrange multipliers that appear in the MPD. The convergence of to intensive limit implies that and q are intensive, that is, they are functions of but not of M or N individually. This further implies that Equation (40) is homogeneous function of M and N with degree 1 and thus must satisfy Euler’s theorem:
| (41) |
Direct comparison with Equation (40) leads to:
| (42) |
| (43) |
Thus the Lagrange multipliers that appear in the MPD are the partial derivatives of the partition function. Differentiation of Equation (40) with respect to all variables that appear on the right-hand side gives
| (44) |
This is the Gibbs-Duhem equation associated with the Euler equation for in Equation (40). It may be written as
| (45) |
In this form its expresses the relationship between , q and .
The MPD maximizes the log of the microcanonical weight, and its maximum is . Therefore we have the inequality:
| (46) |
It is satisfied by all distributions in the ensemble with the equal sign only for . This is the fundamental variational principle of the ensemble: it defines the MPD and generates all relationships of this section.
4.2. Thermodynamics
We recognize the equations of the previous section as those of familiar statistical thermodynamics. Equation (39) is the generalized canonical distribution, a member of the exponential family, whose parameters and q are related to the microcanonical partition function via Equations (40), (42) and (43). We define the extensive form of the canonical partition function via the Legendre transformation of :
| (47) |
and thus we recognize as the intensive form of the canonical partition function.
The variational condition that produces the set of thermodynamic relationships is the inequality in Equation (46), which defines the MPD as the distribution that maximizes the microcanonical weight. Expressing and in terms of the Euler relationships (21) and (37), respectively, this inequality takes the form
| (48) |
where . The inequality is satisfied by all distributions with mean and the equality applies only to . With it reduces the second law: the log of the microcanonical partition function is equal to the Shannon entropy of the most probable distribution, and this is larger than the entropy of any other distribution with the same mean.
Table 2 summarizes these relationships. They are consequences of the maximization of the probability in Equation (18) and are independent of the details of aggregation. These details enter only through Equations (32) and (33), which express the partition function and the selection functional in terms of the aggregation kernel.
Table 2.
Summary of thermodynamic relationships.
5. Gibbs Distributions
A special type of functional is of the form
| (49) |
whose log is linear in
| (50) |
with functional derivative . Here is a function of i alone and does not depend on . If the selection functional is given by Equation (49) the probability of distribution in Equation (18) takes the form
| (51) |
Probability distributions of this type are called Gibbs distributions [23] and are frequently encountered in stochastic processes [24]. Several important results can be obtained in analytic form. In particular, the mean distribution is [22]:
| (52) |
The result is exact for all , .
We apply this selection functional of Equation (49) to the transition that converts parent distribution to offspring . By the stoichiometry of the transition we have
| (53) |
Inserting into Equation (31) we obtain
| (54) |
One possible solution that satisfies this equation for all distributions is
| (55) |
This is not the only possible solution for W in Equation (31) and may or may not be acceptable; if it is, we have obtained a Gibbs distribution and the kernel is a Gibbs kernel.
We have identified three kernels for which Equation (55) is the correct solution. These are the constant kernel,
| (56) |
the sum kernel
| (57) |
and their linear combinations. The product kernel is a quasi-Gibbs kernel and is discussed in Section 5.3.
Here we provide detailed solutions for the constant and sum kernels. We will not discuss the linear combination in part because the results are more involved but mainly because this kernel reverts to the sum kernel when cluster masses are large thus it does not contribute to our understanding of aggregation beyond what we learn by studying the constant and sum kernels separately.
5.1. Constant Kernel
With Equation (55) gives for all i. Accordingly, the ensemble is unbiased and its partition function is given by Equation (27):
| (58) |
The mean distribution follows from Equation (52) and is given by
| (59) |
To obtain the most probable distribution we calculate the parameters and q from Equations (42) and (43) along with (58). The differentiations may be done by first replacing the factorials in the partition function with the Stirling expression. Alternatively we may obtain these parameters by the discrete difference form of these derivatives and apply the asymptotic conditions . The latter method is simpler:
| (60) |
| (61) |
We obtain the MPD from (39) with :
| (62) |
For large this goes over to the exponential distribution
| (63) |
which is the well known result for the constant kernel. Here x stands for the continuous cluster mass.
5.2. Sum Kernel
The ensemble average of the sum kernel is
| (64) |
We obtain the partition function from Equation (32). The result is
| (65) |
The factors that satisfy Equation (55) are
| (66) |
and the mean distribution follows from (52),
| (67) |
This is an exact result for all , . The parameters and q are obtained similarly to those for the constant kernel:
| (68) |
| (69) |
Combining with Equation (39) we obtain the MPD in the form
| (70) |
with . We use the Stirling formula for the factorial the MPD in the continuous limit takes the form
| (71) |
Figure 2 shows the MPD for and the mean distribution from Equation (67) at fixed for various values of M and N. In the scaling limit the mean distribution converges to the MPD.
Figure 2.
Approach to scaling limit for the sum kernel at . The most probable distribution (MPD) is calculated from Equation (70) and the mean distribution from Equation (67) with , .
5.3. Quasi-Gibbs Kernels—The Product Kernel
We are able to obtain closed-form expressions for the partition function of the constant and sum kernels and heir linear combinations because they all satisfy the condition
| (72) |
for all . This states that the mean kernel is the same in all distributions of the ensemble, therefore also equal to the ensemble average kernel. In this case the calculation of the ensemble average kernel is trivial, as it does not require knowledge of the probabilities . The constant kernel, sum kernel and their linear combinations are the only kernels that satisfy (72) in the strictest sense, that is, for all that satisfy the two constraints in (3). We refer to Equation (72) as the Gibbs condition because it generates Gibbs distributions. We may relax the requirement that all distributions obey the Gibbs condition with the milder requirement that it be obeyed by most distributions. This is the case of the produce kernel. The product kernel is defined
| (73) |
and its mean within distribution is
| (74) |
Here and are the normalized first moment and second moments of , respectively. In the limit , , this scales as
| (75) |
in most distributions except those that contain clusters of the order M. (The largest cluster size in the ensemble is and for it is of the order M.) According to Equation (75) the product kernel is a quasi-Gibbs kernel: it satisfies the Gibbs condition in Equation (72) asymptotically in most but not all feasible distributions. We proceed to obtain the Gibbs distribution of the product kernel and test its validity.
Inserting (75) into (32) we obtain the partition function:
| (76) |
We complete the solution by evaluating from Equation (55),
| (77) |
The mean distribution is obtained by inserting these results into Equation (52):
| (78) |
Unlike Equations (62) and (67) this result is not exact. This can be demonstrated numerically by the fact this distribution is not normalized to unity and its mean is not for finite M, N; its approaches proper normalization in the asymptotic limit. This failure arises from the fact that Equation (52) requires a Gibbs probability distribution that strictly applies to all distributions of the ensemble.
We obtain and from Equations (42) and (43):
| (79) |
| (80) |
Using the MPD is
| (81) |
and in the continuous limit
| (82) |
These results are summarized in Table 3 along with those for the constant and sum kernels.
Table 3.
Summary of Constant, Sum and Product Kernel; in all cases .
| Constant Kernel | Sum Kernel | Product Kernel | |
|---|---|---|---|
| 1 | |||
| q | |||
| 1 | |||
| MPD |
Valid only for .
The relationship between the mean and the most probable distribution of the product kernel is shown in Figure 3 for two values of the mean cluster, and . At the mean distribution calculated from Equation (78) is not exact but its moments asymptotically approach the correct values as M and N are increased at fixed . At the behavior is different. A peak develops at the long tail of the distribution. It is pushed to ever larger sizes but never vanishes. In this region the mean distribution from Equation (78) is not correct: its mean does not converge to when M and N are increased, but to a value smaller than , which implies that mass conservation is not satisfied. This breakdown is manifestation of gelation, the emergence of an infinite cluster that is not captured by the mean field theory. The precise nature of the gel phase is discussed in the next section.
Figure 3.
Approach to the scaling limit for the product kernel with (a) and (b) . The MPD is calculated from Equation (71) and the mean distribution (dashed lines) from Equation (78). The distributions for are not stable.
6. Phase Behavior
6.1. Stability
The fundamental inequality of the ensemble is Equation (46) that defines the most probable distribution. This condition implies that the microcanonical functional is concave and this in turn implies that is a concave function of M and N and requires (see Supplementary Material)
| (83) |
These equivalent conditions guarantee the existence of the MPD in the form of Equation (39). In thermodynamic language they ensure that the MPD represents a stable state. The parameters and q of the constant, sum and product kernel are plotted in Figure 4a,b, respectively, as a function of the progress variable . According to Equation (83) stability requires to be decreasing function of and q increasing function of . The constant kernel is stable at all : decreases and increases monotonically over the entire range of . The sum kernel is also stable at all but reaches the limit of stability at or . This kernel is borderline-stable: it is stable for all finite times and reaches instability at . The product kernel is stable up to beyond which point both and violate the stability criteria.
Figure 4.
Phase diagram of power-law kernels: In the shaded region the system is stable and is represented by its MPD. The unshaded region is unstable and the system is split into two phases, a sol phase and a gel phase, each represented by its own MPD. (a,b) provide equivalent criteria of stability.
To survey the stability landscape of aggregation we employ the power-law kernel,
| (84) |
with arbitrary exponent . This is a homogeneous kernel with degree . It reverts to the product kernel with and to the constant kernel with . We treat this as a quasi-Gibbs kernel by analogy to the product kernel. We take the ensemble average power-law kernel to scale as
| (85) |
and obtain the parameters and q as
| (86) |
With and these revert, as expected, to the results for the constant and product kernels, respectively. Interestingly, with we obtain the parameters for the sum kernel. This behavior turns the power-law kernel into a useful tool, a homogeneous kernel that reproduces the correct values of the constant, sum and product kernels, and which may be used to interpolate (and cautiously extrapolate) to other homogeneous kernels by varying the exponent .
The stability limit in power-law aggregation is reached at
| (87) |
Accordingly, the MPD is stable in and unstable in . The phase diagram is shown in Figure 4a,b with the stable region indicated by the shaded area. For the system is stable at all from 0 to 1. For the limit of stability appears at , which is reached in infinite time. In practice the system is stable at all finite times. For the stability limit is reached within finite time at the point where the mean size reaches the critical value
| (88) |
For (product kernel) the limit of stability is reached at . We see from Figure 4 that both and q reach the limit of stability simultaneously.
6.2. Phase Splitting—The Sol-Gel Transition
When the system crosses into the unstable region its state is no longer represented by the MPD but by a mixture of two phases, each with its own MPD. What are these phases? To answer this question we first observe that the elements of the ensemble are fundamentally discrete distributions; the apparent continuity in the scaling limit is a mathematical artifact, a great convenience, but not a fundamental quality of the ensemble. To understand therefore the nature of the gel phase we must consider the discrete finite system. Given a distribution of M particles partitioned into N clusters, the maximum cluster mass possible is and is found in a single distribution of the ensemble, in which one cluster contains units and the remaining clusters contain one unit mass each. The region is special: it is either empty, or it contains a single cluster. It cannot contain more than one cluster because there is not enough mass to have two clusters that are both larger than . In the event that it does contain a cluster, its mass is of the order of , and in the asymptotic limit, of the order M. This means that the mass in the region is of the same order as that in . A cluster in represents a giant component, a single element of the population that carries a finite fraction of the total mass contained in the distribution.
The set of distributions that do not contain a giant cluster constitute the sol phase; sol distributions satisfy the scaling form of the mean kernel in Equation (85) and the Gibbs condition in Equation (72). Distributions that contain a cluster in the gel region violate the Gibbs condition and will be treated as a mixture of a sol phase and a gel phase (). Given an individual distribution , a certain fraction of mass is contained in the sol region with the rest in the gel region. The ensemble averages of these fractions define, respectively, the sol fraction, , and gel fraction, , in the ensemble:
| (89) |
with . If is such that in the scaling limit , the ensemble consists of a single phase, the sol, and is represented by the MPD in Equation (39). If the ensemble is represented by a mixture of the two phases. We will determine their distributions and construct the tie line between the two phases.
We suppose that the state at consists of a sol phase with , , and a gel phase with . The evolution of the sol phase is governed by Equation (30), which we now write as
| (90) |
This must be satisfied by the sol phase at all times. In the pre-gel region the state is a single phase, sol, with and parameters from Equation (86). In the post-gel region it is a mixture of two phases: a sol phase with mass and number of clusters ; and gel phase with mass found in a single cluster (). The sol phase is determined from Equation (90) with and its -q parameters are given by Equation (86) with . The mass of the gel phase is then obtained from the conservation conditions . These steps are summarized below.
Pre-Gel Region
The system consists of a sol phase and its MPD is
| (91) |
with
| (92) |
Post-Gel Region
The system consists of a sol phase with mass fraction and a gel phase with fraction .
- Obtain by solving
with from Equation (86) and .(93) - Obtain and from
(94) -
Obtain the gel fraction from mass balance:
(95) The mean size of the gel cluster is , where M is the total mass in the system. In the scaling limit the gel fraction is 1 and the size of the gel cluster is ∞.
The gel fraction and the mean cluster size for the product kernel are shown in Figure 5 as a function of . The gel fraction is zero up until the gel point () and increases according to Equation (95) once in the post-gel region. The mean cluster size increases in the pre-gel region but decreases in the post-gel region, as clusters in the sol are lost by reaction with the gel. At () all mass is found in the gel phase except for a single sol particle with unit mass. This is the infinite dilution limit of the sol phase, to borrow the terminology of solution thermodynamics.
Figure 5.
(a) Gel fraction and (b) mean sol cluster size as a function of the progress variable . Past the gel point the mean size in the sol retraces its pre-gel history back to its initial size . The dashed lines are Monte Carlo (MC) simulations with particles.
The evolution of past the gel point retraces its pre-gel history. This is a consequence of Equation (93), which resolves the sol phase in the two-phase region. The symmetry of about in the case of the product kernel produces a correspondingly symmetric evolution of , as shown in Figure 5b. The dashed lines are Monte Carlo simulations with particles and are shown for comparison (the simulations are discussed in the next section). The deviation from theory near the gel point is due to finite size effects. In these simulations a relatively small number of particles was used to permit the collection of a large number of realizations within reasonable computational time. These simulations are discussed next.
6.3. Monte Carlo Simulations
We demonstrate the theory with simulations performed by the constant-V Monte Carlo method [25]. The method tracks a sample of clusters that undergo binary aggregation with probability proportional to the transition rate given in Equation (5). At each step the simulation box contains a sample of N clusters, with N decreasing from M to 1 as clusters merge. A pair of clusters are chosen at random and is combined into a single cluster according the following criterion: draw a random number rnd in the interval and accept the merging of the clusters if
| (96) |
where is the aggregation kernel between the chosen clusters and is the maximum aggregation kernel in the simulation box. If the criterion is satisfied the event is accepted and the reactant particles are deleted and replaced by a cluster with their combined mass. If the event is rejected, a new pair is chosen and the process is repeated. The simulation begins with M monomers and continues until a single cluster is formed. This amounts to a random walk along the edges of the graph in Figure 1 that spans its entire range from to . A trajectory from the top to the bottom of the graph consists of a sequence of M sampled distributions, one from each generation. By averaging trajectories we obtain the mean distribution in each generation, which may then be compared to the mean distribution predicted by the theory.
Figure 6 shows the evolution of the mean distribution obtained by MC simulation with the product kernel using . Up until the gel point is reached the state is a single sol phase. It is characterized by a population of clusters whose tail decays fast enough that its moments are finite. Above the gel point a gel peak emerges. It becomes more pronounced and moves to larger sizes as aggregation progresses. Past the gel point the sol distribution contracts and retraces its steps back to the monomeric state as increases. For example, the sol distribution at is identical to that at except that it carries less mass. In the Smoluchowski literature this is known as the Flory solution to gelation [5]. A competing solution by Stockmayer [13] predicts that the intensive distribution of the sol phase remains constant past the gel point except for the fact that its mass gradually decreases as it is transferred to the gel. As it turns out, Stockmayer solution implicitly assumes that is strictly a Gibbs distribution. In this case the sol-gel tie line is obtained by equating the temperatures of the two phases and the sol distribution is indeed found to be constant throughout the post gel region [20]. An analysis of the Stockmayer solution is beyond the scope of this work but a commentary is given in Reference [22].
Figure 6.
Monte Carlo snapshots of the mean distribution of the product kernel with particles (open circles are MC results, solid lines are calculated from theory). The gel phase emerges at and moves towards ever larger sizes (arrows mark the theoretical predictions). The distribution of the sol grows in the pre-gel region range but contracts once past the post-gel point ().
7. Continuous Limit
We define the continuous limit by the conditions
Thus in addition to the scaling limit we require the mean cluster size to be much larger than the unit mass, such that the cluster mass may be treated as a continuous variable, which we denote as x. Equations (63), (71) and (82) refer to this limit. We present the corresponding expressions for the partition function and the selection functional.
In the continuous domain all intensive properties of the ensemble are functions of the mean cluster size . Thus we write , , , and express the partition function in intensive form . The MPD is
| (97) |
and satisfies the normalizations
| (98) |
The log of the cluster function is the functional derivative of the selection functional at the MPD:
| (99) |
and the notation indicates this function of x will generally depend on as well since the functional derivative of non linear functionals depend on the function on which the derivative is evaluated. Since the microcanonical probability peaks sharply about the MPD (we are assuming a stable single-phase state) all ensemble averages revert to averages over the continuous MPD. The ensemble average kernel is then equal to the mean kernel within the MPD
| (100) |
The log of the intensive partition function, , satisfies
| (101) |
with
| (102) |
These are the intensive forms of Equations (40) and (42), respectively. The partition function of aggregation is obtained from Equation (32) by expressing the summation over and an integral over :
| (103) |
The parameter is obtained from Equation (102) and from (101):
| (104) |
| (105) |
From Equation (54) we obtain
| (106) |
which expresses a condition on .
Equations (97), (103)–(106) provide an equivalent mathematical description of Smoluchowski aggregation in the continuous limit. These are accompanied by the variational condition
| (107) |
which is the continuous form of Equation (48) and is satisfied by all distributions with mean . The equality defines the solution to the Smoluchowski process, the MPD, .
As a demonstration we apply these results to the constant kernel. The right-hand side of Equation (106) is zero and we obtain at all times. It follows that . The parameters and q are
| (108) |
and the MPD becomes
| (109) |
This is the well-known solution of the constant kernel in the continuous domain. The partition function of the constant kernel is
| (110) |
and the inequality in Equation (107) becomes
| (111) |
whose right-hand side is the Shannon entropy of distribution . For fixed it is maximized by the exponential distribution, whose entropy is : the inequality is indeed satisfied.
8. Summary
With the results obtained here we have made contact with several previous works in the literature. The mean distribution for the constant kernel in Equation (59) was given by Hendriks [26], who also obtained a recursion for the partition function similar to that in Equation (30). The combinatorial treatment of Hendriks has in fact several common elements to ours but is limited to the constant and sum kernels and lacks the thermodynamic element of this work. The recursion for the cluster weights in Equation (55) has appeared in various treatments of aggregation, both deterministic [2,27] and stochastic [15,17,26]. The mean distributions in the continuous limit for the mean and sum kernels and for the product kernel in the pre-gel region are well known results in the literature [2]. The instability of power-law kernels has been discussed by Ziff et al. [3] in the context of the Smoluchowski equation. These connections to prior literature serve to validate the theory presented here and demonstrate that the thermodynamic treatment provides a unified theory of aggregation that brings previously disconnected results under a single formalism, the Smoluchowski ensemble.
The Smoluchowski ensemble is a probability space of distributions that are feasible under the rules of binary aggregation. The structure of this space, that is, the connectivity of the graph in Figure 1, is solely determined by the condition that aggregation is a binary event; the probability measure over this space is determined by the rate expression prescribed by the aggregation model. In Smoluchowski aggregation the rate is directly proportional to the number of clusters that appear on the reactant side of the aggregation reaction and on the aggregation kernel. In the scaling limit the probability of distribution is sharply peaked around a single distribution of the ensemble, its most probable distribution (MPD). In this limit all ensemble averages reduce to averages of the MPD, a distribution that alone suffices to generate all properties of the ensemble. The Smoluchowski coagulation equation is the time evolution of the most probable distribution in the asymptotic limit.
The step that turns the Smoluchowski ensemble into a thermodynamic ensemble is Equation (18). It expresses the probability of distribution in terms of two special functionals, the multinomial coefficient and the selection functional . This formulation introduces the partition function as the central property of the ensemble to which al other properties are connected. The thermodynamic calculus, summarized by the equations in Table 2, is a mathematical consequence of the variational condition that defines the most probable distribution in Equation (39) as the solution to the constrained maximization of the probability in Equation (18). The constraints are given by Equations (3) that fix the zeroth and first order moments of the distribution. These constraints define a microcanonical ensemble of distributions with fixed mean .
The MPD obtained by the method of constrained maximization is stable, provided that the partition function is concave in its independent variables. In extensive terms, must be concave in ; in intensive terms, must be concave in . The two conditions are equivalent and define the stability criterion of the MPD. As in regular thermodynamics, when the stability criterion is violated the system experiences phase splitting and exists a mixture of two phases—mathematically, as a linear combination of two independent MPD’s. In aggregation these phases are the sol phase, which is represented by the MPD in Equation (39) and the gel phase (giant component), which in the scaling limit is represented by a delta function at ∞. The splitting into a sol and gel phase is treated by the theory in a natural and rigorous manner.
Notably, entropy in this treatment plays no special role. The Shannon entropy of distribution is the log of the multinomial coefficient. In the scaling limit, entropy is a component of the partition function through Equation (36),
| (112) |
where is the Shannon functional evaluated at the MPD. In the special case of the constant kernel . In this case the partition function reduces to the Shannon entropy of the MPD,
| (113) |
and the variational condition reads,
| (114) |
In this form we have recovered the inequality of the second law as stated in statistical thermodynamics: the entropy of the equilibrium distribution (MPD) is at maximum with respect to all feasible distributions, namely, all distributions with the same mean. As is well known this distribution is exponential. The constant kernel is special. With the probability of distribution is proportional to ; accordingly, all ordered sequences of N clusters with total mass M are equally probable. The ordered sequence of cluster masses in this case is analogous to microstate in statistical mechanics and the condition analogous to the postulate if equal a priori probabilities. In the general case the Shannon entropy and the log of the microcanonical partition function are not the same. The fundamental functional that is maximized is the microcanonical weight , whose log is
| (115) |
The selection functional incorporates the effect of the aggregation kernel and in this sense it the point of contact between thermodynamics and the mathematical model of the stochastic process that gives rise to the probability space of interest. In Smoluchowski aggregation the model is defined by the transition rate in Equation (10) and the corresponding governing equation for W is Equation (33).
The thermodynamic formalism developed here is not limited to aggregation. Two alternative derivations that make no reference to stochastic process that gives rise to the probability space have been given in References [20,28]. As long as is a homogeneous functional with degree 1, the thermodynamic relationships follow as a direct consequence of the maximization of the microcanonical probability in Equation (18) under the constraints in Equation (3). The details of aggregation enter through Equations (32) and (33) that give the partition function and selection functional in terms of the aggregation kernel. The approach may be generalized to other processes including growth by monomer addition and breakup. These will be treated elsewhere.
Supplementary Materials
The following are available online at https://www.mdpi.com/1099-4300/22/10/1181/s1.
Funding
This research received no external funding.
Conflicts of Interest
The author declares no conflict of interest.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Smoluchowski M. Versuch einer mathematischen Theorie der Koagulationkinetic kolloider Loesungen. Z. Phys. Chem. 1917;92:129–168. [Google Scholar]
- 2.Leyvraz F. Scaling theory and exactly solved models in the kinetics of irreversible aggregation. Phys. Rep. 2003;383:95–212. doi: 10.1016/S0370-1573(03)00241-2. [DOI] [Google Scholar]
- 3.Ziff R.M., Hendriks E.M., Ernst M.H. Critical Properties for Gelation: A Kinetic Approach. Phys. Rev. Lett. 1982;49:593–595. doi: 10.1103/PhysRevLett.49.593. [DOI] [Google Scholar]
- 4.Hendriks E.M., Ernst M.H., Ziff R.M. Coagulation Equations with Gelation. J. Stat. Phys. 1983;31:519–563. doi: 10.1007/BF01019497. [DOI] [Google Scholar]
- 5.Ziff R.M., Stell G. Kinetics of polymer gelation. J. Chem. Phys. 1980;73:3492–3499. doi: 10.1063/1.440502. [DOI] [Google Scholar]
- 6.Marcus A. Stochastic Coalescence. Technometrics. 1968;10:133–143. doi: 10.1080/00401706.1968.10490541. [DOI] [Google Scholar]
- 7.Lushnikov A.A. Coagulation in finite systems. J. Colloid Interface Sci. 1978;65:276–285. doi: 10.1016/0021-9797(78)90158-3. [DOI] [Google Scholar]
- 8.Lushnikov A.A. Exact kinetics of a coagulating system with the kernel K = 1. J. Phys. A Math. Theor. 2011;44:335001. doi: 10.1088/1751-8113/44/33/335001. [DOI] [Google Scholar]
- 9.Lushnikov A.A. Exact kinetics of the sol-gel transition. Phys. Rev. E. 2005;71:046129. doi: 10.1103/PhysRevE.71.046129. [DOI] [PubMed] [Google Scholar]
- 10.Lushnikov A.A. Exact particle mass spectrum in a gelling system. J. Phys. A Math. Gen. 2005;38:L35. doi: 10.1088/0305-4470/38/3/L01. [DOI] [Google Scholar]
- 11.Lushnikov A.A. From Sol to Gel Exactly. Phys. Rev. Lett. 2004;93:198302. doi: 10.1103/PhysRevLett.93.198302. [DOI] [PubMed] [Google Scholar]
- 12.Aldous D.J. Deterministic and stochastic models for coalescence (aggregation and coagulation): A review of the mean-field theory for probabilists. Bernoulli. 1999;5:3–48. doi: 10.2307/3318611. [DOI] [Google Scholar]
- 13.Stockmayer W.H. Theory of Molecular Size Distribution and Gel Formation in Branched-Chain Polymers. J. Chem. Phys. 1943;11:45–55. doi: 10.1063/1.1723803. [DOI] [Google Scholar]
- 14.Spouge J.L. Analytic solutions to Smoluchowski’s coagulation equation: A combinatorial interpretation. J. Phys. Math. Gen. 1985;18:3063. doi: 10.1088/0305-4470/18/15/028. [DOI] [Google Scholar]
- 15.Spouge J.L. Equilibrium polymer size distributions. Macromolecules. 1983;16:121–127. doi: 10.1021/ma00235a024. [DOI] [Google Scholar]
- 16.Hendriks E.M., Spouge J.L., Eibl M., Schreckenberg M. Exact solutions for random coagulation processes. Z. Phys. B Condens. Matter. 1985;58:219–227. doi: 10.1007/BF01309254. [DOI] [Google Scholar]
- 17.Spouge J.L. The size distribution for the AgRBf-g Model of polymerization. J. Stat. Phys. 1983;31:363–378. doi: 10.1007/BF01011587. [DOI] [Google Scholar]
- 18.Matsoukas T. Statistical Thermodynamics of Irreversible Aggregation: The Sol-Gel Transition. Sci. Rep. 2015;5:8855. doi: 10.1038/srep08855. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Matsoukas T. Abrupt percolation in small equilibrated networks. Phys. Rev. E. 2015;91:052105. doi: 10.1103/PhysRevE.91.052105. [DOI] [PubMed] [Google Scholar]
- 20.Matsoukas T. Statistical thermodynamics of clustered populations. Phys. Rev. E. 2014;90:022113. doi: 10.1103/PhysRevE.90.022113. [DOI] [PubMed] [Google Scholar]
- 21.Flory P.J. Molecular Size Distribution in Three Dimensional Polymers. I. Gelation. J. Am. Chem. Soc. 1941;63:3083–3090. doi: 10.1021/ja01856a061. [DOI] [Google Scholar]
- 22.Matsoukas T. Generalized Statistical Thermodynamics: Thermodynamics of Probability Distributions and Stochastic Processes. Springer International Publishing; Berlin/Heidelberg, Germany: 2019. [DOI] [Google Scholar]
- 23.Berestycki N., Pitman J. Gibbs Distributions for Random Partitions Generated by a Fragmentation Process. J. Stat. Phys. 2007;127:381–418. doi: 10.1007/s10955-006-9261-1. [DOI] [Google Scholar]
- 24.Kelly F.P. Reversibility and Stochastic Networks. Cambridge University Press; Cambridge, UK: 2011. (Reprint of the 1979 edition by Wiley) [Google Scholar]
- 25.Smith M., Matsoukas T. Constant-number Monte Carlo simulation of population balances. Chem. Eng. Sci. 1998;53:1777–1786. doi: 10.1016/S0009-2509(98)00045-1. [DOI] [Google Scholar]
- 26.Hendriks E.M. Cluster size distributions in equilibrium. Z. Phys. B Condens. Matter. 1984;57:307–314. doi: 10.1007/BF01470421. [DOI] [Google Scholar]
- 27.Lushnikov A.A. Evolution of coagulating systems. J. Colloid Interface Sci. 1973;45:549–556. doi: 10.1016/0021-9797(73)90171-9. [DOI] [Google Scholar]
- 28.Matsoukas T. Thermodynamics Beyond Molecules: Statistical Thermodynamics of Probability Distributions. Entropy. 2019;21:890. doi: 10.3390/e21090890. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.






