Abstract
In computational materials science, predicting the yield strain of crosslinked polymers remains a challenging task. A common approach is to identify yield as the first critical point of stress-strain curves simulated by molecular dynamics (MD). However, in such cases the underlying data can be excessively noisy, making it difficult to extract meaningful results. In this work, we propose an alternate method for identifying yield on the basis of deformation-recovery simulations. Notably, the corresponding raw data (i.e. residual strains) produce a sharper signal for yield via a transition in their global behavior. We analyze this transition by non-linear regression of computational data to a hyperbolic model. As part of this analysis, we also propose uncertainty quantification techniques for assessing when and to what extent the simulated data is informative of yield. Moreover, we show how the method directly tests for yield via the onset of permanent deformation and discuss recent experimental results, which compare favorably with our predictions.
Keywords: Yield strain, Molecular Dynamics, Uncertainty Quantification
1. Introduction
In computational materials science, estimating the ultimate mechanical properties of crosslinked polymers remains a challenging task [1, 2]. From an industrial, materials-design perspective, one of the key problems amounts to an inherent competition between throughput and model fidelity. That is, on the one hand, the microscopic events that determine bulk properties involve many-body interactions over long distances; thus large, fully atomistic (or even quantum) simulations are needed to capture the relevant physics [3–5]. On the other hand, such computations become prohibitively expensive long before the system size approaches the thermodynamic limit. As a compromise, it is therefore becoming common for modelers to use atomistic molecular dynamics (MD) simulations of modest-sized systems1 in the hopes that the corresponding predictions will nevertheless approximate bulk properties [1, 3, 6–8].
While this trend is furthering the development of simulations as an industrial engineering tool [2], it has also increasingly brought scientists into contact with the limitations of MD. In the case of yield-strain ϵy, such limitations typically manifest in the form of noise that complicates data analysis [1, 6, 8–10]. In more detail, modelers often identify yield as the first local maximum of the von-Mises stress-strain curve σ(ϵ) [11–14]. However, finite-size simulations and limited time averaging allow for the motion of individual monomers and torsions to manifest as kinks in the resulting datasets, introducing artificial extrema (cf. Fig. 1 and Refs. [6, 8, 10, 15]). Moreover, the spectrum of relaxation times for crosslinked polymers is poorly sampled by MD, and hence it is not even obvious that simulated stress-strain curves necessarily capture yield processes. As a result, modelers require improved simulation techniques and uncertainty quantification (UQ) analyses to better assess when and to what extent their predictions are informative of yield [2, 7, 16].
In this work, our goal is to address these problems by proposing a modified procedure for estimating ϵy on the basis of deformation-recovery simulations. In place of stress-strain data, we analyze simulated residual strains ϵr as a function of the applied engineering strain ϵ. With respect to such data, yield is identified as the onset of permanent deformation, i.e. nonzero values of ϵr. As we show, this approach provides a more precise estimation of ϵy via a sharp transition in the behavior of the underlying raw data. Importantly, this transition can be analyzed in terms of known techniques based on hyperbola fits, which are straightforward to implement and admit simple UQ techniques. Moreover, corresponding experimental results also indicate that the value of yield extracted from such a procedure is independent of the time over which the system is allowed to relax; cf. Refs. 12, 17. Given that MD is often limited to simulating times on the order of a few nanoseconds, the aforementioned experimental results suggest that our approach may be unaffected by this timescale constraint.
A key motivation for our procedure arises from the observation that analyses based on σ(ϵ) may lead to unfounded estimates of ϵy and/or unacceptable levels of uncertainty in its value. A major contribution to this uncertainty arises from the fact that the traditional, critical point estimates of ϵy depend on a local property of the stress-strain curve,2 so that without additional outside information, most of the simulated data does not inform the value of yield. In such circumstances, the uncertainty δ in ϵy is roughly related to the noise ς via the approximation , which can be quite large for typical simulations (cf. Fig. 1).3 In order to overcome this issue, our analysis extracts ϵy from the global behavior of ϵr(ϵ), which is characterized in terms of hyperbola asymptotes. Importantly, this approach sharpens estimates of ϵy by using all of the available datapoints to stabilize the fitting process. Moreover, the fit can itself be analytically interrogated to assess when a given dataset is too noisy to determine ϵy.
A second motivation for our procedure stems from the desire to formulate an objective, reproducible, and automated workflow for estimating yield. In the context of analyses based on stress-strain relations, this task remains challenging, given that there is no known, global functional form for σ(ϵ) describing crosslinked polymers in the nonlinear regime [18, 19]. As a result, the associated methods for estimating ϵy necessarily vary among modelers and introduce uncertainty via subjective interpretations of the data. In contrast, our method relies on the generic observation that ϵr = 0 and ϵr ∝ ϵ −ϵy when the applied deformations are below and above yield, respectively. As this behavior is universal across a range of material systems, our corresponding data analysis can be applied in a uniform and reproducible manner with little-to-no input from the modeler.
Despite these stated aims, however, we emphasize that the scope of our work is limited to an analysis of yield (and its uncertainty) within the context of a single simulation. This has important implications insofar as a single, small simulation may not capture all sources of uncertainty [7]. In the case of the glass transition temperature Tg, Ref. [7] addressed this problem via a weighted-mean average designed to account for the uncertainty between multiple simulated datasets. However, this estimate relied on a prerequisite convergence analysis to help verify that it was indeed appropriate to average datasets in the first place. Given that it is not clear how to extend this convergence test to the case of yield, we refrain from invoking a similar between-simulation uncertainty analysis. Here our goal is simply to propose and justify a modified simulation procedure for estimating ϵy, and we leave a final assessment of the total simulated uncertainties for future work.
Along related lines, we do not exhaustively validate our method against experimental results. Recent work has demonstrated that strain-recovery can be used to estimate yield of glassy polymers in laboratory settings [20], and we show how our method makes qualitatively similar predictions. However, a more quantitative comparison requires extensive estimation of model-form uncertainties. Given that verification (or estimation of uncertainties within the computational realm) remains an open problem, we postpone rigorous validation until the former task is more fully addressed. Such tasks comprise ongoing studies.
The rest of this paper is organized as follows. In Sec. 2, we discuss the simulation procedure needed to generate residual-strain data. In Sec. 3, we discuss our method for extracting ϵy and the statistical analyses underlying the associated UQ. Section 4 further discusses our method in the context of stress-strain curves and experimental data, and appendices provide key mathematical ideas underlying the simulation protocol.
2. Simulation procedure
2.1. Procedure Overview
Our simulation protocol is implemented via custom Materials Studio™ scripts and consists of three main steps: (i) crosslinking and annealing; (ii) iterative straining; and (iii) strain relaxation.4 As the first of these steps has been documented elsewhere [7], we only review key points. The second and third steps are discussed in more detail below.
Starting with an unreacted unit cell of monomers, we crosslink the system by iterating between nearest-neighbor bonding steps and short relaxation simulations that eliminate high-energy configurations. After the monomers are sufficiently crosslinked (between 60% and 95%), the system is heated to 800 K and then annealed to 300 K in 10 K increments using the Parrinello barostat [21] and Andersen thermostat [22]. A convergence criterion on the density adaptively determines the number of timesteps at each temperature. The criterion first allows the density to equilibrate to its new temperature until fluctuations in its running average fall below some threshold. Then, this running average is discarded, and a new running average density is computed until its fluctuations fall below a second, stricter threshold. After this criterion is met, the temperature is decreased and the procedure repeated; cf. Appendix A. Once the system has reached 300 K, the running average cell dimensions of the last averaging step are taken to be the dimensions of the relaxed, unstrained system.
Given an annealed structure, we next apply a small, volume conserving strain Δϵ (i.e. Poisson’s ratio of 1/2) to the system by compressing in the direction of the longest basis vector.5 We apply a tensile stress in two other orthogonal directions. Because the annealed structures are generally skew, this involves both lengthening the basis vectors and changing their relative angles; see Appendix B for details of the calculations. Next, we equilibrate the system via an NVT simulation that adaptively chooses the number of timesteps according to a convergence criterion analogous to the one described previously, but in this case monitoring the stress. As above, each iteration uses a second, more stringent convergence criterion to compute a final, average stress for use in subsequent analysis. In addition, the final structure obtained here is saved for later use to evaluate the residual strain as discussed below. In order to generate a full stress-strain curve, we increment the applied strain and iterate the above steps until enough datapoints (ϵi, σi) have been collected, typically 51 in total (i.e. corresponding to one zero-strain structure and 50 strain increments).
After this process has completed, we allow each of the N saved structures to relax using an NPT simulation. As above, the number of timesteps is adaptively chosen until fluctuations in the running average system dimensions fall below a threshold, and the final average dimensions are computed using a second, more stringent criterion. Denoting the length of the sides of the relaxed and original unit cells as ℓi and Li, we define the residual strain as
(1) |
Our definition is consistent with those used in Refs. [17, 20, 23], with the exception that we consider strain in all three directions as opposed to just the loading direction axis. That is, they defined ϵr = (L1 − ℓ1)/L1, where i = 1 corresponds to the axis of the applied load. Both definitions show identical bilinear behaviors as discussed below. However, we empirically find that the definition in terms of Eq. (1) provides a slightly larger signal-to-noise ratio.
2.2. Systems considered in this work
In this work, we consider two chemistries representative of materials found in aerospace applications. The first of these, which we denote 33MY, is a one-to-one mixture of: 3,3-diaminodiphenyl sulfone (33DDS), a four-functional amine; and tetraglycidyl methylene dianiline (MY720), a four-functional epoxy. The second system, which we denote 44BA, is a one-to-two mixture of: 4,4-diaminodiphenyl sulfone (44DDS); and digycidyl ether of Bisphenyl A (BisA), a two-functional epoxy. The second chemistry was chosen because Ref. [20] recently performed experiments on this systems using techniques analogous to our strain-recovery simulations.
All of our simulated results were generated from unit cells with roughly 4000 to 5000 atoms. While this is small relative to state-of-the-art calculations [1, 6, 8, 11], our emphasis is on generating data that is representative of what might be encountered in a high-throughput environment. Sizes above roughly 10,000 atoms can take weeks to months to simulate on high-end machines, which may be prohibitive from a cost perspective (cf. also Ref. [7]).6 Moreover, our goal is not to generate the most well-behaved data, but rather to show that data acquired at moderate computational cost can provide value subject to careful and robust analysis. To that end, we chose chemistries and system sizes to demonstrate a range of possible behaviors encountered in practice.
3. Data analysis
3.1. Observations on residual strain data
Figure 2, shows a representative example of a residual strains computed from the system that generated Fig. 1. Before proposing our data analysis routine, several comments are in order.
First, ϵr(ϵ) has bilinear behavior that admits a simple physical interpretation. When the applied strain is below yield, the system is able to fully recover to its original dimensions; thus the corresponding ϵr are close to zero and constant. When the applied strain is greater than yield, only the pre-yield deformation is recoverable, leading to an approximately linear growth of ϵr with ϵ. Thus, in principle (but not practice), ϵy can be identified as the value of applied strain at which the linear growth first appears.
In practical simulations, such transitions are not sharp; we attribute this to several effects. For one, strain-relaxation in polymers occurs via two distinct mechanism, namely elastic and viscoelastic recovery [24]. The former occurs instantaneously and is associated with deformation modes that are confined to local energy minima, such as small changes in bond lengths. Viscoelastic effects, on the other hand, are typically associated with transitions between energy minima, such as the torsional rearrangement between discrete states, collective sliding of chains, and so forth. As the barriers between such states may be significant, the associated relaxation modes occur over physical timescales that may be unaccessible to MD. Therefore, it is reasonable to expect that simulated ϵr(ϵ) will smoothly transition from 0 to a post-yield, linear behavior over some finite interval of applied strains.7
Thermal noise and finite-size effects also complicate our interpretation of the data. In particular, at small applied strains one expects that ϵr = 0. However, fluctuating system dimensions conspire with the absolute values in Eq. (1) to return small, non-zero ϵr. Beyond yield, the data in Fig. 2 also shows a marked increase in fluctuations about linear behavior. In the context of stress-strain curves, Sundararaghavan and Kumar [15] provided convincing evidence that such large fluctuations are due to a finite number of torsional rearrangements and related transitions that become activated post-yield. By virtue of the small system size, such rearrangements produced noticeable perturbations in their data. Given the similarity to systems that we consider in this work, we anticipate that the increase in our post-yield noise arises for similar reasons. Below we pay careful attention to ensure that our estimates of ϵy are not overly affected by such effects.
Along related lines, we also note that in several regards, Fig. 2 is visually consistent with its underlying stress-strain curve in Fig. 1. In particular, the transition region in the former corresponds roughly to a plausible location for the maximum of σ(ϵ). Moreover, our general observations about the trends in noise apply equally to both. By eye, however, the plausible region for ϵy is much smaller in Fig. 2 than in Fig. 1. In the following section, we propose analyses to make such observations more precise.
3.2. Hyperbola fits of residual strain
As noted in the previous section, under ideal circumstances all of the applied strain up to yield is recoverable, while post-yield strain is not. Mathematically this observation can be stated by letting ϵr(ϵ) be a function of the form
(2) |
where Θ(x) is the Heaviside step function, and a and b are constants; cf. the green asymptotes in Fig. 2. We emphasize that this function is at best an approximation for ϵr. For one, there is (to the best of our knowledge) no general theory that predicts such a bilinear behavior for the residual strain, despite ample experimental evidence [12, 17, 18, 20, 23]. Moreover, we physically expect that a = 0, but as has been noted practical simulations often return small, non-zero values of ϵr when ϵ → 0. Nonetheless, the data shown in Fig. 2 (along with subsequent figures) demonstrates that Eq. (2) is a good approximation to the behavior of ϵr(ϵ).
In order to account for the smoothing effects of finite-time and -size simulations, we fit ϵr to a hyperbola as follows. Specifically, define
(3) |
where a, b, c, and ϵy are to be determined. Note that when c → −∞, the hyperbola given by Eq. (3) tends in the limit to the double line formula given by Eq. (2). Finite c allows for a smooth transition between the two linear asymptotes. Equation (3) automatically enforces the requirement that below yield, the residual strain approaches a constant (ideally a ≈ 0). Given a set of parameters ϕ = (a, b, c, ϵy) fit to a particular dataset, we equate the hyperbola center with yield ϵy.
In order to actually determine ϕ, we perform a weighted, nonlinear least squares fit of the simulated data to Eq. (3) via
(4) |
where ϵj and ϵr,j are the applied and residual strain values returned at the D discrete simulation points, and ςj is weighting factor associated with the amplitude of the noise in the simulations. This latter quantity is particularly important in order to prevent the hyperbola from overfitting the increasingly noisy, high-strain data; see Fig. 2. As we show below, failure to do so can lead to unphysical values of yield.
In order to estimate ςj, we use an iterative approach wherein we first assume ςj = 1. Solving the least-squares problem associated with Eq. (4) provides an initial set of parameters corresponding to a hyperbola . Defining the squared residuals , we fit these to a power law of the form
(5) |
The non-constant variance model (5) is determined by maximizing the likelihood
(6) |
as a function of q. Taking ς2(ϵ) = R(q, ϵ), we then compute a final estimate of the hyperbola parameters ϕLS by solving Eq. (4) with the updated weighting factor evaluated at the ϵj.
Figure 2 shows the results of this method for the residual strain experiments corresponding to the computational stress-strain shown in Fig. 1. Figure 3 shows the noise-model fit according to Eq. (5). As a result of the iterative update for ς, the hyperbola fits the small-strain data well while exhibiting some flexibility in its interpolation beyond ϵy. This is reasonable, given that the fluctuations in the former are small. Figure 4 also shows the residuals scaled by R(q, ϵ). By eye, these appear uniform in scale and uncorrelated. We take this as evidence that the uncorrelated noise model with strain-dependent variance (5) is sufficient to describe the associated fluctuations in ϵr.
As an aside, we note that residuals to the hyperbola fit are not explained by the error bars in the residual strain data, which are on the order of 0.01 % (cf. Appendix A). In light of the discussion of the previous section, we attribute this discrepancy to the fact that small simulations undersample the (random) network configurations of the crosslinked polymers. See Ref. [15], as well as Ref. [7] for a more in-depth discussion in the context of the glass-transition temperature.
3.3. Rejection criterion for datasets
Considered as a tool for uncertainty quantification, Eq. (3) is useful for assessing when a given dataset is informative of yield. In particular, the parameter c can be used to quantify the extent to which the data approaches asymptotic regimes of the hyperbola. Specifically, Ref. [7] showed that for a user-defined convergence threshold P (where 0 < P < 1), the formula
(7) |
defines an interval [ϵy − Δϵ, ϵy + Δϵ] outside of which the ϵr are 100 × P % converged to their nearest asymptote. Thus, given a value of P, we can use Eq. (7) to identify those datasets for which the residual strain approaches a constant value when the applied strain is small. Any simulation for which ϵy−Δϵ < 0 can be held out for further inspection on the grounds that the data does not sample any deformation states that entirely recover.
In Fig. 2, we illustrate an analysis according to Eq. (7) applied to the data underlying Fig. 1 with P = 0.99. The strictness of the convergence criterion is reflected in the fact that the residual strains essentially overlap the hyperbola asymptotes up to the region outlined in black, which was later identified as the confidence interval for ϵy.
3.4. Uncertainty quantification of ϵy through noise sampling
As Fig. 2 illustrates, noise in the simulated data leads to uncertainty with regards to the exact location of ϵy. By eye, it appears that ϵ ≈ 0.08 is a plausible lower bound, although such an assessment is largely subjective. In this section, we propose a statistical method for estimating a confidence interval containing ϵy on the basis of repeated noise sampling. Physically, our goal is to quantify the extent to which finite-size and -time fluctuations in the data inhibit our ability to compute a best-fit hyperbola.
Given and R(q, ϵ) from Sec. 3.2, we note that the latter amounts to a noise model for the residuals of ϵr. Letting denote a normal random variable with zero mean and variance x, we therefore construct synthetic datasets
(8) |
by using random number generators to realize the . Fitting these synthetic datasets to hyperbolas via Eq. (4) [with ] generates new estimates of ϵy. Given that this process is inexpensive, we realize or more such estimates and compute confidence intervals on the basis of the resulting distribution.
Figures 5 and 6 shows the results of this procedure applied to the simulation results that generated Figs. 1 – 3. In Fig. 6, we take the confidence intervals for yield (dash-dot black lines) to be the minimum and maximum values of ϵy returned by the noise sampling algorithm. For the time being, we avoid reporting a mean and standard deviation since it is unclear whether the initial departure from ϵr ≈ 0 is due to the onset of yield or an inability to sample long-relaxation time viscoelastic modes. Nonetheless, Figs. 2 and 6 show clearly that the residual strain method can lead to a marked decrease in the uncertainty in yield.
4. Discussion and conclusions
4.1. Behaviors seen in data
Amine-cured epoxies are a rich class of materials, and in silico they exhibit a variety of behaviors that affect the data analysis. In this section, we discuss such considerations in more detail. Relevant conclusions are summarized within each subsection.
4.1.1. Overfitting and non-weighted least squares estimates
In Sec. 3.2, we suggested that a non-weighted least squares estimate of the parameters ϕ0 can lead to a hyperbola that overfits the residual strain data. Figure 7 shows an example of this situation. The dashed, purple hyperbola corresponds to a non-weighted least squares fit with ς2 = 1. At large applied strains, the fit follows the data significantly better than its weighted counterpart. However, examination of the inset shows that the non-weighted least squares estimate of yield is unphysically high near a value of ϵy = 0.19. We anticipate that this is due to overfitting the high-strain data, thus necessitating the iterative analysis by which we determine R(q, ϵ). The weighted-least squares hyperbola (solid pink) allows for more flexibility in fitting the high-strain data, and leads to correspondingly more reasonable estimate of ϵy.
4.1.2. Comparison to methods based on σ(ϵ)
Up to this point, we have suggested that the residual strain method provides estimates of ϵy that are both consistent with and sharper than their σ(ϵ) counterparts. This is not always be true. Figure 8 illustrates a case for which the peak of the stress-strain data appears well resolved and localized around ϵ = 0.1, but outside the confidence interval determined from an analysis of 50,000 synthetic datasets. Moreover, a plausible eyeball estimate of the uncertainty in ϵy extracted from σ(ϵ) is δ = ±0.01, so that there is little overlap with the residual strain estimate. Such a discrepancy could arise from a random fluctuation in σ(ϵ) that creates an artificial maximum.8 However, this observation does not necessarily imply that our residual strain method provides a more accurate estimate of ϵy in this case. Notably, the right-most bound from the latter method falls far into the post-yield region relative to the σ(ϵ) data. Thus, this example highlights the potential need for estimates that combine information from multiple sources in order to overcome their individual limitations.
4.2. Comparison with experiments
While our main goal is to propose and verify a new procedure for estimating ϵy from MD simulations, our approach would nonetheless be problematic if it showed little correspondence with real experiments. To this end, we briefly discuss past results that support the usefulness of our method.
In 1997, Quinson et al. showed that deformation-relaxation experiments can be used to quantify the rate-dependence of relaxation modes in linear polymers such as polystyrene [12, 17]. As a byproduct of this work, they generated plots of residual strain data as a function of the applied strain. Interestingly, they observed that yield (or the onset of plastic deformation) occurred at the first value of for which the material exhibited a non-zero residual strain; cf. also Refs. [12, 17, 18, 23]. It is noteworthy that in all of their results, ϵr is approximately a linear function of the applied strain beyond yield. Critically, this observation holds irrespective of either the deformation rate, temperature, or relaxation time. Given the inherent length and time-scale limitations of MD, these observations are therefore encouraging, since they suggest that our simulated estimates of ϵy are independent of the time over which the system is allowed to relax.
Reference [20] recently reproduced some of these results via strain-relaxation experiments on 44BA. Certain details of their analysis differ from our approach. Notably, they computed residual strain relative to the direction of major strain only; moreover, their compression tests were not necessarily volume conserving. Nonetheless, key elements of their results are replicated in our simulations. In particular, their residual-strain data exhibits the same bilinear character evident in, e.g. our Fig. 2 and Refs. [12, 17, 18, 23]. Moreover, they identified the onset of permanent deformation with the appearance of a non-zero slope in their ϵr. Such observations are promising in that they suggest that the simulations capture the same phenomena as experiments.
More direct comparison with the results in Ref. [20] is complicated by several factors. In particular, the authors did not measure the strain perpendicular to the direction of compression; thus it is not possible to compute the Poisson ratio of the deformation. Moreover, our procedure for creating crosslinked structures has free parameters controlling properties such as the final crosslink percentage, etc. Thus, while it is likely that our method can be calibrated to be comparable with experimental measurements, such an analysis cannot be performed with the currently available information.
Finally, we note that while the simulated systems are all amine-cured epoxies, the experimental work that motivated our method comes from studies of systems such as poly(methylmethacrylate), polystyrene, and polycarbonate; cf. Refs. 12, 17, 18, 23. Given this, we anticipate that the bilinear behavior seen in our residual-strain data is likely to be a generic feature of diverse polymer systems. Thus, our method of estimating yield should also apply in such cases.
4.3. Limitations and open problems
As suggested in Sec. 1 and Ref. [7], a key feature of small-scale molecular dynamics simulations is the inability for any one realization to sample all of the possible structures that a crosslinked network can form. In Ref. [7], the authors showed how this undersampling can lead to significant uncertainties that are not captured in the analysis of a single simulation. They addressed this problem by proposing a weighted-mean statistic to estimate the unaccounted for ”dark” uncertainties by comparing results from multiple independent simulations. In general, we expect that a similar undersampling of the crosslinked network gives rise to varying estimates of yield between simulations. But while it is desirable to extend the dark-uncertainty analysis to the case at hand, certain problems arise.
In particular, such an analysis requires that the individual datasets be converged as a function of system size so that the hyperbola analysis does not bias predictions via nonlinear transformations of the noise. Reference [7] addressed this problem through a pooling analysis that detected bias by systematically averaging datasets to mimic the effect of reducing noise. Importantly, however, this analysis was facilitated by the fact that Tg is extracted from scalar relationships (i.e. density-temperature curves); thus it was straightforward to formulate the notion of an average density. In the case of yield-strain, the corresponding raw data is composed of tensors (which are subsequently massaged into scalars). Given that the underlying unit cells are non-orthogonal, it is not straightforward to formulate what we mean by average stress and strain tensors. That is, the appropriate method of averaging raw data may depend on the relative orientations of the unit cells, which requires additional modeling or assumptions beyond what we propose.
Acknowledgments
The authors thank Anthony Kearsley, Stephen Christensen, and Matthew Jackson for helpful discussions during preparation of this manuscript. The authors also thank The Boeing Company for computational resources provided in support of this project. P.P. was partially funded by the Institute for Mathematics and its Applications (IMA) as a postdoctoral fellow during this work. The IMA is a National Science Foundation Math Institute funded under award DMS-0931945.
This work is a contribution of the National Institute of Standards and Technology and is not subject to copyright in the United States.
Appendix A. Convergence criterion
The general method for assessing convergence of our simulated quantities works as follows. First, let ρ denote an arbitrary quantity such as density, stress, or length. Denote an interval of times between time tj and tj+1 as Ij := [tj, tj+1], and partition this interval into Nj equal timesteps tj,k. We use Ij to represent a single simulation whose duration is equal to the length of the interval; tj,k (1 ≤ k ≤ Nj) denote the discrete simulation timesteps within that interval.
For both the equilibration and ρ-averaging algorithms, we first run a single simulation and compute the running average
(A.1) |
where ρ(t1,k′) is the quantity output by the simulation at the k′th step in the 1st interval. Next, we compute the “unnormalized variance”
(A.2) |
for j = 1. If V1 is less than some user-defined threshold Vtarget, then the algorithm stops. If, on the other hand, V1 > Vtarget, we run additional simulations, computing
(A.3) |
until Vj < Vtarget. Note that we calculate the running average across the sum of intervals Σj I j, whereas we only compute the variance Vj for individual intervals I j.
In general, the task of picking suitable interval lengths I and convergence criteria Vtarget is non-trivial. Degenerate cases illustrate potential problems. If I is so small as to encompass only a few timesteps, the system will not evolve from its initial state, and the variances Vj will be artificially small. Consequently, the algorithm will stop prematurely. In the other extreme, picking I to be on the scale of nanoseconds (which is long for MD simulations) can lead one to do more dynamical averaging than may be necessary. Picking Vtarget too large or small leads to similar problems. For condensed polymer systems with atoms or fewer, we find that setting I = 20 ps is a reasonable compromise for all temperatures between 100 K and 800 K, provided that our timesteps are 1 fs. This yields Nj = 20, 000 for all j. For the density equilibration and averaging steps, we set Vtarget = 10−3 g2/cm6 and Vtarget = 10−7 g2/cm6, respectively. For stress-equilibration and averaging steps, we set and , where σf was the final running average stress in the I j under consideration. For the cell-dimension equilibration and averaging steps, we set and , where ϵf was the final running average residual strain in the I j under consideration. In particular, this implies that error bars on the computed stresses are on the order of or 3%. In all of the figures, this corresponds to roughly the thickness of the lines used to draw the symbols. Likewise, error bars on the residual strains are on the order or 0.01 %, which are not visible on the scale of the plots.
We caution that this stopping algorithm can severely undersample metastable energy minima of a single system, especially when the characteristic time to leave the mimima is on the order of or greater than the simulation lengths I j. We attempt to compensate for this problem by generating multiple realizations of the same chemistry in order to more fully explore its possible configurations.
It is also important to note that our convergence criterion does not guarantee that the system statistics are representative of realistic equilibrium distributions. Reference [25] provides an analysis for assessing the extent to which this latter criterion has been satisfied on MD timescales, and this approach can likely be incorporated into our simulation procedure. However, we do not explicitly quantify uncertainties due to lack of such convergence because MD cannot reach the timescales necessary to verify that the aforementioned analysis is applicable to crosslinked polymers in the glassy regime.
Appendix B. Strain control for non-orthogonal unit cells
Our MD simulations model non-orthogonal, periodic unit cells with shapes that vary between realizations. In order to compare simulation datasets with different underlying structures, it is therefore useful to strain all of the systems in the same way, e.g. relative to an orthogonal coordinate system. In practice, this entails changing both the length of and angles between the non-orthogonal basis vectors.
To achieve this, we first pick an orthogonal coordinate frame in which to strain the system. Without loss of generality, we assume that the longest basis vector of the non-orthogonal system coincides with the z-axis of its orthogonal counterpart. In the orthogonal basis, we impose a volume conserving strain
(B.1) |
where λ > 0 and Δ > 0 are small. Letting denote the identity matrix, volume conservation implies that , where | ★ | denotes the determinant of ★. Consequently,
(B.2) |
so that picking the strain increment associated with compression determines expansion in the other two orthogonal directions.
To compute the associated changes in the non-orthogonal unit cell, first let A, B, and C denote its three unit vectors, recall that C is parallel to the z-axis, and assume that B is in the y-z plane. The angles between these unit vectors are defined via the inner products
(B.3) |
(B.4) |
(B.5) |
It is convenient to encapsulate this information by writing A, B, and C in terms of the orthogonal coordinate system; viz.
(B.6) |
(B.7) |
(B.8) |
where , , and are the orthogonal unit vectors, and
(B.9) |
(B.10) |
These last identities are computed via the aid of Eq. (B.5) and the fact that |A| = 1 by definition.
Now, straining the system according to Eq. (B.1) amounts to computing new basis vectors , , and via the relation
(B.11) |
The non-orthogonal unit vectors become
(B.12) |
(B.13) |
(B.14) |
The lengths of the strained, non-orthogonal basis vectors can be easily calculated from the above. Moreover, we can define strained angles by inverting the definitions
(B.15) |
(B.16) |
(B.17) |
Footnotes
i.e. systems that model to atoms over tens of nanoseconds.
That is, a (non-global) critical point is only associated with the behavior (i.e. zero slope) of a function f(x) at a single point, namely .
This approximation is derived as follows. First, let σ′ and σ″ denote the first and second derivatives of stress. Next note that by definition, σ′(ϵy) = 0 and σ″(ϵy) < 0. Therefore, the approximate value of Δϵ = δ required to see a change −ς in the stress is given by inverting −ς = (1/2)δ2σ″ (ϵy), i.e. .
Certain commercial equipment, instruments, or materials are identified in this paper in order to specify the experimental procedure adequately. Such identification is not intended to imply recommendation or endorsement by the National Institute of Standards and Technology, nor is it intended to imply that the materials or equipment identified are necessarily the best available for the purpose.
As an aside, we emphasize that this method does not assess a possible anisotropic response of the material to different loading conditions. Here, our method is chosen solely to ensure that none of the system dimensions become smaller than the electrostatic interaction cutoff distance.
From the perspective of noise-reduction, another consideration also comes into play in our choice of system size. In statistical mechanics, fluctuations in intensive quantities such as the pressure tend to decay as , where N is the number of particles. Thus, a 10,000 atom simulation would only reduce noise in a stress-strain curve such as Fig. 1 by a factor of about 1.5, which is not likely to be worth the extra computational cost. Roughly 4 × 105 atoms would be required to reduce the noise in Fig. 1 by one order of magnitude. See Refs. 10, 11 for stress-strain curves generated from 70,000 atom and 1:5 × 106 atom simulations.
The convergence criterion in Appendix A is designed to alleviate this problem. However, the underlying calculation considers running averages computed on finite (and typically small) blocks of time, often on the order of 10 ps. As a result, relaxation modes with significantly longer characteristic times are not adequately sampled by the simulations.
It is also worth noting that this “peaking” behavior is not due to the chemistry, which is different from the 33MY systems used to generate the other datasets. This example was deliberately chosen to be problematic and represents a natural variation in data associated with systems of this size. We have found that 44BA datasets frequently look like Fig. 1.
References
- [1].Li C, Strachan A, Molecular scale simulations on thermoset polymers: A review, Journal of Polymer Science Part B: Polymer Physics 53 (2) (2015) 103–122. doi: 10.1002/polb.23489. [DOI] [Google Scholar]
- [2].Goldbeck G, The economic impact of molecular modeling, Goldbeck Consulting Ltd; (2012) 1–45. [Google Scholar]
- [3].Buchholz J, Paul W, Varnik F, Binder K, Cooling rate dependence of the glass transition temperature of polymer melts: Molecular dynamics study, The Journal of Chemical Physics 117 (15). [Google Scholar]
- [4].Berne B, Ciccotti G, Coker D, italiana di fisica S, Classical and Quantum Dynamics in Condensed Phased Simulations: Proceedings of the International School of Physics ”Computer Simulation of Rare Events and Dynamics of Classical and Quantum Condensed-Phased Systems” : Lerici, Villa Marigola, 7 July-18 July 1997, Euroconference on ”Technical Advances in Particle-Based Computational Material Sciences”, World Scientific, 1998. [Google Scholar]
- [5].Deuflhard P, Computational molecular dynamics: challenges, methods, ideas : proceedings of the 2nd International Symposium on Algorithms for Macromolecular Modelling, Berlin, May 21–24, 1997, Lecture notes in computational science and engineering, Springer, 1999. [Google Scholar]
- [6].Li C, Strachan A, Molecular dynamics predictions of thermal and mechanical properties of thermoset polymer epon862/detda, Polymer 52 (13) (2011) 2920–2928. doi: 10.1016/j.polymer.2011.04.041. [DOI] [Google Scholar]
- [7].Patrone PN, Dienstfrey A, Browning AR, Tucker S, Christensen S, Uncertainty quantification in molecular dynamics studies of the glass transition temperature, Polymer (2016) – doi: 10.1016/j.polymer.2016.01.074. [DOI] [Google Scholar]
- [8].Li C, Strachan A, Evolution of network topology of bifunctional epoxy thermosets during cure and its relationship to thermo-mechanical properties: A molecular dynamics study, Polymer 75 (2015) 151–160. doi: 10.1016/j.polymer.2015.08.037. [DOI] [Google Scholar]
- [9].Hossain D, Tschopp M, Ward D, Bouvard J, Wang P, Horstemeyer M, Molecular dynamics simulations of deformation mechanisms of amorphous polyethylene, Polymer 51 (25) (2010) 6071–6083. doi: 10.1016/j.polymer.2010.10.009. [DOI] [Google Scholar]
- [10].Li C, Coons E, Strachan A, Material property prediction of thermoset polymers by molecular dynamics simulations, Acta Mechanica 225 (4) (2014) 1187–1196. doi: 10.1007/s00707-013-1064-2. URL 10.1007/s00707-013-1064-2 [DOI] [Google Scholar]
- [11].Jaramillo E, Wilson N, Christensen S, Gosse J, Strachan A, Energy-based yield criterion for pmma from large-scale molecular dynamics simulations, Phys. Rev. B 85 (2012) 024114. doi: 10.1103/PhysRevB.85.024114. URL http://link.aps.org/doi/10.1103/PhysRevB.85.024114 [DOI] [Google Scholar]
- [12].QUINSON R, PEREZ J, RINK M, PAVAN A, Yield criteria for amorphous glassy polymers, Journal of Materials Science 32 (5) (1997) 1371–1379. doi: 10.1023/A:1018525127466. [DOI] [Google Scholar]
- [13].Hart-Smith L, Application of the strain invariant failure theory (sift) to metals and fiberpolymer composites, Philosophical Magazine 90 (31–32) (2010) 4263–4331. doi: 10.1080/14786435.2010.510450. [DOI] [Google Scholar]
- [14].Gosse JH, Christensen S, Strain invariant failure criteria for polymers in composite materials, Proceedings of the 19th AIAA Applied Aerodynamics Conference. [Google Scholar]
- [15].Sundararaghavan V, Kumar A, Molecular dynamics simulations of compressive yielding in cross-linked epoxies in the context of argon theory, International Journal of Plasticity 47 (2013) 111–125. doi: 10.1016/j.ijplas.2013.01.004. [DOI] [Google Scholar]
- [16].Dienstfrey A, Phelan FR, Christensen S, Strachan A, Santosa F, Boisvert R, Uncertainty quantification in materials modeling, JOM 66 (7) (2014) 1342–1344. doi: 10.1007/s11837-014-1049-1. [DOI] [Google Scholar]
- [17].David L, Quinson R, Gauthier C, Perez J, The role of anelasticity in high stress mechanical response and physical properties of glassy polymers, Polymer Engineering & Science 37 (10) (1997) 1633–1640. doi: 10.1002/pen.11811. URL 10.1002/pen.11811 [DOI] [Google Scholar]
- [18].Perez J, Physics and Mechanics of Amorphous Polymers, Taylor & Francis, 1998. [Google Scholar]
- [19].Stachurski Z, Deformation mechanisms and yield strength in amorphous polymers, Progress in Polymer Science 22 (3) (1997) 407–474. doi: 10.1016/S0079-6700(96)00024-X. [DOI] [Google Scholar]
- [20].Heinz S, Tu J, Jackson M, Wiggins J, Digital image correlation analysis of strain recovery in glassy polymer network isomers, Polymer 82 (2016) 87–92. doi: 10.1016/j.polymer.2015.11.026. [DOI] [Google Scholar]
- [21].Parrinello M, Rahman A, Polymorphic transitions in single crystals: A new molecular dynamics method, Journal of Applied Physics 52 (12). [Google Scholar]
- [22].Andersen HC, Molecular dynamics simulations at constant pressure and/or temperature, The Journal of Chemical Physics 72 (4). [Google Scholar]
- [23].Pegoretti A, Pandini S, Ricco T, Strain recovery of post-yield compressed semicrystalline poly(butylene terephthalate), Polymer 47 (16) (2006) 5862–5870. doi: 10.1016/j.polymer.2006.06.020. [DOI] [Google Scholar]
- [24].Ward I, Sweeney J, An Introduction to the Mechanical Properties of Solid Polymers, Wiley, 2004. [Google Scholar]
- [25].Shirts MR, Simple quantitative tests to validate sampling from thermodynamic ensembles, Journal of Chemical Theory and Computation 9 (2) (2013) 909–926, pMID: 26588735. doi: 10.1021/ct300688p. [DOI] [PubMed] [Google Scholar]