A simple method for automated equilibration detection in molecular simulations

John D Chodera

doi:10.1021/acs.jctc.5b00784

. Author manuscript; available in PMC: 2017 Apr 12.

Published in final edited form as: J Chem Theory Comput. 2016 Mar 23;12(4):1799–1805. doi: 10.1021/acs.jctc.5b00784

A simple method for automated equilibration detection in molecular simulations

John D Chodera ^1,^*

PMCID: PMC4945107 NIHMSID: NIHMS785193 PMID: 26771390

Abstract

Molecular simulations intended to compute equilibrium properties are often initiated from configurations that are highly atypical of equilibrium samples, a practice which can generate a distinct initial transient in mechanical observables computed from the simulation trajectory. Traditional practice in simulation data analysis recommends this initial portion be discarded to equilibration, but no simple, general, and automated procedure for this process exists. Here, we suggest a conceptually simple automated procedure that does not make strict assumptions about the distribution of the observable of interest, in which the equilibration time is chosen to maximize the number of effectively uncorrelated samples in the production timespan used to compute equilibrium averages. We present a simple Python reference implementation of this procedure, and demonstrate its utility on typical molecular simulation data.

Keywords: molecular dynamics (MD), Metropolis-Hastings, Monte Carlo (MC), Markov chain Monte Carlo (MCMC), equilibration, burn-in, timeseries analysis, statistical inefficiency, integrated autocorrelation time

INTRODUCTION

Molecular simulations use Markov chain Monte Carlo (MCMC) techniques [1] to sample configurations x from an equilibrium distribution π(x), either exactly (using Monte Carlo methods such as Metropolis-Hastings) or approximately (using molecular dynamics integrators without Metropolization) [2].

Due to the sensitivity of the equilibrium probability density π(x) to small perturbations in configuration x and the difficulty of producing sufficiently good guesses of typical equilibrium configurations x ~ π(x), these molecular simulations are often started from highly atypical initial conditions. For example, simulations of biopolymers might be initiated from a fully extended conformation unrepresentative of behavior in solution, or a geometry derived from a fit to diffraction data collected from a cryocooled crystal; solvated systems may be prepared by periodically replicating a small solvent box equilibrated under different conditions, yielding atypical densities and solvent structure; liquid mixtures or lipid bilayers may be constructed by using methods that fulfill spatial constraints (e.g. PackMol [3]) but create locally aytpical geometries, requiring long simulation times to relax to typical configurations.

As a result, traditional practice in molecular simulation has recommended some initial portion of the trajectory be discarded to equilibration (also called burn-in¹ in the MCMC literature [4]). While the process of discarding initial samples is strictly unnecessary for the time-average of quantities of interest to eventually converge to the desired expectations [5], this nevertheless often allows the practitioner to avoid what may be impractically long run times to eliminate the bias in computed properties in finite-length simulations induced by atypical initial starting conditions. It is worth noting that a similar procedure is not a practice universally recommended by statisticians when sampling from posterior distributions in statistical inference [4]; the differences in complexity of probability densities typically encountered in statistics and molecular simulation may explain the difference in historical practice.

As a motivating example, consider the computation of the average density of liquid argon under a given set of reduced temperature and pressure conditions shown in Figure 1. To initiate the simulation, an initial dense liquid geometry at reduced density ρ* ≡ ρσ³ = 0.960 was prepared and subjected to local energy minimization. The upper panel of Figure 1 depicts the average relaxation behavior of simulations initiated from the same configuration with different random initial velocities and integrator random number seeds (see Simulation Details). The average of 500 realizations of this process shows a characteristic relaxation away from the initial density toward the equilibrium density (Figure 1, upper panel, black line). As a result, the expectation of the running average of the density significantly deviates from the true expectation (Figure 1, lower panel, dashed line). This effect leads to significantly biased estimates of the expectation unless simulations are sufficiently long to eliminate starting point dependent bias, which takes a surprisingly long ~2000τ in this example. Note that this bias is present even in the average of many realizations because the same atypical starting condition is used for every realization of this simulation process.

FIG. 1 — To illustrate the bias in expectations induced by relaxation away from initial conditions, 500 replicates of a simulation of liquid argon were initiated from the same energy-minimized initial configuration constructed with initial reduced density ρ* ≡ ρσ³ = 0.960 but different random number seeds for stochastic integration. **Top:** The average of the reduced density (black line) over the replicates relaxes to the region of typical equilibrium densities over the first ~ 100 τ of simulation time, where τ is a natural time unit (see *Simulation Details*). **Bottom:** If the average density is estimated by a cumulative average from the beginning of the simulation (red dotted line), the estimate will be heavily biased by the atypical starting density even beyond 1000 τ. Discarding even a small amount of initial data—in this case 500 initial samples—results in a cumulative average estimate that converges to the true average (black dashed line) much more rapidly. Shaded regions denote 95% confidence intervals.

To develop an automatic approach to eliminating this bias, we take motivation from the concept of reverse cumulative averaging from Yang et al. [6], in which the trajectory statistics over the production region of the trajectory are examined for different choices of the end of the discarded equilibration region to determine the optimal production region to use for computing expectations and other statistical properties. We begin by first formalizing our objectives mathematically.

Consider T successively sampled configurations x_t from a molecular simulation, with t = 1, … , T, initiated from x₀. We presume we are interested in computing the expectation

〈 A 〉 \equiv \int d x A (x) π (x)

(1)

of a mechanical property of interest A(x). For convenience, we will refer to the timeseries a_t ≡ A(x_t), with t ∈ [1, T]. The estimator $\hat{A} \approx 〈 A 〉$ constructed from the entire dataset is given by

{\hat{A}}_{[1, T]} \equiv \frac{1}{T} \sum_{t = 1}^{T} a_{t} .

(2)

While ${lim}_{T \to \infty} {\hat{A}}_{[1, T]} = 〈 A 〉$ for an infinitely long simulation², the bias in ${\hat{A}}_{[1, T]}$ may be significant in a simulation of finite length T.

By discarding samples t < t₀ to equilibration, we hope to exclude the initial transient from our sample average, and provide a less biased estimate of 〈A〉,

{\hat{A}}_{[t_{0}, T]} \equiv \frac{1}{T - t_{0} + 1} \sum_{t = t_{0}}^{T} a_{t} .

(3)

We can quantify the overall error in an estimator ${\hat{A}}_{[t_{0, T}]}$ in a sample average of trajectories initiated from x₀ that excludes samples where t < t₀ by the expected error $δ^{2} {\hat{A}}_{[t_{0, T}]}$ ,

δ^{2} {\hat{A}}_{[t_{0}, T]} \equiv E_{x_{0}} [{({\hat{A}}_{[t_{0}, T]} - 〈 A 〉)}^{2}]

(4)

where E_x₀ [·] denotes the expectation over independent realizations of the specific simulation process initiated from configuration x₀, but with different velocities and random number seeds.

We can rewrite the expected error $δ^{2} \hat{A}$ by separating it into two components

\begin{matrix} δ^{2} {\hat{A}}_{[t_{0}, T]} & = E_{x_{0}} [{({\hat{A}}_{[t_{0}, T]} - E_{x_{0}} [{\hat{A}}_{[t_{0}, T]}])}^{2}] \\ + {(E_{x_{0}} [{\hat{A}}_{[t_{0}, T]}] - 〈 A 〉)}^{2} \end{matrix}

(5)

The first termdenotes the variance in the estimator $\hat{A}$ ,

{var}_{x_{0}} ({\hat{A}}_{[t_{0}, T]}) \equiv E_{x_{0}} [{({\hat{A}}_{[t_{0}, T]} - E_{x_{0}} [{\hat{A}}_{[t_{0}, T]}])}^{2}]

(6)

while the second term denotes the contribution from the squared bias,

{bias}_{x_{0}}^{2} ({\hat{A}}_{[t_{0}, T]}) \equiv {(E_{x_{0}} [{\hat{A}}_{[t_{0}, T]}] - 〈 A 〉)}^{2}

(7)

BIAS-VARIANCE TRADEOFF

With increasing equilibration time t₀, bias is reduced, but the variance—the contribution to error due to random variation from having a finite number of uncorrelated samples—will increase because less data is included in the estimate. This can be seen in the bottom panel of Figure 2, where the shaded region (95% confidence interval of the mean) increases in width with increasing equilibration time t₀.

FIG. 2 — Trajectories of length T = 2000 τ for the argon system described in Figure 1 were analyzed as a function of equilibration time choice t₀. Averages over all 500 replicate simulations (all starting from the same initial conditions) are shown as dark lines, with shaded lines showing standard deviation of estimates among replicates. **Top:** The statistical inefficiency g as a function of equilibration time choice t₀ is initially very large, but diminishes rapidly after the system has relaxed to equilibrium. **Middle:** The number of effectively uncorrelated samples N_eff = (T − t₀ + 1)/g shows a maximum at t₀~ 100 τ (red vertical lines), suggesting the system has equilibrated by this time. **Bottom:** The cumulative average density 〈ρ*〉 computed over the span [t₀, T] shows that the bias (deviation from the true estimate, shown as red dashed lines) is minimized for choices of t₀ ≥ 100 τ. The standard deviation among replicates (shaded region) grows with t₀ because fewer data are included in the estimate. The choice of optimal t₀ that maximizes N_eff (red vertical line) strikes a good balance between bias and variance. The true estimate (red dashed lines) is computed from averaging over the range [5 000, 10 000] τ over all 500 replicates.

To examine the tradeoff between bias and variance explicitly, Figure 3 plots the bias and variance (here, shown as the standard deviation over replicates—the square root of the variance—which is an indication of the true standard erplicitly, Figure 3 plots the bias and variance (here, shown as the standard deviation over replicates—the square root of the variance—which is an indication of the true standard error of a single simulation) contributions against each other as a function of t₀ (denoted by color) as computed from statistics over all 500 replicates. At t₀ = 0, the bias is large but variance is minimized. With increasing t₀, bias is eventually eliminated but then variance rapidly grows as fewer uncorrelated samples are included in the estimate. There is a clear optimal choice at t₀ ~ 100 τ thatminimizes variance while also effectively eliminating bias (where τ is a natural time unit—see Simulation Details).

FIG. 3 — Trajectories of length T = 2000τ for the argon system described in Figure 1 were analyzed as a function of equilibration time choice t₀, with colors denoting the value of t₀ (in units of τ) corresponding to each plotted point. Using 500 replicate simulations, the average bias (average deviation from true expectation) and standard deviation (random variation from replicate to replicate) were computed as a function of a prespecified fixed equilibration time t₀, with colors running from violet (0 τ ) to red (1800 τ ). As is readily discerned, the bias for small t₀ is initially large, but minimized for larger t₀. By contrast, the standard error (a measure of variance, estimated here by standard deviation among replicates) grows as t₀ grows above a certain critical time (here, ~ 100 τ ). If the t₀ that maximizes N_eff is instead chosen *individually* for each trajectory based on that trajectory’s estimates of statistical inefficiency g[t₀ ,T], the resulting bias-variance tradeoff (black triangle) does an excellent job minimizing bias and variance simultaneously, comparable to what is possible for a choice of equilibration time t₀ based on knowledge of the true bias and variance among many replicate estimates.

SELECTING THE EQUILIBRATION TIME

Is there a simple approach to choosing an optimal equilibration time t₀ that provides a significantly improved estimate ${\hat{A}}_{[t_{0}, T]}$ , even when we do not have access to multiple realizations? At worst, we hope that such a procedure would at least give some improvement over the naive estimate, such that $δ^{2} {\hat{A}}_{[t_{0}, T]} < δ^{2} {\hat{A}}_{[0, T]}$ ; at best, we hope that we can achieve a reasonable bias-variance tradeoff close to the optimal point identified in Figure 3 that minimizes bias without greatly increasing variance. We remark that, for cases in which the simulation is not long enough to reach equilibrium, no choice of t₀ will eliminate bias completely; the best we can hope for is to minimize this bias.

While automated methods for selecting the equilibration time t₀ have been proposed, these approaches have short-comings that have greatly limited their use. The reverse cumulative averaging (RCA) method proposed by Yang et al. [6], for example, uses a statistical test for normality to determine the point before which which the observable timeseries deviates from normality when examining the timeseries in reverse. While this concept may be reasonable for experimental data, where measurements often represent the sum of many random variables such that the central limit theorem’s guarantee of asymptotic normality ensures the distribution of the observable will be approximately normal, there is no such guarantee that instantaneous measurements of a simulation property of interest will be normally distributed. In fact, many properties will be decidedly non-normal. For a biomolecule such as a protein, for example, the radius of gyration, end-to-end distance, and torsion angles sampled during a simulation will all be highly non-normal. Instead, we require a method that makes no assumptions about the nature of the distribution of the property under study.

AUTOCORRELATION ANALYSIS

The set of successively sampled configurations {x_t} and their corresponding observables {a_t} compose a correlated timeseries of observations. To estimate the statistical error or uncertainty in a stationary timeseries free of bias, we must be able to quantify the effective number of uncorrelated samples present in the dataset. This is usually accomplished through computation of the statistical inefficiency g, which quantifies the number of correlated timeseries samples needed to produce a single effectively uncorrelated sample of the observable of interest. While these concepts are well-established for the analysis of both Monte Carlo and molecular dynamics simulations [7–10], we review them here for the sake of clarity.

For a given equilibration time choice t₀, the statistical uncertainty in our estimator ${\hat{A}}_{[t_{0}, T]}$ can be written as,

\begin{matrix} δ^{2} {\hat{A}}_{[t_{0}, T]} & \equiv E_{x_{0}} [{({\hat{A}}_{[t_{0}, T]} - 〈 \hat{A} 〉)}^{2}] \\ = E_{x_{0}} [{\hat{A}}_{[t_{0}, T]}^{2}] - E_{x_{0}} {[{\hat{A}}_{[t_{0}, T]}]}^{2} \\ = \frac{1}{T_{t_{0}}^{2}} \sum_{t, t' = t_{0}}^{T} {E_{x_{0}} [a_{t} a_{t'}] - E_{x_{0}} [a_{t}] E_{x_{0}} [a_{t'}]} \\ = \frac{1}{T_{t_{0}}^{2}} \sum_{t = t_{0}}^{T} {E_{x_{0}} [a_{t}^{2}] - E_{x_{0}} {[a_{t}]}^{2}} \\ + \frac{1}{T_{t_{0}}^{2}} \sum_{t \neq t' = t_{0}}^{T} {E_{x_{0}} [a_{t} a_{t'}] - E_{x_{0}} [a_{t}] E_{x_{0}} [a_{t'}]}, \end{matrix}

(8)

where T_t₀ ≡ T − t₀ + 1, the number of correlated samples in the timeseries ${a_{t}}_{t_{0}}^{T}$ . In the last step, we have split the double-sum into two separate sums—a term capturing the variance in the observations a_t, and a remaining term capturing the correlation between observations.

If t₀ is sufficiently large for the initial bias to be eliminated, the remaining timeseries ${a_{t}}_{t_{0}}^{T}$ will obey the properties of both stationarity and time-reversibility, allowing us to write

\begin{matrix} δ^{2} {\hat{A}}_{[t_{0}, T]}^{equil} = \frac{1}{T_{t_{0}}} [〈 a_{t}^{2} 〉 - {〈 a_{t} 〉}^{2}] \\ + \frac{2}{T_{t_{0}}} \sum_{n = 1}^{T - t_{0}} (\frac{T_{t_{0}} - n}{T_{t_{0}}}) [〈 a_{t} a_{t + n} 〉 - 〈 a_{t} 〉 〈 a_{t + n} 〉] \\ \equiv \frac{σ_{t_{0}}^{2}}{T_{t_{0}} ∕ g_{t_{0}}}, \end{matrix}

(9)

where the variance σ² and statistical inefficiency g (in units of the sampling interval τ ) are given by

σ^{2} \equiv 〈 a_{t}^{2} 〉 - {〈 a_{t} 〉}^{2},

(10)

g \equiv 1 + 2 τ_{eq},

(11)

integrated autocorrelation time τ_ac given by

τ_{eq} \equiv \sum_{t = 1}^{T - 1} (1 - \frac{t}{T}) C_{t},

(12)

with the discrete-time normalized fluctuation autocorrelation function C_t defined as

C_{t} \equiv \frac{〈 a_{n} a_{n + t} 〉 - {〈 a_{n} 〉}^{2}}{〈 a_{n}^{2} 〉 - {〈 a_{n} 〉}^{2}} .

(13)

In practice, it is difficult to estimate C_t for t ~ T , due to growth in the statistical error, so common estimators of g make use of several additional properties of C_t to provide useful estimates (see Practical Computation of Statistical Inefficiencies).

The t₀ subscript for the variance σ², the integrated autocorrelation time τ_ac, and the statistical inefficiency g mean that these quantities are only estimated over the production portion of the timeseries, ${a_{t}}_{t = t_{0}}^{T}$ . Since we assumed that the bias was eliminated by judicious choice of the equilibration time t₀, this estimate of the statistical error will be poor for choices of t₀ that are too small.

THE ESSENTIAL IDEA

Suppose we choose some arbitrary time t₀ and discard all samples t ∈ [0, t₀) to equilibration, keeping [t₀, T] as the dataset to analyze. How much data remains? We can determine this by computing the statistical inefficiency g_t₀ for the interval [t₀, T], and computing the effective number of uncorrelated samples N_eff (t₀) ≡ (T − t₀ + 1)/g_t₀. If we start at t₀ ≡ T and move t₀ to earlier and earlier points in time, we expect that the effective number of uncorrelated samples N_eff (t₀) will continue to grow until we start to include the highly atypical initial data. At that point, the integrated autocorrelation time τ (and hence the statistical inefficiency g) will greatly increase (a phenomenon observed earlier, e.g. Figure 2 of [6]). As a result, the effective number of samples N_eff will start to plummet.

Figure 2 demonstrates this behavior for the liquid argon system described above, using averages of the statistical inefficiency g_t₀ and N_eff (t₀) computed over 500 independent replicate trajectories. At short t₀, the average statistical inefficiency g (Figure 2, top panel) is large due to the contribution from slow relaxation from atypical initial conditions, while at long t₀ the statistical inefficiency estimate is much shorter and nearly constant of a large span of time origins. As a result, the average effective number of uncorrelated samples N_eff (Figure 2, middle panel) has a peak at t₀ ~ 100 τ (Figure 2, vertical red lines). The effect on bias in the estimated average reduced density 〈ρ^∗〉 (Figure 2, bottom panel) is striking—the bias is essentially eliminated for the choice of equilibration time t₀ that maximizes the number of uncorrelated samples N_eff.

This suggests an alluringly simple algorithm for identifying the optimal equilibration time—pick the t₀ which maximizes the number of uncorrelated samples N_eff in the timeseries ${a_{t}}_{t_{0}}^{T}$ for the quantity of interest A(x):

\begin{matrix} t_{0}^{opt} & = \underset{t_{0}}{argmax} N_{eff} (t_{0}) \\ = \underset{t_{0}}{argmax} \frac{T - t_{0} + 1}{g_{t_{0}}} \end{matrix}

(14)

Bias-variance tradeoff

How will the simple strategy of selecting the equilibration time t₀ using Eq 14 work for cases where we do not know the statistical inefficiency g as a function of the equilibration time t₀ precisely? When all that is available is a single simulation, our best estimate of g_t₀ is derived from that simulation alone over the span [t₀, T]—will this affect the quality of our estimate of equilibration time? Empirically, this does not appear to be the case—the black triangle in Figure 3 shows the bias and variance contributions to the error for estimates computed over the 500 replicates where t₀ is individually determined from each simulation using this simple scheme based on selecting t₀ to maximize N_eff for each individual realization. Despite not having knowledge about multiple realizations, this strategy effectively achieves a near-optimal balance between minimizing bias without increasing variance.

Overall RMS error

How well does this strategy perform in terms of decreasing the overall error $δ {\hat{A}}_{[t_{0}, T]}$ compared to $δ {\hat{A}}_{[0, T]}$ ? Figure 4 compares the expected standard error (denoted $δ \hat{A}$ ) as a function of a fixed initial equilibration time t₀ (black line with shaded region denoting 95% confidence interval) with the strategy of selecting t₀ to maximize N_eff for each realization (red line with shaded region denoting 95% confidence interval). While the minimum error for the fixed- t₀ strategy (0.00152±0.00005) is achieved at ~ 100 τ—a fact that could only be determined from knowledge of multiple realizations—the simple strategy of selecting t₀ using Eq. 14 achieves a minimum error of 0.00173±0.00005, only 11% worse (compared to errors of 0.00441±0.00007, or 290% worse, should no data have been discarded).

FIG. 4 — Trajectories of length T = 2000τ for the argon system described in Figure 1 were analyzed as a function of fixed equilibration time choice t₀. Using 500 replicate simulations, the root-mean-squared (RMS) error (Eq. 4) was computed (black line) along with 95% confidence interval (gray shading). The RMS error is minimized for fixed equilibration time choices in the range 100–200 τ. If the t₀ that maximizes N_eff is instead chosen *individually* for each trajectory based on that trajectory’s estimated statistical inefficiency g_[t₀,T] using Eq. 14, the resulting RMS error (red line, 95% confidence interval shown as red shading) is quite close to the minimum RMS error achieved from any particular *fixed* choice of equilibration time t₀, suggesting that this simple automated approach to selecting t₀ achieves close to optimal performance.

DISCUSSION

The scheme described here—in which the equilibration time t₀ is computed using Eq. 14 as the choice that maximizes the number of uncorrelated samples in the production region [t₀, T]—is both conceptually and computationally straightforward. It provides an approach to determining the optimal amount of initial data to discard to equilibration in order to minimize variance while also minimizing initial bias, and does this without employing statistical tests that require generally unsatisfiable assumptions of normality of the observable of interest. All that is needed is to save the timeseries ${a_{t}}_{1}^{T}$ of the observable A(x) of interest—there is no need to store full configurations x_t—and post-process this dataset with a simple analysis procedure, for which we have provided a convenient Python reference implementation (see Simulation Details). As we have seen, this scheme empirically appears to select a practical compromise between bias and variance even when the statistical inefficiency g is estimated directly from the trajectory using Eq. 11.

To show that this approach is indeed general, we repeated the analysis illustrated above in Figs. 1–4 for a different choice of observable A(x) for the same liquid argon system—in this case, the reduced potential energy³ u^∗(x) ≡ βU (x). The results of this analysis are collected in Fig. 5. As can readily be seen, this reduced potential behaves in essentially the same way the reduced density does, and the simple scheme for automated determination of equilibration time t₀ from Eq. 14 does just as well.

FIG. 5 — The analyses of Figs. 1–4 were repeated for the reduced potential energy u^∗(x) ≡ βU (x) of the liquid argon system. As with the analysis of reduced density, the simple automated determination of equilibration time t₀ from Eq. 14 works equivalently well for the reduced potential. Shaded regions denote 95% confidence interval.

A word of caution is necessary. One can certainly envision pathological scenarios where this algorithm for selecting an optimal equilibration time will break down. In cases where the simulation is not long enough to reach equilibrium—let alone collect many uncorrelated samples from it—no choice of equilibration time will bestow upon the experimenter the ability to produce an unbiased estimate of the true expectation. Similarly, in cases where insufficient data is available for the statistical inefficiency to be estimated well, this algorithm is expected to perform poorly. However, in these cases, the data itself should be suspect if the trajectory is not at least an order of magnitude longer than the minimum estimated autocorrelation time.

SIMULATION DETAILS

All molecular dynamics simulations described here were performed with OpenMM 6.3 [12] (available at openmm.org) using the Python API. All scripts used to retrieve the software versions used here, run the simulations, analyze data, and generate plots—along with the simulation data itself and scripts for generating figures—are available on GitHub⁴.

To model liquid argon, the LennardJonesFluid model system in the openmmtools package⁵ was used with parameters appropriate for liquid argon (σ = 3.4 Å, ϵ = 0.238 kcal/mol). All results are reported in reduced (dimensionless) units. Initial dense liquid geometries were generated via a Sobol’ subrandom sequence [13], as generated by the subrandom_particle_positions method in openmmtools. A cubic switching function was employed, with the potential gently switched to zero over r ∈ [σ, 3σ], and a long-range isotropic dispersion correction accounting for this switching behavior used to include neglected contributions. Simulations were performed using a periodic box of N = 500 atoms at reduced temperature $T^{*} \equiv k_{B} T ∕ ϵ = 0.850$ and reduced pressure p^∗ ≡ pσ³/ϵ = 1.266 using a Langevin integrator [14] with timestep Δt = 0.01τ and collision rate ν = τ ⁻¹, with characteristic oscillation timescale $τ = \sqrt{m r_{0}^{2} ∕ 72 ϵ}$ and r₀ = 2^1/6 σ [15]. All times are reported in multiples of the characteristic timescale τ. A molecular scaling Metropolis Monte Carlo barostat with Gaussian simulation volume change proposal moves attempted every τ (100 timesteps), using an adaptive algorithm that adjusts the proposal width during the initial part of the simulation [12]. Densities were recorded every τ (100 timesteps). The true expectation 〈ρ^∗〉 was estimated from the sample average over all 500 realizations over [5000,10000] τ.

The automated equilibration detection scheme is also available in the timeseries module of the pymbar package as detectEquilibration(), and can be accessed using the following code:

graphic file with name nihms-785193-f0006.jpg

PRACTICAL COMPUTATION OF STATISTICAL INEFFICIENCIES

The robust computation of the statistical inefficiency g (defined by Eq. 11) for a finite timeseries a_t, t = 0, … , T deserves some comment. There are, in fact, a variety of schemes for estimating g described in the literature, and their behaviors for finite datasets may differ, leading to different estimates of the equilibration time t₀ using the algorithm of Eq. 14.

The main issue is that a straightforward approach to estimating the statistical inefficiency using Eqs. 12–13 in which the expectations are simply replaced with sample estimates causes the statistical error in the estimated correlation function C_t to grow with t in a manner that allows this error to quickly overwhelm the sum of Eq. 12. As a result, a number of alternative schemes—generally based on controlling the error in the estimated C_t or truncating the sum of Eq. 12 when the error grows too large—have been proposed.

For stationary, irreducible, reversible Markov chains, Geyer observed that a function Γ_k ≡ γ_2k + γ_2k+1 of the unnormalized fluctuation autocorrelation function γ_t ≡ 〈a_ia_i_+t〉 − 〈a_i〉² has a number of pleasant properties (Theorem 3.1 of [16]): It is strictly positive, strictly decreasing, and strictly convex. Some or all of these properties can be exploited to define a family of estimators called initial sequence methods (see Section 3.3 of [16] and Section 1.10.2 of [4]), of which the initial convex sequence (ICS) estimator is generally agreed to be optimal, if somewhat more complex to implement.⁶

All computations in this manuscript used the fast multiscale method described in Section 5.2 of [10], which we found performed equivalently well to the Geyer estimators (data not shown). This method is related to a multiscale variant of the initial positive sequence (IPS) method of Geyer [17], where contributions are accumulated at increasingly longer lag times and the sum of Eq. 12 is truncated when the terms become negative. We have found this method to be both fast and to provide useful estimates of the statistical inefficiency, but it may not perform well for all problems.

ACKNOWLEDGMENTS

We are grateful to William C. Swope (IBM Almaden Research Center) for his illuminating introduction to the use of autocorrelation analysis for the characterization of statistical error, as well as Michael R. Shirts (University of Virginia), David L. Mobley (University of California, Irvine), Michael K. Gilson (University of California, San Diego), Kyle A. Beauchamp (MSKCC), and Robert C. McGibbon (Stanford University) for valuable discussions on this topic, and Joshua L. Adelman (University of Pittsburgh) for helpful feedback and encouragement. We are grateful to Michael K. Gilson (University of California, San Diego), Wei Yang (Florida State University), Sabine Reißer (SISSA, Italy), and the anonymous referees for critical feedback on the manuscript itself. JDC acknowledges a Louis V. Gerstner Young Investigator Award, NIH core grant P30-CA008748, and the Sloan Kettering Institute for funding during the course of this work.

Footnotes

The term burn-in comes from the field of electronics, in which a short “burn-in” period is used to ensure that a device is free of faulty components—which often fail quickly—and is operating normally [4].

We note that this equality only holds for simulation schemes that sample from the true equilibrium density π(x), such as Metropolis-Hastings Monte Carlo or Metropolized dynamical integration schemes such as hybrid Monte Carlo (HMC). Molecular dynamics simulations utilizing finite timestep integration without Metropolization will produce averages that may deviate from the true expectation 〈A〉 [2].

Note that the reduced potential [11] for the isothermal-isobaric ensemble is generally defined as u^∗(x) = β[u(x) + pV (x)] to include the pressure-volume term βpV (x), but in order to demonstrate the performance of this analysis on an observable distinct from the density—which depends on V (x)—we omit the βpV (x) term in the present analysis.

⁴

All Python scripts necessary to reproduce this work—along with data plotted in the published version—are available at: http://github.com/choderalab/automatic-equilibration-detection

⁵

available at http://github.com/choderalab/openmmtools

⁶

Implementations of these methods are provided with the code distributed with this manuscript.

References

[1].Liu JS. Monte Carlo strategies in scientific computing. 2nd Springer-Verlag; New York: 2002. [Google Scholar]
[2].Sivak D, Chodera J, Crooks G. Physical Review X. 2013;3:011007. bibtex: Sivak:2013:Phys.Rev.X. [Google Scholar]
[3].Martínez L, Andrade R, Birgin EG, Martínez JM. J. Chem. Theor. Comput. 2009;30:2157. doi: 10.1002/jcc.21224. [DOI] [PubMed] [Google Scholar]
[4].Brooks S, Gelman A, Jones GL, Meng X-L. Handbook of Markov chain Monte Carlo. CRC Press; 2011. Chap. 1. [Google Scholar]
[5].Geyer C. Burn-in is unnecessary. http://users.stat.umn.edu/~geyer/mcmc/burn.html. [Google Scholar]
[6].Yang W, Bittetti-Putzer R, Karplus M. J. Chem. Phys. 2004;120:2618. doi: 10.1063/1.1638996. [DOI] [PubMed] [Google Scholar]
[7].Müller-Krumbhaar H, Binder K. J. Stat. Phys. 1973;8:1. [Google Scholar]
[8].Swope WC, Andersen HC, Berens PH, Wilson KR. J. Chem. Phys. 1982;76:637. [Google Scholar]
[9].Janke W. In: Quantum Simulations of Complex Many-Body Systems: From Theory to Algorithms. Groten-dorst J, Marx D, Murmatsu A, editors. Vol. 10. John von Neumann Institute for Computing: 2002. pp. 423–445. [Google Scholar]
[10].Chodera JD, Swope WC, Pitera JW, Seok C, Dill KA. J. Chem. Theor. Comput. 2007;3:26. doi: 10.1021/ct0502864. [DOI] [PubMed] [Google Scholar]
[11].Shirts MR, Chodera JD. J. Chem. Phys. 2008 In press. [Google Scholar]
[12].Eastman P, Friedrichs M, Chodera JD, Radmer R, Bruns C, Ku J, Beauchamp K, Lane TJ, Wang L-P, Shukla D, Tye T, Houston M, Stitch T, Klein C. J. Chem. Theor. Comput. 2012;9:461. doi: 10.1021/ct300857j. [DOI] [PMC free article] [PubMed] [Google Scholar]
[13].Sobol IM. USSR Comput. Maths. Math. Phys. 1967;7:86. [Google Scholar]
[14].Sivak DA, Chodera JD, Crooks GE. J. Phys. Chem. B. 2014;118:6466. doi: 10.1021/jp411770f. [DOI] [PMC free article] [PubMed] [Google Scholar]
[15].Veytsman B, Kotelyanskii M. Lennard-jones potential revisited. http://borisv.lk.net/matsc597c-1997/simulations/Lecture5/node3.html. [Google Scholar]
[16].Geyer CJ. Stat. Sci. 1992;76:473. [Google Scholar]
[17].Geyer CJ, Thompson EA. J. Royal Stat. Soc. B. 1992;54:657. [Google Scholar]

[R1] [1].Liu JS. Monte Carlo strategies in scientific computing. 2nd Springer-Verlag; New York: 2002. [Google Scholar]

[R2] [2].Sivak D, Chodera J, Crooks G. Physical Review X. 2013;3:011007. bibtex: Sivak:2013:Phys.Rev.X. [Google Scholar]

[R3] [3].Martínez L, Andrade R, Birgin EG, Martínez JM. J. Chem. Theor. Comput. 2009;30:2157. doi: 10.1002/jcc.21224. [DOI] [PubMed] [Google Scholar]

[R4] [4].Brooks S, Gelman A, Jones GL, Meng X-L. Handbook of Markov chain Monte Carlo. CRC Press; 2011. Chap. 1. [Google Scholar]

[R5] [5].Geyer C. Burn-in is unnecessary. http://users.stat.umn.edu/~geyer/mcmc/burn.html. [Google Scholar]

[R6] [6].Yang W, Bittetti-Putzer R, Karplus M. J. Chem. Phys. 2004;120:2618. doi: 10.1063/1.1638996. [DOI] [PubMed] [Google Scholar]

[R7] [7].Müller-Krumbhaar H, Binder K. J. Stat. Phys. 1973;8:1. [Google Scholar]

[R8] [8].Swope WC, Andersen HC, Berens PH, Wilson KR. J. Chem. Phys. 1982;76:637. [Google Scholar]

[R9] [9].Janke W. In: Quantum Simulations of Complex Many-Body Systems: From Theory to Algorithms. Groten-dorst J, Marx D, Murmatsu A, editors. Vol. 10. John von Neumann Institute for Computing: 2002. pp. 423–445. [Google Scholar]

[R10] [10].Chodera JD, Swope WC, Pitera JW, Seok C, Dill KA. J. Chem. Theor. Comput. 2007;3:26. doi: 10.1021/ct0502864. [DOI] [PubMed] [Google Scholar]

[R11] [11].Shirts MR, Chodera JD. J. Chem. Phys. 2008 In press. [Google Scholar]

[R12] [12].Eastman P, Friedrichs M, Chodera JD, Radmer R, Bruns C, Ku J, Beauchamp K, Lane TJ, Wang L-P, Shukla D, Tye T, Houston M, Stitch T, Klein C. J. Chem. Theor. Comput. 2012;9:461. doi: 10.1021/ct300857j. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] [13].Sobol IM. USSR Comput. Maths. Math. Phys. 1967;7:86. [Google Scholar]

[R14] [14].Sivak DA, Chodera JD, Crooks GE. J. Phys. Chem. B. 2014;118:6466. doi: 10.1021/jp411770f. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] [15].Veytsman B, Kotelyanskii M. Lennard-jones potential revisited. http://borisv.lk.net/matsc597c-1997/simulations/Lecture5/node3.html. [Google Scholar]

[R16] [16].Geyer CJ. Stat. Sci. 1992;76:473. [Google Scholar]

[R17] [17].Geyer CJ, Thompson EA. J. Royal Stat. Soc. B. 1992;54:657. [Google Scholar]

PERMALINK

A simple method for automated equilibration detection in molecular simulations

John D Chodera

Abstract

INTRODUCTION

FIG. 1. Illustration of the motivation for discarding data to equilibration.

BIAS-VARIANCE TRADEOFF

FIG. 2. Statistical inefficiency, number of uncorrelated samples, and bias for different equilibration times.

FIG. 3. Bias-variance tradeoff for fixed equilibration time versus automatic equilibration time selection.

SELECTING THE EQUILIBRATION TIME

AUTOCORRELATION ANALYSIS

THE ESSENTIAL IDEA

Bias-variance tradeoff

Overall RMS error

FIG. 4. RMS error for fixed equilibration time versus automatic equilibration time selection.

DISCUSSION

FIG. 5. Corresponding analysis for reduced potential energy of liquid argon system.

SIMULATION DETAILS

PRACTICAL COMPUTATION OF STATISTICAL INEFFICIENCIES

ACKNOWLEDGMENTS

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

A simple method for automated equilibration detection in molecular simulations

John D Chodera

Abstract

INTRODUCTION

FIG. 1. Illustration of the motivation for discarding data to equilibration.

BIAS-VARIANCE TRADEOFF

FIG. 2. Statistical inefficiency, number of uncorrelated samples, and bias for different equilibration times.

FIG. 3. Bias-variance tradeoff for fixed equilibration time versus automatic equilibration time selection.

SELECTING THE EQUILIBRATION TIME

AUTOCORRELATION ANALYSIS

THE ESSENTIAL IDEA

Bias-variance tradeoff

Overall RMS error

FIG. 4. RMS error for fixed equilibration time versus automatic equilibration time selection.

DISCUSSION

FIG. 5. Corresponding analysis for reduced potential energy of liquid argon system.

SIMULATION DETAILS

PRACTICAL COMPUTATION OF STATISTICAL INEFFICIENCIES

ACKNOWLEDGMENTS

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases