Abstract
Abstract
In this work, we elaborate on two recently discovered invariance principles, according to which transport coefficients are, to a large extent, independent of the microscopic definition of the densities and currents of the conserved quantities being transported (energy, momentum, mass, charge). The first such principle, gauge invariance, allows one to define a quantum adiabatic energy current from density-functional theory, from which the heat conductivity can be uniquely defined and computed using equilibrium ab initio molecular dynamics. When combined with a novel topological definition of atomic oxidation states, gauge invariance also sheds new light onto the mechanisms of charge transport in ionic conductors. The second principle, convective invariance, allows one to extend the analysis to multi-component systems. These invariance principles can be combined with new spectral analysis methods for the current time series to be fed into the Green–Kubo formula to obtain accurate estimates of transport coefficients from relatively short molecular dynamics simulations.
Graphic abstract
Introduction
Transport coefficients are archetypal examples of off-equilibrium properties, which describe in fact entropy production and the approach to equilibrium in extended systems, thus giving a quantitative meaning and a conceptual framework to the very notion of irreversibility and the arrow of time. While non-equilibrium statistical mechanics is still a very active and largely unsettled field of research, the relaxation of small off-equilibrium fluctuations and the response of systems to small perturbations have been given a rigorous theoretical foundation back in the fifties by the Green–Kubo (GK) theory of linear response [1–4]. Among other achievements, this theory provides a rigorous and elegant way to cast the computation of transport coefficients into the evaluation of equilibrium time correlation functions of suitably defined fluxes, thus making it accessible to equilibrium molecular dynamics (EMD) simulations. This feat notwithstanding, the several conceptual subtleties underlying the linear-response theory of transport coefficients are often dodged or disguised with a clumsy notation that makes it difficult to fully appreciate their scope and impact on the design and use of computer simulation methodologies. Also due to this predicament, a number of misconceptions have affected the otherwise mature and fecund field of computer simulation of transport in condensed matter, thus unduly limiting its scope. Foremost among these misconceptions is that the intrinsic indeterminacy of any local representations of an extensive quantity, such as the density or atomic break-up of the energy (or charge and mass, for that matter), would undermine the uniqueness of the transport coefficients that are derived from them. While similar reservations should apply to classical and quantum ab initio simulations alike, they have in fact impacted mainly the latter, to the extent that until recently it had been believed that the GK theory of heat transport could not be combined with quantum simulation methods based on electronic-structure theory. This quandary has been recently overcome, mainly thanks to the introduction of a so-called gauge invariance principle of transport coefficients [5, 6], which basically states that, under well-defined conditions, the value of a transport coefficient is largely independent of the detailed form of the conserved densities and fluxes from which they are derived through the GK formulae. In full generality, gauge invariance implies that the value of a transport coefficients is unchanged if the flux from which it is calculated is modified by adding to it a vector process whose power spectrum vanishes at (such a process is conveniently dubbed non-diffusive). Further difficulties may arise in the case of multi-component systems, where the interaction among different fluxes make the transport of one conserved quantity (such as, e.g., energy), depend on the dynamics of the other hydrodynamical variables (such as, e.g., the number of different molecular species), thus muddling the definition of heat conductivity in these systems. This difficulty is solved by defining, e.g., the thermal conductivity as the ratio between the energy flux and the temperature gradient, when all the other conserved fluxes vanish. In this case, a further invariance principle, dubbed convective invariance [7], states that the thermal conductivity results to be unchanged if the definition of the energy flux is altered by adding to it an arbitrary linear combination of the mass fluxes of the molecular species constituting the system. While the value of the transport coefficients enjoys the invariance properties mentioned above, the statistical properties of the flux time series from which they are derived do depend on the microscopic representation of the conserved densities and current densities, thus substantially affecting the statistical error of the transport coefficient being computed. This dependence opens the way to optimizing this representation, so as to minimize the resulting statistical errors. This freedom can be exploited in conjunction with the recently introduced cepstral analysis of the current spectra [8] to substantially reduce the length of the EMD simulations needed to evaluate a transport coefficient to a given target accuracy.
In this paper, we review the concepts of gauge and convective invariance in the classical theory of transport in condensed matter, with emphasis on their application to the computation of transport coefficients from the GK theory of linear response and EMD, as well as some concepts and tools for the spectral analysis of the current time series, which can be used in conjunction with these invariance principles to substantially reduce the statistical errors affecting the EMD estimate of various conductivities. In Sect. 2, we briefly lay down the GK linear-response theory of transport, aiming at establishing some general terminology and notations. In Sect. 3, we introduce the concept of gauge invariance, whereas convective invariance is discussed in Sect. 4. In Sect. 5, we discuss a newly introduced spectral method (dubbed cepstral analysis) to evaluate and systematically reduce the statistical error affecting the estimate of transport coefficients from EMD. In Sect. 6, we specialize our discussion to ab initio heat transport and report a general expression for the microscopic heat flux suitable to density functional theory. In Sect. 7, we show how gauge invariance can be combined with concepts from topology to reveal some unexpected features of charge transport in ionic conductors. Section 8 finally contains our conclusions.
Theory
Transport theory is essentially a dynamical theory of hydrodynamical variables, i.e. of the long-wavelength components of the densities of conserved, extensive, variables, which in short we refer to as conserved charges and densities. In a simple fluid, for instance, the conserved charges are the energy, the three components of total momentum, and the total number of molecules of each chemical species. The corresponding transport coefficients are the thermal conductivity, the viscosity, and the various diffusivities. Let be one such conserved charge, and the corresponding density: . Here and in the following we adopt the convention that a hat, as in , indicates an implicit dependence on the system’s phase-space variables, : , where is the set of all the atomic coordinates and momenta. When neither a hat nor the phase-space argument are present, A indicates the expectation of over some suitably defined phase-space distribution, (most frequently the canonical or micro-canonical equilibrium distributions): . When it will be necessary to distinguish between the expectation of a quantity, A, and its phase-space representation, , the latter will be referred to as a phase-space sample of the former. Sometimes, we will need to indicate explicitly the implicit time dependence of a phase-space variable (no pun intended). When this is done, we will mean the phase-space variable , as a function of time and of the initial condition, , of a phase-space trajectory, , as determined by Hamilton’s equations of motion.
Locality implies that for any conserved density, , a (conserved) current density, , can be defined, such that the two of them satisfy the continuity equation, , where indicates the time derivative of q. A current density, , satisfying the continuity equation with a conserved density, , will be said to be conjugate to q. Strictly speaking, the continuity equation holds for the expectations of conserved densities and current densities, as well as for their phase-space samples, . For the sake of unburdening the notation as much as possible, we will sometimes overlook the distinction between phase-space samples and their expectations.
The Fourier transform of the continuity equation reads:
1 |
where is the Fourier transform of q, and similarly for and any other function of . Equation (1) shows that the smaller the wave-vector, , i.e. the longer the wavelength, the slower the dynamics of conserved densities and fluxes. This means that, at sufficiently long wavelength, conserved densities and current densities are adiabatically decoupled from the (zillions of) other atomically fast degrees of freedom. Also, translational invariance implies that conserved densities at different wavevectors do not interact with each other. As a consequence, the dynamics of hydrodynamic variables is determined by a handful of equations that couple them with each other at fixed wavevector. When the intensive variables conjugate to the conserved quantities depend on position sufficiently slowly, the system can be thought of as locally in thermal equilibrium and conserved currents can then be connected to the thermodynamical affinities (i.e. to the gradients of the intensive variables) through the so-called constitutive equations [9]. By combining the constitutive equations with the continuity equations of all the conserved densities and currents, the Navier–Stokes equation of classical hydrodynamics can be derived [9].
Let us consider a system described by a Hamiltonian and subject to a time-dependent external perturbation, , where is a set of conserved densities. The perturbed Hamiltonian reads:
2 |
Here the s are to be treated as strengths of the perturbation in linear-response theory. As such, they are not phase-space functions, and they dependence on time only explicitly. To first order in the vs, the expected value of the conserved current densities conjugate to the qs, , can be obtained from the GK theory of linear response [3, 4] as [10]:
3 |
where is the Boltzmann’s constant, T the system’s temperature, and the correlation function, , is defined for a pair of general phase-space variables, and , as the equilibrium expectation over the initial conditions of a molecular trajectory, , of the time-lagged product of the values of the variables:
4 |
The dependence of the correlation function in Eq. (4) on the time difference is due to time-translation invariance ensuing from the equilibrium condition. By the same token, space-translation invariance makes the correlation function in Eq. (3) only depend on , turning the integral in into a convolution. Using the continuity equation to replace the time derivative of the density with the divergence of the conjugate current density, Eq. (3) can be cast into a linear relation between the Fourier transforms of the longitudinal component of the current density and the forces acting on the system:
5 |
where indicates the Fourier transform of the longitudinal component of a generic vector field , is the longitudinal susceptibility of the current densities, and is the force field associated with the perturbing potential. For the sake of streamlining the notation and without much loss of generality, we will restrict ourselves to longitudinal perturbations and response currents, and drop the “” suffix from currents and forces. If the external perturbation is independent of time, in the long-wavelength limit Eq. (5) results in the Onsager relation between particle fluxes and applied forces [11, 12]:
6 |
where the index denotes for short the combination of the indices i, for the conserved charge, and , for the Cartesian component; the fluxes and forces are the macroscopic averages of the current densities and force fields, respectively; is the system’s volume, and
7 |
is the matrix of Onsager’s transport coefficients [11, 12]. In the case of a charged fluid, for instance, the steady state charge flux, , induced by a stationary electric field, , is given by , where the static electrical conductivity is .
We can reformulate the GK expression of Onsager’s coefficients in another equivalent representation, the so-called Helfand–Einstein (HE) formula, which will be expedient in the following and is also better behaved statistically, based on the identity:
8 |
valid for any even function, Let be a stationary stochastic process representing a conserved flux, so that only depends upon . By applying the identity above, we obtain
9 |
This is called the HE formula, which gives a transport coefficient of some conserved charge, as the ratio of the mean-square dipole, , displaced by the conserved flux, , in a time t and time itself. This argument was first exploited by Einstein in his celebrated paper on Brownian motion [13] to establish the relation between diffusivity and velocity auto-correlation functions, and later extended by Helfand to general transport phenomena [14]. A comparison between the numerical performance of the GK and the HE formulas is displayed in Fig. 1 in the case of charge transport in a molten salt. The better stability of the HE integral is evident: not only does the HE integral converge faster than the GK one, but the variance on the first, even though growing linearly with time, is much smaller than the one on the second.
To understand the better statistical behaviour of the HE representation of transport coefficients with respect to the GK one, we leverage the relation between conductivities and the spectral properties of the conserved fluxes, which will also be instrumental in our subsequent considerations on data analysis in Sect. 5. Let us define
10 |
where
11 |
One has
12 |
where
13 |
Using the Parseval–Plancherel identity [15], one gets
14 |
where and . The two functions
15 |
where is the cardinal sine function, are displayed in Fig. 2: as , they both tend to , where indicates the Dirac delta function. One sees that the statistical accuracy of the HE estimate of the transport coefficients is higher than GK’s. Using the findings of Sect. 5, in Appendix A we demonstrate that, in the large-T limit, one has indeed the following relation between the statistical uncertainties: .
Gauge invariance
General theory
The gauge-invariance principle of transport coefficients is the condition by which transport coefficients are largely insensitive to the specific definition of the fluxes. In fact, from a microscopic standpoint, any two conserved densities, and , whose integrals over a volume differ by a quantity that scales as the volume boundary, should be considered equivalent in the large volume limit, . For instance, two equivalent densities may differ by the divergence of a (bounded) vector field :
16 |
In this sense, and can be thought of as different gauges of the same scalar field. Since is also conserved, for any given gauge of the conserved density, , a conserved current density can be defined, , to satisfy the continuity equation, Eq. (1). By combining Eqs. (16) and (1) we see that conserved current densities, as well as macroscopic fluxes, transform under a gauge transformation as
17 |
where , the time dependence being implicit via the phase-space point , e.g. , and where the dot indicates the Poisson brackets with the unperturbed Hamiltonian. We conclude that the macroscopic energy fluxes in two different energy gauges differ by the total time derivative of a bounded phase-space vector function. Nonetheless, both definitions must lead to the same transport coefficient. To prove such a gauge-invariance principle, let us consider a generic transport process with conserved flux represented by the stochastic stationary process , and define the (generalised, conserved) charge displacement per unit volume . According to the HE formulation we can express the transport coefficient of the process, , as
18 |
where the factor c is specific to the transport process considered. The addition of a time-bounded term to , resulting in the new displacement produces a transport coefficient which coincides with . In fact, by direct calculation, we have
19 |
Some considerations on boundary conditions
To conclude this section, we spend a few words to examine the role of boundary conditions (BC) in practical molecular dynamics simulations of a finite sample of the system. First of all, the simulation box must be larger than the relevant correlation/diffusion lengths of the system, for equilibrium properties to be independent of specific BC adopted in the simulation. In EMD simulations, periodic BC (PBC) are preferred since (1) they minimize size effects, and (2) the limit commutes with thermodynamic limit, where while the density is kept fixed. This commutation no longer holds in open BC (OBC) at thermodynamic equilibrium, where the asymptotic time limit, , must be taken only after the thermodynamic limit is performed.
Differently stated, PBC have to be preferred with respect to OBC, since they are the only ones which can sustain a steady-state flux [16]. Nonetheless, this poses some issues in the definitions of the fluxes, since the textbook definition relying on the first moment of the time-derivative of the (periodic) charge density ,
20 |
cannot be employed since is ill-defined in extended, PBC-closed systems. Strictly speaking, Eq. (20) is ill-defined in PBC for the very same reason why macroscopic polarisation is so in insulators [17]. The formal meaning of this equation is that it should be considered as the leading order of a Taylor series of the Fourier transform of the time derivative of the conserved density in powers of its argument: . Computer simulations performed for systems of any finite size, L, give access to the Fourier components of conserved (current) densities at finite wave-vectors whose minimum magnitude is . Let be the (spatial Fourier transform of the) current-current correlation function evaluated at , which can be easily evaluated from a MD simulation. Unfortunately, cannot be directly used to estimate, not even by extrapolation, the values of the transport coefficients, because the corresponding GK integral vanishes for any finite system size. In practice, the flux to be used to evaluate transport coefficients from the GK formula is obtained from Eq. (20) by formal manipulations that make the flux boundary insensitive. The time correlation function , computed for a system of size L, is well-defined as well, and has the property that, for any given t,
21 |
The tricky thing is the convergence of the limit in Eq. (21) is not uniform and, while , is the non-vanishing GK integral yielding the transport coefficient we are after. Therefore, definitions for the fluxes suitable to PBC must be properly designed not only from the speculative standpoint, but also to run meaningful simulations. In what follows, we shall largely use these PBC-based definitions together with the gauge invariance principle to draw general conclusions on transport properties which do not depend on the system size, and hold in the thermodynamic limit.
Convective invariance
In a multicomponent system, the (relevant) conserved charges are the energy and the particles number (or, equivalently, the masses) of each atomic species. Since the total-mass flux (i.e. the total linear momentum) is itself a constant of motion, for a K-species system the number of independent conserved fluxes is equal to K (energy, plus partial masses) [18]. Further constraints may reduce the number of relevant conserved fluxes. For instance, energy flux becomes the only relevant conserved flux in solids or in one-component molecular liquids, as long as the molecules do not dissociate. In true multicomponent systems (molten salts, solutions, etc.), neither the mass fluxes of the single atomic species are constant of motion nor their integral is a bound quantity. This fact must be taken into account in practical simulations of heat transport since the thermal conductivity relates, by definition, a gradient of temperature to the induced energy flux in the absence of convection, i.e. when the non-equilibrium average of the macroscopic mass flux vanishes.
To make things simpler but also more quantitative, let us consider a two-component system, like, e.g. a molten salt. The conserved fluxes are the energy flux and the mass flux of one of the species . The following system of phenomenological equations holds:
22 |
where is the difference between the chemical potentials of the two species [19], and where , being the mass flux of species l, the last step following from the conservation of linear momentum. When we set the mass flux ,
23 |
which can be substituted in the first equation of the system to finally find
24 |
25 |
where we employed the symmetric property . In the light of GK theory, the thermal conductivity is obtained by removing from the GK integral of the energy flux a term which represents the contributions of convection/mass diffusion to heat flow [20].
It is straightforward to verify that a change in the definition of the microscopic energy flux by any multiple of the mass flux,
26 |
does not affect , even if such change does change each of the Onsager coefficients in Eq. (25). We dub this peculiar property the “convective invariance” principle. This can be easily extended to more than two species, , thanks to standard linear algebra techniques. In such case, becomes
27 |
where is the square matrix of Onsager coefficients of the mass fluxes. The convective-invariance principle reads
28 |
Any linear combination of the mass fluxes can be added to the energy flux without affecting the thermal conductivity. This statement has a direct, crucial consequence concerning ab initio calculations: the heat conductivity cannot in fact depend on whether atomic cores contribute to the definition of the atomic energy, as they would in an all-electron calculation, or not, as they would when using pseudo-potentials. In the latter case, the energy of isolated atoms would depend on the specific form of pseudo-potential adopted, which is to a large extent arbitrary, while the heat conductivity in all cases should not. Thanks to convective invariance, shifting the zero of energy of each species by a quantity would result in a change which does not affect , just like physical intuition would suggest. Finally, from a more practical way, convective invariance also avoids the calculation of partial enthalpies to dispose of the spurious self-energy effects, a rather tedious and cumbersome task [21–23].
Cepstral analysis
Wiener–Khinchin theorem
Cepstral analysis is a powerful spectral method introduced in the ’60s for the analysis of time series, mainly in the field of speech recognition and sound engineering [24]. To deploy its power to extract the transport coefficient from the time series of the relevant conserved fluxes, we must shift to a Fourier-space representation of stochastic processes, as allowed by the Wiener–Khinchin theorem [25, 26]. The latter states that the expectation of the squared modulus of the Fourier transform of a stationary process is the Fourier transform of its time correlation function. We can thus apply this result to the case where the stochastic process is a conserved flux, [the Cartesian indices have been omitted for clarity], and generalize Eq. (9) to the finite-frequency regime as follows:
29 |
More generally, when several fluxes interact with each other, one can define the cross-spectrum of the conserved fluxes as the Fourier transform of the cross time-correlation functions:
30 |
Onsager’s coefficients can be thus expressed as
31 |
As we shall see, one can leverage on the Wiener–Khinchin theorem and the gauge invariance principles to obtain good estimates (i.e. within 10% accuracy) of the transport coefficients with relatively short EMD simulations (i.e. 10–100 ps). In practice, this result is based on a particular spectral method named cepstral analysis of time series, which we describe below.
Periodograms and power spectra
Let us focus on one specific flux. In MD simulation, we shall have it as a (discrete time) sample, here denoted with the calligraphic font, of the flux process:
32 |
Here is the sampling period, in general a multiple of the timestep of the simulation , so that is the total length of the simulation. The discrete Fourier transform of the flux time series is defined by
33 |
for . The sample power spectrum (i.e. the periodogram) is defined as
34 |
and, for large N, it is an unbiased estimator of the power spectrum of the process, as defined in Eq. (29), evaluated at
35 |
namely
36 |
Since , we have
37 |
which, in the continuous limit, amounts to saying that the power spectrum is an even function of frequency: . Thanks to this last point, periodograms are usually reported for and their Fourier transforms are evaluated as discrete cosine transforms. The space autocorrelations of conserved currents are usually short-ranged. Therefore, in the thermodynamic limit, the corresponding fluxes can be considered sums of (almost) independent identically distributed (iid) stochastic variables: according to the central-limit theorem their equilibrium distribution is Gaussian. Generalizing this argument allows us to conclude that any conserved-flux process is Gaussian as well. The flux time series is in fact a multivariate stochastic variable that, in the thermodynamic limit, results from the sum of (almost) independent variables, thus tending to a multivariate normal deviate. In particular,
for or , ;
for , and are independent and both .
Here indicates a normal deviate with mean and variance . In conclusion, in the large-N (i.e. long-time) limit the periodogram of the time series reads:
38 |
the being independent random variables, where is the chi-square distribution with degrees of freedom and is the number of independent samples of the current (for instance, when the 3 equivalent Cartesian components of the flux are considered). In particular, and .
Equation (38) shows that is an unbiased estimator of the zero-frequency value of the power spectrum, i.e. that , and, through Eq. (31), of the Onsager coefficient we are after. However, this estimator is not consistent, i.e. its variance does not vanish in the large-N limit: The multiplicative nature of the statistical noise makes it difficult to disentangle it from the signal. A way to solve the problem is to apply the logarithm to Eq. (38) to turn the multiplicative noise into an additive one by defining the log-periodogram, , as
39 |
The quantities are iid stochastic variables whose statistics is well known: their mean and variance are simply expressed in terms of the digamma and trigamma functions, and , respectively [27]. Furthermore, whenever the number of significant (inverse) Fourier components of is much smaller than the length of the time series, applying a low-pass filter to Eq. (39) would result in a reduction of the power of the noise, without affecting the signal. To exploit this idea, we define the cepstrum of the time series as the inverse Fourier transform of its sample log-spectrum [28]:
40 |
A generalized central-limit theorem for Fourier transforms of stationary time series ensures that, in the large-N limit, these coefficients are a set of independent (almost) identically distributed zero-mean normal deviates [29, 30]. It also follows that:
41 |
Figure 3 confirms the typical behaviour of cepstral coefficients calculated from the low-frequency region of the periodogram. We see that only a few coefficients are in fact substantially different from zero, within statistical uncertainty.
Let us indicate by the smallest integer such that for . By limiting the Fourier transform of the sample cepstrum, Eq. (40), to coefficients, we obtain an efficient estimator of the zero-frequency component of the log-spectrum, , whose expectation and variance are
42 |
43 |
We thus see that the logarithm of the Onsager coefficient we are after can be estimated from the cepstral coefficients of the flux time series through Eqs. (42-43), and that the resulting estimator is always a normal deviate whose variance depends on the specific system only through the number of these coefficients, . Notice that the absolute error on the logarithm of the conductivity directly and nicely yields the relative error on the conductivity itself. The efficacy of this approach obviously depends on our ability to estimate the number of coefficients necessary to keep the bias introduced by the truncation to a value smaller than the statistical error, while maintaining the magnitude of the latter at a prescribed acceptable level. In Ref. [8], it has been proposed to estimate using the Akaike’s information criterion [31], even if other more advanced model selection approaches may be more effective [32]. A plot of vs is shown in Fig. 4. We immediately realize that returned by the AIC (vertical dashed line) is indeed capable to find the correct value for the log-spectrum—within statistical error—of lowest variance. Furthermore, thanks to the convective invariance principle described in Sect. 4, the cepstral analysis can be extended to multicomponent systems [7]. This multivariate cepstral method has been recently applied to calculate the ab initio thermal conductivity of water at planetary conditions from trajectories as short as a few tens of picoseconds [33]. It has been also shown that the multivariate cepstral analysis is able to substantially reduce the statistical error affecting the estimate of thermal conductivity even for one-component systems: it can in fact decorrelate the finite-frequency power spectrum of non-diffusive fluxes (like the mass flux, or the adiabatic electronic flux in ab initio simulations) from the heat flux power spectrum, which will have its total power considerably reduced, and whose low-frequency portion will be easier to analyse [7]. The statistical tools for time-series analysis presented in this Section have been implemented in the open-source code SporTran, which is freely downloadable from the GitHub repository https://github.com/lorisercole/sportran [34].
Ab initio heat transport in insulators
Until only a few years ago, it was thought that ab initio MD simulations were in general unsuitable to a GK theory of thermal transport, since the continuous electronic density makes any decomposition of the energy flux into local, atomic contributions fully arbitrary [35]. This consideration clashes with a reductionist picture whereby a fundamental description of the microscopic interactions should be in principle suitable to describe heat transport in the very general GK theory, as well. Once again, this apparent inconsistency stands on the misconception that the definition of microscopic fluxes must be unique, and it is thus solved thanks to the gauge-invariance principle. Apart from correcting such a misconception, the gauge-invariance principle proves to be also a rigorous mathematical tool to derive a well-defined expression (out of the infinitely many possibilities!) for the energy flux directly from DFT, with no ad hoc approximation tailored on the considered physical system [36]. The starting point is the standard DFT definition of the total energy in terms of the Kohn–Sham (KS) eigenvalues , eigenfunctions , and density [37]:
44 |
where is the electron charge, is a local exchange-correlation (XC) energy per particle defined by the relation , the latter being the total XC energy of the system, and is the XC potential [in this Section and in Sect. 7 we drop the hat on top of classical observables to avoid confusion with quantum operators]. From this we can write a DFT energy density as [38]
45 |
where
46 |
47 |
48 |
is the instantaneous self-consistent Kohn–Sham Hamiltonian, and is the Hartree potential. An explicit expression for the DFT energy flux,
49 |
obtained by computing the first moment of the time derivative of the energy density in Eqs. (45–48), results in a number of terms, some of which are either infinite or ill-defined in PBC, since the position operator is not periodic. Leveraging on gauge invariance and on a careful breakup and refactoring of the various harmful terms, as explained in Refs. [5, 39], we can recast Eq. (49) in a form suitable to PBC, whose final expression is
50 |
where
51 |
52 |
53 |
54 |
55 |
Here is the bare, possibly non-local, (pseudo-)potential acting on the electrons and
56 |
are the projections over the empty-state manifold of the action of the position operator over the vth occupied orbital, and of its adiabatic time derivative [40–42], respectively, and being the projector operators over the occupied- and empty-states manifolds, respectively. Both these functions are well defined in PBC and can be computed, explicitly or implicitly, using standard density-functional perturbation theory [42]. These rather complicated formulae have been implemented in the open-source code QEHeat of the Quantum ESPRESSO project which has been publicly released [43].
Ab initio charge transport in ionic conductors
Ab initio charge transport in ionic conductors
While in metals the current is carried by delocalised conduction electrons, in electronic insulators the electrons are bound to follow adiabatically the ionic motion and no charge transport can occur if the ion positions are frozen. Nonetheless, when ions are allowed to move, like in ionic liquids, charge can be displaced. Daily life examples range from simple salted water, to liquid electrolytes employed in Li-ion batteries, or to the molten salts (NaCl, KCl, etc.) used as heat exchangers in power plants.1 Due to their large electronic bandgap, ionic liquids are in general transparent to visible light and possess a negligible fraction of “free”, conduction electrons. When the quantum nature of the electrons is considered, a question arises about a proper definition for the charge flux, , to employ in the GK expression for the electrical conductivity, :
57 |
Any static partition of the instantaneous total charge to express into atomic contributions is in fact totally arbitrary.
The solution comes from the modern theory of polarisation (MTP), which provides a definition for the polarisation valid for extended systems [45–47]. Just like the electronic Hamiltonian and ground state, depends on time through the nuclear coordinates only: we can thus apply the chain rule and write as a sum of atomic contributions
58 |
where is the atomic velocity of the ith atom and is a time-dependent tensor whose entries, , are the derivative of the cell polarisation along the Cartesian direction with respect to the atomic displacement of atom i along . The dynamical charges are called “Born effective-charge tensors”, and can be computed in perturbation theory [43], or by finite force differences when introducing a finite electric field in accordance to the MTP [48]. Born charge tensors are in general strongly dependent on the atomic positions, and they display large fluctuations along a AIMD trajectory, even for strongly ionic systems [49]. For these reasons, the charge flux defined in Eq. (58) is in general fluctuating and does not vanish identically. Therefore, there is no apparent reason for the GK integral in Eq. (57) to be exactly zero. How come, then, that the electrical conductivity of, say, pure water is zero? And why, instead, under the same microscopic formalism, does a salted water solution have a non-vanishing ? Another strange “coincidence” reported in the literature [50] is that by replacing in Eq. (58) the time-dependent Born charge tensors with the predefined constant integer charges—the oxidation numbers of the atoms—to define a new flux
59 |
leads to the same GK integral, Eq. (57), that is obtained via the alternative definition in Eq. (58).2
It seems, therefore, that two main aspects must be understood to answer the fundamental questions raised above: first, the presence of integer numbers suggests some underlying quantisation in the charge transport process; second, the insensitivity of to the choice of one of the two different definition hints at a manifestation of the gauge invariance principle of transport coefficients, described in Sect. 3. We analyse in detail these two aspects in what follows.
Quantisation of charge transport
In 1983, David Thouless showed that for a quantum system in PBC, with non-degenerate ground state evolving adiabatically under a cyclic, slowly varying Hamiltonian , the total displaced dipole, is quantised, i.e. the time integral of the current over one cycle is equal to a triplet of integers, multiplied by the side L of the cell, which we shall assume cubic for simplicity [51]:
60 |
with . In AIMD simulations, atomic trajectories are identified by paths in the nuclear configuration space (NCS). The electronic Hamiltonian, , depends parametrically upon time through the nuclear positions, at time t, , with . A path in NCS whose end points are one the the periodic image of the other can be thus considered as a cycle for the electronic Hamiltonian. Therefore, we can thus employ Thouless’ theorem and show, under very general hypothesis, that the electric dipole displaced along is itself partitioned into atomic contributions:
61 |
where is the triplet containing the number of cells spanned, along , by atom i in x, y and z directions, and are integer constants which only depend on the species S(i) of atom i, and are shown to coincide with the oxidation numbers suggested by chemical intuition [49]. This provides a quantum-mechanical definition of oxidation numbers where their integerness arises naturally (and not just as an approximation of some real charge), and which identifies them as intrinsically dynamical quantities [49, 52].
Gauge invariance of electrical conductivity
We are now ready to combine the theory of quantisation of charge transport and the gauge-invariance principle of transport coefficients to answer the questions raised at beginning of this Section. Let us consider a physical path in the NCS from an initial configuration I to the configuration F, as a result, e.g. of an AIMD simulation. Then we elongate the path fictitiously up to the point which is the replica (periodic image) of the initial point I sharing with the point F the same cell of the nuclear configuration space, as depicted by the dashed line in Fig. 5.
Due to the additivity of integrals, the electric dipole displaced along the physical path IF reads
62 |
Since the open path entirely belongs to one cell, is a bounded quantity. Therefore, thanks to gauge-invariance, to evaluate we only need to consider
63 |
since
64 |
By the same token, the electric dipole displaced from I to F by to the flux defined in Eq. (59) is
65 |
Therefore, we can conclude that
66 |
which, by comparison with Eq. (64), proves the equivalence of the electrical conductivities obtained via Eqs. (58) and (59). This is shown in Fig. 6 for an ab initio MD simulation of molten KCl.
As explained in detail in Ref. [49], this result is grounded on the hypothesis, which we dubbed strong adiabaticity, that any closed paths in the NCS can be shrunk to a point without closing the electronic gap: this implies—see Eq. (61)—that charge transport can occur only through a net displacement of the ions, as it is typical of stoichiometric ionic conductors. The breach of strong adiabaticity may instead dictate a non-trivial charge-transport regime where charge may be adiabatically transported even in the absence of a net ionic displacement. As shown in Ref. [53] this non-trivial behaviour is intertwined with the presence, in non-stoichiometric electrolytes, of dissolved yet localised electrons, whose displacement is to a large extent uncorrelated to that of the ions. By the same token discussed in Sect. 3.2, we remark that all the conclusions drawn in this Section are independent of the (macroscopic) size L of the system. In fact, even though all the derivation of Eq. (66) is done at finite size, the use of charge fluxes which are well defined in PBC ensures that the GK integrals of their correlation functions are well defined and non-vanishing identically, for any L.
Conclusions
To conclude, we believe that we managed to show how the invariance principles of transport coefficients can be employed, within the general Green–Kubo theory of linear response, to demystify some common misconceptions based on the groundless assumption that the microscopic conserved fluxes must be uniquely defined. We have further deployed the power of invariance principles in the construction of numerical tools for the statistical analysis of the time series of the fluxes, produced via equilibrium molecular dynamics simulations. These tools provide accurate values for the transport coefficients from the relatively short trajectories which are accessible to ab initio MD simulations. We also showed how to design an ab initio heat flux suitable to extended systems in PBC, from which the ab initio thermal conductivity can be computed without any further system- or phase-specific assumptions. Finally, we have applied the gauge invariance principle to the ab initio charge transport in ionic systems. We showed that each ion can be associated with a well-defined integer and time-independent charge, and the exact ab initio electrical conductivity can be obtained by replacing, with these integer charges, the time-dependent Born tensors, which enter the definition of the charge flux and whose calculation is a computationally expensive and quite abstruse task. In this way, we recover the classical (Faraday’s) idea of atomic contributions to charge transport [54] and provided a theoretically sound definition to the concept of oxidation states in ionic liquid insulators. We are confident that the results here exposed will be a valid aid for both theorists and practitioners aiming at deeper insights and more efficient implementations about the calculation of transport coefficients from molecular dynamics simulations.
Acknowledgements
We thank Paolo Pegolo for fruitful discussions and a thorough reading of the manuscript.
Appendix A: Variance on GK and HE formulas
To evaluate the statistical error affecting the GK or HE expressions of the transport coefficients, Eq. (14), we consider it as the expectation of the estimator:
A1 |
where , is the total length of the time series of the flux, N the number of its terms, the sampling period, and are the set of independent stochastic variables introduced in Eq. (38). As the variables are independent and identically distributed, one has
A2 |
where we employed Eq. (10) and where the factor 2 in the first step accounts for the full correlation between and . This behaviour is shown in Fig. 7, which displays the theoretical estimate, Eq. (A2), for the variance on the GK integral, as well as the empirical variances for GK (blue) and HE (orange) integrals of the charge flux autocorrelation function, obtained via standard block analysis from the simulation of molten KCl already discussed in Sect. 2.
Notice that, to obtain the variance on the mean value , the variance of the process, which is independent of the number B of trajectories (or blocks) of length employed, must be further divided by B [56].
Funding
Open Access funding provided by EPFL Lausanne. This work was partially funded by the European Union through the MaX Centre of Excellence for Supercomputing applications (Project No. 824143), by the Italian MIUR/MUR through the PRIN 2017 FERMAT grant and by the Swiss National Science Foundation (SNSF), through Project No. 200021-182057. FG acknowledges funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie Action IF-EF-ST, grant agreement no. 101018557 (TRANQUIL).
Data Availability Statement
The manuscript has associated data in a data repository. [Authors’ comment: The data that support the plots and relevant results within this paper are available on the Materials Cloud platform [55] at [Materials Cloud Archive 2021.112(2021), 10.24435/materialscloud:rp-cd.]
Author contribution statement
The authors contributed equally to all parts of this work.
Footnotes
Pure water itself displays conductive behaviour in its exotic phases at high temperatures/pressures, where HO molecules are (fully or partially) dissociated. This has implications in both charge [57] and heat transport [33].
It must be also remarked the the use of other definitions for the atomic charges, like Bader charges, lead to a wrong electrical conductivity [50].
References
- 1.Green MS. J. Chem. Phys. 1952;20:1281. doi: 10.1063/1.1700722. [DOI] [Google Scholar]
- 2.Green MS. J. Chem. Phys. 1954;22:398. doi: 10.1063/1.1740082. [DOI] [Google Scholar]
- 3.Kubo R. J. Phys. Soc. Jpn. 1957;12:570. doi: 10.1143/JPSJ.12.570. [DOI] [Google Scholar]
- 4.Kubo R, Yokota M, Nakajima S. J. Phys. Soc. Jpn. 1957;12:1203. doi: 10.1143/JPSJ.12.1203. [DOI] [Google Scholar]
- 5.Marcolongo A, Umari P, Baroni S. Nat. Phys. 2016;12:80. doi: 10.1038/nphys3509. [DOI] [Google Scholar]
- 6.Ercole L, Marcolongo A, Umari P, Baroni S. J. Low Temp. Phys. 2016;185:79. doi: 10.1007/s10909-016-1617-6. [DOI] [Google Scholar]
- 7.Bertossa R, Grasselli F, Ercole L, Baroni S. Phys. Rev. Lett. 2019;122:255901. doi: 10.1103/PhysRevLett.122.255901. [DOI] [PubMed] [Google Scholar]
- 8.Ercole L, Marcolongo A, Baroni S. Sci. Rep. 2017;7:15835. doi: 10.1038/s41598-017-15843-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Kadanoff LP, Martin PC. Ann. Phys. 1963;24:419. doi: 10.1016/0003-4916(63)90078-2. [DOI] [Google Scholar]
- 10.S. Baroni, R. Bertossa, L. Ercole, F. Grasselli, A. Marcolongo, Heat Transport in Insulators from Ab Initio Green-Kubo theory, in Handbook of Materials Modeling: Applications: Current and Emerging Materials, editor edited by editor W. Andreoni and editor S. Yip (Springer International Publishing, address Cham, 2018) pp. 1–36, edition 2nd ed. 10.1007/978-3-319-50257-1_12-1,arXiv: 1802.08006 [cond-mat.stat-mech]
- 11.L. Onsager, Phys. Rev. 37, 405 (1931). 10.1103/PhysRev.37.405
- 12.L. Onsager, Phys. Rev. 38, 2265 (1931). 10.1103/PhysRev.38.2265
- 13.Einstein A. Ann. Phys. (Berl.) 1905;322:549. doi: 10.1002/andp.19053220806. [DOI] [Google Scholar]
- 14.Helfand E. Phys. Rev. 1960;119:1. doi: 10.1103/PhysRev.119.1. [DOI] [Google Scholar]
- 15.E.W. Weisstein,Parseval’s Relation, From MathWorld—a Wolfram Web Resource (2020). https://mathworld.wolfram.com/ParsevalsRelation.html
- 16.R. Resta, The insulating state of matter: A geometrical theory, in The Physics of Correlated Insulators, Metals, and Superconductors. Modeling and Simulation, Vol. 7, ed. by E Pavarini, E. Koch, R. Scalettar, and R. M. Martin (Verlag des Forschungszentrum Jülich, 2017) p. 3.5. https://www.cond-mat.de/events/correl17/manuscripts/resta.pdf
- 17.R. Resta, EPL (Europhys. Lett.) 22, 133 (1993). 10.1209/0295-5075/22/2/010
- 18.D. Forster, Hydrodynamic fluctuations, broken symmetry, and correlation functions (Benjamin, 1975). 10.1201/9780429493683
- 19.P. Sindzingre, M.J. Gillan, J. Phys. Condens. Matter 2, 7033 (1990). http://stacks.iop.org/0953-8984/2/i=33/a=014
- 20.Galamba N, de Nieto Castro CA, Ely JF. J. Chem. Phys. 2007;126:204511. doi: 10.1063/1.2734965. [DOI] [PubMed] [Google Scholar]
- 21.Debenedetti PG. J. Chem. Phys. 1987;86:7126. doi: 10.1063/1.452362. [DOI] [Google Scholar]
- 22.R. Vogelsang, C. Hoheisel, Phys. Rev. A. 35, 3487 (1987). 10.1103/PhysRevA.35.3487
- 23.P. Sindzingre, C. Massobrio, G. Ciccotti, D. Frenkel, Chem. Phys. 129, 213 (1989). 10.1016/0301-0104(89)80007-2
- 24.B.P. Bogert, J.R. Healy, J.W. Tukey, The quefrency alanysis [sic!] of time series for echoes; cepstrum, pseudo-autocovariance, cross-cepstrum and saphe cracking, in Proceedings of the Symposium on Time Series Analysis (John Wiley & Sons, Inc., 1963) pp. 209–243
- 25.Wiener N. Acta Math. 1930;55:117. doi: 10.1007/BF02546511. [DOI] [Google Scholar]
- 26.Khintchine A. Math. Ann. 1934;109:604. doi: 10.1007/BF01449156. [DOI] [Google Scholar]
- 27.E.W. Weisstein, Polygamma Functions, From MathWorld—a Wolfram Web Resource (2020). https://mathworld.wolfram.com/PolygammaFunction.html
- 28.Childers DG, Skinner DP, Kemerait RC. Proc. IEEE. 1977;65:1428. doi: 10.1109/PROC.1977.10747. [DOI] [Google Scholar]
- 29.T.W. Anderson, The Statistical Analysis of Time Series (Wiley-Interscience, Hoboken, 1994)
- 30.Peligrad M, Wu WB. Ann. Prob. 2010;38:2009. doi: 10.1214/10-AOP530. [DOI] [Google Scholar]
- 31.H. Akaike, Information theory and an extension of the maximum likelihood principle, in 2nd International Symposium on Information Theory pp. 267–281 (edited by B. N. Petrov and F. Csáki, 1972)
- 32.G. Claeskens, N.L. Hjort, Model Selection and Model Averaging (Cambridge University Press, Cambridge, 2008). 10.1017/CBO9780511790485
- 33.F. Grasselli, L. Stixrude, S. Baroni, Nat. Commun. 11, 1 (2020). 10.1038/s41467-020-17275-5 [DOI] [PMC free article] [PubMed]
- 34.L. Ercole, R. Bertossa, S. Bisacchi, S. Baroni, SporTran: a code to estimate transport coefficients from the cepstral analysis of a multi-variate current stationary time series. https://github.com/lorisercole/sportran (2017–2021)
- 35.Stackhouse S, Stixrude L, Karki BB. Phys. Rev. Lett. 2010;104:208501. doi: 10.1103/PhysRevLett.104.208501. [DOI] [PubMed] [Google Scholar]
- 36.Carbogno C, Ramprasad R, Scheffler M. Phys. Rev. Lett. 2017;118:175901. doi: 10.1103/PhysRevLett.118.175901. [DOI] [PubMed] [Google Scholar]
- 37.R.M. Martin, Electronic Structure: Basic Theory and Practical Methods (Cambridge University Press, Cambridge, 2008). 10.1017/CBO9780511805769
- 38.Chetty N, Martin R. Phys. Rev. B. 1992;45:6074. doi: 10.1103/PhysRevB.45.6074. [DOI] [PubMed] [Google Scholar]
- 39.A. Marcolongo, Theory and ab initio simulation of atomic heat transport, SISSA PhD thesis, Trieste (2014). http://hdl.handle.net/20.500.11767/3897
- 40.P. Giannozzi, S. Baroni, N. Bonini, M. Calandra, R. Car, C. Cavazzoni, D. Ceresoli, G.L. Chiarotti, M. Cococcioni, I. Dabo et al., J. Phys. Condens. Matter 21, 395502 (2009). 10.1088/0953-8984/21/39/395502 [DOI] [PubMed]
- 41.P. Giannozzi, O. Andreussi, T. Brumme, O. Bunau, M.B. Nardelli, M. Calandra, R. Car, C. Cavazzoni, D. Ceresoli, M. Cococcioni et al., J. Phys. Condens. Matter 29, 465901 (2017). 10.1088/1361-648X/aa8f79 [DOI] [PubMed]
- 42.Giannozzi, O. Baseggio, P. Bonfá, D. Brunato, R. Car, I. Carnimeo, C. Cavazzoni, S. De Gironcoli, P. Delugas, F. Ferrari Ruffino, et al., J. Chem. Phys. 152, 154105 (2020) 10.1063/5.0005082 [DOI] [PubMed]
- 43.Baroni S, de Gironcoli S, Dal Corso A, Giannozzi P. Rev. Mod. Phys. 2001;73:515. doi: 10.1103/RevModPhys.73.515. [DOI] [Google Scholar]
- 44.A. Marcolongo, R. Bertossa, D. Tisi, S. Baroni, Computer Physics Communications, 10.1016/j.cpc.2021.108090, in press, and arXiv: 2104.06383 (2021)
- 45.R. Resta, Phys. Rev. Lett. 80, 1800 (1998). 10.1103/PhysRevLett.80.1800
- 46.R. Resta, D. Vanderbilt, Theory of polarization: a modern approach, in Physics of Ferroelectrics: A Modern Perspective (Springer Berlin Heidelberg, address Berlin, Heidelberg, 2007). 10.1007/978-3-540-34591-6_2
- 47.D. Vanderbilt, Berry Phases in Electronic Structure Theory: Electric Polarization, Orbital Magnetization and Topological Insulators (Cambridge University Press, Cambridge, 2018). 10.1017/9781316662205
- 48.P. Umari, A. Pasquarello, Phys. Rev. Lett. 89, 157602 (2002). 10.1103/PhysRevLett.89.157602 [DOI] [PubMed]
- 49.F. Grasselli, S. Baroni, Nature Phys. 15, 967 (2019). 10.1038/s41567-019-0562-0
- 50.M. French, S. Hamel, R. Redmer, Phys. Rev. Lett. 107, 185901 (2011). 10.1103/PhysRevLett.107.185901 [DOI] [PubMed]
- 51.Thouless DJ. Phys. Rev. B. 1983;27:6083. doi: 10.1103/PhysRevB.27.6083. [DOI] [Google Scholar]
- 52.Jiang L, Levchenko SV, Rappe AM. Phys. Rev. Lett. 2012;108:166403. doi: 10.1103/PhysRevLett.108.166403. [DOI] [PubMed] [Google Scholar]
- 53.Pegolo P, Grasselli F, Baroni S. Phys. Rev. X. 2020;10:041031. doi: 10.1103/PhysRevX.10.041031. [DOI] [Google Scholar]
- 54.R. Resta, arXiv: 2104.06026 (2021)
- 55.L. Talirz, S. Kumbhar, E. Passaro, A.V. Yakutovich, V. Granata, F. Gargiulo, M. Borelli, M. Uhrin, S.P. Huber, S. Zoupanos et al., Sci. Data 7, 1 (2020). 10.1038/s41597-020-00637-5 [DOI] [PMC free article] [PubMed]
- 56.R.E. Jones, K.K. Mandadapu, J. Chem. Phys. 136, 154102 (2012). 10.1063/1.3700344 [DOI] [PubMed]
- 57.V. Rozsa, D. Pan, F. Giberti, G. Galli, Proc. Natl. Acad. Sci. 115, 6952 (2018). 10.1073/pnas.1800123115 [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The manuscript has associated data in a data repository. [Authors’ comment: The data that support the plots and relevant results within this paper are available on the Materials Cloud platform [55] at [Materials Cloud Archive 2021.112(2021), 10.24435/materialscloud:rp-cd.]