Skip to main content
Springer logoLink to Springer
. 2015 May 24;36(9):2328–2341. doi: 10.1007/s10765-015-1890-7

Thermodynamic Limits of Spatial Resolution in Active Thermography

Peter Burgholzer 1,
PMCID: PMC4643119  PMID: 26594081

Abstract

Thermal waves are caused by pure diffusion: their amplitude is decreased by more than a factor of 500 within a propagation distance of one wavelength. The diffusion equation, which describes the temperature as a function of space and time, is linear. For every linear equation the superposition principle is valid, which is known as Huygens principle for optical or mechanical wave fields. This limits the spatial resolution, like the Abbe diffraction limit in optics. The resolution is the minimal size of a structure which can be detected at a certain depth. If an embedded structure at a certain depth in a sample is suddenly heated, e.g., by eddy current or absorbed light, an image of the structure can be reconstructed from the measured temperature at the sample surface. To get the resolution the image reconstruction can be considered as the time reversal of the thermal wave. This inverse problem is ill-conditioned and therefore regularization methods have to be used taking additional assumptions like smoothness of the solutions into account. In the present work for the first time, methods of non-equilibrium statistical physics are used to solve this inverse problem without the need of such additional assumptions and without the necessity to choose a regularization parameter. For reconstructing such an embedded structure by thermal waves the resolution turns out to be proportional to the depth and inversely proportional to the natural logarithm of the signal-to-noise ratio. This result could be derived from the diffusion equation by using a delta-source at a certain depth and setting the entropy production caused by thermal diffusion equal to the information loss. No specific model about the stochastic process of the fluctuations and about the distribution densities around the mean values was necessary to get this result.

Keywords: Diffusion, Entropy production, Information loss , Kullback–Leibler divergence, Chernoff–Stein Lemma, Stochastic thermodynamics

Introduction

In active thermography, subsurface-embedded structures are detected by heating the sample surface or the structures and measuring the time-dependent surface temperature. The temperature evolution as a function of space and time is determined by the heat diffusion equation, and its solution can be described as a composition of plain thermal waves with different frequencies and wavenumbers [1]. The heat diffusion equation is a macroscopic mean-value-equation in the sense that in a microscopic picture the temperature is proportional to the mean value of the kinetic energy of the molecules. Stochastic processes describe the statistical distributions of the fluctuations around those mean values [2].

The detection and location of the embedded structures from the measured temperature evolution at the surface are an inverse problem. In the microscopic picture, the movement of the molecules is invertible, as the kinetic equations are invariant to time reversal. The macroscopic mean-value-equation, for thermography the heat diffusion equation, is not invariant to time reversal any more. Recent results from non-equilibrium thermodynamics show that for macroscopic samples the mean entropy production is equal to the information loss by using the macroscopic mean-value-equation instead of the microscopic description (see Sect. 2 and ‘Appendix’). This loss of information for the macroscopic description is the physical reason that the inverse problem gets ill-posed and that subsurface structures cannot be detected any more if they are lying too deep under the surface.

Thermal waves are caused by pure diffusion: their amplitude is decreased by a factor of e-2π1/535 within a propagation distance of one wavelength [1]. The diffusion equation, which describes the temperature as a function of space and time (“diffusion-wave fields” [3]), is linear. For every linear equation, the superposition principle is valid, which is known as Huygens principle for optical or mechanical wave fields. This allows a composition of the solution of plane waves having different wavenumbers and similar to the wave equation also for the diffusion equation, components with a higher wavenumber are attenuated more than those with a lower wavenumber. This limits the spatial resolution, like the Abbe diffraction limit in optics. The resolution is the minimal size of a structure which can be detected at a certain depth. If an embedded structure at a certain depth in a sample is suddenly heated, e.g., by eddy current or absorbed light, an image of the structure can be reconstructed from the measured temperature at the sample surface. One possibility for image reconstruction is a time reversal of the thermal waves. This inverse problem is ill-conditioned, and therefore regularization methods have to be used taking additional assumptions like smoothness of the solutions into account. In the present work for the first time, methods of non-equilibrium statistical physics are used to solve this inverse problem without the need of such additional assumptions and without the necessity to choose a regularization parameter.

Reconstructing the samples interior structure from the measured signals at the samples surface is a prominent example of an inverse problem. An inverse problem is a general framework that is used to convert observed measurements into information about a physical object or system. Inverse problems are typically ill-posed, as opposed to the well-posed problems more typical when modeling physical situations where the model parameters or material properties are known. Of the three conditions for a well-posed problem suggested by Hadamard [4] (existence, uniqueness, and stability of the solution or solutions) the condition of stability is most often violated [5]. Even if a problem is well-posed, it may still be ill-conditioned, meaning that initially small errors can grow exponentially. An ill-conditioned problem is indicated by a large condition number. It needs to be re-formulated for numerical treatment. Typically, this involves including additional assumptions, such as smoothness of solution [6]. This process is known as regularization, like truncated singular value decomposition (SVD) or Tikhonov regularization [7]. The choice of an adequate regularization parameter, which describes the trade-off between the original ill-conditioned problem and the additional assumptions (e.g., smoothness), is critical and has to be evaluated for every individual problem [6]. In this paper, we propose that by using the entropy production we get a physical background for choosing the regularization parameter for thermographic imaging and no additional assumption is necessary.

To simplify the following calculations, two assumptions have been made. First, heat diffusion is assumed to take place only in one dimension. Physically this happens in layered structures, when the lateral dimension of these structures is big compared to their thickness. Second, the heat source is a thin layer which is instantaneous heated in a thick sample. Mathematically, this can be approximated by a Dirac delta function in infinite space in one dimension. For real thermography, different boundary conditions on the sample surface have to be used. E.g., Mandelis has solved the heat diffusion equation in his book “Diffusion-Wave Fields” [3] in one, two, or three dimensions for different boundary conditions. The method demonstrated here to get a thermodynamic resolution limit for one-dimensional heat diffusion in infinite space can be generalized to such boundary conditions and more dimensions, as discussed in Sect. 4.

To get the spatial resolution usually a fine structure which is small compared to the resolution is imaged. Then the resolution is equal to the size of the imaged structure. For bigger structures, the convolution of the structure with the blurring from image resolution gives the size of the imaged structure. Using the Dirac delta function as the initial temperature distribution ensures that the size of the imaged structure is just the resolution. Of course the evaluated resolution is valid for any initial temperature profile as the heat diffusion equation is a linear one. For the excitation in active thermography often a pulse is used, e.g., a light or eddy current pulse. The pulse duration is chosen to be short compared to the time needed for thermal diffusion along the samples structures. Mathematically, this gives again a Dirac delta function, but in time. Signals for longer pulses can be calculated by a convolution integral using the temporal evolution of the generation pulse. Generalizations for other generation patterns, like a sinusoidal excitation for lock-in thermography, are discussed in Sect. 4.

In the present work it should be shown that a general thermodynamic limit of the spatial resolution can be derived from a very recent result from stochastic thermodynamics, which is summarized in Sect. 2 and in the ‘Appendix’: if a macroscopic system is kicked out of the equilibrium by a short but not necessarily small perturbation, such as a short laser pulse, for the following dissipative process back to equilibrium the information loss about the kick magnitude is equal to the mean entropy production. In active thermography, the kick magnitude is no single number but the vector of the temperature in space just after the excitation pulse (=kick) or it can be the magnitude of all the Fourier components of the temperature wave, as used in Sect. 3.

The information loss can be quantified by the Kullback–Leibler divergence (e.g., [8], also called relative entropy). The Kullback–Leibler divergence D(f||g) is used in information theory for testing the hypothesis that the two distributions with density f and g are different [8], and is defined as

D(f||g):=lnfxgxfxdx, 1

where ln is the natural logarithm. The Chernoff–Stein Lemma states that if n data from g are given, the probability of guessing incorrectly that the probability distribution for describing the data is f is bounded by the type II error ε=(exp(-D(f||g)))n, for n large [8]. In that sense D(f||g) can describe some “distance” between the distribution densities f and g.

The inverse problem of estimating the kick magnitude from a measurement of an intermediate state, a certain time after the kick is ill-conditioned. Just after the kick its magnitude can be estimated very well. A long time after the kick, the state has nearly evolved back to equilibrium and all the information about the kick magnitude is lost. The information content at an intermediate state a time t after the kick with a distribution density pt in comparison to the equilibrium distribution peq is D(pt||peq). For macroscopic systems, D is equal to the entropy production till time t Sect. 2. It has its maximum just after the kick, when no entropy has been produced yet. Then it decreases monotonously in time and gets zero in the limit of infinite time. But already at some earlier cut-off time tcut, all the information about the kick magnitude is lost: according to the Chernoff–Stein Lemma for a fixed error ε, the distribution pt at time t cannot be distinguished from the equilibrium distribution peq if D(pt||peq) gets smaller than ln1/ε/n.

In active thermography, it is assumed that the information about the spatial pattern from the interior structure to the surface of the sample is transferred by heat diffusion. The “thermal wave” can be represented as a superposition of wave trains having a certain wavenumber or frequency in Fourier k-space or ω-space, respectively [9]. Instead of a cut-off time for the whole signal, the decrease of the Kullback–Leibler divergence in Fourier space gives a criterion for a cut-off wavenumber or a cut-off frequency as an upper limit, where all the information about the Fourier-component is lost because it cannot be distinguished from equilibrium according to Chernoff–Stein Lemma. In the past we have modeled heat diffusion by a Gauss–Markov process in Fourier space and found a principle limit for the spatial resolution [9]. Using the information loss and entropy production for a kicked process, it will be shown in Sect. 2 that the spatial resolution depends just on the macroscopic mean-value-equations and is independent of the actual stochastic process, as long as the macroscopic equations describe the mean heat flow and therefore also the mean entropy production as the mean heat flow divided by the temperature. General limits of spatial resolution are derived in Sect. 3 from the diffusion equation by using cut-off wavenumbers or frequencies from stochastic thermodynamics.

Information Loss and Entropy Production for Kicked Processes

In active pulse thermography, embedded structures are detected by heating the sample surface or the structures with a short excitation pulse, which “kicks” the sample out of its equilibrium. The kick is a sudden temperature rise due to, e.g., optical absorption or electromagnetic induction at the sample structures. For the reconstruction very often the temperature distribution just after the kick is the “kick magnitude” which should be reconstructed, like in photothermal depth profiling of the first kind [10]. But the present theory is valid also for other applications, e.g., determining the thickness of sample sheets of opaque materials, like metals. When lightening the surface with a short light pulse the heat is absorbed in a very thin surface layer. The absorbed heat diffuses into the material, is “reflected” at the back plane (or at voids in the sample), and measured as a temperature change of the surface. For the thermodynamic resolution limit for such a case, the depth a as given in (13) is the doubled thickness of the sheet as the thermal wave travels back and forth.

To derive the connection between information loss and entropy production, a generalization of the second law and of Landauers principle for states arbitrarily far from equilibrium given recently by Hasegawa et al. [11, 12] and by Esposito and Van den Broeck in [13] is used. The main idea to deal with a non-equilibrium state pt is to perform a sudden transition from the known Hamiltonian H with equilibrium state peq to a new one H, such that the original non-equilibrium state becomes a canonical equilibrium with respect to H. The average amount of irreversible work for this quench turns out to be the Kullback–Leibler divergence D(pt||peq) times kBT, where kB is the Boltzmann constant and T is the temperature of the system. The mean irreversible work for this quench is used either to change the entropy ΔS of the kicked system itself or to heat the surrounding by Hpt-Heq. For macroscopic samples it is shown in the ‘Appendix’ that the change in the system entropy ΔS can be neglected compared to (Hpt-Heq)/T as fluctuations in macroscopic systems are small compared to the changes in the mean value. Therefore, the information kBD(pt||peq) about the non-equilibrium state any time t after the kick can be approximated by the mean entropy production described by the diffusion equation.

In information , kBD(pt||peq) can be identified as the amount of information that needs to be processed to switch from the known equilibrium distribution peq (no kick) to the distribution pt a time t after the kick [8]. If the logarithm to the base 2 is taken instead of the natural logarithm in (1), D(pt||peq) measures the average number of bits needed to describe the kick magnitude, if a coding scheme is used based on the given distribution peq rather than the “true” distribution pt [8]. The information theoretical interpretation of D(pt||peq) according to the Chernoff–Stein Lemma states that pt cannot be distinguished from peq at the cut-off time tcut if D(ptcut||peq) becomes smaller than ln1/ε/n (see Sect.  1). In the present work the error level ε is chosen in a way that at the cut-off time the signal amplitude becomes less than the thermodynamic fluctuations (=noise level). In the next section this will be applied in the Fourier space to determine a cut-off wavenumber or a cut-off frequency.

Cut-Off Wavenumber and Frequency in Fourier Space

In this section, two different inverse problems for heat diffusion are presented. For the first inverse problem, the initial temperature profile Tx,0 at a time t=0 should be reconstructed from the measured temperature T(x,tm) at one specific time tm>0. For the second inverse problem, the temperature Txm,t is measured at a specific point xm, usually at the surface of the sample, for all times t>0 and from that data, the temperature T0,t for the point x=0 is reconstructed. The second inverse problem is more adequate for thermography, where the temperature can be measured only on the surface. Nevertheless, the first inverse problem is very instructive and therefore it is also presented here.

In active thermography, the macroscopic mean-value-equation for the temperature Tx,t as a function of time t and space x after the kick, which is a short heating pulse, is the diffusion equation,

2x2-1αtTx,t=0 2

with α is the thermal diffusivity. For simplicity x has only one dimension, but the same procedure can be used for a more dimensional space. The bilateral Fourier transform over space and its inverse is

T^k,t=-Tx,texpikxdxTx,t=12π-T^k,texp-ikxdk, 3

where i=-1 and k=2π/λ is the wavenumber, with the wavelength λ. The wavelength quantifies the spatial resolution of Tx,t if the wavenumber in the integral of the Fourier transform is not taken till infinity but only to a limited wavenumber. This limitation of the wavenumber is caused by a lower cut-off-time for higher wavenumbers, where the Kullback–Leibler divergence D gets too small, so that the wave trains with a higher wavenumber cannot be distinguished from equilibrium.

For the first inverse problem we could show in [7] that for adiabatic boundary conditions, where the sample is thermally isolated, the eigenfunctions are cosine-functions and in k-space (Fourier transform over space), the temperature evolution in time is a simple multiplication:

T^k,t=T^k,0exp-k2αt. 4

The eigenvalues exp-k2αt decrease with higher wavenumbers k “exponentially,” and for the inverse problem the multiplication with exp+k2αt as a huge number makes the reconstruction unstable for higher wavenumbers. Therefore, regularization was used, either truncated singular value decomposition (SVD) or Thikonov regularization. Results for both regularization methods were compared in [9]. The choice of an adequate regularization parameter, that is the cut-off value for the SVD or the trade-off parameter between the original ill-conditioned problem and the smoothness of the solution as an additional assumption for the Thikonov regularization, is critical. Li Voti et al. have described regularization by truncated SVD and genetic algorithms for photothermal depth profiling [10], where the influence of the number of used singular vectors on the reconstructed heat source profile is described. The L-curve method was used to find the optimum value of used singular vectors. Several groups, e.g., the group of Salazar [14], have investigated the reconstruction of thermal conducting depth-profiles from thermography data, e.g., by Thikonov regularization. This so-called photothermal inverse problem of the second kind is for small variations of the sample thermal properties as a function of depth mathematically the same problem as reconstructing the heat source profile [10]. In [7] the L-curve method was used to find the regularization parameter for Thikonov regularization, and in [9] a certain stochastic process, the Ornstein–Uhlenbeck process, was used to derive the cut-off wavenumber for the SVD. In both cases, the cut-off wavenumber kcut gave the same result as we derive in Eq. 7, but now without choosing a regularization parameter or using a specific stochastic process.

As initial condition at a time t=0, the delta function Tx,0=δx is taken, to be sure to get the imaging resolution and not a convolution with the initial structure (see Sect. 1), which results in a constant Fourier transform T^k,t=0=1. All wavenumbers till infinity are present, which gives the best spatial resolution. λ going to zero means that even peaks which are only separated by a very small distance can be still reconstructed as two separate peaks. After a certain time t one gets from thermodynamics the mean entropy in k-space (e.g., [2, 9]) proportional to T^k,t2, which shows an exponential decay in time with exp-2k2αt, see Eq. 4;

ΔSkt=12kBSNR2exp-2k2αt 5

with the signal-to-noise ratio SNR. The wavenumber kcut is determined by using the Chernoff–Stein Lemma,

ΔSkcuttkBDkcut(pt||peq)kB1nln1ε 6

If error ε is set to 1/e and n=1 for one measured temperature one gets that the cut-off wavenumber kcut is just the wavenumber for which at a time t the signal T^kcut,t gets less than the noise level (Fig. 1). Then

kcut=lnSNRαt 7

The reconstructed signal Trx,t is the Fourier transform of a rectangular function, which gives a sinc-function (see Fig. 1),

Trx,t=12π-kcutkcutexp-ikxdk=1πsinkcuttxx. 8

The resolution δrt is the “width” of the reconstructed signal and is taken as the distance between the zero points of the reconstructed signal Trx,t – to be on the save side – which is the wavelength corresponding to the wavenumber kcut,

δrt=2πkcutt=2παtlnSNR. 9

This is the same result as derived in [9] where we assumed that the thermal diffusion in k-space is an Ornstein–Uhlenbeck process, but in the derivation above the knowledge of the specific stochastic process is not necessary. The resolution given in Eq. 9 depends only on the diffusion equation (Eq. 2) as the equation for the mean value of the stochastic process.

Fig. 1.

Fig. 1

Temperature in k-space: T^k,t shows an exponential decay in time with exp(-k2αt). At kcut it goes below the noise level. For the reconstruction only the wavenumbers less than kcut are taken

Usually in thermography the temperature is not measured on the whole sample x at a certain time t, but at a certain x—which is usually the sample surface—the temperature is measured at several times t. Instead of a Fourier transform (Eq. 3) to k-space, a bilateral Fourier transform in ω-space is performed:

T~x,ω=-Tx,texp-iωtdtTx,t=12π-T~x,ωexpiωtdω. 10

Now for a certain depth a from the Fourier transform of the diffusion equation, which is the Helmholtz equation for T~x,ω, a cut-off frequency ωcut is determined by using the Chernoff–Stein Lemma analog to Eq. 6:

ωcut=2αlnSNRa2 11

This is consistent to a damping with the thermal diffusion length μ=2α/ωcut, where the signal is damped by a factor of exp-a/μ to the noise level during propagation along the length a (e.g., [1] or [3]).

The reconstructed signal Tra at x=0 is

Tra=1π0ωcut1ωαcosaω2α+π4exp-aω2αdω=2πsinkaaaexp-kaawithka=ωcut2α=lnSNR/a 12

Like in Eq. 9 the resolution δra is taken as the distance between the zero points of the reconstructed signal Tra, which is the wavelength corresponding to the wavenumber k(a),

δra=2πka=2πalnSNR, 13

which is proportional to the depth a and independent from the thermal diffusivity α. The resolution is proportional to the thermal diffusion length at the cut-off frequency.

This is not only the limit for the depth resolution, but also for the lateral resolution as can be estimated by using the formula for the Abbe diffraction limit δAbbe=λ/2sinθ. The minimal wavelength λ=2π2α/ωcut of the thermal wave as a certain point at the surface is now a function of the angle θ, as the path d at a certain angle θ is stretched to a=d/cosθ (Fig. 2). This reduces the cut-off frequency ωcut by a factor of cosθ2, which gives for the Abbe limit,

δAbbeθ=λ2sinθ=πaln(SNR)sinθ=πdln(SNR)sinθcosθ. 14

The term sinθcosθ is maximal for a value of θ=45, where its value is 1/2:

δAbbeθ=45=2πdlnSNR 15

This expression for the lateral resolution is the same as given in (13) for the depth resolution. The detection circle at the surface has a radius of d. At a higher distance from the center (θ>45), the resolution is cut because of diffusion. Compared to optics, the influence of the minimal wavelength varying with θ on the Abbe resolution limit is not fully clear. This might result in a better resolution as estimated in (15); see also the discussion in Sect. 4.

Fig. 2.

Fig. 2

Thermal waves which do not go directly to the surface but at an angle θ have a longer path a and therefore their minimal wavelength which can be detected on the surface is increased by a factor 1/cosθ

Discussion, Conclusions and Outlook

Equations 13 and 15 are the main result of this work. For thermographic depth profiling, the axial and the lateral resolution are proportional to the depth and inversely proportional to the natural logarithm of the signal-to-noise ratio. The resolution does not depend on the thermal diffusivity α. This result could be derived from the diffusion equation by using a delta-source at a certain depth and setting the entropy production caused by thermal diffusion equal to the information loss. The delta-source in space as a point source is used to be sure to have a smaller structure than the blurring from image resolution. The derived resolution is valid for any initial temperature profile as the heat diffusion equation is a linear one. No specific model about the stochastic process of the fluctuations and about the distribution densities around the mean values was necessary to get this result. In earlier work [9] we derived Eq. 9 by assuming that the thermal diffusion in k-space is a special Gauss–Markov process (Ornstein–Uhlenbeck process). The same result in k-space was derived in [7] by choosing for the Thikonov regularization the regularization parameter equal to the inverse signal-to-noise ratio, which was justified by the “L-curve” method. In the present publication, e.g., the cut-off wavenumber kcut in Eq. 7 was derived by using a recently gained thermodynamic result: that the mean entropy production is equal to the information loss. In comparison to previous publications on photothermal depth profiling, e.g., by Li Voti [10], the groups of Salazar [14], or Majaron et al. [15], where the L-curve or the Morozov discrepancy principle was used to determine the regularization parameters, we could derive a principle limit from thermodynamics and no additional assumption for regularization or of a specific model for the stochastic process was necessary.

This result is consistent with work about interference of thermal waves [16]. The wavenumber for a thermal wave with frequency ω is ω/2α, which is the inverse thermal diffusion length (e.g., [1]). Using the frequency ωcut from Eq. 11 the wavelength corresponding to this wavenumber is just the resolution in Eq. 13. Also experimental results indicate the linear relation between depth and resolution, e.g., in [15]. In the derivation of Eq. 15, using the formula for the optical Abbe diffraction limit, the minimal wavelength varies with the aperture-angle. This is different to optics, where the whole aperture has the same wavelength. Therefore, the resolution given in Eq. 15 might be an upper limit. This has to be verified by additional two- or three-dimensional models and simulations.

For future work instead of a short kick also other excitation patterns can be considered to evaluate the resolution limits. Busse used a phase angle measurement with a sinusoidal excitation in lock-in thermography to get a better resolution in depth [17]. Sreekumar and Mandelis proposed a chirped excitation pattern like in radar technology to get a better spatial resolution at a certain depth [18]. Using the mean entropy production it should be possible to give also thermodynamic resolution limits for those excitation patterns and compare them to resolution limits for single short pulse excitation. Different and more realistic boundary conditions for the surface should be implemented (e.g., third kind in [3]) to be comparable to experimental results. First results for such a third kind boundary condition are given in [7], where we have heated a metal foil embedded in epoxy resin by a short eddy current pulse. Instead of the k-space, where the eigenfunctions are, e.g., cosine-functions, new base functions fn were used (mathematically the same functions as in quantum physics for the finite potential well, e.g., Griffiths [19]).

Acknowledgments

This work has been supported by the Austrian Science Fund (FWF), Project Number S10503-N20, by the Christian Doppler Research Association, the Federal Ministry of Economy, Family and Youth, the European Regional Development Fund (EFRE) in the framework of the EU-program Regio 13, and the federal state of Upper Austria.

Appendix

To show the equality of information loss and entropy production for the kicked process we follow the derivation of Kawai et al. [20] and Gomez-Marin et al. [21] in spirit to that of the Jarzynski [22] and Crooks [23, 24] equalities, but instead of varying a control parameter λ from an initial value to a final value along a given protocol as in [20] we assume to start from a canonical equilibrium state at a temperature T and “kick” it at time t=0. We consider a Hamiltonian Hx. x is a point in phase space, where x=q,p represents the set of position and momentum coordinates. Before the kick the equilibrium probability distribution to observe the state x is given by a Boltzmann distribution peqx=exp-βHx/Z. Z is the normalization factor (partition function) and β:=1/(kBT). The distribution density just after the short kick is pkickx=peqx-x0. The applied work W for a kick x0 is Hx+x0-Hx for a phase point x at t=0. With

eβW=e-βHxe-βHx+x0=peqxpeqx+x0 16

one gets by averaging the logarithm ln of Eq. 16 and substituting x=x+x0,

βWeq=lnpeqxpeqx+x0peqxdx=lnpeqx-x0peqxpeqx-x0dx 17

or multiplied by kB and using the definition of the Kullback–Leibler divergence Eq. 1,

Informationloss=kBD(pkick||peq)=WeqT=1THkick-Heq=meanentropyproduction 18

In this equation the distribution density pkick is a shifted equilibrium density corresponding to a Hamiltonian Hx-x0. The partition function Z is the same for pkick and peq, therefore ΔF is zero. More applied work W in Eq. 18 means that the distribution pkick just after the kick is “more distant” from the equilibrium distribution peq. The equilibrium is the state where all the information about the kick is lost and all the applied work has been dissipated.

A certain time t>0 after the kick only a part of the applied work has been dissipated and not all the information about the kick magnitude has been lost. To describe a state at a time t, which is usually not an equilibrium state, it is not necessary to know pt for all microscopic variables, but x as a set of reduced variables which captures the information on the work is sufficient. In Fig. 3 the forward process is illustrated and the distributions are sketched. We propose that Eq. 18 can be written in a time-dependent form for all the intermediate non-equilibrium states after the kick with a distribution density pt in a good approximation.

Fig. 3.

Fig. 3

Illustration of the forward process: a system in equilibrium state peq with mean value at x=0 is kicked at a time t=0 with magnitude x0 to a state pkick far from equilibrium, followed by a dissipative process back to equilibrium. x is a set of reduced variables which captures the information on the work. The arrows connecting pkick at time t=0,pt at t>0, and peq at t indicate the tube of trajectories, which is “thin” for macroscopic systems as deviations from the mean values xt are small

Using the definition of the Kullback–Leibler divergence D and peqx=exp-βHx/Z, one gets

Information loss=kBΔD:=kB(D(pkick||peq)-D(pt||peq))=ΔS+1THkick-Hpt=mean entropy production. 19

The entropy production is the total entropy change ΔS minus the entropy flow, which is the negative of the dissipated heat Hkick-Hpt divided by the temperature. The total entropy change ΔSSt-Skick is the difference in the Shannon entropy of pt and pkick:

St:=-kBptxlnptxdx. 20

As pkick is only a “shifted” equilibrium distribution peq, the two distributions have the same entropy: Skick=Seq. Equation 19 can be also deduced directly from Eq. 12 given by Esposito and Van den Broeck in [13] by taking a time-invariant equilibrium distribution. Then Wirr=0, which is the sum of the entropy production ΔiS=ΔS-Q/T and the information ΔI=-kBΔD. Q is the heat coming from the heat bath, in our case the negative of the dissipated heat Hkick-Hpt.

Equation 19 describes that the information ΔI about the kick magnitude decreases and the information loss increases during the evolution of time and is equal to the mean dissipated work divided by the temperature plus the entropy change ΔS. After a long time t the distribution pt converges to the equilibrium distribution peq and from Eq. 19 one gets Eq. 18 using ΔS=Seq-Skick=0. This is also true in the linear regime near equilibrium as the shape of the distribution pt and therefore St does not change and is equal to Seq. But also far from equilibrium in a good approximation for all the intermediate states ΔS1/THkick-Hpt, as for a macroscopic system fluctuations are small compared to the mean value (see Fig. 3 showing a “thin” tube of trajectories). Then the distribution pt has nearly no overlap with peq and one gets

D(pt||peq)=ptlnptdx-ptlnpeqdx-ptlnpeqdx=βHpt+lnZ. 21

The entropy term ptlnpt in Eq. 21 can be neglected because for all regions in the phase space where pt is different from zero and which contribute to the integral, peq is nearly zero and lnpt can be neglected compared to lnpeq. The same approximation in (21) is valid for pt=pkick and therefore the total entropy change ΔS in Eq. 19 can be neglected:

kB(D(pkick||peq)-D(pt||peq))1THkick-Hpt. 22

Subtracting Eq. 22 from Eq. 18 one gets for the information It about the kick magnitude:

It=kBD(pt||peq)1THpt-Heq1T(Hxt-Hx=0), 23

It can be identified as the amount of information that needs to be processed to switch from the known equilibrium distribution peq (no kick) to the distribution pt a time t after the kick [8]. All the information about the kick can be approximated by the mean work, which has not been dissipated yet, divided by the temperature. This is the entropy production according to heat diffusion. After a long time all the energy has been dissipated and no information about the kick magnitude is available. The second approximation in Eq. 23 uses that for a macroscopic system the fluctuations are small and the mean of the Hamiltonian is approximately the Hamiltonian of the mean value xt (Fig. 3).

References

  • 1.Rosencwaig A. Non-destructive evaluation. In: Mandelis A, editor. Progress in Photothermal and Photoacoustic Science and Technology. New York: Elsevier; 1992. [Google Scholar]
  • 2.de Groot SR, Mazur P. Non-Equilibrium Thermodynamics. New York: Dover Publications; 1984. [Google Scholar]
  • 3.Mandelis A. Diffusion-Wave Fields: Mathematical Methods and Green Functions. New York: Springer; 2001. [Google Scholar]
  • 4.J. Hadamard, Sur les problèmes aux dérivés partielles et leur signification physique. Princet. Univ. Bull. 13, 49 (1902)
  • 5.Groetsch CW. Inverse Problems in the Mathematical Sciences. Braunschweig: Vieweg; 1993. [Google Scholar]
  • 6.Hansen PC. Rank-Deficient and Discrete Ill-Posed Problems: Numerical Aspects of Linear Inversion. Philadelphia: SIAM; 1987. [Google Scholar]
  • 7.P. Burgholzer, F. Camacho-Gonzales, D. Sponseiler, G. Mayer, G. Hendorfer, Information changes and time reversal for diffusion-related periodic fields, in Biomedical Optics: Photons Plus Ultrasound: Imaging and Sensing, SPIE (2009), pp. 7177–76
  • 8.Cover TM, Thomas JA. Elements of Information Theory. 2. New York: Wiley; 2006. [Google Scholar]
  • 9.Burgholzer P, Hendorfer G. Int. J. Thermophys. 2013;34:1617. doi: 10.1007/s10765-013-1513-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Li Voti R, Sibilia C, Bertolotti M. Int. J. Thermophys. 2005;26:1833. doi: 10.1007/s10765-005-8599-y. [DOI] [Google Scholar]
  • 11.Hasegawa HH, Ishikawa J, Takara K, Driebe DJ. Phys. Lett. A. 2010;374:1001. doi: 10.1016/j.physleta.2009.12.042. [DOI] [Google Scholar]
  • 12.Takara K, Hasegawa HH, Driebe DJ. Phy. Lett. A. 2010;375:88. doi: 10.1016/j.physleta.2010.11.002. [DOI] [Google Scholar]
  • 13.Esposito M, Van den Broeck C. Europhys. Lett. 2011;95:40004. doi: 10.1209/0295-5075/95/40004. [DOI] [Google Scholar]
  • 14.Apinaniz E, Mendioroz A, Salazar A, Celorrio R. J. Appl. Phys. 2010;108:064905. doi: 10.1063/1.3475498. [DOI] [Google Scholar]
  • 15.Milanič M, Serša I, Majaron B. Phys. Med. Biol. 2009;54:2829. doi: 10.1088/0031-9155/54/9/016. [DOI] [PubMed] [Google Scholar]
  • 16.Gleiter A, Mayr G. Infrared Phys. Technol. 2010;53:288. doi: 10.1016/j.infrared.2010.04.001. [DOI] [Google Scholar]
  • 17.Busse G. Appl. Phys. Lett. 1979;35:759. doi: 10.1063/1.90960. [DOI] [Google Scholar]
  • 18.Sreekumar K, Mandelis A. Nat. Photonics. 2014;8:635. doi: 10.1038/nphoton.2014.111. [DOI] [Google Scholar]
  • 19.D.J. Griffiths, Introduction to Quantum Mechanics (Prentice Hall, Upper Saddle River, NJ, 2005)
  • 20.Kawai R, Parrondo JMR, Van den Broeck C. Phys. Rev. Lett. 2007;98:080602. doi: 10.1103/PhysRevLett.98.080602. [DOI] [PubMed] [Google Scholar]
  • 21.Gomez-Marin A, Parrondo JMR, Van den Broeck C. Europhys. Lett. 2008;82:50002. doi: 10.1209/0295-5075/82/50002. [DOI] [Google Scholar]
  • 22.Jarzynski C. Phys. Rev. Lett. 1997;78:2690. doi: 10.1103/PhysRevLett.78.2690. [DOI] [Google Scholar]
  • 23.Crooks GE. J. Stat. Phys. 1998;90:1481. doi: 10.1023/A:1023208217925. [DOI] [Google Scholar]
  • 24.Crooks GE. Phys. Rev. E. 1999;60:2721. doi: 10.1103/PhysRevE.60.2721. [DOI] [PubMed] [Google Scholar]

Articles from International Journal of Thermophysics are provided here courtesy of Springer

RESOURCES