Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Apr 1.
Published in final edited form as: Stat Probab Lett. 2015 Apr 1;99:149–155. doi: 10.1016/j.spl.2014.12.028

Central limit theorems under special relativity

Ian W McKeague 1
PMCID: PMC4365789  NIHMSID: NIHMS670250  PMID: 25798020

Abstract

Several relativistic extensions of the Maxwell–Boltzmann distribution have been proposed, but they do not explain observed lognormal tail-behavior in the flux distribution of various astrophysical sources. Motivated by this question, extensions of classical central limit theorems are developed under the conditions of special relativity. The results are related to CLTs on locally compact Lie groups developed by Wehn, Stroock and Varadhan, but in this special case the asymptotic distribution has an explicit form that is readily seen to exhibit lognormal tail behavior.

Keywords: heavy-tailed distributions, random walks on locally compact Lie groups, κ-deformations, Maxwell, Boltzmann, Ornstein, Uhlenbeck

1. Introduction

In special relativity, the momenta of particles on parallel trajectories combine according to the κ-addition rule

xκy=x1+κ2y2+y1+κ2x2, (1.1)

where 0 < κ < 1 is a dimensionless parameter representing the reciprocal of the speed of light in the ambient space (with all variables expressed in dimensionless units). The classical Galilean addition law for momenta appears in the limit as κ → 0.

The purpose of this note is to derive extensions of classica lcentral limit theorems under κ-addition, and to relate the resulting (non-Gaussian) limit distributions to an existing parametric family, namely hyperbolic sine transformations of normal distributions. It is also shown that such distributions arise as the limit distribution of a certain type of relativistic Ornstein–Uhlenbeck process.

Background on the κ-addition rule for momenta in special relativity, and corresponding operations for velocity and energy, can be found in Kaniadakis (2002, 2006, 2009a,b, 2013), along with a comprehensive survey of the literature on various other κ-deformations, and references to applications. These include a formal extension of the Maxwell–Boltzmann distribution (used to model the energy of large systems of particles that only interact through elastic collision and are at thermal equilibrium), constructed by replacing the standard exponential by the κ-deformed exponential expκ(x) = exp (sinh−1(κx)/κ), and providing power-law tail behavior with exponent 1/κ.

General formulations of central limit theorems for random walks on locally compact Lie groups, of which the real line under κ-addition is a simple example, were first established by Wehn (1962) and Stroock and Varadhan (1973). These results require indirect moment conditions on the random elements after they are logarithmically mapped into the associated Lie algebra (tangent space at the identity), and the limit distribution is described in terms of the infinitessimal generator of a semi-group of probability measures on the Lie group. Despite their great generality, such results fall short of being able to resolve whether classical CLTs extend in full to κ-addition under only standard conditions. Explicit CLTs are currently only available for random walks on certain special Lie groups, e.g., for the special orthogonal group SO(3), which is of interest in the field of directional statistics. In particular, Qiu et al. (2014) recently established an explicit CLT for symmetric random elements of SO(3), with an isotropic Gaussian distribution as the limit.

We establish complete relativistic extensions of the classical CLTs under standard conditions. These results are established by showing that “relativistic warping” is asymptotically equivalent to taking the hyperbolic sine of a row-sum in a transformed array, and that the standard CLT conditions on the original array suffice for applying the Lindeberg–Feller theorem to this transformed array. The proofs are elementary, avoiding the need for any background on Lie groups or infinitesimal generators. The limiting distributions have an explicit form because the exponential map from the associated Lie algebra into the Lie group in this special case takes an especially tractable form.

Our main results are presented in Section 2. Background on observed lognormal tail-behavior in astrophysical sources, and a relativistic version of the Ornstein–Uhlenbeck process that might help explain such observations, are discussed in Section 3.

2. Relativistic CLTs

In this section we first develop CLTs that apply to the relativistic averaging of momenta, and then consider extensions to the relativistic averaging of velocities and energies, along with a functional CLT.

2.1. Momenta

Our first result is a full extension of the classical CLT to κ-sums, only requiring that the iid summands have finite second moment.

Theorem 1

Let {Xi} be a sequence of iid zero-mean r.v.s with variance σ2 < ∞, and let Xni=Xi/n. Then

Xn1κXn2κκXnnd1κsinh(κZ), (2.1)

where ZN(0,σ2).

Proof

The Lie group ( ,κ) is group isomorphic to its Lie algebra (ℝ, +), with the logarithmic map xsinh1(κx) providing the isomorphism. That is,

sinh1[κ(xyκ)]=sinh1(κx)+sinh1(κy), (2.2)

which can be directly checked (without recourse to the theory of Lie groups) using the expression sinh1(x)=log(x+1+x2). This provides the following decomposition of the κ-sum:

Xn1κXn2κκXnn=1κsinh(κTn+μn), (2.3)

where Tn=1κi=1n(YniEYni),Yni=sinh1(κXi/n), and μn=nEYn1.

Since nYn1κX1 a.s. and |nYn1|κ|X1|, by dominated convergence we have μn/n=nEYn1κEX1, and

Var(Tn)=1κ2i=1nVar(Yni)=1κ2{E(nYn1)2[E(nYn1)]2}Var(X1)=σ2.

Also, if t is a continuity point of the distribution of κ|X1EX1|, then by dominated convergence

nE([Yn1EYn1]21{n|Yn1EYn1|>t})=E[nYn1μn/n]2E([nYn1μn/n]21{|nYn1μn/n|t})κ2E([X1EX1]21{κ|X1EX1|>t}).

Thus, noting that the last term above tends to zero as t, it follows that

i=1nE([Yn1EYn1]21{|Yn1EYn1|>ε})=nE([Yn1EYn1]21{n|Yn1EYn1|>εn})0

for every ε > 0, so the Lindeberg condition holds and TndN(0,σ2) by the Lindeberg–Feller theorem.

The first derivative of sinh−1(x), namely 1/1+x2, is bounded between 1 − |x|α and 1 for any 0 < α ≤ 2. By taking α = 1 when |x| ≥ 1 and α = 2 when |x| < 1, then integrating, we obtain | sinh−1(x) − x| ≤ x2 min(|x|, 1). Using this inequality and the condition EX1 = 0 (not used up to now), we have

|μn|=|nE[Yn1κX1/n]|nE|sinh1(κX1/n)κX1/n|κ2E[X12min(κ|X1|/n,1)],

which tends to zero by dominated convergence. The result then follows by (2.3) and the continuous mapping theorem. □

Example 1

Let the Xi be iid copies of a discrete r.v. X having distribution formed by standardizing (to have zero-mean and unit variance) the probability mass function

pk=Ck3(logk)(log logk)2,k=3,4,

where C is a normalizing constant. Note that EX2 < ∞ and Theorem 1 is applicable, but E[X2 log(1 + |X|)] = , so it is a borderline case. Also, X is not symmetric and the bias μn in (2.3) does not vanish. As proved in Theorem 1, μn 0, but its rate of convergence to zero is very slow, as seen in Figure 1, indicating that the distribution of the κ-sum may be substantially skewed, even for large samples. Curiously, the bias does not tend monotonically to zero, but initially becomes more severe (for n ≤ 64).

Figure 1.

Figure 1

The behavior of the bias μn as a function of log2 n in Example 1 (red solid line) and Example 2 (green dashed line), when κ = σ = 1.

In contrast, μn tends very rapidly to zero in the following example, see Figure 1.

Example 2

Let X=2 with probability 1/3, and 1/2 with probability 2/3. This X is not symmetric but has finite moments of all orders (as well as zero-mean and unit variance).

Note that the asymmetry in Example 1, rather than the borderline moment condition, is the prime cause of the slow rate of convergence of the bias to zero. For CLTs on general Lie groups, a suitable symmetry assumption can have the convenient effect of removing bias. For example, in connection with the explicit CLT on the (compact) Lie group SO(3) mentioned in the Introduction, the main result of Qiu et al. (2014) assumes that the angular distribution of the random rotation is symmetric on (−π, π]; the possibility of an extension to the asymmetric case was not discussed.

Our second result extends the classical Lindeberg–Feller theorem and contains Theorem 1 as a special case, although the proof is less revealing in the sense that the bias is no longer made explicit.

Theorem 2

If {Xni, i = 1, …, n} is a triangular array of independent zero-mean r.v.s such that i=1nE[Xni21{|Xni|>ε}]0 for all ɛ > 0, and i=1nVar(Xni)σ2<, then (2.1) holds.

Proof

Consider the following alternative decomposition of the κ-sum as

Xn1κXn2κκXnn=1κsinh(κSn+Rn),

where Sn=i=1nXni and Rn=i=1n{sinh1(κXni)κXni}. By the Lindeberg–Feller theorem, SndZN(0,σ2). The result then follows using the continuous mapping theorem and Slutsky’s lemma if the remainder Rnp 0. By the same bound on | sinh−1(x)–x| used to handle the bias term μn in the proof of Theorem 1,

E|Rn|κ2i=1nE[Xni2min(κ|Xni|,1)]=κ2(An+Bn),

where for any given ε > 0,

An=i=1nE[Xni2min(κ|Xni|,1)1{|Xni|>ε}]i=1nE[Xni21{|Xni|>ε}]0

by the Lindeberg condition, and

Bn=i=1nE[Xni2min(κ|Xni|,1)1{|Xni|ε}]κεi=1nEXni2

if ε < 1/κ. Thus lim sup Bn ≤ κ ε σ2, and since ε can be arbitrarily small, we also have Bn 0, completing the proof. □

Our approach has involved showing that a Lindeberg–Feller type result holds for the transformed array sinh−1(κXni) under standard conditions on the underlying array {Xni}. In contrast, for the Lie group CLTs mentioned in the Introduction, moment conditions are needed on the transformed array (logarithmically mapped into the Lie algebra); in our special setting, those conditions are a consequence of the simpler conditions on the underlying array.

We conclude this section by noting that a full extension of the classical strong law of large numbers also holds. If Xi are iid with finite mean and Xni = Xi/n, then

Xn1κXn2κκXnn1κsinh(κEX1)a.s. (2.4)

The proof uses a similar argument to what we have already seen, except the relevant remainder term is handled using the inequality | sinh−1(x) − x| ≤ |x| min(|x|, 1).

2.2. Arcsinh-normal distributions

We refer to a r.v. of the form a sinh X, where X is normally distributed and a is a constant, as arcsinh-normal, in parallel with the term lognormal. Figure 2 shows the pdf of 1κsinh(κZ+μ), where Z ~ N (0, 1) for various choices of κ and μ. The distribution is close to normal for small κ, but has lognormal tail behavior; the lognormal tails become especially apparent as μ increases.

Figure 2.

Figure 2

The pdf of 1κsinh(κZ+μ) compared with the pdf of Z ~ N(0, 1) (dashed), for μ = 0 (blue), μ = 1 (green) and μ = 2 (red), for κ = .5 (left panel) and κ = .9 (right panel).

Arcsinh-normal distributions form a subclass of the translation system SU introduced by Johnson (1949). Various classical characterization results for normal distributions can be translated immediately into arcsinh-normal versions. For example, if X and Y are independent and their κ-sum is arcsinh-normal, then both X and Y must be arcsinh-normal; this is a consequence of (2.2) and Cramér’s theorem. Likelihood-based methods of inference for such distributions are available (Jones and Pewsey 2009), but, as far as we know, their appearance as limit laws in special relativity has not previously been noted.

2.3. Velocity and energy

Under special relativity, the velocity u(q) and energy ℰ (q) of a particle with momentum q are given (in dimensionless units) by

u(q)=q1+κ2q2,E(q)=1κ21+κ2q2, (2.5)

and the corresponding κ-sum rules are

u1κvu2=u1+u21+κ2u1u2

and

E1eκE2=κ2E1E2+1κ2(κ4E121)(κ4E221),

respectively, see Kaniadakis (2006). The previous CLTs for momenta can be translated into CLTs for velocity and energy as follows.

Let Uni = u(Xni) and ℰni = ℰ(Xni), where {Xni, i = 1, …, n} is a triangular array of the form considered earlier. Using the identity (cf. Kaniadakis 2006)

Un1vκUn2vκvκUnn=u(Xn1κXn2κκXnn),

applying the above theorems (and noting that u(·) is continuous) we have

Un1vκUn2vκvκUnndsinh(κZ)κ1+sinh2(κZ)=1κtanh(κZ)

and similarly

En1eκEn2eκeκEnnd1κ21+sinh2(κZ)=1κ2cosh(κZ).

The pdfs of these limiting distributions are illustrated in Figs. 3 and 4.

Figure 3.

Figure 3

The pdf of the limiting distribution for velocity: κ = .33 (blue), κ = .5 (purple), κ = .7 (green), κ = .9 (red) and κ = .99 (violet). The vertical dashed lines indicate the lower and upper bounds (±1/κ) on the velocity.

Figure 4.

Figure 4

The pdf of the limiting distribution for energy, with the same color coding as in Figure 3. The vertical dashed lines indicate the lower bounds (1/κ2) on the energy.

2.4. Relativistic invariance principle

Next we discuss a κ-sum version of the functional CLT (invariance principle) of Donsker (1951). As in Theorem 1, let {Xi} be a sequence of iid zero-mean r.v.s with variance σ2 < ∞, and set Xn,i=Xi/n. Then define the relativistic random walk for momentum as the process

Bn(t)=Xn,1κXn,2κκXn,ntfor0<t1,

where ⌈·⌉ is the ceiling function, and set Bn(0) = 0. Viewing Bn as a random element in the space of bounded functions on [0, 1] endowed with the uniform norm, under the conditions of Theorem 1 we have

Bn(t)dB(t)=1κsinh(κW(t)),

where W (t) is a Wiener process with infinitesimal variance σ2. This follows from the classical Donsker theorem using a similar expansion to (2.3) and noting that the drift term μn, now a function of t, converges uniformly to zero.

3. Relativistic Maxwell–Boltzmann distributions

The first relativistic extension of the Maxwell–Boltzmann distribution was due to Jüttner (1911), but in recent years various authors have questioned whether Maxwell–Boltzmann–Jüttner theory is adequate to explain the flux distributions of relativistic sources, especially those that expand into an infinite surrounding space (e.g., cosmic rays, quasars, gamma ray bursts, and X-rays from black hole objects), rather than being confined to a closed vessel. Indeed, it has been observed that lognormal distributions may provide better approximations in such systems, see, e.g., Ioka and Nakamura (2002); Gaskell (2004); Gandhi (2009); Kunjaya et al. (2011); Gladders et al. (2013).

Multiplicative processes have typically been used to explain lognormal distributions. Yet existing approaches (e.g., based on self-organized criticality) used to model the X-ray flux of black hole objects rely on additive processes, and produce power-law tail behavior rather than lognormal (see Kunjaya et al. (2011), who develop a more sophisticated multiplicative model to address this issue). By extending classical central limit theory to allow the type of addition that is relevant to special relativity, and that is relevant to an open system, our results may provide an alternative explanation for the lognormal behavior that is a common feature of relativistic particle systems.

Dunkel et al. (2007) showed that the Jüttner (and modified-Jüttner) distribution can be obtained from the maximum entropy principle under the constraint that the average (relativistic) energy at equilibrium is fixed; this is done by defining the entropy with respect to the Haar measure on the relevant state space (a locally compact group, on which the Haar measure is preserved under the group operation). The Haar measure for the real line under κ-addition is dq/1+κ2q2, and the arcsinh-normal distribution maximizes entropy with respect to this Haar measure when a constraint is placed on the second moment of rapidity: φ = sinh−1(κq) = tanh−1(κu), where q is the momentum and u ∈ (−1/κ, 1/κ) is the velocity. The addition rule for rapidities is the usual addition (for parallel trajectories), so this result can be obtained directly from the entropy-maximizing property of the normal distribution. However, it is not natural from the physical point of view to place a constraint on the second moment of the rapidity, in contrast to a constraint on the expected energy say (used to derive Jüttner type distributions).

A physically more compelling approach to deriving a relativistic Maxwell–Boltzmann distribution is to construct a relativistic Langevin equation driven by a Wiener process, and determine its limiting distribution. Many types of relativistic Ornstein–Uhlenbeck (OU) processes directly driven by a standard Wiener process have been proposed, and shown to converge to either Jüttner or modified-Jüttner stationary distributions, see Dunkel and Hänggi (2009) and Angst (2011) for references to this extensive literature.

We now describe a different type of relativistic OU process and show that it has an arcsinh-normal stationary distribution. This involves using the relativistic Wiener process discussed in the previous section to drive a κ-deformed version of the Langevin equation. Since the exponential map [x] ≡ sinh(κx)/κ is a field isomorphism from (ℝ, +,·) to its κ-deformed version, we have

[x]κ[y]=[xy]and[x]κ[y]=[x+y],

for κ-multiplication and κ-addition, respectively. Let Xt be an OU process on the real line satisfying the stochastic differential equation dXt = αXt dt + dWt, where α < 0 and Wt is a standard Wiener process. It is then easily seen that the transformed process Yt = [Xt] satisfies the following κ-deformed Langevin equation driven by the relativistic Wiener process Bt = [Wt]:

dkYt=βκYtκdktκdkBt,

where β = [α], and we have used the κ-differential dkYtYt+dtκYt as defined in Kaniadakis (2013). Formally, this means that Yt satisfies the stochastic integral equation

Yt=Y0κ0tβκYs1+κ2s2dsκBt,t0, (3.1)

where the Haar measure for the real line under κ-addition now plays a role because Yt is being treated as a stochastic process on this Lie group. The stationary distribution of Xt is normal, so the stationary distribution of Yt is arcsinh-normal. In related work, OU processes on Lie groups that are driven by a Wiener process on the associated Lie algebra have been studied by Baudoin et al. (2008), who showed the existence of “natural” OU processes with stationary distribution induced by the exponential map applied to the Wiener process at time 1.

To conclude, we have provided in (3.1) a natural and explicit construction of a relativistic OU process having a stationary distribution with lognormal tails, which may help explain the observed flux distributions of various astrophysical sources. Extensions of classical CLTs and OU processes to other Lie groups on the real line, such as the Lie group associated with Tsallis entropy (see Tempesta (2011) for background) can also be developed using our approach. This will be the topic of a future paper.

Acknowledgments

This research was partially supported by NSF Grant DMS-1307838 and NIH Grant R01GM095722-01. The author thanks Charles Hailey, Bruce Levin and Georg Hahn for helpful comments on early versions of the paper.

References

  1. Angst J. Trends to equilibrium for a class of relativistic diffusions. Journal of Mathematical Physics. 2011;52(11):113703. [Google Scholar]
  2. Baudoin F, Hairer M, Teichmann J. Ornstein–Uhlenbeck processes on Lie groups. Journal of Functional Analysis. 2008;255(4):877–890. [Google Scholar]
  3. Donsker MD. An invariance principle for certain probability limit theorems. Mem Amer Math Soc. 1951;6:1–10. [Google Scholar]
  4. Dunkel J, Hänggi P. Relativistic Brownian motion. Physics Reports. 2009;471(1):1–73. [Google Scholar]
  5. Dunkel J, Talkner P, Hänggi P. Relative entropy, Haar measures and relativistic canonical velocity distributions. New Journal of Physics. 2007;9:144. [Google Scholar]
  6. Gandhi P. The flux-dependent rms variability of X-ray binaries in the optical. The Astrophysical Journal Letters. 2009;697(2):L167. [Google Scholar]
  7. Gaskell CM. Lognormal X-ray flux variations in an extreme narrow-line Seyfert 1 galaxy. The Astrophysical Journal Letters. 2004;612(1):L21. [Google Scholar]
  8. Gladders MD, Oemler A, Dressler A, Poggianti B, Vulcani B, Abramson L. The IMACS Cluster Building Survey IV. The log-normal star formation history of galaxies. The Astrophysical Journal. 2013;770(1):64. [Google Scholar]
  9. Ioka K, Nakamura T. A possible origin of lognormal distributions in gamma-ray bursts. The Astrophysical Journal Letters. 2002;570(1):L21. [Google Scholar]
  10. Johnson NL. Systems of frequency curves generated by methods of translation. Biometrika. 1949;36:149–176. [PubMed] [Google Scholar]
  11. Jones MC, Pewsey A. Sinh-arcsinh distributions. Biometrika. 2009;96(4):761–780. [Google Scholar]
  12. Jüttner F. Das Maxwellsche gesetz der geschwindigkeitsverteilung in der relativthe-orie. Annalen der Physik. 1911;339(5):856–882. [Google Scholar]
  13. Kaniadakis G. Statistical mechanics in the context of special relativity. Phys Rev E. 2002;66:056125. doi: 10.1103/PhysRevE.66.056125. [DOI] [PubMed] [Google Scholar]
  14. Kaniadakis G. Towards a relativistic statistical theory. Physica A: Statistical Mechanics and its Applications. 2006;365(1):17–23. [Google Scholar]
  15. Kaniadakis G. Maximum entropy principle and power-law tailed distributions. The European Physical Journal B. 2009a;70(1):3–13. [Google Scholar]
  16. Kaniadakis G. Relativistic entropy and related boltzmann kinetics. The European Physical Journal A. 2009b;40(3):275–287. [Google Scholar]
  17. Kaniadakis G. Theoretical foundations and mathematical formalism of the power-law tailed statistical distributions. Entropy. 2013;15(10):3983–4010. [Google Scholar]
  18. Kunjaya C, Mahasena P, Vierdayanti K, Herlie S. Can self-organized critical accretion disks generate a log-normal emission variability in AGN? Astrophysics and Space Science. 2011;336(2):455–460. [Google Scholar]
  19. Qiu Y, Nordman DJ, Vardeman SB. A wrapped trivariate normal distribution and Bayes inference for 3-D rotations. Stat Sin. 2014;24(2):897–917. [Google Scholar]
  20. Stroock DW, Varadhan SRS. Limit theorems for random walks on Lie groups. Sankhyā Ser A. 1973;35(3):277–294. [Google Scholar]
  21. Tempesta P. Group entropies, correlation laws, and zeta functions. Phys Rev E. 2011;84:021121. doi: 10.1103/PhysRevE.84.021121. [DOI] [PubMed] [Google Scholar]
  22. Wehn D. Probabilities on Lie groups. Proc Nat Acad Sci USA. 1962;48:791–795. doi: 10.1073/pnas.48.5.791. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES