Skip to main content
Springer logoLink to Springer
. 2021 Feb 15;383(1):223–279. doi: 10.1007/s00220-021-03988-1

Convergence Rates for the Quantum Central Limit Theorem

Simon Becker 1,, Nilanjana Datta 1, Ludovico Lami 2,3, Cambyse Rouzé 4
PMCID: PMC8550765  PMID: 34720122

Abstract

Various quantum analogues of the central limit theorem, which is one of the cornerstones of probability theory, are known in the literature. One such analogue, due to Cushen and Hudson, is of particular relevance for quantum optics. It implies that the state in any single output arm of an n-splitter, which is fed with n copies of a centred state ρ with finite second moments, converges to the Gaussian state with the same first and second moments as ρ. Here we exploit the phase space formalism to carry out a refined analysis of the rate of convergence in this quantum central limit theorem. For instance, we prove that the convergence takes place at a rate On-1/2 in the Hilbert–Schmidt norm whenever the third moments of ρ are finite. Trace norm or relative entropy bounds can be obtained by leveraging the energy boundedness of the state. Via analytical and numerical examples we show that our results are tight in many respects. An extension of our proof techniques to the non-i.i.d. setting is used to analyse a new model of a lossy optical fibre, where a given m-mode state enters a cascade of n beam splitters of equal transmissivities λ1/n fed with an arbitrary (but fixed) environment state. Assuming that the latter has finite third moments, and ignoring unitaries, we show that the effective channel converges in diamond norm to a simple thermal attenuator, with a rate O(n-12(m+1)). This allows us to establish bounds on the classical and quantum capacities of the cascade channel. Along the way, we derive several results that may be of independent interest. For example, we prove that any quantum characteristic function χρ is uniformly bounded by some ηρ<1 outside of any neighbourhood of the origin; also, ηρ can be made to depend only on the energy of the state ρ.

Introduction

The Central Limit Theorem (CLT) is one of the cornerstones of probability theory. This theorem and its various extensions have found numerous applications in diverse fields including mathematics, physics, information theory, economics and psychology. Any limit theorem becomes more valuable if it is accompanied by estimates for rates of convergence. The Berry–Esseen theorem (see e.g. [1]), which gives the rate of convergence of the distribution of the scaled sum of independent and identically distributed (i.i.d.) random variables to a normal distribution, thus provides an important refinement of the CLT.

The first results on quantum analogues of the CLT were obtained in the early 1970s by Cushen and Hudson [2], and Hepp and Lieb [3, 4]. The approach of [3] was generalised by Giri and von Waldenfels [5] a few years later. These papers were followed by numerous other quantum versions of the CLT in the context of quantum statistical mechanics [614], quantum field theory [1517], von Neumann algebras [18, 19], free probability [20], noncommutative stochastic processes [21] and quantum information theory [2224]. For a more detailed list of papers on noncommutative or quantum central limit theorems (QCLT), see for example [19, 25] and references therein. A partially quantitative central limit theorem for unsharp measurements has been obtained in [26].

An important pair of non-commuting observables is the pair (xp) of canonically conjugate operators, which obey Heisenberg’s canonical commutation relations (CCR) [x,p]=iI, where I denotes the identity operator.1 These observables could be, for example, the position and momentum operators of a quantum particle, or the so-called position and momentum quadratures of a single-mode bosonic field, described in the quantum mechanical picture by the Hilbert space Inline graphic – the space of square integrable functions on R. The corresponding annihilation and creation operators are constructed as Inline graphic and Inline graphic. When expressed in terms of a,a, the CCR take the form [a,a]=I.

Quantum states are represented by density operators, i.e. positive semi-definite trace class operators with unit trace. A state ρ of a continuous variable quantum system is uniquely identified by its characteristic function, defined for all zC by Inline graphic. The special class of Gaussian states comprises all quantum states whose characteristic function is the (classical) characteristic function of a normal random variable on C.2 Exactly as in the classical case, a quantum Gaussian state is uniquely defined by its mean and covariance matrix.

Cushen and Hudson [2] proved a quantum CLT for a sequence of pairs of such canonically conjugate operators {(xn,pn):n=1,2,}, with each pair acting on a distinct copy of the Hilbert space Inline graphic. More precisely, they showed that sequences that are stochastically independent and identically distributed, and have finite covariance matrix and zero mean with respect to a quantum state ρ (given by a density operator on Inline graphic), are such that their scaled sums converge in distribution to a normal limit distribution [2, Theorem 1].

Their result admits a physical interpretation in terms of a passive quantum optical element known as the n-splitter. This can be thought of as the unitary operator Un-split that acts on n annihilation operators of n independent optical modes as Un-splitajUn-split=kFjkak, where Inline graphic is the discrete Fourier transform matrix. Passivity here means that Un-split commutes with the canonical Hamiltonian of the field, i.e. Un-split,jajaj=0. When n identical copies of a state ρ are combined by means of an n-splitter, and all but the first output modes are traced away, the resulting output state is called the n -fold quantum convolution of ρ, and denoted by ρn. This nomenclature is justified by the fact that the characteristic function χρσ of two states ρ and σ is equal to the product of the characteristic functions of ρ and σ, a relation analogous to that satisfied by characteristic functions of convolutions of classical random variables. Observe state ρn can also be obtained as the output of a cascade of n-1 beam splitters with suitably tuned transmissivities λj=j/(j+1) for j=1,2,n-1 (see Fig. 1a).

Fig. 1.

Fig. 1

The n-fold convolution ρn of a state ρ can be realised by mixing n copies of it either: a in an n-splitter; or b in a cascade of beam splitters with suitably tuned transmissivities

Cushen and Hudson’s result is that if ρ is a centred state (i.e. with zero mean) and has finite second moments, its convolutions ρn converge to the Gaussian state ρG with the same first and second moments as ρ in the limit n (Theorem 3). In [2, Theorem 1], the convergence is with respect to the weak topology of the Banach space of trace class operators, which translates to pointwise convergence of the corresponding characteristic functions, by a quantum analogue of Levy’s lemma that is also proven in [2]. This in turn implies that the convergence actually is with respect to the strong topology, i.e. in trace norm (see [27], or [28, Lemma 4]).

In this paper, we focus on the framework proposed by Cushen and Hudson, and provide a refinement of their result by deriving estimates for the associated rates of convergence. We consider a quantum system composed of m modes of the electromagnectic field, each modelled by an independent quantum harmonic oscillator, so that the corresponding Hilbert space becomes Inline graphic. The main contribution of this paper consists of estimates on rate of convergence of ρn to the ‘Gaussification’ ρG of ρ, obtained under suitable assumptions on ρ – typically, the finiteness of higher-order moments. In analogy with the classical case, we refer to our Theorems 6 and 7 as quantum Berry–Esseen theorems. Our estimates are given in the form of bounds on the Schatten p-norms (for p=1 and 2) of the difference (ρn-ρG) in the limit of large n, as well as bounds on the relative entropy of ρn with respect to ρG in the same limit.

We also show that the assumption of finiteness of the second moments cannot be removed from the Cushen–Hudson theorem. Namely, we construct a simple example of a single-mode quantum state σ such that Tr[σ(aa)1-δ] is finite for all δ>0 (and infinite for δ=0), yet σn does not converge to any quantum state as n.

As an application, we propose and study a new model of optical fibre, represented as a cascade of n beam splitters, each with transmissivity λ1/n and fed with a fixed environment state ρ, which is assumed to have bounded energy and thermal Gaussification. Such a model may be relevant to the mathematical modelisation of a channel running across an integrated optical circuit [29, 30]. We are able to show that for n the cascade channel converges in diamond norm, up to irrelevant symplectic unitaries, to a thermal attenuator channel with transmissivity λ and the same photon number as that of the environment state ρ. Furthermore, an extension of our results to the non-i.i.d. setting allows us to bound the rate of convergence in terms of the diamond norm distance. Finally, combining existing continuity bounds on entropies and energy-constrained channel capacities [31, 32], obtained by Winter [33, 34] and Shirokov [35, 36], with the known formulae expressing or estimating energy-constrained classical [37, 38] and quantum [3945] capacities of thermal attenuator channels, we derive bounds on the same capacities for the cascade channel.

Finally, along the way we derive several novel results concerning quantum characteristic functions, which we believe to be of independent interest. First, we prove the simple yet remarkable fact that convolving any two quantum states (i.e. mixing them in a 50 : 50 beam splitter) always results in a state with non-negative Wigner function (Lemma 16). This allows us to interpret the quantum central limit theorem as a result on classical random variables, in turn enabling us to transfer techniques from classical probability theory to the quantum setting. Secondly, we derive new decay bounds on the behaviour of the quantum characteristic function both at the origin and at infinity. For instance, we prove that for any m-mode quantum state ρ and for any ε>0 there exists a constant η=η(ρ,ε)<1 such that |χρ(z)|η(ρ,ε) for all zCm with |z|ε (Proposition 14). Moreover, we show that such a constant can be made to depend only on the second moments of the state, assuming they are finite (Proposition 15). As an explicit example, consider a single-mode state ρ with mean energy E. We then prove that |χρ(z)|1-cE2 for all z with |z|CE, where cC are universal constants. Note that any such bound must depend on the energy, as one can construct a sequence of highly squeezed Gaussian states for which the modulus of the characteristic function approaches one at any designated point in phase space (Example 2).

Layout of the paper: In Sect. 2 we introduce the notation and definitions used in the paper. In Sect. 3 we recall the Cushen and Hudson quantum central limit theorem. Our main results are presented in Sect. 4. The rest of the paper is devoted to the proofs of these results. We start with the novel properties of quantum characteristic functions (Sect. 5), which lie at the heart of our approach. Then, in Sect. 6 we prove our quantum Berry–Esseen theorems. Section 7 is devoted to the discussion of the optimality and sharpness of our results. In Sect. 8 we apply our quantitative non-i.i.d. extension of the Cushen–Hudson theorem to an optical fibre subject to non-Gaussian environment noise. The paper contains a technical appendix (Appendix A) that makes the connection between moments and the regularity of the quantum characteristic function and shows that our definition of moments induces a canonical family of interpolation spaces.

Notation and Definitions

In this section, we fix the basic notations used in the paper, and introduce the necessary definitions.

Mathematical notation

Let Inline graphic denote a separable Hilbert space, and let Inline graphic denote the set of bounded linear operators acting on Inline graphic. Let Inline graphic denote the set of quantum states of a system with Hilbert space Inline graphic, that is the set of density operators ρ (positive semi-definite, i.e. ρ0, trace class operators3 with unit trace) acting on Inline graphic. We denote by Inline graphic the Schatten p-norm, defined as Xp=Tr|X|p1/p. The Schatten p -class Inline graphic is the Banach subspace of Inline graphic formed by all bounded linear operators whose Schatten p-norm is finite. We shall hereafter refer to Inline graphic as the set of trace class operators, to the corresponding norm ·1 as the trace norm, and to the induced distance (e.g. between quantum states) as the trace distance. The case p=2 is also special, as the norm ·2 coincides with the Hilbert–Schmidt norm.

Let AB be positive semi-definite operators defined on some domains Inline graphic. According to [46, Definition 10.15], we write that AB if and only if DomA1/2DomB1/2 and Inline graphic for all |ψ>DomA1/2. Now, let A be a positive semi-definite operator, and let ρ be a quantum state with spectral decomposition Inline graphic. We define the expected value of A on ρ as

graphic file with name 220_2021_3988_Equ1_HTML.gif 1

with the convention that Tr[ρA]=+ if the above series diverges or if there exists an index i such that pi>0 and |ei>DomA1/2. To extend this definition to a generic densely defined self-adjoint operator X on Inline graphic, it is useful to consider its decomposition X=X+-X- into positive and negative part [46, Example 7.1]. We will say that X has finite expected value on ρ if |ei>Dom(X+1/2)Dom(X-1/2) for all i such that pi>0, and moreover the two series ipiX±1/2|ei>2 both converge. In this case, we call

graphic file with name 220_2021_3988_Equ2_HTML.gif 2

the expected value of X on ρ. Clearly, given two operators AB0, we have that Tr[ρA]Tr[ρB].

For two real sequences an(λ)n,bn(λ)n that depend on some parameter λ, we write an(λ)=Oλbn(λ) if there exists a constant cλ>0 that only depends on λ such that |an(λ)|cλ|bn(λ)| holds in the limit n. We also write an(λ)=Oλbn(λ) if for every NN we have that an(λ)=Oλbn(λ)N.

For an n-linear tensor A:×i=1nCmCk, we write Inline graphic if the vector we apply the tensor to is the same in every component. For functions f, we sometimes abuse the notation by denoting the norm of this function as f(z) instead of Inline graphic We denote with the entry-wise complex conjugation, with the standard transposition of vectors, and with the combination of the two.

For partial derivatives with respect to complex variables z,z we write z and z. Consider an m-dimensional multi-index α=(α1,α2,,αm) with |α|=α1+α2++αm. Then Inline graphic and analogously for z. The total derivatives of order k of a function f:CmC we denote by Inline graphic We then recall the definition of the Fréchet derivative for functions f:CmC such that Inline graphic and therefore

Dkf(z)v(1),..,v(k)=|α|+|β|=kzαzβf(z)=1|α|vjα()()=|α|+1kvjβ(-|α|)(), 3

with Inline graphic. Let C0(Cm) denote the space of continuous functions f:CmC that tend to zero as |z|, where for zCm we set

graphic file with name 220_2021_3988_Equ4_HTML.gif 4

We write Cc(Cm) to denote the space of smooth and compactly supported functions on Cm. For some open set ΩCm with closure Ω¯, a function f:Ω¯C, and a non-negative integer kN0, we denote by Ck(Ω¯) the space of functions for which the norm

graphic file with name 220_2021_3988_Equ5_HTML.gif 5

is finite. Here, α,βN0m are multi-indices. When k0 is not an integer, we define instead

graphic file with name 220_2021_3988_Equ6_HTML.gif 6

This extension allows us to consider the normed spaces Ck(Ω¯) for all k0. Typically, we will deal with the case where Ω is bounded, so that Ck(Ω¯) is in fact a Banach space. Finally, L2(Ω) will denote the space of equivalence classes of measurable functions f:ΩC whose L2 norm Inline graphic is finite.

Definitions

Quantum information with continuous variables

In this paper, we focus on continuous variable quantum systems. The Hilbert space of a set of m harmonic oscillators, in this context called ‘modes’, is the space Inline graphic of square-integrable functions on Rm. Let xj,pj be the canonical position and momentum operators on the jth mode. The m annihilation and creation operators, denoted by Inline graphic and Inline graphic (j=1,,m), satisfy the commutation relations

[aj,ak]=0,[aj,ak]=δjkI, 7

where I is the identity on Inline graphic. An m-mode quantum state ρ is said to be centred if

graphic file with name 220_2021_3988_Equ8_HTML.gif 8

i.e. if all expected values of the canonical operators on ρ, defined according to (2), vanish. For an m-tuple of non-negative integers n=(n1,,nm)N0m, the corresponding Fock state is defined by Inline graphic, where Inline graphic denotes the (multi-mode) vacuum state. In what follows, we often consider m=1.

The (von Neumann) entropy of a quantum state ρ is defined as

graphic file with name 220_2021_3988_Equ9_HTML.gif 9

which is well defined although possibly infinite.4 The relative entropy between two states ρ and σ is usually written as follows [47]

graphic file with name 220_2021_3988_Equ10_HTML.gif 10

Again, the above expression is well defined and possibly infinite [48].5

For two Hilbert spaces Inline graphic, a quantum channel Inline graphic is a completely positive, trace-preserving linear map. For a linear map Inline graphic, we define its diamond norm as

graphic file with name 220_2021_3988_Equ11_HTML.gif 11

where the supremum is over all non-zero trace class operators X on Inline graphic.

Consider a quantum system with Hilbert space Inline graphic, governed by a Hamiltonian H, which is taken to be a positive (possibly unbounded) operator on Inline graphic. The energy of a state Inline graphic is the quantity Tr[ρH]R+{+} defined as in (1).

Given two Hilbert spaces Inline graphic and Inline graphic, a Hamiltonian H on Inline graphic, and some energy bound Inline graphic, the corresponding energy-constrained classical capacity of a channel Inline graphic is given by [31, 4952]

graphic file with name 220_2021_3988_Equ12_HTML.gif 12

where it is understood that the Hamiltonian H(n) on Inline graphic is given by Inline graphic, where Hj acts on the jth tensor factor, and tensor products with the identity operator are omitted for notational simplicity. With the same notation, one can also define the energy-constrained quantum capacity of N, given by [32, 34, 5355]

graphic file with name 220_2021_3988_Equ13_HTML.gif 13

where Inline graphic is the partial trace over the entirely arbitrary ancillary Hilbert space Inline graphic. In this paper we are interested in the simple case Inline graphic and Inline graphic, so that there is a natural choice for H, namely, the canonical Hamiltonian

graphic file with name 220_2021_3988_Equ14_HTML.gif 14

of m modes. In this case, we will omit the subscripts and simply write the energy-constrained capacities as CN,E and QN,E.

Phase space formalism

We define the displacement operator D(z) associated with a complex vector zCm as

D(z)=expj(zjaj-zjaj). 15

Thus, D(z) is a unitary operator and satisfies D(z)=D(-z) and

D(z)D(w)=D(z+w)e12(zw-zw), 16

valid for all z,wCm.

Let Hquad=j,kXjkajak+Yjkajak+Yjkajak, where X=X is an m×m Hermitian matrix, and Y=Y is an m×m complex symmetric matrix. The unitaries e-iHquad generated by such Hamiltonians, and products thereof,6 are called symplectic unitaries, because they induce a symplectic linear transformation at the phase space level (zR,zI)R2m, where Inline graphic and Inline graphic [56, 57]. A symplectic unitary is called passive if it commutes with the number operator jajaj, which happens whenever the generating Hamiltonian Hquad satisfies Y=0. A passive symplectic unitary V acts on annihilation operators as VajV=kUjkak, where U is an m×m unitary matrix.

For trace class operators Inline graphic, the quantum characteristic function Inline graphic is given by

graphic file with name 220_2021_3988_Equ17_HTML.gif 17

Conversely, the operator T can be reconstructed from χT via the weakly defined identity

T=d2mzπmχT(z)D(-z). 18

Observe that the adjoint T of T satisfies χT(z)=χT(-z) for all zCm, so that T is self-adjoint if and only if χT(-z)χT(z). The characteristic function χT of a trace class operator T is bounded and uniformly continuous [58, § 5.4]. If T is positive semi-definite (e.g. if T is a density operator), then maxα|χT(α)|=χT(0)=Tr[T].

We write |ψf> to denote the pure state corresponding to the wave function fL2(Rm), so that the corresponding rank-one state Inline graphic has the following characteristic function:

χψf(z)=e-izIzRdmxf(x)f(x-2zR)e2izIx, 19

where as usual z=zR+izI.

The Fourier transform of the characteristic function is known as the Wigner function. For a trace class operator T, the Wigner function is given by [59, Eq. (4.5.12) and (4.5.19)]

graphic file with name 220_2021_3988_Equ20_HTML.gif 20
graphic file with name 220_2021_3988_Equ21_HTML.gif 21

Observe that WT(z)=WT(z), so that T is self-adjoint if and only if WT(z)R for all zCm. From (21) it is not difficult to see that |WT(z)|2mπmT1, where T1=Tr|T| reduces to 1 when T is a density operator. By taking the Fourier transform of (19), one can show that

Wψf(z)=2πdmxf(x+2zR)f(-x+2zR)e22izIx. 22

Moreover, the energy of any density matrix, ρ, can be obtained as a phase space integral

d2mzz2Wρ(z)=TrρHm+m2I 23

The displacement operator D(z) induces a translation or displacement of the Wigner function as follows, hence the nomenclature:

χD(z)ρD(z)(u)=ezu-zuχρ(u),WD(z)ρD(z)(u)=Wρ(u-z). 24

The map TχT, defined for trace class operators T in (17), extends uniquely to an isomorphism between the space of Hilbert–Schmidt operators and that of square-integrable functions L2(Cm). In fact, the quantum Plancherel theorem guarantees that this is also an isometry, namely

Tr[ST]=d2mzπmχS(z)χT(z)=πmd2mzWS(z)WT(z) 25

and therefore

ρ-σ22=d2mzπmχρ(z)-χσ(z)2=πmd2mzWρ(z)-Wσ(z)2. 26

Henceforth, we refer to (26) as the quantum Plancherel identity.

Gaussian states on Inline graphic are the density operators Inline graphic such that Wρ(z) is a Gaussian probability distribution on the real space (zR,zI)R2m and are uniquely defined by their first and second moments. A particularly simple example of a single-mode Gaussian state is a thermal state with mean photon number N[0,), given by

graphic file with name 220_2021_3988_Equ27_HTML.gif 27

The thermal state is the maximiser of the entropy among all states with a fixed maximum average energy:

graphic file with name 220_2021_3988_Equ28_HTML.gif 28

for all N0, where the function g is defined by

graphic file with name 220_2021_3988_Equ29_HTML.gif 29

The characteristic function and Wigner function of the thermal state evaluate to [59, Eq. (4.4.21) and (4.5.31)]

χτN(z)=e-(2N+1)|z|2/2,WτN(z)=2π(2N+1)e-2|z|2/(2N+1), 30

respectively, so that τN is easily seen to be a centred Gaussian state.

Moments

Definition 1

(Standard Moments). An m-mode quantum state ρ is said to have finite standard moments of order up to k, for some k[0,), if

graphic file with name 220_2021_3988_Equ31_HTML.gif 31

where Hm is the canonical Hamiltonian (14), and the above trace is defined as in (1).

Remark

The above condition is fairly easy to check once the matrix representation of ρ in the Fock basis is given. Namely, resorting to (1) and exchanging the order of summation for infinite series with non-negative terms, we see that (31) is equivalent to

Mk(ρ)=nN0m(m+|n|)k/2<n|ρ|n><, 32

where as usual |n|=jnj.

Given k>0 and mN, we can also define, by analogy with classical harmonic analysis, the m -mode bosonic Sobolev space of order k as follows

graphic file with name 220_2021_3988_Equ171_HTML.gif

where as usual Inline graphic. Here, we set

graphic file with name 220_2021_3988_Equ172_HTML.gif

with the canonical Hamiltonian on m modes being defined by (14). For density operators ρ it holds, using monotone convergence and cyclicity of the trace, that

graphic file with name 220_2021_3988_Equ173_HTML.gif

where Inline graphic is the indicator function of the interval [0, E].

It is well known that the characteristic function of any classical random variable with finite moments of order up to k (with k being a positive integer) is continuously differentiable k times everywhere. We can draw inspiration from this fact to devise an alternative way to introduce moments, relying on the regularity of the quantum characteristic function, in the quantum setting as well. We refer to moments defined in this manner as phase space moments.

Definition 2

( Phase space moments). An m-mode quantum state ρ is said to have finite phase space moments of order up to k, for some k[0,), if

graphic file with name 220_2021_3988_Equ33_HTML.gif 33

for some ε>0, where Inline graphic is the Euclidean ball of radius ε centred in 0, and the norm on the space CkB(0,ε) is defined by (5) and (6).

In complete analogy with the classical case, finiteness of standard moments implies local differentiability of the characteristic function, and hence finiteness of phase space moments. See Theorem 9 of Sect. 4.

However, the converse is not true in general. This is not surprising, as the same phenomenon is observed for classical random variables. In fact, a famous example by Zygmund [60] shows the existence of classical random variables with continuously differentiable characteristic function whose first absolute moments do not exist. We can swiftly carry over his example to the quantum realm, e.g. by considering a particular displaced vacuum state Inline graphic. One can show that its characteristic function is χρ(z)=e-|z|2n=2cos(2nzI)n2logn, which turns out to be continuously differentiable everywhere [60]. However,

Tr[ρ|x|]cn=21n2logn22n=+,

which implies that ρ has no finite first-order moments (see Lemma 24).

In spite of the above counterexample, we show in Theorem 28 that at least if k is an even integer, then the existence of kth order phase space moment implies the existence of the kth order standard moment. Again, this is in total analogy with the classical case [61, Theorem 1.8.16].

Remark. Due to the above, for even k, we simply use the word moment in the statements of our theorems, instead of differentiating between standard moments and phase space moments.

Quantum convolution

A beam splitter with transmissivity λ[0,1] acting on two sets of m modes is a particular type of a passive symplectic unitary, which we express as7

graphic file with name 220_2021_3988_Equ34_HTML.gif 34

where aj and bj (j=1,,m) are the creation operators of the first and second sets of modes, respectively. Its action on annihilation operators can be represented as follows

UλajUλ=λaj-1-λbj,UλbjUλ=1-λaj+λbjj{1,..,m}. 35

Accordingly, displacement operators are transformed by

UλD(z)D(w)Uλ=Dλz+1-λwD-1-λz+λw. 36

The beam splitter unitary can be used to define the following (λ-dependent) quantum convolution: for two m-mode quantum states ρ,σ and λ[0,1], their (λ-dependent) quantum convolution is given by the state ρλσ which is defined according to [62] as

graphic file with name 220_2021_3988_Equ37_HTML.gif 37

In terms of characteristic functions, this definition corresponds to

χρλσ(z)=χρλzχσ1-λz. 38

It is not difficult to verify that for all symplectic unitaries V and all λ[0,1], the beam splitter unitary Uλ of (34) satisfies VV,Uλ=0. In particular,

V(ρλσ)V=(VρV)λ(VσV) 39

for any state σ. Also, using (35) it can be shown that the mean photon number of a quantum convolution is just the convex combination of those of the input states, i.e.

Tr(ρλσ)Hm=λTrρHm+(1-λ)TrσHm, 40

where the canonical Hamiltonian is defined by (14).

For all m-mode quantum states σ and all λ[0,1], we can use the corresponding convolution to define a quantum channel Inline graphic, whose action is given by

graphic file with name 220_2021_3988_Equ41_HTML.gif 41

When σ=τN is a thermal state (with mean photon number N), the channel Inline graphic is called a thermal attenuator channel. Its action, obtained by combining (38) and (30), is given by

graphic file with name 220_2021_3988_Equ42_HTML.gif 42

For the thermal attenuator channel, the energy-constrained classical capacity (defined in (12)) can be shown to reduce to can be shown to be given by [37, 38]

CEN,λ,E=gλE+(1-λ)N-g(1-λ)N, 43

where g is given by (29).

In what follows, we will be interested in the symmetric quantum convolutions ρ1ρn, iteratively defined for a positive integer n and states ρ1,,ρn, by the relations Inline graphic and

graphic file with name 220_2021_3988_Equ44_HTML.gif 44

We will also use the shorthand

graphic file with name 220_2021_3988_Equ45_HTML.gif 45

In terms of characteristic and Wigner functions, we can also write

χρ1ρn(z)=χρ1z/nχρnz/n, 46
Wρ1ρn(z)=nmWρ1Wρnnz. 47

Here, denotes convolution, which is defined for n functions f1,,fn:CmR by

graphic file with name 220_2021_3988_Equ48_HTML.gif 48

Equation (46) shows that the quantum characteristic function of the symmetric quantum convolution satisfies the same scaling property as a sum of classical i.i.d. (independent and identically distributed) random variables. The important special case ρiρ of (46) for all i{i,2,,n}, on which we will focus most of our efforts, reads

χρn(z)=χρz/nn. 49

Iterating (39), using (44), shows that

VρnV=(VρV)n 50

holds for all symplectic unitaries V.

Cushen and Hudson’s Quantum Central Limit Theorem

In [2], Cushen and Hudson proved the following quantum mechanical analogue of the central limit theorem, which is the starting point of our study.

Theorem 3

[2, Theorem 1] . Let Inline graphic be a centred m-mode quantum state with finite second moments. Then the sequence (ρn)nN converges weakly to the Gaussian state ρG of same first and second moments as ρ:

graphic file with name 220_2021_3988_Equ51_HTML.gif 51

where Inline graphic is the set of bounded operators on Inline graphic.

Remark

The state ρG is commonly called the Gaussification of ρ.

In fact, the proof of Theorem 3 relies on the equivalence between weak convergence of states and pointwise convergence of their characteristic functions. More precisely, the following holds:

Lemma 4

([27, Lemma 4.3] and [28, Lemma 4]). Let (ρn)nN be a sequence of density operators on Inline graphic. The following are equivalent:

  • (ρn)nN converges to a density operator in the weak operator topology, namely, it holds that Inline graphic for all Inline graphic;

  • (ρn)nN converges in trace distance to a trace class operator;

  • the sequence (χρn)nN of characteristic functions converges pointwise to a function that is continuous at 0.

Together, the above lemma and Theorem 3 allow us to conclude the following seemingly stronger convergence:

Theorem 5

Under the assumptions of Theorem 3, we have that

limnρn-ρG1=0. 52

Main Results

The main objective of this paper is to refine Theorem 5 of the previous section in the following directions:

  • First, in the case in which the state ρ satisfies the conditions of the Cushen–Husdon theorem, we provide quantitative bounds on the rate at which the sequence of states (ρn)nN converges to ρG, under the assumption of finiteness of certain phase space moments of ρ. We also show how finiteness of phase space moments is implied by finiteness of the corresponding standard moments, the latter having the advantage of being a more easily verifiable condition. Moreover, we show that finiteness of even integer phase space moments implies finiteness of even integer standard moments (Sect. 4.1).

  • Secondly, we provide an example to show that the assumption that the second moments be finite in the Cushen–Hudson theorem cannot be weakened (Sect. 4.2).

  • Thirdly, we extend our results to the non-i.i.d. setting, i.e. we consider a scaling in the quantum convolution different from (44). This allows us to analyse the propagation of states through cascades of beam splitters with varying transmissivities (Sect. 4.3).

  • Finally, we provide a precise asymptotic analysis of the behaviour of quantum characteristic functions at zero and at infinity (Sect. 4.4).

Quantitative bounds in the QCLT

In this section, we state our results on rates of convergence in the Cushen–Hudson quantum central limit theorem. We call them quantum Berry–Esseen theorems, as is customary in the literature. Our first theorem provides convergence rates On-1/2 in the quantum central limit theorem under a fourth-order moment condition. The rate of convergence is boosted to On-1 if the third derivative of the characteristic function at zero vanishes:

Theorem 6

(Quantum Berry–Esseen theorem; High regularity). Let ρ be a centred m-mode quantum state with finite fourth-order phase space moments. Then, the convergence in the quantum central limit theorem in Hilbert–Schmidt norm satisfies

ρn-ρG2=OM4n-1/2.

Here, M4=M4(ρ,ε) is the moment defined in (33), and ε>0 is sufficiently small. Moreover, if D3χρ(0)=0 then the convergence is at least with rate OM4n-1.

The proof of Theorem 6 is provided in Sect.  6. In the next Theorem, we weaken the assumption on the moments of the state ρ, which leads to a slower rate of convergence.

Theorem 7

(Quantum Berry–Esseen theorem; Low regularity). Let ρ be a centred m-mode quantum state with finite (2+α)-order phase space moments, where α(0,1]. The convergence in the quantum central limit theorem in Hilbert–Schmidt norm is given by

ρn-ρG2=OM2+αn-α/2.

Here, M2+α=M2+α(ρ,ε) is the phase space moment defined in (33), and ε>0 is sufficiently small.

The proof of Theorem 7 is provided in Sect.  6. The variable α allows us to obtain a convergence rate under the assumption of finiteness of phase space moments of order all the way down to 2 (excluded), which is the assumption required in the Cushen–Hudson QCLT. The above results can further be used to find convergence rates in other, statistically more relevant, distance measures:

Corollary 8

(Convergence in trace distance and relative entropy). Assume that an m-mode quantum state ρ has finite third-order phase space moments. Then,

ρn-ρG1=OM3n-12(m+1),DρnρG=OM3n-12(m+1),

where M3=M3(ρ,ε) is defined in (33), and ε>0 is sufficiently small. The above rates are replaced by OM2+αn-α/(2m+2) when ρ only satisfies the conditions of Theorem 7.

The proof of this Corollary is given in Sect. 6.

Remark

(Condition on the existence of moments). The error bounds in Theorems 6 and 7 are stated in terms of assumptions on the phase space moments Mk given by (33), of the state. It is possible to bound the phase space moments Mk directly in terms of the standard moments Mk defined in (31). This is stated in the following Theorem, whose proof is given in Appendices A–C

Theorem 9

Let k[0,), m a positive integer, and ε>0 be given. Then every m-mode quantum state with finite standard moments of order up to k also has finite phase space moments of the same order. More precisely, there is a constant ck,m(ε)< such that

Mk(ρ,ε)=χρCkB(0,ε)ck,m(ε)Mk(ρ).

Conversely, if the characteristic function is 2k times totally differentiable at z=0 for some integer k, then the 2kth standard moment is finite as well.

The importance of Theorem 9 for us comes from the fact that most of our proofs rest upon local differentiability properties of the characteristic function. While mathematically useful, such properties have no direct physical meaning and may be hard to verify in practice. Instead, the condition of finiteness of higher-order standard moments, as given in Definition 1, bears a straightforward physical meaning, related to the properties of the photon number distribution of the state, and is often easier to verify.

The key to proving Theorem 9 for fractional k lies in an interpolation argument. To state it precisely, we briefly recall some basic facts about real interpolation theory (see [63] for more details): given two Banach spaces Inline graphic and Inline graphic, and a parameter 0θ1, define the K -function as follows:

graphic file with name 220_2021_3988_Equ53_HTML.gif 53

and derive from this the function Φθ(K(X))=supt>0t-θK(t,X). The real interpolation spaces, parametrised by θ(0,1), are then defined as

graphic file with name 220_2021_3988_Equ174_HTML.gif

Now, given two couples of Banach spaces Inline graphic and Inline graphic, and a map Inline graphic such that Inline graphic and Inline graphic are bounded, the map Inline graphic is bounded and:

graphic file with name 220_2021_3988_Equ175_HTML.gif

We want to apply this to the map ρχρ.

The following interpolation result for density operators then holds:

Proposition 10

Let k1k00 be real numbers. The m -mode bosonic Sobolev spaces Inline graphic and Inline graphic form a compatible couple such that for any m-mode quantum state ρ and θ(0,1) the real interpolation norm satisfies

graphic file with name 220_2021_3988_Equ176_HTML.gif

The proof of Proposition 10 is stated in Appendix B.

Optimality of convergence rates and necessity of finite second moments in the QCLT

The results stated in the previous section lead naturally to the following questions:

(i) Can the assumption of finiteness of second moments in the Cushen–Hudson theorem be weakened?

(ii) Are the convergence rates of Theorems 6 and 7 and Corollary 8 optimal?

We start by answering the first question in the negative: there exists a state with finite moments of all orders 2(1-δ) (for δ>0) for which neither Theorem 3 nor Theorem 5 holds.

Proposition 11

Consider the one-mode state Inline graphic with wave function

graphic file with name 220_2021_3988_Equ54_HTML.gif 54

Then: (a) ψf is centred; (b) M2(1-δ)(ψf)=<ψf|(aa)1-δ|ψf>< for all δ>0; yet (c) the sequence Inline graphic does not converge to any quantum state. Hence, the assumption of finiteness of second moments in the Cushen–Hudson QCLT (Theorems 3 and 5) cannot be weakened.

The proof of the above proposition is given in Sect.  7.

We now come to the second question (ii) regarding tightness of the estimates in Theorems 6 and 7 and Corollary 8. In Sect. 7 below, we study several explicit examples and provide convincing numerical evidence that our estimates are indeed tight, at least as far as the Hilbert–Schmidt convergence rates are concerned. Our findings are summarised as follows.

  • We start by looking at the pure state |ψ>=(|0>+|3>)/2, with density matrix Inline graphic and thermal Gaussification ψG=τ3/2. Our findings indicate that ψn-ψG2cn-1/2, in the sense that the ratio between the two sides tends to 1 as n, for some absolute constant c (Example 5 and Fig. 4). Hence, the O(n-1/2) convergence rate of Theorem 7 is attained.

  • Next, we focus on the second estimate of Theorem 6, and show that it is also tight. Namely, we compute the differences ψn-ψGζ for the simple case of a single-photon state Inline graphic and for ζ=1,2, and find numerical evidence that again ψn-ψGζcn-1 for some absolute constant c (Example 4 and Fig. 4). This shows that the O(n-1) convergence rate stated in Theorem 6, under the assumption that D3χρ(0)=0, is also attained.

Fig. 4.

Fig. 4

This plot shows the expressions cnαρn-ρ for a constant c>0 such that limncnαρn-ρ=1. The left figure shows that the O(1/n) convergence rate is sharp (Theorem 7) by using the state from Example 5. The right figure shows that we can obtain a rate O(1/n) if D3χρ(0)=0 (Theorem 6) by using the state from Example 4. In both figures we write ρn for ρn

Applications to capacity of cascades of beam splitters with non-Gaussian environment

We now discuss applications of our results to the study of channels that arise naturally in the analysis of lossy optical fibres. We model a physical fibre of overall transmissivity λ as a cascade of n beam splitters, in each of which the signal state ω is mixed via an elementary beam splitter of transmissivity λ1/n with a fixed state ρ, modelling the environmental noise (Fig. 2). Each step corresponds to the action of the channel Inline graphic (cf. the definition (41)), so that the whole cascade can be represented by the n-fold composition Inline graphic. Note that this is in general a non-Gaussian channel, albeit it is Gaussian dilatable [28, 64]. We are interested in the asymptotic expression of the output state Inline graphic as the number n tends to infinity, as a function of the input state ω. In other words, we want to study the asymptotic channel Inline graphic.

Fig. 2.

Fig. 2

An input state ω enters an optical fibre modelled by a cascade of n beam splitters with equal transmissivities λ1/n and environment states ρ

At this point, it should not come as a surprise that such a channel exists and coincides with NρG,λ.

Before we see why, let us justify why the above model may be relevant to applications. The recently flourishing field of integrated quantum photonics sets as its goal that of implementing universal quantum computation on miniaturised optical chips [29, 30, 65, 66]. A quantum channel that runs across such a circuit is susceptible to noise generated by other active elements of the same circuit, e.g. single-photon sources. While we expect such noise to be far from thermal, it may become so in the limit n of many interactions. In a regime where n is finite, albeit large, our setting will thus be the appropriate one. The forthcoming Corollary 13 allows us to study the classical and quantum capacity of the effective channel in such a regime.

Let us note in passing that the cascade architecture we are investigating now, in spite of some apparent resemblance, is different from that depicted in Fig. 1b. While we regard the former as more operationally motivated, the latter is mathematically convenient, as the transmissivities are tuned in such a way as to yield the symmetric convolution ρn at the output.

Theorem 12

(Approximation of thermal attenuators channels by cascades of beam splitters). Let ρ be a centred m-mode quantum state with finite third-order phase space moments M3, cf. (33), and denote by ρG its Gaussification. Then,

graphic file with name 220_2021_3988_Equ177_HTML.gif

where · stands for the diamond norm (11).

One can further make use of the recently derived continuity bounds under input energy constraints [3336] in order to find bounds on capacities of the cascade channel Inline graphic in the physically relevant case where the Gaussification ρG of ρ is a thermal state.8

Corollary 13

Consider a single-mode quantum state ρ with finite third-order phase space moments M3 (cf. (33)) and thermal Gaussification ρG=τN as in (27). Then, for λ[0,1], mean photon number Inline graphic, and some input energy E>0, the energy-constrained classical and quantum capacity of the cascade channel Inline graphic relative to the canonical Hamiltonian aa satisfy

graphic file with name 220_2021_3988_Equ55_HTML.gif 55

and

graphic file with name 220_2021_3988_Equ56_HTML.gif 56

where Inline graphic (as in (29)), and Q(EN,λ,E) is the quantum capacity of the thermal attenuator.9

The remainder terms are such that

Δc(n;N,M3,λ,E)C(M3)n-1/4logn,Δq(n;N,M3,λ,E)C(M3)n-1/8logn. 57

for some constant C=C(M3) and all sufficiently large nn0λE+(1-λ)N,M3.

The proofs of Theorem 12 and Corollary 13 are postponed to Sect. 8.

New results on quantum characteristic functions

In this subsection we state our refined asymptotic analysis of the decay of quantum characteristic functions that we employ in the proofs of our main theorems. For arbitrary quantum states, we have the following asymptotic result on the quantum characteristic function at infinity. It states that the quantum characteristic function can, in absolute value, only attain the value one at zero and decays to zero at infinity. Both these properties do not hold for general classical random variables, see Sect.  5.2.

Proposition 14

The quantum characteristic function of an m-mode quantum state ρ is a continuous function that is arbitrarily small in absolute value outside of a sufficiently large compact set, i.e. χρ belongs to the Banach space C0(Cm) of asymptotically vanishing functions. Moreover, for any ε>0 we have

maxzCm\B(0,ε)χρ(z)<1, 58

where Inline graphic denotes a Euclidean ball of radius ε centred at the origin.

The proof of Proposition 14 is given in Sect.  5.2. Interestingly, we can obtain a much more refined asymptotic on the decay of quantum characteristic functions if we assume that the state has finite second order moments.

Proposition 15

Let ρ be an m-mode state with finite average energy Inline graphic, where we have explicitly accounted for the non-zero energy of the vacuum state. Then, for all zCm and all δ[0,1] it holds that

χρ(z)1-(1-δ)3δ2m-1(2m+1)!!26·24m·E2m-1min|z|2,π2δ4E.

The proof of Proposition 15 is given in Sect.  5.2.

New Results on Quantum Characteristic Functions: Proofs

Quantum characteristic functions constitute a central tool in our approach. Therefore, the first step in our path towards the quantum Berry–Esseen theorems is to prove the results stated in Sect.  4.4. The structure of this section is as follows:

  • Quantum–classical correspondence: We derive a quantum–classical correspondence of the central limit theorems by showing that the quantum convolution of two arbitrary density operators naturally induces a classical random variable (Sect.  5.1).

  • Decay bounds: We derive new decay estimates and asymptotic properties of the quantum characteristic function at infinity (Sect. 5.2).

Quantum–Classical Correspondence

In this section we show that the quantum convolution ρσ of any two states ρ and σ has a non-negative Wigner function. While the mathematics behind this is known (see e.g. [67, Proposition (1.99)], [2, Proposition 5], and [68, Eq. (8)]), we believe that its physical implications have not been appreciated to the extent they deserve.

Lemma 16

Let ρ and σ be arbitrary m-mode quantum states. Then the Wigner function of their convolution ρσ defined by (37), with λ=1/2, is given by

Wρσ(z)=2mπmTrρD(2z)JσJD(2z), 59

where Inline graphic is the unitary and self-adjoint operator that implements a phase space inversion (in the sense of Eq. (61) below). In particular,

Wρσ(z)0zCm. 60

Proof

We start by verifying that J actually corresponds to a phase space inversion, in the sense that

WJρJ(z)=Wρ(-z) 61

for all m-mode quantum states ρ and all zCm. This follows from the easily verified fact that JajJ=-aj for all j, which also implies that JD(z)J=D(-z). In fact, using (21) we find that

WJρJ(z)=2mπmTrD(-z)JρJD(z)J=2mπmTrJD(z)ρD(-z)=Wρ(-z).

We now compute

graphic file with name 220_2021_3988_Equ178_HTML.gif

In 1, we use the convolution property for the Wigner function in (47),where in 2 we just write out the convolution of several functions as in (48). In 3 we then first flip phase space variables according to (61) and use the displacement operator in 4 to translate them by 2z, cf(24). Finally, in 5 we use the quantum Plancherel identity (25) to transform the integral over Wigner functions in a trace over density operators.

(24)

The above equalities are labelled by the equation numbers corresponding to the identities that justify them.

Remark

It is not difficult to see that λ=1/2 is the only special value for which Lemma 16 can hold, i.e. such that Wρλσ(z)0 for all m-mode states ρ,σ and for all zCm. To see why, consider the case where m=1 and ρ,σ are the first two Fock states. The action of the beam splitter unitary on the annihilation operators, as expressed by (35), leads to the identity Inline graphic. Using the expression for the Wigner function of Fock states [59, Eq. (4.5.31)], we see that

W|0><0|λ|1><1|(z)=Wλ|0><0|+(1-λ)|1><1|(z)=2πe-2|z|2λ-(1-λ)1-4|z|2.

Hence, W|0><0|λ|1><1|(0)<0 as soon as 0λ<1/2. For 1/2<λ1, we arrive at the same conclusion by looking at the state Inline graphic, obtained by sending λ1-λ.

We proceed by showing how the above result bridges the gap between classical and quantum central limit theorems. We now fix an m-mode quantum state ρ, and notice that ρ2n=(ρρ)n. Consider the probability density function Inline graphic, where positivity holds by (60). Let X be a random variable with density fX. The mean and covariance matrix of X coincide with those of ρρ, which are in turn the same as those of ρ. Hence, at the level of Gaussifications, fG=WρG. We write for an i.i.d. family of random variables Xi with law fX

graphic file with name 220_2021_3988_Equ179_HTML.gif

where 1 follows from (47) and 2 follows from the change of variables unu. This implies by applying the classical and quantum Plancherel identities (26) that

ρ2n-ρG22=π-mχρ2n-χρGL2(R2m)2=πmWρ2n-WρGL2(R2m)2=πmf(X1++Xn)/n-fGL2(R2m)2 62

which shows that the QCLT is equivalent to a certain CLT for classical i.i.d. random variables. The problem with this approach is that the right classical tool to use here would be an estimate on the rate of convergence of (X1++Xn)/n to the normal variable XG with respect to the L2 norm. However, it is known that convergence fails to hold in general, and even under some finiteness of moments assumption there does not seem to be a readily available result in the literature, that is powerful enough to be successfully employed here. Therefore, we do not pursue this route further here.

Decay estimates on the quantum characteristic function

Before studying the rate of convergence in the quantum central limit theorem, we show that quantum characteristic functions have the so-called strict non-lattice property. To motivate this property, we start by recalling some basic properties of characteristic functions from classical probability theory.

The characteristic function χXcl of a classical random variable X always attains the value one at zero. However, it can also attain the value one, in absolute value, at any other point. The random variables that exhibit this latter behaviour are precisely those that are lattice-distributed;10 see also [69, Section 3.5]. Examples include the Dirac, Bernoulli, geometric and Poisson distributions.

Knowing that χXcl(t)<1 for all values t0 however does not imply that lim suptχXcl(t)<1. This latter condition is known as the strict non-lattice property of a random variable. An example of a non-lattice distributed random variable which does not satisfy the strict non-lattice property is as follows.

Example 1

([69, Section 3.5]). Consider an enumeration of the positive rationals q1,q2,Q+ with qii and a non-lattice random variable X defined by

P(X=qn)=P(X=-qn)=2-(n+1).

The random variable X is then given by

χXcl(t)=E(eitX)=n=1(eitqn+e-itqn)P(X=qn)

which simplifies to

χXcl(t)=i=1costqi2i.

Let qi=piri where piZ and riN0, by considering times tn=2πi=1nri for arbitrarily large n, one has lim suptχXcl(t)=1.

We now show the surprising fact that quantum characteristic functions do not exhibit this somewhat pathological behaviour. Instead, for any quantum state ρ it holds that lim sup|z|χρ(z)=0, as the proof of Proposition 14 below shows.

Proof of Proposition 14

Thanks to the spectral theorem and by the dominated convergence theorem, it suffices to prove that lim|z|χψf(z)=0 for all wave function fL2(Rm), where Inline graphic, and |ψf> is the pure state with wave function f. We rephrase this as the requirement that χψf belongs to the Banach space C0Cm, where the norm on C0Cm is the supremum norm.

We consider smooth compactly supported functions f first. For such functions, the claim follows by combining (i) Eq. (19); (ii) the fact that f is normalised, i.e. dmx|f(x)|2=1; and (iii) the Riemann–Lebesgue lemma. For general fL2(Rm), the result then follows by a density argument: for an arbitrary fL2(Rm) there is a sequence of smooth and compactly supported functions fnCc(Rm) converging to fL2(Rm), so that

graphic file with name 220_2021_3988_Equ180_HTML.gif

Since C0(Cm) is a Banach space and χψfnC0(Cm), this implies that also the limit χψfC0(Cm). Thus, to complete the proof of (58) it suffices to show that for every ε>0 and any zCm\B(0,ε) one has that χψf(z)<1. If this were not the case, then |ψf> would be an eigenvector of the displacement operator D(z). This is well known to be impossible, see e.g. [28, Lemma 10].

For a given state ρ and some fixed ε>0, Proposition 14 tells us that there exists a constant η(ρ,ε)<1 such that maxzCm\B(0,ε)χρ(z)η(ρ,ε) (cf. (58)). However, the problem of characterising the quantity η(ρ,ε) in terms of some physically meaningful property of the state ρ remains. To this end, a natural candidate turns out to be the energy of the state. To see why this is the case, consider the following simple example.

Example 2

(Squeezed states). For every zCm and every δ(0,1) there is a (Gaussian) state ρG of mean photon number TrρGHmt28ln11-δ-14 such that χρG(z)1-δ.

To see that this is the case, up to the application of passive symplectic unitaries, it suffices to consider the case z=(t,0,,0), where t>0. Consider the ‘squeezed’ Gaussian state [7072] defined by the characteristic function

graphic file with name 220_2021_3988_Equ181_HTML.gif

where we set Inline graphic. The mean photon number of ρG is well known to be given by TrρGHm=14η+1η-1214η-14, where we used the fact that η1.

The above example shows that any estimate on η(ρ,ε) can be reasonably expected to depend on the energy. We now show that our preliminary work on the quantum–classical correspondence allows us to derive a general upper estimate for |χρ(z)| at any designated point zCm in terms of the energy of the state ρ. For this purpose, we draw upon some important mathematical results from the well-developed theory of classical characteristic functions. Proposition 15, whose proof we present now, implies e.g. that for a one-mode state ρ, we can take η(ρ,ε)=1-cEminε2,CE, where E is the energy of ρ, and cC are universal constants.

Proof of Proposition 15

Denoting as usual with |z| the Euclidean norm (4) of zCm, we write the following chain of inequalities.

graphic file with name 220_2021_3988_Equ182_HTML.gif

Here, 1 is an application of the quantum convolution rule (cf. the n=2 case of (49)). In 2 we introduced the classical random vector X(ρρ) taking values in Cm, with probability distribution given by the Wigner function Wρρ, which is everywhere non-negative by Lemma 16. The inequality in 3, which is the non-trivial one, follows from [61, Corollary 2.7.2]: we set Inline graphic, with the latter estimate coming from (21), and α=2, so that

graphic file with name 220_2021_3988_Equ183_HTML.gif

also, we substituted m2m, because our phase space Cm has real dimension 2m; finally, we used the well-known formula Γ(m+1/2)=π2-m(2m-1)!!, where (·)!! is the bi-factorial. Lastly, the inequality in 4 is just an application of the elementary estimate 1-x1-x2 for 0x<1.

Remark

In [61, Section 2.7], several other estimates for χXcl(t) are derived. While we decided to stick to the simplest one, as it is already very instructive, it is possible to substantially improve over it, e.g. by resorting to non-isotropic estimates (cf. for instance [61, Theorem 2.7.14]). Notably, our quantum–classical correspondence allows us to translate all of these inequalities to the quantum setting, up to an irrelevant factor of 1/2 in the associated constants (see step 4 in the above proof). We do not pursue this approach further, though we want to stress that it immediately leads to a plethora of further results.

Quantitative Bounds in the QCLT: Proofs

In this section, we provide proofs of the convergence rates in our quantum Berry–Esseen theorems. We also provide proofs of some of the statements in Sect. 4.3 on the convergence rate for cascades of beam splitters converging to thermal attenuator channels.

Outline of this section:. To fix ideas, we give a high-level outline of our proofs:

  • Williamson form: We apply a suitable symplectic unitary to the state, so as to make the Hessian of its characteristic function diagonal and larger than the identity. Subsequently, we use the quantum Plancherel identity to express the difference of the convolved state and its Gaussification in Hilbert–Schmidt norm as a difference of quantum characteristic functions in L2 norm (Sect.  6.1).

  • Local-tail decomposition: We then split the integral of the L2 norm of the difference of the quantum characteristic functions of the convolved state and the Gaussification of the original state into a regime around zero (Lemma 17), in which we can control the behaviour of the quantum characteristic function by its Taylor expansion, and a tail-regime in which we estimate the difference using Proposition 14. The error in the Taylor expansion is controlled by the phase space moments of the state, cf. Lemma 18.

  • Hilbert–Schmidt convergence: We implement the above ideas to prove Theorems 6 and 7, and Proposition 22 (Sect.  6.2).

  • Trace norm and entropic convergence: We then use the preservation of the boundedness of the second moment under quantum convolutions to obtain a quantitative estimate of convergence in trace distance, employing Markov’s inequality and the Gentle Measurement Lemma [73], and in relative entropy, using entropic continuity bounds [33] (Sect.  6.3).

  • Convergence rates for cascades of beam splitters: In the final subsection, we prove the results claimed in Sect.  4.3, namely convergence rates for cascades of beam splitters converging to thermal attenuator channels (Sect.  8).

Preliminary steps

Williamson form

Let ρ be a centred m-mode quantum state with finite second moments, as in the Cushen–Hudson theorem. It is known that one can find a symplectic unitary V and numbers ν1,,νm1 such that

graphic file with name 220_2021_3988_Equ63_HTML.gif 63

satisfies

χρ(z)=1-12jνj|zj|2+o|z|2(z0). 64

With a slight abuse of terminology, we will call ρ the Williamson form of ρ [74]. Bringing a state to its Williamson form allows us to assume that (i) the smallest eigenvalue of its covariance matrix is at least one. Also, (ii) the transformation in (63) does not change the first moments of the state, so that if ρ is centred then ρ remains centred. Finally, (iii) the same unitary V brings not only ρ but also its Gaussification ρG to their Williamson forms simultaneously, so that

χρG(z)=exp-12jνj|zj|2,WρG(z)=2πmexp-2jνj|zj|2. 65

holds as well. Thanks to the covariance of the quantum convolution with respect to symplectic unitaries (50), we see that

ρn-ρG2=Vρn-ρGV2=(ρ)n-ρG2.

Combining this with the quantum Plancherel identity (26) yields

ρn-ρG22=d2mzπmχ(ρ)n(z)-χρG(z)2 66
=πmd2mzW(ρ)n(z)-WρG(z)2. 67

In short, when estimating any unitarily invariant distance of ρn from its limit ρG, we can assume without loss of generality that all states are in their Williamson forms. When the Hilbert–Schmidt norm is employed, we can compute the distance as an L2 norm at the level of characteristic functions, or equivalently at that of Wigner functions.

Local-tail decomposition

We continue with an important technical lemma that reduces the convergence in the quantum central limit theorem to the behaviour of the quantum characteristic function around zero.

Lemma 17

Let ρ be an m-mode quantum state with finite second-order phase space moment. Without loss of generality, we assume that ρ is centred and in Williamson form, and that its Gaussification ρG has characteristic function as in (65). Then for every ε>0 we have that

ρn-ρG22=1πm|z|nεd2mzχρznn-e-12jνj|zj|22+On- 68

as n. If ρ has also finite third-order phase space moments, then

ρn-ρG2m(m+1)(m+2)6D3χρ(0)n+O(n-)+1πm/2|z|nεd2mzχρznn-e-12jνj|zj|21+16nD3χρ(0)z×321/2, 69

where the Fréchet derivative of χρ is defined by (3).

Proof

The first identity (68) follows along the lines of the second one (69) and so we focus on verifying the latter. Using the quantum Plancherel identity (26) and the relation (46), we apply the triangle inequality and split the integration domain into two disjoint sets such that

πm2ρn-ρG2=d2mzχρznn-e-12jνj|zj|221/2|z|nεd2mzχρznn-e-12jνj|zj|21+16nD3χρ(0)z×321/2+|z|>nεd2mzχρznn-e-12jνj|zj|21+16nD3χρ(0)z×321/2+1nd2mze-12jνj|zj|216D3χρ(0)z×321/2. 70

The last term on the rightmost side of (70) can be estimated explicitly using spherical coordinates. Namely, combining the fact that the coefficients appearing in the Williamson form satisfy νj1 with the bound D3χρ(0)z×3D3χρ(0)|z|3, we obtain that

d2mze-12jνj|zj|216D3χρ(0)(z×3)2vol(S2m-1)D3χρ(0)2360dre-r2r2m+5=Γm+3vol(S2m-1)72D3χρ(0)2=πm36m(m+1)(m+2)D3χρ(0)2,

where we used that 0dre-r2r2m+5=Γ(m+3)2, and recalled the expression volSN-1=2πN/2Γ(N/2) for the volume of the (N-1)-sphere. Furthermore, the second-to-last term in (70) can be shown to be exponentially small. In fact,

graphic file with name 220_2021_3988_Equ184_HTML.gif

where in 1 we use that (a+b)22(a2+b2), in 2 we use that

|(fg)(x)|2dxsupx|f(x)|2|(g)(x)|2dx,

and in (3) we changed variables in the first integral to u:=zn. Finally, in 4, we used that the L2 norm of the characteristic function is at most one and switched to spherical coordinates to compute the second integral. In 5, instead, we estimated e-r2<e-ε22ne-r22 for r>nε. Note that the first addend goes to zero faster than any inverse power of n for n by Proposition 14. The second decays exponentially, essentially because the integral is bounded in n (in fact, it tends to 0 as n). This concludes the proof.

The first term on the right-hand side of (69) features an explicit dependence on n, while the second decays faster than any inverse power of n. Therefore, all that is left to do is to estimate the third term, which can be done by looking at the behaviour of the characteristic function in a neighbourhood of the origin. The first step in this direction, rather unsurprisingly, involves a Taylor expansion of χρ around 0. In the subsequent lemma we record various important estimates of this sort, which will play a key role in the proofs of our quantum Berry–Esseen theorems.

Lemma 18

For ε>0 and k[0,), let ρ be an m-mode state with finite phase space moments of order up to k (namely, with the notation of Definition 2, assume that Mk(ρ,ε)<). Then for all zCm with |z|nε it holds that

graphic file with name 220_2021_3988_Equ71_HTML.gif 71

In particular, if ρ is centred and in Williamson form,

χρzn-1m(2m+1)2M2(ρ,ε)|z|2n, 72
χρzn-1+12njνj|zj|2m(2m+1)2M2+α(ρ,ε)|z|2+αn1+α2, 73
χρzn-1+12njνj|zj|2m(m+1)(2m+1)9M3(ρ,ε)|z|3n3/2, 74
χρzn-1+12njνj|zj|2-16n3/2D3χρ(0)z×3m(m+1)(2m+1)(2m+3)144M4(ρ,ε)|z|4n2, 75

depending on what phase space moments are finite. In (73), we assumed that α(0,1).

The estimate in (71) follows immediately from using Hölder continuity of the derivative.

Proofs of convergence rates in Hilbert–Schmidt distance

We start with the proof of Theorem 6 assuming fourth-order moments.

Proof of Theorem 6

By the discussion in Sect. 6.1.1, we can assume that ρ is in Williamson form, namely, that its characteristic function satisfies (64), with ν1,,νm1. Since M2(ρ,ε) is monotonically non-decreasing in ε, for any fixed μ(0,2) we can chose ε>0 small enough so that for any Inline graphic it holds that

m(2m+1)2M2(ρ,ε)|z|2nm(2m+1)2ε2M2(ρ,ε)μ2. 76

Looking at (72), this implies that 21-χρznμ. Now, for xC with |x|<2 define the function

graphic file with name 220_2021_3988_Equ77_HTML.gif 77

Substituting x=21-χρzn, we then have that

logχρzn+1-χρzn=-1-χρzn2a21-χρznm2(2m+1)24M2(ρ,ε)2a(μ)|z|4n2, 78

where to deduce the last inequality we observed that |x|μ implies that |a(x)|a(μ). Then, thanks to (78) and (74), an application of the triangle inequality yields

logχρznn+12jνj|zj|2nlogχρzn+1-χρzn+nχρzn-1+12jνj|zj|2m2(2m+1)24M2(ρ,ε)2a(μ)|z|4n+m(m+1)(2m+1)9M3(ρ,ε)|z|3nC1|z|3n, 79

where for fixed m the constant C1 depends only on M3 (remember that M2M3 by construction). Using again (78) but now in conjunction with (75), by a swift application of the triangle inequality we see that

graphic file with name 220_2021_3988_Equ80_HTML.gif 80

where for fixed m the constant C2 depends only on M4 (remember that M2M4 by construction). We now estimate

graphic file with name 220_2021_3988_Equ81_HTML.gif 81

Here, 1 follows simply by the triangle inequality. In 2, we (i) observed that eu-(1+u)|u|2e|u|; (ii) operated the substitution u=logχρznn+12jνj|zj|2; (iii) noted that Rxx2ex is a monotonically increasing function; and (iv) used the fact – proved in (79) – that |u|C1|z|3n. Finally, in 3 we remembered that |z|nε and assumed that ε>0 is small enough so that εC114. Now, since ν1,,νm1, we can rephrase the above estimate as

χρznn-e-12jνj|zj|21+16nD3χρ(0)z×31ne-14|z|2C12|z|6+C2|z|4. 82

Upon integration, (82) naturally yields an upper bound for the second term on the right-hand side of (69). We obtain that

graphic file with name 220_2021_3988_Equ83_HTML.gif 83

The justification of the above steps goes as follows: in 4 we switched to spherical coordinates; in 5 we performed the change of variables Inline graphic; in 6 we computed the gamma integrals, also remembering that volS2m-1=2πm(m-1)!; finally, the constant C3 introduced in 7 depends – for fixed m – only on M4 (note that M3M4 by construction). The proof of the first claim is completed once one inserts (83) into (69). In particular, if D3χρ(0)=0 we see that the convergence rate is OM4n-1. This proves also the second claim.

We continue with the proof of the low-regularity QCLT that assumes finiteness of phase space moments of order up to 2+α, for some α(0,1].

Proof of Theorem 7

We just deal with the case where α(0,1). As above, we start by fixing μ(0,2) and choosing a sufficiently small ε>0 so that for any Inline graphic the inequality (76) holds. By a similar estimate as in (79), but now leveraging (73) instead of (74), we have that for any zB0,nε

logχρznn+12jνj|zj|2nlogχρzn+1-χρzn+nχρzn-1+12jνj|zj|2m2(2m+1)24M2(ρ,ε)2a(μ)|z|4n+m(2m+1)2M2+α(ρ,ε)|z|2+αnα/2C4|z|2+αnα/2, 84

where the constant C4 introduced in the last line depends only on M2+α (note that M2M2+α).

graphic file with name 220_2021_3988_Equ185_HTML.gif

Here, in 1 we used the elementary estimate eu-1|u|e|u|, together with the observation that the function Rxxex is monotonically increasing. In 2 we used the fact that |z|nε, and chose ε>0 sufficiently small so that εαC414. Combining the above estimate with the fact that ν1,,νm1 yields

χρznn-e-12jνj|zj|2C4|z|2+αnα/2e-14|z|2, 85

which upon integration in turn leads to

graphic file with name 220_2021_3988_Equ86_HTML.gif 86

Here, in 3 we switched to spherical coordinates; in 4 we operated the change of variables Inline graphic and computed the gamma integrals; the constant introduced in 5 depends, for fixed α, only on M2+α. Inserting (86) into the right-hand side of (69) completes the proof.

Convergence in trace distance and relative entropy

In this section, we further use the assumption of finiteness of the second moments of the state in order to find convergence rates in trace distance and in relative entropy.

Proof of Corollary 8

The hypothesis implies in particular that ρ has finite phase space moments of the second order. By Theorem 28, this amounts to saying that ρ has also finite standard moments of the second order, that is, that TrρHmE<. Iterating (40) and passing to the limit, we see that in fact

TrρnHm=TrρGHm=TrρHmE.

Now, for any E>0, denote by PE the projection onto the finite dimensional subspace generated by the eigenvectors of the canonical Hamiltonian Hm of eigenvalue less than E. Then, by Markov’s inequality, for any ε>0,

TrρnPE/ε,TrρGPE/ε1-ε.

From the so-called ‘gentle measurement lemma’ [73, Lemma 9], we have that

ρn-PE/ερnPE/ε1,ρG-PE/ερGPE/ε12ε.

Then,

ρn-ρG1ρn-PE/ερnPE/ε1+PE/ερn-ρGPE/ε1+PE/ερGPE/ε-ρG14ε+PE/ερn-ρGPE/ε14ε+PE/ε2PE/ερn-ρGPE/ε24ε+(E/ε)m/2ρn-ρG2

The result follows after optimising over ε>0. In particular, if ρn-ρG2=On-α, we find that ρn-ρG1=On-αm+1.

We now turn to the proof of the convergence in relative entropy. Observe that, since ρn and ρG share the same first and second moments, TrρnlogρG=TrρGlogρG and thus DρnρG=SρG-Sρn. The result follows directly from [33, Lemma 18].

Optimality of Convergence Rates and Necessity of Finite Second Moments in the QCLT: Proofs

In this section we discuss the optimality of our results in two different directions:

  • First, we provide examples of states ρ that do not have finite second moments and for which ρn does not converge to any quantum state. This shows the necessity of the assumptions on finite second moments in the Cushen–Hudson Theorem (Sect. 7.1).

  • Secondly, we provide examples of explicit states which saturate our convergence rates in Theorems 6 and 7 (Sect. 7.2).

Failure of convergence for states with unbounded energy

We now show that the assumption of finiteness of second moments in Theorems 3 and 5 cannot be weakened, e.g. by replacing it with finiteness of some lower-order moments. Some examples of states with undefined moments that do not satisfy Theorems 3 and 5 can be obtained by drawing inspiration from probability theory. For instance, remembering that a classical Cauchy-distributed random variable does not satisfy the central limit theorem, we construct the following example.

Example 3

(Cauchy-based wave function). Consider the pure state |ψf> with wave function Inline graphic. The characteristic function of this state can be computed thanks to (19), which in this case evaluates to

χ|ψf><ψf|(z)=2e-|zI|2+izR2+izR.

The absolute value of this characteristic function is illustrated in Fig.  3.

Fig. 3.

Fig. 3

Example of the modulus of a quantum characteristic function, taken from Example 3, with heavy tails in a single direction

We then find the pointwise limit limnχ|ψf><ψf|z/nn=δz,0 which again is not continuous at 0 and hence is not the characteristic function of any quantum state.

The main drawback of the above state is that it does not have even first order moments. We can fix this by considering a slightly more sophisticated example. To proceed further, we first need to recall a well-known integral representation of fractional matrix powers.

Lemma 19

([46, Proposition 5.16]). For all r(0,1), all positive (possibly unbounded) operators A, and all |ψ>DomA1/2, we have that

Ar/2|ψ>2=sin(πr)π0tr-1<ψ|AtI+A|ψ>dt, 87

where all functions of A are defined by means of its spectral decomposition.

Proof of Proposition 11

The state is clearly centred, for instance because the wave function is symmetric under inversion x-x. We proceed to prove claim (b). Note that, since x2+p2=I+2aaI, 2aa=x2+p2+I2(x2+p2), where Inline graphic is the momentum operator. We now apply the operator inequality (A+B)rAr+Br, which can be shown to hold for all r[0,1] and all positive (possibly unbounded) self-adjoint operators AB. To prove this explicitly in the non-trivial case where r(0,1), we apply (87) to A+B. For a generic |ψ>DomAr/2DomBr/2, we obtain that

(A+B)r/2|ψ>2=sin(πr)π0tr-1<ψ|A+BtI+A+B|ψ>dt=sin(πr)π0tr-1<ψ|AtI+A+B+BtI+A+B|ψ>dtsin(πr)π0tr-1<ψ|AtI+A|ψ>+<ψ|BtI+B|ψ>dt=Ar/2|ψ>2+Br/2|ψ>2,

where the inequality in the above derivation follows e.g. from [46, Corollary 10.13]. Now, setting A=x2, B=p2 and r=1-δ, we obtain that

(aa)1-δ|x|2(1-δ)+|p|2(1-δ)|x|2(1-δ)+1+p2.

Computing the expectation value on |ψf> yields

<ψf|(aa)1-δ|ψf><ψf||x|2(1-δ)+1+p2|ψf>=1πΓ32-δΓ(δ)+1+710,

where the last step is by explicit computation. This proves (b). We now move on to (c). For this we evaluate the characteristic function of the convolution Inline graphic on the purely imaginary line. For tR, using (19) we obtain that

χ|ψf><ψf|(it)=-+dx|f(x)|2e2itx=2|t|K12|t|,

were K1 is a modified Bessel function of the second kind, and the last equality follows from (54) and [75, Eq. (9.6.25)]. Therefore, for any fixed t>0 it holds that

limnχ|ψf><ψf|n(it)=limnχ|ψf><ψf|itnn=limn1+c+logt-12lognt2n+On-3/2n=0,

where we have used the expansion in [75, Eq. (9.6.53)] (see also [75, Eq. (6.3.2) and (9.6.7)]). Since χ|ψf><ψf|n(0)=1 for all n because |ψf><ψf|n is a valid quantum state, the sequence of functions χ|ψf><ψf|n does not possess a continuous limit. Hence, it cannot converge to the characteristic function of any quantum state. This proves (c).

Optimality of the convergence rates

The following two examples show that the bounds stated in Theorems 6 and 7 are indeed saturated. Both examples consist of states constructed using the Fock basis. The construction of examples saturating the bounds in Theorems 6 and 7 is motivated by the following Proposition.

Proposition 20

Let ρ be a one-mode density operator satisfying the assumptions of Theorem 6 and also <i|ρ|j>=0 for |i-j|1,3. Then the state ρn converges at least with rate On-1 to its Gaussification

ρn-ρG2=On-1.

In particular, every density operator satisfying the assumptions of Theorem 6 that is diagonal in the Fock basis achieves a O(n-1) rate.

Proof of Proposition 20

By Theorem 6 it suffices to show that D3χρ(0)=0 under the assumptions of the Proposition. We start by recalling that any density operator ρ has an expansion into the Fock basis such that

graphic file with name 220_2021_3988_Equ88_HTML.gif 88

Hence, we find for the characteristic function that

χρ(z)=i,j=0<i|ρ|j>χ|i><j|(z). 89

Using a finite-rank approximation of the density operator ρ, it suffices then by Theorem 9 to analyse the component-wise derivatives in (89). The functions χ|i><j| are explicitly given by [59, Eq. (4.4.46) and (4.4.47)]

χ|i><j|(z)=i!j!(-z)j-ie-|z|22Ljj-i|z|2ifij,j!i!(z)i-je-|z|22Lii-j|z|2ifi>j. 90

Here, Inline graphic are the associated Laguerre polynomials. By assumption, it suffices to consider the case where |i-j| is even or |i-j| is odd and at least 5. We find that by writing the characteristic function in the form Inline graphic for some suitable function Hji, as in (90), that for the different possible third derivatives, we have

z3χ|i><j|(0)=-3zHji(0)+z3Hji(0),z3χ|i><j|(0)=-3zHji(0)+z3Hji(0),z2zχ|i><j|(0)=-zHji(0)+z2zHji(0),z2zχ|i><j|(0)=-zHji(0)+z2zHji(0).

Therefore, the only possible non-zero contribution to the third derivative of the quantum characteristic function χρ at zero could be due to terms that contain either one or three derivatives of functions Hji evaluated at zero.

If |i-j|4 then z and z appear in (90) with a joint power of at least 4; thus, this term’s contribution necessarily has to vanish. It suffices therefore to consider the case where |i-j|=2. If Hji is only differentiated once, then it is clear that this derivative has to vanish at zero, since z,z appear with a joint power of at least two.

If Hji is differentiated three times, then the term |z|2 causes the derivative to vanish at zero unless this term is differentiated precisely two times. This, however, implies that the Laguerre polynomial is differentiated exactly once. However, by the chain rule any first order derivative of the term Lj|j-i|(|z|2) vanishes at the origin. This concludes the proof.

The following example shows that the O(n-1) convergence rate stated in Proposition 20, under the assumption that D3χρ(0)=0, is in fact attained.

Example 4

(O(n-1)-rate). By Proposition 20 we can take Inline graphic to obtain a convergence rate of at least O(n-1) in the QCLT. That the O(n-1) rate is actually attained is illustrated in the right figure in Fig. 4. The O(n-1) rate is saturated both in Hilbert–Schmidt and trace norm.

The following example shows that the O(n-1/2) convergence rate of Theorem 7 is attained.

Example 5

(O(n-1/2)-rate). Consider the state11

ρ=|0>+|3>2<0|+<3|2.

Its characteristic function is explicitly given by (90)

χρ(z)=112e-|z|2212-18|z|2+6z3-(z)3+9|z|4-|z|6

Now, since <0|ρ|3>0 we see that the condition of Proposition 20 does not hold. One verifies directly that χρ(z)=1-2|z|2+o|z|2, so that ρ is already in Williamson form (cf. (64)). Letting Φ(z)=e-2|z|2, we then find that χρ-ΦL2(R2) converges with rate n-1/2, see Fig. 4.

The following example shows that the O(n-α/2) convergence rate of Theorem 7 is attained at least for α=1/2.

Example 6

Consider the probability density function p on R given by

graphic file with name 220_2021_3988_Equ186_HTML.gif

Its Fourier transform reads

graphic file with name 220_2021_3988_Equ91_HTML.gif 91

where Kν(z) is again the modified Bessel function of the second kind, and (91) follows from [75, Eq. (9.6.25)]. Define the single-mode quantum state

graphic file with name 220_2021_3988_Equ187_HTML.gif

where

graphic file with name 220_2021_3988_Equ188_HTML.gif

is a so-called coherent state [7679]. The characteristic function of ρ can be easily computed as

χρ(z)=-+dtp(t)e-|z|2/2-2itzI=p^(2zI)e-|z|2/2=2|zI|5/4Γ(5/4)K5/4(2|zI|)e-|z|2/2,

which leads us to

χρn(z)=χρz/nn=2n|zI|5n/4Γ(5/4)nn5n/8K5/42|zI|nne-|z|2/2.

On the other hand, a little thought confirms that ρ has vanishing first moments and second moments given by Tr[ρx2]=9/2 and Tr[ρp2]=1/2. Its Gaussification then reads

graphic file with name 220_2021_3988_Equ189_HTML.gif

We also observe that: (a) ρ has finite standard moments of order up to 5/2-δ, for all δ>0; but (b) it has no well-defined phase space moments (nor standard moments) of order 5/2.

To prove claim (a), start by setting Inline graphic. Assuming that δ1/2 so that β1, for all tR we have that

graphic file with name 220_2021_3988_Equ190_HTML.gif

where 1 is just the definition of coherent state, 2 comes from the concavity of the function xxβ-1 and from the fact that qn=t2ne-t2n! is a probability distribution over N, and finally in 3 we used the formula n=0xnn!(n+1)=(1+x)ex. From the above calculation we now deduce that

Trρaaβ-+dtp(t)1+t2β=Γ(7/4)πΓ(5/4)-+dt(1+t2)(1+δ)/2<,

as claimed.

To prove claim (b), it suffices to use [75, Eq. (9.6.10) and (9.6.11)] in order to write zνKν(z)=A(z)+z2νln(z)B(z), with ν>0, AB analytic functions, and B(0)0. Setting ν=5/4 shows that the phase space moment of ρ of order 5/2, as constructed in Definition 2, is not well defined, formally M5/2(ρ,ε)=+ for all ε>0.

We now present numerical evidence hinting at the fact that ρn-ρG2=O(n-1/4) for our choice of ρ. Note that

ρn-ρG22=d2zπe-4|zI|2-|z|2/2-2n|zI|5n/4Γ(5/4)nn5n/8K5/42|zI|nne-|z|2/22=2π0+due-9u2/2-2nu5n/4Γ(5/4)nn5n/8K5/42unne-u2/22. 92

The above integral can be evaluated numerically to a high degree of precision. Plotting the function -lnρn-ρG2 against lnn shows that ρn-ρG2 decays as O(n-1/4), cf. Figure 5. By what we have learnt above, Theorem 7 predicts a convergence at least as fast as O(n-1/4+δ) for every fixed δ>0, and is therefore tight at least for α=1/2.

Fig. 5.

Fig. 5

This plot shows the expression n-1/4ρn-ρG, with ρn and ρG as in (92). In particular, this figure shows that the O(1/n1/4) convergence rate is sharp (Theorem 7) for α=1/2

Cascade of Beam Splitters: Proofs

In this section, we prove the results claimed in Sect.  4.3, namely convergence rates for cascades of beam splitters converging to thermal attenuator channels.

Generalities of the cascade channels

In order to study the convergence of the cascade channel, we start by proving the following elementary equivalence.

Lemma 21

For an m-mode quantum state ρ, some λ[0,1], and a positive integer n, consider the cascade channel Inline graphic (cf. (41)). One has that

graphic file with name 220_2021_3988_Equ93_HTML.gif 93

where the effective environment state ρ(λ,n) is defined via its characteristic function

graphic file with name 220_2021_3988_Equ94_HTML.gif 94

Proof

We proceed by induction. The case n=1 follows from (38). Let us assume that the claim holds for n-1, so that

graphic file with name 220_2021_3988_Equ191_HTML.gif

By setting μ=λ(n-1)/n we see that

graphic file with name 220_2021_3988_Equ192_HTML.gif

Since Inline graphic, composition with the nth copy of the channel Inline graphic yields

graphic file with name 220_2021_3988_Equ193_HTML.gif

which proves (93) and (94). Finally, one can also verify by induction that

ρ(λ,n)=ρλn-1n,n-1η(λ,n)ρ, 95

where Inline graphic, so that ρ(λ,n) is a legitimate quantum state for all λ[0,1] and all n.

On the effective environment state

Thanks to Lemma 21, the study of the cascade channel Inline graphic boils down to that of the iteratively convolved state ρ(λ,n) of (94). Since such a convolution is not symmetric (cf.  (49)), to proceed further we need to extend our quantum Berry–Esseen results to a non-i.i.d. scenario. Note that the classical central limit theorem has indeed been extended to sequences of independent, non-identically distributed random variables [1, 80], and even to sequences of correlated random variables [81]. Rates of convergence for the former case can be found for instance in [82] (see e.g. Theorem 13.3 of [82]).

Proposition 22

Let ρ be a centred m-mode quantum state with finite second-order phase space moments. Then the sequence of quantum states ρ(λ,n) defined via (94) converges to the Gaussification ρG of ρ in trace norm. Moreover, if ρ has finite third-order phase space moments then

ρ(λ,n)-ρG2=Oλ,M3n-1/2, 96
ρ(λ,n)-ρG1=Oλ,M3n-1/(2m+2). 97

Here, M3=M3(ρ,ε) is defined by (33), and ε>0 is sufficiently small.

Proof

The argument is a variation of that used to prove Theorem 7 in Sect. 6.2. First of all, reasoning as in Sect. 6.1.1, we can assume without loss of generality that ρ is in its Williamson form. To simplify the notation, we introduce the re-scaled vectors Inline graphic, where {1,,n}. Then clearly χρ(λ,n)(z)==1nχρw. Note that wlog1/λn(1-λ)|z|; substituting zw into (72) and (74), we see that whenever |z|n(1-λ)log1/λε it holds that

χρw-1m(2m+1)2M2(ρ,ε)log1λ1-λ|z|2n, 98
χρw-1+121-λ1/n1-λλ-1njνj|zj|2m(m+1)(2m+1)9M3(ρ,ε)log1λ1-λ3/2|z|3n3/2. 99

We start by choosing ε>0 small enough so that (76) holds for some μ(0,2). We can now mimic the calculations in (79), obtaining

graphic file with name 220_2021_3988_Equ100_HTML.gif 100

Here, in 1 we observed that =1n1-λ1/n1-λλ-1n=1 and applied the triangle inequality. To deduce 2, instead, we proceeded as for (78). Namely, on the first addend we used the identity log1-x2+x2=-x24a(x) satisfied by the function a(x) defined by (77), we set x=21-χρw, we noted that |x|μ implies that |a(x)|a(μ), and lastly we employed (98). The second addend, instead, has been estimated thanks to (99). Finally, for fixed m the constant introduced in 3 depends only on M3 and λ (again, M2M3 by construction).

Proceeding as usual, we continue to estimate

graphic file with name 220_2021_3988_Equ194_HTML.gif

Note that in 4 we applied the elementary inequality eu-1|u|e|u|, observed that Rxxex is a monotonically increasing function, and leveraged the bound in (100). In 5, instead, we wrote C6|z|3nC61-λlog1λε|z|214|z|2, where the last estimate holds provided that ε>0 is small enough.

Remembering that ν1,,νm1, we can massage the above relation so as to get

χρ(λ,n)(z)-e-12jνj|zj|2C6|z|3ne-14|z|2. 101

Now, we can repeat the steps that led to (68). We obtain that

graphic file with name 220_2021_3988_Equ102_HTML.gif 102

The justification of the above steps is as follows. The estimate in 6 is just an application of the triangle inequality. In 7 we used (101) and the elementary fact that |u+v|22|u|2+2|v|2 on the second addend. As for 8, we: (i) performed the integral and introduced a constant C7 that depends on m only on the first addend; (ii) decomposed χρ(λ,n)(z)=χρ(w1)·=2nχρ(w) on the second; and (iii) used the fact that e-jνj|zj|2e-|z|2<e-ε22ne-12|z|2 in the prescribed range on the third. Finally, in 9 we noted that if |z|>nε then eventually in n

|w|=1-λ1/n1-λλ-12n|z|>12λlog1λε

for all {1,,n}; moreover, we used the fact that d2mu|χρ(u)|21 to evaluate the integral in the second addend.

Since the second term in the rightmost side of (102) decays faster than any inverse power of n as n thanks to Proposition 14, the proof of (96) is complete. Lastly, (97) follows similarly to Corollary 8.

Approximating cascade channels

With this convergence at hand, we provide a quantitative bound on the approximation of thermal attenuator channels by cascades of beam splitters (with possibly non-Gaussian environment states). Recall that, to an environment state ρ one can associate an attenuator channel Inline graphic of transmissivity 0λ1. The following simple lemma is crucial to convert the above state approximation result (Proposition 22) into a statement about approximations of attenuator channels.

Lemma 23

Given any two m-mode quantum states ρ1 and ρ2, and some λ[0,1], the corresponding channels defined as in (41) satisfy

Nρ1,λ-Nρ2,λρ1-ρ21. 103

Proof

Let R be any reference system, and let Inline graphic be a state on the bipartite system AR. Then

(Nρ1,λ-Nρ2,λ)idR(ω)1=TrR[Uλ(ωρ1)Uλ]-TrR[Uλ(ωρ2)Uλ]1Uλ(ωρ1-ωρ2)Uλ1=ρ1-ρ21,

where the inequality stems from the monotonicity of trace distance under quantum channels.

With this lemma at hand, we are ready to prove Theorem 12.

Proof of Theorem 12

Recall from Lemma 21 that Inline graphic, where ρ(λ,n) is the state with characteristic function given by (94). Applying Lemma 23 and Proposition 22, we have that

graphic file with name 220_2021_3988_Equ195_HTML.gif

concluding the proof.

Proof of Corollary 13

We now move on to Corollary 13. Let us start by proving the statement on quantum capacities, namely (56) and (57). Our aim is to apply [34, Theorem 9] to the two channels Inline graphic and NρG,λ=EN,λ, for the special case m=1 (cf. (42)). We set

graphic file with name 220_2021_3988_Equ196_HTML.gif

where the energy-constrained diamond norm is defined with respect to the canonical Hamiltonian, namely the number operator H=aa (see [36, Eq. (2)] and [34, Eq. (2)]). Note that εn=OM3n-1/4 by Theorem 12.

The input–output energy relations can be easily determined for both channels thanks to (95) and (40), which together show that Trρ(λ,n)aa=Trρaa=N=TrτNaa. One obtains that

graphic file with name 220_2021_3988_Equ104_HTML.gif 104

This means that we can set α=λ and E0=(1-λ)N, and hence E~=λE+(1-λ)N, in [34, Theorem 9]. We obtain that

graphic file with name 220_2021_3988_Equ197_HTML.gif

Here, in step 1 we applied [34, Theorem 9] together with the formula S(τN)=g(N) (see (28) and (29)); the inequality in 2 holds eventually in n for some universal constant c57+24loge, as can be seen by combining the bounds g(x)log(x+1)+loge (tight for large x) and g(x)-2xlogx (valid for sufficiently small x); finally, in 3 we used the fact that εnC(M3)n-1/4 eventually in n by the already proven Theorem 12, together with the observation that x-xlogx is an increasing function for sufficiently small x>0.

To complete the first part of the proof we need to estimate the classical capacity of Inline graphic in terms of that of the thermal attenuator EN,λ of (42), in turn given by (43). Although we could use the estimates in [34], we prefer to resort to the tighter ones provided in [36]. We obtain that

graphic file with name 220_2021_3988_Equ198_HTML.gif

The inequality in 4 is an application of [36, Proposition 6]. To see why, let us re-write the result of Shirokov [36, Proposition 6] for one-mode channels and with respect to the canonical Hamiltonian as

C(N1,E)-CN2,E2ϵ2t+rϵ(t)(log(E+1)+1-log(ϵt))+2gϵrϵ(t)+4h2(ϵt).

Here, Ni (i=1,2) are two quantum channels with 12N1-N2Eϵ, we picked E such that supρ:Tr[ρaa]ETrNi(ρ)aaE, the function rϵ is defined by Inline graphic, and Inline graphic is the binary entropy. Setting Inline graphic, N2=EN,λ, we see that E=λE+(1-λ)N (cf. (104) and [36, Eq. (21)]); choosing t=1/2 and hence rε(t)r1(t)=r1(1/2)=5/2 yields the above relation 4, as claimed. The inequality in 5 holds for all sufficiently large n and for some absolute constant c15. Finally, 6 is analogous to 3 above.

Remark

Let us stress that the threshold in n above which the inequalities in the above proof hold true depends on both λE+(1-λ)N and M3 (which dictates the rate of convergence of εn0). Although this is a minor point from the point of view of the mathematical derivation, it may be important for applications.

Remark

An analytical formula for the quantum capacity of the thermal attenuator that appears in Corollary 13 is currently not known. The best lower bound to date reads [45, Eq. (9)]

QEN,λ,Emax0x1x{gλEx+(1-λ)N-g12Dλ,N(x)+(1-λ)Ex-N-1-g12Dλ,N(x)-(1-λ)Ex-N-1}, 105

where

graphic file with name 220_2021_3988_Equ106_HTML.gif 106

The best upper bound to date, instead, can be obtained by combining the results of [40, Eq. (23)–(25)] (see also [41, Section 8]) with those of [44, Theorem 9] and [43, Theorem 46], in turn derived by refining a technique introduced in [42]. We look at the case where λN+1/2N+1, because below that value of λ the channel EN,λ becomes 2-extendable [83] (that is, anti-degradable [8486]) and therefore QEN,λ,E=0.

graphic file with name 220_2021_3988_Equ107_HTML.gif 107
graphic file with name 220_2021_3988_Equ108_HTML.gif 108
graphic file with name 220_2021_3988_Equ109_HTML.gif 109

Acknowledgements

ND would like to thank M. Jabbour for helpful discussions. LL acknowledges financial support from the European Research Council under the Starting Grant GQCOP (grant no. 637352) and from Universität Ulm; he is also grateful to V. Giovannetti, A. Holevo and K. Sabapathy for discussions on Lemma 16, and to M.B. Plenio and M. Wilde for sharing their thoughts on our model of optical fibre. SB thanks G. Baverez for interesting discussions on stable laws and gratefully acknowledges support by the EPSRC grant EP/L016516/1 for the University of Cambridge CDT, the CCA. CR acknowledges financial support from the TUM university Foundation Fellowship and by the DFG cluster of excellence 2111 (Munich Center for Quantum Science and Technology).

Appendix A: Standard Moments Versus Phase Space Moments: The Integer Case

In this appendix we prove that a state with finite standard moments of order up to k also has finite phase space moments of order up to k, i.e. Theorem 10. More precisely, we show that its characteristic function is differentiable k times, and that there are constants ck,m(ε)< such that the standard moments and phase space moments, defined by (31) and (33), respectively, satisfy Mk(ρ,ε)ck,m(ε)Mk(ρ) for all m-mode quantum states ρ. We start with the following lemma.

Lemma 24

For all positive integers m and real numbers k[0,), there is a universal constant dk,m>0 such that

jajajk/2dk,mj|xj|k+|pj|k, A1

where Inline graphic and Inline graphic are the position and momentum quadratures of the jth mode.

Proof

First of all, it suffices to consider the one-mode case. Indeed, assume that (aa)k/2dk(|x|k+|p|k) for some dk>0. Then, leveraging the fact that the operators ajaj commute with each other, and employing standard inequalities between p-norms, we deduce that

jajajk/2mmin{k/2-1,0}j(ajaj)k/2mmin{k/2-1,0}dkj|xj|k+|pj|k.

Therefore, from now on we look at the one-mode case only. The vector space Inline graphic of states with a finite expansion in the Fock basis is a core for both aak/2, as well as |x|k and |p|k. Thus, it suffices for us to prove the inequality (A1) on states in Vm.

It is enough to show that (aa)k/2dk|x|k for some constants dk>0, as the other inequality (aa)k/2dk|p|k is obtained by performing a phase space rotation of an angle π/2, i.e. by conjugating both sides by the unitary operator eiπ2aa.

We now prove that the inequality (aa)k/2dk|x|k holds for some dk>0 on all vectors in Vm. Write k=2rh, where r(0,1] and Inline graphic. Since the function AAr is well known to be operator monotone [46, Proposition 10.14], it suffices to show that (aa)hdhx2h for all non-negative integers hN0. To this end, let us take advantage of our restriction to states with a finite expansion in the Fock basis. Defining ΠN as the projector onto the span of the first N+1 Fock states (from 0 to N), we have to show that

graphic file with name 220_2021_3988_Equ199_HTML.gif

where the inequality now involves only matrices. Thanks to Gershgorin’s circle theorem [87, 88], in order to show that AN is positive semi-definite, it suffices to prove that AN is diagonally dominant, i.e. that for all N and 0nN the inequality

<n|AN|n>-n=0,,N,nn|<n|AN|n>|0 A2

holds true. Writing down the left-hand side yields

graphic file with name 220_2021_3988_Equ200_HTML.gif

Here, in 1 we extended the sum over n to all values that yield a non-vanishing result, i.e. those that satisfy |n-n|2h. In 2 we used the canonical commutation relations (7) to expand

x2h=2-h(a+a)2h=,0,+2hfh,,(aa)a+fh,,(aa)(a).

In 3 we applied standard estimates for factorials: for example, when nnn+2h we used the fact that nn!n!n(n)(n-n)/2(n)+(n-n)/2(n)2h(n+2h)2h; moreover, we defined Inline graphic. Finally, 4 follows by choosing e.g. dh-1=(2h+1)hFh. Since (A2) holds for all n, we conclude that AN0 for all N, which completes the proof.

Remark

The inequality in Lemma 24 depends critically on the special properties of the canonical operators. In fact, there is no universal constant dk>0 that makes the general relation (A+B)kdkAk+Bk true for all positive matrices A,B0. To see why this is the case, it suffices to consider two pure states Inline graphic and Inline graphic. Setting Inline graphic, it can be shown that the minimal eigenvalue of (A+B)k is λk, while that of Ak+Bk=A+B is clearly λ. By Weyl’s principle, the conjectured matrix inequality would imply that λkdkλ for all λ[0,1], absurd.

Proposition 25

Let k0 and m1 be integers; also, let ε>0 be given. Then, there is a constant ck,m(ε)< such that every m-mode quantum state ρ with finite k-moments Mk(ρ), as defined by (31), also satisfies

Mk(ρ,ε)=χρCkB(0,ε)ck,m(ε)Mk(ρ).

In particular, according to (33), ρ has finite phase space moments of order up to k.

Proof

Let ρ be an m-mode quantum state. We start by considering the modified state Inline graphic that is obtained by convolving it with the (multi-mode) vacuum state according to the rule (37) (for λ=1/2). A first important observation is that the moments of σ and ρ are related. Namely,

Mk(σ)Mk(ρ)k[0,). A3

To see why, we pick a multi-index nN0m and evaluate the nth diagonal entries of σ with respect to the Fock basis. We obtain that

graphic file with name 220_2021_3988_Equ201_HTML.gif

Here, in 1 we introduced the dephasing operator in the Fock basis, whose action is given by Inline graphic. In 2 we observed that Δ(ωδ)=Δ(ω)δ for all m-mode quantum states ω whenever δ=Δ(δ) is already diagonal in the Fock basis. To show this, first exploit linearity and factorisation of Δ to reduce to the one-mode case. Then, use the representation Δ(X)=02πdφ2πeiφaaXe-iφaa, valid for bounded X and where the integrals are as usual weakly converging, and remember that eiφaa+bb=eiφaaeiφbb is a function of the total Hamiltonian and thus commutes with the action of the beam splitter. The identity in 3 follows from the formula

graphic file with name 220_2021_3988_Equ202_HTML.gif

for the convolution of a Fock state with the vacuum. Here, ,N0m are multi-indices, ordered entry-wise, and Inline graphic. The above expression can be obtained easily e.g. by first reducing to the one-mode case, and then by induction on , employing the relations (35). Computing the kth moment of σ then yields

graphic file with name 220_2021_3988_Equ203_HTML.gif

Here, 4 and 7 follow from the representation in (32); in 5 we rearranged a double series of non-negative terms, and in 6 we observed that for a given N0m the coefficients Inline graphic form a probability distribution over the set of multi-indices nN0m with n. This proves that the kth moments of σ are upper bounded by those of ρ.

The state σ is also useful because its characteristic function is a close relative of that of ρ. Namely, according to (38) we have that χσ(z)=χρ(z/2)e-z2/4, and hence

χρCkB(0,ε)gk,m(ε)χσCkB(0,ε) A4

for some constants gk,m(ε). Thus, it suffices to find a suitable upper estimate for the norm χσCkB(0,ε). By Lemma 16, the Fourier transform of χσ, i.e. the Wigner function Wσ of σ, is everywhere non-negative. Hence, χσ can be seen as the characteristic function of a classical random variable Z over Cm, with probability density function Wσ. If we show that Z has finite absolute moments of order k, then thanks to [61, Theorem 1.8.15] we deduce that χσ is k-fold differentiable everywhere, and since

zαzβχσ(z)=d2mu(j(-uj)βj(uj)αj)Wσ(u)ezu-zud2muu|α|+|β|Wσ(u) A5

for all multi-indices α,βN0m, we in fact have that

graphic file with name 220_2021_3988_Equ115_HTML.gif A6

Therefore, we now look at the quantity Lk(σ). For a vector u=uR+iuICm, with uR,uIRm, we observe that

uk=j(uRj2+uIj2)k/2(2m)maxk2-1,0j|uRj|k+|uIj|k.

Thus,

graphic file with name 220_2021_3988_Equ204_HTML.gif

In the above derivation, the identity in 8 can be verified by first reducing to the case of a pure σ, which can be done by linearity and by multiple applications of Tonelli’s theorem, and by subsequently remembering that for a pure state |ψf> with wave function fL2(Rm) it holds e.g. that dmuIW|ψf><ψf|(u)=2f(2uR)2. The inequality in 9 is just an application of Lemma 24. Finally, in 10 we introduced a suitable constant ck,m1.

Combining the above estimate with (A4), (A6), and (A3), we deduce that

graphic file with name 220_2021_3988_Equ205_HTML.gif

which concludes the proof.

Appendix B: Standard Moments Versus Phase Space Moments: The Fractional Case

In the last section, we showed that the kth phase space moment was controlled by the kth standard moment in the case of an integer constant k.

Here, we show that this fact still holds when k is a positive real number by an interpolation argument. In principle, we could conclude this fact from the setting of Proposition 25, using that for Lp(w0) spaces with weight function w0 and Lp(w1) spaces with weight function w1, the real interpolation spaces [63, Theorem 5.4.1] satisfy

Lp(w0),Lp(w1)θ=Lp(wθ)

where wθ:=w01-θw1θ. This would allow us to extend the estimate in (A5) to fractional powers as well. However, we want to establish the stronger result that shows that the moments themselves naturally induce an interpolating family of normed spaces. That is, we show the following:

Proposition 26

Let ρ be an m-mode quantum state and k0. If Trρjajajk/2<, then χρCk(B(0,ε))< for some ε>0. Moreover,

χρCk(B(0,ε))CεTrρjajajk/2,

for some constant Cε>0.

We have seen in Appendix 8.3 that the map ρχρ is bounded from Inline graphic to Ck(B(0,ε)) for any k integer. Since the spaces Ck(B(0,ε)) form an interpolation family, meaning that for any k0,k1N0 with k1:=k0+1, C(1-θ)k0+θk1(B(0,ε))=(Ck0(B(0,ε)),Ck1(B(0,ε)))θ, we have from the previously mentioned interpolation method that

graphic file with name 220_2021_3988_Equ116_HTML.gif B1

for some positive constant Cε that comes from the bounds derived in Sect. 8.3 for k0 and k1. It only remains to prove that the interpolated norms Inline graphic can further be bounded above by ρW(1-θ)k0+θk1,1. First, we recall a useful technical lemma [89, Lemma 3.4].

Lemma 27

Let T=T11T21T21T22 be a positive semi-definite trace class operator such that T11:CdCd, then

T21112T111+T221.

Proof of Proposition 26

We provide the proof only for m=1, since the general case follows similarly. First, observe that

Trρ(aa)k/2=n=0<n|ρ|n>(n+1)k/2. B2

First, we restrict attention to states ρ that are orthogonal in the Fock basis. We then write ΠE for the spectral projection onto the Fock states of energy at most E, that is Inline graphic. Next, we fix two parameters 0k0k1 and introduce the quantity γn:=(n+1)(k1-k0)/2, fix a parameter t>0, define N0(t)N such that

nN0(t):γnt-1andγN0(t)+1t-1,

the two operators

X0(t):=(I-ΠN0(t))ρ(I-ΠN0(t))0andX1(t):=ΠN0(t)ρΠN0(t)0

and ρρdiag(t):=X0(t)+X1(t). Using these two operators we start estimating

graphic file with name 220_2021_3988_Equ206_HTML.gif

where αn=δn>N0(t) and βn=δnN0(t) with Kronecker delta δ. Thus, we obtain for the norm Inline graphic the upper bound

graphic file with name 220_2021_3988_Equ207_HTML.gif

We now recall that for γnt-1 we have αn=0 and βn=1 such that

t-θαn+tγnβn=t-θtγn=t1-θγn1-θγnθγnθ.

For γn>t-1 we have αn=1 and βn=0 such that

t-θαn+tγnβn=t-θγnθ.

Thus, in either case, we have the estimate

graphic file with name 220_2021_3988_Equ208_HTML.gif

This shows that for arbitrary density operators

graphic file with name 220_2021_3988_Equ209_HTML.gif

To extend the bound to a density operator ρ that is not diagonal in the Fock basis, and not only for the diagonal ρdiag(t), we partition ρ as

ρ=X0(t)ρ21(t)ρ21(t)X1(t)

and a self-adjoint diagonal operator S(k)(t):=diagS1(k)(t),S2(k)(t) where S1(k)(t):=ΠN0(t)(aa)k/4ΠN0(t) and S2(k)(t):=(I-ΠN0(t))(aa)k/4(I-ΠN0(t)). This implies that

T(k):=S(k)ρS(k)=S1(k)(t)X0(t)S1(k)(t)S1(k)(t)ρ21(t)S2(k)(t)S1(k)(t)ρ21(t)S2(k)(t)S2(k)(t)ρ22(t)S2(k)(t).

Let then S1(k)(t):=ΠN0(t)(aa)k/4ΠN0(t) and S2(k)(t):=(I-ΠN0(t))(aa)k/4(I-ΠN0(t)). The previous Lemma 27 then shows that

S1(k)(t)ρ21S2(k)(t)112S1(k)(t)ρ11S1(k)(t)1+S2(k)(t)ρ22S2(k)(t)1.

From here, we examine three cases separately:

  • Case 1: T11(k1)1T22(k1)1. In this case, we find from choosing X0:=ρ21 and X1:=0 in (53)
    graphic file with name 220_2021_3988_Equ210_HTML.gif
  • Case 2: T22(k0)1T11(k0)1 In this case, we find from choosing X0:=0 and X1:=ρ21 in (53)
    graphic file with name 220_2021_3988_Equ211_HTML.gif
  • Case 3: T22(k0)1T11(k0)1 and T22(k1)1T11(k1)1 In this case, we find from choosing X0=ρ21/2 and X1=ρ21/2 in (53) that
    graphic file with name 220_2021_3988_Equ118_HTML.gif B3

Hence, we have altogether that

graphic file with name 220_2021_3988_Equ212_HTML.gif

which implies that

graphic file with name 220_2021_3988_Equ213_HTML.gif

The result follows from the interpolation bound (B1).

Appendix C: Standard Moments Versus Phase Space Moments: A Partial Converse

We now show that at least for even integers k, the existence of kth order phase space moments implies the existence of standard moments of the same order.

Theorem 28

Let ρ be an m-mode quantum state such that its characteristic function χρ is 2k times totally differentiable at z=0 for some integer k, then the 2kth standard moment is finite as well.

Proof

For simplicity, we restrict attention to m=1. Let Inline graphic and Inline graphic be two Hamiltonians, and consider the spectral decomposition of the density operator Inline graphic. Then, there exist unique probability measures μei such that

<ei|f(Hlin±)|ei>=σ(Hlin±)f(λ)dμei(λ)for allfbounded measurable.

We then define the new probability measure Inline graphic such that

Trρf(Hlin±)=σ(Hlin±)f(λ)dμρ(λ)for allfbounded measurable.

We now proceed with an induction argument. Start by noting that for k=0 the result holds. For k1, define the auxiliary function φ:RC as

graphic file with name 220_2021_3988_Equ214_HTML.gif

which is by assumption 2k times differentiable at zero and let u(t)=Rφ(t). Then, u is also 2k times differentiable at zero. Since φ2k(0) exists, for t(-ε,ε), with sufficiently small ε>0, the function tφ(2k-1)(t) exists and is continuous.

We record that Taylor’s formula implies that for t(-ε,ε)

u(t)-i=0k-1u(2i)(0)t2i(2i)!|t|2k-1(2k-1)!supθ(0,1]u(2k-1)(θt),

where odd derivatives vanish at zero, since u is even.

We then define a positive continuous function fk:R[0,) with fk(0)=1 and for t0 as

graphic file with name 220_2021_3988_Equ215_HTML.gif

From Taylor’s formula above we obtain the following estimate for t sufficiently small

graphic file with name 220_2021_3988_Equ216_HTML.gif

Then, we have from Fatou’s lemma

Trρ(Hlin±)2k=Trρfk(0)(Hlin±)2klim inft0σ(Hlin±)fk(tλ)λ2kdμρ(λ)=lim inft0gk(t)=2k|u2k(0)|<.

Using integration by parts and standard estimates only, it is straightfroward to verify that the finiteness of both TrρHlin±2k implies the finiteness of Trρ(aa)k.

Footnotes

1

Throughout this paper we set ħ=1.

2

The characteristic function of a complex-valued random variable X is defined by Inline graphic.

3

That is, operators Inline graphic for which Inline graphic.

4

One way to define it is via the infinite sum S(ρ)=i(-pilogpi), where Inline graphic is the spectral decomposition of ρ. Since all terms of this sum are non-negative, the sum itself can be assigned a well-defined value, possibly +.

5

To define it one considers the infinite sum Inline graphic, where Inline graphic and Inline graphic are the spectral decompositions of ρ and σ, respectively. As detailed in [48], the convexity of xxlogx implies that all terms of this sum are non-negative, which makes the expression well defined.

6

While not all products of unitaries of the form e-iHquad can be written as a single exponential, two such factors always suffice. See [56, p.37], combined with [56, Propositions 2.12, 2.18, and 2.19] and with the observation that the exponential Lie map of the unitary group is surjective.

7

Tensor products are omitted here.

8

This amounts to assuming that ρ can be brought to its so-called Williamson form (see (63) of Sect.  6) by a passive symplectic unitary only.

9

An analytical formula for this quantity is currently not known. We report the best lower [39, 45] and upper [4044] bounds known to date in (105)–(106) and (107)–(109), respectively. These results can be used together with (56) to find bounds on Inline graphic.

10

These are discrete random variables with probability distributions supported on a lattice.

11

We use states |0> and |3> rather than |0> and |1> because the latter choice does not lead to a centred state.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Simon Becker, Email: simon.becker@damtp.cam.ac.uk.

Nilanjana Datta, Email: n.datta@damtp.cam.ac.uk.

Ludovico Lami, Email: ludovico.lami@gmail.com.

Cambyse Rouzé, Email: rouzecambyse@gmail.com.

References

  • 1.Feller, W.: An Introduction to Probability Theory and Its Applications, vol. II, 2nd edn. Wiley, New York (1971)
  • 2.Cushen CD, Hudson RL. A quantum-mechanical Central Limit theorem. J. Appl. Probab. 1971;8(3):454–469. [Google Scholar]
  • 3.Hepp K, Lieb EH. Phase-transitions in reservoir-driven open systems with applications to lasers and superconductors. Helv. Phys. Acta. 1974;46(5):573–603. [Google Scholar]
  • 4.Hepp K, Lieb EH. On the superradiant phase transition for molecules in a quantized radiation field: the Dicke Maser model. Ann. Phys. 1973;76(2):360–404. [Google Scholar]
  • 5.Giri N, von Waldenfels W. An algebraic version of the Central Limit theorem. Z. Wahrscheinlichkeitstheorie verw. Gebiete. 1978;42(2):129–134. [Google Scholar]
  • 6.Goderis D, Vets P. Central limit theorem for mixing quantum systems and the CCR-algebra of fluctuations. Commun. Math. Phys. 1989;122(2):249–265. [Google Scholar]
  • 7.Goderis D, Verbeure A, Vets P. About the mathematical theory of quantum fluctuations. Leuven, Belgium: Leuven University Press; 1989. [Google Scholar]
  • 8.Matsui T. Bosonic Central Limit Theorem for the one-dimensional XY model. Rev. Math. Phys. 2002;14(07n08):675–700. [Google Scholar]
  • 9.Cramer M, Eisert J. A quantum Central Limit Theorem for non-equilibrium systems: exact local relaxation of correlated states. New J. Phys. 2010;12(5):055020. [Google Scholar]
  • 10.Goderis D, Verbeure A, Vets P. About the exactness of the linear response theory. Commun. Math. Phys. 1991;136(2):265–283. [Google Scholar]
  • 11.Jakšić V, Pautrat Y, Pillet C-A. Central Limit Theorem for locally interacting Fermi gas. Commun. Math. Phys. 2009;285(1):175–217. [Google Scholar]
  • 12.Arous GB, Kirkpatrick K, Schlein B. A central limit theorem in many-body quantum dynamics. Commun. Math. Phys. 2013;321(2):371–417. [Google Scholar]
  • 13.Brandão, F.G.S.L., Cramer, M.: Equivalence of statistical mechanical ensembles for non-critical quantum systems (2015)
  • 14.Brandão FGSL, Gour G. Reversible framework for quantum resource theories. Phys. Rev. Lett. 2015;115:070503. doi: 10.1103/PhysRevLett.115.070503. [DOI] [PubMed] [Google Scholar]
  • 15.Dereziński J. Boson free fields as a limit of fields of a more general type. Rep. Math. Phys. 1985;21(3):405–417. [Google Scholar]
  • 16.Streater RF. Entropy and the Central Limit Theorem in quantum mechanics. J. Phys. A. 1987;20(13):4321–4330. [Google Scholar]
  • 17.Michoel T, Nachtergaele B. Central Limit Theorems for the large-spin asymptotics of quantum spins. Probab. Theory Relat. Fields. 2004;130(4):493–517. [Google Scholar]
  • 18.Goderis D, Verbeure A, Vets P. Non-commutative central limits. Probab. Theory Related Fields. 1989;82(4):527–544. [Google Scholar]
  • 19.Jakšić V, Pautrat Y, Pillet C-A. A quantum central limit theorem for sums of independent identically distributed random variables. J. Math. Phys. 2010;51(1):015208. [Google Scholar]
  • 20.Voiculescu DV, Dykema KJ, Nica A. Free Random Variables, CRM Monograph Series. Providence: American Mathematical Society; 1992. [Google Scholar]
  • 21.Accardi L, Lu YG. Quantum central limit theorems for weakly dependent maps. II. Acta Math. Hungar. 1994;63(3):249–282. [Google Scholar]
  • 22.Hayashi M. Quantum Information: An Introduction. Berlin Heidelberg: Springer; 2006. [Google Scholar]
  • 23.Hayashi M. Quantum estimation and the Quantum Central Limit theorem. Am. Math. Soci. Transl. Ser. 2009;2(227):95–123. [Google Scholar]
  • 24.Campbell ET, Genoni MG, Eisert J. Continuous-variable entanglement distillation and noncommutative central limit theorems. Phys. Rev. A. 2013;87:042330. [Google Scholar]
  • 25.Lenczewski R. Quantum Central Limit Theorems. Boston: Springer; 1995. pp. 299–314. [Google Scholar]
  • 26.Dimi A, Daki B. On the central limit theorem for unsharp quantum random variables. New J. Phys. 2018;20:063051. [Google Scholar]
  • 27.Davies EB. Quantum stochastic processes. Commun. Math. Phys. 1969;15(4):277–304. [Google Scholar]
  • 28.Lami L, Sabapathy KK, Winter A. All phase-space linear bosonic channels are approximately Gaussian dilatable. New J. Phys. 2018;20(11):113012. [Google Scholar]
  • 29.Carolan J, Harrold C, Sparrow C, Martín-López E, Russell NJ, Silverstone JW, Shadbolt PJ, Matsuda N, Oguma M, Itoh M, Marshall GD, Thompson MG, Matthews JCF, Hashimoto T, O’Brien JL, Laing A. Universal linear optics. Science. 2015;349(6249):711–716. doi: 10.1126/science.aab3642. [DOI] [PubMed] [Google Scholar]
  • 30.Rohde PP, Dowling JP. The on-ramp to the all-optical quantum information processing highway. Science. 2015;349(6249):696. doi: 10.1126/science.aac7250. [DOI] [PubMed] [Google Scholar]
  • 31.Holevo AS, Shirokov ME. Continuous ensembles and the capacity of infinite-dimensional quantum channels. Theory Probab. Its Appl. 2006;50(1):86–98. [Google Scholar]
  • 32.Wilde MM, Qi H. Energy-constrained private and quantum capacities of quantum channels. IEEE Trans. Inf. Theory. 2018;64(12):7802–7827. [Google Scholar]
  • 33.Winter A. Tight uniform continuity bounds for quantum entropies: conditional entropy, relative entropy distance and energy constraints. Commun. Math. Phys. 2016;347(1):291–313. [Google Scholar]
  • 34.Winter, A.: Energy-constrained diamond norm with applications to the uniform continuity of continuous variable channel capacities (2017). arXiv:1712.10267
  • 35.Shirokov ME. Tight uniform continuity bounds for the quantum conditional mutual information, for the Holevo quantity, and for capacities of quantum channels. J. Math. Phys. 2017;58(10):102202. [Google Scholar]
  • 36.Shirokov ME. On the energy-constrained diamond norm and its application in quantum information theory. Prob. Inform. Transm. 2018;54(1):20–33. [Google Scholar]
  • 37.Giovannetti V, García-Patrón R, Cerf NJ, Holevo AS. Ultimate classical communication rates of quantum optical channels. Nat. Photonics. 2014;8(10):796–800. [Google Scholar]
  • 38.Giovannetti V, Holevo AS, García-Patrón R. A solution of Gaussian optimizer conjecture for quantum channels. Commun. Math. Phys. 2015;334(3):1553–1571. [Google Scholar]
  • 39.Holevo AS, Werner RF. Evaluating capacities of bosonic Gaussian channels. Phys. Rev. A. 2001;63:032312. [Google Scholar]
  • 40.Pirandola S, Laurenza R, Ottaviani C, Banchi L. Fundamental limits of repeaterless quantum communications. Nat. Commun. 2017;8(1):15043. doi: 10.1038/ncomms15043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Wilde MM, Tomamichel M, Berta M. Converse bounds for private communication over quantum channels. IEEE Trans. Inf. Theory. 2017;63(3):1792–1817. [Google Scholar]
  • 42.Rosati M, Mari A, Giovannetti V. Narrow bounds for the quantum capacity of thermal attenuators. Nat. Commun. 2018;9(1):4339. doi: 10.1038/s41467-018-06848-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Sharma K, Wilde MM, Adhikari S, Takeoka M. Bounding the energy-constrained quantum and private capacities of phase-insensitive bosonic Gaussian channels. New J. Phys. 2018;20(6):063025. [Google Scholar]
  • 44.Noh K, Albert VV, Jiang L. Quantum capacity bounds of Gaussian thermal loss channels and achievable rates with Gottesman–Kitaev–Preskill codes. IEEE Trans. Inf. Theory. 2019;65(4):2563–2582. [Google Scholar]
  • 45.Noh K, Pirandola S, Jiang L. Enhanced energy-constrained quantum communication over bosonic Gaussian channels. Nat. Commun. 2020;11(1):457. doi: 10.1038/s41467-020-14329-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Schmuedgen K. Unbounded Self-adjoint Operators on Hilbert Space. Dordrecht: Springer; 2012. [Google Scholar]
  • 47.Umegaki H. Conditional expectation in an operator algebra. IV. Entropy and information. Kodai Math. Sem. Rep. 1962;14(2):59–85. [Google Scholar]
  • 48.Lindblad G. Entropy, information and quantum measurements. Commun. Math. Phys. 1973;33(4):305–322. [Google Scholar]
  • 49.Holevo, A.S.: On capacity of a quantum communications channel. Probl. Pered. Inform. 15(4):3–11 (1979). (English translation: Probl. Inf. Transm. 15(4):247–253)
  • 50.Hausladen P, Jozsa R, Schumacher B, Westmoreland M, Wootters WK. Classical information capacity of a quantum channel. Phys. Rev. A. 1996;54:1869–1876. doi: 10.1103/physreva.54.1869. [DOI] [PubMed] [Google Scholar]
  • 51.Schumacher B, Westmoreland MD. Sending classical information via noisy quantum channels. Phys. Rev. A. 1997;56:131–138. [Google Scholar]
  • 52.Holevo AS. The capacity of the quantum channel with general signal states. IEEE Trans. Inf. Theory. 1998;44(1):269–273. [Google Scholar]
  • 53.Lloyd S. Capacity of the noisy quantum channel. Phys. Rev. A. 1997;55:1613–1622. [Google Scholar]
  • 54.Shor, P.: Lecture Notes. MSRI Workshop on Quantum Computation (2002)
  • 55.Devetak I. The private classical capacity and quantum capacity of a quantum channel. IEEE Trans. Inf. Theory. 2005;51(1):44–55. [Google Scholar]
  • 56.de Gosson MA. Symplectic Geometry and Quantum Mechanics. Operator Theory: Advances and Applications. Basel: Birkhäuser; 2006. [Google Scholar]
  • 57.Serafini A. Quantum Continuous Variables: A Primer of Theoretical Methods. Taylor & Francis Group: CRC Press; 2017. [Google Scholar]
  • 58.Holevo, A.S.: Probabilistic and Statistical Aspects of Quantum Theory. Publications of the Scuola Normale Superiore, Scuola Normale Superiore (2011)
  • 59.Barnett S, Radmore PM. Methods in Theoretical Quantum Optics Oxford Series in Optical and Imaging Sciences. Oxford: Clarendon Press; 2002. [Google Scholar]
  • 60.Zygmund A. A remark on characteristic functions. Ann. Math. Stat. 1947;18(2):272–276. [Google Scholar]
  • 61.Ushakov NG. Selected Topics in Characteristic Functions. Modern Probability and Statistics. Berlin: de Gruyter; 2011. [Google Scholar]
  • 62.König R, Smith G. The entropy power inequality for quantum systems. IEEE Trans. Inf. Theory. 2014;60(3):1536–1548. [Google Scholar]
  • 63.Bergh J, Löfström J. Interpolation Spaces: An Introduction. Berlin: Springer; 2012. [Google Scholar]
  • 64.Sabapathy KK, Winter A. Non-Gaussian operations on bosonic modes of light: photon-added Gaussian channels. Phys. Rev. A. 2017;95:062309. [Google Scholar]
  • 65.O’Brien JL, Furusawa A, Vučković J. Photonic quantum technologies. Nat. Photonics. 2009;3(12):687–695. [Google Scholar]
  • 66.Politi A, Matthews JCF, Thompson MG, O’Brien JL. Integrated quantum photonics. IEEE J. Sel. Top. Quantum Electron. 2009;15(6):1673–1684. [Google Scholar]
  • 67.Folland GB. Harmonic Analysis in Phase Space. Princeton: Princeton University Press; 1989. [Google Scholar]
  • 68.Jagannathan R, Simon R, Sudarshan ECG, Vasudevan R. Dynamical maps and nonnegative phase-space distribution functions in quantum mechanics. Phys. Lett. A. 1987;120(4):161–164. [Google Scholar]
  • 69.Durrett R. Probability: Theory and Examples. Cambridge Series in Statistical and Probabilistic Mathematics. 5. Cambridge: Cambridge University Press; 2019. [Google Scholar]
  • 70.Kennard EH. Zur Quantenmechanik einfacher Bewegungstypen. Z. Phys. 1927;44(4):326–352. [Google Scholar]
  • 71.Stoler D. Equivalence classes of minimum uncertainty packets. Phys. Rev. D. 1970;1:3217–3219. [Google Scholar]
  • 72.Yuen HP. Two-photon coherent states of the radiation field. Phys. Rev. A. 1976;13:2226–2243. [Google Scholar]
  • 73.Winter A. Coding theorem and strong converse for quantum channels. IEEE Trans. Inf. Theory. 1999;45(7):2481–2485. [Google Scholar]
  • 74.Williamson J. On the algebraic problem concerning the normal forms of linear dynamical systems. Am. J. Math. 1936;58(1):141–163. [Google Scholar]
  • 75.Abramowitz M, Stegun IA. Handbook of Mathematical Functions: With Formulas, Graphs, and Mathematical Tables. Applied mathematics series. Mineola: Dover Publications; 1965. [Google Scholar]
  • 76.Schrödinger E. Der stetige Übergang von der Mikro- zur Makromechanik. Naturwissenschaften. 1926;14(28):664–666. [Google Scholar]
  • 77.Klauder JR. The action option and a Feynman quantization of spinor fields in terms of ordinary c-numbers. Ann. Phys. (N.Y) 1960;11(2):123–168. [Google Scholar]
  • 78.Glauber RJ. Coherent and incoherent states of the radiation field. Phys. Rev. 1963;131:2766–2788. [Google Scholar]
  • 79.Sudarshan ECG. Equivalence of semiclassical and quantum mechanical descriptions of statistical light beams. Phys. Rev. Lett. 1963;10:277–279. [Google Scholar]
  • 80.Lindeberg JW. Eine neue Herleitung des Exponentialgesetzes in der Wahrscheinlichkeitsrechnung. Math. Z. 1922;15(1):211–225. [Google Scholar]
  • 81.Bryc W. A remark on the connection between the large deviation principle and the central limit theorem. Statist. Probab. Lett. 1993;18(4):253–256. [Google Scholar]
  • 82.Bhattacharya RN, Rao RR. Normal approximation and asymptotic expansions. Philadelphia: SIAM; 1986. [Google Scholar]
  • 83.Lami L, Khatri S, Adesso G, Wilde MM. Extendibility of bosonic Gaussian states. Phys. Rev. Lett. 2019;123:050501. doi: 10.1103/PhysRevLett.123.050501. [DOI] [PubMed] [Google Scholar]
  • 84.Devetak I, Shor PW. The capacity of a quantum channel for simultaneous transmission of classical and quantum information. Commun. Math. Phys. 2005;256(2):287–303. [Google Scholar]
  • 85.Caruso F, Giovannetti V, Holevo AS. One-mode bosonic Gaussian channels: a full weak-degradability classification. New J. Phys. 2006;8(12):310. [Google Scholar]
  • 86.Wolf MM, Pérez-García D, Giedke G. Quantum capacities of bosonic channels. Phys. Rev. Lett. 2007;98:130501. doi: 10.1103/PhysRevLett.98.130501. [DOI] [PubMed] [Google Scholar]
  • 87.Geršgorin S. Über die Abgrenzung der Eigenwerte einer Matrix. Izv. Akad. Nauk. S.S.S.R. 1931;7:749–754. [Google Scholar]
  • 88.Varga RS. Geršgorin and his circles. Berlin: Springer; 2010. [Google Scholar]
  • 89.Bourin J-C, Lee E-Y. Unitary orbits of Hermitian operators with convex or concave functions. Bull. Lond. Math. Soc. 2012;44(6):1085–1102. [Google Scholar]

Articles from Communications in Mathematical Physics are provided here courtesy of Springer

RESOURCES