Skip to main content
Proceedings. Mathematical, Physical, and Engineering Sciences logoLink to Proceedings. Mathematical, Physical, and Engineering Sciences
. 2017 Mar 15;473(2199):20160721. doi: 10.1098/rspa.2016.0721

Device-independent tests of quantum channels

Michele Dall’Arno 1,, Sarah Brandsen 1, Francesco Buscemi 2
PMCID: PMC5378235  PMID: 28413337

Abstract

We develop a device-independent framework for testing quantum channels. That is, we falsify a hypothesis about a quantum channel based only on an observed set of input–output correlations. Formally, the problem consists of characterizing the set of input–output correlations compatible with any arbitrary given quantum channel. For binary (i.e. two input symbols, two output symbols) correlations, we show that extremal correlations are always achieved by orthogonal encodings and measurements, irrespective of whether or not the channel preserves commutativity. We further provide a full, closed-form characterization of the sets of binary correlations in the case of: (i) any dihedrally covariant qubit channel (such as any Pauli and amplitude-damping channels) and (ii) any universally-covariant commutativity-preserving channel in an arbitrary dimension (such as any erasure, depolarizing, universal cloning and universal transposition channels).

Keywords: device independent quantum information theory, device independent tests, quantum channel, time-like correlations

1. Introduction

Any physical experiment is based upon the observation of correlations among events at various points in space and time, along with some assumptions about the underlying physics. Naturally, in order to be operational any such assumption must have been tested as a hypothesis in a previous experiment. Ultimately, to break an otherwise circular argument, experiments involving no further assumptions are required—that is, device-independent tests.

Formally, a hypothesis consists of a circuit,1 which is usually assumed to have a global causal structure (following special relativity), and its components, which are usually assumed to be governed by classical or quantum theories and thus representable by channels.

Denoting a hypothesis (circuit) by X, the set of correlations compatible with X is denoted by S(X). Then, hypothesis X is falsified, along with any other hypothesis Y such that S(Y)S(X), as soon as the observed correlation does not belong to S(X) (This inclusion relation induces an ordering among channels which is reminiscent of that introduced by Shannon [5] among classical channels). Therefore, from the theoretical viewpoint, the problem of falsifying a hypothesis X can be recast2 as that of characterizing the set S(X) of compatible correlations.

Since (discrete, memoryless) classical channels are by definition input–output correlations (conditional probabilities), the characterization of S(X) is trivial in classical theory as it is a polytope easily related to the correlation defining the channel. By contrast, the problem is far from trivial in quantum theory: due to the existence of superpositions of states and effects, the set S(X) can be strictly convex.

In this work, we address the problem of device-independent tests of quantum channels, in particular, the characterization of the set Smn(X) of m-inputs/n-outputs correlations pj|i obtainable through an arbitrary given channel X, upon the input of an arbitrary preparation {ρi}i=0m1 and the measurement of an arbitrary positive-operator valued measure (POVM) {πj}j=0n1, that is

1. 1.1

The analogous problems of device-independent tests of quantum states and measurements have been recently addressed in [12,13], respectively.

An alternative formulation for the problem considered here can be given in terms of a ‘game’ involving two parties: an experimenter, claiming to be able to prepare quantum states, feed them through some quantum channel X, and then perform measurements on the output, and a skeptical theoretician, willing to trust observed correlations only. If the experimenter produces some correlations lying outsides of Smn(X), then the theoretician must conclude that the actual channel X is not worse than X at producing correlations, but this is not sufficient to support the experimenter’s claim. Indeed, in order to convince the theoretician, the experimenter must produce the entire set Smn(X): in fact, it is sufficient to produce a set of correlations whose convex hull contains Smn(X). Then, the theoretician must conclude that whatever channel the experimenter actually has is at least as good as X at producing correlations, and the experimenter’s claim is accepted.

It is hence clear that the problem of device-independent tests of quantum channels induces a preordering relation among quantum channels: XY if and only if Smn(X)Smn(Y). (The order also depends upon m and n, but for compactness we drop the indexes whenever they are clear from the context). In order to characterize such preorder, for any given channel X, we need to (i) provide the experimenter with all the states and measurements generating the extremal correlations of Smn(X) and (ii) provide the theoretician with a full closed-form characterization of the set Smn(X) of compatible correlations.

As a preliminary result, we find that the sets Smn(X) coincide for any d-dimensional unitary and dephasing channels, for any d, m and n (this is an immediate consequence of a remarkable result by Frenkel & Weiner [14]). Upon considering only the binary case m=n=2, our first result is to show that any correlation on the boundary of S22(X) is achieved by a pair of commuting pure states—irrespective of whether X is a commutativity-preserving channel. Then, we derive the complete closed-form characterization of S22(X) for: (i) any given dihedrally covariant qubit channel, including any Pauli and amplitude-damping channels; and (ii) any given universally covariant commutativity-preserving channel, including any erasure, depolarizing, universal 1→2 cloning [15], and universal transposition [16] channels.

Upon specifying X as the d-dimensional identity channel Id, one recovers device-independent dimension tests analogous to those discussed in [1720], in which case the aforementioned ordering induced by the inclusion Smn(Id0)Smn(Id1)d0d1 is of course total. However, the completeness of our characterization of S22(X) implies that our framework detects all correlations incompatible with the given hypothesis, unlike the studies [1721] where the set of correlations is tested only along an arbitrarily chosen direction.

Let us provide a preview of some consequences of our results:

  • — any Pauli channel Pλ:ρλ0ρ+k=13λkσkρσk is compatible with p if and only if
    |p1|1p1|2|1|p1|1p2|2|maxk[1,3]|2(λ0+λk)1|;
  • — any amplitude-damping channel Aλ:ρA0ρA0+A1ρA1 with A0=|00|+λ|11| and A1=1λ|01| is compatible with p if and only if
    (p1|2p2|1p1|1p2|2)2λ;
  • — any d-dimensional erasure channel Ed:ρλρ(1λ)Tr[ρ]ϕ for some pure state ϕ is compatible with p if and only if
    |p1|1p1|2|λ;
  • — any d-dimensional depolarizing channel Ddλ:ρλρ+(1λ)Tr[ρ]𝟙/d is compatible with p if and only if
    |p1|1p1|2|λ
    and
    |p1|1p1|2|1|p1|1p2|2|dλ22λ+dλ;
  • — the d-dimensional universal optimal 1→2 cloning [15] channel Cd is compatible with p if and only if
    |p1|1p1|2|dd+1;
  • — any d-dimensional universal optimal transposition [16] channel Td is compatible with p if and only if
    |p1|1p1|2|1d+1
    and
    |p1|1p1|2|1|p1|1p2|2|13.

This paper is structured as follows. We will introduce our framework and discuss the case of unitary and trace class channels in §2. For the binary case, introduced in §3, we will solve the problem for any qubit dihedrally-covariant channel in §4, and for any arbitrary-dimensional universally covariant commutativity-preserving channel in §5. In §6, we will provide a natural geometrical interpretation of our results and in §7, we will summarize our results and present further outlooks.

2. General results

We will make use of standard definitions and results in quantum information theory [22]. Since Smn(X) is convex for any n and m, the hyperplane separation theorem [23,24] states that pSmn(X) if and only if there exists an m×n real matrix w such that

pTwW(X,w)>0, 2.1

where pTw:=i,jpj|iwi,j and

W(X,w):=maxqSmn(X)wTq. 2.2

We call w a channel witness and W(X,w) its threshold value for channel X.

Although equation (2.1) generally allows one to detect some conditional probability distributions p not belonging to Smn(X) for any arbitrarily fixed witness w, here our aim is to detect any such p. Direct application of equation (2.1) is impractical, as one would need to consider all of the infinitely many witnesses w. Note, however, that equation (2.1) can be rewritten through negation by stating that pSmn(X) if and only if for any m×n witness w one has

pTwW(X,w)0.

We then have our first preliminary result.

Lemma 2.1 —

A channel X:L(H)L(K) is compatible with conditional probability distribution p if and only if

maxw[pTwW(X,w)]0. 2.3

Let us start by considering an arbitrary d-dimensional unitary channel Ud:ρUρU, for some unitary UL(H) with dimH=d. If dm, the maximization in equation (2.2) is trivial, since the input labels i∈[1,m] can all be encoded on orthogonal states, so that any m×n conditional probability distribution q can in fact be obtained. However, if d<m, the evaluation of the witness threshold W(Ud,w) for any witness w is far from obvious. The solution immediately follows from a recent, remarkable result by Frenkel & Weiner [14]. It turns out that W(Ud,w) is attained on extremal conditional probability distributions q compatible with the exchange of a classical d-level system, namely, those q where qj|i=0 or 1 for any i and j, and such that qj|i≠0 for at most d different values of j. Frenkel and Weiner’s result hence guarantees that the threshold W(Ud,w) can be provided in closed form since, for any m and n, the number of such extremal classical conditional probabilities is finite, i.e. the set Smn(Ud) is a polytope. Any probability p lying outside Smn(Ud) can thus be detected by testing the violation of equation (2.3) for a finite number of witnesses w, corresponding to the faces of the polytope. Moreover, the set Smn(Ud) of distributions compatible with any d-dimensional unitary channel Ud coincides with the set Smn(Fdλ)jj of distributions compatible with any d-dimensional dephasing channel Fdλ:ρλρ+(1λ)kk|ρ|k|kk|.

At the opposite end of the unitary channels, there sit trace-class channels T:ρσ for some arbitrary but fixed state σ. In this case, no information about i (the input label) can be communicated. Of course, the set Smn(T) of correlations achievable through any trace-class channel T does not depend on the particular choice of σ: a trace-class channel simply means that no communication is available. For any trace-class channel T and any witness w, it immediately follows that the threshold W(T,w) is achieved by conditional probabilities q such that qj|i=1 for a single value of j, and therefore is given by W(T,w)=maxjiwi,j. As a consequence, the set Smn(T) is a polytope with n vertices, and any probability p lying outside Smn(T) can be detected by testing the violation of equation (2.3) for a finite number of witnesses w.

3. Binary conditional probability distribution

In the remainder of this work, we will consider the case where p is a binary input–output conditional probability distribution (i.e. m=n=2).

First, we show that it suffices to consider diagonal or anti-diagonal witnesses with positive entries summing up to one. Indeed, for any witness w, the witness w′:=α(w+β), where α>0 and β is such that βi,j is independent of j, leaves equation (2.3) invariant for any conditional probability distribution p and channel X, since wp=α(pTw+iβi,1).

By taking βi,j=minkwi,k for any i and j, the witness w′ is diagonal, anti-diagonal, or has a single non-null column. We first consider the latter case. Clearly, the maximum in equation (2.2) is attained when p is a vertex of the polytope S22(T) of probabilities compatible with any trace-type channel T, and therefore equation (2.3) is always verified. Then we consider the case of diagonal and anti-diagonal witnesses. By taking α1=i|wi,1wi,2| one recovers the normalization condition i,jwi,j=1, thus proving the statement.

Therefore, upon denoting with w±(ω) the diagonal and anti-diagonal witnesses given by

w+(ω):=(1+ω2001ω2)andw(ω):=(01+ω21ω20),

where ω∈[−1,1], one has the following preliminary result.

Lemma 3.1 —

The maximum in equation (2.3) is attained for a diagonal or anti-diagonal witness, namely

maxw(pTwW(X,w))=maxω[1,1](pTw±(ω)W(X,w±(ω))).

Any extremal distribution q in equation (2.2) can be represented by states ρ0 and ρ1 and a POVM {π0,π1} such that qj|i=Tr[X(ρi)πj]. Since w±(ω) is diagonal or anti-diagonal, equation (2.2) represents the maximum probability of success in the discrimination of states {ρ0,ρ1} with prior probabilities given by the non-null entries of w, in the presence of noise X, namely

W(X,w±(ω))=12maxρ0,ρ1{π0,π1}[(1+ω)Tr[X(ρ0)π0]+(1ω)Tr[X(ρ1)π1]].

It is a well-known fact [25] that the solution of the optimization problem over POVMs is given as a function of the Helstrom matrix defined as

Hω(ρ0,ρ1):=1+ω2ρ01ω2ρ1,

as follows

W(X,w±(ω))=12maxρ0,ρ1[1+X(Hω(ρ0,ρ1))1], 3.1

where ∥⋅∥1 denotes the operator 1-norm.

It is easy to see that without loss of generality one can take ρ0 and ρ1 such that [ρ0,ρ1]=0. Indeed, let {|k〉} be a basis of eigenvectors of the Helstrom matrix Hω(ρ0,ρ1). The complete dephasing channel Fd0 on the basis {|k〉} is such that

Hω(ρ0,ρ1)=Fd0(Hω(ρ0,ρ1))=Hω(σ0,σ1),

where σi:=Fd0(ρi) and therefore [σ0,σ1]=0. By applying channel X we have the following identity

X(Hω(ρ0,ρ1))=X(Hω(σ0,σ1)).

Therefore, the encoding {σi} performs as well as the encoding {ρi}, and thus without loss of generality we can take the supremum in equation (3.1) over commuting encodings only.

Moreover, one can see that without loss of generality one can take σi to be orthogonal pure states. Indeed, let σi=kμk|i|kk| be a spectral decomposition of σi. Owing to the convexity of the trace norm, we have

X(Hω(σ0,σ1))1=k,lμk|0μl|1X(Hω(|kk|,|ll|))1k,lμk|0μl|1X(Hω(|kk|,|ll|))1maxk,lX(Hω(|kk|,|ll|))1.

Then we have the following preliminary result.

Lemma 3.2 —

The maximum in equation (2.2) is given by an orthonormal pure encoding, namely

W(X,w±(ω)):=max|ϕ0,|ϕ1ϕ1|ϕ0=012[1+X(Hω(ϕ0,ϕ1))1]

and by an orthogonal POVM such that π0 is the projector on the positive part of Hω(ϕ0,ϕ1) and π1=𝟙π0.

Here, for any pure state |ϕ〉 we denote with ϕ:=|ϕ〉〈ϕ| the corresponding projector.

4. Dihedrally covariant qubit channel

Let us start with the case where X:L(H)L(K) is a qubit channel, i.e. dimH=dimK=2. Since Pauli matrices span the space of qubit Hermitian operators, any qubit state ρ can be parametrized in terms of Pauli matrices, i.e.

ρ=12(𝟙+σTx),|x|21, 4.1

where σ=(σx,σy,σz)T and x are the vectors of Pauli matrices and their real coefficients, respectively. Analogously, any qubit channel X can be parametrized in terms of Pauli matrices, i.e.

X(ρ)=12(𝟙+σT(Ax+b)),

where Ai,j=12Tr[σiX(σj)] and bi=12Tr[σiX(𝟙)].

With such a parametrization X(Hω(ϕ0,ϕ1)) assumes a very simple form given by

X(Hω(ϕ0,ϕ1))=12[ω𝟙+(Ax+ωb)Tσ],

whose eigenvalues are 12(ω±|Ax+ωb|2). Thus, the witness threshold W(X,w±(ω)) in equation (2.2) can be readily computed by means of lemma 3.2 as

W(X,w±(ω))=12[1+max(|ω|,maxx|x|21|Ax+ωb|2)].

Note that this expression is the maximum between two strategies. The first one is given by the trivial POVM and thus corresponds to trivial guessing. The second one can be further simplified by means of the following substitutions. Let A=VDU be a polar decomposition of matrix A with U and V unitaries and D diagonal and positive-semidefinite with eigenvalues d (accordingly c:=−V b). By unitary invariance of the 2-norm one has

maxx|x|21|Ax+ωb|2=maxx|x|21|Dxωc|2.

By defining y:=Dx one has

maxx|x|21|Dxωc|2=maxy,z|D1y+(𝟙D1D)z|21|yωc|2,

where (⋅)−1 denotes the Moore–Penrose pseudoinverse. By explicit computation it follows that [D1]T(𝟙D1D)=0, and therefore vectors D−1y and (𝟙D1D)z are orthogonal. Then for any optimal (y,z) one has that (y,0) is also optimal, since |D1y+(𝟙D1D)z|2|D1y|2. Therefore, we have

W(X,w±(ω))=12[1+max(ω,Δ(ω))], 4.2

where

Δ(ω):=maxy|D1y|21|yωc|2. 4.3

The maximum in equation (4.3) is a quadratically constrained quadratic optimization problem, which is known to be NP-hard in general. However, Δ(ω) has a simple geometrical interpretation: it is the maximum Euclidean distance of vector ωc and ellipsoid |D−1y|2≤1. This interpretation suggests symmetries under which the optimization problem becomes feasible. In particular, we take vector c to be parallel to one of the axes of the ellipsoid |D−1y|2≤1, namely c1=c2=0 (up to irrelevant permutations of the computational basis).

This configuration corresponds to a D2-covariant channel X, where D2 is the dihedral group of the symmetries of a line segment, consisting of two reflections and a π-rotation. This configuration is depicted in figure 1. In particular, a qubit channel X is D2-covariant if and only if there exist unitary representations UkR3×3 and VkR3×3 of D2 such that

AUkx+b=Vk(Ax+b). 4.4

Up to unitaries, the most general unitary representation of D2 in R3×3 is given by

W1=σz1,W2=σz1andW3=𝟙1,

where W1 and W2 are reflections and W3 is a π-rotation. We take Uk:=UWkU and V k:=V WkV . Then by explicit computation we have

AUkx+b=VkAx+b,

where we used the fact that [D,Wk]=0 for any k. Therefore, D2 covariance expressed by equation (4.4) is equivalent to the requirement Wkc=c, namely c1=c2=0.

Figure 1.

Figure 1.

Bloch-sphere representation of: (a,b) dihedrally covariant channels X mapping the sphere into an ellipsoid (a) centred in the Bloch sphere (e.g. any Pauli channel Pλ), or (b) translated by a vector c which is parallel to one of the axis of the ellipsoid (e.g. any amplitude damping channel Aλ); (c) non-dihedrally covariant channel X, as the ellipsoid is translated by a vector c which is not parallel to any of the axis of the ellipsoid. (Online version in colour.)

Under the assumption of D2-covariance, we take without loss of generality d2d1 and c3≥0. If also c3=0, we further take without loss of generality d3d2. Then, as formally proved in the appendix, the maximum Euclidean distance Δ(ω) in equation (4.3) can be explicitly computed, leading to the following result.

Lemma 4.1 —

The witness threshold W(X,w±(ω)) of any qubit D2-covariant channel X is given by equation (4.2) where

Δ(ω)={d21+c32ω2d22d32,if |ω|<d22d32d3c3,d3+c3|ω|, otherwise.

The optimal encoding is given by equation (4.1) with x=D−1y and

y={(0,±d21c32d32ω2(d32d22)2,c3d32ωd32d22)Tif |ω|d22d32d3c3(0,0,±d3)T otherwise.

Using lemmas 4.1 and 2.1, equation (2.3) becomes the maximum over ω of the minimum of two functions. The maximum is attained either in the maxima 0, ±ω1, or ±1 of the two functions over the domain [−1,1], where

ω1:=(d22d32)(p1|1p2|2)c3c32d22(d22d32)(p1|1p2|2)2,

(the limit should be considered if c3=0), or in their intersection ±ω2 given by

ω2:={d22(d22d32)d22d32d22c32,if (d22d32)>d22c3,d31c3, otherwise.

We can then state our first main result, formally proved in the appendix, namely a complete and closed-form characterization of the set S22(X) of conditional probability distributions compatible with any qubit D2-covariant channel X.

Theorem 4.2 —

Any given binary conditional probability distribution p is compatible with any given qubit D2-covariant channel X if and only if

maxωΩ(pTw±(ω)W(X,w±(ω)))0, 4.5

where Ω:={0,±ω1ω2,±1}∩[−1,1].

As applications of theorem 4.2, let us explicitly characterize the sets of binary conditional probability distributions compatible with two relevant examples of qubit D2-covariant channels: the Pauli and amplitude-damping channels.

Any Pauli channel can be written as Pλ:ρλ0ρ+k=13λkσkρσk, where σ=(σx,σy,σz) are the Pauli matrices. One has that c3=0 and d3=maxk[1,3]|2(λ0+λk)1|d2, thus ω1= and ω2=d3 and the maximum in equation (4.5) is attained for ωω2. Thus, upon applying theorem 4.2, one has the following result.

Corollary 4.3 —

Any given binary conditional probability distribution p is compatible with the Pauli channel Pλ if and only if

|p1|1p1|2|1|p1|1p2|2|maxk[1,3]|2(λ0+λk)1|.

Any amplitude-damping channel can be written as Aλ(ρ)=k=01AkρAk, where A0=|00|+λ|11| and A1=1λ|01|. As shown in the appendix, one has that c3=1−λ and d3=λ, d2=d1=λ, and thus the maximum in equation (4.5) is attained for ωω1 or ω=±1. Thus, upon applying theorem 4.2, one has the following result, formally proved in the appendix.

Corollary 4.4 —

Any given binary conditional probability distribution p is compatible with the amplitude-damping channel Aλ if and only if

(p1|2p2|1p1|1p2|2)2λ.

5. Universally-covariant commutativity-preserving channels

Let us now move to the arbitrary dimensional case. We trade generality regarding the dimension for generality regarding the symmetry of the channel, and assume universal covariance. A channel X:L(H)L(K) is universally covariant if and only if there exist unitary representations UgL(H) and VgL(K) of the special unitary group SU(d) with d:=dimH, such that for every state ρL(H) one has

X(UgρUg)=VgX(ρ)Vg. 5.1

From universal covariance it immediately follows that any orthonormal pure encoding attains the witness threshold W(X,w±(ω)) in equation (3.1). Indeed, for any orthonormal pure states {ϕi} let U be the unitary such that ϕi=U|i〉〈i|U. Then one has

X(Hω(ϕ0,ϕ1))1=X(Hω(U|00|U,U|11|U))1=VX(Hω(|00|,|11|))V1=X(Hω(|00|,|11|))1,

where the second equality follows from equation (5.1), and the third from the invariance of trace distance under unitary transformations. Then we have the following result.

Lemma 5.1 —

The witness threshold W(X,w±(ω)) of any universally covariant channel X is given by

W(X,w±(ω))=12[1+X(Hω(|00|,|11|))1]. 5.2

The optimal encoding is given by any pair of orthonormal pure states.

Equation (5.2) has a simple dependence on w in the case when channel X is commutativity preserving, i.e. [X(ρ0),X(ρ1)]=0 whenever [ρ0,ρ1]=0. Notice that it suffices to check commutativity preservation for pure states, indeed a channel X is commutativity preserving if and only if [X(ϕ0),X(ϕ1)]=0 whenever 〈ϕ1|ϕ0〉=0. Necessity is trivial, and sufficiency follows by assuming [ρ0,ρ1]=0, and considering a simultaneous spectral decompositions of ρ0=kμkϕk and ρ1:=jνjϕj. Then one has

[X(ρ0),X(ρ1)]=k,lμkνl[X(ϕk),X(ϕl)]=0,

where the last inequality follows from the fact that 〈ϕl|ϕk〉=δk,l. For a universally covariant channel X, it immediately follows from equation (5.1) that it suffices to check commutativity preservation for an arbitrary pair of orthogonal pure states.

In this case X(|00|) and X(|11|) admit a common basis of eigenvectors {|k〉}, and thus a spectral decomposition of the Helstrom matrix X(Hω(|00|,|11|)) is given by

X(Hω(|00|,|11|))=k(αkω+βk)|kk|,

where αk and βk are the half-sum and half-difference of the kth eigenvectors of X(|00|) and X(|11|), respectively. Therefore equation (5.2) becomes

W(X,w±(ω))=12(1+k|αkω+βk|).

Then, the optimization problem in equation (2.3) becomes piece-wise linear, thus the maximum is attained on the intersections of the piece-wise components given by γk:=βk/αk when such values belongs to the domain [−1,1], or on its extrema. We can then provide our second main result, namely a complete closed-form characterization of the set S22(X) of conditional probability distributions compatible with any arbitrary-dimensional universally-covariant commutativity-preserving channel X.

Theorem 5.2 —

Any given binary conditional probability distribution p is compatible with any given arbitrary-dimensional universally-covariant commutativity-preserving channel X if and only if

|p1|1p1|2|k|βk|,|p1|1p1|2|X(Hγk(|00|,|11|))1γk|p1|1p2|2|,

for any k such that γk∈[−1,1].

As applications of theorem 5.2, let us explicitly compute the binary conditional probability distributions compatible with any erasure, depolarizing, universal optimal 1→2 cloning and universal optimal transposition channels. As discussed before, commutativity preservation can be immediately verified for all of these channels by checking that [X(|00|),X(|11|)]=0.

Any erasure channel can be written as Edλ:ρλρ(1λ)ϕ, where ϕ is some pure state. One can compute that α=(λ/2,λ/2,0×d−2,1−λ) and β=(λ/2,−λ/2,0×d−1), thus upon applying theorem 5.2 one has the following corollary.

Corollary 5.3 —

Any given binary conditional probability distribution p is compatible with the erasure channel Edλ if and only if

|p1|1p1|2|λ.

Any depolarizing channel can be written as Ddλ:ρλρ+(1λ)(𝟙/d). One can compute that α=(λ/2+(1−λ)/d×2,(1−λ)/d×d−2) and β=(−λ/2,λ/2,0×d−2), thus upon applying theorem 5.2 one has the following corollary.

Corollary 5.4 —

Any given binary conditional probability distribution p is compatible with the depolarizing channel Ddλ if and only if

|p1|1p1|2|λ

and

|p1|1p1|2|1|p1|1p2|2|dλ22λ+dλ.

The universal optimal 1→2 cloning channel can be written as Cdλ:ρ(2/(d+1))PS(ρ𝟙)PS. By explicit computation one has

Cd(|ii|)=12(d+1)k(|k,i+|i,k)(k,i|+i,k|),

and therefore [Cd(|00|),Cd(|11|)]=0, thus the universal optimal 1→2 cloning Cd is a commutativity preserving channel. One can compute that α=(1/(d+1)×3,1/2(d+1)×2(d−2)) and β=(−1/(d+1),1/(d+1),0,−1/2(d+1)×d−2,1/2(d+1)×d−2), thus upon applying theorem 5.2 one has the following corollary.

Corollary 5.5 —

Any given binary conditional probability distribution p is compatible with the universal optimal 1→2 cloning channel Cd if and only if

|p1|1p1|2|dd+1.

The universal transposition channel can be written as Td:ρ(1/(d+1))(ρT+𝟙). One can compute that α=(3/2(d+1)×2,1/(d+1)×d−2) and β=(1/2(d+1),−1/2(d+1),0×d−2), thus upon applying theorem 5.2 one has the following corollary.

Corollary 5.6 —

Any given binary conditional probability distribution p is compatible with the universal transposition channel Td if and only if

|p1|1p1|2|1d+1

and

|p1|1p1|2|1|p1|1p2|2|13.

The results of corollaries 4.3, 4.4, 5.35.6 are summarized in table 1.

Table 1.

Complete closed-form characterization of the set S22(X) of binary conditional probability distributions compatible with channel X, for X given by the Pauli channel Pλ, the amplitude damping channel Aλ, the erasure channel Edλ, the depolarizing channel Ddλ, the universal 1→2 cloning channel Cd and the universal transposer Td, as given by corollaries 4.3, 4.4, 5.35.6, respectively.

graphic file with name rspa20160721-i1.jpg

6. Cartesian representation

In this section, we provide a geometrical interpretation of our results. Binary conditional probability distributions are represented by 2×2 real matrices, so they can be regarded as vectors in R4. However, due to the normalization constraint jpj|i=1 for any i, they all lie in a bidimensional affine subspace. A natural Cartesian parametrization of such a subspace is given by

pj|i=p(x,y)=12[(1111)+x(1111)+y(1111)] 6.1

and binary conditional probability distributions form the square |x±y|≤1, whose four vertices are the right-stochastic matrices with all entries equal to 0 or 1.

As it is clear from equation (6.1):

  • — a permutation of the states {ρ0,ρ1} corresponds to the transformation (x,y)→(x,−y);

  • — a permutation of the effects {π0,π1} corresponds to the transformation (x,y)→(−x,−y);

  • — a permutation of the states {ρ0,ρ1} and effects {π0,π1} corresponds to the transformation (x,y)→(−x,y).

Therefore, for any channel X, the set S22(X) of binary conditional probability distributions compatible with X is symmetric for reflections around the x or y axes (i.e. it is D2-covariant).

As a consequence of our previous results, the sets S22(Ud) and S22(Fdλ) of conditional probability distributions compatible with any unitary and dephasing channels Ud and Fdλ coincide with the square |x±y|≤1, for any d and any λ. The set S22(T) of conditional probability distributions compatible with any trace-class channel T coincide with the segment x∈[−1,1], y=0.

With the parametrization in equation (6.1), the sets of binary conditional probability distributions compatible with any Pauli, amplitude-damping, erasure, depolarizing, universal 1→2 cloning and universal transposition channels as given by corollaries 4.3, 4.4, 5.35.6, respectively, are given in table 2 and depicted in figure 2.

Table 2.

Cartesian parametrization of the set S22(X) of binary conditional probability distributions compatible with channel X, for X given by the Pauli channel Pλ, the amplitude damping channel Aλ, the erasure channel Edλ, the depolarizing channel Ddλ, the universal 1→2 cloning channel Cd and the universal transposer Td.

graphic file with name rspa20160721-i2.jpg

Figure 2.

Figure 2.

Cartesian representation of the space of binary conditional probability distributions p. The outer white square denotes the polytope of all binary conditional probability distributions. The inner yellow region denotes the sets S22(X) of conditional probability distributions compatible with: (a) the erasure channel X=Edλ (for Δy=λ) and the universal optimal 1→2 cloning channel X=Cd (for Δy=d/(d+1)); (b) the Pauli channel X=Pλ (for Δy=maxk[1,3]|2(λ0+λk)1|); (c) the depolarizing channel X=Ddλ (for Δx=((d−2)/d)(1−λ) and Δy=λ) and the universal optimal transposition channel Td (for Δx=(d−2)/(d+1) and Δy=1/(d+1)); (d) the amplitude-damping channel Aλ (for Δy1=λ and Δy2=λ). (Online version in colour.)

7. Conclusion and outlook

In this work, we developed a device-independent framework for testing quantum channels. The problem was framed as a game involving an experimenter, claiming to be able to produce some quantum channel, and a theoretician, willing to trust observed correlations only. The optimal strategy consists of (i) all the input states and measurements generating the extremal correlations that the experimenter needs to produce and (ii) a full closed-form characterization of the correlations compatible with the claim, that the theoretician needs to compare with the observed correlations. For binary correlations, we explicitly derived the optimal strategy for the cases where the claimed channel is a dihedrally-covariant qubit channel, such as any Pauli and amplitude-damping channels, or an arbitrary-dimensional universally-covariant commutativity-preserving channel, such as any erasure, depolarizing, universal cloning and universal transposition channels.

Natural generalization of our results include relaxing the restriction of binary correlations, that is m=n=2, and extending the characterization of Smn(X) to other classes of channels. An interesting generalization would consist of letting the POVM {πy} depend upon an input not known during the preparation of {ρx}, as is the case in quantum random access codes. Moreover, the set-up in equation (1.1) could be modified to allow for entanglement alongside X, or many parallel or sequential uses of channel X.

We conclude by remarking that our results are particularly suitable for experimental implementation. For any channel X an experimenter claims to be able to produce, our framework only requires them to prepare orthogonal pure input states and perform orthogonal measurements in order to fully characterize S22(X) and thus device-independently test X.

Acknowledgements

We are grateful to Alessandro Bisio, Antonio Acín, Giacomo Mauro D’Ariano and Vlatko Vedral for valuable discussions and suggestions.

Appendix A. Proofs

In this section, we prove those results reported in the previous sections for which the proof, being lengthy and not particularly insightful, has only been outlined. The numbering of statements follows that of the previous sections.

Lemma A.1. —

The witness threshold W(X,w±(ω)) of any qubit D2-covariant channel X is given by equation (4.2) where

Δ(ω)={d21+c32ω2d22d32, if |ω|<d22d32d3c3,d3+c3|ω|,otherwise.

Proof. —

Under the assumption of D2-covariance, take without loss of generality c=(0,0,c3)T. Then without loss of generality we take d2d1 and c3≥0. If c3=0 without loss of generality we also take d3d2.

First notice that y*, which attains the maximum in equation (4.2), lies in the yz plane. Indeed, any ellipse obtained as the intersection of the ellipsoid |D−1y|2≤1 and a plane containing the z-axis is, up to a z rotation, a subset of the ellipse obtained as the intersection of the ellipsoid |D−1y|2≤1 and the yz plane.

The generic vector on the boundary of the yz ellipse can be parametrized as

y=(0,±d21z2d32,z)T,

with z∈[−d3,d3] and thus the maximum Euclidean distance in equation (4.3) is given by

Δ(ω)=maxz[d3,d3]d22(1z2d32)+(zωc3)2. A 1

By explicit computation one has

dΔ(ω)dz=[d22(1z2d32)+(zωc3)2]1/2[(1d22d32)zc3ω],

which is zero for z=c3d32ω/(d32d22) and

d2Δ(ω)dz2|z=z=[d32d22(1+c32ω2d22d32)]1(d32d22),

namely z* attains the maximum in equation (A.1) whenever d2d3. Therefore, the maximum is attained by z=z* iff −d3<z*≤d3, namely when |ω|<(d22d32)/d3c3, and by zd3 otherwise. By replacing z* and ±d3 in equation (A.1) the statement follows. ▪

Theorem A.2. —

Any given binary conditional probability distribution p is compatible with any given qubit D2-covariant channel X if and only if

maxωΩ(pw±(ω)W(X,w±(ω)))0,

where Ω:={0,±ω1ω2,±1}∩[−1,1].

Proof. —

The function f±(ω):=pTw±(ω)W(X,ω) is the minimum of continuous functions g±(ω):=pTw±(ω)12(1+|ω|) and h±(ω):=pTw±(ω)12(1+Δ(ω)). Therefore, maxω[1,1]f±(x) is attained by those values of ω maximizing g±(ω) or h±(ω), or in the intersections of g±(ω) and h±(ω).

The function g±(ω) is piece-wise linear and attains its maximum on [−1,1] in 0. The function h±(ω) is quasi-concave continuous with a continuous derivative. Indeed

2dh±(ω)dω={±(p1|1p2|2)d2c32ω(d22d32)(d22d32+c32ω2),if |ω|<d22d32d3c3,±(p1|1p2|2)sgn(ω)c3,otherwise

is continuous and

2d2h±(ω)dω2={d2c32(d22d32)2[(d22d32)(d22d32+c32ω2)]3/2,if |ω|<d22d32d3c3,0,if |ω|>d22d32d3c3

is non-positive. Therefore, h±(ω) attains its maximum on [−1,1] in 0, ±1, or in the zero ±ω1 of its first derivative.

Owing to the piece-wise linearity of g±(ω) and the quasi-concavity of h±(ω), since g±(0)≥h±(0) and g±(±1)≤h±(±1) one has that g±(ω) and h±(ω) intersect in exactly two points ±ω2∈[−1,1], thus the statement follows. ▪

Corollary A.3. —

Any given binary conditional probability distribution p is compatible with the amplitude-damping channel Aλ if and only if

(p1|2p2|1p1|1p2|2)2λ.

Proof. —

One has c3=1−λ, d2=λ and d3=λ, thus

ω1=λ(1λ)((1λ)(p1|1p2|2)2)(p1|1p2|2)

and ω2=1. By explicit computation, the conditions ω1R and |ω1|≤1 are equivalent to (p1|1p2|2)2<1−λ and (p1|1p2|2)2≤(1−λ)2, respectively, thus ω1∈[−1,1] is equivalent to (p1|1p2|2)2≤(1−λ)2 for any λ>0.

By explicit computation, the maximum in equation (4.5) is attained at ωω1 and ω=±1 whenever |p1|1p2|2|≤1−λ and |p1|1p2|2|>1−λ, respectively. Thus equation (4.5) becomes

|p1|1p1|2|λ[1(p1|1p2|2)21λ]0,

whenever |p1|1p2|2|≤1−λ, which, by solving in λ, becomes λλλ+ whenever λ≤1−|p1|1p2|2|, where λ±=(p1|1p2|2±p1|2p2|1)2. By explicit computation 1−|p1|1p2|2|≤λ+, so the statement follows. ▪

Footnotes

1

The role of the circuit within any hypothesis is to describe the space–time structure of the experiment, usually assumed to obey special relativity. Thus, while circuits corresponding to space-like correlations are constrained by the no-signalling principle, those corresponding to time-like correlations are only constrained by the strictly weaker no-signalling-from-the-future principle [1], (M Ozawa 2006, private communication). As a consequence, the hypotheses falsifiable in a time-like test are inherently more specific than those falsifiable in a space-like test: for instance, while a Bell-test [24] can rule out classical theory altogether, a classical model always exists supporting any given time-like correlation.

2

As a comparison we note that while our approach is top-down, i.e. it aims at characterizing the set of correlations compatible with a given hypothesis, in self-testing [611] the approach is bottom-up, i.e. it aims at characterizing the set of hypotheses compatible with a given correlation.

Authors' contributions

All authors contributed equally to the original ideas, analytical derivations and final writing of this manuscript, and gave final approval for publication.

Competing interests

The authors declare no competing interests.

Funding

M.D.A. acknowledges support from the Singapore Ministry of Education Academic Research Fund Tier 3 (grant no. MOE2012-T3-1-009). F.B. acknowledges support from the JSPS KAKENHI, no. 26247016.

References


Articles from Proceedings. Mathematical, Physical, and Engineering Sciences are provided here courtesy of The Royal Society

RESOURCES