Thermodynamic Implementations of Quantum Processes

Philippe Faist; Mario Berta; Fernando G S L Brandao

doi:10.1007/s00220-021-04107-w

. 2021 May 28;384(3):1709–1750. doi: 10.1007/s00220-021-04107-w

Thermodynamic Implementations of Quantum Processes

Philippe Faist ^1,^2,^3,^✉, Mario Berta ^1,^4,⁵, Fernando G S L Brandao ^1,⁵

PMCID: PMC8550554 PMID: 34776522

Abstract

Recent understanding of the thermodynamics of small-scale systems have enabled the characterization of the thermodynamic requirements of implementing quantum processes for fixed input states. Here, we extend these results to construct optimal universal implementations of a given process, that is, implementations that are accurate for any possible input state even after many independent and identically distributed (i.i.d.) repetitions of the process. We find that the optimal work cost rate of such an implementation is given by the thermodynamic capacity of the process, which is a single-letter and additive quantity defined as the maximal difference in relative entropy to the thermal state between the input and the output of the channel. Beyond being a thermodynamic analogue of the reverse Shannon theorem for quantum channels, our results introduce a new notion of quantum typicality and present a thermodynamic application of convex-split methods.

Introduction

In the information-theoretic approach to thermodynamics, a careful analysis of the resources required to perform thermodynamic tasks has allowed to consistently and systematically describe the thermodynamic behaviour of quantum systems at the nano-scale [1]. In particular, thermodynamics can be phrased as a resource theory [2–4]. In a resource theory, one specifies which operations can be carried out at no cost—the free operations—and then one studies how much of external resources (e.g., thermodynamic work) one needs to provide to carry out operations that are not free. Two established resource theories for quantum thermodynamics are thermal operations [2, 3] and Gibbs-preserving maps [5, 6]. In the former, the free operations consist of energy-conserving interactions of the system with a heat bath, while in the latter, the free operations are any quantum operation that preserves the thermal state. It is reasonable to assume that thermal operations can be realized in an idealized setting, making them a good choice of framework for constructing explicit protocols, whereas Gibbs-preserving maps encompass a broader class of operations, allowing us to derive stronger fundamental limits.

The resource theory approach to thermodynamics has revealed close connections with measures of information known from quantum information theory [7, 8]. Namely, single-shot thermodynamic and information-theoretic tasks are both quantified by relevant entropy measures [9–11]. Consequently, tools from quantum Shannon theory can be used to characterize tasks in thermodynamics, for instance to derive second-order asymptotics of the work cost of state transformations [12]. Recently, focus was shifted to understand the resource costs of quantum processes, rather than state transformations [13–16]. The information measure associated with quantum processes is the quantum capacity, along with its many variants [17]. A natural question arises: What is the thermodynamic analogue of the quantum capacity?

Here, we ask how much work is required to implement a given quantum process, with the requirement that the implementation is accurate for any possible input state. In the single-instance regime, we find that the answer is a variation of the results obtained in Ref. [16]. However, in the regime where we consider many independent and identically distributed (i.i.d.) copies of the process, important differences arise due to typicality. We find that the optimal work cost of such an implementation in the i.i.d. regime is given by the thermodynamic capacity, defined as the maximal difference between the input and output free energy of the process over all possible input states. The fact that no implementation can perform better than the thermodynamic capacity follows fairly straightforwardly from the results of Ref. [16]. The technically challenging part of the present paper is to show that there exist protocols that achieve this limit.

We provide three different constructions of such protocols, each valid in different settings. In the first construction, we make the simplifying assumption that Hamiltonian of the system is trivial as in Ref. [13]. We then show that simple properties of one-shot entropy measures, coupled with the post-selection technique [18], provide an existence proof of the required implementation. The implementation is given in terms of thermal operations. In our second construction, we develop novel quantum typicality tools which we use along with the post-selection technique to explicitly construct an implementation in terms of Gibbs-preserving maps for any i.i.d. process and for any system Hamiltonian. In our third construction, we assume that the i.i.d. process is time-covariant, i.e., commutes with the time evolution. We then use recent results on the convex-split lemma and position-based decoding [19] to construct an implementation of a time-covariant i.i.d. process with thermal operations.

Our results imply that the thermodynamic resource theory of channels becomes reversible in the i.i.d. limit [20]. Namely, invoking the results in Ref. [21], we see that the work rate that is required to implement a given i.i.d. process is the same as what can be extracted if the i.i.d. process is provided to us as a black box. This provides a thermodynamic analogue of the reverse Shannon theorem from quantum information theory. This theorem states that the quantum mutual information of the channel uniquely characterizes the resources required to simulate the channel with noiseless channel uses and shared entanglement, as well as to distil a noiseless channel from many uses of the channel and shared entanglement [22, 23]. Indeed, our proof techniques are inspired by Refs. [22, 24–26].

The remainder of this paper is structured as follows. Section 2 gives the necessary preliminaries and fixes some notation. Section 3 introduces two resource theories for thermodynamics, thermal operations and Gibbs-preserving maps. In Sect. 4 we introduce the thermodynamic capacity and present some elementary properties. In Sect. 5, we provide our first construction for a trivial Hamiltonian. In Sect. 6 we provide our second construction, which is valid in the general setting and provides an implementation in terms of Gibbs-preserving maps. Section 7 provides our third construction, valid for time-covariant i.i.d. processes, and built with thermal operations. Our conclusions are presented in Sect. 8. Various more technical proof details are deferred to “Appendices A–F”.

Preliminaries

Quantum states, quantum processes, and distance measures

Each quantum system considered lives in a finite-dimensional Hilbert space. A quantum state is a positive semi-definite operator $ρ$ satisfying $tr [ρ] = 1$ . A sub-normalized quantum state is a positive semi-definite operator $ρ$ satisfying $tr [ρ] ⩽ 1$ . To each system S is associated a standard basis, usually denoted by ${{| k ⟩}_{S}}$ . For any two systems $A, A^{'}$ , we denote by $A ≃ A^{'}$ the fact that they are isometric. In that case, we consider a representation in which the isometry maps the standard basis onto the standard basis, i.e., ${id}_{A \to A^{'}} (| k ⟩ ⟨ k |_{A}) = {| k ⟩ ⟨ k |}_{A^{'}}$ for all k, where ${id}_{A \to A^{'}}$ denotes the identity process. For any two systems $A ≃ A^{'}$ , we define the non-normalized maximally entangled reference ket ${| Φ ⟩}_{A : A^{'}} = \sum_{k} {| k ⟩}_{A} \otimes {| k ⟩}_{A^{'}}$ . Matrix inequalities are with respect to the positive semi-definite cone: $A ⩽ B$ signifies that $B - A$ is positive semi-definite. A completely positive map $E_{X \to X^{'}}$ is a linear mapping that maps Hermitian operators on X to Hermitian operators on $X^{'}$ and that satisfies $E_{X \to X^{'}} (Φ_{X : R_{X}}) ⩾ 0$ , where $R_{X} ≃ X$ . The adjoint $E_{X \leftarrow X^{'}}^{†}$ of a completely positive map $E_{X \to X^{'}}$ is the unique completely positive map $X^{'} \to X$ that satisfies $tr [E (Y) Z] = tr [Y E^{†} (Z)]$ for all operators Y, Z. A completely positive map $E_{X \to X^{'}}$ is trace-preserving if $E^{†} (1_{X^{'}}) = 1_{X}$ and trace non-increasing if $E^{†} (1_{X^{'}}) ⩽ 1_{X}$ .

Proximity of quantum states can be measured by the fidelity $F (ρ, σ) = ‖ \sqrt{ρ} \sqrt{σ} ‖_{1}$ , where the one-norm of an operator is defined as ${‖ A ‖}_{1} = tr [\sqrt{A^{†} A}]$ . The fidelity is extended to sub-normalized states $ρ, σ$ as the generalized fidelity, $\bar{F} (ρ, σ) = ‖ \sqrt{ρ} \sqrt{σ} ‖_{1} + \sqrt{(1 - tr [ρ]) (1 - tr [σ])}$ , noting that $F (\cdot, \cdot) = \bar{F} (\cdot, \cdot)$ whenever at least one of the states is normalized. An associated metric can be defined for any sub-normalized states as $P (ρ, σ) = \sqrt{1 - {\bar{F}}^{2} (ρ, σ)}$ , called the purified distance [10, 11, 27], or root infidelity, and is closely related to the Bures distance and the quantum angle [28]. The proximity of two sub-normalized quantum states $ρ, σ$ may also be measured in the trace distance $D (ρ, σ) = \frac{1}{2} {‖ ρ - σ ‖}_{1}$ . We note that the one-norm of a Hermitian operator A can be expressed as

\begin{matrix} {‖ A ‖}_{1} = max_{{‖ Z ‖}_{\infty} ⩽ 1} tr [Z A] = min_{\begin{matrix} Δ_{\pm} ⩾ 0 \\ A = Δ_{+} - Δ_{-} \end{matrix}} tr [Δ_{+}] + tr [Δ_{-}], \end{matrix}

where the first optimization ranges over Hermitian Z operators and where the second over positive semi-definite operators $Δ_{\pm}$ . For any two states $ρ, σ$ (one can even be sub-normalized), the purified distance and the trace distance are related via

\begin{matrix} D (ρ, σ) ⩽ P (ρ, σ) ⩽ \sqrt{2 D (ρ, σ)} . \end{matrix}

Similarly, we may define a distance measure for channels: For two completely positive, trace non-increasing maps $T_{X \to X^{'}}$ and $T_{X \to X^{'}}^{'}$ , the diamond norm distance is defined as

\begin{matrix} \frac{1}{2} {∥T_{X \to X^{'}} - T_{X \to X^{'}}^{'}∥}_{⋄} = max_{σ_{XR}} D (T_{X \to X^{'}} (σ_{XR}), T_{X \to X^{'}}^{'} (σ_{XR})), \end{matrix}

where the optimization ranges over all bipartite quantum states over X and a reference system $R ≃ X$ . The optimization may be restricted to pure states without loss of generality.

Entropy measures

The von Neumann entropy of a quantum state $ρ$ is $H (ρ) = - tr [ρ ln ρ]$ . In this work, all entropies are defined in units of nats, using the natural logarithm $ln (\cdot)$ , instead of units of (qu)bits. A number of nats is equal to $ln (2)$ times the corresponding number of qubits. The conditional von Neumann entropy of a bipartite state $ρ_{AB}$ is given by

\begin{matrix} H {(A | B)}_{ρ} = H {(A B)}_{ρ} - H {(B)}_{ρ} = H (ρ_{AB}) - H (ρ_{B}) . \end{matrix}

The quantum relative entropy is defined as

\begin{matrix} D (ρ ‖ σ) = tr [ρ (ln ρ - ln σ)], \end{matrix}

where $ρ$ is a quantum state and where $σ$ is any positive semi-definite operator whose support contains the support of $ρ$ .

Schur–Weyl duality

Consider a Hilbert space $H_{A}$ and $n \in N$ . The group $GL (d_{A}) \times S_{n}$ acts naturally on $H_{A}^{\otimes n}$ , where $X \in GL (d_{A})$ acts as $X^{\otimes n}$ and where the permutation group permutes the tensor factors. We follow closely the notation of Refs. [24, 25]. Schur–Weyl tells us that the full Hilbert space decomposes as

\begin{matrix} H_{A} ≃ ⨁_{λ} V_{λ} = ⨁_{λ} Q_{λ} \otimes P_{λ}, \end{matrix}

where $λ \in Young (n, d)$ are Young diagrams with n boxes and (at most) d rows, and where $Q_{λ}$ , $P_{λ}$ are irreducible representations of $GL (d_{A})$ and $S_{n}$ , respectively. The number of Young diagrams in the decomposition above is at most $poly (n)$ , if $d_{A}$ is kept constant. We write $poly (n) = O (poly (n))$ in big O notation for terms whose absolute value is upper bounded by some polynomial $n^{c}$ for $c \in N$ in the asymptotic limit $n \to \infty$ .

We denote by $Π_{A^{n}}^{λ}$ the projector in $H_{A}^{\otimes n}$ onto the term labelled by $λ$ in the decomposition above. We denote by $q_{λ} (X)$ a representing matrix of $X \in GL (d_{A})$ in the irreducible representation labelled by $λ$ ; the operator $q_{λ} (X)$ lives in $Q_{λ}$ . We furthermore introduce the following notation, for any $Y \in Q_{λ} \otimes P_{λ}$ ,

\begin{matrix} {[Y]}_{λ} = 1_{(Q_{λ} \otimes P_{λ}) \to A^{n}} Y 1_{(Q_{λ} \otimes P_{λ}) \leftarrow A^{n}}^{†}, \end{matrix}

which represents the canonical embedding of an operator Y on $Q_{λ} \otimes P_{λ}$ into the space $H_{A}^{\otimes n}$ , i.e., mapping Y onto the corresponding block in (6). In particular,

\begin{matrix} Π_{A^{n}}^{λ} {[Y]}_{λ} Π_{A^{n}}^{λ} = {[Y]}_{λ} . \end{matrix}

Any operator $X_{A^{n}}$ acting on the n copies which commutes with all the permutations admits a decomposition of the form

\begin{matrix} X_{A^{n}} = \sum_{λ} {[X_{λ} \otimes 1_{P_{λ}}]}_{λ} \end{matrix}

for some set of operators $X_{λ} \in Q_{λ}$ . In particular, $[X_{A^{n}}, Π_{A^{n}}^{λ}] = 0$ . We can make this more precise for i.i.d. states. For any $X \in GL (d_{A})$ , we have that

\begin{matrix} [Π_{A^{n}}^{λ}, X^{\otimes n}] = 0 \end{matrix}

\begin{matrix} X^{\otimes n} = \sum_{λ} {[q_{λ} (X) \otimes 1_{P_{λ}}]}_{λ} . \end{matrix}

For a given $λ \in Young (n, d)$ , it is often useful to consider the corresponding normalized probability distribution $λ / n = {(λ_{i} / n)}_{i}$ . The entropy of this distribution is given by

\begin{matrix} \bar{H} (λ) = H (λ / n) = - \sum_{i} \frac{λ_{i}}{n} ln \frac{λ_{i}}{n}, \end{matrix}

where $λ_{i}$ is the number of boxes in the i-th row of the diagram.

If we have n copies of a bipartite system $H_{A} \otimes H_{B}$ , then we may Schur–Weyl decompose $H_{A}^{\otimes n}$ , $H_{B}^{\otimes n}$ and ${(H_{A} \otimes H_{B})}^{\otimes n}$ under the respective actions of $GL (d_{A}) \times S_{n}$ , $GL (d_{B}) \times S_{n}$ and $GL (d_{A} d_{B}) \times S_{n}$ . A useful property we will need here is that the projectors onto the respective Schur–Weyl blocks commute between these decompositions.

Lemma 2.1

Consider two spaces $H_{A}, H_{B}$ and let $Π_{A^{n} B^{n}}^{λ}$ and $Π_{A^{n}}^{λ^{'}}$ be the projectors onto Schur–Weyl blocks of $H_{AB}^{\otimes n}$ and $H_{A}^{\otimes n}$ , respectively, with $λ \in Young (d_{A} d_{B}, n)$ and $λ^{'} \in Young (d_{A}, n)$ . Then, we have

\begin{matrix} [Π_{A^{n} B^{n}}^{λ}, Π_{A^{n}}^{λ^{'}} \otimes 1_{B^{n}}] = 0 . \end{matrix}

Proof

$Π_{A^{n}}^{λ^{'}} \otimes 1_{B^{n}}$ is invariant under the action of $S_{n}$ permuting the copies of $A \otimes B$ , and so it admits a decomposition of the form (9) and commutes with $Π_{A^{n} B^{n}}^{λ}$ . $□$

The following is another lemma about how much overlap Schur–Weyl blocks have on a bipartite system versus on one of the two systems. This lemma forms the basis of our universal typical subspace.

Lemma 2.2

Consider $n \in N$ copies of a bipartite system $H_{A} \otimes H_{B}$ . Then, for any $λ \in Young (d_{A} d_{B}, n)$ and $λ^{'} \in Young (d_{B}, n)$ , we have

\begin{matrix} Π_{B^{n}}^{λ^{'}} {tr}_{A^{n}} [Π_{A^{n} B^{n}}^{λ}] Π_{B^{n}}^{λ^{'}} ⩽ poly (n) e^{n (\bar{H} (λ) - \bar{H} (λ^{'}))} Π_{B^{n}}^{λ^{'}} \end{matrix}

noting that $[1_{A^{n}} \otimes Π_{B^{n}}^{λ^{'}}, Π_{A^{n} B^{n}}^{λ}] = 0$ .

The proof is provided in “Appendix A”.

Estimating entropy

Measuring the Young diagram $λ$ —that is, performing the projective measurement with operators ${Π_{A^{n}}^{λ}}_{λ}$ —yields a good estimation of the spectrum of a state $ρ_{A}$ when given $ρ_{A}^{\otimes n}$ [25]. An estimate for the entropy of $ρ$ is thus obtained by calculating the entropy $H (λ / n)$ corresponding to the probability distribution $λ / n$ .

Proposition 2.1

(Spectrum and entropy estimation [22, 24, 25]). Consider $n \in N$ copies of a system $H_{A}$ . Then, the family of projectors ${Π_{A^{n}}^{λ}}_{λ}$ given by Schur–Weyl duality forms a POVM obeying the following property: For any $δ > 0$ , there exists an $η > 0$ such that for any state $ρ_{A}$ , we have

\begin{matrix} tr [(\sum_{λ : \bar{H} (λ) \in [H (ρ) \pm δ]}, Π_{A^{n}}^{λ}), ρ_{A}^{\otimes n}] ⩾ 1 - poly (n) exp (- n η) . \end{matrix}

The proof is provided in “Appendix A”.

Estimating energy

Proposition 2.2

Consider any observable $H_{A}$ on $H_{A}$ and write $Γ_{A} = e^{- H_{A}}$ . Then, the set of projectors $\{R_{A^{n}}^{k}\}$ onto the eigenspaces of $Γ_{A}^{\otimes n}$ forms a POVM satisfying the following properties:

(i)
There are at most $poly (n)$ POVM elements, with the label k running over a set $k \in K_{n} (H_{A}) \subset R$ ;
(ii)
We have $[R_{A^{n}}^{k}, Γ_{A}^{\otimes n}] = 0$ and $e^{- n k} R_{A^{n}}^{k} = R_{A^{n}}^{k} Γ_{A}^{\otimes n}$ ;
(iii)
For any $δ > 0$ and for any state $ρ_{A}$ ,
$\begin{matrix} tr [R_{A^{n}}^{\approx_{δ} tr [ρ_{A} H_{A}]}, ρ_{A}^{\otimes n}] ⩾ 1 - 2 e^{- n η} with η = δ^{2} / (2 ‖ H_{A} ‖_{\infty}^{2}), \end{matrix}$ 16
and where for any $h \in R$ we define
$\begin{matrix} R_{A^{n}}^{\approx_{δ} h} = \sum_{k \in K_{n} (H_{A}) : | k - h | ⩽ δ} R_{A^{n}}^{k} . \end{matrix}$ 17
(iv)
For any $h \in R$ , we have
$\begin{matrix} e^{- n (k + δ)} R_{A^{n}}^{\approx_{δ} h} ⩽ R_{A^{n}}^{\approx_{δ} h} Γ_{A}^{\otimes n} ⩽ e^{- n (k - δ)} R_{A^{n}}^{\approx_{δ} h} . \end{matrix}$ 18

The proof is provided in “Appendix A”.

Post-selection technique

The post-selection technique is useful for bounding the diamond norm of a candidate smoothed channel to a target ideal i.i.d. channel.

Theorem 2.1

(Post-selection technique [18]). Let $X, X^{'}$ be quantum systems, $E_{X \to X^{'}}$ be a completely positive, trace-preserving map, and $T_{X^{n} \to X^{' n}}$ be a completely positive, trace non-increasing map. Furthermore, let $\bar{R} ≃ X$ ,

\begin{matrix} ζ_{X^{n}} = {tr}_{{\bar{R}}^{n}} [\int d ϕ_{X \bar{R}} {| ϕ ⟩ ⟨ ϕ |}_{X \bar{R}}^{\otimes n}] = \int d σ_{X} σ_{X}^{\otimes n}, \end{matrix}

where $d ϕ_{X \bar{R}}$ denotes the Haar-induced measure on the pure states on $X \otimes \bar{R}$ , and $d σ_{X}$ its induced measure on X after partial trace, and let ${| ζ ⟩}_{X^{n} R}$ be a purification of $ζ_{X^{n}}$ . Then, we have

\begin{matrix} \frac{1}{2} {‖ T - E^{\otimes n} ‖}_{⋄} ⩽ poly (n) D (T (ζ_{X^{n} R}), E^{\otimes n} (ζ_{X^{n} R})) . \end{matrix}

Moreover, for all $n \in N$ there exists a set $\{|, ϕ_{i}, ⟩_{X \bar{R}}\}$ of at most $poly (n)$ states, and a probability distribution $\{p_{i}\}$ , providing a purification of $ζ_{X^{n}}$ as

\begin{matrix} {| ζ ⟩}_{X^{n} {\bar{R}}^{n} R^{'}} = \sum_{i} \sqrt{p_{i}} | ϕ_{i} ⟩_{X \bar{R}}^{\otimes n} \otimes {| i ⟩}_{R^{'}} \end{matrix}

with a register $R^{'}$ of size $poly (n)$ .

The first part of the theorem is [18, Eq. (4)] and the second part is, e.g., found as [23, Cor. D.6]. The following proposition shows that a given channel is close to an i.i.d. channel, if it behaves as expected on all i.i.d. states with exponentially good accuracy.

Proposition 2.3

For three systems $X, X^{'}, E$ , let $V_{X \to X^{'} E}$ be an isometry and $W_{X^{n} \to X^{' n} E^{n}}$ be an isometry which commutes with the permutations of the n systems. Furthermore, assume that there exists $η > 0$ independent of n such that for all pure states ${| σ ⟩ ⟨ σ |}_{X R_{X}}$ with a reference system $R_{X} ≃ X$ , we have

\begin{matrix} Re \{{⟨ σ |}_{X R_{X}}^{\otimes n}, {(V_{X \leftarrow X^{'} E}^{†})}^{\otimes n}, W_{X^{n} \to X^{' n} E^{n}}, {| σ ⟩}_{X R_{X}}^{\otimes n}\} ⩾ 1 - poly (n) exp (- n η) . \end{matrix}

For $E_{X \to X^{'}} (\cdot) = {tr}_{E} [V_{X \to X^{'} E} (\cdot) V^{†}]$ and $T_{X^{n} \to X^{' n}} (\cdot) = {tr}_{E^{n}} [W_{X^{n} \to X^{' n} E^{n}} (\cdot) W^{†}]$ we then have

\begin{matrix} \frac{1}{2} ‖ T_{X^{n} \to X^{' n}} - E_{X \to X^{'}}^{\otimes n} ‖_{⋄} ⩽ poly (n) exp (- n η / 2) . \end{matrix}

The proof is provided in “Appendix A”.

Resource Theory of Thermodynamics

Gibbs-preserving maps

We consider the framework of Ref. [16], where for each system S considered a positive semi-definite operator $Γ_{S} ⩾ 0$ is associated. A trace non-increasing, completely positive map $Φ_{A \to B}$ is allowed for free if it satisfies $Φ_{A \to B} (Γ_{A}) ⩽ Γ_{B}$ . In the case of a system S with Hamiltonian $H_{S}$ , and in the presence of a single heat bath at inverse temperature $β$ , the relevant thermodynamic framework is given by setting $Γ_{S} = e^{- β H_{S}}$ . In the remainder of this paper, when using the present framework, it is convenient to work with the $Γ$ operators on an abstract level. The results then also apply to situations where several different thermodynamic baths are considered, or in more general settings where a specific operator needs to be conserved by the spontaneous evolution of the system [16].

The resources required to enable non-free operations are counted using an explicit system that provides these resources, such as an information battery. An information battery is a large register W whose associated operator $Γ_{W}$ is simply $Γ_{W} = 1_{W}$ (i.e., $H_{W} = 0$ ). The information battery is required to be in a state of the special form $τ_{W}^{m} = P_{W}^{m} / tr [P_{W}^{m}]$ where $P_{W}^{m}$ is a projector of rank $e^{m}$ . That is, $τ_{W}^{m}$ has uniform eigenvalues over a given rank $e^{m}$ . We denote the charge or resource value of a battery state $τ_{W}^{m}$ by $w (τ_{W}^{m}) = ln (d) - m$ , where d is the dimension of the information battery. The value $w (τ)$ measures the amount of purity present in the state $τ$ , which is the basic resource required to implement maps that are not already Gibbs-preserving maps. We choose to measure $w (τ)$ in units of number of pure nats, equal to $ln (2)$ times a number of pure qubits. A Gibbs-preserving map that acts jointly on a system and an information battery, and which maps the input battery state $τ$ to the output battery state $τ^{'}$ , is deemed to consume an amount of work $w = w (τ) - w (τ^{'})$ .

The resources can be counted in terms of thermodynamic work in units of energy if we are given a heat bath at inverse temperature T. Recall that a pure qubit can be converted to $k T ln (2)$ work using a Szilárd engine, where k is Boltzmann’s constant [29]. By counting purity in nats instead of qubits, we get rid of the $ln (2)$ factor: A number $λ$ of pure nats can be converted into $λ k T$ thermodynamic work using a Szilárd-type engine. We count work exclusively in equivalent of pure nats, for simplicity, as opposed to units of energy. The two are directly related by a factor $β^{- 1} = k T$ . Furthermore, this eliminates the factor $β$ from otherwise essentially information-theoretic expressions, and our theorems thus directly apply to cases where $Γ_{X}, Γ_{X^{'}}$ are any abstract positive semi-definite operators which are not necessarily defined via a Hamiltonian.

Let $Φ_{X W \to X^{'} W}$ be a Gibbs-preserving map acting on an information battery W, and let $τ_{W}^{m}$ , $τ_{W}^{m^{'}}$ be two information battery states. An implementation running the operation $Φ_{X W \to X^{'} W}$ with the given input and output battery states is tasked to (a) make available the input battery state, (b) apply the operation $Φ_{X W \to X^{'} W}$ , and (c) check that the output battery state is appropriate (e.g., for possible future re-use). For the verification in Point (c) it is sufficient to measure the two-outcome POVM ${P_{W}^{m^{'}}, 1 - P_{W}^{m^{'}}}$ ; as long as the first outcome is observed, it is always possible to bring the state to $τ_{W}^{m^{'}}$ by applying a completely thermalizing operation on the support of $P_{W}^{m^{'}}$ (here, this is a completely randomizing or completely symmetrizing operation). In the constructions presented in the present paper, we allow this verification measurement to fail with a small fixed probability $ϵ > 0$ .

A convenient mathematical object to characterize what the operation does on the system is the following. The effective work process $T_{X \to X^{'} F}$ associated with $Φ_{X W \to X^{'} W}$ and $(τ_{W}^{m}, τ_{W}^{m^{'}})$ is the trace non-increasing map defined as

\begin{matrix} T_{X \to X^{'}} (\cdot) & = {tr}_{W} [P_{W}^{m^{'}} Φ_{X W \to X^{'} W} ((\cdot) \otimes τ_{W}^{m})] . \end{matrix}

The question of implementing a process $E$ becomes the issue of finding a Gibbs-preserving map along with battery states such that the associated effective work process is close to $E$ . Specifically, if $‖ T_{X \to X^{'}} - E_{X \to X^{'}} ‖_{⋄} ⩽ ϵ$ , then we can assert that the failure probability in Point (c) above is bounded by $ϵ$ for all possible inputs on X; the operation therefore implements $E_{X \to X^{'}}$ accurately with high success probability.

A useful characterization of which processes can be implemented using an information battery is given by the following proposition.

Proposition 3.1

( [16, Proposition I]). Let $Γ_{X}, Γ_{X^{'}} ⩾ 0$ , $T_{X \to X^{'}}$ be a completely positive, trace non-increasing map, and $w \in R$ . Then, the following are equivalent:

(i)
We have $T_{X \to X^{'}} (Γ_{X}) ⩽ e^{w} Γ_{X^{'}}$ ;
(ii)
For all $δ > 0$ there exists an information battery W and two battery states $τ_{W}, τ_{W}^{'}$ such that $w (τ_{W}) - w (τ_{W}^{'}) ⩽ w + δ$ , and there exists a Gibbs-preserving map $Φ_{X W \to X^{'} W}$ with $T_{X \to X^{'}}$ the effective work process associated with $Φ_{X W \to X^{'} W}$ and $(τ_{W}, τ_{W}^{'})$ .

Therefore, to show that one can implement $E_{X \to X^{'}}$ with Gibbs-preserving maps while expending work w, it suffices to exhibit a map $T_{X \to X^{'}}$ that is $ϵ$ -close to $E_{X \to X^{'}}$ in diamond distance and that satisfies $T_{X \to X^{'}} (Γ_{X}) ⩽ e^{w} Γ_{X^{'}}$ . From the proof in [16] we know in Point (ii) above that W, $τ_{W} \equiv τ_{W}^{m}$ and $τ_{W}^{'} \equiv τ_{W}^{m^{'}}$ can be chosen freely as long as $m^{'} - m = w (τ_{W}) - w (τ_{W}^{'}) ⩾ w$ and that the corresponding Gibbs-preserving map is given by

\begin{matrix} Φ_{X W \to X^{'} W} (\cdot) & = T_{X \to X^{'}} [{tr}_{W} (P_{W}^{m} (\cdot))] \otimes τ_{W}^{m^{'}} . \end{matrix}

In Ref. [16], the resource cost w of implementing a process $E_{X \to X^{'}}$ (any completely positive, trace-preserving map) up to an accuracy $ϵ ⩾ 0$ in terms of proximity of the process matrix given a fixed input state $σ_{X}$ , counted in pure nats, was shown to be given by the coherent relative entropy

\begin{matrix} w = - {\hat{D}}_{X \to X^{'}}^{ϵ} (E_{X \to X^{'}} (σ_{X R_{X}}) ‖ Γ_{X}, Γ_{X^{'}}) = ln min_{\begin{matrix} T (Γ_{X}) ⩽ α Γ_{X^{'}} \\ P (T (σ_{X R_{X}}), E (σ_{X R_{X}})) ⩽ ϵ \end{matrix}} α, \end{matrix}

where $σ_{X R_{X}}$ is the purification of $σ_{X}$ on a system $R_{X} ≃ X$ given by ${| σ ⟩}_{XR} = σ_{X}^{1 / 2} {| Φ ⟩}_{X :} R_{X}$ , and where the optimization ranges over completely positive, trace non-increasing maps $T_{X \to X^{'}}$ . The coherent relative entropy enjoys a collection of properties in relation to the conditional min- and max-entropy, and to the min- and max-relative entropy. It satisfies the following asymptotic equipartition property: For a completely positive, trace-preserving map $E_{X \to X^{'}}$ and quantum state $σ_{X}$ we have for $0 < ϵ < 1$ that

\begin{matrix} lim_{n \to \infty} \frac{1}{n} {\hat{D}}_{X^{n} \to X^{' n}}^{ϵ} (E_{X \to X^{'}}^{\otimes n} (σ_{XR}^{\otimes n}) ‖ Γ_{X}^{\otimes n}, Γ_{X^{'}}^{\otimes n}) = D (σ_{X} ‖ Γ_{X}) - D (E (σ_{X}) ‖ Γ_{X^{'}}) . \end{matrix}

Thermal operations

The framework of Gibbs-sub-preserving maps is technically convenient, but it is unclear whether any Gibbs-sub-preserving operation can be implemented at no work cost using other frameworks. This includes for example thermal operations that might be considered more operational

Here, we consider the alternative framework of thermal operations [2, 3, 8]. Each system S of interest has an associated Hamiltonian $H_{S}$ and is not interacting with the other systems. For a given fixed inverse temperature $β$ , we allow the following operations to be carried out for free:

(i)
Apply any unitary operation that commutes with the total Hamiltonian;
(ii)
Bring in any ancillary system in its Gibbs state at inverse temperature $β$ ; and
(iii)
Discard any system.

The most general transformation a system S can undergo under this set of rules is a thermal operation. A thermal operations is any process that can be implemented using an additional system B with any Hamiltonian $H_{B}$ and with any unitary $U_{SB}$ satisfying $[U_{SB}, H_{S} + H_{B}] = 0$ , resulting in the completely positive, trace-preserving map

\begin{matrix} Φ_{S} (\cdot) = {tr}_{B} [U_{SB} ((\cdot) \otimes γ_{B}) U_{SB}^{†}], \end{matrix}

where $γ_{B} = e^{- β H_{B}} / tr [e^{- β H_{B}}]$ is the Gibbs state of the bath system B. Observe that any concatenation of thermal operations is again a thermal operation.

Clearly, any thermal operation $Φ_{S}$ leaves the thermal state $γ_{S} = e^{- β H_{S}} / tr [e^{- β H_{S}}]$ on S invariant. Hence, any lower bound on the work cost of an implementation derived in the framework of Gibbs-preserving maps also applies to thermal operations. We use the same definitions of work and the effective work process for thermal operations as we defined for Gibbs-preserving maps earlier: an information battery is used to account for work, and the effective work process associated with a thermal operation $Φ_{X W \to X W}$ with respect to battery states $(τ_{W}^{m}, τ_{W}^{m^{'}})$ is also defined by (24).

When considering only states that commute with the Hamiltonian, a powerful tool to characterize possible state transformations is the notion of thermomajorization [8]. In the fully quantum regime, there is in contrast no known simple mathematical characterization of the work required to implement a quantum process with thermal operations. In fact, because thermal operations are time-covariant, it is impossible to implement processes that are not time-covariant, even if the latter might admit an implementation with a Gibbs-preserving map [6].

We will later use a primitive that transforms a thermal state into a pure energy eigenstate. The next statement follows directly from [8, Eq. (8) and Suppl. Note 4].

Proposition 3.2

Let $γ_{X} = e^{- β H_{X}} / tr [e^{- β H_{X}}]$ be the thermal state on a system X with Hamiltonian $H_{X}$ , and let ${| E ⟩}_{X}$ be a pure energy eigenstate of $H_{X}$ . There exists a thermal operation $Φ_{XW}$ on an information battery with battery states $(τ_{W}, τ_{W}^{'})$ such that $Φ_{XW} (γ_{X} \otimes τ_{W}) = {| E ⟩ ⟨ E |}_{X} \otimes τ_{W}^{'}$ and such that $w (τ_{W}) - w (τ_{W}^{'})$ can be chosen arbitrarily close to $β E + ln tr [e^{- β H_{X}}]$ .

Thermodynamic Capacity

Definition

Let $X, X^{'}$ be quantum systems, $E_{X \to X^{'}}$ be a quantum process, and $ϵ > 0$ . We seek a free thermodynamic operation (either a thermal operation or a Gibbs preserving map) $Φ_{X^{n} W \to X^{' n} W}$ that acts on $X^{\otimes n}$ and a battery W, with output on $X^{' \otimes n}$ and W, as well as information battery states $τ_{W}^{(i)}$ and $τ_{W}^{(f)}$ , such that:

(i)
The effective work process $T_{X^{n} \to X^{' n}}$ of $Φ_{X^{n} W \to X^{' n} W}$ with respect to $(τ_{W}^{(i)}, τ_{W}^{(f)})$ is $ϵ$ -close in diamond distance to $E_{X \to X^{'}}^{\otimes n}$ ;
(ii)
We seek to minimize the work consumption per copy w given by
$\begin{matrix} w = \frac{1}{n} [w (τ_{W}^{(i)}) - w (τ_{W}^{(f)})] . \end{matrix}$ 29

Our main result is a collection of three independent constructions of such implementations in different regimes, using either Gibbs-preserving maps or thermal operations. In each case, the amount of work consumed per copy is given by a quantity which we call the thermodynamic capacity of the process, and which turns out to be the minimal work cost an implementation satisfying the above conditions can achieve. The thermodynamic capacity of a completely positive, trace-preserving map $E_{X \to X^{'}}$ relative to operators $Γ_{X}, Γ_{X^{'}} > 0$ is defined as

\begin{matrix} T (E) = sup_{σ_{X}} {D (E_{X \to X^{'}} (σ_{X}) ‖ Γ_{X^{'}}) - D (σ_{X} ‖ Γ_{X})} . \end{matrix}

In a fully thermodynamic context where $Γ_{X} = e^{- β H_{X}}$ and $Γ_{X^{'}} = e^{- β H_{X^{'}}^{'}}$ , one can choose to express the thermodynamic capacity in units of energy rather than in nats, in which case a pre-factor $β^{- 1}$ may be included in the definition above such that the thermodynamic capacity is a difference of free energies

\begin{matrix} T (E) = sup_{σ} {F_{H^{'}} (E (σ)) - F_{H} (σ)} with F_{H} (ρ) = β^{- 1} D (ρ ‖ e^{- β H}) . \end{matrix}

Construction for trivial Hamiltonians First, in Sect. 5 we consider the special case where $Γ_{X} = 1_{X}$ and $Γ_{X^{'}} = 1_{X^{'}}$ corresponding to trivial Hamiltonians and show that simple considerations based on properties of known entropy measures guarantee the existence of a universal implementation of $E^{\otimes n}$ with either thermal operations or Gibbs-preserving maps.

Construction using Gibbs-preserving maps Second, in Sect. 6 we consider the case of general $Γ_{X}, Γ_{X^{'}}$ and we construct a universal implementation of $E_{X \to X^{'}}^{\otimes n}$ with Gibbs-preserving maps, based on new typicality considerations.

Construction using thermal operations Third, for arbitrary Hamiltonians we construct in Sect. 7 a universal implementation of $E_{X \to X^{'}}^{\otimes n}$ with thermal operations, assuming that $E$ is time-covariant, i.e., that it commutes with the time evolution operation.

Properties

The thermodynamic capacity is a convex optimization program. Namely, the objective function of the optimization in (30) can be written as

\begin{matrix} D (E_{X \to X^{'}} (σ_{X}) ‖ Γ_{X^{'}}) - D (σ_{X} ‖ Γ_{X}) \\ = - H (E_{X \to X^{'}} (σ_{X})) + H (σ_{X}) - tr [E_{X \to X^{'}} (σ_{X}) ln Γ_{X^{'}}] + tr [σ_{X} ln Γ_{X}] \\ = H {(E | X^{'})}_{ρ} - tr [E_{X \to X^{'}} (σ_{X}) ln Γ_{X^{'}}] + tr [σ_{X} ln Γ_{X}], \end{matrix}

where we defined the state $ρ_{E X^{'}} = V_{X \to X^{'} E} σ_{X} V^{†}$ using a Stinespring dilation $V_{X \to X^{'} E}$ of $E_{X \to X^{'}}$ into an environment system E, satisfying $E_{X \to X^{'}} (\cdot) = {tr}_{E} [V, (\cdot), V^{†}]$ . The conditional entropy is concave in the quantum state as $H {(E | X^{'})}_{ρ} = - D (ρ_{E X^{'}} ‖ 1_{E} \otimes ρ_{X^{'}})$ and the quantum relative entropy is jointly convex. The other terms in (32) are linear. Hence, the optimization (30) is a convex optimization that can be carried out efficiently for small system sizes [30]. Indeed, we have successfully computed the thermodynamic capacity of simple example quantum channels acting on few qubits with Python code, using the QuTip framework [31, 32] and the CVXOPT optimization software [33] (see also [34] for a direct algorithm).

The thermodynamic capacity is additive [21]. As a consequence of this property, it is not necessary to include a stabilization over a reference system in the definition of the thermodynamic capacity. That is, had we optimized over bipartite states $σ_{XR}$ with a reference system R for any $Γ_{R}$ , on which the process acts as the identity process, we would be effectively computing $T (E \otimes {id}_{R})$ . However, additivity implies that $T (E \otimes {id}_{R}) = T (E)$ .

Proposition 4.1

(Additivity of thermodynamic capacity [21]). For $Γ_{X}, Γ_{X^{'}}, Γ_{Z}, Γ_{Z^{'}} > 0$ and quantum channels $E_{X \to X^{'}}$ , $F_{Z \to Z^{'}}$ we have

\begin{matrix} T (E \otimes F) = T (E) + T (F) . \end{matrix}

For completeness we provide an independent proof of additivity, to ensure validity in the general setting of abstract $Γ$ operators.

Proof

Let $σ_{X}, τ_{Z}$ be states achieving the thermodynamic capacity of $T (E)$ and $T (F)$ , respectively. Then, $σ_{X} \otimes τ_{Z}$ is a candidate for $T (E \otimes F)$ , yielding

\begin{matrix} T (E \otimes F) & ⩾ D (E (σ) \otimes F (τ) ‖ Γ_{X^{'}} \otimes Γ_{Z^{'}}) - D (σ \otimes τ ‖ Γ_{X} \otimes Γ_{Z}) \\ = D (E (σ) ‖ Γ_{X^{'}}) - D (σ ‖ Γ_{X}) + D (F (τ) ‖ Γ_{Z^{'}}) - D (τ ‖ Γ_{Z}) \\ = T (E) + T (F) . \end{matrix}

Now, let $ζ_{XZ}$ achieve the optimum for $T (E \otimes F)$ . Let $V_{X \to E_{1} X^{'}}$ , $W_{Z \to E_{2} Z^{'}}$ be Stinespring isometries of $E$ and $F$ respectively, such that $E (\cdot) = {tr}_{E_{1}} [V, (\cdot), V^{†}]$ and $F (\cdot) = {tr}_{E_{2}} [W, (\cdot), W^{†}]$ . Let $ρ_{E_{1} E_{2} X^{'} Z^{'}} = (V \otimes W) ζ {(V \otimes W)}^{†}$ . Then, we have

\begin{matrix} T (E \otimes F) & = D ((E \otimes F) (ζ) ‖ Γ_{X^{'}} \otimes Γ_{Z^{'}}) - D (ζ_{XZ} ‖ Γ_{X} \otimes Γ_{Z}) \\ = H {(E_{1} E_{2} | X^{'} Z^{'})}_{ρ} - tr [ρ_{X^{'} Z^{'}} ln (Γ_{X^{'}} \otimes Γ_{Z^{'}})] + tr [ζ_{XZ} ln (Γ_{X} \otimes Γ_{Z})], \\ = H {(E_{1} E_{2} | X^{'} Z^{'})}_{ρ} - tr [ρ_{X^{'}} ln (Γ_{X^{'}})] - tr [ρ_{Z^{'}} ln (Γ_{Z^{'}})] \\ + tr [ζ_{X} ln (Γ_{X})] + tr [ζ_{Z} ln (Γ_{Z})] \end{matrix}

since $ln (A \otimes B) = ln (A) \otimes 1 + 1 \otimes ln (B)$ . Invoking the chain rule of the von Neumann entropy, and then strong sub-additivity of the entropy, we see that $H {(E_{1} E_{2} | X^{'} Z^{'})}_{ρ} = H {(E_{1} | X^{'} Z^{'})}_{ρ} + H {(E_{2} | E_{1} X^{'} Z^{'})}_{ρ} ⩽ H {(E_{1} | X^{'})}_{ρ} + H {(E_{2} | Z^{'})}_{ρ}$ . Hence, we have

\begin{matrix} (35) & ⩽ H {(E_{1} | X^{'})}_{ρ} - tr [ρ_{X^{'}} ln (Γ_{X^{'}})] + tr [ζ_{X} ln (Γ_{X})] \\ + H {(E_{2} | Z^{'})}_{ρ} - tr [ρ_{Z^{'}} ln (Γ_{Z^{'}})] + tr [ζ_{Z} ln (Γ_{Z})] \\ ⩽ T (E) + T (F), \end{matrix}

where the last inequality holds because the reduced states $ζ_{X}, ζ_{Z}$ are optimization candidates for $T (E)$ and $T (F)$ , respectively. $□$

A special case worth mentioning is when $Γ_{X} = 1_{X}$ , $Γ_{X^{'}} = 1_{X^{'}}$ , which corresponds to the situation where the Hamiltonians of X and $X^{'}$ are trivial. For any quantum channel $E_{X \to X^{'}}$ , let $V_{X \to X^{'} E}$ be a Stinespring dilation isometry with $E_{X \to X^{'}} (\cdot) = {tr}_{E} [V, (\cdot), V^{†}]$ . Then, we have

\begin{matrix} T (E) = sup_{σ} \{H (σ_{X}) - H (E (σ_{X}))\} = sup_{σ} H {(E | X^{'})}_{V σ V^{†}} . \end{matrix}

That is, the thermodynamic capacity characterizes by how much the channel is capable of reducing the entropy of its input, or equivalently, how much entropy the channel is capable of dumping into the environment when conditioned on the output. We note that the quantity $- T (E)$ has previously been studied in the information theory literature as the entropy gain of quantum channels [35–42]. Our work can be seen as giving a precise operational interpretation to this quantity.

Optimality

Here, we show that any universal implementation that obeys our stated conditions in Sect. 4.1 must necessarily consume an amount of work that is lower bounded by the thermodynamic capacity. That is, any universal implementation that consumes an amount of work equal to the thermodynamic capacity is optimal. This lower bound is simple to prove, because a universal implementation of a process must necessarily be a good implementation for any individual i.i.d. input state, a situation where the optimal work cost is known [16]. Furthermore, any scheme that satisfies the requirements of Sect. 4 at work cost w per copy counted with standard battery states of Ref. [16], has an effective process $T_{X^{n} \to X^{' n}}$ on the systems that obeys $T (Γ_{X}^{\otimes n}) ⩽ e^{nw} Γ_{X^{'}}^{\otimes n}$ . This is because any thermal operation is in particular a Gibbs-preserving map, and the work cost is characterized by Proposition 3.1. The following shows that for any such implementation, the work consumed w per copy cannot be less than the thermodynamic capacity of the process.

Proposition 4.2

Let $ϵ > 0$ , $Γ_{X}, Γ_{X^{'}} > 0$ , $E_{X \to X^{'}}$ a completely positive, trace-preserving map, and $T_{X^{n} \to X^{' n}}$ a completely positive, trace non-increasing map such that we have $‖ T - E^{\otimes n} ‖_{⋄} / 2 ⩽ ϵ$ . For $w \in R$ such that $T_{X^{n} \to X^{' n}} (Γ_{X}^{\otimes n}) ⩽ e^{nw} Γ_{X^{'}}^{\otimes n}$ , we have in the limit $n \to \infty$ that $w ⩾ T (E)$ .

Proof

Let $T$ with $\frac{1}{2} {‖ E - T ‖}_{⋄} ⩽ ϵ$ , $σ_{X}$ be a quantum state, and ${| σ ⟩}_{X R_{X}} = σ_{X}^{1 / 2} {| Φ ⟩}_{X : R_{X}}$ . Then, by definition of the diamond norm it must hold that $D (E (σ_{X R_{X}}), T (σ_{X R_{X}})) ⩽ ϵ$ , which implies that $P (E (σ_{X R_{X}}), T (σ_{X R_{X}})) ⩽ \sqrt{2 ϵ}$ . We have that $T$ is a valid optimization candidate for the definition of the coherent relative entropy and thus

\begin{matrix} - {\hat{D}}_{X^{n} \to X^{' n}}^{\sqrt{2 ϵ}} (E_{X \to X^{'}}^{\otimes n} (σ_{X R_{X}}^{\otimes n}) ‖ Γ_{X}^{\otimes n}, Γ_{X^{'}}^{\otimes n}) ⩽ n w . \end{matrix}

For $n \to \infty$ , we can employ the asymptotic equipartition of the coherent relative entropy (27) to see that

\begin{matrix} D (E (σ_{X}) ‖ Γ_{X^{'}}) - D (σ_{X} ‖ Γ_{X}) ⩽ w . \end{matrix}

Since this inequality holds for all $σ_{X}$ , we deduce that $T (E) ⩽ w$ . $□$

Construction #1: Trivial Hamiltonians

Statement and proof sketch

Instead of constructing explicitly an implementation that satisfies the requirements of Sect. 4, one might hope that the implementation could be given implicitly as the solution of a semi-definite program representing an entropy measure. This proof idea was indeed exploited in other contexts in Refs. [23, 43]. Here, we define the one-shot entropy-like quantity

\begin{matrix} W_{X \to X^{'}}^{ϵ} (E_{X \to X^{'}} ‖ Γ_{X}, Γ_{X^{'}}) = min_{\begin{matrix} T (Γ_{X}) ⩽ e^{w} Γ_{X^{'}} \\ \frac{1}{2} {∥T - E∥}_{⋄} ⩽ ϵ \end{matrix}} w, \end{matrix}

where $T_{X \to X^{'}}$ ranges over all trace non-increasing, completely positive maps. The proof strategy would then be to relate this entropy measure to the coherent relative entropy, and to exploit known properties of the latter in the i.i.d. regime to provide an upper bound to the expression

\begin{matrix} \frac{1}{n} W_{X^{n} \to X^{' n}}^{ϵ} (E_{X^{n} \to X^{' n}}^{\otimes n} ‖ Γ_{X}^{\otimes n}, Γ_{X^{'}}^{\otimes n}) . \end{matrix}

Should this upper bound behave like $T (E)$ to leading order, then the $T$ equal to the optimal solution to (40) defines an implementation in terms of Gibbs-preserving maps thanks to Proposition 3.1. It turns out that this proof strategy works well in the special case of trivial Hamiltonians, but fails in the general case.

The core technical statement that underlies our Construction #1 is summarized in the following theorem.

Theorem 5.1

Let $E_{X \to X^{'}}$ be a completely positive, trace-preserving map, and $ϵ > 0$ . Then we have

\begin{matrix} lim_{n \to \infty} \frac{1}{n} W_{X^{n} \to X^{' n}}^{ϵ} (E_{X^{n} \to X^{' n}}^{\otimes n} ‖ 1_{X^{n}}, 1_{X^{' n}}) = T (E), \end{matrix}

where $T (E) = {max}_{σ_{X}} \{H (σ_{X}) - H (E (σ_{X}))\}$ .

This implementation is constructed by taking the implicit optimal solution $T_{X^{n} \to X^{' n}}$ in the semi-definite program (40) for $\frac{1}{n} W_{X^{n} \to X^{' n}}^{ϵ} (E_{X \to X^{'}}^{\otimes n} ‖ 1_{X^{n}}, 1_{X^{' n}})$ , and using Proposition 3.1 to construct an associated Gibbs-preserving map acting on battery states via (25). In summary, for any $δ^{'} > 0$ , for n large enough and choosing any $m, m^{'}$ such that $m - m^{'} ⩽ n T (E) + δ^{'}$ , the full implementation map in terms of $T_{X^{n} \to X^{' n}}$ becomes

\begin{matrix} Φ_{X^{n} W \to X^{' n} W} (\cdot) & = T_{X^{n} \to X^{' n}} ({tr}_{W} [P_{W}^{m} (\cdot)]) \otimes τ_{W}^{m^{'}} . \end{matrix}

We emphasise that Theorem 5.1 exactly covers the entropy gain of quantum channels as studied in [35–42].

Proof

(Theorem 5.1) By using the post-selection technique (Theorem 2.1) and recalling that the fixed-input state case is given by the coherent relative entropy, we find

\begin{matrix} W_{X^{n} \to X^{' n}}^{ϵ} (E_{X \to X^{'}}^{\otimes n} ‖ 1_{X^{n}}, 1_{X^{' n}}) ⩽ - {\hat{D}}_{X^{n} \to X^{' n}}^{ϵ / poly (n)} (E_{X \to X^{'}}^{\otimes n} (ζ_{X^{n} R_{X}^{n}}) ‖ 1_{X^{n}}, 1_{X^{' n}}) . \end{matrix}

In the case of trivial Hamiltonians, the coherent relative entropy reduces to the smooth max-entropy (cf. [16, Props. 28 and 26] and also Ref. [44]). More precisely, we have

\begin{matrix} {\hat{D}}_{X \to X^{'}}^{ϵ} (ρ_{X^{'} R_{X}} ‖ 1_{X}, 1_{X^{'}}) ⩾ - H_{\max}^{c ϵ^{α}} {(E | X^{'})}_{ρ} + g (ϵ), \end{matrix}

where ${| ρ ⟩}_{X^{'} R_{X} E}$ is a pure state, where $c > 0$ , $0 < α < 1$ , $g (ϵ)$ are universal and do not depend on the state or the dimensions of the systems, and the smooth max-entropy is defined as

\begin{matrix} H_{\max}^{ϵ} {(E | X^{'})}_{ρ} & = min_{P (\hat{ρ}, ρ) ⩽ ϵ} H_{\max} {(E | X^{'})}_{\hat{ρ}} ; \\ H_{\max} {(E | X^{'})}_{\hat{ρ}} & = max_{0 \leq ω_{X^{'}} \leq 1} ln ‖ {\hat{ρ}}_{E X^{'}}^{1 / 2} ω_{X^{'}}^{1 / 2} ‖_{1}^{2} . \end{matrix}

Thus, we have

\begin{matrix} (44) ⩽ H_{\max}^{ϵ^{α} / poly (n)} {(E^{n} | X^{' n})}_{ρ} + g (ϵ), \end{matrix}

where $ρ_{X^{' n} E^{n}} = V_{X \to X^{'} E}^{\otimes n} ζ_{X^{n}} {(V^{†})}^{\otimes n} = \int d σ {(V σ V^{†})}^{\otimes n}$ and $V_{X \to X^{'} E}$ is a Stinespring dilation isometry of $E_{X \to X^{'}}$ as $E_{X \to X^{'}} (\cdot) = {tr}_{E} [V_{X \to X^{'} E}, (\cdot), V^{†}]$ . At this point we invoke two facts. First, note that the de Finetti state can be written as a mixture of only $poly (n)$ i.i.d. states, instead of a continuous average (Theorem 2.1): There exists a set ${σ_{i}}$ of at most $poly (n)$ states and a distribution ${p_{i}}$ such that $ζ_{X^{n}} = \sum_{i} p_{i} σ_{i}^{\otimes n}$ . Second, we invoke the property that the conditional max-entropy is quasi-convex up to a penalty term, namely, that the conditional max-entropy of $\sum_{i} p_{i} ρ_{i}$ is less than or equal to the maximum over the set of max-entropies corresponding to each $ρ_{i}$ , plus a term proportional to the logarithm of the number of terms in the sum [45, Lemma 11]. Hence, with $ρ_{i} = V σ_{i} V^{†}$ , we get

\begin{matrix} (48) ⩽ max_{i} H_{\max}^{ϵ^{α} / poly (n)} {(E^{n} | X^{' n})}_{ρ_{i}^{\otimes n}} + ln (poly (n)) + g (ϵ) . \end{matrix}

Now, we are in business because the max-entropy is evaluated on an i.i.d. state, and we know that it asymptotically goes to the von Neumann entropy in this regime [46]. Also, ${lim}_{n \to \infty} (1 / n) {ln (poly (n)) + g (ϵ)} = 0$ and hence

\begin{matrix} lim_{n \to \infty} \frac{1}{n} W_{X^{n} \to X^{' n}}^{ϵ} (E_{X \to X^{'}}^{\otimes n} ‖ 1_{X^{n}}, 1_{X^{' n}}) & ⩽ max_{i} H {(E | X^{'})}_{ρ_{i}} \\ = max_{i} \{H (σ_{i}) - H (E (σ_{i}))\} \\ ⩽ max_{σ} \{H (σ) - H (E (σ))\} \\ = T (E) \end{matrix}

noting that $H (E | X^{'}) = H (E X^{'}) - H (X^{'}) = H (X) - H (X^{'})$ . $□$

Challenges for extension to non-trivial Hamiltonians

Naturally, one might ask whether it is possible to extend this proof to the case of non-trivial $Γ$ operators. Interestingly, this is not possible, at least not in a naive way. The problem is that we need a quasi-convexity property of the form

\begin{matrix} - {\hat{D}}_{X \to X^{'}}^{ϵ} (E_{X \to X^{'}} (σ_{X R_{X}}) ‖ Γ_{X}, Γ_{X^{'}}) \\ \overset{?}{⩽} max_{i} (- {\hat{D}}_{X \to X^{'}}^{ϵ} (E_{X \to X^{'}} (σ_{X R_{X}}^{i}) ‖ Γ_{X}, Γ_{X^{'}})) + (penalty), \end{matrix}

where $σ_{X} = \sum p_{i} σ_{X}^{i}$ and ${| σ ⟩}_{XR} = σ_{X}^{1 / 2} {| Φ ⟩}_{X : R_{X}}$ , $| σ^{i} ⟩_{XR} = {(σ_{X}^{i})}^{1 / 2} {| Φ ⟩}_{X : R_{X}}$ , and where the $(penalty)$ term scales in a favourable way in n, say of order $ln (poly (M))$ where M is the number of terms in the convex decomposition as for the max-entropy. In fact, Eq. (51) is false, as can be shown using an explicit counterexample on a two-level system which we present below. As this example is based on physical reasons, the coherent relative entropy is not even approximately quasi-convex. We note that a priori we cannot rule out a quasi-convexity property that might have a penalty term that depends on properties of the $Γ$ operators, yet such a term is likely to scale unfavourably with n.

Our example is as follows. Consider a two-level system with a Hamiltonian H with energy levels $| 0 ⟩, | 1 ⟩$ at corresponding energies $E_{0} = 0$ and $E_{1} > 0$ . The corresponding $Γ$ operator is $Γ = g_{0} | 0 ⟩ ⟨ 0 | + g_{1} | 1 ⟩ ⟨ 1 |$ with $g_{0} = 1$ , $g_{1} = e^{- β E_{1}}$ . Consider the process consisting in erasing the input and creating the output state $| + ⟩$ , where we define $| \pm ⟩ = [| 0 ⟩ \pm | 1 ⟩] / \sqrt{2}$ . That is, we consider the process $E (\cdot) = tr [\cdot] | + ⟩ ⟨ + |$ . Suppose the input state is maximally mixed, $σ = 1 / 2$ , such that $ρ_{X^{'} R_{X}} = {| + ⟩ ⟨ + |}_{X^{'}} \otimes 1_{R_{X}} / 2$ . If $E_{0} = 0$ and $E_{1} \to \infty$ , then this process requires a lot of work; intuitively, with probability 1/2 we start in the ground state $| 0 ⟩$ and need to prepare the output state $| + ⟩$ which has high energy.

For $ϵ = 0$ , we can see this because the input state is full rank, hence $T = E$ ; then $E (Γ) = tr [Γ] | + ⟩ ⟨ + |$ and the smallest $α$ such that $E (Γ) ⩽ α Γ$ is given by

\begin{matrix} α / tr [Γ] = ‖ Γ^{- 1 / 2} | + ⟩ ⟨ + | Γ^{- 1 / 2} ‖_{\infty} = ⟨ + | Γ^{- 1} | + ⟩ = (g_{0}^{- 1} + g_{1}^{- 1}) / 2 \\ = (1 + e^{β E_{1}}) / 2 ⩾ e^{β E_{1}} / 2 . \end{matrix}

Noting that $tr [Γ] ⩾ 1$ , we have $α ⩾ e^{β E_{1}} / 2$ , and hence the energy cost of the transformation $1 / 2 \to | + ⟩$ is

\begin{matrix} energy cost = - β^{- 1} {\hat{D}}_{X \to X^{'}} (E_{X \to X^{'}} (σ_{X R_{X}}) ‖ Γ, Γ) = β^{- 1} ln α ⩾ E_{1} - β^{- 1} ln (2) . \end{matrix}

Clearly, this work cost can become arbitrarily large if $E_{1} \to \infty$ . On the other hand, we can perform the transformation $| + ⟩ \to | + ⟩$ obviously at no work cost; similarly, $| - ⟩ \to | + ⟩$ can be carried out by letting the system time-evolve under its own Hamiltonian for exactly the time interval required to pick up a relative phase $(- 1)$ between the $| 0 ⟩$ and $| 1 ⟩$ states. This also costs no work because it is a unitary operation that commutes with the Hamiltonian. We thus have our counter-example to the quasi-convexity of the coherent relative entropy. The transformation $1 / 2 \to | + ⟩$ is very hard, but the individual transformations $| \pm ⟩ \to | + ⟩$ are trivial, noting that $1 / 2 = (1 / 2) | + ⟩ ⟨ + | + (1 / 2) | - ⟩ ⟨ - |$ .

We show in “Appendix D” how to make the above claim robust against an accuracy tolerance $ϵ \geq 0$ .

Construction #2: Gibbs-Preserving Maps

Statement and proof sketch

Here, we present a general construction of a universal implementation of an i.i.d. process using Gibbs-preserving maps according to the requirements of Sect. 4.1. The idea is to explicitly construct an implementation using a novel notion of quantum typicality. We introduce notions of quantum typicality that apply to quantum processes and universally capture regions of the Hilbert space where the conditional entropy (respectively the relative entropy difference) has a given value. This generalizes existing notions of typical projectors to a quantum typical operator that applies to bipartite states, is relative to a $Γ$ operator, and universal.

The main result behind the construction in this section is the following theorem.

Theorem 6.1

Let $Γ_{X}, Γ_{X^{'}} > 0$ , $E_{X \to X^{'}}$ be a completely positive, trace-preserving map, and $ϵ > 0$ . Then, for $δ > 0$ and $n \in N$ large enough there exists a completely positive map $T_{X^{n} \to X^{' n}}$ such that:

(i)
$T_{X^{n} \to X^{' n}}$ is trace non-increasing;
i(ii)
$‖ T_{X^{n} \to X^{' n}} - E_{X \to X^{'}}^{\otimes n} ‖_{⋄} ⩽ ϵ$ ;
(iii)
$T_{X^{n} \to X^{' n}} (Γ_{X}^{\otimes n}) ⩽ e^{n [T (E) + 4 δ + n^{- 1} ln (poly (n))]} Γ_{X^{'}}^{\otimes n}$ .

Note that we have $n^{- 1} ln (poly (n)) \to 0$ as $n \to \infty$ , and that we can take $δ \to 0$ after taking $n \to \infty$ . Thanks to Proposition 3.1, the mapping $T_{X^{n} \to X^{' n}}$ defines an implementation of the i.i.d. process $E_{X \to X^{'}}^{\otimes n}$ in terms of Gibbs-preserving maps and a battery, whose work cost rate is given to leading order by the thermodynamic capacity $T (E)$ after taking $δ \to 0$ .

As for Construction #1, the full Gibbs-preserving map implementing the required process is assembled in two steps, first constructing the map $T_{X^{n} \to X^{' n}}$ in Theorem 6.1 and then using Proposition 3.1 to obtain the full Gibbs-preserving map. Let $V_{X \to X^{'} E}$ be a Stinespring dilation isometry of $E_{X \to X^{'}}$ . For $δ > 0$ , we introduce a universal conditional and relative typical smoothing operator $M_{E^{n} X^{' n}}^{x, δ}$ (see later Definition 6.1 and Proposition 6.1) with $x = - n T (E)$ and relative to $Γ_{X^{'} E} \equiv V Γ_{X} V^{†}$ and $Γ_{X^{'}}$ . The map $T_{X^{n} \to X^{' n}}$ is then constructed as

\begin{matrix} T_{X^{n} \to X^{' n}} (\cdot) & = {tr}_{E^{n}} [M_{E^{n} X^{' n}}^{x, δ}, V_{X \to X^{'} E}^{\otimes n}, (\cdot), V_{X \leftarrow X^{'} E}^{† \otimes n}, M_{E^{n} X^{' n}}^{x, δ †}] . \end{matrix}

Finally, we employ Proposition 3.1 to construct an associated Gibbs-preserving map acting on battery states via (25). For any $δ^{'} > 0$ , for n large enough and choosing any $m, m^{'}$ such that $m - m^{'} ⩽ n T (E) + 4 δ + n^{- 1} ln poly (n) + δ^{'}$ , the full implementation map in terms of $T_{X^{n} \to X^{' n}}$ becomes

\begin{matrix} Φ_{X^{n} W \to X^{' n} W} (\cdot) & = T_{X^{n} \to X^{' n}} ({tr}_{W} [P_{W}^{m} (\cdot)]) \otimes τ_{W}^{m^{'}} . \end{matrix}

Construction via universal conditional and relative typicality

The main ingredient of our proof is a notion of a universal conditional and relative typical smoothing operator that enables us to discard events that are very unlikely to appear in the process while accounting for how much they contribute to the overall work cost. This operator is inspired by similar constructions in Refs. [47, 48]. However, in additional to being “relative” as in [47] our smoothing operator is also simultaneously “conditional” and “universal”.

Definition 6.1

Let $Γ_{AB}, Γ_{B}^{'} ⩾ 0$ and $x \in R$ . A universal conditional and relative typical smoothing operator $M_{A^{n} B^{n}}^{x, δ}$ with parameter $δ > 0$ is an operator on $A^{n} B^{n}$ that satisfies the following conditions:

(i)
$(M_{A^{n} B^{n}}^{x, δ})^{†} M_{A^{n} B^{n}}^{x, δ} ⩽ 1$ ;
(ii)
There exists $ξ > 0$ independent of n with the following property: For any pure state ${| ρ ⟩}_{ABR}$ with $ρ_{AB}$ (respectively $ρ_{B}$ ) in the support of $Γ_{AB}$ (respectively $Γ_{B}^{'}$ ) and such that $D (ρ_{AB} ‖ Γ_{AB}) - D (ρ_{B} ‖ Γ_{B}^{'}) ⩾ x$ , it holds that
$\begin{matrix} Re \{{⟨ ρ |}_{ABR}^{\otimes n}, M_{A^{n} B^{n}}^{x, δ}, {| ρ ⟩}_{ABR}^{\otimes n}\} ⩾ 1 - poly (n) exp (- n ξ) ; \end{matrix}$ 55
(iii)
${tr}_{A^{n}} [M_{A^{n} B^{n}}^{x, δ} Γ_{AB}^{\otimes n} (M_{A^{n} B^{n}}^{x, δ})^{†}] ⩽ poly (n) e^{- n (x - 4 δ)} Γ_{B}^{' \otimes n}$ .

Note that the smoothing operator is defined as a general operator of norm bounded by one, as opposed to the usual definition of typical subspaces or typical projectors. The main reason is that it is not known to us in general if such an object can be chosen to be a projector. By using the real part in Point (ii) above, we ensure that a process that applies the operator $M_{A^{n} B^{n}}^{x, δ}$ preserves coherences when it is applied to a superposition of several states ${{| ρ ⟩}_{ABR}^{\otimes n}}$ . This property would not have been ensured if instead, we had merely asserted that $M_{A^{n} B^{n}}^{x, δ} {| ρ ⟩}_{ABR}^{\otimes n}$ and ${| ρ ⟩}_{ABR}^{\otimes n}$ have high absolute value overlap or are close in fidelity. If $M_{A^{n} B^{n}}^{x, δ}$ is a projector then the expression reduces to $tr (M_{A^{n} B^{n}}^{x, δ} ρ)$ as one usually considers for projectors on typical subspaces.

The core technical statement of Construction #2 is to show the existence of a universal conditional and relative smoothing operator, which is as follows.

Proposition 6.1

Let $Γ_{AB}, Γ_{B}^{'} ⩾ 0$ , $x \in R$ , as well as $n \in N$ and $δ > 0$ . There exists a universal conditional and relative typical smoothing operator $M_{A^{n} B^{n}}^{x, δ}$ that is furthermore permutation-invariant. Moreover, if $[Γ_{AB}, 1_{A} \otimes Γ_{B}^{'}] = 0$ , then $M_{A^{n} B^{n}}^{x, δ}$ can be chosen to be a projector satisfying $[M_{A^{n} B^{n}}^{x, δ}, Γ_{B}^{' \otimes n}] = 0$ and $[M_{A^{n} B^{n}}^{x, δ}, Γ_{AB}^{\otimes n}] = 0$ .

In the following, we present the proof of Theorem 6.1 based on the existence of such the smoothing operator from Proposition 6.1. The more technical proof of Proposition 6.1 is then given in Sect. 6.3.

Proof

(Theorem 6.1). Let $V_{X \to X^{'} E}$ be a Stinespring dilation of $E_{X \to X^{'}}$ into an environment system $E ≃ X \otimes X^{'}$ . For $n \in N$ we need to find a suitable candidate implementation $T_{X^{n} \to X^{' n}}$ . Let

\begin{matrix} x = - max_{σ_{X}} {D (E (σ_{X}) ‖ Γ_{X^{'}}) - D (σ_{X} ‖ Γ_{X})} = - T (E) . \end{matrix}

For any $δ > 0$ let $M_{E^{n} X^{' n}}^{x, δ}$ be the operator constructed by Proposition 6.1, with the system E playing the role of the system A, with $V_{X \to X^{'} E} Γ_{X} V_{X \leftarrow X^{'} E}^{†}$ as $Γ_{AB}$ and with $Γ_{X^{'}}$ as $Γ_{B}^{'}$ . Now, define

\begin{matrix} T_{X^{n} \to X^{' n}} (\cdot) = {tr}_{E^{n}} [M_{E^{n} X^{' n}}^{x, δ} V_{X \to X^{'} E}^{\otimes n} (\cdot) (V_{X \leftarrow X^{'} E}^{†})^{\otimes n} (M_{E^{n} X^{' n}}^{x, δ})^{†}] \end{matrix}

noting that $T_{X^{n} \to X^{' n}}$ is trace non-increasing by construction thanks to Property (i) of Definition 6.1.

Let ${| σ ⟩}_{X R_{X}}$ be any pure state, and define ${| ρ ⟩}_{X^{'} E R_{X}} = V_{X \to X^{'} E} {| σ ⟩}_{X R_{X}}$ . By construction, $D (ρ_{E X^{'}} ‖ (V_{X \to X^{'} E} Γ_{X} V^{†})) - D (ρ_{X^{'}} ‖ Γ_{X^{'}}) = D (σ_{X} ‖ Γ_{X}) - D (E (σ_{X}) ‖ Γ_{X^{'}}) ⩾ x$ . Then Property (ii) of Proposition 6.1 tells us that there exists a $ξ > 0$ independent of both $ρ$ and n such that

\begin{matrix} Re \{{⟨ ρ |}_{X^{'} E R_{X}}^{\otimes n}, M_{E^{n} X^{' n}}^{x, δ}, {| ρ ⟩}_{X^{'} E R_{X}}^{\otimes n}\} ⩾ 1 - poly (n) exp (- n ξ) . \end{matrix}

The conditions of Proposition 2.3 are fulfilled, with $W_{X^{n} \to X^{' n} E^{n}} = M_{A^{n} B^{n}}^{x, δ} V_{X \to X^{'} E}^{\otimes n}$ , thanks furthermore to the fact that $M_{E^{n} X^{' n}}^{x, δ}$ is permutation-invariant as guaranteed by Proposition 6.1. Hence, we have

\begin{matrix} \frac{1}{2} ‖ T_{X^{n} \to X^{' n}} - E_{X \to X^{'}}^{\otimes n} ‖_{⋄} ⩽ poly (n) exp (- n ξ / 2) . \end{matrix}

For $n \in N$ large enough this becomes smaller than any fixed $ϵ > 0$ . Furthermore, by Property (iii) of Definition 6.1, we have that

\begin{matrix} T_{X^{n} \to X^{' n}} (Γ_{X}^{\otimes n}) & = {tr}_{E^{n}} [M_{E^{n} X^{' n}}^{x, δ} (V_{X \to X^{'} E} Γ_{X} V_{X \leftarrow X^{'} E}^{†})^{\otimes n} {(M_{E^{n} X^{' n}}^{x, δ})}^{†}] \\ ⩽ poly (n) e^{- n (x - 4 δ)} Γ_{X^{'}}^{\otimes n} \end{matrix}

as required. $□$

Universal conditional and relative typical smoothing operator

We now turn to the proof of Proposition 6.1, giving an explicit construction of a universal conditional and relative typical smoothing operator. As the proof of Proposition 6.1 is quite lengthy, it can be instructive to consider a simpler version of our typical smoothing operator which applies in the case where the Hamiltonians are trivial. We carry out this analysis in “Appendix E”.

Proof

(Proposition 6.1). First, we claim that we can assume $Γ_{AB} > 0$ and $Γ_{B}^{'} > 0$ without loss of generality. Indeed, if either operator is not positive definite, then we can first construct the operator ${\tilde{M}}_{A^{n} B^{n}}^{x, δ}$ associated with modified operators ${\tilde{Γ}}_{AB} > 0$ and ${\tilde{Γ}}_{B}^{'} > 0$ where all the zero eigenvalues of $Γ_{AB}$ and $Γ_{B}^{'}$ are replaced by some arbitrary fixed strictly positive constant (e.g., one); we can then set $M_{A^{n} B^{n}}^{x, δ} = P_{B^{n}}^{Γ^{'}} {\tilde{M}}_{A^{n} B^{n}}^{x, δ} P_{A^{n} B^{n}}^{Γ}$ , where $P_{A^{n} B^{n}}^{Γ}$ (respectively $P_{B^{n}}^{Γ^{'}}$ ) is the projector onto the support of $Γ_{AB}^{\otimes n}$ (respectively $Γ_{B}^{' \otimes n}$ ). The operator $M_{A^{n} B^{n}}^{x, δ}$ constructed in this way satisfies all of the required properties. For the remainder of this proof we thus assume that $Γ_{AB} > 0$ and $Γ_{B}^{'} > 0$ .

Let $\{R_{A^{n} B^{n}}^{k}\}$ be the POVM constructed by Proposition 2.2 for $H_{AB} = - ln (Γ_{AB})$ . Similarly, let $\{S_{B^{n}}^{ℓ}\}$ be the corresponding POVM constructed in Proposition 2.2 for $H_{B}^{'} = - ln (Γ_{B}^{'})$ . Also, as before, we denote by $Π_{A^{n} B^{n}}^{λ}$ and by $Π_{B^{n}}^{μ}$ the projectors on the Schur–Weyl blocks labelled by the Young diagrams $λ \in Young (d_{A} d_{B}, n)$ and $μ \in Young (d_{B}, n)$ . Let

\begin{matrix} M_{A^{n} B^{n}}^{x, δ} = \sum_{\begin{matrix} k, ℓ, λ, μ : \\ k - \bar{H} (λ) - ℓ + \bar{H} (μ) ⩾ x - 4 δ \end{matrix}} S_{B^{n}}^{ℓ} Π_{B^{n}}^{μ} Π_{A^{n} B^{n}}^{λ} R_{A^{n} B^{n}}^{k} . \end{matrix}

Note that $[S_{B^{n}}^{ℓ}, Π_{B^{n}}^{μ}] = 0$ because $S_{B^{n}}^{ℓ}$ is permutation-invariant, and $[1_{A^{n}} \otimes S_{B^{n}}^{ℓ}, Π_{A^{n} B^{n}}^{λ}] = 0$ because $1_{A^{n}} \otimes S_{B^{n}}^{ℓ}$ is permutation-invariant. Recall also that $[1_{A^{n}} \otimes Π_{B^{n}}^{μ}, Π_{A^{n} B^{n}}^{λ}] = 0$ for the same reason. The operator $M_{A^{n} B^{n}}^{x, δ}$ is permutation-invariant by construction. Then, we have

\begin{matrix} M_{A^{n} B^{n}}^{x, δ †} M_{A^{n} B^{n}}^{x, δ} & = \sum_{\begin{matrix} k, ℓ, λ, μ, \\ k^{'}, ℓ^{'}, λ^{'}, μ^{'} : \\ k - \bar{H} (λ) - ℓ + \bar{H} (μ) ⩾ x - 4 δ \\ k^{'} - \bar{H} (λ^{'}) - ℓ^{'} + \bar{H} (μ^{'}) ⩾ x - 4 δ \end{matrix}} R_{A^{n} B^{n}}^{k} Π_{A^{n} B^{n}}^{λ} Π_{B^{n}}^{μ} S_{B^{n}}^{ℓ} S_{B^{n}}^{ℓ^{'}} Π_{B^{n}}^{μ^{'}} Π_{A^{n} B^{n}}^{λ^{'}} R_{A^{n} B^{n}}^{k^{'}} \\ = \sum_{\begin{matrix} k, k^{'}, ℓ, λ, μ : \\ k - \bar{H} (λ) - ℓ + \bar{H} (μ) ⩾ x - 4 δ \\ k^{'} - \bar{H} (λ) - ℓ + \bar{H} (μ) ⩾ x - 4 δ \end{matrix}} R_{A^{n} B^{n}}^{k} (Π_{A^{n} B^{n}}^{λ} Π_{B^{n}}^{μ} S_{B^{n}}^{ℓ}) R_{A^{n} B^{n}}^{k^{'}} \\ = \sum_{k, k^{'}} R_{A^{n} B^{n}}^{k} (\sum_{\begin{matrix} ℓ, λ, μ \\ k - \bar{H} (λ) - ℓ + \bar{H} (μ) ⩾ x - 4 δ \\ k^{'} - \bar{H} (λ) - ℓ + \bar{H} (μ) ⩾ x - 4 δ \end{matrix}}, Π_{A^{n} B^{n}}^{λ}, Π_{B^{n}}^{μ}, S_{B^{n}}^{ℓ}) R_{A^{n} B^{n}}^{k^{'}} \\ ⩽ \sum_{k, k^{'}} R_{A^{n} B^{n}}^{k} R_{A^{n} B^{n}}^{k^{'}} \\ = \sum_{k} R_{A^{n} B^{n}}^{k} = 1_{A^{n} B^{n}} \end{matrix}

recalling that the operators $(Π_{A^{n} B^{n}}^{λ}, Π_{B^{n}}^{μ}, S_{B^{n}}^{ℓ})$ form a commuting set of projectors, and where in the third line the inner sum is taken to be the zero operator if no triplet $(ℓ, λ, μ)$ satisfies the given constraints. This shows Property (i).

Now, consider any state ${| ρ ⟩}_{ABR}$ , where R is any reference system, and assume that $D (ρ_{AB} ‖ Γ_{AB}) - D (ρ_{B} ‖ Γ_{B}^{'}) ⩾ x$ . Rewrite this condition as

\begin{matrix} x ⩽ - H (ρ_{AB}) - tr [ρ_{AB} ln Γ_{AB}] + H (ρ_{B}) + tr [ρ_{B} ln Γ_{B}^{'}] . \end{matrix}

We write

where we define

66a

further noting that the conditions in the sum defining $▪_{1}$ indeed imply that $k - \bar{H} (λ) - ℓ + \bar{H} (μ) ⩾ - tr [ρ_{AB} ln Γ_{AB}] - H (ρ_{AB}) + tr [ρ_{B} ln Γ_{B}^{'}] + H (ρ_{B}) - 4 δ ⩾ x - 4 δ$ . We first consider $▪_{1}$ . Define the projectors

\begin{matrix} X_{1} & = \sum_{k ⩾ - tr [ρ_{AB} ln Γ_{AB}] - δ} R_{A^{n} B^{n}}^{k} ; & X_{1}^{⊥} & = 1 - X_{1} ; \end{matrix}

66b

\begin{matrix} X_{2} & = \sum_{\bar{H} (λ) ⩽ H (ρ_{AB}) + δ} Π_{A^{n} B^{n}}^{λ} ; & X_{2}^{⊥} & = 1 - X_{2} ; \end{matrix}

67a

\begin{matrix} X_{3} & = \sum_{\bar{H} (μ) ⩾ H (ρ_{B}) - δ} Π_{B^{n}}^{μ} ; & X_{3}^{⊥} & = 1 - X_{3} ; \end{matrix}

67b

\begin{matrix} X_{4} & = \sum_{ℓ ⩽ - tr [ρ_{B} ln Γ_{B}^{'}] + δ} S_{B^{n}}^{ℓ} ; & X_{4}^{⊥} & = 1 - X_{4}, \end{matrix}

67c

and observe that

\begin{matrix} Re \{▪_{1}\} = Re \{{⟨ ρ |}_{ABR}^{\otimes n}, (, X_{4}, X_{3}, X_{2}, X_{1}, ), {| ρ ⟩}_{ABR}^{\otimes n}\} . \end{matrix}

67d

Thanks to Proposition 2.2, we have $‖ X_{1}^{⊥} {| ρ ⟩}_{ABR}^{\otimes n} ‖ ⩽ 2 exp (- n η / 2)$ , recalling that $‖ P | ψ ⟩ ‖ = \sqrt{tr [P ψ]}$ , and hence

\begin{matrix} Re \{{⟨ ρ |}_{ABR}^{\otimes n}, X_{4}, X_{3}, X_{2}, X_{1}, {| ρ ⟩}_{ABR}^{\otimes n}\} \\ = Re \{{⟨ ρ |}_{ABR}^{\otimes n}, X_{4}, X_{3}, X_{2}, {| ρ ⟩}_{ABR}^{\otimes n}\} - Re \{{⟨ ρ |}_{ABR}^{\otimes n}, X_{4}, X_{3}, X_{2}, X_{1}^{⊥}, {| ρ ⟩}_{ABR}^{\otimes n}\} \\ ⩾ Re \{{⟨ ρ |}_{ABR}^{\otimes n}, X_{4}, X_{3}, X_{2}, {| ρ ⟩}_{ABR}^{\otimes n}\} - 2 exp (- n η / 2) \end{matrix}

using Cauchy–Schwarz to assert that $Re (⟨ χ | ψ ⟩) ⩽ | ⟨ χ | ψ ⟩ | ⩽ ‖ | χ ⟩ ‖ ‖ | ψ ⟩ ‖$ . Similarly, using Proposition 2.1, we have $‖ X_{2}^{⊥} {| ρ ⟩}_{ABR}^{\otimes n} ‖ ⩽ poly (n) exp (- n η / 2)$ . Also, we have $‖ X_{3}^{⊥} {| ρ ⟩}_{ABR}^{\otimes n} ‖ ⩽ poly (n) exp (- n η / 2)$ , and $‖ X_{4}^{⊥} {| ρ ⟩}_{ABR}^{\otimes n} ‖ ⩽ 2 exp (- n η / 2)$ , yielding

\begin{matrix} Re \{{⟨ ρ |}_{ABR}^{\otimes n}, X_{4}, X_{3}, X_{2}, {| ρ ⟩}_{ABR}^{\otimes n}\} & ⩾ Re \{{⟨ ρ |}_{ABR}^{\otimes n}, X_{4}, X_{3}, {| ρ ⟩}_{ABR}^{\otimes n}\} - poly (n) exp (- n η / 2) ; \end{matrix}

\begin{matrix} Re \{{⟨ ρ |}_{ABR}^{\otimes n}, X_{4}, X_{3}, {| ρ ⟩}_{ABR}^{\otimes n}\} & ⩾ Re \{{⟨ ρ |}_{ABR}^{\otimes n}, X_{4}, {| ρ ⟩}_{ABR}^{\otimes n}\} - poly (n) exp (- n η / 2) ; \end{matrix}

\begin{matrix} Re \{{⟨ ρ |}_{ABR}^{\otimes n}, X_{4}, {| ρ ⟩}_{ABR}^{\otimes n}\} & ⩾ 1 - 2 exp (- n η / 2) . \end{matrix}

We take all these $η$ ’s to be the same, by choosing if necessary the minimum of the four possibly different $η$ s. Hence, we have

\begin{matrix} Re \{▪_{1}\} ⩾ 1 - poly (n) exp (- n η / 2) . \end{matrix}

Now we consider the term $▪_{2}$ . We know that

\begin{matrix} ∥R_{A^{n} B^{n}}^{k} {| ρ ⟩}_{ABR}^{\otimes n}∥ & ⩽ exp (- n η / 2) & if k < - tr [ρ_{AB} ln Γ_{AB}] - δ ; \end{matrix}

\begin{matrix} ∥Π_{A^{n} B^{n}}^{λ} {| ρ ⟩}_{ABR}^{\otimes n}∥ & ⩽ poly (n) exp (- n η / 2) & if \bar{H} (λ) > H (ρ_{AB}) + δ ; \end{matrix}

74a

\begin{matrix} ∥S_{B^{n}}^{ℓ} {| ρ ⟩}_{ABR}^{\otimes n}∥ & ⩽ exp (- n η / 2) & if ℓ > - tr [ρ_{B} ln Γ_{B}^{'}] + δ ; \end{matrix}

74b

\begin{matrix} ∥Π_{B^{n}}^{μ} {| ρ ⟩}_{ABR}^{\otimes n}∥ & ⩽ poly (n) exp (- n η / 2) & if \bar{H} (μ) < H (ρ_{B}) - δ \end{matrix}

74c

recalling that $‖ P | ψ ⟩ ‖ = \sqrt{tr [P ψ]}$ . So, for each term in the sum (66b), we have

\begin{matrix} |{⟨ ρ |}_{ABR}^{\otimes n} (S_{B^{n}}^{ℓ} Π_{B^{n}}^{μ} Π_{A^{n} B^{n}}^{λ} R_{A^{n} B^{n}}^{k}) {| ρ ⟩}_{ABR}^{\otimes n}| & = |{(⟨ ρ |}_{ABR}^{\otimes n} S_{B^{n}}^{ℓ} Π_{B^{n}}^{μ} Π_{A^{n} B^{n}}^{λ}) (R_{A^{n} B^{n}}^{k} {| ρ ⟩}_{ABR}^{\otimes n})| \\ ⩽ ∥R_{A^{n} B^{n}}^{k} {| ρ ⟩}_{ABR}^{\otimes n}∥ \cdot ∥(S_{B^{n}}^{ℓ} Π_{B^{n}}^{μ} Π_{A^{n} B^{n}}^{λ}) {| ρ ⟩}_{ABR}^{\otimes n}∥ \\ ⩽ poly (n) exp (- n η / 2) \end{matrix}

74d

using the Cauchy–Schwarz inequality and because at least one of the four conditions is violated, causing at least one of the two the norms to decay exponentially (noting also that $S_{B^{n}}^{ℓ}, Π_{B^{n}}^{μ}, Π_{A^{n} B^{n}}^{λ}$ all commute). Because there are only at most $poly (n)$ terms, we have

Hence, we have

\begin{matrix} Re \{{⟨ ρ |}_{ABR}^{\otimes n}, M_{A^{n} B^{n}}^{x, δ}, {| ρ ⟩}_{ABR}^{\otimes n}\} & = Re \{▪_{1}\} + Re \{▪_{2}\} \\ ⩾ Re \{▪_{1}\} - |▪_{2}| \\ ⩾ 1 - poly (n) exp (- n η / 2) \end{matrix}

proving Property (ii) for $ξ = η / 2$ . Note that $ξ$ does not depend on the state ${| σ ⟩}_{XR}$ . Now, we prove Property (iii). Using Lemma B.1 and dropping some subsystem indices for readability, we have

\begin{matrix} {tr}_{A^{n}} [M_{A^{n} B^{n}}^{x, δ} Γ_{AB}^{\otimes n} (M_{A^{n} B^{n}}^{x, δ})^{†}] \\ ⩽ poly (n) \sum_{\begin{matrix} k, ℓ, λ, μ : \\ k - \bar{H} (λ) - ℓ + \bar{H} (μ) ⩾ x - 4 δ \end{matrix}} {tr}_{A^{n}} [S^{ℓ}, Π^{μ}, Π^{λ}, R^{k}, Γ^{\otimes n}, R^{k}, Π^{λ}, Π^{μ}, S^{ℓ}] . \end{matrix}

Recall that, using Proposition 2.2 and Lemma 2.2,

\begin{matrix} R_{A^{n} B^{n}}^{k} Γ_{AB}^{\otimes n} & ⩽ e^{- n k} R_{A^{n} B^{n}}^{k} ⩽ e^{- n k} 1_{A^{n} B^{n}} ; \end{matrix}

\begin{matrix} Π_{B^{n}}^{μ} {tr}_{A^{n}} [Π_{A^{n} B^{n}}^{λ}] Π_{B^{n}}^{μ} & ⩽ poly (n) exp (n (\bar{H} (λ) - \bar{H} (μ))) 1_{B^{n}} ; \end{matrix}

\begin{matrix} S_{B^{n}}^{ℓ} & ⩽ e^{n ℓ} S_{B^{n}}^{ℓ} Γ_{B}^{' \otimes n} ⩽ e^{n ℓ} Γ_{B}^{' \otimes n} \end{matrix}

further recalling that $[R_{A^{n} B^{n}}^{k}, Γ_{AB}^{\otimes n}] = 0$ and $[S_{B^{n}}^{ℓ}, Γ_{B}^{' \otimes n}] = 0$ . Combining these together yields

\begin{matrix} (78) & ⩽ poly (n) \sum_{\begin{matrix} k, ℓ, λ, μ : \\ k - \bar{H} (λ) - ℓ + \bar{H} (μ) ⩾ x - 4 δ \end{matrix}} e^{- n k} S^{ℓ} Π^{μ} {tr}_{A^{n}} [Π_{A^{n} B^{n}}^{λ}] Π^{μ} S^{ℓ} \\ ⩽ \sum_{\begin{matrix} k, ℓ, λ, μ : \\ k - \bar{H} (λ) - ℓ + \bar{H} (μ) ⩾ x - 4 δ \end{matrix}} poly (n) e^{- n k + n (\bar{H} (λ) - \bar{H} (μ))} S_{B^{n}}^{ℓ} \\ ⩽ \sum_{\begin{matrix} k, ℓ, λ, μ : \\ k - \bar{H} (λ) - ℓ + \bar{H} (μ) ⩾ x - 4 δ \end{matrix}} poly (n) e^{- n (k - \bar{H} (λ) + \bar{H} (μ) - ℓ)} Γ_{B}^{' \otimes n} \\ ⩽ poly (n) e^{- n (x - 4 δ)} Γ_{B}^{' \otimes n} . \end{matrix}

Finally, suppose that $[Γ_{AB}, Γ_{B}^{'}] = 0$ , meaning that we can choose a simultaneous eigenbasis for $Γ_{AB}$ and $Γ_{B^{'}}$ . Then the operator $M_{A^{n} B^{n}}^{x, δ}$ is a projector, as can be seen in (62) since in that case ${S_{B^{n}}^{ℓ}}, {Π_{B^{n}}^{μ}}, {Π_{A^{n} B^{n}}^{λ}}, {R_{A^{n} B^{n}}^{k}}$ are all complete sets of projectors all elements of which commute pairwise between different sets. Furthermore, $Γ_{B^{'}}^{\otimes n}$ and $Γ_{AB}^{\otimes n}$ both commute with all of these projectors and therefore also with $M_{A^{n} B^{n}}^{x, δ}$ . $□$

Construction #3: Thermal Operations

Statement and proof sketch

We now present a construction of a universal thermodynamic implementation of a time-covariant i.i.d. process, using the framework of thermal operations instead of Gibbs-preserving maps.

Theorem 7.1

Let X be a quantum system, $H_{X}$ a Hermitian operator, $β ⩾ 0$ , $E_{X \to X}$ a completely positive, trace-preserving map satisfying

\begin{matrix} E_{X \to X} (e^{- i H_{X} t} (\cdot) e^{i H_{X} t}) = e^{- i H_{X} t} E_{X \to X} (\cdot) e^{i H_{X} t} for all t \in R . \end{matrix}

Let $ϵ > 0$ . Let $δ > 0$ be small enough and $n \in N$ be large enough. Then, there exists an information battery W, a thermal operation $Φ_{X^{n} W}$ , and battery states $τ_{W}^{(i)}$ and $τ_{W}^{(f)}$ such that:

(i)
The effective work process $T_{X^{n} \to X^{n}}$ associated with $Φ_{X^{n} W}$ and $(τ_{W}^{(i)}, τ_{W}^{(f)})$ satisfies
$\begin{matrix} \frac{1}{2} {‖ T_{X^{n} \to X^{n}} - E_{X \to X^{'}}^{\otimes n} ‖}_{⋄} ⩽ ϵ ; \end{matrix}$ 83
(ii)
The work cost per copy satisfies
$\begin{matrix} lim_{δ \to 0} lim_{n \to \infty} \frac{1}{n} [w (τ_{W}^{(i)}) - w (τ_{W}^{(f)})] = T (E) . \end{matrix}$ 84

The main idea in the present construction is to first carry out a Stinespring dilation unitary explicitly using suitable ancillas as the environment system, and then to apply a conditional erasure process that resets the ancillas to a standard state while using the output of the process as side information. The idea of implementing a process in this fashion was also employed in Ref. [13].

Our core technical contribution for Construction #3 is to show how to build a thermodynamic protocol for universal conditional erasure, using the idea of position-based decoding [19, 49–55]. The assembly of the full thermal operation is slightly more involved than Constructions #1 and #2, because we cannot use Proposition 3.1. The construction will be illustrated in Figure 2, using a conditional erasure primitive whose construction is illustrated in Figure 1.

Fig. 2 — The conditional erasure procedure in Figure 1 can be used to construct an i.i.d. implementation of a given time-covariant process (Theorem 7.1). First we apply an energy-conserving Stinespring dilation of the process on all input copies, using a zero-initialized ancilla as environment system E for each copy. We then invoke the conditional erasure procedure $R_{E^{n} X^{n} J}$ to reset $E^{n}$ to the thermal state $γ_{E}^{\otimes n}$ using $X^{' n}$ as a memory, while extracting work using an information battery J. Here, the projector that can distinguish $ρ_{E X^{'}}^{\otimes n}$ from $1_{E^{n}} \otimes ρ_{X^{' n}}$ is the universal conditional typical projector given by Proposition E.2. The fact that $R_{E^{n} X^{n} J}$ preserves the correlations ${[E (σ_{XR})]}^{\otimes n}$ between the memory (output systems $X^{' n}$ ) and the reference $R^{n}$ ensures that the process is implemented accurately. The amount of work extracted by $R_{E^{n} X^{n} J}$ is $m \sim n [β F_{E} + T (E)]$ but $\sim n β F_{E}$ work has to be paid to prepare the initially pure $E^{n}$ ancillas, where $β F_{E} = - ln tr (e^{- β H_{E}})$ . The overall work extracted is $\sim T (E)$ per copy

Fig. 1 — Construction of the thermal operation for universal conditional erasure using position-based decoding [19], illustrating the construction in the proof of Proposition 7.1 and Lemma 7.1. We define a map $R_{SMJ}$ that acts on a system S to reset, a quantum memory M and a register J, which is promised to be initialized in the uniformly mixed state $e^{- m} 1_{e^{m}}$ of rank $e^{m}$ for a fixed and known value of m. A state $ρ_{SM}$ of the system and the memory is purified by a reference system R (not pictured). The map $R_{SMJ}$ outputs the system S in a state close to the thermal state $γ_{S}$ and the register J in a state close to the pure state ${| 0 ⟩}_{J}$ , all while ensuring that $ρ_{MR}$ remains unchanged (up to small errors), for all states $ρ_{SM}$ in a given class of states $S_{SM}$ . The routine is provided a POVM effect $P_{SM}$ whose task is to distinguish $ρ_{SM}$ from $γ_{S} \otimes ρ_{M}$ in a hypothesis test for all $ρ_{SM} \in S_{SM}$ . As long as m is not too large (as determined by how well $P_{SM}$ can perform this distinguishing), the procedure completes successfully. To implement $R_{SMJ}$ (shaded region) we involve $e^{m}$ ancillas $A = A_{1} \dots A_{e^{m}}$ with $A_{j} ≃ S$ , each initialized in the thermal state $γ_{A_{j}} = γ_{S}$ . Then S and $A_{j}$ are coherently swapped ( $F_{S A_{j}}$ ) conditioned on the value stored in J. If m is not too large, a POVM ${Ω_{MA}^{j}}$ can infer the value j stored in J, up to a small error; the POVM is constructed from $P_{SM}$ . We then coherently reset the J register to zero by conditioning on this outcome (up to a small error). The full procedure is a thermal operation where the ancillas are the heat bath and J is an information battery such that m work has been extracted in units of pure nats (see main text)

Universal conditional erasure

Conditional erasure is a task that is of independent interest because it generalizes Landauer’s erasure principle to situations where a quantum memory is available. A protocol for thermodynamic conditional erasure of a system using a memory as quantum side information was given in ref. [56] for trivial Hamiltonians. Here, we study the problem of finding a universal protocol for conditional erasure, whose accuracy is guaranteed for any input state on n copies of a system, and where the system and memory Hamiltonians can be arbitrary.

Definition 7.1

(Universal conditional erasure). Consider two systems S, M. Let $σ_{S}$ be a fixed state, let $S_{SM} = {ρ_{SM}}$ be an arbitrary set of states on $S \otimes M$ , and let $δ^{'} ⩾ 0$ . A universal conditional $δ^{'}$ -erasure process of S using M as side information is a completely positive, trace non-increasing map $T_{S M \to S M}$ such that for all $ρ_{SM} \in S_{SM}$ , and writing ${| ρ ⟩}_{SMR}$ a purification of $ρ_{SM}$ , we have

\begin{matrix} F (T_{S M \to S M} (ρ_{SMR}), σ_{S} \otimes ρ_{MR}) ⩾ 1 - δ^{'} . \end{matrix}

We provide a thermodynamic protocol for universal conditional erasure.

Proposition 7.1

Let S, M be systems with Hamiltonians $H_{S}, H_{M}$ and let $γ_{S}$ refer to the thermal state on S. Let $S_{SM}$ be an arbitrary set of states on $S \otimes M$ . Let $m ⩾ 0$ such that $e^{m}$ is integer. Let $P_{SM}$ be a Hermitian operator satisfying $0 ⩽ P_{SM} ⩽ 1$ and $[P_{SM}, H_{S} + H_{M}] = 0$ , and assume that there exists $κ, κ^{'} ⩾ 0$ such that for all $ρ_{SM} \in S_{SM}$ we have

\begin{matrix} tr [P_{SM} ρ_{SM}] & ⩾ 1 - κ ; \end{matrix}

\begin{matrix} tr [P_{SM}, (γ_{S} \otimes ρ_{M})] & ⩽ \frac{κ^{'}}{e^{m}} . \end{matrix}

87a

Then, there exists a thermal operation $R_{S M J \to S M J}$ acting on the systems SM and an information battery J, such that the effective work process $T_{S M \to S M}$ of $R_{S M J \to S M J}$ with respect to the battery states ${(τ_{J}^{m}, | 0 ⟩}_{J})$ is a universal conditional $(2 κ + 4 κ^{'})$ -erasure process with $σ_{S} = γ_{S}$ for the set of states $S_{SM}^{'}$ , where $S_{SM}^{'}$ is the convex hull of $S_{SM}$ .

The proof of Proposition 7.1 is developed in the rest of this section. We start by reformulating the ideas of the convex-split lemma, the position-based decoding, and the catalytic decoupling schemes [19, 49–55] to form a protocol for universal conditional erasure. The underlying ideas of the following proposition are the same as, e.g., in Ref. [19]. Yet, our technical statement differs in some aspects and that is why we provide a proof for completeness. The setting is depicted in Fig. 1.

Lemma 7.1

(Conditional erasure unitary using position-based decoding). Consider two systems S, M and fix $m ⩾ 0$ such that $e^{m}$ is integer. Let J be a large register of dimension at least $2 e^{m}$ , and choose a fixed basis ${{| j ⟩}_{J}}$ . Now, let $γ_{S}$ be any state, $S_{SM}$ an arbitrary set of quantum states on $S \otimes M$ , $P_{SM}$ a Hermitian operator satisfying $0 ⩽ P_{SM} ⩽ 1$ , and assume that there exists $κ, κ^{'} ⩾ 0$ such that for all $ρ_{SM} \in S_{SM}$ the conditions (87) hold. Furthermore, let $A = A_{1} \otimes \dots \otimes A_{e^{m}}$ be a collection of ancilla systems with each $A_{j} ≃ S$ , and let $A^{'} = A_{1}^{'} \otimes \dots \otimes A_{e^{m}}^{'}$ be a copy of the full collection of ancilla systems. We write a purification of $γ_{A_{j}}$ on $A_{j}^{'}$ as ${| γ ⟩}_{A_{j} A_{j}^{'}} = γ_{A_{j}}^{1 / 2} {| Φ ⟩}_{A_{j} : A_{j}^{'}}$ . Let $S_{SM}^{'}$ be the convex hull of $S_{SM}$ . Then, there exists a unitary operator $W_{S M A J \to S M A J}^{(m)}$ satisfying the following property: For any reference system R, for any pure tripartite state ${| ρ ⟩}_{SMR}$ with $ρ_{SM} \in S_{SM}^{'}$ , and for any ${| j ⟩}_{J}$ with $1 ⩽ j ⩽ e^{m}$ , we have

\begin{matrix} Re \{(⟨ \hat{τ^{j}} (ρ_{SMR}) |_{R M S A A^{'}} \otimes {⟨ 0 |}_{J}) W_{SMAJ}^{(m)} {(| ρ ⟩}_{RMS} \otimes {| γ ⟩}_{A_{\cdot} A_{\cdot}^{'}}^{\otimes e^{m}} \otimes {| j ⟩}_{J})\} ⩾ 1 - (2 κ + 4 κ^{'}), \end{matrix}

87b

where we have defined

\begin{matrix} | \hat{τ^{j}} (ρ_{SMR}) ⟩_{R M S A A^{'}} = {| ρ ⟩}_{A_{j} M R} \otimes {| γ ⟩}_{S A_{j}^{'}} \otimes {[| γ ⟩}^{\otimes (e^{m} - 1)}]_{A A^{'} \ A_{j} A_{j}^{'}} \end{matrix}

and by the notation $A A^{'} \ A_{j} A_{j}^{'}$ we refer to all $A A^{'}$ systems except $A_{j} A_{j}^{'}$ . Moreover, for any observables $H_{S}$ , $H_{M}$ such that $[P_{SM}, H_{S} + H_{M}] = 0$ , the unitary $W_{SMAJ}^{(m)}$ may be chosen such that $[H_{S} + H_{M} + \sum H_{A_{j}}, W_{SMAJ}^{(m)}] = 0$ , where $H_{A_{j}} = H_{S}$ .

Intuitively, we absorb the initial randomness present in the register J, e.g., given to us by the environment in a mixed state, and return it in a pure state; J can therefore be identified as an information battery. Similarly, A can be identified as a heat bath.

Proof

First observe that we can assume $S_{SM}$ to be a convex set, because any convex combination of states in $S_{SM}$ also satisfies the conditions (87). For the rest of the proof we assume without loss of generality that $S_{SM} = S_{SM}^{'}$ .

The operator W is defined in two steps. The first operation simply consists on conditionally swapping S with $A_{j}$ , depending on the value stored in J. Then, we infer again from MA which j we swapped S with, in order to coherently reset the register J back to the zero state (approximately). We define the first unitary operation as $W^{(1)}$ , acting on systems SAJ

\begin{matrix} W_{SAJ}^{(1)} = \sum_{j} F_{S A_{j}} \otimes {| j ⟩ ⟨ j |}_{J}, \end{matrix}

where $F_{S A_{j}}$ denotes the swap operator between the two designated systems. Observe that $W^{(1)}$ maps $ρ$ onto $\hat{τ^{j}}$ according to

\begin{matrix} W_{SQJ}^{(1)} ({| ρ ⟩}_{RMS} \otimes {| γ ⟩}_{A_{\cdot} A_{\cdot}^{'}}^{\otimes e^{m}} \otimes {| j ⟩}_{J}) \\ = {| ρ ⟩}_{R M A_{j}} \otimes {| γ ⟩}_{S A_{j}^{'}} \otimes {[{| γ ⟩}^{\otimes (e^{m} - 1)}]}_{A A^{'} \ A_{j} A_{j}^{'}} \otimes {| j ⟩}_{J} \\ = | \hat{τ^{j}} ⟩_{S R M A A^{'}} \otimes {| j ⟩}_{J} . \end{matrix}

The second step is more tricky. We need to infer from the systems MA alone which j was stored in J. Fortunately the answer is provided in the form of position-based decoding [19], using a pretty good measurement. Define

\begin{matrix} Λ_{MA}^{j} = P_{M A_{j}} \otimes 1_{A \ A_{j}} \end{matrix}

such that ${Λ_{MA}^{j}}$ is a set of positive operators. We can form a POVM ${Ω_{MA}^{j}}_{j} \cup {Ω_{MA}^{⊥}}$ by normalizing the $Λ^{j}$ ’s as follows:

\begin{matrix} Ω_{MA}^{j} & = Λ_{MA}^{- 1 / 2} Λ_{MA}^{j} Λ_{MA}^{- 1 / 2} ; & Λ_{MA} & = \sum_{j} Λ_{MA}^{j} ; & Ω_{MA}^{⊥} & = 1 - \sum_{j} Ω_{MA}^{j} . \end{matrix}

We would now like to lower bound $tr [Ω_{MA}^{j} {\hat{τ^{j}}}_{MA}]$ . Following the proof of [19, Theorem 2], we first invoke the Hayashi–Nagaoka inequality [57], which states that for any operators $0 ⩽ A ⩽ 1$ , $B ⩾ 0$ , we have

\begin{matrix} 1 - {(A + B)}^{- 1 / 2} A {(A + B)}^{- 1 / 2} ⩽ 2 (1 - A) + 4 B . \end{matrix}

Applying this inequality with $A = Λ_{MA}^{j}$ and $B = \sum_{j^{'} \neq j} Λ_{MA}^{j^{'}}$ we obtain

\begin{matrix} tr [(1 - Ω^{j}), {\hat{τ^{j}}}_{MA}] & ⩽ 2 tr [(1 - Λ_{MA}^{j}), {\hat{τ^{j}}}_{MA}] + 4 \sum_{j^{'} \neq j} tr [Λ_{MA}^{j^{'}}, {\hat{τ^{j}}}_{MA}] \\ ⩽ 2 tr [(1 - P_{SM}), ρ_{SM}] + 4 m tr [P_{SM}, (γ_{S} \otimes ρ_{M})] \\ ⩽ 2 κ + 4 κ^{'} . \end{matrix}

Now, let ${SHIFT}_{J} (x) = \sum_{j} {| j + x ⟩ ⟨ j |}_{J}$ denote the SHIFT operation on the J register, modulo $e^{m}$ ; note that $({SHIFT}_{J} (x))^{†} = {SHIFT}_{J} (- x)$ . We define

\begin{matrix} W_{MAJ}^{(2)} & = (\sum_{j} Ω_{MA}^{j} \otimes {SHIFT}_{J} (- j)) ; & W_{SMAJ}^{'} & = W_{MAJ}^{(2)} W_{SAJ}^{(1)} \end{matrix}

and we see that $W^{' †} W^{'} ⩽ 1$ thanks to Proposition B.3. Then, we have

\begin{matrix} W_{SMAJ}^{'} ({| ρ ⟩}_{RMS} \otimes {| ϕ ⟩}_{A_{\cdot} A_{\cdot}^{'}}^{\otimes e^{m}} \otimes {| j ⟩}_{J}) \\ = (\sum_{j^{'}} Ω_{MA}^{j^{'}} \otimes {SHIFT}_{J} (- j^{'})) (| \hat{τ^{j}} ⟩_{S R M A A^{'}} \otimes {| j ⟩}_{J}) \\ = \sum_{j^{'}} (Ω_{MA}^{j^{'}}, {| \hat{τ^{j}} ⟩}_{R M S A A^{'}}) \otimes | j - j^{'} ⟩ . \end{matrix}

Thanks to Proposition C.1, the operator $W_{SMAJ}^{'}$ can be completed to a full unitary $W_{SMAJ}$ by using an extra qubit in the J register, and such that ${⟨ 0 |}_{J} W_{SMAJ} {| j ⟩}_{J} = {⟨ 0 |}_{J} W_{SMAJ}^{'} {| j ⟩}_{J}$ for all $j = 1, \dots, e^{m}$ (with the convention that ${| j ⟩}_{J}$ for $j ⩽ e^{m}$ forces the extra qubit to be in the zero state). So, recalling (95),

\begin{matrix} (⟨ \hat{τ^{j}} |_{R M S A A^{'}} \otimes {⟨ 0 |}_{J}) W_{SMAJ} ({| ρ ⟩}_{RMS} \otimes {| ϕ ⟩}_{A_{\cdot} A_{\cdot}^{'}}^{\otimes e^{m}} \otimes {| j ⟩}_{J}) \\ = (⟨ \hat{τ^{j}} |_{R M S A A^{'}} \otimes {⟨ 0 |}_{J}) W_{SMAJ}^{'} ({| ρ ⟩}_{RMS} \otimes {| ϕ ⟩}_{A_{\cdot} A_{\cdot}^{'}}^{\otimes e^{m}} \otimes {| j ⟩}_{J}) \\ = ⟨ \hat{τ^{j}} | Ω_{MA}^{j} {| \hat{τ^{j}} ⟩}_{R M S A A^{'}} \\ ⩾ 1 - (2 κ + 4 κ^{'}) . \end{matrix}

To prove the last part of the claim, let $H_{S}, H_{M}$ be observables such that $[P_{SM}, H_{S} + H_{M}] = 0$ and $[H_{S}, γ_{S}] = 0$ . Let $H_{A_{j}} = H_{S}$ and we write $H_{A} = \sum_{j} H_{A_{j}}$ . For all j, we have

\begin{matrix} [H_{S} + H_{M} + H_{A}, Λ_{MA}^{j}] = [H_{S} + \sum_{j^{'} \neq j} H_{A_{j^{'}}}, Λ_{MA}^{j}] + [H_{M} + H_{A_{j}}, P_{M A_{j}}] = 0 . \end{matrix}

This implies that $[H_{S} + H_{M} + H_{A}, Λ_{MA}] = 0$ , and in turn $[H_{S} + H_{M} + H_{A}, Λ_{MA}^{- 1 / 2}] = 0$ , and thus also $[H_{S} + H_{M} + H_{A}, Ω^{j}] = 0$ . Hence, we have

\begin{matrix} [H_{S} + H_{M} + H_{A}, W_{MAJ}^{(2)}] = 0 . \end{matrix}

Clearly, $[H_{S} + H_{M} + H_{A}, W_{SAJ}^{(1)}] = 0$ , and hence $[H_{S} + H_{M} + H_{A}, W_{SMAJ}^{'}] = 0$ . Using Proposition C.2 instead of Proposition C.1, we may further enforce $[H_{S} + H_{M} + H_{A}, W_{SMAJ}] = 0$ , as required. $□$

We now give the proof of Proposition 7.1.

Proof

(Proposition 7.1). Let $W_{SMAJ}^{(m)}$ be the energy-conserving unitary as in Lemma 7.1 and define the thermal operation

\begin{matrix} R_{SMJ} (\cdot) = {tr}_{A} [W_{SMAJ}^{(m)} ((\cdot) \otimes γ_{A}) W_{SMAJ}^{(m) †}] . \end{matrix}

100

Identifying J as an information battery, the associated effective work process of $R_{SMJ}$ with respect to ${(τ_{J}^{m}, | 0 ⟩}_{J})$ is

\begin{matrix} T_{S M \to S M} (\cdot) & = {tr}_{A} {[⟨ 0 |}_{J} W_{SMAJ}^{(m)} ((\cdot) \otimes γ_{A} \otimes τ_{J}^{m}) W_{SMAJ}^{(m) †} {| 0 ⟩}_{J}] . \end{matrix}

101

Let $ρ_{SM} \in S_{SM}^{'}$ and let ${| ρ ⟩}_{SMR}$ be a purification of $ρ_{SM}$ . We have that the state vector

\begin{matrix} e^{- m / 2} \sum_{j = 1}^{e^{m}} {⟨ 0 |}_{J} W_{SMAJ}^{m} {(| ρ ⟩}_{SMR} \otimes {| γ ⟩}_{A A^{'}}^{\otimes e^{m}} \otimes {| j ⟩}_{J}) \otimes {| j ⟩}_{R_{J}} \end{matrix}

102

is a purification of $T_{S M \to S M} (ρ_{SMR})$ , where $R_{J}$ is an additional register. Similarly, the state vector

\begin{matrix} e^{- m / 2} \sum_{j = 1}^{e^{m}} | \hat{τ^{j}} (ρ_{SMR}) ⟩_{R M S A A^{'}} \otimes {| j ⟩}_{R_{J}} \end{matrix}

103

is a purification of $γ_{S} \otimes ρ_{MR}$ . Then, with Uhlmann’s theorem we find

\begin{matrix} F (T_{S M \to S M} (ρ_{SMR}), γ_{S} \otimes ρ_{MR}) \\ ⩾ e^{- m} \sum_{j = 1}^{e^{m}} Re \{(⟨ \hat{τ^{j}} (ρ_{SMR}) |_{R M S A A^{'}} \otimes {⟨ 0 |}_{J}) W_{SMAJ}^{(m)} {(| ρ ⟩}_{RMS} \otimes {| γ ⟩}_{A_{\cdot} A_{\cdot}^{'}}^{\otimes e^{m}} \otimes {| j ⟩}_{J})\} \\ ⩾ 1 - (2 κ + 4 κ^{'}), \end{matrix}

104

making use of (88). $□$

Construction via universal conditional erasure

This section is devoted to the proof of Theorem 7.1. The strategy is to exploit the fact that time-covariant processes admit a Stinespring dilation with an energy-conserving unitary using an environment system with a separate Hamiltonian. This property enables us to map the problem of implementing such a process directly to a conditional erasure problem with a system and memory that are non-interacting.

The following lemma formalizes the property of time-covariant processes we make use of. Various proofs of this lemma can be found in [58, 59, Appendix B] and [60, Theorem 25].

Lemma 7.2

(Stinespring dilation of covariant processes [58–60]). Let X be a quantum system with Hamiltonian $H_{X}$ , and $E_{X \to X}$ be a completely positive, trace-preserving map that is covariant with respect to time evolution. That is, for all t we have

\begin{matrix} E_{X \to X} (e^{- i H_{X} t} (\cdot) e^{i H_{X} t}) = e^{- i H_{X} t} E_{X \to X} (\cdot) e^{i H_{X} t} . \end{matrix}

105

Then, there exists a system E with Hamiltonian $H_{E}$ including an eigenstate ${| 0 ⟩}_{E}$ of zero energy, as well as a unitary $V_{E X \to E X}$ such that

\begin{matrix} E_{X \to X} (\cdot) = {tr}_{E} [V, ({| 0 ⟩ ⟨ 0 |}_{E} \otimes (\cdot)), V^{†}] \end{matrix}

106

as well as $V (H_{X} + H_{E}) V^{†} = H_{X} + H_{E}$ .

We provide an additional proof in “Appendix A”. The main idea behind the construction in the following proof of Theorem 7.1 is depicted in Fig. 2.

Proof

(Theorem 7.1) Thanks to Lemma 7.2, there exists an environment system E with Hamiltonian $H_{E}$ , as well as an energy-conserving unitary $V_{XE}$ and a state ${| 0 ⟩}_{E}$ of zero energy such that (107) holds. Let $F_{E} = - β^{- 1} ln (Z_{E})$ with $Z_{E} = tr [e^{- β H_{E}}]$ . We define

\begin{matrix} x = min_{σ} \{D (σ ‖ e^{- β H_{X}}) - D (E (σ) ‖ e^{- β H_{X}})\} = - T (E) . \end{matrix}

107

Writing $ρ_{XE} = V_{XE} ({| 0 ⟩ ⟨ 0 |}_{E} \otimes σ_{X}) V_{XE}^{†}$ , we have that $x = {min}_{σ_{X}} {- H (σ_{X}) + β tr [σ_{X} H_{X}] + H (ρ_{X}) - β tr [ρ_{X} H_{X}]}$ . By $tr [σ_{X} H_{X}] = tr [(| 0 ⟩ ⟨ 0 |_{E} \otimes σ_{X}) (H_{X} + H_{E})] = tr [ρ_{XE} (H_{X} + H_{E})]$ , we see that

\begin{matrix} x = min_{σ_{X}} \{- H (ρ_{XE}) + H (ρ_{X}) + β tr [ρ_{E} H_{E}]\} . \end{matrix}

108

Observe that for any such $ρ_{XE}$ , we have

\begin{matrix} - H {(E | X)}_{ρ} + β tr [ρ_{E} H_{E}] & ⩾ - H {(E)}_{ρ} + β tr [ρ_{E} H_{E}] + ln (Z) - ln (Z) \\ = D (ρ_{E} ‖ γ_{E}) + β F_{E} ⩾ β F_{E} \end{matrix}

109

using the sub-additivity of the von Neumann entropy and the fact that relative entropy is positive for normalized states. Hence, we have $x ⩾ β F_{E}$ .

Let

\begin{matrix} S_{E^{n} X^{n}} = {ρ_{EX}^{\otimes n} : ρ_{EX} = V_{XE} (| 0 ⟩ ⟨ 0 |_{E} \otimes σ_{X}) V_{XE}^{†} for some σ_{X}}, \end{matrix}

110

noting that for all $ρ_{EX}^{\otimes n} \in S_{E^{n} X^{n}}$ , we have $D (ρ_{EX} ‖ e^{- β (H_{X} + H_{E})}) - D (ρ_{X} ‖ e^{- β H_{X}}) = D (σ ‖ e^{- β H_{X}}) - D (E (σ) ‖ e^{- β H_{X}}) ⩾ x$ . Let $P_{E^{n} X^{n}}^{x, δ}$ be the universal typical and relative conditional operator furnished by Proposition 6.1, where $Γ_{X} = e^{- β H_{X}}$ and $Γ_{XE} = e^{- β (H_{X} + H_{E})} = Γ_{X} \otimes Γ_{E}$ with $Γ_{E} = e^{- β H_{E}}$ . Since $Γ_{XE}$ commutes with $1_{E} \otimes Γ_{X}$ , Proposition 6.1 guarantees that $P_{E^{n} X^{n}}^{x, δ}$ is a projector which furthermore commutes with $Γ_{XE}^{\otimes n}$ and $Γ_{X}^{\otimes n}$ . We proceed to show that $P_{E^{n} X^{n}}^{x, δ}$ can perform a hypothesis test between $ρ_{EX}^{\otimes n}$ and $γ_{E}^{\otimes n} \otimes ρ_{X}^{\otimes n}$ . Recalling Definition 6.1 we have

\begin{matrix} tr [P_{E^{n} X^{n}}^{x, δ} ρ_{EX}^{\otimes n}] & ⩾ 1 - κ, \end{matrix}

111

with $κ = poly (n) e^{- n η}$ for some $η > 0$ independent of $ρ$ and n. By construction we have $1_{X} \otimes Γ_{E} = Γ_{X}^{- 1 / 2} Γ_{XE} Γ_{X}^{- 1 / 2}$ , and so thanks to Point (iii) of Definition 6.1 we can compute

\begin{matrix} {tr}_{E^{n}} [P_{E^{n} X^{n}}^{x, δ} Γ_{E}^{\otimes n}] & = (Γ_{X}^{- 1 / 2})^{\otimes n} {tr}_{E^{n}} [P_{E^{n} X^{n}}^{x, δ} Γ_{XE}^{\otimes n}] (Γ_{X}^{- 1 / 2})^{\otimes n} \\ ⩽ poly (n) exp (- n (x - 4 δ)) 1_{X^{n}}, \end{matrix}

112

where we furthermore used the fact that $P_{E^{n} X^{n}}^{x, δ}$ commutes with $Γ_{XE}^{\otimes n}$ and with $Γ_{X}^{\otimes n}$ . We therefore see using $γ_{E} = Γ_{E} / tr [Γ_{E}]$ that

\begin{matrix} tr [P_{E^{n} X^{n}}^{x, δ} ρ_{X}^{\otimes n} \otimes γ_{E}^{\otimes n}] & ⩽ \frac{1}{tr [Γ_{E}^{\otimes n}]} poly (n) exp (- n (x - 4 δ)) tr [ρ_{X}^{\otimes n}] \\ = poly (n) exp (- n (x - β F_{E} - 4 δ)) . \end{matrix}

113

Let

\begin{matrix} e^{m} = ⌊ exp {n (x - β F_{E} - 4 δ - η)} ⌋, \end{matrix}

114

such that $tr [P_{E^{n} X^{n}}^{x, δ} ρ_{X}^{\otimes n} \otimes γ_{E}^{\otimes n}] ⩽ e^{- m} κ^{'}$ by choosing $κ^{'} = poly (n) e^{- n η}$ .

Now let J be a register of dimension at least $2 e^{m}$ and let $R_{E^{n} X^{n} J}$ be the thermal operation furnished by Proposition 7.1 for $S = E^{n}$ , $M = X^{n}$ , $S_{E^{n} X^{n}}$ , $P_{E^{n} X^{n}}^{x, δ}$ , m, $κ$ , and $κ^{'}$ as defined above. Here, we have assumed that $x > β F_{E}$ , and that furthermore $δ, η$ are small enough such that $4 δ + η < (x - β F_{E})$ ; if instead $x = β F_{E}$ then we can set $e^{m} = 1$ and $R_{E^{n} X^{n} J} (\cdot) = {tr}_{E^{n}} (\cdot) \otimes γ_{E}^{\otimes n}$ (which is a thermal operation) in the following.

We proceed to show that the effective work process $T_{E^{n} X^{n} \to E^{n} X^{n}}^{R}$ of $R_{E^{n} X^{n} J}$ with respect to ${(τ_{J}^{m}, | 0 ⟩}_{J})$ is close to the partial trace map $T_{E^{n} X^{n} \to E^{n} X^{n}}^{(0)} (\cdot) = {tr}_{E^{n}} (\cdot) \otimes γ_{E}^{\otimes n}$ in diamond distance. We invoke the post-selection technique (Theorem 2.1) to show this. Let $ζ_{E^{n} X^{n}}$ be the de Finetti state which via (21) can be written as the convex combination of a finite number of i.i.d. states

\begin{matrix} ζ_{E^{n} X^{n}} = \sum p_{i} ϕ_{i}^{\otimes n} . \end{matrix}

115

Hence $ζ_{E^{n} X^{n}}$ lies in the convex hull of $S_{E^{n} X^{n}}$ , and from Proposition 7.1 and Definition 7.1 we see that for a purification ${| ζ ⟩}_{E^{n} X^{n} R}$ of $ζ_{E^{n} X^{n}}$ we have

\begin{matrix} F (T_{E^{n} X^{n} \to E^{n} X^{n}}^{R} (ζ_{E^{n} X^{n} R}), γ_{E}^{\otimes n} \otimes {tr}_{E^{n}} (ζ_{E^{n} X^{n} R})) \geq 1 - (2 κ + 4 κ^{'}) . \end{matrix}

116

Using $D (ρ, σ) \leq \sqrt{1 - F (ρ, σ)}$ along with Theorem 2.1 we find

\begin{matrix} \frac{1}{2} {‖ T_{E^{n} X^{n} \to E^{n} X^{n}}^{R} - T_{E^{n} X^{n} \to E^{n} X^{n}}^{(0)} ‖}_{⋄} ⩽ \sqrt{2 κ + 4 κ^{'}} = poly (n) e^{- n η / 2} . \end{matrix}

117

We can start piecing together the full process. Our overall protocol needs to (a) bring in a heat bath $E^{n}$ , i.e., ancillas initialized in their thermal state, (b) prepare the states ${| 0 ⟩}_{E}^{\otimes n}$ on the ancillas using an auxiliary information battery (denoted by $W^{'}$ below), (c) apply the energy-conserving unitary $V_{XE}^{\otimes n}$ , (d) apply $R_{E^{n} X^{n} J}$ using an information battery J initialized in the state $τ_{J}^{m}$ , and (e) discard the ancillas.

As explained in Sect. 3, there exists a thermal operation ${\tilde{Φ}}_{E^{n} W^{'}}$ on the ancillas and an information battery $W^{'}$ along with battery states $(τ_{W^{'}}^{(1)}, τ_{W^{'}}^{(2)})$ such that ${\tilde{Φ}}_{E^{n} W^{'}} (γ_{E}^{\otimes n} \otimes τ_{W^{'}}^{(1)}) = {| 0 ⟩ ⟨ 0 |}_{E}^{\otimes n} \otimes τ_{W^{'}}^{(2)}$ and with $w (τ_{W^{'}}^{(1)}) - w (τ_{W^{'}}^{(2)})$ arbitrarily close to $- β n F_{E}$ . Now let $W = J \otimes W^{'}$ , $τ_{W}^{(i)} = τ_{W^{'}}^{(1)} \otimes τ_{J}^{m}$ , $τ_{W}^{(f)} = τ_{W^{'}}^{(2)} \otimes {| 0 ⟩ ⟨ 0 |}_{J}$ , and define

\begin{matrix} Φ_{X^{n} W} (\cdot) & = {tr}_{E^{n}} [R_{E^{n} X^{n} J} (V_{XE}^{\otimes n} {\tilde{Φ}}_{E^{n} W^{'}} ((\cdot) \otimes γ_{E}^{\otimes n}) {(V_{XE}^{\otimes n})}^{†})] . \end{matrix}

118

The map $Φ_{X^{n} W}$ is a thermal operation because it is a concatenation of thermal operations. The overall heat bath is formed of the systems $E^{n}$ , the ancillas $A^{n}$ used in the implementation of $R_{E^{n} X^{n} J}$ , as well as the implicit heat bath used in the implementation of ${\tilde{Φ}}_{E^{n} W^{'}}$ . The system $W = J \otimes W^{'}$ is the information battery. We can verify that the associated effective work process with respect to $(τ_{W}^{(i)}, τ_{W}^{(f)})$ is

\begin{matrix} T_{X^{n}} (\cdot) & = 〈 0 |_{J} {tr}_{E^{n}} [R_{E^{n} X^{n} J} (V_{XE}^{\otimes n} {tr}_{W^{'}} [P_{W^{'}}^{(2)} {\tilde{Φ}}_{E^{n} W^{'}} ((\cdot) \otimes τ_{W^{'}}^{(1)} \otimes τ_{J}^{m} \otimes γ_{E}^{\otimes n})] {(V_{XE}^{\otimes n})}^{†})] | 0 〉_{J} \\ = {tr}_{E^{n}} [〈 0 |_{J} R_{E^{n} X^{n} J} ([V_{XE}^{\otimes n} ((\cdot) \otimes {| 0 ⟩ ⟨ 0 |}_{E}^{\otimes n}) {(V_{XE}^{\otimes n})}^{†}] \otimes τ_{J}^{m}) | 0 〉_{J}] \\ = {tr}_{E^{n}} [T_{E^{n} X^{n}}^{R} (V_{XE}^{\otimes n} ((\cdot) \otimes {| 0 ⟩ ⟨ 0 |}_{E}^{\otimes n}) {(V_{XE}^{\otimes n})}^{†})] \\ = {tr}_{E^{n}} [V_{XE}^{\otimes n} ((\cdot) \otimes {| 0 ⟩ ⟨ 0 |}_{E}^{\otimes n}) {(V_{XE}^{\otimes n})}^{†}] + Δ_{X^{n}} (\cdot) \\ = E_{X \to X}^{\otimes n} (\cdot) + Δ_{X^{n}} (\cdot), \end{matrix}

119

where $Δ_{X^{n}} (\cdot) = {tr}_{E^{n}} (T_{X^{n} E^{n}}^{R} (\cdot) - T_{X^{n} E^{n}}^{(0)} (\cdot))$ satisfies $(1 / 2) ‖ Δ_{X^{n}} ‖_{⋄} ⩽ poly (n) e^{- n η / 2}$ . Therefore for any fixed $ϵ$ and for n large enough we have $(1 / 2) ‖ T_{X^{n}} - E_{X \to X}^{\otimes n} ‖_{⋄} ⩽ ϵ$ .

The associated work cost per copy satisfies

\begin{matrix} lim_{δ \to 0} lim_{n \to \infty} \frac{1}{n} [w (τ_{W}^{(i)}) - w (τ_{W}^{(f)})] & = lim_{δ \to 0} lim_{n \to \infty} \frac{1}{n} [w (τ_{W^{'}}^{(1)}) - w (τ_{W^{'}}^{(2)}) - m] \\ = lim_{δ \to 0} lim_{n \to \infty} \frac{1}{n} [- n β F_{E} - n (x - β F_{E} - 4 δ + η) + υ] \\ = T (E), \end{matrix}

120

recalling (115), where $0 ⩽ υ ⩽ 2$ accounts for the rounding error in (115) and a possible arbitrarily small difference between $- n β F_{E}$ and $w (τ_{W^{'}}^{(1)}) - w (τ_{W^{'}}^{(2)})$ , and recalling that $η \to 0$ as $δ \to 0$ . $□$

Discussion

Our results fits in the line of research extending results in thermodynamics from state-to-state transformations to quantum processes. Implementations of quantum processes are difficult to construct because they need to reproduce the correct correlations between the output and the reference system, and not only produce the correct output state. Here, we have seen that it is nevertheless possible to implement any quantum process at an optimal work cost: Any implementation that would use less work would violate the second law of thermodynamics on a macroscopic scale. As a special case this also provides an operational interpretation of the minimal entropy gain of a channel [35–42].

Our three constructions of optimal implementations of processes are valid in different settings, and it remains unclear if they can be unified in a single protocol that presents the advantages of all three constructions. Namely, is it possible to use a physically well-justified framework, e.g. thermal operations, to universally implement any i.i.d. process? We expect this to be possible only if an arbitrary amount of coherence is allowed, in analogy with the entanglement embezzling state required in the reverse Shannon theorem [22, 23].

Finally, the notion of quantum typicality that we have introduced in Definition 6.1 and Proposition 6.1 might be interesting in its own right. We anticipate that similar considerations might provide pathways to smooth other information-theoretic quantities [54, 61, 62] and to study the joint typicality conjecture [26, 63–66].

Acknowledgements

The authors thank Álvaro Alhambra, David Ding, Patrick Hayden, Rahul Jain, David Jennings, Martí Perarnau-Llobet, Mark Wilde, and Andreas Winter for discussions. PhF acknowledges support from the Swiss National Science Foundation (SNSF) through the Early PostDoc.Mobility Fellowship No. P2EZP2_165239 hosted by the Institute for Quantum Information and Matter (IQIM) at Caltech, from the IQIM which is a National Science Foundation (NSF) Physics Frontiers Center (NSF Grant PHY-1733907), from the Department of Energy Award DE-SC0018407, from the Swiss National Science Foundation (SNSF) through the NCCR QSIT and through Project No. 200020_16584, and from the Deutsche Forschungsgemeinschaft (DFG) Research Unit FOR 2724. FB is supported by the NSF. This work was completed prior to MB and FB joining the AWS Center for Quantum Computing. Funding Open Access funding enabled and organized by Projekt DEAL.

Appendix

A Missing proofs

Proof

(Lemma 2.2). A useful expression for $Π_{A^{n} B^{n}}^{λ}$ may be obtained following [25, Section V]

\begin{matrix} Π_{A^{n} B^{n}}^{λ} & = \frac{dim (Q_{λ})}{s_{λ} (diag (λ / n))} \int d U_{AB} Π_{A^{n} B^{n}}^{λ} {(U_{AB}, diag, {(λ / n)}_{AB}, U_{AB}^{†})}^{\otimes n} Π_{A^{n} B^{n}}^{λ} \\ ⩽ poly (n) e^{n \bar{H} (λ)} \int d U_{AB} {(U_{AB}, diag, {(λ / n)}_{AB}, U_{AB}^{†})}^{\otimes n}, \end{matrix}

121

recalling that $Π_{A^{n} B^{n}}^{λ}$ commutes with any i.i.d. state, with $s_{λ} (X) = tr [q_{λ} (X)]$ and using bounds on $dim (Q_{λ})$ and $s_{λ} (diag (λ / n))$ derived in Ref. [25]. Here, $d U_{AB}$ denotes the Haar measure over all unitaries acting on $H_{AB}$ , normalized such that $\int d U_{AB} = 1$ . We then have

\begin{matrix} {tr}_{A^{n}} [Π_{A^{n} B^{n}}^{λ}] ⩽ poly (n) e^{n \bar{H} (λ)} \int d U_{AB} {tr}_{A^{n}} [{(U_{AB}, diag, {(λ / n)}_{AB}, U_{AB}^{†})}^{\otimes n}] . \end{matrix}

122

Observe that for any state $ω_{B}$ , we have

\begin{matrix} ‖ Π_{B^{n}}^{λ^{'}} ω_{B}^{\otimes n} Π_{B^{n}}^{λ^{'}} ‖_{\infty} & = ‖ {[q_{λ^{'}} (ω_{B}) \otimes 1_{P_{λ^{'}}}]}_{λ^{'}} ‖_{\infty} \\ = ‖ q_{λ^{'}} (ω_{B}) ‖_{\infty} ⩽ tr [q_{λ^{'}}, (ω_{B})] \\ ⩽ poly (n) e^{- n \bar{H} (λ^{'})} \end{matrix}

123

as derived e.g. in [25, Eq. (9)], and thus for any state $ω_{B}$ ,

\begin{matrix} Π_{B^{n}}^{λ^{'}} ω_{B}^{\otimes n} Π_{B^{n}}^{λ^{'}} ⩽ poly (n) e^{- n \bar{H} (λ^{'})} Π_{B^{n}}^{λ^{'}} . \end{matrix}

124

Hence, we get

\begin{matrix} Π_{B^{n}}^{λ^{'}} {tr}_{A^{n}} [Π_{A^{n} B^{n}}^{λ}] Π_{B^{n}}^{λ^{'}} \\ ⩽ poly (n) e^{n \bar{H} (λ)} \int d U_{AB} Π_{B^{n}}^{λ^{'}} {({tr}_{A}, [U_{AB}, diag, {(λ / n)}_{AB}, U_{AB}^{†}])}^{\otimes n} Π_{B^{n}}^{λ^{'}} \\ ⩽ poly (n) e^{n \bar{H} (λ)} \int d U_{AB} poly (n) e^{- n \bar{H} (λ^{'})} Π_{B^{n}}^{λ^{'}} \\ = poly (n) e^{n (\bar{H} (λ) - \bar{H} (λ^{'}))} Π_{B^{n}}^{λ^{'}}, \end{matrix}

125

as required. $□$

Proof

(Proposition 2.1) The Fannes–Audenaert continuity bound [67, 68] of the entropy states that for any $δ^{'} > 0$ there exists $ξ (δ^{'}) > 0$ such that for any quantum states $ρ, σ$ with $D (ρ, σ) ⩽ δ^{'}$ we have

\begin{matrix} | H (ρ) - H (σ) | ⩽ ξ (δ^{'}), \end{matrix}

126

and furthermore $ξ (δ^{'})$ is monotonically strictly decreasing and $ξ (δ^{'}) \to 0$ if $δ^{'} \to 0$ . Now, let $δ > 0$ , let $ξ^{- 1}$ be the inverse function of $ξ$ , and let $δ^{'} = ξ^{- 1} (δ)$ . Consider the set of Young diagrams $Λ_{δ^{'}} = {λ \in Young (d_{A}, n) : D (diag (λ / n), ρ) ⩽ δ^{'}}$ . For all $λ \in Λ_{δ^{'}}$ , we have that $| H (ρ) - \bar{H} (λ) | ⩽ δ$ thanks to the Fannes–Audenaert inequality. Then, we have

\begin{matrix} tr [(\sum_{λ : \bar{H} (λ) \in [H (ρ) \pm δ]}, Π_{A^{n}}^{λ}), ρ_{A}^{\otimes n}] ⩾ tr [(\sum_{λ \in Λ_{δ^{'}}}, Π_{A^{n}}^{λ}), ρ_{A}^{\otimes n}] \end{matrix}

127

because all terms in the sum in the right hand side are included in the sum on the left hand side. We may now invoke [24, Eq. (6.23)] to see that

\begin{matrix} (128) ⩾ 1 - poly (n) exp \{- n η\}, \end{matrix}

128

where $η = δ^{' 2} / 2$ . $□$

Proof

(Proposition 2.2). The fact that there are only $poly (n)$ elements follows because there are only so many types. Property (ii) holds by definition. Property (iv) holds because $e^{- n (k \pm δ)}$ is the minimum / maximum eigenvalue of $Γ_{A}^{\otimes n}$ in the subspace spanned by $R_{A^{n}}^{\approx_{δ} h}$ . Finally, we need to show Property (iii): This follows from a large deviation analysis. More precisely, let $Z_{j}$ for $j = 1, \dots, n$ be random variables where $Z_{j}$ represents the measurement outcome of $H_{A}$ on the j-th system of the i.i.d. state $ρ_{A}^{\otimes n}$ . By Hoeffding’s inequality, we have that

\begin{matrix} Pr [|(1 / n) \sum Z_{j} - tr [ρ_{A} H_{A}]| > δ] & ⩽ 2 exp (- \frac{2 n δ^{2}}{Δ H_{A}^{2}}) ⩽ 2 exp (- \frac{n δ^{2}}{2 ‖ H_{A} ‖_{\infty}^{2}}), \end{matrix}

129

where $Δ H_{A}$ is the difference between the maximum and minimum eigenvalue of $H_{A}$ , and $Δ H_{A} ⩽ 2 {‖ H_{A} ‖}_{\infty}$ . Thus, the event consisting of the outcomes k satisfying $| k - tr [ρ_{A} H_{A}] | ⩽ δ$ happens with probability at least $1 - 2 e^{- n η}$ , proving (16). $□$

Proof

(Proposition 2.3) We use the post-selection technique (Theorem 2.1) to bound the diamond norm distance between $T_{X^{n} \to X^{' n}}$ and $E_{X \to X^{'}}^{\otimes n}$ . Let ${| ζ ⟩}_{X^{n} {\bar{R}}^{n} R^{'}}$ be the purification of the de Finetti state given by (21). Calculate

\begin{matrix} Re \{{⟨ ζ |}_{X^{n} {\bar{R}}^{n} R^{'}}, {(V_{X \to E X^{'}}^{\otimes n})}^{†}, W_{X^{n} \to E^{n} X^{' n}}, {| ζ ⟩}_{X^{n} {\bar{R}}^{n} R^{'}}\} \\ = \sum p_{i} Re \{⟨, ϕ_{i}, |_{X \bar{R}}^{\otimes n}, {(V_{X \to E X^{'}}^{\otimes n})}^{†}, W_{X^{n} \to E^{n} X^{' n}}, {| ϕ_{i} ⟩}_{X \bar{R}}^{\otimes n}\} \\ ⩾ 1 - poly (n) exp (- n η) \end{matrix}

130

which implies, recalling that $F (| ψ ⟩, | ϕ ⟩) = | ⟨ ψ | ϕ ⟩ | ⩾ Re {⟨ ψ | ϕ ⟩}$ and that ${(1 - x)}^{2} ⩾ 1 - 2 x$ ,

\begin{matrix} F^{2} (V_{X \to E X^{'}}^{\otimes n} {| ζ ⟩}_{X^{n} {\bar{R}}^{n} R^{'}}, W_{X^{n} \to E^{n} X^{' n}} {| ζ ⟩}_{X^{n} {\bar{R}}^{n} R^{'}}) ⩾ 1 - poly (n) exp (- n η) \end{matrix}

131

and hence

\begin{matrix} P (V_{X \to E X^{'}}^{\otimes n} {| ζ ⟩}_{X^{n} {\bar{R}}^{n} R^{'}}, W_{X^{n} \to E^{n} X^{' n}} {| ζ ⟩}_{X^{n} {\bar{R}}^{n} R^{'}}) ⩽ poly (n) exp (- n η / 2) . \end{matrix}

132

Recalling the relations between the trace distance and the purified distance, and noting that these distance measures cannot increase under the partial trace, we obtain

\begin{matrix} D (T (ζ_{X^{n} {\bar{R}}^{n} R^{'}}), E^{\otimes n} (ζ_{X^{n} {\bar{R}}^{n} R^{'}})) ⩽ P (T (ζ_{X^{n} {\bar{R}}^{n} R^{'}}), E^{\otimes n} (ζ_{X^{n} {\bar{R}}^{n} R^{'}})) \\ ⩽ P (W_{X^{n} \to E^{n} X^{' n}} {| ζ ⟩}_{X^{n} {\bar{R}}^{n} R^{'}}, V_{X \to E X^{'}}^{\otimes n} {| ζ ⟩}_{X^{n} {\bar{R}}^{n} R^{'}}) ⩽ poly (n) exp (- n η / 2) . \end{matrix}

133

The post-selection technique then asserts that

\begin{matrix} \frac{1}{2} {‖ T - E^{\otimes n} ‖}_{⋄} ⩽ poly (n) exp (- n η / 2) \end{matrix}

134

as claimed. $□$

Proof

(Lemma 7.2). Let $V_{X \to X E}^{'}$ be any Stinespring dilation isometry of $E_{X \to X}$ , such that $E_{X \to X} (\cdot) = {tr}_{E} [V_{X \to X E}^{'}, (\cdot), V^{' †}]$ . For the input state ${| Φ ⟩}_{X : R_{X}}$ , consider the output state ${| φ ⟩}_{X E R_{X}}$ corresponding to first time-evolving by some time t, and then applying $V^{'}$

\begin{matrix} {| φ ⟩}_{X E R_{X}} = V^{'} e^{- i H_{X} t} {| Φ ⟩}_{X : R_{X}} = e^{- i V^{'} H_{X} V^{' †} t} V^{'} {| Φ ⟩}_{X : R_{X}} . \end{matrix}

135

Now, let us define $| φ^{'} ⟩_{X E R_{X}} = e^{- i H_{X} t} V^{'} {| Φ ⟩}_{X : R_{X}}$ . By the covariance property of $E_{X \to X}$ both $| φ ⟩$ and $| φ^{'} ⟩$ have the same reduced state on $X R_{X}$ . Hence, they are related by some unitary $W_{E}^{(t)}$ on the system E which in general depends on t

\begin{matrix} {| φ ⟩}_{X E R_{X}} = W_{E}^{(t)} {| φ^{'} ⟩}_{X E R_{X}} . \end{matrix}

136

We have

\begin{matrix} {tr}_{X} [V^{'}, e^{- i H_{X} t}, Φ_{X : R_{X}}, e^{i H_{X} t}, V^{' †}] = W_{E}^{(t)} {tr}_{X} [V^{'} Φ_{X : R_{X}} V^{' †}] W_{E}^{(t) †} \end{matrix}

137

so $W_{E}^{(t)}$ must define a representation of time evolution, at least on the support of the operator ${tr}_{X} [V^{'} Φ_{X : R_{X}} V^{' †}]$ . Hence, we may write $W_{E}^{(t)} = e^{- i H_{E} t}$ for some Hamiltonian $H_{E}$ , and from (137), we have for all t

\begin{matrix} V_{X \to X E}^{'} e^{- i H_{X} t} = e^{- i (H_{X} + H_{E}) t} V_{X \to X E}^{'} . \end{matrix}

138

Expanding for infinitesimal t we obtain

\begin{matrix} V_{X \to X E}^{'} H_{X} = (H_{X} + H_{E}) V_{X \to X E}^{'} . \end{matrix}

139

Let ${| 0 ⟩}_{E}$ be an eigenvector of $H_{E}$ corresponding to the eigenvalue zero; if $H_{E}$ does not contain an eigenvector with eigenvalue equal to zero, we may trivially add a dimension to the system E to accommodate this vector. Then, the operator $V_{X \to X E}^{'} {⟨ 0 |}_{E}$ maps each state of a subset of energy levels of XE to a corresponding energy level of same energy on XE; it may thus be completed to a fully energy-preserving unitary $V_{X E \to X E}$ . More precisely, let ${| j ⟩}_{X}$ be a complete set of eigenvectors of $H_{X}$ with energies $h_{j}$ . Then $| ψ_{j}^{'} ⟩ = V_{X \to X E}^{'} {| j ⟩}_{X}$ is an eigenvector of $H_{X} + H_{E}$ of energy $h_{j}$ thanks to (140). We have two orthonormal sets ${{| 0 ⟩}_{E} \otimes {| j ⟩}_{X}}$ and ${| ψ_{j}^{'} ⟩_{X}}$ in which the j-th vector of each set has the same energy; we can thus complete these sets into two bases $\{|, χ_{i}, ⟩_{XE}\}$ , $\{|, χ_{i}^{'}, ⟩_{XE}\}$ of eigenvectors of $H_{X} + H_{E}$ , where the i-th element of either basis has exactly the same energy. This defines a unitary $V_{X E \to X E} = \sum_{i} | χ_{i}^{'} ⟩_{XE} {⟨ χ_{i} |}_{XE}$ that is an extension of $V_{X \to X E}^{'} {⟨ 0 |}_{E}$ , and that satisfies all the conditions of the claim. $□$

B. Technical Lemmas

Lemma B.1

(Pinching-like operator inequality). Let ${E^{i}}_{i = 1}^{M}$ be a collection of M operators and $T ⩾ 0$ . Then, we have

\begin{matrix} (\sum E^{i}) T (\sum E^{j †}) ⩽ M \sum E^{i} T E^{i †} . \end{matrix}

140

Proof

Call our system S and consider an additional register C of dimension $| C | = M$ , and let ${| χ ⟩}_{C} = M^{- 1 / 2} \sum_{k = 1}^{M} {| k ⟩}_{C}$ . Then, we have

\begin{matrix} (\sum E_{S}^{i}) T_{S} (\sum E_{S}^{j †}) & = {tr}_{C} [(\sum E_{S}^{i} \otimes {| i ⟩}_{C}), T_{S}, (\sum E_{S}^{j †} \otimes {⟨ j |}_{C}), (1_{S} \otimes (M | χ ⟩ ⟨ χ |_{C}))] \\ ⩽ M {tr}_{C} [(\sum E_{S}^{i} \otimes {| i ⟩}_{C}), T_{S}, (\sum E_{S}^{j †} \otimes {⟨ j |}_{C}), (1_{S} \otimes 1_{C})] \\ = M \sum E_{S}^{i} T_{S} E_{S}^{i †}, \end{matrix}

141

using ${| χ ⟩ ⟨ χ |}_{C} ⩽ 1_{C}$ . $□$

Lemma B.2

(Gentle measurement). Let $ρ$ be a sub-normalized quantum state and $0 ⩽ Q ⩽ 1$ . For $tr [Q ρ] ⩾ 1 - δ$ we then have

\begin{matrix} P (ρ, Q^{1 / 2} ρ Q^{1 / 2}) ⩽ \sqrt{2 δ} . \end{matrix}

142

This is a cruder statement than that of, e.g., [69, Lemma 7], allowing for a more straightforward proof.

Proof

We have

\begin{matrix} \bar{F} (ρ, Q^{1 / 2} ρ Q^{1 / 2}) ⩾ F (ρ, Q^{1 / 2} ρ Q^{1 / 2}) & = tr [\sqrt{ρ^{1 / 2} (Q^{1 / 2} ρ Q^{1 / 2}) ρ^{1 / 2}}] \\ = tr [Q^{1 / 2}, ρ] ⩾ tr [Q ρ] ⩾ 1 - δ . \end{matrix}

143

Then, we get $P (ρ, Q^{1 / 2} ρ Q^{1 / 2}) ⩽ \sqrt{1 - {(1 - δ)}^{2}} ⩽ \sqrt{2 δ}$ . $□$

Proposition B.3

(Controlled-unitary using a POVM). Let ${Q^{j}}$ be a set of positive semi-definite operators on a system X satisfying $\sum Q^{j} ⩽ 1$ , ${U^{j}}$ be a collection of unitaries on a system Y, and

\begin{matrix} W_{XY} = \sum_{j} Q_{X}^{j} \otimes U_{Y}^{j} . \end{matrix}

144

Then, we have $W^{†} W ⩽ 1$ .

Proof

Using an additional register K, define

\begin{matrix} V_{X \to X K} = \sum \sqrt{Q^{j}} \otimes {| j ⟩}_{K} . \end{matrix}

145

Then, we have $V^{†} V = \sum Q^{j} ⩽ 1$ . Clearly, $V V^{†} ⩽ 1_{XK}$ because $V V^{†}$ and $V^{†} V$ have the same non-zero eigenvalues. Now, let

\begin{matrix} W = V^{†} (\sum 1_{X} \otimes U_{Y}^{j} \otimes {| j ⟩ ⟨ j |}_{K}) V . \end{matrix}

146

Because the middle term in parentheses is unitary, we manifestly have $W^{†} W ⩽ 1$ . $□$

C. Dilation of Energy-Conserving Operators to Unitaries

This appendix collects a few technical lemmas on constructing an energy-conserving unitary that extends a given operator of norm less than one.

Proposition C.1

Let $W_{X}$ be an operator on a system X, such that $W^{†} W ⩽ 1$ . Then, there exists a unitary operator $U_{XQ}$ acting on X and a qubit Q such that for any ${| ψ ⟩}_{X}$ ,

\begin{matrix} {⟨ 0 |}_{Q} U_{XQ} {(| ψ ⟩}_{X} \otimes {| 0 ⟩}_{Q}) = W_{X} {| ψ ⟩}_{X} . \end{matrix}

147

That is, any operator W with ${‖ W ‖}_{\infty} ⩽ 1$ can be dilated to a unitary, with a post-selection on the output.

Proof

Setting $V_{X \to X Q} = W \otimes {| 0 ⟩}_{Q} + \sqrt{1 - W^{†} W} \otimes {| 1 ⟩}_{Q}$ , we see that $V^{†} V = W^{†} W + 1 - W^{†} W = 1_{X}$ , and hence $V_{X \to X Q}$ is an isometry. We can complete this isometry to a unitary $U_{XQ}$ that acts as V on the support of $1_{X} \otimes {| 0 ⟩ ⟨ 0 |}_{Q}$ and that maps the the support of $1_{X} \otimes {| 1 ⟩ ⟨ 1 |}_{Q}$ onto the complementary space to the image of V. It then follows that for any ${| ψ ⟩}_{X}$ , we have $U_{XQ} {(| ψ ⟩}_{X} \otimes {| 0 ⟩}_{Q}) = V_{X \to X Q} {| ψ ⟩}_{X} = {(W_{X} | ψ ⟩}_{X} {) \otimes | 0 ⟩}_{Q} + (\dots) \otimes {| 1 ⟩}_{Q}$ , and the claim follows. $□$

Proposition C.2

Let X be a quantum system with Hamiltonian $H_{X}$ and $W_{X}$ be an operator with $W^{†} W ⩽ 1$ as well as $[W_{X}, H_{X}] = 0$ . Then, there exists a unitary operator $U_{XQ}$ acting on X and a qubit Q with $H_{Q} = 0$ , that satisfies $[U_{XQ}, H_{X}] = 0$ such that

\begin{matrix} {⟨ 0 |}_{Q} U_{XQ} {| 0 ⟩}_{Q} = W_{X} . \end{matrix}

148

That is, any energy-preserving operator W with ${‖ W ‖}_{\infty} ⩽ 1$ can be dilated to an energy-preserving unitary on an ancilla with a post-selection on the output.

Proof

First we calculate $[W^{†} W, H_{X}] = W^{†} [W, H_{X}] + [W^{†}, H_{X}] W = 0 - {[W, H_{X}]}^{†} W = 0$ . This implies that $[\sqrt{1 - W^{†} W}, H_{X}] = 0$ , as $W^{†} W$ and $\sqrt{1 - W^{†} W}$ have the same eigenspaces. We define

\begin{matrix} V_{X \to X Q} = W \otimes {| 0 ⟩}_{Q} + \sqrt{1 - W^{†} W} \otimes {| 1 ⟩}_{Q} . \end{matrix}

149

The operator $V_{X \to X Q}$ is an isometry, because $V^{†} V = W^{†} W + 1 - W^{†} W = 1_{X}$ . Furthermore, we have

\begin{matrix} V_{X \to X Q} H_{X} & = (W_{X} H_{X}) \otimes | 0 ⟩ + (\sqrt{1 - W^{†} W} H_{X}) \otimes | 1 ⟩ \end{matrix}

150

\begin{matrix} = (H_{X} W_{X}) \otimes | 0 ⟩ + (H_{X} \sqrt{1 - W^{†} W}) \otimes | 1 ⟩ = H_{X} V_{X \to X Q} \end{matrix}

151

and thus we find $[V_{X \to X Q}, H_{X}] = 0$ . Let ${{| j ⟩}_{X}}$ be an eigenbasis of $H_{X}$ , and let $| ψ_{j}^{'} ⟩_{XQ} = V_{X \to X Q} {| j ⟩}_{X}$ , noting that both ${| j ⟩}_{X}$ and $| ψ_{j}^{'} ⟩_{XQ}$ have the same energy. The two collections of vectors ${{| j ⟩}_{X} \otimes {| 0 ⟩}_{Q}}$ and ${| ψ_{j}^{'} ⟩_{XQ}}$ can thus be completed into two bases ${| χ_{i} ⟩_{XQ}}$ and ${| χ_{i}^{'} ⟩_{XQ}}$ of eigenvectors of $H_{X} + H_{Q}$ where the i-th element of both bases have the same energy. Define finally $U_{XQ} = \sum_{i} | χ_{i}^{'} ⟩ ⟨ χ_{i} |_{XQ}$ , noting that by construction $U_{XQ} {| 0 ⟩}_{Q} = V_{X \to X Q}$ and $[U_{XQ}, H_{X}] = 0$ . $□$

D. Robust Counterexample Against Extensions of Construction #1

In this appendix we show that the counterexample of Sect. 5.2 is robust to small errors on the process. The process is $E_{X \to X^{'}} (\cdot) = tr [\cdot] | + ⟩ ⟨ + |$ , where $| + ⟩ = [| 0 ⟩ + | 1 ⟩] / \sqrt{2}$ with $| 0 ⟩, | 1 ⟩$ energy eigenstates of respective energies $E_{0} = 0$ , $E_{1} > 0$ ; we write $H_{X} = \sum_{j = 0, 1} E_{j} | j ⟩ ⟨ j |$ and $Γ_{X} = e^{- β H_{X}}$ . The initial state on X and a reference system $R_{X} ≃ X$ is the maximally entangled state ${| σ ⟩}_{X R_{X}} = [| 00 ⟩ + | 11 ⟩] / \sqrt{2} = {| Φ ⟩}_{X : R_{X}} / \sqrt{2}$ .

We seek a map $T_{X \to X^{'}}$ such that

\begin{matrix} P (T_{X \to X^{'}} (σ_{X R_{X}}), E_{X \to X^{'}} (σ_{X R_{X}})) ⩽ ϵ and T_{X \to X} (Γ_{X}) ⩽ α Γ_{X^{'}}, \end{matrix}

152

for a $α$ that is independent of $E_{0}, E_{1}$ . Here we have $X ≃ X^{'}$ and $Γ_{X} = Γ_{X^{'}}$ .

Let $ρ_{X^{'} R_{X}} = E_{X \to X^{'}} (σ_{X R_{X}})$ . From (153) we find $\frac{1}{2} {‖ T_{X \to X^{'}} (σ_{X R_{X}}) - ρ_{X^{'} R_{X}} ‖}_{1} ⩽ ϵ$ , which in turn implies that $(1 / 4) ‖ T_{X \to X^{'}} (Φ_{X : R_{X}}) - {| + ⟩ ⟨ + |}_{X^{'}} \otimes 1_{R_{X}} ‖_{1} ⩽ ϵ$ , and hence that $T_{X \to X^{'}} (\cdot) = {tr [\cdot] | + ⟩ ⟨ + |}_{X^{'}} + Δ (\cdot)$ for some Hermiticity preserving map $Δ (\cdot)$ satisfying $\frac{1}{2} {‖ Δ (Φ_{X R_{X}}) ‖}_{1} ⩽ 2 ϵ$ .

Let $Δ_{\pm} ⩾ 0$ be the positive and negative parts of $Δ (Γ) = Δ_{+} - Δ_{-}$ , noting that $tr (Δ_{-}) ⩽ tr (Δ_{-}) + tr (Δ_{+}) = {‖ Δ (Γ) ‖}_{1} = {‖ {tr}_{R_{X}} (Γ_{R_{X}}^{1 / 2} Δ (Φ_{X : R_{X}}) Γ_{R_{X}}^{1 / 2}) ‖}_{1}$ , defining $Γ_{R_{X}}$ as the transpose of $Γ_{X}$ onto the system $R_{X}$ , and continuing the computation we obtain $tr (Δ_{-}) ⩽ ‖ Γ_{R_{X}}^{1 / 2} Δ (Φ_{X : R_{X}}) Γ_{R_{X}}^{1 / 2} ‖_{1} ⩽ ‖ Γ_{R_{X}} ‖_{\infty} {‖ Δ (Φ_{X : R_{X}}) ‖}_{1} ⩽ 4 ϵ$ , using the fact that $‖ Γ_{X} ‖_{\infty} = {max}_{j} {e^{- β E_{j}}} = 1$ .

To complete this argument we define the hypothesis testing relative entropy [70–74] in its form as presented in [75]. For any sub-normalized quantum state $ρ$ and for any positive semi-definite operator $σ$ whose support contains the support of $ρ$ , we define it via the following equivalent optimizations, which are semi-definite programs [76] in terms of the primal variable $Q ⩾ 0$ and the dual variables $μ, X ⩾ 0$ :

\begin{matrix} \begin{matrix} e^{- D_{H}^{η} (ρ ‖ σ)} & = & minimize: & η^{- 1} tr [Q σ] & = & maximize: & μ - η^{- 1} tr [X] \\ subject\ to: & Q ⩽ 1 & = & subject to: & μ ρ ⩽ σ + X . \\ tr [Q ρ] ⩾ η \end{matrix} \end{matrix}

153

The condition $T_{X \to X^{'}} (Γ) ⩽ α Γ$ implies that $α Γ ⩾ tr [Γ] | + ⟩ ⟨ + | + Δ (Γ) ⩾ | + ⟩ ⟨ + | - Δ_{-}$ . Hence, we have that $α^{- 1} | + ⟩ ⟨ + | ⩽ Γ + Δ_{-} / α$ . Hence, for any $0 < η ⩽ 1$ to be fixed later, $μ = α^{- 1}$ is feasible for the dual problem (154) defining the hypothesis testing entropy $D_{H}^{η} (| + ⟩ ⟨ + | ‖ Γ)$ , and $e^{- D_{H}^{η} (| + ⟩ ⟨ + | ‖ Γ)} ⩾ α^{- 1} - tr [Δ_{-} / α] / η ⩾ α^{- 1} (1 - 4 ϵ / η)$ . Thus, we have $ln (α) ⩾ D_{H}^{η} (| + ⟩ ⟨ + | ‖ Γ) + ln (1 - 4 ϵ / η)$ . Choosing $η = 8 ϵ$ yields $ln (1 - 4 ϵ / η) = - ln (2)$ .

On the other hand, by definition we have $e^{- D_{H}^{η} (| + ⟩ ⟨ + | ‖ Γ)} ⩽ tr [Q Γ] / η$ for any $0 ⩽ Q ⩽ 1$ satisfying $tr [Q | + ⟩ ⟨ + |] ⩾ η$ ; with $Q = 2 η | 1 ⟩ ⟨ 1 |$ we obtain $e^{- D_{H}^{η} (| + ⟩ ⟨ + | ‖ Γ)} ⩽ 2 e^{- β E_{1}}$ and thus $D_{H}^{η} (| + ⟩ ⟨ + | ‖ Γ) ⩾ β E_{1} - ln (2)$ .

Then, $ln (α) ⩾ - ln (2) + β E_{1} - ln (2) = - 2 ln (2) + β E_{1}$ . Now let $α$ be the optimal candidate in the coherent relative entropy ${\hat{D}}_{X \to X^{'}}^{ϵ} (ρ_{X^{'} R_{X}} ‖ Γ, Γ) = - ln (α)$ . We finally see that the transformation $1 / 2 \to | + ⟩$ may require arbitrarily much energy if $E_{1} \to \infty$ , even for a small $ϵ > 0$ , since

\begin{matrix} energy cost = - β^{- 1} {\hat{D}}_{X \to X^{'}}^{ϵ} (ρ_{X^{'} R_{X}} ‖ Γ, Γ) = β^{- 1} ln (α) ⩾ E_{1} - 2 β^{- 1} ln (2) . \end{matrix}

154

E. Universal Conditional Typical Projector for Trivial Hamiltonians

In the case of trivial Hamiltonians, Definition 6.1 can be simplified. We call the corresponding object a universal conditional typical projector

Definition E.1

Consider two systems with Hilbert spaces $H_{A}, H_{B}$ and let $s \in R$ . We define a universal conditional typical projector $P_{A^{n} B^{n}}^{s, δ}$ with parameter $δ > 0$ as a projector acting on ${(H_{A} \otimes H_{B})}^{\otimes n}$ such that:

(i)
There exists $η > 0$ independent of n such that for any quantum state $ρ_{AB}$ with $H {(A | B)}_{ρ} ⩽ s$ , we have
$\begin{matrix} tr [P_{A^{n} B^{n}}^{s, δ} ρ_{AB}^{\otimes n}] ⩾ 1 - poly (n) exp (- n η) ; \end{matrix}$ 155
(ii)
${tr}_{A^{n}} [P_{A^{n} B^{n}}^{s, δ}] ⩽ poly (n) e^{n (s + 2 δ)} 1_{B^{n}}$ .

Observe that we choose to define the object in Definition E.1 as a projector whereas we only require the object in Definition 6.1 to be an operator of norm at most 1. The reason is that while we can prove that a projector satisfying the conditions of Definition E.1 exists, we are currently not able to guarantee the existence of a projector satisfying the criteria of Definition 6.1.

Proposition E.2

Consider two systems A, B and let $s \in R$ . For any $δ > 0$ and $n \in N$ there exists a universal conditional typical projector $P_{A^{n} B^{n}}^{s, δ}$ that is permutation-invariant.

The proof of Proposition E.2 is developed in the rest of this appendix. To understand why the projector of Definition E.1 is conditional—as well as for a simple illustration of its use—consider the smooth Rényi-zero conditional max-entropy, also known as the smooth alternative max-entropy [11]. It is defined for a bipartite state $ρ_{AB}$ as

\begin{matrix} {\hat{H}}_{\max}^{ϵ} {(A | B)}_{ρ} = min_{\hat{ρ \approx_{ϵ} ρ}} ln ‖ {tr}_{A} [Π_{AB}^{{\hat{ρ}}_{AB}}] ‖_{\infty}, \end{matrix}

156

where $Π_{AB}^{{\hat{ρ}}_{AB}}$ is the projector onto the support of ${\hat{ρ}}_{AB}$ , and where the optimization ranges over sub-normalized states ${\hat{ρ}}_{AB}$ which are $ϵ$ -close to $ρ_{AB}$ in purified distance. We may understand the i.i.d. behaviour of this quantity as follows. For $δ > 0$ and $n \in N$ let $P_{A^{n} B^{n}}^{s, δ}$ be a universal conditional typical projector with $s = H {(A | B)}_{ρ}$ . We define ${\hat{ρ}}_{A^{n} B^{n}} = P^{s, δ} ρ_{AB}^{\otimes n} P^{s, δ}$ . Then, we have ${\hat{ρ}}_{A^{n} B^{n}} \approx_{ϵ} ρ_{AB}^{\otimes n}$ for $n \in N$ large enough, thanks to Property (i) and the gentle measurement lemma (Lemma B.2). On the other hand, using Property (ii) we have

\begin{matrix} \frac{1}{n} {\hat{H}}_{\max}^{ϵ} {(A^{n} | B^{n})}_{ρ^{\otimes n}} ⩽ \frac{1}{n} ln ‖ {tr}_{A^{n}} [P^{s, δ}] ‖_{\infty} ⩽ H {(A | B)}_{ρ} + 2 δ + \frac{1}{n} ln (poly (n)) \end{matrix}

157

such that taking the limits $n \to \infty$ and $δ \to 0$ , we get that the smooth Rényi-zero conditional entropy is asymptotically upper bounded by the von Neumann conditional entropy in the i.i.d. regime.

We proceed to construct a universal conditional typical projector based on ideas from Schur–Weyl duality. The construction presented here is similar to, and inspired by, techniques put forward in earlier work [22, 24–26, 47, 48].

Proof

(Proposition E.2) Let

\begin{matrix} P_{A^{n} B^{n}}^{s, δ} = \sum_{\begin{matrix} λ, λ^{'} : \\ \bar{H} (λ) - \bar{H} (λ^{'}) ⩽ s + 2 δ \end{matrix}} (1_{A^{n}} \otimes Π_{B^{n}}^{λ^{'}}) Π_{A^{n} B^{n}}^{λ}, \end{matrix}

158

where the respective projectors $Π_{B^{n}}^{λ^{'}}$ , $Π_{A^{n} B^{n}}^{λ}$ refer to Schur–Weyl decompositions of $H_{B}^{\otimes n}$ and of ${(H_{A} \otimes H_{B})}^{\otimes n}$ , respectively, $λ \in Young (d_{A} d_{B}, n)$ and $λ^{'} \in Young (d_{B}, n)$ . Observe that $P_{A^{n} B^{n}}^{s, δ}$ is a projector: Each term in the sum is a projector as a product of two commuting projectors (Lemma 2.1), and each term of the sum acts on a different subspace of ${(H_{A} \otimes H_{B})}^{\otimes n}$ . The projector $P_{A^{n} B^{n}}^{s, δ}$ corresponds to the measurement of the two commuting POVMs ${Π_{A^{n} B^{n}}^{λ}}$ and ${Π_{B^{n}}^{λ^{'}}}$ , and testing whether or not the event $\bar{H} (λ) - \bar{H} (λ^{'}) ⩽ s + 2 δ$ is satisfied. Also by construction $P_{A^{n} B^{n}}^{s, δ}$ is permutation-invariant.

For any $ρ_{AB}$ with $H {(A | B)}_{ρ} ⩽ s$ , the probability that the measurement of $P_{A^{n} B^{n}}^{s, δ}$ fails on $ρ_{AB}^{\otimes n}$ can be upper bounded as follows. The passing event $\bar{H} (λ) - \bar{H} (λ^{'}) ⩽ s + 2 δ$ is implied in particular by the two events (a) $\bar{H} (λ) ⩽ H {(A B)}_{ρ} + δ$ and (b) $\bar{H} (λ^{'}) ⩾ H {(B)}_{ρ} - δ$ happening simultaneously, recalling that $H {(A B)}_{ρ} - H {(B)}_{ρ} = H {(A | B)}_{ρ} ⩽ s$ . The probability of event (a) failing is

\begin{matrix} Pr [\bar{H} (λ) > H {(A B)}_{ρ} + δ] ⩽ poly (n) exp (- n η) \end{matrix}

159

as given by Proposition 2.1, and similarly for event (b)

\begin{matrix} Pr [\bar{H} (λ^{'}) < H {(B)}_{ρ} - δ] ⩽ poly (n) exp (- n η) . \end{matrix}

160

We can use the same $η$ in both cases by picking the lesser of the two values given by Proposition 2.1, if necessary. Note furthermore that $η > 0$ does not depend on $ρ$ . Hence with this $η$ , for any $ρ_{AB}$ we have

\begin{matrix} tr [P_{A^{n} B^{n}}^{s, δ} ρ_{AB}^{\otimes n}] ⩾ 1 - poly (n) exp (- n η) \end{matrix}

161

as required.

For the second property, we use Lemma 2.2 to write

\begin{matrix} {tr}_{A^{n}} [P_{A^{n} B^{n}}^{s, δ}] & = \sum_{\begin{matrix} λ, λ^{'} : \\ \bar{H} (λ) - \bar{H} (λ^{'}) ⩽ s + 2 δ \end{matrix}} Π_{B^{n}}^{λ^{'}} {tr}_{A^{n}} [Π_{A^{n} B^{n}}^{λ}] Π_{B^{n}}^{λ^{'}} \\ ⩽ \sum_{\begin{matrix} λ, λ^{'} : \\ \bar{H} (λ) - \bar{H} (λ^{'}) ⩽ s + 2 δ \end{matrix}} poly (n) e^{n (\bar{H} (λ) - \bar{H} (λ^{'}))} 1_{B^{n}} \\ ⩽ poly (n) e^{n (s + 2 δ)} 1_{B^{n}} \end{matrix}

162

recalling that there are only $poly (n)$ many possible Young diagrams and hence at most so many terms in the sum. $□$

F. Universal Conditional Erasure for n Copies and Trivial Hamiltonians

Corollary F.1

(Thermodynamic protocol for universal conditional erasure for n copies). Let S, M be systems, let $σ_{S}$ be the maximally mixed state on S. Let $s < ln (d_{S})$ , where $d_{S}$ is the dimension of S, and let $δ > 0$ small enough. Let $n \in N$ be large enough. Let J be a large enough information battery and let any $m ⩽ n (ln (d_{S}) - s - 3 δ)$ such that $e^{m}$ is integer.

Then, there exists $η^{'} > 0$ and a thermal operation $R_{S^{n} M^{n} J \to S^{n} M^{n} J}$ acting on the systems $S^{n} M^{n} J$ , such that the effective work process $T_{S^{n} M^{n} \to S^{n} M^{n}}$ of $R_{S^{n} M^{n} J \to S^{n} M^{n} J}$ with respect to the battery states ${(τ_{J}^{m}, | 0 ⟩}_{J})$ is a universal conditional $(poly (n) e^{- n η^{'}})$ -erasure process resetting $S^{n}$ to the state $σ_{S}^{\otimes n}$ with respect to the set of states $S_{S^{n} M^{n}}^{'}$ , where $S_{S^{n} M^{n}}^{'}$ is the convex hull of $S_{S^{n} M^{n}} = {ρ_{SM}^{\otimes n} : H {(S | M)}_{ρ} ⩽ s}$ .

The case where $s = ln (d_{S})$ is uninteresting as we cannot hope to extract any work. In such cases one can simply set $m = 0$ and take $R_{S^{n} M^{n} J}$ to be the thermal operation that completely thermalizes $S^{n}$ .

Proof

This is in fact a relatively straightforward application of Proposition 7.1 over n copies of SM. Let $P_{S^{n} M^{n}}^{s, δ}$ be given by Proposition E.2. We seek $κ, κ^{'}$ that satisfy (87). We can choose $κ = poly (n) exp \{- n η (δ)\}$ thanks to Definition E.1. Furthermore for any $ρ_{SM}^{\otimes n} \in S_{S^{n} M^{n}}$ we have

\begin{matrix} tr [P_{S^{n} M^{n}}, {(\frac{1_{S}}{d_{S}} \otimes ρ_{M})}^{\otimes n}] ⩽ poly (n) e^{n (s + 2 δ)} d_{S}^{- n} tr [ρ_{M}^{\otimes n}] & = poly (n) e^{- n (ln (d_{S}) - s - 2 δ)} \\ ⩽ \frac{poly (n) e^{- n δ}}{e^{m}} \end{matrix}

163

and thus we may take $κ^{'} = poly (n) e^{- n δ}$ . Finally, $η^{'}$ is given as $η^{'} = min {δ, η (δ)}$ . $□$

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Goold J, Huber M, Riera A, del Rio L, Skrzypczyk P. The role of quantum information in thermodynamics—a topical review. J. Phys. A: Math. Theor. 2016;49(14):143001. doi: 10.1088/1751-8113/49/14/143001. [DOI] [Google Scholar]
2.Brandão FGSL, Horodecki M, Oppenheim J, Renes JM, Spekkens RW. Resource theory of quantum states out of thermal equilibrium. Phys. Rev. Lett. 2013;111(25):250404. doi: 10.1103/PhysRevLett.111.250404. [DOI] [PubMed] [Google Scholar]
3.Brandão F, Horodecki M, Ng N, Oppenheim J, Wehner S. The second laws of quantum thermodynamics. Proc. Natl. Acad. Sci. 2015;112(11):3275. doi: 10.1073/pnas.1411728112. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Chitambar E, Gour G. Quantum resource theories. Rev. Mod. Phys. 2019;91(2):025001. doi: 10.1103/RevModPhys.91.025001. [DOI] [Google Scholar]
5.Janzing D, Wocjan P, Zeier R, Geiss R, Beth T. Thermodynamic cost of reliability and low temperatures: tightening Landauer’s principle and the second law. Int. J. Theor. Phys. 2000;39(12):2717. doi: 10.1023/A:1026422630734. [DOI] [Google Scholar]
6.Faist P, Oppenheim J, Renner R. Gibbs-preserving maps outperform thermal operations in the quantum regime. New J. Phys. 2015;17(4):043003. doi: 10.1088/1367-2630/17/4/043003. [DOI] [Google Scholar]
7.Åberg J. Truly work-like work extraction via a single-shot analysis. Nat. Commun. 2013;4:1925. doi: 10.1038/ncomms2712. [DOI] [PubMed] [Google Scholar]
8.Horodecki M, Oppenheim J. Fundamental limitations for quantum and nanoscale thermodynamics. Nat. Commun. 2013;4:2059. doi: 10.1038/ncomms3059. [DOI] [PubMed] [Google Scholar]
9.Renner, R.: Security of quantum key distribution. Ph.D. thesis, ETH Zürich (2005). 10.3929/ethz-a-005115027
10.Tomamichel, M.: A framework for non-asymptotic quantum information theory. Ph.D. thesis, ETH Zurich (2012). 10.3929/ethz-a-7356080
11.Tomamichel M. Quantum Information Processing with Finite Resources. Berlin: Springer; 2016. [Google Scholar]
12.Chubb CT, Tomamichel M, Korzekwa K. Beyond the thermodynamic limit: finite-size corrections to state interconversion rates. Quantum. 2018;2:108. doi: 10.22331/q-2018-11-27-108. [DOI] [Google Scholar]
13.Faist P, Dupuis F, Oppenheim J, Renner R. The minimal work cost of information processing. Nat. Commun. 2015;6:7669. doi: 10.1038/ncomms8669. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.îrstoiu, C., Jennings, D.: Global and local gauge symmetries beyond lagrangian formulations (2017). arXiv:1707.09826
15.Ben Dana K, García Díaz M, Mejatty M, Winter A. Resource theory of coherence: beyond states. Phys. Rev. A. 2017;95(6):062327. doi: 10.1103/PhysRevA.95.062327. [DOI] [Google Scholar]
16.Faist P, Renner R. Fundamental work cost of quantum processes. Phys. Rev. X. 2018;8(2):021011. doi: 10.1103/PhysRevX.8.021011. [DOI] [Google Scholar]
17.Smith, G.: Quantum channel capacities. In: IEEE Information Theory Workshop, pp. 1–5 (2010). 10.1109/CIG.2010.5592851
18.Christandl M, König R, Renner R. Postselection technique for quantum channels with applications to quantum cryptography. Phys. Rev. Lett. 2009;102(2):20504. doi: 10.1103/PhysRevLett.102.020504. [DOI] [PubMed] [Google Scholar]
19.Anshu A, Jain R, Warsi NA. Building blocks for communication over noisy quantum networks. IEEE Trans. Inf. Theory. 2019;65(2):1287. doi: 10.1109/TIT.2018.2851297. [DOI] [Google Scholar]
20.Faist P, Berta M, Brandão F. Thermodynamic capacity of quantum processes. Phys. Rev. Lett. 2019;122(20):200601. doi: 10.1103/PhysRevLett.122.200601. [DOI] [PubMed] [Google Scholar]
21.Navascués M, García-Pintos LP. Nonthermal quantum channels as a thermodynamical resource. Phys. Rev. Lett. 2015;115(1):010405. doi: 10.1103/PhysRevLett.115.010405. [DOI] [PubMed] [Google Scholar]
22.Bennett CH, Devetak I, Harrow AW, Shor PW, Winter A. The quantum reverse Shannon theorem and resource tradeoffs for simulating quantum channels. IEEE Trans. Inf. Theory. 2014;60(5):2926. doi: 10.1109/TIT.2014.2309968. [DOI] [Google Scholar]
23.Berta M, Christandl M, Renner R. The quantum reverse Shannon theorem based on one-shot information theory. Commun. Math. Phys. 2011;306(3):579. doi: 10.1007/s00220-011-1309-7. [DOI] [Google Scholar]
24.Harrow, A.W.: Applications of coherent classical communication and the Schur transform to quantum information theory. Ph.D. thesis, Massachusetts Institute of Technology (2005)
25.Haah J, Harrow AW, Ji Z, Wu X, Yu N. Sample-optimal tomography of quantum states. IEEE Trans. Inf. Theory. 2017;63(9):5628. doi: 10.1109/TIT.2017.2719044. [DOI] [Google Scholar]
26.Nötzel, J.: A solution to two party typicality using representation theory of the symmetric group (2012). arXiv:1209.5094
27.Tomamichel M, Colbeck R, Renner R. Duality between smooth min-and max-entropies. IEEE Trans. Inf. Theory. 2010;56(9):4674. doi: 10.1109/TIT.2010.2054130. [DOI] [Google Scholar]
28.Nielsen MA, Chuang IL. Quantum Computation and Quantum Information. Cambridge: Cambridge University Press; 2000. [Google Scholar]
29.Szilard L. über die Entropieverminderung in einem thermodynamischen System bei Eingriffen intelligenter Wesen. Z. Phys. 1929;53(11–12):840. doi: 10.1007/BF01341281. [DOI] [Google Scholar]
30.Boyd SP, Vandenberghe L. Convex Optimization. Cambridge: Cambridge University Press; 2004. [Google Scholar]
31.Pitchford, A., Granade, C., Nation, P.D., Johansson, R.J.: QuTip4.1.0 (2016). http://qutip.org
32.Johansson J, Nation P, Nori F. QuTiP 2: a Python framework for the dynamics of open quantum systems. Comput. Phys. Commun. 2013;184(4):1234. doi: 10.1016/j.cpc.2012.11.019. [DOI] [Google Scholar]
33.Andersen, M.S., Dahl, J., Vandenberghe., L.: CVXOPT 1.1.9 (2016). https://cvxopt.org/
34.Ramakrishnan N, Iten R, Scholz VB, Berta M. Computing quantum channel capacities. IEEE Trans. Inf. Theory. 2021;67(2):946. doi: 10.1109/TIT.2020.3034471. [DOI] [Google Scholar]
35.Alicki, R.: Isotropic quantum spin channels and additivity questions (2004). arXiv:quant-ph/0402080
36.Devetak I, Junge M, King C, Ruskai MB. Multiplicativity of completely bounded p-norms implies a new additivity result. Commun. Math. Phys. 2006;266(1):37. doi: 10.1007/s00220-006-0034-0. [DOI] [Google Scholar]
37.Holevo, A.S.: The entropy gain of quantum channels. In Proceedings of the 2011 IEEE International Symposium on Information Theory. IEEE, pp. 289–292 (2011). 10.1109/ISIT.2011.6034107
38.Holevo AS. The entropy gain of infinite-dimensional quantum evolutions. Dokl. Math. 2010;82(2):730. doi: 10.1134/S1064562410050133. [DOI] [Google Scholar]
39.Holevo AS. On the Choi-Jamiolkowski correspondence in infinite dimensions. Theor. Math. Phys. 2011;166(1):123. doi: 10.1007/s11232-011-0010-5. [DOI] [Google Scholar]
40.Holevo AS. Quantum Systems, Channels, Information. Berlin: De Gruyter; 2012. [Google Scholar]
41.Buscemi F, Das S, Wilde MM. Approximate reversibility in the context of entropy gain, information gain, and complete positivity. Phys. Rev. A. 2016;93(6):062314. doi: 10.1103/PhysRevA.93.062314. [DOI] [Google Scholar]
42.Gour, G., Wilde, M.M.: Entropy of a quantum channel: definition, properties, and application. In Proceedings of the 2020 IEEE International Symposium on Information Theory. IEEE, pp. 1903–1908 (2020). 10.1109/ISIT44484.2020.9174135
43.Berta M, Renes JM, Wilde MM. Identifying the information gain of a quantum measurement. IEEE Trans. Inf. Theory. 2014;60(12):7987. doi: 10.1109/TIT.2014.2365207. [DOI] [Google Scholar]
44.Faist, P.: Quantum coarse-graining: An information-theoretic approach to thermodynamics. Ph.D. thesis, ETH Zürich (2016). 10.3929/ethz-a-010695790
45.Morgan C, Winter A. “Pretty strong” converse for the quantum capacity of degradable channels. IEEE Trans. Inf. Theory. 2014;60(1):317. doi: 10.1109/TIT.2013.2288971. [DOI] [Google Scholar]
46.Tomamichel M, Colbeck R, Renner R. A fully quantum asymptotic equipartition property. IEEE Trans. Inf. Theory. 2009;55(12):5840. doi: 10.1109/TIT.2009.2032797. [DOI] [Google Scholar]
47.Bjelakovic, I., Siegmund-Schultze, R.: Quantum Stein’s lemma revisited, inequalities for quantum entropies, and a concavity theorem of Lieb (2003). arXiv:quant-ph/0307170
48.Berta M, Lemm M, Wilde MM. Monotonicity of quantum relative entropy and recoverability. Quantum Inf. Comput. 2015;15(15&16):1333. [Google Scholar]
49.Anshu A, Devabathini VK, Jain R. Quantum communication using coherent rejection sampling. Phys. Rev. Lett. 2017;119(12):120506. doi: 10.1103/PhysRevLett.119.120506. [DOI] [PubMed] [Google Scholar]
50.Anshu A, Jain R, Warsi NA. A one-shot achievability result for quantum state redistribution. IEEE Trans. Inf. Theory. 2018;64(3):1425. doi: 10.1109/TIT.2017.2776112. [DOI] [Google Scholar]
51.Anshu A, Jain R, Warsi NA. A generalized quantum Slepian–Wolf. IEEE Trans. Inf. Theory. 2018;64(3):1436. doi: 10.1109/TIT.2017.2786348. [DOI] [Google Scholar]
52.Anshu A, Jain R, Warsi NA. Convex-split and hypothesis testing approach to one-shot quantum measurement compression and randomness extraction. IEEE Trans. Inf. Theory. 2019;65(9):5905. doi: 10.1109/TIT.2019.2915242. [DOI] [Google Scholar]
53.Majenz C, Berta M, Dupuis F, Renner R, Christandl M. Catalytic decoupling of quantum information. Phys. Rev. Lett. 2017;118(8):080503. doi: 10.1103/PhysRevLett.118.080503. [DOI] [PubMed] [Google Scholar]
54.Anshu A, Berta M, Jain R, Tomamichel M. Partially smoothed information measures. IEEE Trans. Inf. Theory. 2020;66(8):5022. doi: 10.1109/TIT.2020.2981573. [DOI] [Google Scholar]
55.Berta M, Majenz C. Disentanglement cost of quantum states. Phys. Rev. Lett. 2018;121:190503. doi: 10.1103/PhysRevLett.121.190503. [DOI] [PubMed] [Google Scholar]
56.del Rio L, Åberg J, Renner R, Dahlsten O, Vedral V. The thermodynamic meaning of negative entropy. Nature. 2011;474(7349):61. doi: 10.1038/nature10123. [DOI] [PubMed] [Google Scholar]
57.Hayashi M, Nagaoka H. General formulas for capacity of classical-quantum channels. IEEE Trans. Inf. Theory. 2003;49(7):1753. doi: 10.1109/TIT.2003.813556. [DOI] [Google Scholar]
58.Scutaru H. Some remarks on covariant completely positive linear maps on C*-algebras. Rep. Math. Phys. 1979;16(1):79. doi: 10.1016/0034-4877(79)90040-5. [DOI] [Google Scholar]
59.Keyl M, Werner RF. Optimal cloning of pure states, testing single clones. J. Math. Phys. 1999;40(7):3283. doi: 10.1063/1.532887. [DOI] [Google Scholar]
60.Marvian Mashhad, I.: Symmetry, asymmetry and quantum information. Ph.D. thesis, University of Waterloo (2012). https://hdl.handle.net/10012/7088
61.Fang K, Wang X, Tomamichel M, Berta M. Quantum channel simulation and the channel’s smooth max-information. IEEE Trans. Inf. Theory. 2020;66(4):2129. doi: 10.1109/TIT.2019.2943858. [DOI] [Google Scholar]
62.Gour G, Winter A. How to quantify a dynamical quantum resource. Phys. Rev. Lett. 2019;123:150401. doi: 10.1103/PhysRevLett.123.150401. [DOI] [PubMed] [Google Scholar]
63.Dutil, N.: Multiparty quantum protocols for assisted entanglement distillation. Ph.D. thesis, McGill University, Montréal (2011)
64.Drescher, L., Fawzi, O.: On simultaneous min-entropy smoothing. In 2013 IEEE International Symposium on Information Theory. IEEE, pp. 161–165 (2013). 10.1109/ISIT.2013.6620208
65.Sen, P.: A one-shot quantum joint typicality lemma (2018). arXiv:1806.07278
66.Anshu A, Berta M, Jain R, Tomamichel M. A minimax approach to one-shot entropy inequalities. J. Math. Phys. 2019;60:122201. doi: 10.1063/1.5126723. [DOI] [Google Scholar]
67.Fannes M. Distillation of local purity from quantum states. Commun. Math. Phys. 1973;31(4):291. doi: 10.1007/BF01646490. [DOI] [Google Scholar]
68.Audenaert KMR. A sharp continuity estimate for the von Neumann entropy. J. Phys. A: Math. Theor. 2007;40(28):8127. doi: 10.1088/1751-8113/40/28/S18. [DOI] [Google Scholar]
69.Berta M, Christandl M, Colbeck R, Renes JM, Renner R. Entropic uncertainty and measurement reversibility. Nat. Phys. 2010;6(9):659. doi: 10.1038/nphys1734. [DOI] [Google Scholar]
70.Wang L, Renner R. One-shot classical-quantum capacity and hypothesis testing. Phys. Rev. Lett. 2012;108(20):200501. doi: 10.1103/PhysRevLett.108.200501. [DOI] [PubMed] [Google Scholar]
71.Tomamichel M, Hayashi M. A hierarchy of information quantities for finite block length analysis of quantum tasks. IEEE Trans. Inf. Theory. 2013;59(11):7693. doi: 10.1109/TIT.2013.2276628. [DOI] [Google Scholar]
72.Matthews W, Wehner S. Finite blocklength converse bounds for quantum channels. IEEE Trans. Inf. Theory. 2014;60(11):7317. doi: 10.1109/TIT.2014.2353614. [DOI] [Google Scholar]
73.Buscemi F, Datta N. The quantum capacity of channels with arbitrarily correlated noise. IEEE Trans. Inf. Theory. 2010;56(3):1447. doi: 10.1109/TIT.2009.2039166. [DOI] [Google Scholar]
74.Brandão FGSL, Datta N. One-shot rates for entanglement manipulation under non-entangling maps. IEEE Trans. Inf. Theory. 2011;57(3):1754. doi: 10.1109/TIT.2011.2104531. [DOI] [Google Scholar]
75.Dupuis, F., Kraemer, L., Faist, P., Renes, J.M., Renner, R.: Generalized entropies. In: XVIIth international congress on mathematical physics, pp. 134–153 (2013). 10.1142/9789814449243_0008
76.Watrous J. Semidefinite programs for completely bounded norms. Theory Comput. 2009;5(11):217. doi: 10.4086/toc.2009.v005a011. [DOI] [Google Scholar]

[CR1] 1.Goold J, Huber M, Riera A, del Rio L, Skrzypczyk P. The role of quantum information in thermodynamics—a topical review. J. Phys. A: Math. Theor. 2016;49(14):143001. doi: 10.1088/1751-8113/49/14/143001. [DOI] [Google Scholar]

[CR2] 2.Brandão FGSL, Horodecki M, Oppenheim J, Renes JM, Spekkens RW. Resource theory of quantum states out of thermal equilibrium. Phys. Rev. Lett. 2013;111(25):250404. doi: 10.1103/PhysRevLett.111.250404. [DOI] [PubMed] [Google Scholar]

[CR3] 3.Brandão F, Horodecki M, Ng N, Oppenheim J, Wehner S. The second laws of quantum thermodynamics. Proc. Natl. Acad. Sci. 2015;112(11):3275. doi: 10.1073/pnas.1411728112. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR4] 4.Chitambar E, Gour G. Quantum resource theories. Rev. Mod. Phys. 2019;91(2):025001. doi: 10.1103/RevModPhys.91.025001. [DOI] [Google Scholar]

[CR5] 5.Janzing D, Wocjan P, Zeier R, Geiss R, Beth T. Thermodynamic cost of reliability and low temperatures: tightening Landauer’s principle and the second law. Int. J. Theor. Phys. 2000;39(12):2717. doi: 10.1023/A:1026422630734. [DOI] [Google Scholar]

[CR6] 6.Faist P, Oppenheim J, Renner R. Gibbs-preserving maps outperform thermal operations in the quantum regime. New J. Phys. 2015;17(4):043003. doi: 10.1088/1367-2630/17/4/043003. [DOI] [Google Scholar]

[CR7] 7.Åberg J. Truly work-like work extraction via a single-shot analysis. Nat. Commun. 2013;4:1925. doi: 10.1038/ncomms2712. [DOI] [PubMed] [Google Scholar]

[CR8] 8.Horodecki M, Oppenheim J. Fundamental limitations for quantum and nanoscale thermodynamics. Nat. Commun. 2013;4:2059. doi: 10.1038/ncomms3059. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Renner, R.: Security of quantum key distribution. Ph.D. thesis, ETH Zürich (2005). 10.3929/ethz-a-005115027

[CR10] 10.Tomamichel, M.: A framework for non-asymptotic quantum information theory. Ph.D. thesis, ETH Zurich (2012). 10.3929/ethz-a-7356080

[CR11] 11.Tomamichel M. Quantum Information Processing with Finite Resources. Berlin: Springer; 2016. [Google Scholar]

[CR12] 12.Chubb CT, Tomamichel M, Korzekwa K. Beyond the thermodynamic limit: finite-size corrections to state interconversion rates. Quantum. 2018;2:108. doi: 10.22331/q-2018-11-27-108. [DOI] [Google Scholar]

[CR13] 13.Faist P, Dupuis F, Oppenheim J, Renner R. The minimal work cost of information processing. Nat. Commun. 2015;6:7669. doi: 10.1038/ncomms8669. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.îrstoiu, C., Jennings, D.: Global and local gauge symmetries beyond lagrangian formulations (2017). arXiv:1707.09826

[CR15] 15.Ben Dana K, García Díaz M, Mejatty M, Winter A. Resource theory of coherence: beyond states. Phys. Rev. A. 2017;95(6):062327. doi: 10.1103/PhysRevA.95.062327. [DOI] [Google Scholar]

[CR16] 16.Faist P, Renner R. Fundamental work cost of quantum processes. Phys. Rev. X. 2018;8(2):021011. doi: 10.1103/PhysRevX.8.021011. [DOI] [Google Scholar]

[CR17] 17.Smith, G.: Quantum channel capacities. In: IEEE Information Theory Workshop, pp. 1–5 (2010). 10.1109/CIG.2010.5592851

[CR18] 18.Christandl M, König R, Renner R. Postselection technique for quantum channels with applications to quantum cryptography. Phys. Rev. Lett. 2009;102(2):20504. doi: 10.1103/PhysRevLett.102.020504. [DOI] [PubMed] [Google Scholar]

[CR19] 19.Anshu A, Jain R, Warsi NA. Building blocks for communication over noisy quantum networks. IEEE Trans. Inf. Theory. 2019;65(2):1287. doi: 10.1109/TIT.2018.2851297. [DOI] [Google Scholar]

[CR20] 20.Faist P, Berta M, Brandão F. Thermodynamic capacity of quantum processes. Phys. Rev. Lett. 2019;122(20):200601. doi: 10.1103/PhysRevLett.122.200601. [DOI] [PubMed] [Google Scholar]

[CR21] 21.Navascués M, García-Pintos LP. Nonthermal quantum channels as a thermodynamical resource. Phys. Rev. Lett. 2015;115(1):010405. doi: 10.1103/PhysRevLett.115.010405. [DOI] [PubMed] [Google Scholar]

[CR22] 22.Bennett CH, Devetak I, Harrow AW, Shor PW, Winter A. The quantum reverse Shannon theorem and resource tradeoffs for simulating quantum channels. IEEE Trans. Inf. Theory. 2014;60(5):2926. doi: 10.1109/TIT.2014.2309968. [DOI] [Google Scholar]

[CR23] 23.Berta M, Christandl M, Renner R. The quantum reverse Shannon theorem based on one-shot information theory. Commun. Math. Phys. 2011;306(3):579. doi: 10.1007/s00220-011-1309-7. [DOI] [Google Scholar]

[CR24] 24.Harrow, A.W.: Applications of coherent classical communication and the Schur transform to quantum information theory. Ph.D. thesis, Massachusetts Institute of Technology (2005)

[CR25] 25.Haah J, Harrow AW, Ji Z, Wu X, Yu N. Sample-optimal tomography of quantum states. IEEE Trans. Inf. Theory. 2017;63(9):5628. doi: 10.1109/TIT.2017.2719044. [DOI] [Google Scholar]

[CR26] 26.Nötzel, J.: A solution to two party typicality using representation theory of the symmetric group (2012). arXiv:1209.5094

[CR27] 27.Tomamichel M, Colbeck R, Renner R. Duality between smooth min-and max-entropies. IEEE Trans. Inf. Theory. 2010;56(9):4674. doi: 10.1109/TIT.2010.2054130. [DOI] [Google Scholar]

[CR28] 28.Nielsen MA, Chuang IL. Quantum Computation and Quantum Information. Cambridge: Cambridge University Press; 2000. [Google Scholar]

[CR29] 29.Szilard L. über die Entropieverminderung in einem thermodynamischen System bei Eingriffen intelligenter Wesen. Z. Phys. 1929;53(11–12):840. doi: 10.1007/BF01341281. [DOI] [Google Scholar]

[CR30] 30.Boyd SP, Vandenberghe L. Convex Optimization. Cambridge: Cambridge University Press; 2004. [Google Scholar]

[CR31] 31.Pitchford, A., Granade, C., Nation, P.D., Johansson, R.J.: QuTip4.1.0 (2016). http://qutip.org

[CR32] 32.Johansson J, Nation P, Nori F. QuTiP 2: a Python framework for the dynamics of open quantum systems. Comput. Phys. Commun. 2013;184(4):1234. doi: 10.1016/j.cpc.2012.11.019. [DOI] [Google Scholar]

[CR33] 33.Andersen, M.S., Dahl, J., Vandenberghe., L.: CVXOPT 1.1.9 (2016). https://cvxopt.org/

[CR34] 34.Ramakrishnan N, Iten R, Scholz VB, Berta M. Computing quantum channel capacities. IEEE Trans. Inf. Theory. 2021;67(2):946. doi: 10.1109/TIT.2020.3034471. [DOI] [Google Scholar]

[CR35] 35.Alicki, R.: Isotropic quantum spin channels and additivity questions (2004). arXiv:quant-ph/0402080

[CR36] 36.Devetak I, Junge M, King C, Ruskai MB. Multiplicativity of completely bounded p-norms implies a new additivity result. Commun. Math. Phys. 2006;266(1):37. doi: 10.1007/s00220-006-0034-0. [DOI] [Google Scholar]

[CR37] 37.Holevo, A.S.: The entropy gain of quantum channels. In Proceedings of the 2011 IEEE International Symposium on Information Theory. IEEE, pp. 289–292 (2011). 10.1109/ISIT.2011.6034107

[CR38] 38.Holevo AS. The entropy gain of infinite-dimensional quantum evolutions. Dokl. Math. 2010;82(2):730. doi: 10.1134/S1064562410050133. [DOI] [Google Scholar]

[CR39] 39.Holevo AS. On the Choi-Jamiolkowski correspondence in infinite dimensions. Theor. Math. Phys. 2011;166(1):123. doi: 10.1007/s11232-011-0010-5. [DOI] [Google Scholar]

[CR40] 40.Holevo AS. Quantum Systems, Channels, Information. Berlin: De Gruyter; 2012. [Google Scholar]

[CR41] 41.Buscemi F, Das S, Wilde MM. Approximate reversibility in the context of entropy gain, information gain, and complete positivity. Phys. Rev. A. 2016;93(6):062314. doi: 10.1103/PhysRevA.93.062314. [DOI] [Google Scholar]

[CR42] 42.Gour, G., Wilde, M.M.: Entropy of a quantum channel: definition, properties, and application. In Proceedings of the 2020 IEEE International Symposium on Information Theory. IEEE, pp. 1903–1908 (2020). 10.1109/ISIT44484.2020.9174135

[CR43] 43.Berta M, Renes JM, Wilde MM. Identifying the information gain of a quantum measurement. IEEE Trans. Inf. Theory. 2014;60(12):7987. doi: 10.1109/TIT.2014.2365207. [DOI] [Google Scholar]

[CR44] 44.Faist, P.: Quantum coarse-graining: An information-theoretic approach to thermodynamics. Ph.D. thesis, ETH Zürich (2016). 10.3929/ethz-a-010695790

[CR45] 45.Morgan C, Winter A. “Pretty strong” converse for the quantum capacity of degradable channels. IEEE Trans. Inf. Theory. 2014;60(1):317. doi: 10.1109/TIT.2013.2288971. [DOI] [Google Scholar]

[CR46] 46.Tomamichel M, Colbeck R, Renner R. A fully quantum asymptotic equipartition property. IEEE Trans. Inf. Theory. 2009;55(12):5840. doi: 10.1109/TIT.2009.2032797. [DOI] [Google Scholar]

[CR47] 47.Bjelakovic, I., Siegmund-Schultze, R.: Quantum Stein’s lemma revisited, inequalities for quantum entropies, and a concavity theorem of Lieb (2003). arXiv:quant-ph/0307170

[CR48] 48.Berta M, Lemm M, Wilde MM. Monotonicity of quantum relative entropy and recoverability. Quantum Inf. Comput. 2015;15(15&16):1333. [Google Scholar]

[CR49] 49.Anshu A, Devabathini VK, Jain R. Quantum communication using coherent rejection sampling. Phys. Rev. Lett. 2017;119(12):120506. doi: 10.1103/PhysRevLett.119.120506. [DOI] [PubMed] [Google Scholar]

[CR50] 50.Anshu A, Jain R, Warsi NA. A one-shot achievability result for quantum state redistribution. IEEE Trans. Inf. Theory. 2018;64(3):1425. doi: 10.1109/TIT.2017.2776112. [DOI] [Google Scholar]

[CR51] 51.Anshu A, Jain R, Warsi NA. A generalized quantum Slepian–Wolf. IEEE Trans. Inf. Theory. 2018;64(3):1436. doi: 10.1109/TIT.2017.2786348. [DOI] [Google Scholar]

[CR52] 52.Anshu A, Jain R, Warsi NA. Convex-split and hypothesis testing approach to one-shot quantum measurement compression and randomness extraction. IEEE Trans. Inf. Theory. 2019;65(9):5905. doi: 10.1109/TIT.2019.2915242. [DOI] [Google Scholar]

[CR53] 53.Majenz C, Berta M, Dupuis F, Renner R, Christandl M. Catalytic decoupling of quantum information. Phys. Rev. Lett. 2017;118(8):080503. doi: 10.1103/PhysRevLett.118.080503. [DOI] [PubMed] [Google Scholar]

[CR54] 54.Anshu A, Berta M, Jain R, Tomamichel M. Partially smoothed information measures. IEEE Trans. Inf. Theory. 2020;66(8):5022. doi: 10.1109/TIT.2020.2981573. [DOI] [Google Scholar]

[CR55] 55.Berta M, Majenz C. Disentanglement cost of quantum states. Phys. Rev. Lett. 2018;121:190503. doi: 10.1103/PhysRevLett.121.190503. [DOI] [PubMed] [Google Scholar]

[CR56] 56.del Rio L, Åberg J, Renner R, Dahlsten O, Vedral V. The thermodynamic meaning of negative entropy. Nature. 2011;474(7349):61. doi: 10.1038/nature10123. [DOI] [PubMed] [Google Scholar]

[CR57] 57.Hayashi M, Nagaoka H. General formulas for capacity of classical-quantum channels. IEEE Trans. Inf. Theory. 2003;49(7):1753. doi: 10.1109/TIT.2003.813556. [DOI] [Google Scholar]

[CR58] 58.Scutaru H. Some remarks on covariant completely positive linear maps on C*-algebras. Rep. Math. Phys. 1979;16(1):79. doi: 10.1016/0034-4877(79)90040-5. [DOI] [Google Scholar]

[CR59] 59.Keyl M, Werner RF. Optimal cloning of pure states, testing single clones. J. Math. Phys. 1999;40(7):3283. doi: 10.1063/1.532887. [DOI] [Google Scholar]

[CR60] 60.Marvian Mashhad, I.: Symmetry, asymmetry and quantum information. Ph.D. thesis, University of Waterloo (2012). https://hdl.handle.net/10012/7088

[CR61] 61.Fang K, Wang X, Tomamichel M, Berta M. Quantum channel simulation and the channel’s smooth max-information. IEEE Trans. Inf. Theory. 2020;66(4):2129. doi: 10.1109/TIT.2019.2943858. [DOI] [Google Scholar]

[CR62] 62.Gour G, Winter A. How to quantify a dynamical quantum resource. Phys. Rev. Lett. 2019;123:150401. doi: 10.1103/PhysRevLett.123.150401. [DOI] [PubMed] [Google Scholar]

[CR63] 63.Dutil, N.: Multiparty quantum protocols for assisted entanglement distillation. Ph.D. thesis, McGill University, Montréal (2011)

[CR64] 64.Drescher, L., Fawzi, O.: On simultaneous min-entropy smoothing. In 2013 IEEE International Symposium on Information Theory. IEEE, pp. 161–165 (2013). 10.1109/ISIT.2013.6620208

[CR65] 65.Sen, P.: A one-shot quantum joint typicality lemma (2018). arXiv:1806.07278

[CR66] 66.Anshu A, Berta M, Jain R, Tomamichel M. A minimax approach to one-shot entropy inequalities. J. Math. Phys. 2019;60:122201. doi: 10.1063/1.5126723. [DOI] [Google Scholar]

[CR67] 67.Fannes M. Distillation of local purity from quantum states. Commun. Math. Phys. 1973;31(4):291. doi: 10.1007/BF01646490. [DOI] [Google Scholar]

[CR68] 68.Audenaert KMR. A sharp continuity estimate for the von Neumann entropy. J. Phys. A: Math. Theor. 2007;40(28):8127. doi: 10.1088/1751-8113/40/28/S18. [DOI] [Google Scholar]

[CR69] 69.Berta M, Christandl M, Colbeck R, Renes JM, Renner R. Entropic uncertainty and measurement reversibility. Nat. Phys. 2010;6(9):659. doi: 10.1038/nphys1734. [DOI] [Google Scholar]

[CR70] 70.Wang L, Renner R. One-shot classical-quantum capacity and hypothesis testing. Phys. Rev. Lett. 2012;108(20):200501. doi: 10.1103/PhysRevLett.108.200501. [DOI] [PubMed] [Google Scholar]

[CR71] 71.Tomamichel M, Hayashi M. A hierarchy of information quantities for finite block length analysis of quantum tasks. IEEE Trans. Inf. Theory. 2013;59(11):7693. doi: 10.1109/TIT.2013.2276628. [DOI] [Google Scholar]

[CR72] 72.Matthews W, Wehner S. Finite blocklength converse bounds for quantum channels. IEEE Trans. Inf. Theory. 2014;60(11):7317. doi: 10.1109/TIT.2014.2353614. [DOI] [Google Scholar]

[CR73] 73.Buscemi F, Datta N. The quantum capacity of channels with arbitrarily correlated noise. IEEE Trans. Inf. Theory. 2010;56(3):1447. doi: 10.1109/TIT.2009.2039166. [DOI] [Google Scholar]

[CR74] 74.Brandão FGSL, Datta N. One-shot rates for entanglement manipulation under non-entangling maps. IEEE Trans. Inf. Theory. 2011;57(3):1754. doi: 10.1109/TIT.2011.2104531. [DOI] [Google Scholar]

[CR75] 75.Dupuis, F., Kraemer, L., Faist, P., Renes, J.M., Renner, R.: Generalized entropies. In: XVIIth international congress on mathematical physics, pp. 134–153 (2013). 10.1142/9789814449243_0008

[CR76] 76.Watrous J. Semidefinite programs for completely bounded norms. Theory Comput. 2009;5(11):217. doi: 10.4086/toc.2009.v005a011. [DOI] [Google Scholar]

PERMALINK

Thermodynamic Implementations of Quantum Processes

Philippe Faist

Mario Berta

Fernando G S L Brandao

Abstract

Introduction

Preliminaries

Quantum states, quantum processes, and distance measures

Entropy measures

Schur–Weyl duality

Lemma 2.1

Proof

Lemma 2.2

Estimating entropy

Proposition 2.1

Estimating energy

Proposition 2.2

Post-selection technique

Theorem 2.1

Proposition 2.3

Resource Theory of Thermodynamics

Gibbs-preserving maps

Proposition 3.1

Thermal operations

Proposition 3.2

Thermodynamic Capacity

Definition

Properties

Proposition 4.1

Proof

Optimality

Proposition 4.2

Proof

Construction #1: Trivial Hamiltonians

Statement and proof sketch

Theorem 5.1

Proof

Challenges for extension to non-trivial Hamiltonians

Construction #2: Gibbs-Preserving Maps

Statement and proof sketch

Theorem 6.1

Construction via universal conditional and relative typicality

Definition 6.1

Proposition 6.1

Proof

Universal conditional and relative typical smoothing operator

Proof

Construction #3: Thermal Operations

Statement and proof sketch

Theorem 7.1

Fig. 2.

Fig. 1.

Universal conditional erasure

Definition 7.1

Proposition 7.1

Lemma 7.1

Proof

Proof

Construction via universal conditional erasure

Lemma 7.2

Proof

Discussion

Acknowledgements

Appendix

A Missing proofs

Proof

Proof

Proof

Proof

Proof

B. Technical Lemmas

Lemma B.1

Proof

Lemma B.2

Proof

Proposition B.3

Proof

C. Dilation of Energy-Conserving Operators to Unitaries

Proposition C.1