Quantum Data Processing
Quantum Data Processing
Abstract
We prove a data processing inequality for quantum communication channels, which states that processing
a received quantum state may never increase the mutual information between input and output states.
I. Introduction
Our starting point was the question whether Holevo’s upper bound could be viewed as
arXiv:quant-ph/9907081v1 26 Jul 1999
a special case of a more general theorem, the quantum data processing inequality (that
should hold). An inequality of this kind states for an information transfer via (quantum)
communication channels that if a received state is “processed”, this may never increase
the mutual information between input and output states.
We soon realized that a quantum data processing inequality is a corollary of Uhlmann’s
Monotonicity Theorem (see Sec. 5) which is a generalization of a theorem of Lieb ([7]).
Like in the proof of the classical data processing inequality (which plays around with con-
ditional mutual information1 ), convexity properties play a central role. Here, we will need
such properties “for operators” (see Sec. 3).
The next theorem shows that the latter examples give essentially (i.e. up to limits) all
Pick functions:
The authors are with Fakultät für Mathematik, Universität Bielefeld, Postfach 100131, 33501 Bielefeld.
E-mail: ahlswede@mathematik.uni-bielefeld.de, loeber@mathematik.uni-bielefeld.de
1
cf. [2], p. 55 or [1], p. 32
2
This theorem is from [4], p. 20 ff. Its proof transforms the Pick function with an ap-
propriate Möbius transformation (and its inverse) to a function which maps the unit disc
into itself and which has a positive real part. This real part is a positive harmonic function.
Definition 2.5: For an open interval (a, b) ⊆ R let C(a, b) , H + ∪ H − ∪ (a, b) and
Remark 2.6: P (a, b) is a convex cone. Moreover, ϕ ∈ P (a, b) implies that ϕ|(a,b) is a
monotonically increasing real function. (As φ maps H + into H + and H − into H − it has
d
to be real on the real axis. Let now ϕ = u + iv and z = x + iy. By definition dy v(x) ≥ 0,
d
and the Cauchy-Riemann differential equations imply that dx u(x) ≥ 0, too.)
Remark 2.8: Let ϕ ∈ P a Pick function and µ its corresponding Borel measure (cf. (1)).
Let (a, b) ∈ R an interval. Then:
This remark is again from [4] (p. 26), like the next example (p. 27):
√ R0 √
x
Example 2.9: z= √1 + ( 1 − x
) dx
2 −∞ x−z x2 +1 π
notation we have
a1 0 f (a1 ) 0
A= .. f
7→ f (A) = .. .
. .
0 an 0 f (an )
B≤C ⇔ B −1 ≥ C −1 ⇔ −B −1 ≤ −C −1 .
Theorem 3.6: Let f ≥ 0 a continuous and operator monotone function on [0, ∞), and
let x ∈ L(H)+ and a ∈ L(H) with ||a||op ≤ 1. Then:
1 1
Proof: Let b , (1 − aa∗ ) 2 and c , (1 − a∗ a) 2 , and define operators
x 0 a b
X, , U,
0 0 c −a∗
Remark 3.7: We call functions f that fulfill (2) operator concave. Indeed, a non-negative
continuous function on [0, ∞) that is operator concave is also operator monotone (cf. [5],
p. 232). Furthermore, operator concave functions f P fulfill Jensen’s Inequality:
+ ∗
P ∗
∀ x
P ∗ 1 , . . . , x n ∈ L(H) and a1 , . . . , an ∈ L(H) with i ai ai ≤ 1 it holds: f ( i ai xi ai ) ≥
a
i i f (x )a
i i .
Proof:
a∗i xi ai 0 · · · 0
P
x1 0
a1
i
0
.. , A , ... 0 ⇒ A∗ XA =
X , . ..
. 0
0 xn an
0
Therefore:
P ∗ P ∗
f ( i ai xi ai ) 0 · · · 0 i ai f (xi )ai 0 · · · 0
0 0
= f (A∗ XA) ≥ A∗ f (X)A =
.
.. .
..
0 (2)
0
0 0
Corollary 3.8: Let f ≥ 0 a continuous and operator monotone function on [0, ∞), and let
x ∈ L(H)+ and a : H′ → H a (lin.) contraction (||a||op ≤ 1). It holds: f (a∗ xa) ≥ a∗ f (x)a.
Example 3.9: For 0 ≤ µ ≤ 1 we have (a∗ xa)µ ≥ a∗ xµ a (with notation from Cor. 3.8).
Definition 4.1: A finite quantum system A is a finite dimensional C ∗ -algebra, i.e. a self-
adjoint subalgebra (with identity) of some L(H), where dim(H) < ∞.
5
A physical state (on this system) is an element A ∈ A+ , L(H)+ ∩A for which tr (A) = 1.
a1 0
are positive. A physical map of finite quantum systems A, B is a trace preserving and
completely positive C-linear map α∗ : A → B.
Fortunately, the completely positive maps have a nice representation by Stinespring’s The-
orem (see [3], p. 137 or [9] for a proof):
Theorem 4.4: Let α∗ : A → L(H) a completely positive map of finite quantum systems.
Then
α∗ (A) = V ∗ ρ(A)V (∀ A ∈ A)
for some representation ρ of A on a finite dimensional Hilbert space K and a linear map
V : H → K. Here, the representation ρ is an (algebra-)homomorphism with ρ(A∗ ) = ρ(A)∗
for all A ∈ A, and ρ(1) = 1.
then
tr (β(x)∗ S2t β(x)T21−t ) ≤ tr (x∗ S1t xT11−t ) ∀ 0 ≤ t ≤ 1, x ∈ A1 .
Proof: We remark that A1 and A2 are Hilbert spaces with inner product < a, b > =
tr (a∗ b).
1 1
Define a linear map V : A1 → A2 by aT12 7→ β(a)T22 for all a ∈ A1 .
V is a contraction, in fact:
1 1
||β(a)T22 ||2 = tr (T2 β(a)∗ β(a)) ≤ tr (T2 β(a∗ a)) ≤ tr (T1 a∗ a) = ||aT12 ||2 .
1
− 12
For i = 1, 2 define ∆i ∈ L(Ai )+ by ∆i (aTi2 ) , Si aTi for all a ∈ Ai .
1 1 1
− 21 1
− 12
(It is positive because < aTi |∆i |aTi > = < aTi |Si aTi
2 2 2
> = tr (Ti2 a∗ Si aTi ) =
tr (a∗ Si a) ≥ 0.)
1 1
−t
It holds ∆ti (aTi2 ) = Sit aTi2 (t ≥ 0), and V ∗ ∆2 V ≤ ∆1 :
1 1 1 1 1
− 21
< aT12 |V ∗ ∆2 V |aT12 > = < β(a)T22 |∆2 |β(a)T22 > = < β(a)T22 |S2 β(a)T2 >
1 1
= tr (β(α)∗ S2 β(a)) ≤ tr (S2 β(αα∗ )) ≤ tr (S1 αα∗ ) = < aT12 |∆1 |aT12 > .
So2 , V ∗ ∆t2 V ≤ (V ∗ ∆2 V )t ≤ ∆t1 , and
1 1 1 1
−t
tr (T22 β(x)∗ S2t β(x)T22 ) = < xT12 |V ∗ ∆t2 V |xT12 >
1 1 1 1
−t
≤ < xT12 |∆t1 |xT12 > = tr (T12 x∗ S1t xT12 ) .
Definition 5.3: Given a physical state A ∈ A+ we denote in the sequel its support,
i.e. the projector on the space spanned by the eigenvectors corresponding to non-zero
eigenvalues, by supp A.
Two physical states A, B ∈ A+ have divergence
(
tr (A(log A − log B)), if supp A ≤ supp B
D(A||B) ,
∞, otherwise.
2
We use Ex. 3.9 (like we promised above).
7
To reduce the claim to Theorem 5.4 notice that tr (α∗ (Ai ) · ) = α( tr (Ai · )) = tr (Ai α∗ (·)).
I(A; D∗ ◦ W∗ ) ≤ I(A; W∗ ) .
8
We remind the reader that Holevo’s result is from 1973 whereas Uhlmann’s Monotonicity
Theorem is from 1977. Both use analytical considerations for the proofs. There is also an
“elementary” proof of Holevo’s Upper Bound, using only information theoretical consid-
erations, in [11].
VII. Acknowledgments
We thank Andreas Winter for fruitful discussions about quantum information theory.
References
[1] Th. M. Cover, J. A. Thomas, “Elements of Information Theory,” Wiley-Interscience, New York, 1991.
[2] I. Csiszár, J. Körner, “Information Theory: Coding Theorems for Discrete Memoryless Systems”, Academic
Press, London, 1981.
[3] E. B. Davis, “Quantum Theory of Open Systems”, Academic Press, London, 1976.
[4] W. F. Donoghue (Jr.), “Monotone Matrix Functions and Analytic Continuation”, Springer, Berlin, 1970.
[5] F. Hansen, G. K. Pederson, “Jensen’s Inequality and Löwner’s Theorem”, Math. Ann., vol. 258, pp. 229-241,
1982.
[6] A. S. Holevo, “Bounds for the Quantity of Information Transmitted by a Quantum Communication Channel”,
Probl. Peredachi Inform., vol. 9, no. 3, pp. 3-11, 1973. (Engl. transl.: Probl. of Inf. Transm., vol. 9, no. 3,
pp. 177-183, 1973).
[7] E. H. Lieb, “Convex Trace Functions and the Wigner-Yanase-Dyson Conjecture”, Advan. Math., vol. 11,
pp. 267-288, 1973.
[8] M. Ohya, D. Petz, “Quantum Entropy and Its Use”, (Texts and Monographs in Physics), Springer, Berlin,
1993.
[9] W. F. Stinespring, “Positive Functions on C ∗ -algebras”, Proc. Am. Math. Soc., vol. 6, pp. 211-216, 1955.
[10] A. Uhlmann, “Relative Entropy and the Wigner-Yanase-Dyson-Lieb Concavity in an Interpolation Theory”,
Commun. math. Phys., vol. 54, pp. 21-32, 1977.
[11] A. Winter, “Coding Theorem and Strong Converse for Quantum Channels”, to appear in: IEEE Trans. In-
form. Theory, vol. 45, no. 7, 1999.