Hostname: page-component-cd9895bd7-gbm5v Total loading time: 0 Render date: 2024-12-27T08:34:12.627Z Has data issue: false hasContentIssue false

Nonexpansive and noncontractive mappings on the set of quantum pure states

Published online by Cambridge University Press:  17 January 2024

Michiya Mori
Affiliation:
Graduate School of Mathematical Sciences, The University of Tokyo, 3-8-1 Komaba, Meguro-ku, Tokyo 153-8914, Japan (mmori@ms.u-tokyo.ac.jp) Interdisciplinary Theoretical and Mathematical Sciences Program (iTHEMS), RIKEN, 2-1 Hirosawa, Wako, Saitama 351-0198, Japan
Peter Šemrl
Affiliation:
Institute of Mathematics Physics and Mechanics, Jadranska 19, SI-1000 Ljubljana, Slovenia (peter.semrl@fmf.uni-lj.si)
Rights & Permissions [Opens in a new window]

Abstract

Wigner's theorem characterizes isometries of the set of all rank one projections on a Hilbert space. In metric geometry, nonexpansive maps and noncontractive maps are well-studied generalizations of isometries. We show that under certain conditions Wigner symmetries can be characterized as nonexpansive or noncontractive maps on the set of all projections of rank one. The assumptions required for such characterizations are injectivity or surjectivity and they differ in the finite and the infinite-dimensional case. Motivated by a recently obtained optimal version of Uhlhorn's generalization of Wigner's theorem, we also give a description of nonexpansive maps which satisfy a condition that is much weaker than surjectivity. Such maps do not need to be Wigner symmetries. The optimality of all presented results is shown by counterexamples.

Type
Research Article
Copyright
Copyright © The Author(s), 2024. Published by Cambridge University Press on behalf of The Royal Society of Edinburgh

1. Introduction and statement of main results

Throughout this note, $H$ is a complex Hilbert space and ${\mathcal {P}}(H)$ denotes the collection of rank one projections on $H$. We will always assume that $\dim H \ge 2$. In the mathematical foundations of quantum mechanics, projections of rank one represent pure states of a quantum system, and the quantity ${\rm tr}\, (PQ)$, the trace of the product $PQ$, $P,\,Q \in {\mathcal {P}}(H)$, is the so-called transition probability between two pure states. The famous Wigner's unitary–antiunitary theorem [Reference Wigner11] describes the general form of bijective transformations of ${\mathcal {P}}(H)$ which preserve the transition probability.

We may state Wigner's theorem in terms of isometry. Recall that a mapping $\phi$ from a metric space $(X,\, d)$ into itself is called an isometry if

\[ d(\phi (x),\phi (y)) = d(x,y), \quad x, y \in X. \]

In this note, we consider the distance $d(P,\, Q)=\|P-Q\|$ in ${\mathcal {P}}(H)$, where $\|\cdot \|$ denotes the operator norm. One can easily verify that $\|P-Q\| = \sqrt {1 - {\rm tr}\, (PQ)}$ holds true for every pair $P,\,Q \in {\mathcal {P}}(H)$. Hence, Wigner's theorem can be formulated in the following way: Let $\phi : {\mathcal {P}}(H) \to {\mathcal {P}} (H)$ be a bijective map such that

\[ \| \phi (P) - \phi (Q) \| = \| P-Q \|, \quad P,Q \in {\mathcal{P}} (H). \]

Then there exists a unitary or an antiunitary operator $U : H \to H$ such that

\[ \phi (P) = UPU^\ast, \quad P \in {\mathcal{P}} (H). \]

Every map of this form will be called a Wigner symmetry. After all, Wigner's theorem says that every surjective isometry from ${\mathcal {P}}(H)$ onto itself is a Wigner symmetry. More generally, it is known as a nonbijective version of Wigner's theorem that every isometry from ${\mathcal {P}}(H)$ into itself is of the form $\phi (P) = UPU^\ast$, $P \in {\mathcal {P}} (H)$, for some linear or conjugate-linear isometry $U : H \to H$ (see e.g. [Reference Gehér3]). For more detailed explanation with many other references and some recent improvements of Wigner's theorem, we refer to [Reference Fošner, Kuzma, Kuzma and Sze2, Reference Gehér and Mori4Reference Šemrl9].

In this note, we will deal with the question what happens if we replace isometries in Wigner's theorem with maps that are nonexpansive or noncontractive. Recall that a mapping $\phi$ from a metric space $(X,\, d)$ to itself is said to be nonexpansive (resp. noncontractive) if

\[ d(\phi (x),\phi (y)) \leq d(x,y) \quad (\textrm{resp.}\,d(\phi (x),\phi (y)) \geq d(x,y)), \ x, y \in X. \]

From the viewpoint of quantum mechanics, a nonexpansive (resp. noncontractive) mapping of ${\mathcal {P}}(H)$ can be interpreted as a transition probability nondecreasing (resp. nonincreasing) mapping.

The following is a well-known fact from metric geometry [Reference Burago, Burago and Ivanov1, Theorems 1.6.14 and 1.6.15].

Theorem 1.1 Let $(X,\, d)$ be a compact metric space.

  1. 1. A noncontractive map from $X$ into itself is a surjective isometry.

  2. 2. A surjective nonexpansive map from $X$ onto itself is an isometry.

Corollary 1.2 Assume that $\dim H<\infty$. A mapping $\phi \colon {\mathcal {P}}(H)\to {\mathcal {P}}(H)$ is a Wigner symmetry if one of the following holds.

  1. 1. $\phi$ is noncontractive.

  2. 2. $\phi$ is surjective and nonexpansive.

  3. 3. $\phi$ is injective and nonexpansive.

Proof. If $\dim H<\infty$, then $\mathcal {P}(H)$ is compact. Therefore, if $\phi$ satisfies one of the first two conditions, then theorem 1.1 and Wigner's theorem imply that $\phi$ is a Wigner symmetry. To consider the third condition, let $\phi \colon {\mathcal {P}}(H)\to {\mathcal {P}}(H)$ be an injective nonexpansive map. Then $\phi$ is an injective continuous map from a compact connected manifold $\mathcal {P}(H)$ into itself. By the invariance of domain, its image is open. By the continuity, its image is compact. Hence, $\phi$ is surjective and satisfies the second condition, so it is a Wigner symmetry.

We study nonexpansive or noncontractive maps of ${\mathcal {P}}(H)$ under various additional assumptions, including the case of $\dim H=\infty$. We start with a result on noncontractive maps.

Theorem 1.3 Let $\phi : \mathcal {P} (H) \to \mathcal {P} (H)$ be a surjective map such that

\[ \| \phi (P) - \phi (Q) \| \ge \| P - Q \| \]

for every pair $P,\,Q \in \mathcal {P}(H)$. Then $\phi$ is a Wigner symmetry.

Looking at the first item of corollary 1.2, in the finite-dimensional case we see that the same conclusion holds without the surjectivity assumption. However, in the infinite-dimensional case the surjectivity assumption is essential (example 4.1).

We next consider nonexpansive maps.

Theorem 1.4 Let $\phi : \mathcal {P} (H) \to \mathcal {P} (H)$ be a surjective map such that

\[ \| \phi (P) - \phi (Q) \| \le \| P - Q \| \]

for every pair $P,\,Q \in \mathcal {P}(H)$. Then $\phi$ is a Wigner symmetry.

In contrast, we will see in example 4.2 that an injective nonexpansive map of ${\mathcal {P}}(H)$ can be far from a Wigner symmetry if $\dim H=\infty$.

For $P,\,Q \in \mathcal {P}(H)$, we write $P \perp Q$ if $PQ=0$ which is equivalent to $\| P - Q\| = 1$. For every vector $x \in H$ of norm one, we denote by $P_x$ the rank one projection onto the linear span of $x$. We will say that a subset $\{ P_\alpha \, : \, \alpha \in \mathcal {J} \} \subset \mathcal {P}(H)$ is an orthogonal system of projections of rank one, OSP, if $P_\alpha \perp P_\beta$ whenever $\alpha \not =\beta$, and we will say that it is a complete orthogonal system of projections of rank one, COSP, if it is an OSP and there is no rank one projection $Q$ that is orthogonal to each $P_\alpha$. If $P_\alpha = P_{x_\alpha }$ for some unit vectors $x_\alpha$, $\alpha \in \mathcal {J}$, then $\{ P_\alpha \, : \, \alpha \in \mathcal {J} \} \subset \mathcal {P}(H)$ is a COSP if and only if $\{ x_\alpha \, : \, \alpha \in \mathcal {J} \}$ is an orthonormal basis of $H$.

Under the additional assumption $\dim H\geq 3$, Uhlhorn's improvement of Wigner's theorem [Reference Uhlhorn10] states that every bijective map $\phi : \mathcal {P} (H) \to \mathcal {P} (H)$ with the property that for every pair $P,\,Q \in {\mathcal {P}} (H)$ we have $P \perp Q \iff \phi (P) \perp \phi (Q)$ is a Wigner symmetry. This result can be further improved. A recently obtained optimal version [Reference Šemrl9] reads as follows: Assume that $H$ is separable and $\phi : \mathcal {P} (H) \to \mathcal {P} (H)$ is a map such that for every pair $P,\,Q \in {\mathcal {P}} (H)$ we have $P \perp Q \Rightarrow \phi (P) \perp \phi (Q)$. Suppose that there exists a COSP $\{ P_1,\, P_2,\, \ldots \} \subset {\mathcal {P}} (H)$ such that $\{ \phi (P_1 ),\, \phi (P_2 ),\, \ldots \} \subset {\mathcal {P}} (H)$ is a COSP. Then $\phi$ is a Wigner symmetry. In particular, if $3\leq \dim H<\infty$, then every map $\phi : \mathcal {P} (H) \to \mathcal {P} (H)$ with the property that for every pair $P,\,Q \in {\mathcal {P}} (H)$ we have $P \perp Q \Rightarrow \phi (P) \perp \phi (Q)$ is a Wigner symmetry.

Note that every noncontractive map $\phi : \mathcal {P} (H) \to \mathcal {P} (H)$ satisfies $P \perp Q \Rightarrow \phi (P) \perp \phi (Q)$, $P,\,Q \in {\mathcal {P}} (H)$. It follows that in the separable case a noncontractive map $\phi$ on ${\mathcal {P}} (H)$ with the property that there is a COSP that is mapped by $\phi$ onto some COSP must be a Wigner symmetry. The situation becomes much more interesting when we consider nonexpansive maps, as described below.

We first choose and fix an orthonormal basis $\{e_\alpha \,:\, \alpha \in \mathcal {J}\}$ in $H$. For each projection $P\in {\mathcal {P}}(H)$ onto ${\mathbb {C}}(\sum _{\alpha \in \mathcal {J}}c_\alpha e_\alpha )$, we define $\Phi (P)$ to be the projection onto ${\mathbb {C}}(\sum _{\alpha \in \mathcal {J}}|c_\alpha |e_\alpha )$. Clearly, $\Phi$ is well-defined, and it maps every element of COSP $\{ P_{e_\alpha }\, : \, \alpha \in \mathcal {J}\}$ to itself. We claim that $\Phi : {\mathcal {P}} (H) \to {\mathcal {P}} (H)$ is a nonexpansive map. Equivalently, we need to show that

\[ {\rm tr}\, (\Phi (P) \Phi (Q)) \ge {\rm tr}\, ( P Q ), \ \ \ P,Q \in \mathcal{P}(H). \]

This can be easily checked by the inequality

\[ \sqrt{{\rm tr}\, (P_vP_w)}= \left|\sum_{\alpha \in \mathcal{J}}v_\alpha\overline{w_\alpha}\right|\leq \sum_{\alpha \in \mathcal{J}}|v_\alpha||w_\alpha|=\sqrt{{\rm tr}\, (\Phi(P_v)\Phi(P_w))} \]

for each pair of unit vectors $v=\sum _{\alpha \in \mathcal {J}} v_\alpha e_\alpha,\, w=\sum _{\alpha \in \mathcal {J}}w_\alpha e_\alpha \in H$. Now we state the main result of this note.

Theorem 1.5 Assume that $\dim H \ge 3$. Let $\phi \colon {\mathcal {P}}(H)\to {\mathcal {P}}(H)$ be a mapping satisfying

\[ \|\phi(P)-\phi(Q)\|\leq \|P-Q\|,\quad P,Q\in {\mathcal{P}}(H), \]

and assume that there exists a COSP $\{ Q_\alpha \, : \, \alpha \in \mathcal {J} \} \subset {\mathcal {P}} (H)$ such that $\{ \phi (Q_\alpha ) \, : \, \alpha \in \mathcal {J} \}$ is a COSP in ${\mathcal {P}} (H)$. Then either $\phi$ is a Wigner symmetry, or there exist unitary operators $U,\,V : H \to H$ such that

(1.1)\begin{equation} \phi (P)= V \Phi (UPU^\ast) V^\ast , \quad P \in \mathcal{P}(H) . \end{equation}

In fact, we will prove a stronger result (proposition 3.3) on an arbitrary nonexpansive map $\phi \colon {\mathcal {P}}(H)\to {\mathcal {P}}(H)$ whose image contains a COSP. If $H$ is finite-dimensional, $\dim H = n$, then we identify $H$ with $\mathbb {C}^n$ and $\mathcal {P}(\mathbb {C}^n)$ with the set of all idempotent hermitian rank one $n \times n$ matrices. Then the map $\Phi \colon {\mathcal {P}}({\mathbb {C}}^n)\to {\mathcal {P}}({\mathbb {C}}^n)$ with respect to the standard orthonormal basis of ${\mathbb {C}}^n$ is given by the ‘entrywise absolute value’

(1.2)\begin{equation} \Phi(P)=[\, |p_{ij}|\, ]_{1\leq i,j\leq n}\text{ for } P=[p_{ij}]_{1\leq i,j\leq n}\in {\mathcal{P}}({\mathbb{C}}^n). \end{equation}

Every nonexpansive map $\phi \colon {\mathcal {P}}({\mathbb {C}}^n)\to {\mathcal {P}}({\mathbb {C}}^n)$ such that $\phi ({\mathcal {P}}({\mathbb {C}}^n))$ contains a COSP is either a Wigner symmetry or of form (1.1). It is somewhat surprising that the assumption $n \ge 3$ is indispensable. The general form of nonexpansive maps $\phi$ on $\mathcal {P}(\mathbb {C}^2)$ such that $\phi ({\mathcal {P}}({\mathbb {C}}^2))$ contains a COSP is described in proposition 2.7.

The next section will be devoted to some auxiliary results. In the third section, we will present proofs of the main results. The last section will be devoted to counterexamples showing the optimality of our main theorems.

2. Preliminary results

In what follows, we occasionally use the following elementary facts. They are well-known and easily verified, so we omit the proofs.

Lemma 2.1 A $2\times 2$ projection of rank one is of the form

\[ \begin{bmatrix} p & z\sqrt{p (1-p)} \\ \overline{z}\sqrt{p (1-p)} & 1-p \end{bmatrix} \]

for some real number $p$, $0 \leq p \leq 1$, and complex number $z$ of modulus one.

Lemma 2.2 Let $v,\, w\in H$ be unit vectors. Then the projections $P_v$ and $P_w$ in ${\mathcal {P}}(H)$ onto ${\mathbb {C}} v$ and ${\mathbb {C}} w$, respectively, satisfy

\[ {\rm tr}\, (P_vP_w) = |\langle v,w\rangle|^2,\quad \|P_v-P_w\| = \sqrt{1-|\langle v,w\rangle|^2}. \]

We start with a basic property of nonexpansive maps on ${\mathcal {P}}(H)$.

Lemma 2.3 Let $\phi : \mathcal {P} (H) \to \mathcal {P} (H)$ be a map such that

(2.1)\begin{equation} \| \phi (P) - \phi (Q) \| \le \| P - Q \|, \quad P,Q \in \mathcal{P}(H) , \end{equation}

and let $n$ be a positive integer. Assume that $\{P_1 ,\, \ldots,\, P_n\} \subset \mathcal {P}(H)$ is an OSP contained in the image of $\phi$. If $Q_1,\, \ldots,\, Q_n\in {\mathcal {P}}(H)$ satisfy $\phi (Q_j)=P_j$, $j=1,\, \ldots,\, n$, then $\{Q_1 ,\, \ldots,\, Q_n\} \subset \mathcal {P}(H)$ is also an OSP. Moreover, for every $Q \in \mathcal {P}(H)$ we have

\[ Q \le Q_1 + \cdots + Q_n \Rightarrow \phi (Q) \le P_1 + \cdots + P_n. \]

Proof. For $1\leq i< j\leq n$, we know

\[ 1=\|P_i-P_j\|=\|\phi(Q_i)-\phi(Q_j)\|\leq \|Q_i-Q_j\|\leq 1 , \]

which implies that $\{Q_1 ,\, \ldots,\, Q_n\} \subset \mathcal {P}(H)$ is an OSP.

Assume that $Q \le Q_1 + \cdots + Q_n$. Then $Q = Q( Q_1 + \cdots + Q_n )$, and consequently,

(2.2)\begin{equation} 1 = {\rm tr}\, Q = {\rm tr}\, (Q Q_1) + \cdots + {\rm tr}\, (Q Q_n) . \end{equation}

Let $\{ e_1 ,\, \ldots,\, e_n \}$ be an orthonormal set of vectors such that $P_j = P_{e_j}$, $j=1,\, \ldots,\, n$. Then with respect to the direct sum decomposition

\[ H = {\rm span} \, \{ e_1, \ldots , e_n \} \oplus \{ e_1, \ldots , e_n \}^\perp, \]

the rank one projections $P_j$, $j=1,\, \ldots,\, n$, have matrix representations

\[ P_j=\phi (Q_j) = \begin{bmatrix} E_{jj} & 0 \\ 0 & 0 \end{bmatrix}. \]

Here, $E_{jj}$ stands for the $n \times n$ matrix whose all entries are zero but the $(j,\,j)$-entry which is equal to $1$. Let

\[ \phi (Q) = \begin{bmatrix} \begin{bmatrix} q_{11} & q_{12} & \cdots & q_{1n} \\ q_{21} & q_{22} & \cdots & q_{2n} \\ \vdots & \vdots & \ddots & \vdots \\ q_{n1} & q_{n2} & \cdots & q_{nn} \end{bmatrix} & A \\ A^\ast & B \end{bmatrix} = \begin{bmatrix} R & A \\ A^\ast & B \end{bmatrix} \]

be the corresponding matrix representation of the rank one projection $\phi (Q)$.

Obviously, ${\rm tr}\, (\phi (Q) \phi (Q_j) ) = q_{jj}$. From (2.1) we infer that ${\rm tr}\, (QQ_j) \le {\rm tr}\, (\phi (Q) \phi (Q_j) )$, $j=1,\, \ldots,\, n$. Hence, by (2.2) we have

\[ 1 \le {\rm tr}\, (\phi (Q) \phi (Q_1)) + \cdots + {\rm tr}\, (\phi( Q) \phi( Q_n)) = q_{11} + \cdots + q_{nn} \le {\rm tr} \, \phi (Q) = 1, \]

and therefore, $q_{11} + \cdots + q_{nn} = 1$. This further implies that $B$ is a positive trace zero operator, and consequently, $B=0$. Using the fact that $\phi (Q)$ is positive, we conclude that $A= 0$. Hence, $R$ is a rank one projection and

\[ \phi (Q) = \begin{bmatrix} R & 0 \\ 0 & 0 \end{bmatrix}\le \begin{bmatrix} I & 0 \\ 0 & 0 \end{bmatrix} = P_1 + \cdots + P_n .\]

From the first part of this lemma, we immediately see the following.

Corollary 2.4 Let $\phi \colon {\mathcal {P}}(H)\to {\mathcal {P}}(H)$ be a nonexpansive mapping. Then the following two conditions are equivalent.

  • $\phi$ maps some OSP of ${\mathcal {P}}(H)$ onto a COSP of ${\mathcal {P}}(H)$.

  • There is a COSP of ${\mathcal {P}}(H)$ that is contained in the image of $\phi$.

If in addition $\dim H<\infty$, then the above conditions are also equivalent to the condition that $\phi$ maps some COSP of ${\mathcal {P}}(H)$ onto a COSP of ${\mathcal {P}}(H)$.

Let $S^1$ denote the set of all complex numbers of modulus 1. We consider a nonexpansive map on $S^1$, that is, a function $g : S^1 \to S^1$ satisfying

\[ \lvert g(z_1) - g(z_2)\rvert \leq \lvert z_1-z_2\rvert , \quad z_1, z_2 \in S^1. \]

Clearly, this condition is equivalent to

(2.3)\begin{equation} {\rm Re}\, (g(z_1)\overline{g(z_2)}) \geq {\rm Re}\, (z_1\overline{z_2}), \quad z_1, z_2 \in S^1. \end{equation}

We give an easy lemma.

Lemma 2.5 Let $g : S^1 \to S^1$ be nonexpansive. Then one of the following holds:

  • There is $c\in S^1$ such that $g(z)=cz$, $z\in S^1$.

  • There is $c\in S^1$ such that $g(z)=c\overline {z}$, $z\in S^1$.

  • The image $g(S^1)\subset S^1$ is compact, connected and contained in some closed half-circle.

Proof. If $g$ is surjective, then the second item of theorem 1.1 implies that $g$ is a surjective isometry. It follows that $g$ satisfies one of the first two options. Assume that $g$ is not surjective. The continuity of $g$ implies that $g(S^1)$ is compact and connected. We may assume by rotation that $g(1)=1$ and $g(S^1)=\{e^{i\theta }\,:\, 0\leq \theta \leq \theta _0\}$ with $0\leq \theta _0<2\pi$, without loss of generality. For every point $z$ of $S^1$, there is a path in $S^1$ from $1$ to $z$ with length at most $\pi$. Therefore, the assumption that $g$ is nonexpansive implies that $\theta _0\leq \pi$.

The following corollary will be used in the proof of proposition 3.1.

Corollary 2.6 Let $g : S^1 \to S^1$ be nonexpansive. If in addition $g$ is a group homomorphism, i.e. if $g(zw)=g(z)g(w)$, $z,\,w\in S^1$, then one of the following holds:

  • $g(z)=z$, $z\in S^1$.

  • $g(z)=\overline {z}$, $z\in S^1$.

  • $g(S^1)=\{1\}$.

Proof. Trivial in the case of the first two options in lemma 2.5. If the third option in lemma 2.5 holds, then $g(S^1)$ is connected and contained in some closed half-circle. This together with the additional assumption clearly shows that $g(S^1)=\{1\}$.

Let $g\colon S^1\to S^1$ be a nonexpansive map. We define a map $\tau : \mathcal {P}(\mathbb {C}^2) \to \mathcal {P}(\mathbb {C}^2)$ in the following way: We first set

\[ \tau (E_{11}) = E_{11} \text{ and } \tau (E_{22}) = E_{22}. \]

Let $P$ be an arbitrary rank one projection with $P\not = E_{11},\, E_{22}$. By lemma 2.1,

\[ P = \begin{bmatrix} p & z\sqrt{p (1-p)} \\ \overline{z}\sqrt{p (1-p)} & 1-p \end{bmatrix} \]

for some real number $p$, $0 < p < 1$, and $z\in S^1$, and for such $P$ we define

\[ \tau (P) =\begin{bmatrix} p & g(z)\sqrt{p (1-p)} \\ \overline{g(z)} \sqrt{p (1-p)} & 1-p \end{bmatrix}. \]

Any such map will be called a standard nonexpansive map of $\mathcal {P}(\mathbb {C}^2)$.

We claim that every such map satisfies

\[ \| \tau (P_1) - \tau (P_2) \| \le \| P_1 - P_2 \|, \quad P_1 ,P_2 \in \mathcal{P}(\mathbb{C}^2). \]

Equivalently, we need to show that

(2.4)\begin{equation} {\rm tr}\, (\tau (P_1 ) \tau (P_2 )) \ge {\rm tr}\, ( P_1 P_2 ), \ \ \ P_1 ,P_2 \in \mathcal{P}(\mathbb{C}^2). \end{equation}

It is trivial to verify the above inequality if any of $P_1 ,\,P_2$ is equal to any of $E_{11},\, E_{22}$. If

\[ P_j = \begin{bmatrix} p_j & z_j\sqrt{p_j (1-p_j)} \\ \overline{z_j}\sqrt{p_j (1-p_j)} & 1-p_j \end{bmatrix}, \ \ \ j=1,2, \]

with $0 < p_j < 1$, $z_j\in S^1$, $j=1,\,2$, then a straightforward calculation shows that the verification of (2.4) reduces to (2.3).

Note that $\tau$ is a Wigner symmetry if one of the first two options in lemma 2.5 holds. If the third option holds, then $\tau$ is noninjective and nonsurjective.

Proposition 2.7 Let $\phi : \mathcal {P}(\mathbb {C}^2) \to \mathcal {P}(\mathbb {C}^2)$ be a map satisfying

\[ \| \phi (P) - \phi (Q) \| \le \| P - Q \|, \quad P,Q \in \mathcal{P}(\mathbb{C}^2), \]

and assume that there exists a COSP of ${\mathcal {P}}({\mathbb {C}}^2)$ that is contained in the image of $\phi$. Then there exist $2 \times 2$ unitary matrices $U,\,V$ and a standard nonexpansive map $\tau : \mathcal {P}(\mathbb {C}^2) \to \mathcal {P}(\mathbb {C}^2)$ such that

\[ \phi (P)= V \tau (UPU^\ast) V^\ast , \quad P \in \mathcal{P}(\mathbb{C}^2) . \]

To show this proposition, we first see by corollary 2.4 that there is a COSP $\{Q_1 ,\, Q_2\} \in \mathcal {P}(\mathbb {C}^2)$ such that $\{P_1,\, P_2\}$ is also a COSP, where $\phi (Q_j) = P_j$, $j=1,\,2$. Thus, there exist $2 \times 2$ unitary matrices $U$ and $V$ such that $U^\ast E_{jj} U = Q_j$ and $V^\ast P_j V = E_{jj}$, $j=1,\,2$. It follows that the map $P \mapsto V^\ast \phi (U^\ast P U)V$, $P\in \mathcal {P}(\mathbb {C}^2)$, sends matrices $E_{11}$ and $E_{22}$ to themselves. Therefore, proposition 2.7 reduces to

Proposition 2.8 Let $\phi : \mathcal {P}(\mathbb {C}^2) \to \mathcal {P}(\mathbb {C}^2)$ be a map satisfying

\[ \| \phi (P) - \phi (Q) \| \le \| P - Q \|, \quad P,Q \in \mathcal{P}(\mathbb{C}^2), \]

and $\phi (E_{11})=E_{11}$, $\phi (E_{22})=E_{22}$. Then $\phi$ is a standard nonexpansive map.

Proof. From $\| \phi (P) - \phi (Q) \| \le \| P - Q \|$, $P,\,Q \in \mathcal {P}(\mathbb {C}^2)$, we infer that ${\rm tr}\, (\phi (P) \phi (Q)) \ge {\rm tr}\, (PQ)$. Let $P$ be an arbitrary rank one projection with $P\not = E_{11},\, E_{22}$. By lemma 2.1,

(2.5)\begin{equation} P = \begin{bmatrix} p & z \sqrt{p (1-p)} \\ \overline{z} \sqrt{p (1-p)} & 1-p \end{bmatrix} \end{equation}

for some real number $p$, $0 < p < 1$, and some complex number $z \in S^1$. Denote

(2.6)\begin{equation} Q = \phi (P) = \begin{bmatrix} q & w \sqrt{q (1-q)} \\ \overline{w} \sqrt{q (1-q)} & 1-q \end{bmatrix}, \end{equation}

where $0 \le q \le 1$ and $w \in S^1$. We have

\[ q = {\rm tr}\, ( \phi (P) \phi (E_{11})) \ge {\rm tr}\, (P E_{11}) = p, \]

and similarly, $1-q \ge 1-p$, and consequently, $p=q$.

In particular, there exists a function $g : S^1 \to S^1$ such that

\[ \phi (T_u) = T_{g(u)},\text{ where } T_u=\begin{bmatrix} \dfrac{1}{2} & \dfrac{1}{2} u \\ \dfrac{1}{2} \overline{u} & \dfrac{1}{2} \end{bmatrix},\ \ \ \text{for every}\ \ u \in S^1. \]

From

\[ {\rm tr}\, (T_{g(u_1)} T_{g(u_2)} )={\rm tr}\, (\phi (T_{u_1}) \phi (T_{u_2}) ) \ge {\rm tr}\, (T_{u_1} T_{u_2} ),\ \ \ u_1, u_2\in S^1, \]

we deduce that $g\colon S^1\to S^1$ is nonexpansive.

Taking $P$, $Q$ as in (2.5) and (2.6) (hence $p=q$), and applying

\[ {\rm tr}\, (T_{g(z)} \phi (P))={\rm tr}\, (\phi (T_z) \phi (P)) \ge {\rm tr}\, (T_z P ), \]

we conclude that $w = g (z)$. That is, for every pair $p,\, z$, $0 < p < 1$, $z \in S^{1}$, we have

\[ \phi \left( \begin{bmatrix} p & z \sqrt{p (1-p)}\\ \overline{z} \sqrt{p (1-p)} & 1-p \end{bmatrix} \right) = \begin{bmatrix} p & g(z) \sqrt{p (1-p)} \\ \overline{g(z)} \sqrt{p (1-p)} & 1-p \end{bmatrix}. \]

The proof is complete.

Remark 2.9 In the next section, we will need the following straightforward consequence. Let $\phi \colon {\mathcal {P}}({\mathbb {C}}^n)\to {\mathcal {P}}({\mathbb {C}}^n)$ be a nonexpansive map and $P \in {\mathcal {P}}({\mathbb {C}}^n)$ any projection of rank one. Let further $r$ be a positive integer, $1 \le r < n$. We write $P$ in the block form

\[ P = \left[ \begin{matrix} R_1 & N \cr N^\ast & R_2 \end{matrix} \right] \]

where $R_1$ is an $r \times r$ matrix. Since both $R_1$ and $R_2$ are positive and of rank at most one, and because $P$ is a trace one matrix, we have $R_1 = \lambda P_1$ and $R_2 = (1-\lambda ) P_2$ for some rank one projections $P_1$ and $P_2$ and for some real $\lambda$, $0 \leq \lambda \leq 1$.

If there exist $r \times r$ projection $Q_1$ and $(n-r) \times (n-r)$ projection $Q_2$ such that

\[ \phi \left( \left[ \begin{matrix} P_1 & 0\cr 0 & 0 \end{matrix} \right] \right) = \left[ \begin{matrix} Q_1 & 0\cr 0 & 0 \end{matrix} \right] \ \ \ {\rm and} \ \ \phi \left( \left[ \begin{matrix} 0 & 0\cr 0 & P_2 \end{matrix} \right] \right) = \left[ \begin{matrix} 0 & 0\cr 0 & Q_2 \end{matrix} \right], \]

then the argument used in the first part of the proof of proposition 2.8, combined with lemma 2.3, yields that $\phi (P)$ is a matrix of the form

\[ \phi (P) = \left[ \begin{matrix} \lambda Q_1 & * \cr * & (1 - \lambda ) Q_2 \end{matrix} \right]. \]

3. Proofs of the main results

We first give a proof of theorem 1.5 in the finite-dimensional case. Thus, we assume that $n\geq 3$ and consider a nonexpansive mapping $\phi \colon {\mathcal {P}}({\mathbb {C}}^n)\to {\mathcal {P}}({\mathbb {C}}^n)$ which maps some COSP onto a COSP. As in the proof of proposition 2.7, it suffices to prove the following proposition.

Proposition 3.1 Assume $n\geq 3$. Let $\phi \colon {\mathcal {P}}({\mathbb {C}}^n)\to {\mathcal {P}}({\mathbb {C}}^n)$ be a mapping satisfying

\[ \|\phi(P)-\phi(Q)\|\leq \|P-Q\|,\quad P,Q\in {\mathcal{P}}({\mathbb{C}}^n), \]

and assume that $\phi (E_{jj})=E_{jj}$, $j=1,\, 2,\, \ldots,\, n$.

Then there is an $n \times n$ diagonal unitary matrix $U$ whose $(1,\,1)$-entry is $1$ such that one of the following three possibilities holds true:

\[ \phi (P)= U P U^\ast , \quad P \in \mathcal{P}(\mathbb{C}^n) , \]
\[ \phi (P)= U P^t U^\ast , \quad P \in \mathcal{P}(\mathbb{C}^n) , \]

where $A^t$ denotes the transpose of a matrix $A$, or

\[ \phi (P)= U \Phi (P) U^\ast , \quad P \in \mathcal{P}(\mathbb{C}^n) , \]

where $\Phi$ is given by (1.2).

Proof. We first consider the case that $n=3$. Lemma 2.3 and proposition 2.8 imply that there is a nonexpansive mapping $f_{12}\colon S^1\to S^1$ such that

\[ \phi\left( \begin{bmatrix} p & z \sqrt{p (1-p)} & 0\\ \overline{z} \sqrt{p (1-p)} & 1-p & 0\\ 0 & 0 & 0 \end{bmatrix} \right) =\begin{bmatrix} p & f_{12}(z) \sqrt{p (1-p)} & 0\\ \overline{f_{12}(z)} \sqrt{p (1-p)} & 1-p & 0\\ 0 & 0 & 0 \end{bmatrix} \]

for every $p\in [0,\, 1]$ and $z\in S^1$. Similarly, there are nonexpansive mappings $f_{13},\, f_{23}\colon S^1\to S^1$ such that

\[ \phi\left( \begin{bmatrix} p & 0 & z \sqrt{p (1-p)} \\ 0 & 0 & 0\\ \overline{z} \sqrt{p (1-p)} & 0 & 1-p \end{bmatrix} \right) =\begin{bmatrix} p & 0 & f_{13}(z) \sqrt{p (1-p)} \\ 0 & 0 & 0\\ \overline{ f_{13}(z)} \sqrt{p (1-p)} & 0 & 1-p \end{bmatrix} \]

and

\[ \phi\left( \begin{bmatrix} 0 & 0 & 0\\ 0 & p & z \sqrt{p (1-p)} \\ 0 & \overline{z} \sqrt{p (1-p)} & 1-p \\ \end{bmatrix} \right) =\begin{bmatrix} 0 & 0 & 0\\ 0 & p & f_{23}(z) \sqrt{p (1-p)} \\ 0 & \overline{f_{23}(z)} \sqrt{p (1-p)} & 1-p\\ \end{bmatrix} \]

for every $p\in [0,\, 1]$ and $z\in S^1$.

Let $p\in [0,\, 1]$ and $z\in S^1$. Using remark 2.9 for the pair of orthogonal rank one projections

\[ \begin{bmatrix} p & z \sqrt{p (1-p)} & 0\\ \overline{z} \sqrt{p (1-p)} & 1-p & 0\\ 0 & 0 & 0 \end{bmatrix} \ \ \ {\rm and} \ \ \ E_{33}, \]

we see the following: For every $P\in {\mathcal {P}}({\mathbb {C}}^3)$ that is of the form

\[ \begin{bmatrix} \lambda p & \lambda z\sqrt{p (1-p)} & *\\ \lambda\overline{z} \sqrt{p (1-p)} & \lambda(1-p) & * \\ * & * & 1-\lambda \end{bmatrix} \]

with $0\leq \lambda \leq 1$, the projection $\phi (P)$ is of the form

\[ \begin{bmatrix} \lambda p & \lambda f_{12}(z)\sqrt{p (1-p)} & *\\ \lambda\overline{f_{12}(z)} \sqrt{p (1-p)} & \lambda(1-p) & * \\ * & * & 1-\lambda \end{bmatrix}. \]

We repeat the same arguments with $E_{11}$ and $E_{22}$ instead of $E_{33}$. It follows that the three mappings $f_{12},\, f_{13},\, f_{23}$ satisfy

\[ \phi\left( \begin{bmatrix} \lambda_1^2 & z\lambda_1\lambda_2 & w\lambda_1\lambda_3\\ \overline{z} \lambda_1\lambda_2 & \lambda_2^2 & \overline{z}w\lambda_2\lambda_3\\ \overline{w} \lambda_1\lambda_3 & z\overline{w} \lambda_2\lambda_3 & \lambda_3^2 \end{bmatrix} \right) =\begin{bmatrix} \lambda_1^2 & f_{12}(z)\lambda_1\lambda_2 & f_{13}(w)\lambda_1\lambda_3\\ \overline{f_{12}(z)} \lambda_1\lambda_2 & \lambda_2^2 & f_{23}(\overline{z}w)\lambda_2\lambda_3\\ \overline{f_{13}(w)} \lambda_1\lambda_3 & \overline{f_{23}(\overline{z}w)} \lambda_2\lambda_3 & \lambda_3^2 \end{bmatrix} \]

for any nonnegative real numbers $\lambda _1,\,\lambda _2,\, \lambda _3$ with $\lambda _1^2+\lambda _2^2+\lambda _3^2=1$ and $z,\, w\in S^1$. Since the right-hand side belongs to ${\mathcal {P}}({\mathbb {C}}^3)$, we obtain

\[ f_{12}(z)f_{23}(\overline{z}w)=f_{13}(w),\ \ \ z,w\in S^1. \]

By substituting $1$ for $z$, we see that $f_{12}(1)f_{23}(w)=f_{13}(w)$, and by substituting $1$ for $w$, we see that $f_{12}(z)f_{23}(\overline {z})=f_{13}(1)$. It follows that $f_{13}(1)\overline {f_{23}(\overline {z})} f_{23}(\overline {z}w)=f_{12}(1)f_{23}(w)$. Thus, we obtain $g(zw)= g(z)g(w)$ for every pair $z,\,w\in S^1$, where $g(z):=\overline {f_{13}(1)}f_{12}(1)f_{23}(z)$, $z\in S^1$. In other words, $g\colon S^1\to S^1$ is a group homomorphism.

Since $f_{23}$ is nonexpansive, so is $g$. Therefore, corollary 2.6 implies that one of the three possibilities $g(z)=z$ for every $z\in S^1$, $g(z)\!=\!\overline {z}$ for every $z\in S^1$, or $g(S^1)\!=\!\{1\}$ holds true. It is now straightforward to conclude that

  • $\phi$ is a Wigner symmetry induced by a diagonal unitary matrix whose $(1,\,1)$-entry is $1$ if one of the first two options holds, and

  • there is a diagonal unitary matrix $U$ whose $(1,\,1)$-entry is $1$ such that $\phi ([p_{ij}]_{1 \le i,j \le 3})=U[\, |p_{ij}| \, ]_{1 \le i,j \le 3}U^*$ for every $P\in {\mathcal {P}}({\mathbb {C}}^3)$ if the third option holds.

This completes the proof in the special case when $n=3$.

Now let us consider the general case by induction on $n$. We assume that the theorem holds for $n-1$, $n\ge 4$. By lemma 2.3 and the induction hypothesis, we can assume with no loss of generality that either

(3.1)\begin{equation} \phi \left( \left[ \begin{matrix} P & 0 \cr 0 & 0 \cr \end{matrix} \right] \right) = \left[ \begin{matrix} P & 0 \cr 0 & 0 \cr \end{matrix} \right], \ \ \ P \in \mathcal{P}(\mathbb{C}^{n-1}) , \end{equation}

or

(3.2)\begin{equation} \phi \left( \left[ \begin{matrix} P & 0 \cr 0 & 0 \cr \end{matrix} \right] \right) = \left[ \begin{matrix} \Phi (P) & 0 \cr 0 & 0 \cr \end{matrix} \right], \ \ \ P \in \mathcal{P}(\mathbb{C}^{n-1}) . \end{equation}

Here we use the same symbol $\Phi$ for two different maps, one on ${\mathcal {P}} (\mathbb {C}^{n-1})$ and the other one on ${\mathcal {P}} (\mathbb {C}^{n})$, both of them sending any projection $P$ of rank one to a rank one projection $Q$ whose entries are the absolute values of the corresponding entries of $P$.

Applying lemma 2.3 and the induction hypothesis once again, we see that there is a diagonal $(n-1) \times (n-1)$ unitary matrix $U$ whose $(1,\,1)$-entry is $1$ such that one of the following three possibilities occurs:

(3.3)\begin{align} \phi \left( \left[ \begin{matrix} 0 & 0 \cr 0 & P \cr \end{matrix} \right] \right) & = \left[ \begin{matrix} 0 & 0 \cr 0 & UPU^\ast \cr \end{matrix} \right] , \ \ \ P \in \mathcal{P}(\mathbb{C}^{n-1}) ,\\ \phi \left( \left[ \begin{matrix} 0 & 0 \cr 0 & P \cr \end{matrix} \right] \right) & = \left[ \begin{matrix} 0 & 0 \cr 0 & UP^tU^\ast \cr \end{matrix} \right] , \ \ \ P \in \mathcal{P}(\mathbb{C}^{n-1}), \nonumber \end{align}

or

(3.4)\begin{equation} \phi \left( \left[ \begin{matrix} 0 & 0 \cr 0 & P \cr \end{matrix} \right] \right) = \left[ \begin{matrix} 0 & 0 \cr 0 & U\Phi(P)U^\ast \cr \end{matrix} \right] , \ \ \ P \in \mathcal{P}(\mathbb{C}^{n-1}) . \end{equation}

By considering projections of the form

\[ \left[ \begin{matrix} 0 & 0 & 0 \cr 0 & P & 0 \cr 0 & 0 & 0 \cr \end{matrix} \right] \]

for $P\in \mathcal {P}(\mathbb {C}^{n-2})$, we see that (3.1) implies (3.3), and that (3.2) implies (3.4). Moreover, in both cases all diagonal entries but the $(n-1,\, n-1)$-entry of $U$ are $1$. Thus, by considering the mapping

\[ P\mapsto \left[ \begin{matrix} 1 & 0 \cr 0 & U^* \cr \end{matrix} \right]\phi(P)\left[ \begin{matrix} 1 & 0 \cr 0 & U \cr \end{matrix} \right] \]

instead of $\phi$, we may additionally assume $U=I$.

Let $P =[p_{ij}]_{1 \le i,j \le n}\in {\mathcal {P}} (\mathbb {C}^{n})$ be any rank one projection. Then one may take a projection $Q\in {\mathcal {P}} (\mathbb {C}^{n-1})$ such that

\[ P=\left[ \begin{matrix} (1-p_{nn}) Q & * \cr * & p_{nn} \cr \end{matrix} \right] . \]

If (3.1) holds, then we have

\[ \phi \left( \left[ \begin{matrix} Q & 0 \cr 0 & 0 \cr \end{matrix} \right] \right) = \left[ \begin{matrix} Q & 0 \cr 0 & 0 \cr \end{matrix} \right]. \]

Thus, remark 2.9 together with the assumption $\phi (E_{nn})=E_{nn}$ shows that

(3.5)\begin{equation} \phi (P) = \phi\left(\left[ \begin{matrix} (1-p_{nn}) Q & * \cr * & p_{nn} \cr \end{matrix} \right]\right) = \left[ \begin{matrix} [p_{ij} ]_{1 \le i,j \le n-1} & * \cr * & p_{nn} \cr \end{matrix} \right] . \end{equation}

Similarly, we obtain

(3.6)\begin{equation} \phi (P) = \left[ \begin{matrix} p_{11} & * \cr * & [p_{ij} ]_{2 \le i,j \le n} \cr \end{matrix} \right] \end{equation}

because $\phi (E_{11})=E_{11}$ and (3.3) holds with $U=I$. Since $\phi (P)$ is a projection of rank one, (3.5) and (3.6) imply that $\phi (P)=P$ holds for every $P \in {\mathcal {P}} (\mathbb {C}^{n})$ with $P\not \le E_{11}+E_{nn}$. By the continuity of $\phi$, we see that $\phi (P)=P$ holds for every $P\in {\mathcal {P}} (\mathbb {C}^{n})$.

In a parallel manner, if (3.2) holds, then we have

\[ \phi (P) = \left[ \begin{matrix} [\, |p_{ij}|\, ]_{1 \le i,j \le n-1} & * \cr * & p_{nn} \cr \end{matrix} \right] =\left[ \begin{matrix} p_{11} & * \cr * & [\, |p_{ij}|\, ]_{2 \le i,j \le n} \cr \end{matrix} \right] \]

for every $P =[p_{ij}]_{1 \le i,j \le n}\in {\mathcal {P}} (\mathbb {C}^{n})$. This leads to $\phi (P)=\Phi (P)$ for every $P\in {\mathcal {P}}({\mathbb {C}}^n)$.

Remark 3.2 The following facts can be verified easily.

The unitary matrix $U$ in the statement of proposition 3.1 is unique. It cannot happen that two different possibilities in the statement of proposition 3.1 are fulfilled simultaneously.

Let $U$ be a diagonal $n \times n$ unitary matrix. Define $V\colon {\mathbb {C}}^n \to {\mathbb {C}}^n$ by $Vx = U\overline {x}$, $x\in {\mathbb {C}}^n$, where $\overline {x}=(\overline {x_i})_{1\leq i\leq n}$ for $x=(x_i)_{1\leq i\leq n}\in {\mathbb {C}}^n$. Then $V$ is an antiunitary operator satisfying

\[ U P^t U^\ast{=}VPV^\ast, \quad P \in \mathcal{P}(\mathbb{C}^n). \]

Let $\phi : \mathcal {P} (H) \to \mathcal {P} (H)$ be a nonexpansive map. Assume that there is an OSP $\{ Q_{\alpha } \, : \, \alpha \in \mathcal {J} \} \subset \mathcal {P}(H)$ such that $\{ P_{\alpha } \, : \, \alpha \in \mathcal {J} \} \subset \mathcal {P}(H)$ is a COSP, where $P_{\alpha }=\phi (Q_\alpha )$, $\alpha \in \mathcal {J}$. Corollary 2.4 implies that this assumption is fulfilled if and only if some COSP is contained in the image of $\phi$. Note that the latter condition is much weaker than the surjectivity of $\phi$.

We take a unit vector $e_\alpha \in P_\alpha H$ for each $\alpha \in \mathcal {J}$. Then $\{e_\alpha \,:\, \alpha \in \mathcal {J}\}$ is an orthonormal basis of $H$. A unitary operator $V : H \to H$ is said to be diagonal (with respect to the orthonormal basis $\{e_\alpha \,:\, \alpha \in \mathcal {J}\}$) if there exist complex numbers $\{z_\alpha \,:\, \alpha \in \mathcal {J}\} \subset S^1$ such that $Ve_\alpha = z_\alpha e_\alpha$ for every $\alpha \in \mathcal {J}$. For each projection $P\in {\mathcal {P}}(H)$ onto ${\mathbb {C}}(\sum _{\alpha \in \mathcal {J}}c_\alpha e_\alpha )$, $\Phi (P)$ stands for the projection onto ${\mathbb {C}}(\sum _{\alpha \in \mathcal {J}}|c_\alpha |e_\alpha )$. For a closed subspace $K\subset H$, we identify ${\mathcal {P}}(K)$ with a subset of ${\mathcal {P}}(H)$ in a natural manner.

Proposition 3.3 Let $\dim H\geq 3$ and let $\phi : \mathcal {P} (H) \to \mathcal {P} (H)$ be a map such that

\[ \| \phi (P) - \phi (Q) \| \le \| P - Q \| \]

for every pair $P,\,Q \in \mathcal {P}(H)$. Let $\{ Q_{\alpha } \, : \, \alpha \in \mathcal {J} \} \subset \mathcal {P}(H)$ be an OSP such that $\{ P_{\alpha } \, : \, \alpha \in \mathcal {J} \} \subset \mathcal {P}(H)$ is a COSP, where $P_{\alpha }=\phi (Q_\alpha )$, $\alpha \in \mathcal {J}$. Let $K:=(\bigvee _{\alpha \in \mathcal {J}} Q_\alpha )H$ and let $\Phi$ be as above. Then one of the following holds:

  • $K=H$ and $\phi$ is a Wigner symmetry.

  • There are a bijective linear isometry $U\colon K\to H$ and a diagonal unitary operator $V : H \to H$ such that $\phi (P)= V \Phi (UPU^*) V^\ast$ for all $P\in {\mathcal {P}}(K)$.

Proof. Let $\{ f_\alpha \, : \, \alpha \in \mathcal {J} \}$ be an orthonormal basis of $K$ such that the image of each $Q_\alpha$ is spanned by $f_\alpha$ and $\{ e_\alpha \, : \, \alpha \in \mathcal {J} \}$ an orthonormal basis of $H$ such that the image of each $P_\alpha$ is spanned by $e_\alpha$. We first define the linear bijective isometry $U : K \to H$ by $U f_\alpha = e_\alpha$, $\alpha \in \mathcal {J}$. Observe that $UQ_\alpha U^*= P_{\alpha }=\phi (Q_\alpha )$ holds for every $\alpha \in \mathcal {J}$. We choose and fix distinct elements $\alpha _1,\, \alpha _2,\, \alpha _3 \in \mathcal {J}$. Using proposition 3.1 and lemma 2.3 (see also remark 3.2) we see that for every finite subset $\mathcal {K} \subset \mathcal {J}$ satisfying $\alpha _1,\, \alpha _2,\, \alpha _3 \in \mathcal {K}$, either

  • there exists a uniquely determined unitary or antiunitary operator $V_\mathcal {K} : {\rm span}\, \{ e_\alpha \, : \, \alpha \in \mathcal {K} \} \to {\rm span}\, \{ e_\alpha \, : \, \alpha \in \mathcal {K} \}$ such that $V_\mathcal {K} {e_{\alpha _1}} = e_{\alpha _1}$ and $\phi (Q) = V_\mathcal {K} UQU^\ast V_\mathcal {K}^\ast$ for every $Q\in {\mathcal {P}}(K)$ with $Q \le \sum _{\alpha \in \mathcal {K}} Q_\alpha$; or

  • there exists a uniquely determined unitary operator $V_\mathcal {K} : {\rm span}\, \{ e_\alpha \, : \, \alpha \in \mathcal {K} \} \to {\rm span}\, \{ e_\alpha \, : \, \alpha \in \mathcal {K} \}$ such that $V_\mathcal {K} {e_{\alpha _1}} = e_{\alpha _1}$ and $\phi (Q) = V_\mathcal {K} \Phi (UQU^\ast ) V_\mathcal {K}^\ast$ for every $Q\in {\mathcal {P}}(K)$ with $Q \le \sum _{\alpha \in \mathcal {K}} Q_\alpha$.

Obviously, if $\mathcal {K}_1 ,\, \mathcal {K}_2 \subset \mathcal {J}$ are finite subsets satisfying $\alpha _1,\, \alpha _2,\, \alpha _3\in \mathcal {K}_1 \subset \mathcal {K}_2$ then either we have for both sets $\mathcal {K}_1$ and $\mathcal {K}_2$ the first possibility above, or we have for both sets the second possibility. In both cases $V_{\mathcal {K}_1}$ is the restriction of the operator $V_{\mathcal {K}_2}$. It follows that either

  • there exists a bijective linear or conjugate-linear isometry $W : K \to H$ such that $\phi (P) = WPW^\ast$ for every $P \in \mathcal {P}(K)$ that is dominated by a sum of finitely many projections in $\{Q_{\alpha }\,:\, \alpha \in \mathcal {J}\}$; or

  • there exists a diagonal unitary operator $V: H \to H$ such that $\phi (P) = V \Phi (UPU^\ast ) V^\ast$ for every $P \in \mathcal {P}(K)$ that is dominated by a sum of finitely many projections in $\{Q_{\alpha }\,:\, \alpha \in \mathcal {J}\}$.

By the continuity of $\phi$, we see that one of the above possibilities holds for every $P \in \mathcal {P}(K)$.

Assume that the first option holds and $K\neq H$. Let $v\in K$ and $w\in K^{\perp }$ be unit vectors and $0< c\leq 1$ a real number. Then we may take a unique projection $Q\in {\mathcal {P}}(K)$ such that $\phi (P_{cv+\sqrt {1-c^2}w}) = \phi (Q)$. For every unit vector $u\in K$ with $Qu=0$, the projection $\phi (P_{u})=WP_u W^*$ is orthogonal to $WQW^*=\phi (Q)=\phi (P_{cv+\sqrt {1-c^2}w})$. Since $\phi$ is nonexpansive, we see that $P_{u}$ is orthogonal to $P_{cv+\sqrt {1-c^2}w}$. Hence $u$ is orthogonal to $v$. Because this is true for every $u \in K$ satisfying $Qu=0$, we conclude that $Q=P_v$, thus $\phi (P_{cv+\sqrt {1-c^2}w}) = \phi (P_v)$.

Since $P_{cv+\sqrt {1-c^2}w}$ converges to $P_w$ when $c$ tends to zero and since $v\in K$ and $w\in K^{\perp }$ are arbitrary unit vectors, we see that $\phi$ is not continuous at every point of ${\mathcal {P}}(K^\perp )$. This contradicts our assumption, and we obtain $K=H$. Thus, we have shown that $\phi$ is a Wigner symmetry.

Proof Proof of theorem 1.5

In the formulation of theorem 1.5 the orthonormal basis $\{ e_\alpha \, : \, \alpha \in \mathcal {J} \}$ is fixed in advance. In proposition 3.3 the orthonormal basis $\{ e_\alpha \, : \, \alpha \in \mathcal {J} \}$ was determined using a given CONS. Thus, in order to show that theorem 1.5 follows from proposition 3.3, we only need to check that if $\{ e_\alpha \, : \, \alpha \in \mathcal {J} \}$ and $\{ e'_\alpha \, : \, \beta \in \mathcal {J}' \}$ are two orthonormal bases of $H$ and $\Phi$ and $\Phi '$ are the corresponding nonexpansive maps, then there exists a unitary operator $W : H \to H$ such that $W\Phi (P) W^\ast = \Phi ' (WPW^\ast )$, $P \in \mathcal {P}(H)$. There exists a bijection $\tau : \mathcal {J} \to \mathcal {J}'$. It is straightforward to verify that the unitary operator $W$ given by $We_{\alpha } = e'_{\tau (\alpha )}$, $\alpha \in \mathcal {J}$, has the desired property.

Proof Proof of theorem 1.4

If $\dim H=2$ then the second item of corollary 1.2 leads to the desired conclusion. In what follows we assume that $\dim H\geq 3$. The surjectivity of $\phi$ together with corollary 2.4 implies that $\phi$ satisfies the assumption of proposition 3.3.

Assume that $\phi$ is of the form as in the second option of the statement of proposition 3.3. The goal of this proof is to show that $\phi$ is never surjective. From now on, we use the same symbols as in proposition 3.3. After replacing the map $\phi$ by $P \mapsto V^\ast \phi (P) V$, $P \in {\mathcal {P}} (H)$, we may assume that $V=I$. Fix three indices $\alpha _1,\, \alpha _2,\, \alpha _3\in \mathcal {J}$. Fix any complex numbers $c_1,\, c_2,\, c_3\in {\mathbb {C}}$ of modulus one such that $0\in {\mathbb {C}}$ is contained in the interior of the convex hull of $\{c_1,\, c_2,\, c_3\}$. For example, we may choose the cube roots of $1$.

We can find $\varepsilon >0$ such that $D(0,\,\varepsilon ) = \{ z \in \mathbb {C} \, : \, |z| < \varepsilon \}$ is contained in the convex hull of $\{c_1,\, c_2,\, c_3\}$. Then there exists $\delta >0$ such that $D(0,\, \varepsilon /2 )$ is contained in the convex hull of $\{c'_1,\, c'_2,\, c'_3\}$ for all complex numbers $c'_1,\, c'_2,\, c'_3$ satisfying $| c'_j - c_j | < \delta$.

Let $P\in {\mathcal {P}}(H)$ denote the projection onto ${\mathbb {C}}(c_1e_{\alpha _1}+c_2e_{\alpha _2}+c_3e_{\alpha _3})$. We can find $\eta >0$ such that if $P'\in {\mathcal {P}}(H)$ satisfies $\|P-P'\|<\eta$, then we may take $c'_1,\, c'_2,\, c'_3 \in {\mathbb {C}}$ and $v\in \{e_{\alpha _1} ,\, e_{\alpha _2},\, e_{\alpha _3} \}^\perp$ such that $P'$ is the projection onto ${\mathbb {C}}(c_1'e_{\alpha _1}+c_2'e_{\alpha _2}+c_3'e_{\alpha _3}+v)$ and

\[ | c'_j - c_j | < \delta \text{ and } \|v \| < \frac{\varepsilon}{2}. \]

Let $P'\in {\mathcal {P}}(H)$ be a projection satisfying $\|P-P'\|<\eta$ and let $c'_1,\, c'_2,\, c'_3 \in {\mathbb {C}}$ and $v\in \{e_{\alpha _1} ,\, e_{\alpha _2},\, e_{\alpha _3} \}^\perp$ be as above. Let further $\{a_\alpha \,:\, \alpha \in \mathcal {J}\setminus \{\alpha _1,\,\alpha _2,\, \alpha _3\}\}$ be any family of nonnegative real numbers such that $\sum _{\alpha \in \mathcal {J}\setminus \{\alpha _1,\,\alpha _2,\, \alpha _3\}} a_{\alpha }^2< 1$. Then

\[ \left| \left\langle v, \sum_{\alpha\in \mathcal{J}\setminus\{\alpha_1,\alpha_2, \alpha_3\}} a_{\alpha}e_\alpha \right\rangle \right| < \frac{\varepsilon}{2} . \]

Thus, we can find $a_{\alpha _1},\, a_{\alpha _2},\, a_{\alpha _3}>0$ such that $a_{\alpha _1}+ a_{\alpha _2}+ a_{\alpha _3}=1$ and

\[ a_{\alpha_1}c_1'+a_{\alpha_2}c_2'+a_{\alpha_3}c_3'={-} \left\langle v, \sum_{\alpha\in \mathcal{J}\setminus\{\alpha_1,\alpha_2, \alpha_3\}} a_{\alpha}e_\alpha \right\rangle \]

hold. It follows that the projection $E$ onto ${\mathbb {C}}(\sum _{\alpha \in \mathcal {J}} a_{\alpha }e_\alpha )$ is orthogonal to $P'$. Observe that the following holds: For any family of complex numbers $\{b_\alpha \,:\, \alpha \in \mathcal {J}\}$, with $|b_\alpha |=a_\alpha$ for all $\alpha \in \mathcal {J}$, the projection $F$ onto ${\mathbb {C}}(\sum _{\alpha \in \mathcal {J}} b_{\alpha }f_\alpha )$ satisfies $\phi (F)=E$. By the assumption that $\phi$ is nonexpansive, we see that such an $F$ is orthogonal to any projection in $\phi ^{-1}(P')$.

The arbitrariness of $\{a_\alpha \,:\, \alpha \in \mathcal {J}\setminus \{\alpha _1,\,\alpha _2,\, \alpha _3\}\}$ and $\{b_\alpha \,:\, \alpha \in \mathcal {J}\}$ ensures that the one-dimensional image of any projection in $\phi ^{-1}(P')$ is actually orthogonal to $K$. It follows that $\phi ^{-1}(\{P'\in {\mathcal {P}}(H)\,:\, \|P-P'\|<\eta \})\subset {\mathcal {P}}(K^{\perp })$. The continuity of $\phi$ implies that the left-hand side is open in ${\mathcal {P}}(H)$. However, the interior of ${\mathcal {P}}(K^{\perp })$ is empty in ${\mathcal {P}}(H)$. Therefore, we obtain $\phi ^{-1}(\{P'\in {\mathcal {P}}(H)\,:\, \|P-P'\|<\eta \})=\emptyset$. In particular, $\phi$ is not surjective.

Proof Proof of theorem 1.3

Obviously, $\phi$ is bijective. By theorem 1.4, the inverse of $\phi$ is a Wigner symmetry. Therefore, $\phi$ is a Wigner symmetry.

4. Optimality of results

The surjectivity assumption is essential in theorem 1.3.

Example 4.1 Assume $\dim H=\infty$. Then there is a noncontractive map $\phi : \mathcal {P}(H) \to \mathcal {P}(H)$ such that no linear or conjugate-linear isometry $U : H \to H$ satisfies $\phi (P) = UPU^\ast$ for every $P \in \mathcal {P}(H)$.

Proof. Note that an infinite-dimensional Hilbert space $H$ can be identified with the orthogonal direct sum of two copies of $H$, $H \equiv H \oplus H$. Then a map $\phi$ on $\mathcal {P}(H)$ can be considered as a map from $\mathcal {P}(H)$ to $\mathcal {P}(H \oplus H)$. We represent operators in $\mathcal {P}(H \oplus H)$ with $2 \times 2$ operator matrices. Choose and fix $\emptyset \neq \mathcal {A} \subsetneq \mathcal {P}(H)$. It is trivial to verify that the map $\phi : \mathcal {P}(H) \to \mathcal {P}(H \oplus H)$ given by

\[ \phi (P) =\begin{bmatrix} P & 0 \\ 0 & 0 \end{bmatrix}, \ \ \ P \in \mathcal{P}(H) \setminus \mathcal{A}, \]

and

\[ \phi (P) = \begin{bmatrix} 0 & 0 \\ 0 & P \end{bmatrix}, \ \ \ P \in \mathcal{A} \]

is noncontractive but there is no linear or conjugate-linear isometry $U : H \to H \oplus H$ such that $\phi (P) = UPU^\ast$ for every $P \in \mathcal {P}(H)$.

We next give a wild example of an injective nonexpansive map.

Example 4.2 Let $H$ be an infinite-dimensional separable Hilbert space. There is an injective nonexpansive map $\phi : \mathcal {P} (H) \to \mathcal {P} (H)$ such that no linear or conjugate-linear isometry $U : H \to H$ satisfies $\phi (P) = UPU^\ast$ for every $P \in \mathcal {P}(H)$.

Proof. We will identify $\mathcal {P}(H)$ with the projective space $\mathbb {P} (H) = \{ [x] \, : \, x \in H \setminus \{ 0 \} \}$ over the Hilbert space $H$. Here, $[x]$ denotes the one-dimensional subspace of $H$ spanned by $x$. Of course, we identify a projection of rank one with its image. By lemma 2.2, we need to find an injective map $\phi : \mathbb {P} (H) \to \mathbb {P} (H)$ with the property that for any unit vectors $x,\,y \in H$ and for any unit vectors $u \in \phi ([x])$ and $v \in \phi ([y])$ we have

(4.1)\begin{equation} | \langle u,v \rangle | \ge | \langle x,y \rangle |, \end{equation}

but there is no linear or conjugate-linear isometry $U : H \to H$ satisfying $\phi ([x]) = [Ux]$ for every unit vector $x\in H$.

We choose and fix a sequence $(P_n) \in \mathcal {P}(H)$ with the property that the set $\{ P_n \, : \, n \in \mathbb {N} \}$ is dense in $\mathcal {P}(H)$. Let $(x_n) \subset H$ be a sequence of unit vectors such that the image of $P_n$ is the linear span of $x_n$, $n=1,\,2,\,\ldots$. We next choose and fix an orthonormal set $\{ e_1 ,\, f_1 ,\, e_2 ,\, f_2 ,\, \ldots \}$ in $H$. For any unit vector $x\in H$, we set $| \langle x,\, x_n \rangle | = t_n$, $n=1,\,2,\,\ldots$, and define

\[ \phi ([x]) = \left[ \sum_{n=1}^\infty \frac{t_n}{2^{n/2}} e_n + \sum_{n=1}^\infty \frac{\sqrt{ 1 - t_{n}^2}}{2^{n/2}} f_n \right]. \]

It is trivial to see that $\phi$ is well-defined.

Let $x,\,y \in H$ be any unit vectors. Set $| \langle x,\, x_n \rangle | = t_n$ and $| \langle y,\, x_n \rangle | = s_n$, $n=1, 2,\,\ldots$. Denote

\[ u = \sum_{n=1}^\infty \frac{t_n}{2^{n/2}} e_n + \sum_{n=1}^\infty \frac{\sqrt{ 1 - t_{n}^2}}{2^{n/2}} f_n \]

and

\[ v = \sum_{n=1}^\infty \frac{s_n}{2^{n/2}} e_n + \sum_{n=1}^\infty \frac{\sqrt{ 1 - s_{n}^2}}{2^{n/2}} f_n . \]

A straightforward computation shows that $u$ and $v$ are unit vectors. Clearly, $[u]= \phi ([x])$ and $[v] =\phi ([y])$. We need to show (4.1). We have

\[ | \langle u,v \rangle | = \sum_{n=1}^\infty \frac{1}{2^n} ( t_n s_n + \sqrt{ 1 - t_{n}^2} \, \sqrt{ 1 - s_{n}^2} ) \]

and therefore it is enough to check that

(4.2)\begin{equation} | \langle x,y \rangle | \le t_n s_n + \sqrt{ 1 - t_{n}^2} \, \sqrt{ 1 - s_{n}^2}, \quad n=1,2,\ldots. \end{equation}

Let $z\in H$ be an arbitrary unit vector. Denote $w_1 = x - \langle x,\,z \rangle z$ and $w_2 = y - \langle y,\,z \rangle z$. Then

\begin{align*} | \langle x,y \rangle | & = | \langle x,z \rangle \overline{ \langle y,z \rangle} + \langle w_1,w_2 \rangle |\\ & \le | \langle x,z \rangle | \, | \langle y,z \rangle | + \| w_1 \| \, \| w_2 \|\\ & = | \langle x,z \rangle | \, | \langle y,z \rangle | + \sqrt{ 1 - | \langle x,z \rangle |^2} \, \sqrt{ 1 - | \langle y,z \rangle |^2}. \end{align*}

Substituting $z= x_n$, we get the desired inequality (4.2). Moreover, if $[x]\neq [y]$, then the assumption that $\{ P_n \, : \, n \in \mathbb {N} \}$ is dense in $\mathcal {P}(H)$ implies that $\{x_n,\, x,\, y\}$ is linearly independent for some $n$. For such an $n$ and $z=x_n$, we see that $\{w_1,\, w_2\}$ is linearly independent. Thus, we have strict inequality in (4.2) and (4.1). It follows that there is no linear or conjugate-linear isometry $U : H \to H$ satisfying $\phi ([x]) = [Ux]$ for every unit vector $x\in H$.

It remains to show that $\phi$ is injective. Here we prefer to work with $\mathcal {P}(H)$ rather than with $\mathbb {P} (H)$. Thus, let $P$ and $Q$ be rank one projections such that $\phi (P) = \phi (Q)$. Then

\[ {\rm tr}\, (P P_n) = {\rm tr}\, (Q P_n), \quad n=1,2,\ldots , \]

and since $\{ P_n : n=1,\,2,\,\ldots \}$ is dense in $\mathcal {P}(H)$, we have

\[ {\rm tr}\, (P R) = {\rm tr}\, (Q R) \]

for every $R \in \mathcal {P}(H)$. Therefore, $P=Q$, as desired.

Our last example will show that when the second possibility in proposition 3.3 occurs, it may happen that $K$ is a proper subspace of $H$. Indeed, let $H$ be an infinite-dimensional Hilbert space and $\{ f_\alpha \, : \, \alpha \in \mathcal {J} \}$ an orthonormal set of vectors such that $K = \overline { {\rm span}\, \{ f_\alpha \, : \, \alpha \in \mathcal {J} \} }$ is a proper subspace of $H$. Let further $\{ e_\alpha \, : \, \alpha \in \mathcal {J} \}$ be an orthonormal basis of $H$ and $U : K \to H$ a bijective linear isometry defined by $U f_\alpha = e_\alpha$, $\alpha \in \mathcal {J}$. For each projection $P\in {\mathcal {P}}(H)$ onto ${\mathbb {C}}(\sum _{\alpha \in \mathcal {J}}c_\alpha e_\alpha )$, $\Phi (P)$ stands for the projection onto ${\mathbb {C}}(\sum _{\alpha \in \mathcal {J}}|c_\alpha |e_\alpha )$. Fix an index $\alpha _0\in \mathcal {J}$, and for each unit vector $w=\sum _{\alpha \in \mathcal {J}} b_{\alpha } f_\alpha + v$, $v\in K^{\perp }$, let $\phi (P_w)$ be the projection onto ${\mathbb {C}}(\sum _{\alpha \in \mathcal {J}\setminus \{ \alpha _0 \}} |b_{\alpha }|e_\alpha + \sqrt {\|v\|^2+|b_{\alpha _0}|^2}e_{\alpha _0})$. We denote $P_{f_{\alpha }} = Q_\alpha$ and $P_{e_{\alpha }} = P_\alpha$, $\alpha \in \mathcal {J}$. Then clearly, $\phi (Q_\alpha ) = P_\alpha$, $\alpha \in \mathcal {J}$, and $\phi (P) = \Phi (UPU^\ast )$, $P \in \mathcal {P}(K)$. It remains to show that $\phi$ is a nonexpansive map, that is, we have to verify that

\[ {\rm tr} \, ( \phi (P_w) \phi (P_z)) \ge {\rm tr}\, (P_w P_z) \]

for every pair of unit vectors $w,\,z \in H$. Let

\[ w=\sum_{\alpha\in \mathcal{J}} b_{\alpha} f_\alpha + v \text{ and } z=\sum_{\alpha\in \mathcal{J}} c_{\alpha} f_\alpha + u, \]

$v,\,u \in K^\perp$, be such vectors. Applying the Cauchy–Schwarz inequality in the two-dimensional Euclidean space, we get

\[ | b_{\alpha_0} | \, | c_{\alpha_0} | + \| v \| \, \| u \| \le \sqrt{ \| v \|^2 + |b_{\alpha_0}|^2 } \ \sqrt{ \| u \|^2 + |c_{\alpha_0}|^2 }. \]

Therefore,

\begin{align*} \sqrt{ {\rm tr}\, ( P_w P_z ) } & = | \langle w,z \rangle | = \left| \sum_{\alpha \in \mathcal{J}} b_\alpha \overline{ c_\alpha} + \langle v,u \rangle \right| \le \sum_{\alpha \in \mathcal{J}} | b_\alpha | \, | c_\alpha| + | \langle v,u \rangle |\\ & \le | b_{\alpha_0} | \, | c_{\alpha_0} | + \| v \| \, \| u \| + \sum_{\alpha \in \mathcal{J}\setminus \{ \alpha_0 \} } | b_\alpha | \, | c_\alpha| \\ & \le \sqrt{ \| v \|^2 + |b_{\alpha_0}|^2 } \ \sqrt{ \| u \|^2 + |c_{\alpha_0}|^2 } \ + \sum_{\alpha \in \mathcal{J}\setminus \{ \alpha_0 \} } | b_\alpha | \, | c_\alpha| \\ & =\sqrt{ {\rm tr}\, ( \phi( P_w ) \phi( P_z )) } . \end{align*}

Acknowledgements

The first author was supported by JSPS KAKENHI grant number 22K13934. The second author was supported by grants J1-2454 and P1-0288 from ARRS, Slovenia.

References

Burago, D., Burago, Y. and Ivanov, S.. A course in metric geometry, Graduate Studies in Mathematics, Vol. 33 (American Mathematical Society, Providence, RI, 2001).CrossRefGoogle Scholar
Fošner, A., Kuzma, B., Kuzma, T. and Sze, N.-S.. Maps preserving matrix pairs with zero Jordan product. Linear Multi. Algebra 59 (2011), 507529.CrossRefGoogle Scholar
Gehér, G. P.. An elementary proof for the non-bijective version of Wigner's theorem. Phys. Lett. A 378 (2014), 20542057.CrossRefGoogle Scholar
Gehér, G. P. and Mori, M.. The structure of maps on the space of all quantum pure states that preserve a fixed quantum angle. Int. Math. Res. Not. IMRN 2022 (2022), 1200312029.CrossRefGoogle Scholar
Gehér, G. P. and Šemrl, P.. Isometries of Grassmann spaces, II. Adv. Math. 332 (2018), 287310.CrossRefGoogle Scholar
Molnár, L.. Generalization of Wigner's unitary-antiunitary theorem for indefinite inner product spaces. Comm. Math. Phys. 210 (2000), 785791.Google Scholar
Pankov, M.. Wigner-type theorems for Hilbert Grassmannians, London Mathematical Society Lecture Note Series, Vol. 460 (Cambridge University Press, Cambridge, 2020).CrossRefGoogle Scholar
Pankov, M. and Vetterlein, T.. A geometric approach to Wigner-type theorems. Bull. London Math. Soc. 53 (2021), 16531662.CrossRefGoogle Scholar
Šemrl, P.. Wigner symmetries and Gleason's theorem. J. Phys. A 54 (2021), 315301, 6pp.CrossRefGoogle Scholar
Uhlhorn, U.. Representation of symmetry transformations in quantum mechanics. Ark. Fysik 23 (1963), 307340.Google Scholar
Wigner, E. P.. Gruppentheorie und ihre Anwendung auf die Quantenmechanik der Atomspektrum (Frederik Vieweg und Sohn, Braunschweig, 1931).CrossRefGoogle Scholar