Intuitive role of the polar decomposition in proof of Uhlmann's theorem for fidelity

Question

I have read the Wikipedia article which relates the polar decomposition to a complex number being split into its modulus and phase but this analogy isn't very intuitive to me.

In Nielsen and Chuang, the polar decomposition is especially important in proving results around fidelity. If I have states $\rho \in H_{R}$ and $\sigma \in H_{Q}$, Uhlmann's theorem shows that the best purification that preserves fidelity $F(\rho, \sigma)$ is some state in $R\otimes Q$ and gives us a method to find it. Set state $ m = \sum_i\vert i_R\rangle\vert i_Q\rangle$. Now

$\vert\psi\rangle = 1\otimes \sqrt{\rho}\vert m\rangle$

$\vert\phi\rangle = 1\otimes \sqrt{\sigma}V^{\dagger}\vert m\rangle$

gives the purification that preserves fidelity. Here, the polar decomposition of $\sqrt{\rho}\sqrt{\sigma}$ is $|\sqrt{\rho}\sqrt{\sigma}|V$.

While I can follow the proof in Nielsen and Chuang, I have no intuition about what the polar decomposition is doing here and why this particular purification preserves fidelity.

I'm looking for intuition only, so proofs are unimportant. Could someone help with simple example states and how they transform under different purifications and why the polar decomposition-based purification works best?

Norbert Schuch · Accepted Answer · 2018-11-13T09:07:37.163

(I will give the argument with formulas for now, hopefully I find time for some pictures later.)

Let $|m\rangle$ be the (unnormalized) maximally entangled state. Then, a purification of $\rho$ is given by $$ |\rho\rangle_{AB}=(\sqrt{\rho}_A\otimes1\!\!1_B)|m\rangle\ , $$ and correspondingly for $\sigma$ -- this can be seen most easily by first tracing the $B$ system from $|m\rangle$, which gives $1\!\!1$, which implies that the partial trace of $|\rho\rangle_{AB}$ is $\sqrt{\rho}1\!\!1\sqrt{\rho}=\rho$.

Now there is a unitary degree of freedom in the purification -- $B$ can apply any unitary and it is the same purification (as $B$ is being traced anyway). Let's use it for the purification of $\sigma$: $$ |\sigma\rangle = (\sqrt{\sigma}\otimes U)|m\rangle\ . $$

Now what is the overlap of the two states? It is $$ \langle\rho|\sigma\rangle = \langle m| \sqrt{\rho}\sqrt{\sigma}\otimes U|m\rangle\ .$$ It is not straightforward to work out that this equals $$ \langle\rho|\sigma\rangle = \mathrm{tr}[\sqrt{\rho}\sqrt{\sigma}U^T]\ .$$ Now we want this quantity to be maximal. When is it maximal? It is maximal if $\sqrt{\rho}\sqrt{\sigma}U^T = |\sqrt{\rho}\sqrt{\sigma}|$, i.e., $\bar U$ is given by the polar decomposition of $\sqrt{\rho}\sqrt{\sigma}$. (Here -- and in the question -- $|X|$ denotes the positive part of the polar decomposition of $X$, which is $\sqrt{X^\dagger X}$, different from the absolute value function defined on the eigenvalues.)

(The last statement holds since $|\mathrm{tr}[XU]|\le \|X\|_1\|U\|_\infty = \|X\|_1$ -- this is Hölder's inequality -- where $\|X\|_1=\mathrm{tr}|X|$ is the sum of the singular values. Alternatively (following @glS) one can use the SVD of $X$, $X=\sum \sigma_i|s_i\rangle\langle q_i|$: $$ |\mathrm{tr}[XU]|=|\sum \sigma_i\langle q_i|U|s_i\rangle| \le \sum \sigma_i|\langle q_i|U|s_i\rangle| \le \sum \sigma_i\ , $$ where we have used that $|\langle q_i|U|s_i\rangle|\le1$, since $||q_i\rangle|=1$ and $|U|s_i\rangle|=||s_i\rangle|=1$.)

score 3 · Answer 2 · edited Nov 12 '18 at 15:13

Purifications play an important role in the theory of density matrices (or more generally quantum states) because they provide a geometric tool in the explanation and description algebraic relations.

(I'll be following here Bengtsson and Życzkowski's reasoning in derivation of the fidelity formula (section 9.4)).

A positive $ N \times N$ matrix $\rho$ can be (nonuniquely) written as: $\rho = A^{\dagger} A$ Where $A$ is an $N \times M$ matrix. The matrix $A$ can be viewed as a vector in $\mathbb{C}^N \otimes \mathbb{C}^M$, i.e. a purification. Of course, the purification is not unique and any matrix of the form: $$A' = A U$$ where $U$ is an $M\times M$ unitary matrix, is an equally good purification.

Now, in physics, distance functions between points in various spaces can be frequently obtained as the solutions of variational principles, or optimization problems. This is the method adopted by Bengtsson and Życzkowski to define the fidelity function:

Since the multiplication by a unitary matrix $A' = A U$ doesn't change the matrix norm, the various purifications of a density matrix can be viewed as points on a sphere, given purifications of two density matrices: $\rho_1 = A_1^{\dagger} A_1$ and $\rho_2 = A_2^{\dagger} A_2$, a reasonable distance function can be taken as the spherical geodesic distance between their purification vectors: $$\cos d = \frac{1}{2} \operatorname{Tr}(A_1A_2^{\dagger} + A_2A_1^{\dagger})$$ Where $d$ is the angle between the two directions $A_1$, $A_2$. Now, since the above formula depends on the purification, it is reasonable to define the distance between the density matrices (the fidelity) as the maximum distance as we run over all the purifications: $$F(\rho_1, \rho_2) = \max_{U, V} \frac{1}{2} \operatorname{Tr}(A_1 U V^{\dagger} A_2^{\dagger} + A_2 V U^{\dagger} A_1^{\dagger})$$

The solution of the above optimization problem which depends only on the density matrices and not on the specific purifications(as explicitly done by Bengtsson and Życzkowski) is exactly the fidelity function.

Uhlmann extensively uses purifications in his work; one of the most important objects defined by him using purifications is the Uhlmann's phase which is the generalization of Berry's geometric phase for mixed states (and which has numerous applications in quantum computing). Please see the following review of his.

glS · Answer 3 · 2019-11-08T12:47:34.867

Short version

Consider the two following observations:

Given a state $\rho$, the problem of finding purifications of $\rho$ is equivalent to that of finding matrices $A$ such that $\rho=AA^\dagger$. The purifications of $\rho$ are then the vectorisations of these $A$ (i.e. a vector $\Psi$ is a purification of $\rho$ iff its matrix of coefficients $\Psi_{ij}$ satisfies $\Psi\Psi^\dagger=\rho$). In this notation, the overlap between two purifications $A,B$ is written as $\operatorname{Tr}(A^\dagger B$).
We want to find the value of $\operatorname{Tr}|\sqrt\rho\sqrt\sigma|$. As shown in this other answer, this is equivalent to finding the unitary $U$ maximising $\lvert\operatorname{Tr}(U\sqrt\rho\sqrt\sigma)\rvert$, and the unitary achieving this maximum turns out to equal $V^\dagger$ where $V$ is the unitary in the polar decomposition of $\sqrt\rho\sqrt\sigma$.

Observation (2) above tells us why finding the fidelity reduces to a maximisation problem, and how the polar decomposition (or more generally, the singular values) enters into it, while observation (1) tells us why we can understand the terms in this maximisation as overlaps of purifications of the states.

Consider then the $U$ such that $F(\rho,\sigma)=\operatorname{Tr}(U\sqrt\rho\sqrt\sigma)$, and define $A\equiv U\sqrt\rho$ and $B=\sqrt\sigma$. Then $F(\rho,\sigma)=\operatorname{Tr}(AB^\dagger)$, and thus $F(\rho,\sigma)$ equals the overlap between the purification of $\rho$ corresponding to $A$ and the purification of $\sigma$ corresponding to $B$.

Detailed version

The starting observation is that there are two "natural" ways to write down $\sqrt\rho\sqrt\sigma$ (or any other product of two normal operators): using their spectral decompositions, and using the singular value decomposition of their product: $$\sqrt\rho\sqrt\sigma=\sum_{jk} \sqrt{\lambda_j\mu_k}\lvert\lambda_j\rangle\!\langle\lambda_j\rvert \mu_k\rangle\!\langle \mu_k\rvert = \sum_m s_m \lvert s_m^L\rangle\!\langle s_m^R\rvert,$$ where $\rho=\sum_j\lambda_j\lvert\lambda_j\rangle\!\langle\lambda_j\rvert$ and $\sigma=\sum_k\mu_k\lvert\mu_k\rangle\!\langle\mu_k\rvert$ are the spectral decompositions of $\rho$ and $\sigma$, and I denoted with $s_m$ the singular values of $\sqrt\rho\sqrt\sigma$, and with $\lvert s_m^{L(R)}\rangle$ the left (right) singular vectors of $\sqrt\rho\sqrt\sigma$. Note that using these definitions, we have $\lvert\sqrt\rho\sqrt\sigma\rvert=\sum_m s_m \lvert s_m^R\rangle\!\langle s_m^R\rvert$ (if using the definition $\lvert A\rvert\equiv\sqrt{A^\dagger A}$, otherwise replace $R$ with $L$ if you want to define $\lvert A\rvert\equiv\sqrt{AA^\dagger}$), and thus $\operatorname{tr}\lvert\sqrt\rho\sqrt\sigma\rvert=\sum_m s_m$.

Let us now denote with $\lvert\psi_\rho\rangle$ and $\lvert\psi_\sigma\rangle$ a pair of purifications of $\rho$ and $\sigma$. These can be written in general as $$\lvert\psi_\rho\rangle=\sum_k \sqrt{\lambda_k}\lvert\lambda_k\rangle\otimes\lvert u_k\rangle, \\ \lvert\psi_\sigma\rangle=\sum_k \sqrt{\mu_k}\lvert\mu_k\rangle\otimes\lvert v_k\rangle, $$ for arbitrary orthonormal bases $\{u_k\}_k, \{v_k\}_k$. We can then write their overlap as $$\langle\psi_\rho\rvert\psi_\sigma\rangle = \sum_{jk}\sqrt{\lambda_j\mu_k} \langle\lambda_j\rvert\mu_k\rangle\langle u_k\rvert v_k\rangle = \sum_{jk}\langle\lambda_j\rvert\sqrt\rho\sqrt\sigma\lvert\mu_k\rangle \langle u_j\rvert v_k\rangle, $$ where I exploited the fact that $\sqrt\rho\lvert\lambda_j\rangle=\sqrt{\lambda_j}\lvert\lambda_j\rangle$ and $\sqrt\sigma\lvert\mu_k\rangle=\sqrt{\mu_k}\lvert\mu_k\rangle$. Using the singular value decomposition of $\sqrt\rho\sqrt\sigma$ we thus get

\begin{align}\langle\psi_\rho\rvert\psi_\sigma\rangle &= \sum_{jkm} s_m \langle\lambda_j\rvert s_m^L\rangle\!\langle s_m^R\rvert\mu_k\rangle \langle u_j\rvert v_k\rangle \\ &= \sum_m s_m \langle s_m^R\rvert \Bigg(\underbrace{\sum_k \lvert \mu_k\rangle\!\langle \bar{v}_k\rvert }_{U_1}\Bigg) \Bigg(\underbrace{\sum_j \lvert\bar{u}_j\rangle\!\langle \lambda_j\rvert}_{U_2}\Bigg) \lvert s_m^L\rangle, \end{align} where I denoted with $\lvert \bar{u}_j\rangle$ the complex conjugate vector of $\lvert u_j\rangle$, so that $\langle u_j\rvert v_k\rangle=\langle \bar{v}_k\rvert \bar{u}_j\rangle$.

Uhlmann's theorem is almost straightforward from this. The triangle inequality gives $\lvert\langle\psi_\rho\rvert\psi_\sigma\rangle\rvert\le \sum_m s_m$ because matrix elements of unitary matrices are always less than $1$ in modulus, and the inequality is saturated when $U_1 U_2=\sum_m \lvert s_m^R\rangle\!\langle s_m^L\rvert\equiv \mathcal U_{PD}^\dagger$. In terms of the purification vectors $\lvert u_j\rangle,\lvert v_k\rangle$, this happens when $$\langle u_j\rvert v_k\rangle = \langle \mu_k \rvert \Bigg(\sum_m \lvert s_m^R\rangle \!\langle s_m^L\rvert \Bigg) \lvert \lambda_j\rangle = \langle \mu_k\rvert \mathcal U_{PD}^\dagger\rvert \lambda_j\rangle.$$ Note that here $\mathcal U_{PD}$ is the unitary matrix that you get out of the polar decomposition, that is, the $V$ in your post. We can thus conclude that the purifications that saturate the inequality are those of the form \begin{align} \lvert u_j\rangle = V\lvert\bar{\lambda}_j\rangle, \qquad \lvert v_k\rangle = V\mathcal U_{PD}^*\lvert\bar{\mu}_k\rangle, \end{align} or, equivalently, \begin{align} \lvert u_j\rangle = V\lvert j\rangle, \qquad \lvert v_k\rangle = V\lvert j\rangle \langle \mu_k\rvert\mathcal U_{PD}^\dagger\lvert\lambda_j\rangle, \end{align} for any unitary $V$.

So, in conclusion, what does this tell us? That the vectors $\lvert u_j\rangle,\lvert v_k\rangle$ that make the purifications the most aligned are determined by the overlap between the eigenvectors of $\sigma$, and the eigenvectors of $\rho$ rotated through unitary that maps the right singular vectors of $\sqrt\rho\sqrt\sigma$ into the left ones.

Why is this the case? I have no idea, mostly because I don't know of any easy way to relate the SVD of a product of two operators with their eigenvectors.

Intuitive role of the polar decomposition in proof of Uhlmann's theorem for fidelity

3 Answers3

Short version

Detailed version

Linked