What is the most general way to describe post-measurement states?

Question

Background

Generally speaking, the description of post-measurement states associated with a POVM seems to always pass through, in some form or another, the formalism of Kraus operators. For example:

Measurement from an arbitrary channel — Consider a general channel $\Phi$ with Kraus representation $\Phi(\rho)=\sum_a A_a \rho A_a^\dagger$. The collection of operators $A_a^\dagger A_a$ is then always a POVM. Furthermore, via Stinespring, $\Phi$ can always be written as $\Phi(\rho)=\operatorname{Tr}_2[V_\Phi \rho V_\Phi^\dagger]$ with the isometry $V_\Phi$ defined as $$V_\Phi = \sum_a A_a\otimes |a\rangle, \qquad V_\Phi |\psi\rangle = \sum_a A_a |\psi\rangle\otimes |a\rangle,$$ so that $$V_\Phi \rho V_\Phi^\dagger = \sum_{a,b} A_a \rho A_b^\dagger\otimes |a\rangle\!\langle b|.$$ This allows interpreting $\Phi$ as the act of performing the measurement $\{A_a^\dagger A_a\}$, with post-measurement states (up to renormalisation) $A_a \rho A_a^\dagger$, and this measurement can be realised concretely by evolving the state through the isometry $V_\Phi$, and then measuring the second degree of freedom in the computational basis $\{|a\rangle\}_a$.
"General measurements" formalism — The so-called formalism of general measurements, which given a POVM with elements written as $A_a^\dagger A_a$ prescribes post-measurement states of the form $$\frac{A_a \rho A_a^\dagger}{\operatorname{Tr}[A_a^\dagger A_a \rho]},$$ seems to be essentially equivalent to the latter approach.
Defining post-measurement states from a POVM — If we are instead given a POVM $\mu$, there are infinitely many ways to decompose its elements as $\mu_a=A_a^\dagger A_a$ for a collection of Kraus operators $A_a$, and thus multiple ways to define post-measurement states corresponding to measuring $\mu$, as discussed in this answer to a related question. The gist, again, is that we can consider the channel $\Phi$ with $$\Phi(\rho) = \sum_a A_a \rho A_a^\dagger\otimes |a\rangle\!\langle a|,$$ which provides a natural way to formalise having both classical and quantum outcomes for the measurement. This kind of channel is known in the literature as a quantum instrument.

On the other hand, given a POVM $\mu$, consider an entanglement-breaking channel $\Phi$ of the form $$\Phi(\rho) = \sum_a \langle \mu_a ,\rho\rangle \sigma_a\equiv \sum_a \operatorname{Tr}(\mu_a\rho) \sigma_a.$$ To some degree, this kind of channel seems to be an even more apt description of a measurement process: we see directly how each outcome $a$ corresponds to a post-measurement state $\sigma_a$. In contrast, the formalism with Kraus operators connects measurement outcomes probabilities and post-measurement states more tightly: $A_a \rho A_a^\dagger$ describes both at the same time, and we can't choose arbitrary post-measurement states to attach to each outcome $a$ (although we do have some freedom, as discussed in the linked post above).

Question

This brings me to the question: why should we consider as the most general description of post-measurement outcomes the formalism with Kraus operators (in one of the shapes outlined above, which I'm considering as all essentially equivalent for the purpose of this discussion)? Why not describe instead post-measurement states via generic entanglement breaking channels attached to the POVM, which would allow to describe more general situations as far as the allowed post-measurement states are concerned?

I have a vague hunch that a possible answer lies in the fact that an entanglement breaking channel of the form outlined above might have a Choi rank larger than the number of outcomes in the corresponding POVM, and thus in some sense would describe a different measurement with a larger number of possible outcomes. I'm not completely sure how to properly formalise this though.

Adam Zalcman · Answer 1 · 2022-09-01T21:08:35.487

TL;DR: There are three possible fates that befall quantum information$^1$ subjected to measurement: it may be converted to classical information, it may be lost in the environmental degrees of freedom or it may be retained in the system under study. Entanglement-breaking channels describe processes in which quantum information is either lost or converted to classical information. Therefore, this type of channel fails to account for the measurements that preserve some of the original quantum information. Moreover, in a world where all measurements are described by entanglement-breaking channels, standard scheme for quantum error correction doesn't work.

Naimark dilation

The above idea is formalized in Naimark dilation theorem which states that any measurement may be realized as a unitary on the Hilbert space $\mathcal{H}_A\otimes\mathcal{H}_E\otimes\mathcal{H}_M$ describing the system under study $A$, its environment $E$ and a measurement apparatus $M$, followed by a projective measurement on $M$ and a partial trace over $E$. The partial trace can be thought of as a projective measurement whose results have been lost due to our lack of control over the environmental degrees of freedom.

This allows us to classify measurements in terms of quantum information they preserve in $A$, the quantum information they lose in $E$ and the classical information they produce in $M$. For example, a unitary on $A$ preserves all quantum information in $A$, loses nothing in $E$ and obtains no classical information in $M$. As another example, a fully depolarizing channel $\mathcal{D}(\rho)=\frac{I}{N}$ loses all input quantum information while obtaining no classical information.

Partial measurements

Similarly, a projective measurement obtains some classical information while losing no quantum information to $E$, i.e. it keeps all information within $A$ and $M$. If the projective measurement is degenerate then the measurement outcome does not uniquely identify the post-measurement state so some information remains in $A$. If the measurement is non-degenerate then the outcome completely determines the post-measurement state and thus all information passes from $A$ to $M$. In both cases, the measurement is efficient in the sense that no information is lost in $E$.

Suppose we measure $|\psi\rangle=\alpha|00\rangle+\beta|01\rangle+\gamma|10\rangle+\delta|11\rangle$ using $P_0=\mathrm{diag}(1, 0, 0, 1)$ and $P_1=\mathrm{diag}(0, 1, 1, 0)$ and obtain outcome $0$. The post-measurement state is $\frac{\alpha|00\rangle+\delta|11\rangle}{\sqrt{|\alpha|^2+|\delta|^2}}$. Note that the set of all possible post-measurement states associated with this outcome is infinite. Thus, the knowledge of the measurement outcome does not uniquely identify the post-measurement state. In fact, the set contains an infinite number of pure states. Therefore, the measurement leaves some "uncollapsed" quantum information behind. At the same time, the classical information it obtains is only partial - it yields the parity of the two qubits but fails to distinguish between $|00\rangle$ and $|11\rangle$.

This possibility is the reason why general measurement cannot be described using generic entanglement breaking channels. Those channels correspond to an extreme special case of the measurement process. Namely, the case where all quantum information is either lost or converted to classical information. This can be shown more rigorously by noting that on one hand the output of any entanglement breaking channel is a convex combination of a finite set of pure states and on the other hand the set of post-measurement states of a partial measurement, like the parity measurement above, may contain an infinite number of pure states.

Proof by quantum error correction

There is an interesting connection to quantum error correction. If entanglement-breaking channels of the form $$ \Phi(\rho) = \sum_i\mathrm{tr}(\mu_i\rho) \sigma_i\tag1 $$ were sufficient to describe general measurements, then the standard scheme for quantum error correction - which entails syndrome measurements for error diagnostics followed by active error correction$^2$ - would be impossible.

To see this, let $\mathcal{E}:L(\mathbb{C}^2)\to\mathcal{H}$ denote an operation that encodes a logical qubit into a quantum system with Hilbert space $\mathcal{H}$, let $\mathcal{N}:\mathcal{H}\to\mathcal{H}$ represent the noise in the system, let the set of non-trace-preserving operations $\mathcal{S}_a:\mathcal{H}\to\mathcal{H}$ represent the syndrome measurements labeled by syndromes $a$, let $\mathcal{R}_a:\mathcal{H}\to\mathcal{H}$ represent the unitary recovery operations conditional on the classical syndrome information $a$ and finally let $\mathcal{D}:\mathcal{H}\to L(\mathbb{C}^2)$ denote the decoding operation. Quantum error correction is successful if $$ \mathcal{D}(\mathcal{R}_a(\mathcal{S}_a(\mathcal{N}(\mathcal{E}(\rho)))))=p(a|\rho)\cdot\rho\tag2 $$ where $p(a|\rho)=\mathrm{tr}(\mathcal{S}_a(\mathcal{N}(\mathcal{E}(\rho))))$ is the probability of measuring syndrome $a$.

Now, suppose that for some $a$, the measurement operation $\mathcal{S}_a$ can be written as $\mathcal{S}_a(\rho) = \sum_i\mathrm{tr}(\mu_i\rho)|\psi_i\rangle\langle\psi_i|$ for some finite set of pure states $\{|\psi_i\rangle\}$. In this case, $(2)$ fails, because the left-hand side is a convex combination of a finite set of states $\{\mathcal{D}(\mathcal{R}_a(|\psi_i\rangle\langle\psi_i|))\}$ while for any pure $\rho$ the right-hand side is an extreme point of the set of single-qubit states and thus not a convex combination of any finite set of states that doesn't include $\rho$. The case of $\mathcal{S}_a$ of the form $\mathcal{S}_a(\rho) = \sum_i\mathrm{tr}(\mu_i\rho)\sigma_i$ for arbitrary $\sigma_i$ can be reduced to the preceding case by expanding $\sigma_i$ as a convex combination of pure states.

Thus, the fact that general quantum measurement allows for the possibility of obtaining partial information about a quantum state while leaving behind a state collapsed to a subspace whose dimension is greater than one is key to the feasibility of quantum error correction.

^{$^1$ Here, we use the term "information" informally as "the thing that information processing devices process". Similarly, "quantum information" is understood informally as "the stuff that quantum information processing devices process". In particular, here the latter term does not denote von Neumann entropy.

$^2$ In practice, active error correction is avoided and replaced by classical bookkeeping using the Pauli frame.}

score 0 · Answer 2 · answered Aug 18 '24 at 16:01

A few observations:

Saying that an "entanglement breaking (EB) channel describes the measurement process" is just not a very meaningful statement. First of all, it being a single channel as I defined it, it can't be describing post-measurement states. It could possibly describe the post-measurement state when we don't know the outcome. However, that is not the case in general, as explained by the other answer and as I will explicitly show in my examples below.
The most general way to describe measurements with their post-measurement states is via quantum instruments: channels of the form $\Phi(\rho)=\sum_i \Phi_i(\rho)\otimes \mathbb{P}_i$ with $\Phi_i$ completely positive maps such that $\sum_i \Phi_i$ is CPTP. These can give rise to entanglement breaking channel in the sense that $\sum_i \Phi_i$ might be EB. This is typically the case when the measurement is "maximally destructive". These channels are also connected to the POVM $\mu$ describing the same measurement via $\mu_i= \Phi_i^\dagger(I)$. This is also discussed in this other answer.
How destructive a measurement is determins how reversible or close to reversible the channels describing post-measurement states can be.

E.g. in the extreme case of a trivial $m$-outcome POVM, the quantum instrument needs to have $\Phi_i^\dagger(I)=I/m$, which can be satisfied by $\Phi_i(\rho)=\frac1m U_i\rho U_i^\dagger$ for any set of (not necessarily distinct) unitaries $U_i$. In such situation the act of measurement is perfectly reversible (and in the limit case of $U_i=U_j$ there's no measurement feedback at all). It's still possible to implement the measurement in other ways that are not reversible, but that's unavoidable, see again this answer for more details on this point.

We can also characterise the other end of the spectrum: consider a rank-1 POVM, with elements $\mu_a = w_a \mathbb{P}_{\psi_a}$ with $w_a>0$ such that $\sum_a w_a= d$, with $d$ dimension of the space. That means $\Phi_a^\dagger(I)=w_a \mathbb{P}_{\psi_a}$, which in turn implies that $\Phi_a^\dagger$ must be a replacement map: $\Phi_a^\dagger(\rho) = \operatorname{tr}(\rho)w_a\mathbb{P}_{\psi_a}$, and thus also $\Phi_a(\rho) = w_a I \langle\psi_a|\rho|\psi_a\rangle$. In other words, all post-measurement states must be the equal to each other, and to the maximally mixed state.

Here's a few explicit examples that I think clarify the issue:

Example with projective POVM

Consider a two-qubit system, where we want to measure a single-qubit input in the first register. To perform this measurement, we perform a unitary evolution through a Hadamard gate on the first qubit followed by a CNOT with the second qubit as target, and then measure the second register on the computational basis.

If the second qubit starts in $|0\rangle$, and the input qubit in the first register is $|\psi\rangle=\alpha|0\rangle+\beta|1\rangle$, then the output is (neglecting normalisation): $$|\psi\rangle|0\rangle\to (\alpha+\beta)|00\rangle+(\alpha-\beta)|11\rangle.$$ It follows that measuring the second qubit in the computational basis completely collapses the first qubit. The POVM $\mu\equiv \{\mu_0,\mu_1\}$ describing the situation is projective with elements $\mu_0=\mathbb{P}_+$ and $\mu_1=\mathbb{P}_-$. The state of the first register ignoring the measurement outcome can be described by an entanglement breaking channel: $$\Phi(\rho) = \langle \mathbb{P}_+,\rho\rangle \mathbb{P}_0 + \langle\mathbb{P}_-,\rho\rangle \mathbb{P}_1, \qquad \mathbb{P}_\psi\equiv |\psi\rangle\!\langle\psi|.$$ We can characterise both post-measurement states and outcome probabilities with a quantum instrument as: $$\Phi_{\rm QI}(\rho) = \Phi_0(\rho)\otimes \mathbb{P}_0+\Phi_1(\rho)\otimes \mathbb{P}_1, \quad \Phi_0(\rho)\equiv \langle\mathbb{P}_{+},\rho\rangle \mathbb{P}_{0}, \quad \Phi_1(\rho)\equiv \langle\mathbb{P}_{-},\rho\rangle \mathbb{P}_{1}.$$ Note that starting from the POVM $\mu$, we could have obtained the Kraus operators $A_0=|0\rangle\!\langle +|$ and $A_1=|1\rangle\!\langle -|$, and written the post-measurement states as $\Phi_i(\rho)= A_i \rho A_i^\dagger$. The POVM is then successfully recovered from these maps as $\Phi_i^\dagger(I)=\mu_i$.

Example with trivial two-outcome POVM

On the other hand, suppose after the same evolution the second register is measured in the eigenbasis of $\sigma_X$ instead. Or equivalently, say we're applying a Hadamard gate to the second register before measuring in the computational basis. The evolution then reads: $$|\psi\rangle|0\rangle \to [(\alpha+\beta)|0\rangle+(\alpha-\beta)|1\rangle]\otimes |0\rangle + [(\alpha+\beta)|0\rangle-(\alpha-\beta)|1\rangle]\otimes |1\rangle.$$ In this case the outcome probabilities are $\|(\alpha+\beta)|0\rangle\pm(\alpha-\beta)|1\rangle\|^2=1/2$, and thus the corresponding POVM is the trivial one: $\mu_0=\mu_1=I/2$. In other words, this measurement doesn't "steal" any amount of information from the input state. As a quantum instrument this measurement is described by $$\Phi_{\rm QI}(\rho) = \sum_{i=0,1}\Phi_i(\rho)\otimes\mathbb{P}_i, \\ \Phi_i(\rho) \equiv \frac14(I \langle I,\rho\rangle + Z \langle X,\rho\rangle \pm X \langle Z,\rho\rangle \mp Y\langle Y,\rho\rangle).$$ In fact, one can also just observe directly from the output state that $\Phi_i$ are unitary channels up to weight factors: $\Phi_i(\rho)=\frac12 U_i \rho U_i^\dagger$ with $$U_0 = H\equiv \frac1{\sqrt2}\begin{pmatrix}1&1\\1&-1\end{pmatrix}, \quad U_1 = ZH= \frac1{\sqrt2}\begin{pmatrix}1&1\\-1&1\end{pmatrix}.$$ This is not accidental: if the measurement is trivial, no information is stolen from the input state, which means the effect of the measurement is reversible, i.e. unitary.

Finally, ignoring the outcome the post-measurement state is $$\Phi(\rho) = \frac12(U_0\rho U_0^\dagger + U_1 \rho U_1^\dagger),$$ which is a mixed-unitary channel. This channel is again entanglement-breaking, as its Choi is $J(\Phi) = \mathbb{P}_0\otimes\mathbb{P}_++\mathbb{P}_1\otimes\mathbb{P}_-$ (see also here about EB channels).

However, the same identical POVM, $\mu_i=I/2$, can correspond to a quantum instrument which gives a non-EB post-measurement outcome-agnostic state. This is borderline trivial: the trivial measurement can be implemented by having the second register not interact with the input state at all. In which case the instrument would just be $\Phi_{\rm QI}(\rho) = \frac12\rho+\frac12\rho=\rho$, which is also the same as the state you get ignoring the measurement outcome. And of course, the identity channel is not entanglement breaking.

Example with a partially depolarising channel

As shown here, the depolarising channel for a single qubit, $\Phi_p(\rho)=p\rho+(1-p)\operatorname{tr}(\rho) I/2$, can be implemented via the Stinespring isometry: $$V = \begin{pmatrix} \sqrt{\frac{1+3p}{4}} \,I \\ \sqrt{\frac{1-p}{4}} \,Z \\ \sqrt{\frac{1-p}{2}} E_{01} \\ \sqrt{\frac{1-p}{2}} E_{10} \end{pmatrix} = \sum_{i=1}^4 |i\rangle\otimes A_i.$$ This isometry acts on a three-qubit space, with the first qubit being our input, and two ancillary qubits needed to obtain the channel. Let's consider a situation where we first evolve the state through this isometry $V$, and then measure in the computational basis the last two qubits.

We thus have four possible outcomes, and conditionally to observign the $i$-th outcome the first qubit becomes, up to renormalisation, $\Phi_i(\rho)\equiv A_i \rho A_i^\dagger$. More explicitly, we have $$\Phi_1(\rho) = \frac{1+3p}{4} \rho, \qquad \Phi_2(\rho) = \frac{1-p}{4} Z\rho Z, \\ \Phi_3(\rho) = \frac{1-p}{2} \mathbb{P}_0 \langle\mathbb{P}_1,\rho\rangle, \qquad \Phi_4(\rho) = \frac{1-p}{2} \mathbb{P}_1 \langle\mathbb{P}_0,\rho\rangle.$$ The corresponding POVM is then $\mu_i=\Phi_i^\dagger(I)$, that is, $$ \mu_1 = \frac{1+3p}{4} I, \quad \mu_2 = \frac{1-p}{4} I, \quad \mu_3 = \frac{1-p}{2} \mathbb{P}_1, \quad \mu_4 = \frac{1-p}{2} \mathbb{P}_0. $$ The channel describing the state when we do not know the measurement outcome is $\Phi=\sum_i \Phi_i$, which here is by construction nothing but the original depolarising channel $\Phi_p$. It therefore follows that in this case the post-measurement state is not given by an entanglement breaking channel, except for the trivial case $p=0$.