Three ways to look at the Bell/GHZ experiment

In 1964 John Stewart Bell proposed an experiment to determine whether the results of quantum measurements were truly random, or governed by hidden variables, i.e. state that exists prior to the measurement, but which we don’t have access to. The experiment involved creating a large number of EPR pairs, and firing them at two observers, Alice and Bob, who measure their photon’s polarisation, choosing the $\updownarrow$ direction or the $\nearrow\llap\swarrow$ direction at random. Determining the result of the experiment involves doing a complex statistical calculation to see if something called Bell’s inequality is satisfied or violated. The Bell experiment was first performed by in 1982 by Alain Aspect, and the result, as most commonly interpreted, is that hidden variables can only exist if Quantum Mechanics is non-local, i.e. if it supports faster-than-light causality!

Some time after Bell proposed his experiment, Greenberger, Horne, and Zeilinger suggested an alternative one to answer the same question. Theirs involved three entangled particles instead of two, but did away with the need for any complicated statistics. We shall consider three ways to look at this experiment. All three approaches lead to the same predictions, but they provide different insights into what is going on.

Version 1: Three spins

The mathematics

Version 2: The quantum computer

The mathematics

Version 3: Treating Alice, Bob, and Charlie as quantum mechanical systems

The mathematics

Version 1: Three spins

This version is easy to analyse but provides few hints as to why the results are seen, or how to find a way out of the non-locality paradox.

In this version three spins are entangled in the state $(\lvert \uparrow\uparrow\uparrow \rangle + \lvert \downarrow\downarrow\downarrow \rangle)/\sqrt 2$ and are sent, respectively, to Alice, Bob, and Charlie who are in different locations. After receipt of their spin, Alice, Bob, and Charlie each toss a coin and measure their spin along the X-axis if they get tails, and the Y-axis if they get heads. Critically, all three do this simultaneously so that no information about the coin toss or the measurement result can reach the others before they choose an axis or make a measurement.

The results are that Alice, Bob, and Charlie observe a 50% chance of getting a +1 and a 50% chance of a -1, and this is independent of whether they got H or T in the coin toss. However, when all three get tails (TTT) multiplying together the measurements always results in +1, and when just one gets tails (THH, HTH, or HHT) multiplying them always results in -1. So, for example (TTT, -1, -1, 1) is possible, but (TTT, -1, -1, -1) is not. This result seems to either rule out hidden variables or allow for non-local causality. But why?

Alice knows she gets +1 and -1 each half the time. As far as she is concerned there are two possibilities: 1) the result is determined by hidden variables before she makes her measurement; 2) the result is chosen at random at the exact moment the measurement is made. She can rule out (1) by a simple thought experiment. If it were the case then she could label the results she would get if she were to measure along the X-axis $X_1$, and the result she would get if she were to measure only the Y-axis as Y1. Likewise Bob could label his possible results as $X_2$, and $Y_2$, and Charlie could label his $X_3$, $Y_3$. Alice knows from past experience that $X_1Y_2Y_3$, $Y_1X_2Y_3$ , and $Y_1Y_2X_3$ are always ‑1. But this would imply that $X_1X_2X_3$ is also always -1 as it is what you get if you multiply the former 3 terms together. Since $X_1X_2X_3$ is in fact always +1 Alice can rule out (1). But if her result is chosen only at the moment she makes the measurement then there must be some faster-than-light communication to Bob and Charlie. For example, if they all decide to measure X and Alice measures a -1, then Bob and Charlie's spins need to get updated instantaneously so that they cannot both measure the same value, and if she measures +1 then Bob and Charlie's spins need to get updated instantaneously so that they must measure the same value.

The mathematics

Let’s see if we can work out why this is the result predicted.

A short reprise on quantum measurement: A measurement in quantum mechanics corresponds to a Hermitian operator. These always have a set of orthonormal eigenvectors, and so any state can be decomposed as a sum of these. For example, $\sigma_z$ is the operator corresponding to measurement of a spin along the Z-axis. It's eigenvectors are $\lvert\uparrow\rangle$ and $\lvert\downarrow\rangle$. If $\lvert\Psi\rangle$ is a spin pointing along the X-axis then $\lvert\Psi\rangle = \left(\lvert\uparrow\rangle + \lvert\downarrow\rangle\right)/\sqrt 2$. According to the Born rule, eigenvalues are the possible measurement results, and the probability of each result is given by the square of the magnitude of the corresponding eigenvector's coefficient. So in this example if we were to measure $\sigma_z$ when the state is $\lvert\Psi\rangle$ we'd have a 50% chance of getting +1 and a 50% chance of getting -1. A consequence of this rule is that if the original state is an eigenvector of the measurement if, and only if, the result is 100% known in advance.

In the case where we have 3 entangled spins the space is a product space $V\otimes V\otimes V$, where $V$ is the space for a single spin, and the measurement that Alice, Bob, and Charlie would make if they got HTH would be the product operator $\sigma_y\otimes\sigma_x\otimes\sigma_y$. Now, $\sigma_x$ takes $\lvert\uparrow\rangle$ to $\lvert\downarrow\rangle$ and vice versa, whereas $\sigma_y$ takes $\lvert\uparrow\rangle$ to $i\lvert\downarrow\rangle$ and $\lvert\downarrow\rangle$ to $-i\lvert\uparrow\rangle$. This means that $\sigma_y\otimes\sigma_x\otimes\sigma_y$ takes $\left(\lvert\uparrow\uparrow\uparrow\rangle+\lvert\downarrow\downarrow\downarrow\rangle\right)/\sqrt 2$ to

$$
\begin{align*}
&\frac{1}{\sqrt 2}(
\sigma_y\lvert\uparrow\rangle\otimes
\sigma_x\lvert\uparrow\rangle\otimes
\sigma_y\lvert\uparrow\rangle +
\sigma_y\lvert\downarrow\rangle\otimes
\sigma_x\lvert\downarrow\rangle\otimes
\sigma_y\lvert\downarrow\rangle) = \\
&\frac{1}{\sqrt 2}(
i^2
\lvert\downarrow\rangle\otimes
\lvert\downarrow\rangle\otimes
\lvert\downarrow\rangle +
(-i)^2
\lvert\uparrow\rangle\otimes
\lvert\uparrow\rangle\otimes
\lvert\uparrow\rangle) = \\
-&\frac{1}{\sqrt 2}(\lvert\uparrow\uparrow\uparrow\rangle +
\lvert\downarrow\downarrow\downarrow\rangle)
\end{align*}
$$

This tells us that $(\lvert\uparrow\uparrow\uparrow\rangle + \lvert\downarrow\downarrow\downarrow\rangle)/\sqrt 2$ is an eigenvector of $\sigma_y\otimes\sigma_x\otimes\sigma_y$ with eigenvalue -1, i.e. that when the coins come up HTH, the product of the measurements is 100% guaranteed to be -1, and likewise for THH, and HHT.

A similar calculation shows that $(\lvert\uparrow\uparrow\uparrow\rangle + \lvert\downarrow\downarrow\downarrow\rangle)/\sqrt 2$ is also an eigenvector of $\sigma_x\otimes\sigma_x\otimes\sigma_x$, but this time with eigenvalue 1. This tells us that when Alice, Bob, and Charlie throw TTT the product of their measurements is 100% guaranteed to be +1.

Version 2: The quantum computer

This version is a bit more confusing but more clearly illustrates how destructive interference causes some results to be eliminated from the set of possible outcomes.

In this version we use a quantum computer. Instead of spins $\lvert \uparrow \rangle$ and $\lvert \downarrow \rangle$ we talk about qbits $\lvert 0 \rangle$ and $\lvert 1 \rangle$, and instead of products of +1s and -1s we talk about the parities of 3-tuples such as 011.

In the diagram above, each horizontal line represents one qbit, and each rectangle is a "gate". Unlike measurements, which "collapse" the wavefunction, gates represent a change-over-time of the wavefunction. Measurement is non-reversible since, for example, a measurement of a single qbit system leaves it in just one of two states, whereas the original state space is infinite. By contrast, a gate is completely reversible, in the same way that the Schrödinger equation is symmetric in forward and reverse time.

The mathematics

Gates are represented by the following unitary operators:

The first gate applies to just Alice's qbit and is called a Hadamard gate. It takes

$\lvert 0\rangle$ to $(\lvert 0\rangle + \lvert 1\rangle)/\sqrt 2$, and
$\lvert 1\rangle$ to $(\lvert 0\rangle - \lvert 1\rangle)/\sqrt 2$

The second and third gates are CNOT, or Controlled NOT, operators. These apply to the product space of two qbits. CNOT takes

$\lvert 0x\rangle$ to $\lvert 0x\rangle$, and
$\lvert 1x\rangle$ to $\lvert 1\bar{x}\rangle$.

The $R_x(\pi/2)$ gates rotate a single qbit about the X-axis in the Bloch sphere. The exponents$h_A$, $h_B$, and $h_C$ indicate that Alice, Bob, and Charlie should only implement this gate if they toss heads. This gate takes

$\lvert 0 \rangle$ to $(\lvert 0\rangle -i \lvert 1\rangle)/\sqrt 2$, and
$\lvert 1\rangle$ to $(-i\lvert 0\rangle + \lvert 1\rangle)/\sqrt 2$.

The $R_y(-\pi/2)$ gates rotate a single qbit about the Y-axis in the Bloch sphere. The exponents $t_A$, $t_B$, and $t_C$ indicate that Alice, Bob, and Charlie should only implement this gate if they toss tails. This gate takes

$\lvert 0 \rangle$ to $(\lvert 0\rangle - \lvert 1\rangle)/\sqrt 2$, and
$\lvert 1\rangle$ to $(\lvert 0\rangle + \lvert 1\rangle)/\sqrt 2$.

The final rectangles represent measurement along the Z-axis, which is defined by the eigenvectors $\lvert 0\rangle$ and $\lvert 1\rangle$.

With this background we can calculate each of the states $\lvert\Psi_1\rangle$, $\lvert\Psi_2\rangle$, $\lvert\Psi_3\rangle$, $\lvert\Psi_4\rangle_{hht}$, and $\lvert\Psi_5\rangle_{hht}$ (the hht subscript indicates that this is the situation in which the coin tosses obtained the result HHT).

$$
\require{cancel}
\begin{align*}
\lvert \Psi_1 \rangle &= \lvert 000\rangle \\
\\
\lvert \Psi_2 \rangle &= \frac{1}{\sqrt 2}(\lvert 000\rangle +
\lvert 100\rangle) \\
\\
\lvert \Psi_3 \rangle &= \frac{1}{\sqrt 2}(\lvert 000\rangle +
\lvert 111\rangle) \\
\\
\lvert \Psi_4 \rangle_{hht} &= \frac{1}{2\sqrt 2}(\\
&\lvert 000\rangle -i\lvert 010\rangle -i \lvert 100\rangle -
\lvert 110 \rangle \\
-&\lvert 001\rangle -i\lvert 011\rangle -i \lvert 101\rangle +
\lvert 111 \rangle) \\
\\
\lvert \Psi_5 \rangle_{hht}&= \frac{1}{4}(\\
&\cancel{\lvert 000\rangle} -\lvert 001\rangle - i\lvert
010\rangle +i\cancel{\lvert 011 \rangle} \\
-i&\lvert 100\rangle +i\cancel{\lvert 101\rangle} -
\cancel{\lvert 110\rangle} +\lvert 111 \rangle \\
-&\cancel{\lvert 000\rangle} -\lvert 001\rangle -i\lvert
010\rangle -i\cancel{\lvert 011 \rangle} \\
-i&\lvert 100\rangle -i\cancel{\lvert 101\rangle}
+\cancel{\lvert 110\rangle} +\lvert 111 \rangle
)
\end{align*}
$$

This shows that only parity 1 combinations remain. A similar calculation for $\lvert \Psi_5 \rangle_{ttt}$ leaves only parity 0 combinations! This shows how the possible outcomes can be described to be as a result of interference.

Version 3: Treating Alice, Bob, and Charlie as quantum mechanical systems

Is there any reason we cannot treat everything in the experiment, including Alice, Bob, and Charlie, as quantum mechanical? We know the participants are perfectly isolated from each other by distance and the speed of light, so there’s really no problem in this. In this treatment the initial state can be represented as a superposition in which each term represents a pre-determined outcome. The only problem is that you have to believe in the multiverse.

The qbits Alice, Bob, and Charlie start off $\lvert h \rangle$, for heads, giving an initial state for the entire system of $\lvert \Psi_1 \rangle =\lvert hhh 000 \rangle$. To get to $\lvert \Psi_2 \rangle$ the lower subsystem is entangled in the 3-qbit Bell state $(\lvert 000 \rangle + \lvert 111 \rangle)/\sqrt 2$ as before, but in addition to this each of Alice, Bob, and Charlie are transformed by Hadamard gates. This leaves the entire system in the product state

$$
\lvert \Psi_2\rangle = \frac{1}{4}\sum_{a,b,c
\in \{h,t\}}{\lvert abc \rangle} \otimes (\lvert 000 \rangle +
\lvert 111 \rangle)
$$

The $R_x(\pi/2)$ rotation gates are now controlled gates that only occur if the corresponding participant state is $\lvert h \rangle$. The $X$ gate swaps $\lvert h \rangle$ and $\lvert t \rangle$, so the $R_y(-\pi/2)$ are also controlled but only occur if the participant state is $\lvert t \rangle$. In this version $\lvert \Psi_3 \rangle$ is a complicated state like

$$
\lvert \Psi_3\rangle = \sum_{a,b,c \in
\{h,t\}}\sum_{x,y,z \in \{0,1\}} \psi_{abcxyz}\lvert abcxyz
\rangle
$$

in which certain coefficients, such as $\psi_{ttt111}$ or $\psi_{hht110}$ have value 0. This is the multiverse interpretation in which collapse never occurs!

The mathematics

The above picture shows no interaction with any classical system and so we are allowed to treat all the gates as a single unitary operator $U$. If we were to append a measurement in the computational basis, its operator would be $Z^{\otimes 6}$ where $Z$ is the pauli Z matrix. We could then treat the entire circuit as a single measurement. What operator $A$ would that correspond to?

We know that if we were to start off with state $\lvert \Psi_1 \rangle = U^\dagger \lvert hhh000 \rangle$ the final result would be 100% guaranteed to be $\lvert hhh000 \rangle$. A similar thing would be the case for all the other combinations of 3 h’s and t’s, and 3 0’s and 1’s. The only initial states which have known final states following a measurement are its eigenvectors. This means we know the eigenvectors of $A$ must be

$$
U^\dagger \lvert abcxyz \rangle \ \dots a,b,c \in \{h,t\}, x,y,z \in \{0,1\}
$$

Knowing the eigenvectors and eigenvalues uniquely identifies the operator, so we must have that

$$
A = U^\dagger Z ^{\otimes 6} U
$$

In the illustration above we did not start off in an eigenstate of $A$, but instead the state

$$
\lvert \Psi_1 \rangle = \lvert hhh000
\rangle .
$$

If we were to decompose this using the orthonormal set of eigenvectors $U^\dagger \lvert abcxyz \rangle $ we would find that all the terms where there’s one t and two h’s in $abc$, and $xyz$ has parity zero, are missing. Likewise all the terms where there are three t’s in $abc$, and $xyz$ has parity one, are missing.

Viewed this way there is no need to explain the outcomes of any collapse as no collapse occurs. Subjectively when Alice, Bob, and Charlie come together to compare results, all that happens is that they determine which term in the original superposition they were in all along.

Search This Blog

Dork Scratchings