quantum algorithms:
In quantum physics/quantum information theory, What came to be called Bell’s inequality (Bell 1964) is an inequality satisfied by the three pairwise correlation functions between three random variables defined on one and the same classical probability space. As such, it is an elementary statement about classical probability theory which as been argued (Pitowsky 1989a) to have been known already to Boole (1854).
The point of the argument by Bell 1964 was to highlight that when taking these three random variables to be the results of quantum measurements of the spin of an electron along three pairwise non-orthogonal axes (as in the Stern-Gerlach experiment) then quantum theory predicts that this inequality is violated – implying that there is no single classical probability space (called a hidden variable in the context of interpretations of quantum mechanics) on which these three quantum measurement-results are jointly random variables.
A number of experiments have sought to check Bell’s inequalities in quantum physics (“Bell tests”) and all claim to have verified that it is indeed violated in nature (see Aspect 2015), as predicted by quantum theory.
Bell’s inequality has been and is receiving an enormous amount of attention, first in discussions of interpretations of quantum mechanics, but more recently and more concretely also in the context of quantum information theory.
The following is fairly verbatim recap of the original argument in Bell 1964. For a streamlined re-statement see further below.
Let us denote the result A of a measurement that is determined by a unit vector, $\vec{a}$, and some parameter $\lambda$ as $A(\vec{a},\lambda)=\pm 1$ where we further suppose that the outcome of the measurement is either +1 or -1. Likewise, we may do the same for the result B of a second measurement, i.e. $B(\vec{b},\lambda)$. We further make the vital assumption that the result B does not depend on $\vec{a}$ and likewise A does not depend on $\vec{b}$.
Before proceeding, we should note that $\lambda$ here plays the role of a “hidden” parameter or variable. We say it is “hidden” because its precise nature is not known. However, it is still a very real parameter with a probability distribution $\rho(\lambda)$. The expectation value of the product of the two measurements is
Because $\rho$ is a normalized probability distribution,
and because $A(\vec{a},\lambda)=\pm 1$ and $B(\vec{b},\lambda)=\pm 1$, P cannot be less than -1. It can be equal to -1 at $\vec{a}=\vec{b}$ only if $A(\vec{a},\lambda)=\pm 1 = -B(\vec{a},\lambda)=\pm 1$ except at a set of points $\lambda$ of zero probability. Thus we can write (1) as
If we introduce a third unit vector $\vec{c}$ we can find the difference between the correlation of $\vec{a}$ to the two other unit vectors,
Rearranging this we may write (2) as
Given the limitations we have placed on the value of A, we may write
But the second term on the right is simply $P(\vec{b},\vec{c})$ and thus
which is the original form of Bell’s inequality. Note that this may be written in terms of correlation coefficients,
where a, b, and c are now settings on the measurement apparatus.
The original derivation of Bell’s inequalities involved the use of a Stern-Gerlach device that measures spin along an axis. Suppose $\sigma_{1}$ and $\sigma_{2}$ are spins. The result, A, of measuring $\sigma_{1}\cdot\vec{a}$ is then interpreted as being entirely determined by $\vec{a}$ and $\lambda$. Likewise for B and $\sigma_{2}\cdot\vec{b}$. It is also important to remember that the result B does not depend on $\vec{a}$ and likewise A does not depend on $\vec{b}$.
For a singlet state (that is a state with total spin of zero), the quantum mechanical expectation value of measurements along two different axes (see the Wigner derivation below for a more intuitive explanation of the physical nature of this) is
In theory this ought to equal $P(\vec{a},\vec{b})$ but in practice it does not. It is important to remember that we are using classical reasoning throughout our derivations of the various forms of Bell’s inequalities.
The setup envisioned here consists of pairs of spin-1/2 particles produced in singlet states that then each pass through separate Stern-Gerlach (SG) devices. Since they are in singlet states, if we measured the first particle of a pair to be aligned with a given axis, say $\vec{a}$, then the second should be measured to be anti-aligned with that same axis, giving a total spin of zero.
In practice we are dealing with beams of particles and thus we can never be absolutely certain that correlated pairs are measured simultaneously and so we ultimately are making statistical predictions. Nevertheless, in a given sample consisting of a large-enough number of randomly distributed spin-1/2 particles, we can be certain that, for example, a definite number are aligned with an axis $\vec{a}$ while a definite number are aligned with an axis $\vec{b}$.
Now take an individual particle and suppose that, for this particle, if we measured $\sigma\cdot\vec{a}$ we would obtain a +1 with certainty (meaning it is aligned with $\vec{a}$) but if we instead chose to measure $\sigma\cdot\vec{b}$ we would obtain a -1 with certainty (meaning it is anti-aligned with $\vec{b}$). Notationally we refer to such a particle as belonging to type $(\vec{a}+,\vec{b}-)$. Clearly for a given pair of particles in a singlet state, if particle 1 is of type $(\vec{a}+,\vec{b}-)$, then particle 2 must be of type $(\vec{a}-,\vec{b}+)$.
For beams of correlated particles measuring along only two axes, we should expect to get a roughly evenly balanced distribution of types as follows:
There is a very important assumption implied here. Suppose a particular pair belongs to the first grouping, that is if an observer A decides to measure the spin along $\vec{a}$ for particle 1, he or she necessarily obtains a plus sign (corresponding to it being aligned with $\vec{a}$) regardless of any measurement observer B may make on particle 2. This is the principle of locality: A’s result is predetermined independently of B’s choice of what to measure.
Now suppose we introduce a third axis, $\vec{c}$, so that we can have, for example, particles of type $(\vec{a}+,\vec{b}+,\vec{c}-)$ corresponding to being aligned if measured on $\vec{a}$ and $\vec{b}$ and anti-aligned on $\vec{c}$. Further let us “count” the pairs that fall into the various groupings and label the populations as follows:
Let’s suppose that observer A finds particle 1 is aligned with $\vec{a}$, i.e. $\vec{a}+$, and that observer B finds particle 2 is aligned with $\vec{b}$, i.e. $\vec{b}+$. From the above table it is clear that the pair belong to either population 3 or 4. Note that because $N_{i}$ is positive semi-definite we must be able to construct relations like, for instance,
Now let $P(\vec{a}+;\vec{b}+)$ be the probability that, in a random selection, A finds particle 1 to be $\vec{a}+$ and B finds particle 2 to be $\vec{b}+$. In terms of populations, we have
Similarly we have
and
The positivity condition (3) then becomes
This is Wigner’s form of Bell’s inequality.
As we mentioned before, we have used purely classical reasoning to derive the two forms of Bell’s inequality that we have thusfar encountered. Recall that the context within which the above were derived was the Stern-Gerlach experiment are we are measuring along axes of the magnetic field. As such, there are angles between these various axes. Thus the quantum mechanically-derived probabilities corresponding to (4), (5), and (6) are
and
respectively. Bell’s inequality, (7), then becomes
From a geometric point of view, this inequality is not always possible. For example, suppose, for simplicity that $\vec{a}$, $\vec{b}$, and $\vec{c}$ lie in a plane and suppose that $\vec{c}$ bisects $\vec{a}$ and $\vec{b}$, i.e.
Then (8) is violated for $0 \lt \theta \lt \frac{\pi}{2}$. For example, if $\theta = \frac{\pi}{4}$, (8) would become $0.500 \le 0.292$ which is absurd!
A transparent and compact way to derive the actual inequality of Bell 1964 (adjusting the original argument only slightly for mathematical elegance) is reviewed in Khrennikov 2008, §10.1, which we broadly follow:
Given
a probability space $(\Lambda, d\rho)$ with
three random variables taking values in $\{\pm 1\}$ (regarded inside the real numbers):
then the correlation functions
satisfy this inequality:
(where $\left\vert-\right\vert$ denotes the absolute value)
Recall that the expectation value of a random variable $P \,\colon\, \Lambda \longrightarrow \mathbb{R}$ is given by its Lebesgue integral against the probability measure:
and that $d\rho$ being a probability measure implies the normalization
Moreover, the assumption (9) that the random variables $S_i$ take values in $\{\pm 1\}$ immediately implies for all $i,j \,in\, \{1,2,3\}$ that
Together this implies – by repeatedly using the Cauchy-Schwarz inequality – the bounds:
and thus, in particular:
for any random variable $P \,\colon\, \Lambda \to \mathbb{R}$.
Using these (evident) ingredients, we directly compute as follows
This is the inequality (11).
Other theorems about the foundations and interpretation of quantum mechanics include:
The original article:
Review:
John F. Clauser, Abner Shimony, Bell’s theorem. Experimental tests and implications, Rep. Prog. Phys. 41 (1978) 1881 [doi:10.1088/0034-4885/41/12/002]
Greg Kuperberg, section 1.6.2 of: A concise introduction to quantum probability, quantum mechanics, and quantum computation (2005) [pdf, pdf]
Valter Moretti, Thm. 4.49 of: Fundamental Mathematical Structures of Quantum Theory, Springer (2019) [doi:10.1007/978-3-030-18346-2]
and on a background of quantum logic:
Further on experimental verification:
Relation to the Kochen-Specker theorem:
See also:
Wikipedia, Bell’s theorem
Wikipedia, Bell test
Wikipedia, Leggett-Garg inequality
Stanford Encyclopedia of Philosophy, Bell’s theorem (url)
In relation to the Grothendieck inequality:
Boris S. Tsirelson, Quantum analogues of the Bell inequalities. The case of two spatially separated domains, Journal of Soviet Mathematics 36 (1987) 557–570 [doi:10.1007/BF01663472]
Boris S. Tsirelson, Some results and problems on quantum Bell-type inequalities Hadronic Journal Supplement 8 4 (1993) 329-345 [pdf, pdf web]
(but see the erratum here)
Wikipedia, Tsirelson’s bound
In the generality of quantum field theory:
On Bell inequalities in particle physics and possible relation to the weak gravity conjecture:
On BRST invariant Bell inequality in gauge field theory:
Identification of Bell’s inequalities with much older inequalities in classical probability theory, due to George Boole‘s The Laws of Thought, was pointed out by (among others, called the “probabilistic opposition” in Khrennikov 2007, p. 3) by:
Itamar Pitowsky, From George Boole To John Bell — The Origins of Bell’s Inequality, in: Bell’s Theorem, Quantum Theory and Conceptions of the Universe, Fundamental Theories of Physics 37 Springer (1989) [doi:10.1007/978-94-017-0849-4_6]
Itamar Pitowsky, Quantum Probability – Quantum Logic, Lecture Notes in Physics 321, Springer (1989) [doi:10.1007/BFb0021186]
Luigi Accardi, The Probabilistic Roots of the Quantum Mechanical Paradoxes, in: The Wave-Particle Dualism, Fundamental Theories of Physics 3 Springer (1984) [doi:10.1007/978-94-009-6286-6_16]
reviewed in:
Elemer E Rosinger, George Boole and the Bell inequalities [arXiv:quant-ph/0406004]
Andrei Khrennikov, Bell’s inequality: Physics meets Probability [arXiv:0709.3909]
Andrei Khrennikov, Bell-Boole Inequality: Nonlocality or Probabilistic Incompatibility of Random Variables?, Entropy 10 2 (2008) 19-32 [doi:10.3390/entropy-e10020019]
Last revised on August 3, 2023 at 12:47:12. See the history of this page for a list of all contributions to it.