Skip to main content
Royal Society Open Science logoLink to Royal Society Open Science
. 2018 May 30;5(5):180526. doi: 10.1098/rsos.180526

Quantum correlations are weaved by the spinors of the Euclidean primitives

Joy Christian 1,
PMCID: PMC5990804  PMID: 29893385

Abstract

The exceptional Lie group E8 plays a prominent role in both mathematics and theoretical physics. It is the largest symmetry group associated with the most general possible normed division algebra, namely, that of the non-associative real octonions, which—thanks to their non-associativity—form the only possible closed set of spinors (or rotors) that can parallelize the 7-sphere. By contrast, here we show how a similar 7-sphere also arises naturally from the algebraic interplay of the graded Euclidean primitives, such as points, lines, planes and volumes, which characterize the three-dimensional conformal geometry of the ambient physical space, set within its eight-dimensional Clifford-algebraic representation. Remarkably, the resulting algebra remains associative, and allows us to understand the origins and strengths of all quantum correlations locally, in terms of the geometry of the compactified physical space, namely, that of a quaternionic 3-sphere, S3, with S7 being its algebraic representation space. Every quantum correlation can thus be understood as a correlation among a set of points of this S7, computed using manifestly local spinors within S3, thereby extending the stringent bounds of ±2 set by Bell inequalities to the bounds of ±22 on the strengths of all possible strong correlations, in the same quantitatively precise manner as that predicted within quantum mechanics. The resulting geometrical framework thus overcomes Bell’s theorem by producing a strictly deterministic and realistic framework that allows a locally causal understanding of all quantum correlations, without requiring either remote contextuality or backward causation. We demonstrate this by first proving a general theorem concerning the geometrical origins of the correlations predicted by arbitrarily entangled quantum states, and then reproducing the correlations predicted by the EPR-Bohm and the GHZ states. The raison d’être of strong correlations turns out to be the Möbius-like twists in the Hopf bundles of S3 and S7.

Keywords: quantum correlations, local causality, Bell’s theorem, spinors, quaternions, octonions

1. Introduction

The central source of intrinsic coherence, geometrical elegance and empirical success of Einstein’s theory of gravity is undoubtedly its strict adherence to local causality [1,2]. Indeed, despite the phenomenal empirical success of Newton’s theory of gravity for over two centuries [3], its founding on the unexplained ‘action at a distance’ was a reason enough for Einstein to search for its locally causal generalization. Today, we face a similar challenge in search of a theory that may unify quantum theory with Einstein’s theory of gravity. But in sharp contrast to Einstein’s theory, quantum theory seems to harbour a peculiar form of non-signalling non-locality, as noticed long ago by Einstein et al. [4]. They hoped, however, that quantum theory can be completed into a locally causal theory with addition of ‘hidden’ parameters or supplementary variables. Today such a hope of completing quantum theory into a realistic and local theory envisaged by Einstein is widely believed to have been dashed by Bell’s theorem [5,6], its variants [7] and the related experimental investigations [813]. Indeed, the claim of Bell’s theorem is remarkably comprehensive in scope: no physical theory which is local and realistic as hoped for by Einstein can reproduce all of the strong correlations predicted by quantum mechanics [14].

By contrast, our primary concern in this paper is not Bell’s theorem, but understanding the origins and strengths of all quantum correlations in terms of the algebraic, geometrical and topological properties of the physical space in which we are confined to perform our experiments. In our view, Bell’s theorem is a distraction that prevents us from understanding the true origins of quantum correlations, especially because it is neither a theorem in the strict mathematical sense, nor a result within quantum theory itself. Indeed, not a single concept from quantum theory is used in the derivation of the Bell-CHSH inequalities [5,6,15]. It is, in fact, an argument that depends on a number of physical assumptions about what is and what is not possible within any locally causal theory, and these assumptions can be and have been questioned before [16,17]. Consequently, by circumventing Bell’s argument, in this paper we set out to explain the origins and strengths of all quantum correlations within a locally causal framework of octonion-like spinors, which are constructed using a geometric algebra [18,19] of rudimentary Euclidean primitives, such as points, lines, planes and volumes. This is accomplished by recognizing and overcoming two neglected shortcomings of Bell’s argument [5,6,14,15,20]. The first of these shortcomings, which is discussed in greater detail in §4.2, amounts to averaging over measurement events in the derivation of the experimentally violated absolute bound of 2 on the CHSH string of expectation values that are impossible to occur in any possible world, classical or quantum, stemming from a mistaken application of the criterion of reality propounded by EPR [4,7]. The second shortcoming of Bell’s argument stems from the unjustified identification of the image {+1,−1} of the measurement functions, which represents the actual measurement results in Bell’s prescription [5,6], with the co-domain of these functions, which is neither specified by Bell explicitly nor observable directly in the so-called Bell-test experiments [814]. An explicit specification of the latter, however, is a prerequisite for the very definition of a mathematical function [16]. By contrast, in our prescription (3.1) of measurement results discussed in §3.1, the locally unobservable co-domain S7 of the measurement functions is explicitly specified with considerable detail, without compromising Bell’s pristine bivalued prescription, ±1, for the actually observed measurement results. It incorporates the Clifford algebraic properties of the physical space in which all such experiments are necessarily situated and performed [813,20].

As noted above, however, our primary focus in this paper is not on Bell’s theorem, but on understanding the origins and strengths of all quantum correlations as a consequence of the geometry and topology of the physical space (or more generally of space–time). As quantum correlations are necessarily observed within the confines of space–time, it is natural to view them as correlations among measurement events in space–time—i.e. among the ‘clicks’ of a set of detectors configured within space–time. On the other hand, what is actually recorded in the Bell-test experiments are coincidence counts among bivalued measurement results, observed simultaneously within space at a given time [813]. Therefore, without loss of generality, we will restrict our analysis to the physical space. With that in mind, in the next section we extensively review the algebraic properties of the compactified physical space, captured in definition (2.16), which is a quaternionic 3-sphere, and construct its algebraic representation space (2.60), which is an octonion-like 7-sphere. As such parallelizable 3- and 7-spheres play a vital role (see footnote 3) in our local-realistic framework, we have devoted a brief appendix (appendix A) to discuss their wider significance in physics and mathematics at a pedagogical level [16]. Our central theorem concerning the origins of all quantum correlations is then stated and proved in §3.2.

The proof presented in §3.2 includes a local-realistic derivation of the simplest yet emblematic quantum correlations—namely, those predicted by the rotationally invariant singlet or EPR-Bohm state—the strengths of which are well known to violate the theoretical bounds of ±2 set by the Bell-CHSH inequalities, in the Bell-test experiments [813]. Then, in §3.3, we derive the closely related Tsirel’son’s bounds of ±22 on the strengths of all quantum correlations within our framework. In the subsequent §3.4, we then explain the geometrical reasons for the fragility of quantum correlations as a counterpart of that of quantum entanglement. This brings us to §3.5 in which we derive the strong correlations predicted by the rotationally non-invariant 4-particle GHSZ state [7], together with the proof of Bell’s condition of factorizability within S7 in appendix B (a similar proof also goes through within S3).

In §4.1, we then point out that the predictions of our local-realistic S7 framework is not in conflict with what is actually observed in the Bell-test experiments [813], as they simply reproduce the predictions of quantum mechanics [57]. In the subsequent §4.2, in the light of the widespread belief in Bell’s theorem, we reveal a serious oversight in Bell’s argument in some detail, independently of the constructive counterexamples provided by our S7 model for the strong correlations. This brings us to §4.3 in which we present an analytical disproof of the GHZ variant of Bell’s theorem, which does not involve Boole-type mathematical inequalities (see footnote 5) used by Bell in his original argument. In §5, we then present event-by-event numerical simulations of the 2-particle EPR-Bohm and 4-particle GHSZ correlations predicted by our local-realistic framework based on S7. Finally, in §6, we summarize our findings.

2. Modern perspective on the Euclidean primitives

In physical experiments—which are usually confined to the three-dimensional physical space by necessity—we often measure relevant quantities by setting up a Cartesian coordinate system {x,y,z} in that space. Mathematically, this is equivalent to identifying the Euclidean space E3 with a threefold product of the real line, IR3. In practice, we sometimes even think of IR3 as the Euclidean space. Euclid himself, however, did not think of E3 in terms of such a Cartesian triple of real numbers. He defined a representation of E3 axiomatically, in terms of primitive geometric objects such as points and lines, together with a list of their properties, from which his theorems of geometry follow.

It is, however, not always convenient to model the physical space in the spirit of Euclid. Therefore, in practice, we tend to identify E3 with IR3 whenever possible. But there is no intrinsic way of identifying the two spaces in this manner without introducing an unphysical element of arbitrarily chosen coordinate system. This difficulty is relevant for understanding the origins of quantum correlations, for time and again we have learned that careless introduction of unphysical ideas in physics could lead to distorted views of the physical reality [16,21]. An intrinsic, coordinate-free representation of the Euclidean space is surely preferable, if what is at stake is the very nature of the physical reality (cf. §4.2).

Fortunately, precisely such a representation of E3 was proposed by Grassmann in 1844 [19]. In the Euclidean spirit, the basic elements of this powerful algebraic representation of E3 are not coordinate systems, but points, lines, planes and volumes, all treated on equal footing. Given a set {ex,ey,ez} of basis vectors representing lines in E3, the algebra of corresponding vector space is constructed as follows. One begins by defining a volume element (or a trivector) in E3:

I3:=exeyez, 2.1

with {ei} being a set of anti-commuting orthonormal vectors in IR3 such that ejei=−eiej for any i,j=x,y, or z. More generally, the unit vectors ei satisfy the fundamental geometric or Clifford product in this (by definition) associative algebra,

eiej=eiej+eiej, 2.2

with

eiej:=12{eiej+ejei}, 2.3

being the symmetric inner product and

eiej:=12{eiejejei}, 2.4

being the anti-symmetric outer product, implying (eiej)2=−1. Any vector xE3 is then a solution of the equation

I3x=0. 2.5

The normalized volume element I3 thus represents an element of the highest grade in the corresponding algebra, namely grade-3. It is also referred to as a pseudo-scalar, dual to the scalar, which in turn is the lowest possible grade in the algebra:

1=I3I3, 2.6

where the conjugate I3:=ezeyex=I3 is the ‘reverse’ of I3 implying (I3)2=−1, and the duality relationship between the elements Ω~ and Ω of arbitrary grades is defined as

Ω~:=ΩI3, 2.7

with the norm ∥ ∥ of Ω and scalar part 〈 〉s of the product of mixed-grade vectors X and Y of n-components defined as

Ω:=ΩΩΩΩsandXYs=l=0nXlYls. 2.8

Thus, for example, the orthonormal vectors ek of grade-1 can be easily recovered from the unit bivectors eiej of grade-2 using the above duality relation:

ek=(eiej)I3=(eiej)I3. 2.9

In three-dimensional Euclidean space, there are thus basis elements of four different grades: an identity element ei2=1 of grade-0, three orthonormal vectors ei of grade-1, three orthonormal bivectors ejek of grade-2 and a trivector eiejek of grade-3. Respectively, they represent points, lines, planes and volumes in E3, as shown in figure 1. Since in IR3, there are 23=8 ways to combine the vectors ei using the geometric product (2.2) such that no two products are linearly dependent, the resulting algebra, Cl3,0, is a linear vector space of 23=8 dimensions, spanned by these graded bases:

Cl3,0=span{1,ex,ey,ez,exey,ezex,eyez,exeyez}. 2.10

This algebra intrinsically characterizes the Euclidean space E3 without requiring a coordinate system, by the bijection

F:IR3:=span{ex,ey,ez}IR8:=span{1,ex,ey,ez,exey,ezex,eyez,exeyez}=Cl3,0. 2.11

Figure 1.

Figure 1.

Euclidean subspaces such as points (p), lines (a), areas (ab) and volumes (abc) are taken in Clifford algebra Cl3,0 as primitive entities constituting the Euclidean space E3, with each subspace specified only by its magnitude, direction and orientation (or handedness), providing a unified and complete algebraic framework of directed numbers across dimensions, spanned by the geometric product ab=ab+ab, reminiscent of a complex number.

2.1. One-point compactification of the three-dimensional Euclidean Space

The physical space represented by the above algebraic model is, however, not quite satisfactory. Stemming from an arbitrarily chosen origin, its points run off to infinity along every radial direction [19]. Moreover, there is no reason for these infinitely many infinities—which can be approached from infinitely many possible different directions—to be distinct from one another. It is therefore natural to assume that one and the same infinity is encountered along any radial direction, and identify it with a single point. One way to achieve this is by compactifying the space E3 by adding a single point to it at infinity. This well-known procedure of one-point compactification is illustrated in figure 2.

Figure 2.

Figure 2.

One-point compactification of the Euclidean space E3 by means of a stereographic projection onto S3R4.

Intuitively, this procedure is not difficult to understand with a two-dimensional analogue of E3. Imagine a stretchable balloon, which is topologically a two-dimensional surface, S2 (cf. figure 3). If we surgically remove a single point from this surface and stretch the remainder out to infinity in every radial direction (like an infinite bed-sheet), then it provides an intuitive model for the two-dimensional Euclidean space, E2. The one-point, or Alexandroff compactification of E2 is an inverse of this process, whereby all points at infinity from all possible radial directions in E2 are brought together again and identified with the previously removed point, thereby reconstructing the S2-balloon from an E2-bed-sheet.

Figure 3.

Figure 3.

Stereographic projection of S2 onto the plane of R2. Both S2 and R2 contain infinite number of points. Each point p of S2 is mapped to a point p′ of R2, except North pole, which has no meaningful finite image under this projection.

Similarly, figure 2 depicts an inverse stereographic projection of E3 onto a unit 3-sphere, S3, by the embedding map ϕ:E3S3, which is given by

ϕ(xE3)=(2x2+1)x+(2x2x2+1)x^4, 2.12

where two of the dimensions of E3 are suppressed in the figure, and x^4 represents the fourth dimension in the embedding space IR4. The crucial observation here is that, as an arbitrary vector xE3 from the origin approaches infinity, it is mapped to the same point e located at (0,2), thereby closing the non-compact space E3 into the compact 3-sphere. By shifting the origin to (0,1), the above set of points can be inscribed by a radial 4-vector originating from (0,1) as

ψ(xE3)=(2x2+1)x+(2x2x2+11)x^4. 2.13

The magnitude of this vector then confirms the unity of the radius of our conformally embedded 3-sphere within IR4:

1=ψ(xE3)=radius ofS3IR4. 2.14

The embedding operator ϕ(x) [or ψ(x)] thus transforms the entire space E3 into a unit 3-sphere within IR4, thereby accomplishing a one-point compactification of E3:

S3=E3{e}. 2.15

Such a conformal mapping is angle-preserving in the sense that a small angle between two curves on S3 projects to the same angle between the projected curves on E3, with a circle of any size on S3 projecting to an exact circle on E3.

Now the tangent bundle of S3 happens to be trivial: TS3=S3×IR3. This renders the tangent space at each point of S3 to be isomorphic to IR3. Consequently, local experiences of the experimenters within S3 are no different from those of their counterparts within E3. The global topology of S3, however, is clearly different from that of IR3 [16,17]. In particular, the triviality of the bundle TS3 means that S3 is parallelizable. As a result, a global anholonomic frame can be defined on S3 that fixes each of its points uniquely. Such a frame renders S3 diffeomorphic to the group SU(2)—i.e. to the set of all unit quaternions:

S3:={q(θ,r):=cosθ2+ξ(r)sinθ2|q(θ,r)=1}, 2.16

where ξ(r) is a bivector rotating about r∈IR3 with the rotation angle θ in the range 0≤θ<4π. In terms of the even sub-algebra of (2.10), the bivector ξ(r)∈S3 can be parametrized by the dual vector r=rxex+ryey+rzez∈IR3 as

ξ(r):=(I3r)=rx(I3ex)+ry(I3ey)+rz(I3ez)=rxeyez+ryezex+rzexey, 2.17

with ξ2(r)=−1. Each configuration of any rotating rigid body can thus be represented by a quaternion q(θ,r), which in turn can always be decomposed into a product of two bivectors, say ξ(u) and ξ(v), belonging to an S2S3,

ξ(u)ξ(v)=cosθ2+ξ(r)sinθ2, 2.18

in accordance with the bivector subalgebra [18]

ξaξb=δabc=13ϵabcξc, 2.19

with θ being its rotation angle from q(0,r)=1. Note also that q(θ,r) reduces to ±1 as θ2κπ for κ=0,1 or 2.

2.2. Conformal completion of the Euclidean primitives

Our interest now lies in the point e which represents the multitude of infinities of E3. Within three dimensions, we continue to view it as a dimensionless point and take its algebraic counterpart to be a non-zero vector of zero norm:

e0,bute2=ee=0e2=0. 2.20

Such a vector that is orthogonal to itself is called a null vector in Conformal Geometric Algebra1 [18]. It is introduced to represent both finite points in space as well as points at infinity [19]. As points thus defined are null-dimensional or dimensionless, addition of e into the algebraic structure of E3 does not alter the latter’s dimensions but only its point-set topology, rendering it diffeomorphic to a closed, compact, simply connected 3-sphere, as we discussed above.

Equipped with e, we are now ready to rebuild the compactified Euclidean space and its algebraic representation as follows. We begin by identifying the set {exey,ezex,eyez} of bivectors as the orthonormal basis of the space E3:

E3=span{exey,ezex,eyez}. 2.21

Using the orthonormality and anti-commutativity of the vectors ei, the product of the basis bivectors works out to be

(exey)(ezex)(eyez)=exeyezexeyez=1. 2.22

The associativity of geometric product then allows us to rediscover the volume form I3 for the Euclidean space (2.21):

(exey)(ezex)(eyez)=(exeyez)(exeyez)=(exeyez)2=:(I3)2=1. 2.23

As it stands, this volume form is open and has the topology of IR3. But we can now close it with the null vector e:

Ic:=I3e=exeyeze, 2.24

where we have used the subscript c on Ic to indicate that it is a volume element of the compact 3-sphere, S3. As we noted earlier, in the Euclidean space the reverse of I3 is I3=I3. Likewise in the conformal space the reverse of Ic is

Ic=I3e=I3e=Ic. 2.25

As a result, in the conformal space the general duality operation between elements Ω~ and Ω of any grade is given by

Ω~c:=ΩIc=ΩI3e. 2.26

This allows us, in particular, to work out the dual elements of all of the basis bivectors in (2.21) in the conformal space:

exeyI3e=exeyezeyexe=eze, 2.27
ezexI3e=ezexezeyexe=eye 2.28
andeyezI3e=eyezezeyexe=exe. 2.29

Moreover, analogous to how the dual of +1 in E3 is −I3, the dual of +1 in the conformal space also works out to be

(+1)Ic=I3e=Ic. 2.30

We have thus worked out the conformal counterparts of all of the basis elements appearing in the algebraic vector space (2.10). Putting them together we can now formalize the desired algebraic representation of our conformal space as

K=span{1,exey,ezex,eyez,exe,eye,eze,I3e}. 2.31

Evidently, not unlike (2.10), this vector space too is eight-dimensional. Unlike (2.10), however, it is closed and compact. The three-dimensional physical space—i.e. the compact 3-sphere we discussed above—can now be viewed as embedded in the four-dimensional ambient space, IR4, as depicted in figure 2. In this higher dimensional space, e is then a unit vector,

e2=ee=1e2=1, 2.32

and the corresponding algebraic representation space (2.31) is nothing but the eight-dimensional even sub-algebra of the 24=16-dimensional Clifford algebra Cl4,0. Thus, a one-dimensional subspace—represented by the unit vector e in the ambient space IR4—represents a null-dimensional space—i.e. the infinite point of E3—in the physical space S3.

2.3. Orientation of representation space as a binary degree of freedom

Before we explore the properties of the above vector space, let us endow it with one more degree of freedom without which it is unjustifiably restrictive. To that end, we first recall what is meant by an orientation of a vector space [22]:

Definition of orientation: An orientation of a finite-dimensional vector space Vn is an equivalence class of ordered basis, say {b1,…,bn}, which determines the same orientation of Vn as the basis {b1,…,bn} if bi=ωijbj holds with det(ωij)>0, and the opposite orientation of Vn as the basis {b1,…,bn} if bi=ωijbj holds with det(ωij)<0.

Thus, each positive-dimensional real vector space has precisely two possible orientations, which we will denote as λ=+1 or λ=−1. More generally an oriented smooth manifold consists of that manifold together with a choice of orientation for each of its tangent spaces. It is worth noting that orientation is a relative concept. The orientation of a tangent space Vn of a manifold defined by the equivalence class of ordered basis such as {b1,…,bn} is meaningful only with respect to that defined by the equivalence class of ordered basis {b1,…,bn}, and vice versa.

Now in geometric algebra the choice of the sign of the unit pseudoscalar amounts to choosing an orientation of the space [18,19]. In our three-dimensional Euclidean space defined in (2.21) with an orthonormal set of unit bivector basis, I3=exeyez picks out the right-handed orientation for E3. The convention usually is to assume such a right-handed set of basis bivectors (or vectors) ab initio. But the algebra itself does not fix the handedness of the basis. In our presentation above, we could have equally well started out with a left-handed set of bivectors in (2.21) by letting −I3 instead of +I3 select the basis. Instead of the representation space (2.31), we would have then ended up with the space

K+=span{1,+exey,+ezex,+eyez,+exe,+eye,+eze,+I3e}. 2.33

On the other hand, in the light of the above definition of orientation, the representation space (2.31) can be written as

K=span{1,exey,ezex,eyez,exe,eye,eze,I3e}. 2.34

It is easy to verify that the bases of K+ and K are indeed related by an 8×8 diagonal matrix whose determinant is (−1)7<0. Consequently, K+ and K indeed represent right- and left-oriented vector spaces, respectively, in accordance with our definition of orientation. We can therefore leave the orientation unspecified and write K± as

Kλ=span{1,λexey,λezex,λeyez,λexe,λeye,λeze,λI3e},λ2=1λ=±1. 2.35

2.4. Representation space Kλ remains closed under multiplication

As an eight-dimensional linear vector space, Kλ has some remarkable properties. To begin with, Kλ is closed under multiplication. Suppose X and Y are two unit vectors in Kλ. Then X and Y can be expanded in the basis of Kλ as

X=X0+X1λexey+X2λezex+X3λeyez+X4λexe+X5λeye+X6λeze+X7λI3e 2.36

and

Y=Y0+Y1λexey+Y2λezex+Y3λeyez+Y4λexe+Y5λeye+Y6λeze+Y7λI3e, 2.37

and using (2.8) they can be normalized as

X2=μ=07Xμ2=1andY2=ν=07Yν2=1. 2.38

Now it is evident from the multiplication table above (table 1) that if X,YKλ, then so is their product Z=XY:

Z=Z0+Z1λexey+Z2λezex+Z3λeyez+Z4λexe+Z5λeye+Z6λeze+Z7λI3e=XY. 2.39

More importantly, we shall soon see that for vectors X and Y in Kλ (not necessarily unit) the following relation holds:

XY=XY. 2.40

In particular, this means that for any two unit vectors X and Y in Kλ with the geometric product Z=XY we have

Z2=ρ=07Zρ2=1. 2.41

One of the important observations here is that, without loss of generality, we can restrict our representation space to a set of unit vectors in Kλ. We are then dealing with a unit 7-sphere as an algebraic representation of the compactified physical space (2.16). If, for convenience, we now identify the basis elements of Kλ (in order) with the ordered elements of the following set:

{ζ0,ζ1,ζ2,ζ3,ζ4,ζ5,ζ6,ζ7}, 2.42

then the algebra generated by them—which has been explicitly displayed in table 1—can be succinctly rewritten as

ζμζν={1}δμ7{δμν}+λρ=17[fμνρ+{1}δρ7lμνρ]ζρ,μ,ν=1,2,,7, 2.43

where fμνρ is a totally anti-symmetric permutation tensor with only non-vanishing independent components being

f123=f246=f365=f415=+1, 2.44

and similarly lμνρ is a totally symmetric permutation tensor with only non-vanishing independent components being

l176=l257=l347=1. 2.45

The eight-dimensional multi-vectors X and Y within Kλ can now be expanded more conveniently in the basis (2.42) as

X=μ=07XμζμandY=ν=07Yνζν. 2.46

Table 1.

Multiplication table for a ‘Conformal Geometric Algebra’ (see footnote 1) of E3. Here, I3=exeyez, e2=+1 and λ=±1.

graphic file with name rsos180526-i1.jpg

2.5. Representation space as a set of orthogonal pairs of quaternions

In his seminal works, Clifford introduced the concept of dual numbers, z, analogous to complex numbers, as follows:

z=r+dε,whereε0butε2=0. 2.47

Here ε is the dual operator, r is the real part and d is the dual part [19]. Similar to how the ‘imaginary’ operator i is introduced in the complex number theory to distinguish the ‘real’ and ‘imaginary’ parts of a complex number, Clifford introduced the dual operator ε to distinguish the ‘real’ and ‘dual’ parts of a dual number. The dual number theory can be extended to numbers of higher grades, including to numbers of composite grades, such as quaternions:

Qz=qr+qdε, 2.48

where qr and qd are quaternions and Qz is a dual-quaternion (or in Clifford’s terminology, Qz is a bi-quaternion). Recall that, as defined in (2.16), the set of all quaternions is a 3-sphere, which can be normalized to radius ϱ and rewritten as

S3={qr:=q0+q1λexey+q2λezex+q3λeyez|qr=ϱ}. 2.49

Consider now a second, dual copy of the set of quaternions within Kλ, corresponding to the fixed orientation λ=+1:

S3={qd:=q7+q6exey+q5ezex+q4eyez|qd=ϱ}. 2.50

If we now identify ε with the duality operator Ic=λI3e used in (2.26), then (in the reverse additive order) we have

ελI3ewithε2=+1(sinceeis a unit vector withinKλ) 2.51

and

qdεqdλI3e=q4λexe+q5λeye+q6λeze+q7λI3e, 2.52

which is a multi-vector ‘dual’ to the quaternion qd at infinity. Note that we continue to write ε as if it were a scalar because it commutes with qd. Comparing (2.50) and (2.52) with (2.35) we can now rewrite Kλ as a set of paired quaternions (figure 4):

Kλ={Qz:=qr+qdε|Qz=2ϱ}. 2.53

Now the normalization of Qz in fact necessitates that every qr be orthogonal to its dual qd:

Qz=2ϱqrqd+qdqr=0, 2.54

or equivalently, qrqds=0 (i.e. qrqd is a pure quaternion). We can see this by working out the product of Qz with Qz while using ε2=+1, which gives

QzQz=(qrqr+qdqd)+(qrqd+qdqr)ε. 2.55

Now, using the definition of q in (2.16), it is not difficult to see that qrqr=qdqd=ϱ2, reducing the above product to

QzQz=2ϱ2+(qrqd+qdqr)ε. 2.56

It is thus clear that for QzQz to be a scalar qrqd+qdqr must vanish, or equivalently qr must be orthogonal to qd.

Figure 4.

Figure 4.

An illustration of the eight-dimensional plane of Kλ, which may be interpreted as an Argand diagram for a pair of quaternions.

But there is more to the normalization condition qrqd+qdqr=0 then meets the eye. It also leads to the crucial norm relation (2.40), which is at the very heart of the only possible four normed division algebras associated with the four parallelizable spheres S0, S1, S3 and S7 (cf. appendix A). To verify it, consider a product of two different members of the set Kλ,

Qz1Qz2=(qr1qr2+qd1qd2)+(qr1qd2+qd1qr2)ε, 2.57

together with their individual definitions

Qz1=qr1+qd1εandQz2=qr2+qd2ε. 2.58

If we now work out the products Qz1Qz1, Qz2Qz2 and (Qz1Qz2)(Qz1Qz2), then, thanks to the orthogonality condition qrqd+qdqr=0, the norm relation is not difficult to verify:

Qz1Qz2=Qz1Qz2. 2.59

Without loss of generality, we can now restrict our algebraic representation space Kλ to a unit 7-sphere by setting the radius ϱ of S3 to 1/2. In what follows S7 will provide the conformal (see footnote 1) counterpart of the algebra Cl3,0 given in (2.10):

KλS7:={Qz:=qr+qdεQz=1andqrqd+qdqr=0}, 2.60

where ε=λI3e, ε2=e2=+1,

qr=q0+q1λexey+q2λezex+q3λeyezandqd=q7+q6exey+q5ezex+q4eyez, 2.61

so that

Qz=q0+q1λexey+q2λezex+q3λeyez+q4λexe+q5λeye+q6λeze+q7λI3e. 2.62

Needless to say, since all Clifford algebras are associative division algebras by definition, unlike the non-associative octonionic algebra, the 7-sphere we have constructed here corresponds to an associative (but of course non-commutative) algebra.

Thus, to summarize this section, we started out with the observation that the correct model of the physical space is provided by the algebra of Euclidean primitives, such as points, lines, planes and volumes, as discovered by Grassmann and Clifford in the nineteenth century. We then recognized the need to ‘close’ the Euclidean space with a non-zero null vector e representing its infinities, thereby compactifying E3 to a 3-sphere, S3. The corresponding algebraic representation space of E3 then turned out to be a unit 7-sphere, S7. It is quite remarkable that S3 and S7, which are the two spheres associated with the only two non-trivially possible normed division algebras, namely the quaternionic and octonionic algebras [16,2329], emerge in this manner from the elementary algebraic properties of the Euclidean primitives (cf. appendix A). Unlike the non-associative octonionic algebra and the exceptional Lie groups such as E8 it gives rise to, however, the compact 7-sphere we have arrived at corresponds to an associative Clifford (or geometric) algebra [19,21], as noted above. And yet, as we shall soon see, it is sufficient to explain the origins of all quantum correlations. It remains to be seen what role, if any, the exceptional groups such as G2 and E8 may eventually play when the current framework is developed further.

3. Derivation of quantum correlations from Euclidean primitives

3.1. Constructing measurement functions in the manner of Bell

In order to derive quantum correlations predicted by arbitrary quantum states, our first task is to construct a set of measurement functions of the form:

±1=N(n,λ):IR3×ΛS7IR8. 3.1

These functions describe local detections of binary measurement results, N(n,λ)=±1, by some analysers fixed along freely chosen directions represented by the vectors n. They are of the same realistic and deterministic form as that considered by Bell2 [57], except for their locally unobservable co-domain, which we have taken to be the algebraic representation space S7 constructed above, embedded in IR8. For an explicit construction of the functions N(n,λ), let us consider the following multi-vector in IR8 analogous to (2.62):

Nz=n0+{n1λexey+n2λezex+n3λeyez}+{n4λexe+n5λeye+n6λeze}+n7λI3e 3.2
n0+λξ(nr)+λξ(nd)ε+λn7ε+ 3.3
n0+λD(nr,nd,n7), 3.4

where

D(nr,nd,n7):=ξ(nr)+ξ(nd)ε+n7ε+, 3.5
S3bivectorξ(nr):=n1exey+n2ezex+n3eyezI3nr, 3.6
S3bivectorξ(nd):=n6exey+n5ezex+n4eyezI3nd, 3.7
pseudoscalarε+:=I3e, 3.8
3D vectornr:=n3ex+n2ey+n1ez,withnr=n12+n22+n32=12 3.9
and3D vectornd:=n4ex+n5ey+n6ez,withnd=n42+n52+n62=12. 3.10

Next, consider the non-scalar part N(nr,nd,−n7,λ) of the above S7-vector,

Nz=n0+N(nr,nd,n7,λ), 3.4

so that

N(nr,nd,n7,λ)=λD(nr,nd,n7)D(nr,nd,n7)=λN(nr,nd,n7,λ), 3.11

since λ2=1. For our purposes, it will suffice to represent the detectors with the special case of this non-scalar part for which n7≡0:

N(nr,nd,0,λ)=λD(nr,nd,0)D(nr,nd,0)=λN(nr,nd,0,λ), 3.12

where

D(nr,nd,0):=ξ(nr)+ξ(nd)ε+=(I3nr)+(I3nd)ε+=I3{nr+ndε+}. 3.13

Next recall that, although global topology of S3 is different from that of IR3, local experiences of experimenters within S3 are no different from those of their counterparts within IR3, not the least because the tangent space at any point of S3 is isomorphic to IR3. With this in mind, we identify the counterparts of measurement directions n within E3 with the dual vectors nr+ndε+ within its algebraic representation space S7. Then n relates to D(nr,nd,0) as

S3S2ξ(n)=I3nD(nr,nd,0)S5S7. 3.14

This allows us to identify the anti-symmetric part D(nr,nd,0) in (3.2) as a detector of the physical system represented by N(nr,nd,0,λ), originating in the initial state λ and producing the measurement results N(n,λ)=±1 along freely chosen unit directions nnr+ndε+ within IR3. Indeed, using the definitions (3.4)–(3.14), it is easy to verify that

N2(nr,nd,0,λ)=λ2D2(nr,nd,0)=D2(nr,nd,0)=1. 3.15

In general, for two vectors a and b the geometric product N(ar,ad,0,λ)N(br,bd,0,λ) is highly non-trivial, as we saw in (2.43):

N(ar,ad,0,λ)N(br,bd,0,λ)=arbradbdN(ar×br+ad×bd,ar×bd+ad×br,arbd+adbr,λ). 3.16

Unlike the general case, however, since we wish to identify the external vectors aar+adε+ and bbr+bdε+ with the measurement directions within E3, the following constraints induced by their scalar product naturally hold:

ab:=12{ab+ba}=(arbr+adbd)+(arbd+adbr)ε+{arbd=adbr=0andarbr=adbd=12cosθab, 3.17

which are consistent with ab=1 for the special case a=b and the normalization conditions for ar and bd, giving

N(ar,ad,0,λ)N(br,bd,0,λ)=arbradbdN(ar×br+ad×bd,ar×bd+ad×br,0,λ). 3.18

Labelling the experimental trials with index k, we can now define the measurement functions (3.1) as maps of the form

S7±1=N(n,λk):IR3×{λk}S7IR8. 3.19

These maps can be realized for the freely chosen measurement directions, specified by the vectors such as a and b, as

S7A(a,λk):=limsr1arsd1ad{±D(ar,ad,0)N(sr1,sd1,0,λk)}={1ifλk=+1±1ifλk=1}together withA(a,λk)=0 3.20

and

S7B(b,λk):=limsr2brsd2bd{D(br,bd,0)N(sr2,sd2,0,λk)}={±1ifλk=+11ifλk=1}together withB(b,λk)=0. 3.21

Here we have assumed that orientation λ=±1 of S7 is a fair coin. Evidently, the functions A(a,λk) and B(b,λk) define local, realistic and deterministically determined measurement outcomes [17,30]. Apart from the common cause λk originating in the overlap of the backward lightcones of A(a,λk) and B(b,λk), the event A=±1 depends only on a freely chosen measurement direction a [5,6]. And likewise, apart from the common cause λk, the event B=±1 depends only on a freely chosen measurement direction b. In particular, the function A(a,λk) does not depend on either b or B, and the function B(b,λk) does not depend on either a or A. This leads us to the following remarkable theorem.

3.2. Quantum correlations from the algebra of Euclidean primitives

Theorem 3.1 —

Every quantum mechanical correlation can be understood as a classical, local, realistic and deterministic correlation among a set of points of S7 constructed above, represented by maps of the form defined in (3.20) and (3.21).

Proof. —

Recall that—as von Neumann recognized in his classic analysis [31]—regardless of the model of physics one is concerned with—whether it is the quantum mechanical model or a hidden variable model—it is sufficient to consider expectation values of the observables measured in possible states of the physical systems, since probabilities are but expectation values of the indicator random variables. Thus, probability P(E) of event E is expectation value E(MANIE),

P(E)=E(MANIE), 3.22

of the indicator random variable MANIE defined as

MANIE:={1ifEoccurs0otherwise. 3.23

Conversely, the expectation value of MANIE is

E(MANIE)=1×P(E)+0×{1P(E)}P(E)+{1P(E)}=P(E). 3.24

Thus, every statement involving probabilities can be translated into a statement involving expectation values, and vice versa. In what follows we shall, therefore, work exclusively with expectation values, because our primary goal here is to trace the origins of the quantum correlations to the algebraic and geometrical properties of the Euclidean primitives.

To that end, consider an arbitrary quantum state |ΨH of a system, where H is a Hilbert space of arbitrary dimensions—not necessarily finite. Apart from their usual quantum mechanical meanings, we impose no restrictions on either |Ψ〉 or H. In particular, the state |Ψ〉 can be as entangled as one may wish [16]. Next, consider a self-adjoint operator O^(n1,n2,n3,n4,n5,) on this Hilbert space, parametrized by arbitrary number of local contexts n1, n2,n3,n4,n5, etc. The quantum mechanically expected value of this observable in the state |Ψ〉 is then defined by

EQ.M.(n1,n2,n3,n4,n5,)=Ψ|O^(n1,n2,n3,n4,n5,)|Ψ. 3.25

More generally, if the system is in a mixed state, then its quantum mechanically expected value can be expressed as

EQ.M.(n1,n2,n3,n4,n5,)=Tr{W^O^(n1,n2,n3,n4,n5,)}, 3.26

where W^ is a statistical operator of unit trace representing the state of the system. Setting n1=aar+adε+, n2=bbr+bdε+, etc., the corresponding local-realistic expectation value for the same system can be written as

EL.R.(a,b,c,d,)=ΛA(a,λ)B(b,λ)C(c,λ)D(d,λ)ρ(λ)dλ, 3.27

where the binary measurement functions N(n,λk) are defined in equation (3.20) and the overall probability distribution ρ(λ),

withΛρ(λ)dλ=1for allλΛ, 3.28

is in general a continuous function of λ. Since in our framework λ=±1 is a fair coin, the above integral simplifies to

EL.R.(a,b,c,d,)=limm[1mk=1mA(a,λk)B(b,λk)C(c,λk)D(d,λk)]. 3.29

We shall soon prove, however, that—thanks to the definitions like (3.21)—this average is geometrically equivalent to

EL.R.(a,b,c,d,)=limm[1mk=1mN(ar,ad,0,λk)N(br,bd,0,λk)N(cr,cd,0,λk)N(dr,dd,0,λk)]. 3.30

Moreover, as we saw in §2.4 the representation space Kλ defined in (2.60), with or without the constraints in (3.18), remains closed under multiplication, the product appearing in the expectation (3.31) is equivalent to the product

N(xr,xd,0,λ)N(yr,yd,0,λ)=xryrxdydN(xr×yr+xd×yd,xr×yd+xd×yr,0,λ), 3.31

for some vectors x and y, depending in general on the measurement directions a, b, c, d, etc. Consequently, we have

EL.R.(a,b,c,d,)=limm[1mk=1mA(a,λk)B(b,λk)C(c,λk)D(d,λk)] 3.32
=limm[1mk=1mN(ar,ad,0,λk)N(br,bd,0,λk)N(cr,cd,0,λk)N(dr,dd,0,λk)] 3.33
=limm[1mk=1mN(xr,xd,0,λk)N(yr,yd,0,λk)] 3.34
=xryrxdydlimm[1mk=1mN(xr×yr+xd×yd,xr×yd+xd×yr,0,λk)] 3.35
=xryrxdydlimm[1mk=1mλk]D(xr×yr+xd×yd,xr×yd+xd×yr,0) 3.36
=cosθxy(a,b,c,d,)0, 3.37

because λk, as in (3.13), is a fair coin. Here, equation (3.37) follows from equation (3.36) by using equation (3.13), which now takes the form

N(xr×yr+xd×yd,xr×yd+xd×yr,0,λk)=λkD(xr×yr+xd×yd,xr×yd+xd×yr,0), 3.12

together with λk=±1. We can now identify the above local-realistic expectation with its quantum mechanical counterpart:

Ψ|O^(a,b,c,d,)|Ψ=EL.R.(a,b,c,d,)=cosθxy(a,b,c,d,). 3.38

This identification proves our main theorem: Every quantum mechanical correlation can be understood as a classical, local, deterministic and realistic correlation among a set of points of the representation space S7Kλ described above (cf. theorem 3.1).

It is instructive to evaluate the sum in equation (3.35) somewhat differently to bring out the fundamental role played by the orientation λk in the derivation of the strong correlations (3.38). Instead of assuming λk=±1 to be an orientation of S7 as our starting point, we may view it as specifying the ordering relation between N(xr,xd,0,λk=±1) and N(yr,yd,0,λk=±1) and the corresponding detectors D(xr,xd,0) and D(yr,yd,0) with 50/50 chance of occurring, and only subsequently identify it with the orientation of S7. Then, using relations (3.13) and (3.32), the sum in equation (3.35) can be evaluated directly by recognizing that in the right- and left-oriented S7 the following geometrical relations hold:

N(xr,xd,0,λk=+1)N(yr,yd,0,λk=+1)=xryrxdydN(xr×yr+xd×yd,xr×yd+xd×yr,0,λk=+1)=xryrxdydD(xr×yr+xd×yd,xr×yd+xd×yr,0)=D(xr,xd,0)D(yr,yd,0) 3.39

and

N(xr,xd,0,λk=1)N(yr,yd,0,λk=1)=xryrxdydN(xr×yr+xd×yd,xr×yd+xd×yr,0,λk=1)=xryrxdyd+D(xr×yr+xd×yd,xr×yd+xd×yr,0)=yrxrydxdD(yr×xr+yd×xd,yr×xd+yd×xr,0)=D(yr,yd,0)D(xr,xd,0). 3.40

Changes in the orientation λk thus alternates the algebraic order of N(xr,xd,0,λk=±1) and N(yr,yd,0,λk=±1) relative to the algebraic order of the detectors D(xr,xd,0) and D(yr,yd,0). Consequently, the sum (3.35) reduces to

EL.R.(a,b,c,d,)=limm[1mk=1mN(xr,xd,0,λk)N(yr,yd,0,λk)]=12{N(xr,xd,0,λk=+1)N(yr,yd,0,λk=+1)}+12{N(xr,xd,0,λk=1)N(yr,yd,0,λk=1)}=12{D(xr,xd,0)D(yr,yd,0)}+12{D(yr,yd,0)D(xr,xd,0)}=12{xryr+yrxr}12{xdyd+ydxd}=xryrxdyd=xy=cosθxy(a,b,c,d,), 3.41

because the orientation λk of S7 is a fair coin. Here xy=12{xy+yx} is the standard definition of the inner product.

Evidently, the above method of calculating suggests that a given initial state λ of the physical system can indeed be viewed as specifying an ordering relation between N(nr,nd,0,λ) and the detectors D(nr,nd,0) that measure it:

N(xr,xd,0,λk=+1)N(yr,yd,0,λk=+1)=D(xr,xd,0)D(yr,yd,0) 3.42

or

N(xr,xd,0,λk=1)N(yr,yd,0,λk=1)=D(yr,yd,0)D(xr,xd,0). 3.43

Then, using the right-hand sides of equations (3.41) and (3.42), the above pair can be reduced to the combined relation

N(xr×yr+xd×yd,xr×yd+xd×yr,0,λk)=λkD(xr×yr+xd×yd,xr×yd+xd×yr,0), 3.44

which is identical to relation (3.13) for normalized vectors. We have thus proved that the ordering relations (3.44) and (3.45) between N(nr,nd,0,λ) and D(nr,nd,0) are equivalent to the alternatively possible orientations of S7.

3.2.1. Special case of a two-level system entangled in the singlet state

Now, to complete the above proof of theorem 3.1, we must prove the step from equations (3.30) to (3.31). To that end, let us first consider observations of the spins of only two spin-12 particles produced in a decay of a single spinless particle as shown in figure 5. After the decay, the two emerging spin-12 particles move freely in opposite directions, subject to spin measurements along freely chosen unit directions a and b, which may be located at a space-like distance from one another [30]. As initially the emerging pair has zero net spin, its quantum mechanical state is described by the entangled singlet state

|Ψz=12{|z,+1|z,2|z,1|z,+2}, 3.45

with σz|z,±〉=±|z,±〉 describing the eigenstates of the Pauli spin ‘vector’ σ in which the particles have spin ‘up’ or ‘down’ along z-axis, in the units of =2. Our interest lies in comparing the quantum mechanical predictions,

EQ.M.Ψz(a,b)=Ψz|σ1aσ2b|Ψz=cosθab, 3.46

together with

EQ.M.Ψz(a)=Ψz|σ1aMANI|Ψz=0andEQ.M.Ψz(b)=Ψz|MANIσ2b|Ψz=0, 3.47

of spin correlations between the two subsystems, with those derived within our locally causal framework, regardless of the relative distance between the two remote locations represented by the unit detection vectors a and b. Here MANI is the identity matrix. The corresponding locally causal description of this emblematic system within our framework thus involves only two contexts, n1=aar+adε+ and n2=bbr+bdε+, with measurement results defined by the functions

S7A(a,λk):=limsr1arsd1ad{D(ar,ad,0)N(sr1,sd1,0,λk)}={+1ifλk=+11ifλk=1}together withA(a,λk)=0 3.48

and

S7B(b,λk):=limsr2brsd2bd{+N(sr2,sd2,0,λk)D(br,bd,0)}={1ifλk=+1+1ifλk=1}together withB(b,λk)=0, 3.49

where s1sr1+sd1ε+ and s2sr2+sd2ε+ represent the directions of the two spins emerging from the source.

Figure 5.

Figure 5.

A spin-less neutral pion decays into an electron–positron pair (such a photon-less decay is quite rare but not impossible, and will suffice for our theoretical purposes here). Measurements of spin components on each separated fermion are performed at remote stations 1 and 2, providing binary outcomes along arbitrary directions such as a and b.

Next, recalling that physically all bivectors ξ(n)∈S2S3 represent spins [16,17], we require that the total spin-zero angular momentum for the initial or ‘complete’ state associated with the above measurement functions is conserved,

total real spin:=λξ(sr1)+λξ(sr2)=0sr1=sr2sr 3.50

and

total dual spin:=λξ(sd1)+λξ(sd2)=0sd1=sd2sd, 3.51

just as it is in the EPR-Bohm-type experiment depicted in figure 5. For N(sr,sd,0,λk), this is equivalent to the condition

N(sr1,sd1,0,λk)+N(sr2,sd2,0,λk)=0N(sr1,sd1,0,λk)=N(sr2,sd2,0,λk). 3.52

In the light of the product rule (3.19) for anti-symmetric elements, the above condition is also equivalent to the condition

N(sr1,sd1,0,λk)N(sr2,sd2,0,λk)={N(sr,sd,0,λk)}2=N2(sr,sd,0,λk)=1. 3.53

In the next subsection, we will derive this condition geometrically as a natural consequence of the twist in the Hopf bundle of S3. Here, it leads to the following statistical equivalence, which can be viewed also as a geometrical identity:

limm[1mk=1mA(a,λk)B(b,λk)]limm[1mk=1mN(ar,ad,0,λk)N(br,bd,0,λk)]. 3.54

Given definitions (3.50) and (3.51), there is more than one way to prove this identity. In the following, we will use one such way. But it can also be proved by simply taking the limits in (3.50) and (3.51) while maintaining (3.54), and then using equation (3.13). Then the computation of correlations between A(a,λk)=±1 and B(b,λk)=±1 works out as

EL.R.EPR(a,b)=limm[1mk=1mA(a,λk)B(b,λk)] 3.55
=limm[1mk=1m{limsr1arsd1ad[D(ar,ad,0)N(sr1,sd1,0,λk)]}{limsr2brsd2bd[N(sr2,sd2,0,λk)D(br,bd,0)]}] 3.56
=limm[1mk=1m{limsr1arsd1adlimsr2brsd2bd[D(ar,ad,0){N(sr1,sd1,0,λk)N(sr2,sd2,0,λk)}D(br,bd,0)]}] 3.57
=limm[1mk=1m{limsr1arsd1adlimsr2brsd2bd[λkN(ar,ad,0,λk){1}λkN(br,bd,0,λk)]}] 3.58
=limm[1mk=1m{limsr1arsd1adlimsr2brsd2bd[+(λk)2N(ar,ad,0,λk)N(br,bd,0,λk)]}] 3.59
=limm[1mk=1mN(ar,ad,0,λk)N(br,bd,0,λk)] 3.60
=arbradbdlimm[1mk=1mN(ar×br+ad×bd,ar×bd+ad×br,0,λk)] 3.61
=arbradbdlimm[1mk=1mλk]D(ar×br+ad×bd,ar×bd+ad×br,0) 3.62
=cosθab0. 3.63

Here equation (3.58) follows from equation (3.57) by substituting the functions A(a,λk) and B(b,λk) from their definitions (3.50) and (3.51); equation (3.59) follows from equation (3.58) by using the ‘product of limits equal to limits of product’ rule (which can be verified by noting that the same multivector results from the limits in equations (3.58) and (3.59)); equation (3.60) follows from equation (3.59) by using (i) relations (3.13) (thus rewriting all anti-symmetric elements in the same bases), (ii) the associativity of the geometric product and (iii) the consequence (3.55) of the conservation of the spin angular momenta in S3; equation (3.61) follows from equation (3.60) by recalling that scalars λk commute with the elements of all grades; equation (3.62) follows from equation (3.61) because λ2=+1, and by removing the superfluous limit operations; equation (3.63) follows from equation (3.62) by using the geometric product (3.19); equation (3.64) follows from equation (3.63) by using relations (3.13); and finally equation (3.65) follows from equation (3.64) by using equation (3.18) and because the scalar coefficient of D vanishes in the m limit since λk is a fair coin. This proves that singlet correlations (see footnote 3) are correlations among the scalar points of a quaternionic S3.

As we did above for the general case, let us again evaluate the sum in equation (3.62) somewhat differently to bring out the crucial role played by λk in the derivation of the correlations (3.65). Using relations (3.13) and (3.19), sum (3.62) can be evaluated directly by recognizing that in the right- and left-oriented S7 the following geometrical relations hold:

N(ar,ad,0,λk=+1)N(br,bd,0,λk=+1)=arbradbdN(ar×br+ad×bd,ar×bd+ad×br,0,λk=+1)=arbradbdD(ar×br+ad×bd,ar×bd+ad×br,0)=D(ar,ad,0)D(br,bd,0) 3.64

and

N(ar,ad,0,λk=1)N(br,bd,0,λk=1)=arbradbdN(ar×br+ad×bd,ar×bd+ad×br,0,λk=1)=arbradbd+D(ar×br+ad×bd,ar×bd+ad×br,0)=brarbdadD(br×ar+bd×ad,br×ad+bd×ar,0)=D(br,bd,0)D(ar,ad,0). 3.65

Changes in λk thus alternate the relative order of D(ar,ad,0) D(br,bd,0). As a result, sum (3.62) reduces to

EL.R.EPR(a,b)=limm[1mk=1mN(ar,ad,0,λk)N(br,bd,0,λk)]=12{N(ar,ad,0,λk=+1)N(br,bd,0,λk=+1)}+12{N(ar,ad,0,λk=1)N(br,bd,0,λk=1)}=12{D(ar,ad,0)D(br,bd,0)}+12{D(br,bd,0)D(ar,ad,0)}=12{arbr+brar}12{adbd+bdad}=arbradbd=ab=cosθab, 3.66

because the orientation λk of S7 is a fair coin. Here ab=12{ab+ba} is the standard definition of the inner product.

The above method of calculating the correlations suggests that a given initial state λ of the physical system can be viewed also as specifying an ordering relation between N(nr,nd,0,λ) and the detectors D(nr,nd,0) that measure it:

N(ar,ad,0,λk=+1)N(br,bd,0,λk=+1)=D(ar,ad,0)D(br,bd,0) 3.67

or

N(ar,ad,0,λk=1)N(br,bd,0,λk=1)=D(br,bd,0)D(ar,ad,0). 3.68

Then, using the right-hand sides of equations (3.66) and (3.67), the above pair can be reduced to the combined relation

N(ar×br+ad×bd,ar×bd+ad×br,0,λk)=λkD(ar×br+ad×bd,ar×bd+ad×br,0), 3.69

which is equivalent to relation (3.13) for normalized vectors. We have thus proved that the ordering relations (3.69) and (3.70) between N(nr,nd,0,λ) and D(nr,nd,0) are equivalent to the alternatively possible orientations of S7.

3.2.2. Conservation of the initial spin-0 from the twist in the Hopf bundle of S3

Note that, apart from the initial state λk, the only other assumption used in the above derivation is that of the conservation of spin angular momentum (3.55). These two assumptions are necessary and sufficient to dictate the singlet correlations:

EL.R.EPR(a,b)=limm[1mk=1mA(a,λk)B(b,λk)]=cosθab. 3.70

The conservation of spin, however, can be understood in terms of the twist in the Hopf bundle of S3SU(2). Recall that locally (in the topological sense) S3 can be written as a product S2×S1, but globally it has no cross-section [32,33]. It can be viewed also as a principal U(1) bundle over S2, with the points of its base space S2 being the elements of the Lie algebra su(2), which are pure quaternions or bivectors [16,30,34]. The product of two such bivectors are, in general, non-pure quaternions of the form (2.18), and are elements of the group SU(2) itself. That is to say, they are points of the bundle space S3, whose elements are the preimages of the points of the base space S2 [32,33]. These preimages are 1-spheres, S1, called Hopf circles, or Clifford parallels [35]. As these 1-spheres are the fibres of the bundle, they do not share a single point in common. Each circle threads through every other circle in the bundle as shown in figure 6, making them linked together in a highly non-trivial configuration. This configuration can be quantified by the following relation among the fibres [34]:

eiψ=eiϕeiψ+, 3.71

where e and e+, respectively, are the U(1) fibre coordinates above the two hemispheres H and H+ of the base space S2, with spherical coordinates (0≤θ<π,0≤ϕ<2π); ϕ is the angle parametrizing a thin strip HH+ around the equator of S2 [θπ/2] and e is the transition function that glues the two sections H and H+ together, thus constituting the 3-sphere. It is evident from equation (3.73) that the fibres match perfectly at the angle ϕ=0 (modulo 2π), but differ from each other at all intermediate angles ϕ. For example, e and e+ differ by a minus sign at the angle ϕ=π. Now to derive the conservation of spin (3.55), we rewrite exponential relation (3.73) in our notation as

{ξ(ar)ξ(sr1)}={ξ(ar)ξ(br)}{ξ(sr2)ξ(br)}, 3.72

by identifying the angles ηarsr1 and ηsr2br between ar and sr1 and sr2 and br with the fibres ψ and ψ+, and the angle ηarbr between ar and br with the generator of the transition function e on the equator of S2. Here, we have used the sign conventions to match the sign conventions in our definitions (3.50) and (3.51) and the correlations (3.65). The above representation of equation (3.73) is not as unusual as it may appear at first sight once we recall that geometric products of the bivectors appearing in it are all non-pure quaternions, which can be parametrized to take the exponential form

ξ(u)ξ(v)=(λIu)(λIv)=cos(ηuv)+uvuvsin(ηuv)=exp{uvuvηuv}. 3.73

Multiplying both sides of equation (3.74) from the left with ξ(ar) and noting that all unit bivectors square to −1, we obtain

ξ(sr1)=ξ(br)ξ(sr2)ξ(br). 3.74

Multiplying the numerator and denominator on the r.h.s. of this similarity relation with −ξ(br) from the right and ξ(br) from the left then leads to the conservation of the spin angular momentum, just as we have specified in equation (3.52):

λξ(sr1)=λξ(sr2)sr1=sr2. 3.75

Similarly, we can derive analogous conservation law for the zero spin within the dual 3-sphere, as specified in equation (3.53):

λξ(sd1)=λξ(sd2)sd1=sd2. 3.76

Given the conservation laws derived in equations (3.77) and (3.78), we can combine them to arrive at the net condition (3.55):

N(sr1,sd1,0,λk)N(sr2,sd2,0,λk)=1, 3.77

which was used in equation (3.60) to derive the strong correlations (3.65). We have thus shown that the conservation of spin angular momentum is not an additional assumption, but follows from the very geometry and topology of the 3-sphere.

Figure 6.

Figure 6.

The tangled web of linked Hopf circles depicting the geometrical and topological non-trivialities of the 3-sphere.

In fact, it is not difficult to see from the twist in the Hopf bundle of S3, captured in equation (3.74), that if we set ar=br (or equivalently ηarbr=0) for all fibres, then S3 reduces to the trivial bundle S2×S1, since then the fibre coordinates ηarsr1 and ηsr2br would match up exactly on the equator of S2 [θπ/2]. In general, however, for arbr, S3S2×S1. For example, when ar=−br (or equivalently when ηarbr=π) there will be a sign difference between the fibres at that point of the equator [3234]. That in turn would produce a twist in the bundle analogous to the twist in a Möbius strip. It is this non-trivial twist in the S3 bundle that is responsible for the observed sign flips in the product AB of measurement results, from AB=1 for ar=br to AB=+1 for ar=−br, as evident from the correlations (3.65). In the appendix of the first chapter of [16] this is illustrated in a toy model of Alice and Bob in a Möbius world. But while the twist in a Möbius strip is in the S1 worth of parallel lines that make up the untwisted cylinder, the twist in S3 is in the arrangement of the S2 worth of circles that make up that 3-sphere (cf. figure 6) [34].

3.2.3. The general case of arbitrarily entangled quantum state

We now proceed to generalize the above 2-particle case3 to the most general case of arbitrarily entangled quantum state considered in (3.26). To this end, let us consider any arbitrary number of measurement results corresponding to those in (3.26) and (3.27):

A(a,λk)B(b,λk)C(c,λk)D(d,λk)E(e,λk)F(f,λk)G(g,λk), 3.78

with each pair such as (C,D) defined for the contexts such as n3=ccr+cdε+ and n4=ddr+ddε+:

S7C(c,λk):=limtr1crtd1cd{D(cr,cd,0)N(tr1,td1,0,λk)}={+1ifλk=+11ifλk=1}together withC(c,λk)=0 3.79

and

S7D(d,λk):=limtr2drtd2dd{+N(tr2,td2,0,λk)D(dr,dd,0)}={1ifλk=+1+1ifλk=1}together withD(d,λk)=0. 3.80

If the number of measurement results happens to be odd instead of even, then the product of an even number of results can be first evaluated, and then that factor can be paired with the remaining result, as done in equation (3.120).

It is important to recall here the elementary fact that any experiment of any kind in physics can always be reduced to a series of questions with ‘yes’/‘no’ answers, represented by binary measurement outcomes of the form (3.80)–(3.82). Therefore, the measurement framework we have developed here is completely general and applicable to any physical experiment.

Now, as in the EPR-Bohm-type experiment with a singlet state discussed above (cf. figure 5 and equations (3.54), (3.55) and (3.79)), for each pair of measurement outcomes such as (3.82) the twist in the Hopf bundle of S3 dictates the condition

N(tr1,td1,0,λk)+N(tr2,td2,0,λk)=0N(tr1,td1,0,λk)=N(tr2,td2,0,λk), 3.81

or equivalently the condition

N(tr1,td1,0,λk)N(tr2,td2,0,λk)={N(tr,td,0,λk)}2=N2(tr,td,0,λk)=1. 3.82

Consequently, by following the steps analogous to those in equations (3.50)–(3.62), we arrive at the geometrical equivalence

limm[1mk=1mC(c,λk)D(d,λk)]limm[1mk=1mN(cr,cd,0,λk)N(dr,dd,0,λk)], 3.83

for each pair (C,D) of measurement outcomes. As a result, the correlations among the outcomes (3.80) take the form

EL.R.(a,b,c,d,)=limm[1mk=1mA(a,λk)B(b,λk)C(c,λk)D(d,λk)] 3.84
=limm[1mk=1mN(ar,ad,0,λk)N(br,bd,0,λk)N(cr,cd,0,λk)N(dr,dd,0,λk)] 3.85
=limm[1mk=1mN(xr,xd,0,λk)N(yr,yd,0,λk)] 3.86
=xryrxdydlimm[1mk=1mN(xr×yr+xd×yd,xr×yd+xd×yr,0,λ)] 3.87
=xryrxdydlimm[1mk=1mλk]D(xr×yr+xd×yd,xr×yd+xd×yr,0) 3.88
=cosθxy(a,b,c,d,)0, 3.89

because λk is a fair coin. We can now identify this locally causal expectation with its quantum mechanical counterpart:

Ψ|O^(a,b,c,d,)|Ψ=EL.R.(a,b,c,d,)=cosθxy(a,b,c,d,). 3.90

This completes the proof of the theorem for the general quantum state stated at the beginning of §3.2.

3.3. Derivation of Tsirel’son’s bounds on the correlation strength

Let us now investigate the bounds on the strengths of the local-realistic correlations (3.91) by deriving Tsirel’son’s bounds [17] for arbitrary quantum states [16]. To this end, instead of (3.80) consider an alternative set of measurement results such as

A(a,λk)B(b,λk)C(c,λk)D(d,λk)E(e,λk)F(f,λk)G(g,λk), 3.91

with each pair such as (C,D) defined for contexts such as n3=c′↔cr+cdε+ and n4=d′↔dr+ddε+. The correlation between these results can then be derived following steps analogous to those in the previous subsection:

EL.R.(a,b,c,d,)=limm[1mk=1mA(a,λk)B(b,λk)C(c,λk)D(d,λk)] 3.92
=limm[1mk=1mN(ar,ad,0,λk)N(br,bd,0,λk)N(cr,cd,0,λk)N(dr,dd,0,λk)] 3.93
=limm[1mk=1mN(xr,xd,0,λk)N(yr,yd,0,λk)] 3.94
=xryrxdydlimm[1mk=1mN(xr×yr+xd×yd,xr×yd+xd×yr,0,λ)] 3.95
=xryrxdydlimm[1mk=1mλk]D(xr×yr+xd×yd,xr×yd+xd×yr,0) 3.96
=cosθxy(a,b,c,d,)0. 3.97

In particular, in equation (3.96) we then have the relation

EL.R.(a,b,c,d,)=EL.R.(x,y)=limm[1mk=1mN(xr,xd,0,λk)N(yr,yd,0,λk)]. 3.98

Needless to say, we are free to choose the contexts different from the primed and unprimed ones chosen in (3.93) and (3.80), as well as any combinations and/or mixtures of them, such as (a,b′,c′′,d′′′,e′′′′,…). Consequently, we may consider the following four relations corresponding to some alternative combinations of measurement contexts so that

EL.R.(x,y)=limm[1mk=1mN(xr,xd,0,λk)N(yr,yd,0,λk)], 3.99
EL.R.(x,y)=limm[1mk=1mN(xr,xd,0,λk)N(yr,yd,0,λk)], 3.100
EL.R.(x,y)=limm[1mk=1mN(xr,xd,0,λk)N(yr,yd,0,λk)] 3.101
andEL.R.(x,y)=limm[1mk=1mN(xr,xd,0,λk)N(yr,yd,0,λk)]. 3.102

Using the above four expressions, the corresponding Bell-CHSH string of expectation values [17], namely the coefficient

EL.R.(x,y)+EL.R.(x,y)+EL.R.(x,y)EL.R.(x,y) 3.103

corresponding to this fully general case of arbitrary number of contexts and measurement results, can be written as

EL.R.(x,y)+EL.R.(x,y)+EL.R.(x,y)EL.R.(x,y)=limm[1mk=1m{N(xr,xd,0,λk)N(yr,yd,0,λk)+N(xr,xd,0,λk)N(yr,yd,0,λk)+k=1mN(xr,xd,0,λk)N(yr,yd,0,λk)N(xr,xd,0,λk)N(yr,yd,0,λk)}]. 3.104

But as N(xr,xd,0,λk) and N(yr,yd,0,λk) represent two independent equatorial points of an S6 within S7, we take them to belong to two disconnected ‘sections’ of the bundle S5×S1 (i.e. two disconnected S5S6), satisfying

[N(xr,xd,0,λk),N(yr,yd,0,λk)]=0xrandydIR3, 3.105

which is equivalent to anticipating null outcomes along the directions xr×yd exclusive to both xr and yd. If we now square the integrand of equation (3.106), use the above commutation relations, and use the fact that all N(nr,nd,0,λk) square to −1, then the absolute value of the above Bell-CHSH string (3.105) leads to the following variance inequality:

|EL.R.(x,y)+EL.R.(x,y)+EL.R.(x,y)EL.R.(x,y)|limm[1mk=1m{4+4Txx(λk)Tyy(λk)}], 3.106

where the classical commutators

Txx(λk):=12[N(xr,xd,0,λk),N(xr,xd,0,λk)]=N(xr×xr+xd×xd,xr×xd+xd×xr,0,λk) 3.107

and

Tyy(λk):=12[N(yr,yd,0,λk),N(yr,yd,0,λk)]=N(yr×yr+yd×yd,yr×yd+yd×yr,0,λk), 3.108

are the geometric measures of the torsion within S7 [16,17]. Thus, it is the non-vanishing torsion T within S7—the parallelizing torsion which makes the Riemann curvature of this representation space vanish—that is responsible for the stronger-than-linear correlations. We can see this from equation (3.108) by setting T=0, and in more detail as follows.

Using the above expressions for the intrinsic torsions Txx(λk) and Tyy(λk) and defining the unnormalized vectors

ur:=(xr×xr+xd×xd)andud:=(xr×xd+xd×xr) 3.109

and

vr:=(yr×yr+yd×yd)andvd:=(yr×yd+yd×yr), 3.110

together with uv:=urvr+udvd analogous to ab:=arbr+adbd given in equation (3.18), we have the product

Txx(λk)Tyy(λk)=urvrudvdN(ur×vr+ud×vd,ur×vd+ud×vr,0,λk)=uvN(ur×vr+ud×vd,ur×vd+ud×vr,0,λk). 3.111

As a result, we have

limm[1mk=1mTxx(λk)Tyy(λk)]=uvlimm[1mk=1mN(ur×vr+ud×vd,ur×vd+ud×vr,0,λk)]=uvlimm[1mk=1mλk]D(ur×vr+ud×vd,ur×vd+ud×vr,0)=uv0=urvrudvd, 3.112

where u and v are unnormalized vectors. Using the constraints analogous to those expressed in equation (3.18), we then have

limm[1mk=1mTxx(λk)Tyy(λk)]=urvrudvd=(x×x)(y×y), 3.113

upon using a vector identity. Consequently, substituting the above value in the variance inequality (3.108), it simplifies to

|EL.R.(x,y)+EL.R.(x,y)+EL.R.(x,y)EL.R.(x,y)|21(x×x)(y×y). 3.114

Finally, since trigonometry dictates the geometrical bounds −1≤(x×x′)⋅(y′×y)≤+1, this inequality reduces to

|EL.R.(x,y)+EL.R.(x,y)+EL.R.(x,y)EL.R.(x,y)|22, 3.115

exhibiting the bounds on all possible correlations. This result can also be derived directly from the correlations (3.91):

|EL.R.(x,y)+EL.R.(x,y)+EL.R.(x,y)EL.R.(x,y)|=|cosθxycosθxycosθxy+cosθxy|22. 3.116

Let us stress again that these bounds are completely general, valid for any quantum state, such as the one in equation (3.27).

3.4. Fragility of strong correlations increases with number of contexts

As we saw in equation (3.65), in the case of two contexts the scalar part of the geometric product N(ar,ad,0,λk)N(br,bd,0,λk) is

cosθxy(a,b)=arbradbd=ab=cosθab. 3.117

And it is this scalar part that captures the pattern of strong correlations exhibited by the singlet system. Analogously, for three contexts the scalar part of the corresponding geometric product N(ar,ad,0,λk)N(br,bd,0,λk)N(cr,cd,0,λk) works out to give

cosθxy(a,b,c)=ar{(br×cr)+(bd×cd)}+ad{(br×cd)+(bd×cr)}, 3.118

with the geometric complexity of the scalar part now increased considerably. And for four contexts the scalar part of the geometric product

N(ar,ad,0,λk)N(br,bd,0,λk)N(cr,cd,0,λk)N(dr,dd,0,λk)

works out to be even more intricate:

cosθxy(a,b,c,d)=(arbr)(crdr)+(adbd)(crdr)+(arbr)(cddd)+(adbd)(cddd)(ar×br+ad×bd)(cr×dr+cd×dd)(ar×bd+ad×br)(cr×dd+cd×dr), 3.119

because

N(ar,ad,0,λ)N(br,bd,0,λ)=arbradbdN(ar×br+ad×bd,ar×bd+ad×br,0,λ) 3.120

and

N(cr,cd,0,λ)N(dr,dd,0,λ)=crdrcdddN(cr×dr+cd×dd,cr×dd+cd×dr,0,λ). 3.121

Needless to say, this pattern of increased geometrical complexity continues with the addition of each new context. As a result, the fragility of the strong correlations also increases rapidly with the number of contexts. This is easy to see already from the above scalar part for just four contexts. It is easy to see that even a slight change, such as ar±Δar, in only one of the four contexts ar+adε+ would lead to a dramatic change in the pattern of the corresponding correlation.

3.5. Reproducing the strong correlations exhibited by the GHSZ states

Now, as a second example of strong correlations, consider the four-particle Greenberger–Horne–Zeilinger state (or the GHSZ state [7]):

|Ψz=12{|z,+1|z,+2|z,3|z,4|z,1|z,2|z,+3|z,+4}. 3.122

Unlike the singlet state, this entangled state is not rotationally invariant [7]. There is a privileged direction, and it is taken to be the z-direction of the experimental set-up [7]. The z-direction thus represents the axis of anisotropy of the system. The quantum mechanical expectation value of the product of the four outcomes of the spin components in this state—namely, the products of finding the spin of particle 1 along a, the spin of particle 2 along b, etc.—is given by

EQ.M.Ψz(a,b,c,d):=Ψz|σaσbσcσd|Ψz. 3.123

This expectation value has been calculated in appendix F of [7]. In the spherical coordinates—with angles such as θa and ϕa representing the polar and azimuthal angles, respectively, of the direction a, b, etc.—it works out to be

EQ.M.Ψz(a,b,c,d)=cosθacosθbcosθccosθdsinθasinθbsinθcsinθdcos(ϕa+ϕbϕcϕd). 3.124

Our goal now is to reproduce this result within our locally causal framework described above (see also ch. 6 of [16]). To this end, we note that the state (3.124) represents, not a two-level, but a four-level quantum system (see footnote 3). Each of the two pairs of the spin-12 particles it represents has four alternatives available to it. These alternatives can be represented by a state-vector of the form

|ψ=γ1|+++γ2|++γ3|++γ4|, 3.125

where γ1, γ2, γ3 and γ4 are complex numbers satisfying |γ1|2+|γ2|2+|γ3|2+|γ4|2=1, which is equivalent to defining a unit 7-sphere, with |γ1|2, |γ2|2, |γ3|2 and |γ4|2 being the probabilities of actualizing the states |++〉, |+−〉, |−+〉 and |−−〉, respectively. Therefore, we may begin with four local maps of the form

S7A(a,λk):=limsr1arsd1ad{D(ar,ad,0)N(sr1,sd1,0,λk)}={+1ifλk=+11ifλk=1}withA(a,λk)=0, 3.126
S7B(b,λk):=limsr2brsd2bd{+N(sr2,sd2,0,λk)D(br,bd,0)}={1ifλk=+1+1ifλk=1}withB(b,λk)=0, 3.127
S7C(c,λk):=limtr1crtd1cd{D(cr,cd,0)N(tr1,td1,0,λk)}={+1ifλk=+11ifλk=1}withC(c,λk)=0 3.128
andS7D(d,λk):=limtr2drtd2dd{+N(tr2,td2,0,λk)D(dr,dd,0)}={1ifλk=+1+1ifλk=1}withD(d,λk)=0, 3.129

together with their geometric product

(AaBbCcDd)(λk)=A(a,λk)B(b,λk)C(c,λk)D(d,λk)=±1S7 3.130

(cf. appendix B), and the corresponding conservation laws

N(sr1,sd1,0,λk)N(sr2,sd2,0,λk)={N(sr,sd,0,λk)}2=N2(sr,sd,0,λk)=1 3.131

and

N(tr1,td1,0,λk)N(tr2,td2,0,λk)={N(tr,td,0,λk)}2=N2(tr,td,0,λk)=1. 3.132

As we saw above, the expectation value of the product of the outcomes A(a,λk), B(b,λk), C(c,λk) and D(d,λk) then works out to be the scalar part of the geometric product N(ar,ad,0,λk)N(br,bd,0,λk)N(cr,cd,0,λk)N(dr,dd,0,λk), as spelled out in equation (3.121). Using a simple vector identity, this expectation value can be further simplified to take the form

EL.R.GHZ(a,b,c,d)=(arbr)(crdr)+(adbd)(crdr)+(arbr)(cddd)+(adbd)(cddd)(arcr)(brdr)+(brcr)(ardr)(arcd)(brdd)+(brcd)(ardd)(adcr)(bddr)+(bdcr)(addr)(adcd)(bddd)+(bdcd)(addd)(arcr)(bddd)+(bdcr)(ardd)(arcd)(bddr)+(bdcd)(ardr)(adcr)(brdd)+(brcr)(addd)(adcd)(brdr)+(brcd)(addr). 3.133

Upon using the constraints in equation (3.18) to set the terms involving arcd, etc., to zero, this expected value reduces to

EL.R.GHZ(a,b,c,d)=(arbr)(crdr)+(adbd)(crdr)+(arbr)(cddd)+(adbd)(cddd)(arcr)(brdr)+(brcr)(ardr)(adcd)(bddd)+(bdcd)(addd)(arcr)(bddd)+(bdcd)(ardr)+(brcr)(addd)(adcd)(brdr). 3.134

Then, again using the constraints in equation (3.18) to identify arbr with adbd, etc., the expected value takes the form

EL.R.GHZ(a,b,c,d)=2(arbr)(crdr)+2(adbd)(cddd)2(arcr)(brdr)+2(brcr)(ardr)+2(bdcd)(addd)2(adcd)(bddd). 3.135

Next, in order to satisfy the above constraints, we relate the external measurement directions a, b, c and d∈IR3, chosen freely by the experimenters, with the directions ar, ad, etc., within our representation space S7Kλ, as follows:

A(ax,ay,az,λk)=±1S7,to be detected byD(ar;ad;0)=D(ax24,+ay24,0;0,0,az24;0), 3.136
B(bx,by,bz,λk)=±1S7,to be detected byD(br;bd;0)=D(+bx24,+by24,0;0,0,+bz24;0), 3.137
C(cx,cy,cz,λk)=±1S7,to be detected byD(cr;cd;0)=D(+cx24,+cy24,0;0,0,+cz24;0) 3.138
andD(dx,dy,dz,λk)=±1S7, to be detected byD(dr;dd;0)=D(+dx24,dy24,0;0,0,dz24;0). 3.139

Here the 4th roots of 2 in the denominators of D (instead of 2 as in equation (3.9)) arise because the product of four factors, N(ar,ad,0,λk)N(br,bd,0,λk) N(cr,cd,0,λk) N(dr,dd,0,λk), instead of two, N(ar,ad,0,λk)N(br,bd,0,λk), is involved in calculation (3.87) of the correlation, while maintaining the unity of the radius of S7. Note also that components of only external vectors are involved in the definitions of the four detectors. And they do not mix with each other, so that Bell’s condition of local causality, or parameter independence [5,6], is strictly respected throughout. Substituting these coordinate values into the remaining vectors in the expected value (3.137) then reduces that value to

EL.R.GHZ(a,b,c,d)=+azbzczdzaybycydyaxbycxdyaybxcydxaxbxcxdx+axbxcydyaxbycydxaybxcxdy+aybycxdx. 3.140

In the spherical coordinates—with angles θa and ϕa representing, respectively, the polar and azimuthal angles of the direction a, etc., for all four measurement directions—this expression of the expected value can be further simplified to

EL.R.GHZ(a,b,c,d)=cosθacosθbcosθccosθdsinθasinθbsinθcsinθdcos(ϕa+ϕbϕcϕd). 3.141

This is exactly the quantum mechanical prediction (3.126) for the four-particle GHZ state (3.124). We have derived this prediction, however, as purely geometric effects within our locally causal framework. The GHZ correlations thus simply exhibit the classical, deterministic, local and realistic correlations among four points of our representation space S7.

4. Bell’s theorem, its experimental tests and the GHSZ variant

4.1. Bell-test experiments: from inceptions to loophole-free advances

Contrary to what we have demonstrated above, it is widely believed that the so-called Bell-test experiments—from their initial conceptions summarized in the classic review paper by Clauser & Shimony [20] to their state-of-the-art ‘loophole-free’ variants [813]—undermine any prospects of a locally causal understanding of quantum correlations. It is important to appreciate, however, that all such experiments simply confirm the predictions of quantum mechanics. They neither contradict the quantum mechanical predictions nor go beyond them in any sense. Moreover, as in §i, we have reproduced all of the quantum mechanical predictions for the singlet state exactly, the Bell-test experiments [813] do not contradict the predictions of our model either. Rather, they simply corroborate them.

More precisely, in the analysis of all such experiments one averages over ‘coincidence counts’ to calculate expectation values in the form

E(a,b)=limn1[1nk=1nA(a,λk)B(b,λk)][C++(a,b)+C(a,b)C+(a,b)C+(a,b)][C++(a,b)+C(a,b)+C+(a,b)+C+(a,b)]=cosθab, 4.1

where C+−(a,b), etc., represent the number of simultaneous occurrences of detections +1 along a and −1 along b, etc. In addition, they observe individual results A(a,λk) and B(b,λk) at each remote station to find that on average

A(a,λk)=0andB(b,λk)=0. 4.2

Finally, they observe that Bell-CHSH inequalities [15] with the absolute bound of 2 are exceeded by a factor of 2:

22E(a,b)+E(a,b)+E(a,b)E(a,b)+22. 4.3

Relations (4.1)–(4.3) are precisely the predictions of quantum mechanics for the singlet state (3.47). In practice, however, it is often difficult to perform such experiments with a pair of spin-12 particles considered in figure 5. For this reason, the usual preference for performing the Bell-test experiments is to measure correlations in a pair of linearly polarized photons instead of in a pair of spin-12 particles. But predictions (4.1)–(4.3) of quantum mechanics do not change for either choice, apart from a factor of 2 in the angular-dependence of the correlations (cf. eqns (1) and (33) of [36]). In this paper, however, we are not concerned about the practical difficulties in performing the experiments, and therefore the original reformulation of the EPR argument by Bohm in terms of a pair of spin-12 particles, such as the electron–positron pair depicted in figure 5, is sufficient for our purposes. What is more important to appreciate is the fact that our S7 model predicts precisely the relations (4.1)–(4.3) for the entangled state (3.47), as can be verified from our predictions (3.65), (3.50), (3.51) and (3.117).

Thus, the crucial difference between the predictions of our S7 model and those of quantum mechanics is not in the observational content, but in the interpretation of the latter in terms of non-locality.4 And this interpretation depends entirely on the argument put forward by Bell and his followers [5,6,20]. This argument, however, is fatally flawed, as we now demonstrate.

4.2. Surprising oversight in the derivation of the Bell-CHSH inequalities

From the outset let us stress that Bell’s so-called theorem is by no means a ‘theorem’ in the sense that word is used by mathematicians but rather a word-statement, which claims that no physical theory which is realistic as well as local in the strict senses espoused by Einstein [4] and later formulated by Bell (see footnote 2) [5,6] can reproduce all of the statistical predictions of quantum theory [14]. This word-statement is based on ‘violations’ of certain mathematical inequalities, which are derived by considering four incompatible EPR-Bohm-type experiments, and without using a single concept from quantum theory. While the bounds thus derived on the inequalities are exceeded by the predictions of quantum theory and ‘violated’ in actual experiments, their derivation happens to be marred by a serious conceptual oversight.

To appreciate this, consider the standard EPR type spin-12 experiment, as proposed by Bohm and later used by Bell to prove his theorem. Alice is free to choose a detector direction a or a′ and Bob is free to choose a detector direction b or b′ to detect spins of the fermions they receive from a common source, at a space-like distance from each other. The objects of interest then are the bounds on the sum of possible averages put together in the manner of CHSH [15],

E(a,b)+E(a,b)+E(a,b)E(a,b), 4.4

with each average defined as

E(a,b)=limn1[1nk=1nA(a,λk)B(b,λk)]Ak(a)Bk(b), 4.5

where A(a,λk)Ak(a)=±1 and B(b,λk)Bk(b)=±1 are the respective measurement results of Alice and Bob. Now, since Ak(a)=±1 and Bk(b)=±1, the average of their product is 1Ak(a)Bk(b)+1. As a result, we can immediately read off the upper and lower bounds on the string of the four averages considered above in (4.4):

4Ak(a)Bk(b)+Ak(a)Bk(b)+Ak(a)Bk(b)Ak(a)Bk(b)+4. 4.6

This should have been Bell’s final conclusion. However, by continuing, Bell overlooked something that is physically unjustifiable. He replaced the above sum of four separate averages of real numbers with the following single average:

E(a,b)+E(a,b)+E(a,b)E(a,b)Ak(a)Bk(b)+Ak(a)Bk(b)+Ak(a)Bk(b)Ak(a)Bk(b). 4.7

As innocuous as this step may seem mathematically, it is in fact an illegitimate step physically, because what is being averaged on its r.h.s. are unobservable and unphysical quantities. But it allows us to reduce the sum of four averages to

Ak(a){Bk(b)+Bk(b)}+Ak(a){Bk(b)Bk(b)}. 4.8

And as Bk(b)=±1, if |Bk(b)+Bk(b)|=2, then |Bk(b)Bk(b)|=0, and vice versa [17]. Consequently, using Ak(a)=±1, it is easy to conclude that the absolute value of the above average cannot exceed 2, just as Bell concluded5 :

2Ak(a)Bk(b)+Ak(a)Bk(b)+Ak(a)Bk(b)Ak(a)Bk(b)+2. 4.9

Let us now try to understand why the replacement in (4.7) is illegitimate.6 To begin with, Einstein’s (or even Bell’s own) notion of local realism does not, by itself, demand this replacement. As this notion is captured already in the very definition (see footnote 2) [5,6] of the functions A(a,λk), the l.h.s. of (4.7) satisfies the demand of local realism perfectly well. Nor can a possible statistical independence of the four separate averages on the l.h.s. of (4.7) justify their replacement with the single average on its r.h.s., at the expense of what is physically possible in the actual experiments. To be sure, mathematically there is nothing wrong with a replacement of four separate averages with a single average. Indeed, every school child knows that the sum of averages is equal to the average of the sum. But this rule of thumb is not valid in the above case, because (a,b), (a,b′), (a′,b) and (a′,b′) are mutually exclusive pairs of measurement directions, corresponding to four incompatible experiments. Each pair can be used by Alice and Bob for a given experiment, for all runs 1 to n, but no two of the four pairs can be used by them simultaneously. This is because Alice and Bob do not have the ability to make measurements along counterfactually possible pairs of directions such as (a,b) and (a,b′) simultaneously. Alice, for example, can make measurements along a or a′, but not along a and a′ at the same time.

But this fact is rather devastating for Bell’s argument, because it means that his replacement (4.7) is illegitimate. Consider, for example, a specific run of the EPR-Bohm-type experiment and the corresponding quantity being averaged in (4.7):

Ak(a)Bk(b)+Ak(a)Bk(b)+Ak(a)Bk(b)Ak(a)Bk(b). 4.10

Here the index k=1 now represents a specific run of the experiment. But since Alice and Bob have only two particles at their disposal for each run, only one of the four terms of the above sum is physically meaningful. In other words, the above quantity is physically meaningless, because Alice, for example, cannot align her detector along a and a′ at the same time. And likewise, Bob cannot align his detector along b and b′ at the same time. What is more, this will be true for all possible runs of the experiment, or equivalently for all possible pairs of particles. Which implies that all of the quantities listed below, as they appear in average (4.9), are unobservable, and hence physically meaningless:

A1(a)B1(b)+A1(a)B1(b)+A1(a)B1(b)A1(a)B1(b),A2(a)B2(b)+A2(a)B2(b)+A2(a)B2(b)A2(a)B2(b),A3(a)B3(b)+A3(a)B3(b)+A3(a)B3(b)A3(a)B3(b),A4(a)B4(b)+A4(a)B4(b)+A4(a)B4(b)A4(a)B4(b),An(a)Bn(b)+An(a)Bn(b)+An(a)Bn(b)An(a)Bn(b).

But as each of the quantities above is physically meaningless, their average appearing on the r.h.s. of (4.7), namely

Ak(a)Bk(b)+Ak(a)Bk(b)+Ak(a)Bk(b)Ak(a)Bk(b), 4.11

is also physically meaningless7 [5,6,14,20]. That is to say, no physical experiment can ever be performed—even in principle—that can meaningfully allow to measure or evaluate the above average, as none of the above list of quantities could have experimentally observable values. Therefore the innocuous looking replacement (4.7) made by Bell is, in fact, illegal.

On the other hand, it is important to note that each of the averages appearing on the l.h.s. of replacement (4.7),

E(a,b)=limn1[1nk=1nA(a,λk)B(b,λk)]Ak(a)Bk(b), 4.12
E(a,b)=limn1[1nk=1nA(a,λk)B(b,λk)]Ak(a)Bk(b), 4.13
E(a,b)=limn1[1nk=1nA(a,λk)B(b,λk)]Ak(a)Bk(b) 4.14
andE(a,b)=limn1[1nk=1nA(a,λk)B(b,λk)]Ak(a)Bk(b), 4.15

is a perfectly well defined and observable physical quantity. Therefore, the bounds (4.6) on their sum are harmless. These bounds of {−4,+4}, however, have never been violated in any experiment. Indeed, nothing can violate them.

In summary, Bell and his followers derive the upper bound of 2 on the CHSH string of averages by an illegal move. In the middle of their derivation, they unjustifiably replace an observable, and hence physically meaningful quantity,

Ak(a)Bk(b)+Ak(a)Bk(b)+Ak(a)Bk(b)Ak(a)Bk(b), 4.16

with an experimentally unobservable, and hence physically entirely meaningless quantity (regardless of the method):

Ak(a)Bk(b)+Ak(a)Bk(b)+Ak(a)Bk(b)Ak(a)Bk(b). 4.17

If they do not make this illegitimate replacement, then the absolute upper bound on the CHSH string of averages is 4, not 2. And the absolute upper bound of 4 has never been exceeded—and can never be exceeded—in any experiment [17].

One may suspect that the above conclusion is perhaps an artefact of the discrete version (4.5) of the expectation values E(a,b). Perhaps it can be ameliorated if we considered the CHSH sum (4.4) in the following continuous form:

ΛA(a,λ)B(b,λ)dρ(λ)+ΛA(a,λ)B(b,λ)dρ(λ)+ΛA(a,λ)B(b,λ)dρ(λ)ΛA(a,λ)B(b,λ)dρ(λ), 4.18

where Λ is the space of all hidden variables λ, and ρ(λ) is the probability measure of λ [57]. Written in this form, it is now easy to see that the above CHSH sum of expectation values is both mathematically and physically identical to

Λ[A(a,λ){B(b,λ)+B(b,λ)}+A(a,λ){B(b,λ)B(b,λ)}]dρ(λ). 4.19

But as the above two integral expressions are identical to each other, we can use the second expression without loss of generality to prove that the criterion of reality used by Bell is unreasonably restrictive compared to that of EPR.

To begin with, expression (4.19) involves an integration over fictitious quantities (see footnote 7) such as A(a,λ){B(b,λ)+B(b,λ)} and A(a,λ){B(b,λ)B(b,λ)}. These quantities are not parts of the space of all possible measurement outcomes such as A(a,λ), A(a,λ), B(b,λ), B(b,λ), etc.; because that space—although evidently closed under multiplication—is not closed under addition. Since each function B(b,λ) is by definition either +1 or −1, their sum such as B(b,λ)+B(b,λ) can only take values from the set {−2,0,+2}, and therefore it is not a part of the unit 2-sphere representing the space of all possible measurement results. Consequently, the quantities A(a,λ){B(b,λ)+B(b,λ)} and A(a,λ){B(b,λ)B(b,λ)} appearing in the integrand of (4.19) do not themselves exist, despite the fact that A(a,λ), A(a,λ), B(b,λ) and B(b,λ) exist, at least counterfactually, in accordance with the hypothesis of local realism. This is analogous to the fact that the set O:={1,2,3,4,5,6} of all possible outcomes of a die throw is not closed under addition. For example, the sum 3+6 is not a part of the set O.

But there is also a much more serious physical problem with Bell’s version of reality. As noted above, the quantities A(a,λ){B(b,λ)+B(b,λ)} and A(a,λ){B(b,λ)B(b,λ)} are not physically meaningful in any possible physical world, classical or quantum. That is because B(b,λ) and B(b,λ) can coexist with A(a,λ) only counterfactually, since b and b′ are mutually exclusive directions. If B(b,λ) coexists with A(a,λ), then B(b,λ) cannot coexist with A(a,λ), and vice versa. But in the proof of his theorem Bell assumes that both B(b,λ) and B(b,λ) can coexist with A(a,λ) simultaneously. That is analogous to being in New York and Miami at exactly the same time (see footnote 7). But no reasonable criterion of reality can justify such an unphysical demand. The EPR criterion of reality most certainly does not demand any such thing.

In conclusion, as the two integrands of (4.19) are physically meaningless, the stringent bounds of ±2 on expression (4.18) are also physically meaningless [30]. They are mathematical curiosities, without any relevance for the question of local realism.

Corollary 4.1 —

It is not possible to be in two places at once.

It is instructive to consider the converse of the above argument. Consider the following hypothesis (see footnote 7): It is possible—at least momentarily—to be in two places at once—for example, in New York and Miami—at exactly the same time.

From this hypothesis, it follows that in a world in which it is possible to be in two places at once, it would be possible for Bob to detect a component of spin along two mutually exclusive directions, say b and b′, at exactly the same time as Alice detects a component of spin along the direction a, or a′. If we denote the measurement functions of Alice and Bob by A(a,λ) and B(b,λ), respectively, then we can posit that in such a world it would be possible for the measurement event like A(a,λ) observed by Alice to coexist with both the measurement events B(b,λ) and B(b,λ) that are otherwise only counterfactually observable by Bob, where λ is the initial state of the singlet system. Therefore, hypothetically, we can represent such a simultaneous event observed by Alice and Bob by a random variable

X(a,b,b,λ):=A(a,λ){B(b,λ)+B(b,λ)}=+2,or0,or2, 4.20

notwithstanding the fact that there are in fact only two localized particles available to Alice and Bob for each run of their EPR-Bohm-type experiment. It is also worth stressing here that in our familiar macroscopic world (after all the vectors a and b represent macroscopic directions) such a bizarre space–time event is never observed, because the measurement directions a and b freely chosen by Alice and Bob are mutually exclusive macroscopic measurement directions in physical space.

Likewise, nothing prevents Alice and Bob in such a bizarre world to simultaneously observe an event represented by

Y(a,b,b,λ):=A(a,λ){B(b,λ)B(b,λ)}=+2,or0,or2. 4.21

And of course nothing prevents Alice and Bob in such a bizarre world to simultaneously observe the sum of the above two events as a single event (i.e. four simultaneous clicks of their four detectors), represented by the random variable

Z(a,a,b,b,λ):=X(a,b,b,λ)+Y(a,b,b,λ)=+2or2. 4.22

Consider now a large number of such initial states λ and corresponding simultaneous events like Z(a,a′,b,b′,λ). We can then calculate the expected value of such an event occurring in this bizarre world, by means of the integral

ΛZ(a,a,b,b,λ)dρ(λ)=Λ[A(a,λ){B(b,λ)+B(b,λ)}+A(a,λ){B(b,λ)B(b,λ)}]dρ(λ), 4.23

where Λ is the space of all hidden variables λ and ρ(λ) is the corresponding normalized probability measure of λ∈Λ.

Note that we are assuming nothing about the hidden variables λ. They can be as non-local as we do not like. They can be functions of A and B, as well as of a and b. In which case we would be dealing with a highly non-local model:

Λλ=f(a,a,b,b,A,B). 4.24

Next we ask: What are the upper and lower bounds on the expected value (4.23)? The answer is given by (4.22). Since Z(a,a′,b,b′,λ) can only take two values, −2 and +2, the bounds on its integration over ρ(λ) are necessarily

2Λ[A(a,λ)B(b,λ)+A(a,λ)B(b,λ)+A(a,λ)B(b,λ)A(a,λ)B(b,λ)]dρ(λ)+2. 4.25

But using the addition property of anti-derivatives, this expected value can be written as a sum of four expected values,

ΛA(a,λ)B(b,λ)dρ(λ)+ΛA(a,λ)B(b,λ)dρ(λ)+ΛA(a,λ)B(b,λ)dρ(λ)ΛA(a,λ)B(b,λ)dρ(λ), 4.26

despite our allowing of λ(a,a,b,b,A,B) to be non-local. As a result, (4.25) can be written in a familiar form as

2E(a,b)+E(a,b)+E(a,b)E(a,b)+2. 4.27

Note that the only hypothesis used to derive these stringent bounds of ±2 is the one stated above: It is possible—at least momentarily—to be in two places at once. Locality was never assumed; nor was the realism of EPR compromised.

Now we perform the experiments and find that our results exceed the bounds of ±2 we found in (4.27) theoretically:

22E(a,b)+E(a,b)+E(a,b)E(a,b)+22. 4.28

Consequently, we conclude that the hypothesis we started out with must be false: We do not actually live in a bizarre world in which it is possible—even momentarily—to be in New York and Miami at exactly the same time. This is what Bell proved. He proved that we do not live in such a bizarre world. But EPR never demanded, nor hoped that we do.

To summarize our corollary, Bell inequalities are usually derived by assuming locality and realism, and therefore violations of the Bell-CHSH inequality are usually taken to imply violations of either locality or realism, or both. But we have derived the Bell-CHSH inequality above by assuming only that Bob can measure along the directions b and b′ simultaneously while Alice measures along either a or a′, and likewise Alice can measure along the directions a and a′ simultaneously while Bob measures along either b or b′, without assuming locality. The violations of the Bell-CHSH inequality therefore simply confirm the impossibility of measuring along b and b′ (or along a and a′) simultaneously.

4.3. The Greenberger, Horne and Zeilinger variant of Bell’s theorem without involving inequalities

Apart from Bell’s argument discussed above, there is also an argument, originally proposed by Greenberger, Horne and Zeilinger (GHZ), that purports to prove the impossibility of any local-realistic understanding of quantum correlations. A remarkable feature of their argument is that, unlike Bell’s argument, it does not involve either inequalities or statistics [7]. Instead, they consider the quantum mechanical expectation value (3.126) for a restricted case in which the measurement settings a, b, c and d are confined to the xy-plane. In that case, the expectation value (3.126) for the state (3.124) simplifies to

EQ.M.GHZ(a,b,c,d)|xy=cos(ϕa+ϕbϕcϕd). 4.29

Then, for ϕa+ϕbϕcϕd=0, the above expectation value reduces to −1 for all runs, and thus even for a single run of the experiment. Similarly, for ϕa+ϕbϕcϕd=π the above expectation value reduces to +1 for all runs, and thus even for a single run of the experiment. This is quite similar to the condition EQ.M.EPR(a,b)=1 or +1 for the 2-particle state (3.47) for the specific settings a=b and a=−b, respectively, for all runs, and thus even for a single run [7]. These are the conditions of perfect anti-correlations and perfect correlations predicted by quantum mechanics. The claim of GHZ is that the corresponding conditions ABCD=1 and ABCD=+1 for respective settings are impossible to reproduce for a single run of the experiment within any locally causal theory. In what follows we disprove this claim, first analytically, and then by a complete event-by-event numerical simulation.

4.3.1. Analytical disproof of the GHZ argument

What we wish to prove is ABCD=1 for any given run of the experiment, for a, b, c and d confined to the xy-plane with ϕa+ϕbϕcϕd=0 (cf. eqns (8) and (11a) of the GHSZ paper [7]). As the case ABCD=+1 for ϕa+ϕbϕcϕd=π follows similarly, it will suffice to prove only the ABCD=1 case. We start with our equations (3.128)–(3.131), which define the functions A=±1, B=±1, C=±1 and D=±1. Now, as derived in §3.5, the expectation value (3.143) follows from the very construction of these functions as a geometrical identity within our model. Therefore, we can use this geometrical identity to demonstrate that ABCD=1 for the chosen settings, for which it reduces to

EL.R.GHZ(a,b,c,d)|xy=limm[1mk=1mA(a,λk)B(b,λk)C(c,λk)D(d,λk)]=cos(ϕa+ϕbϕcϕd). 4.30

In fact, for the chosen settings this identity reduces simply to EL.R.GHZ(a,b,c,d)=ABCD=1 (see eqn (10a) of GHSZ [7] for details). This tells us that the average of the number ABCD is a constant, and it is equal to −1. But that is mathematically possible only if ABCD=1 for all runs, for the chosen settings. But if ABCD=1 for all runs, then ABCD=1 holds also for any given run. Therefore, ABCD=1 for any run, for the chosen settings, QED.

5. Numerical simulations of EPR-Bohm and GHZ correlations

While our analytical result (3.143) for the general 4-particle GHZ correlations stands on its own, we have nevertheless verified the specific correlations (4.30) in an event-by-event numerical simulation of our S7 model using the settings prescribed by GHSZ discussed above, but without the restrictions they imposed on the azimuthal or ϕn-angles [7]. The code for this simulation is reproduced in §5.2, and the graph generated by it is depicted in figure 7.

Figure 7.

Figure 7.

Plot of an event-by-event numerical simulation of the 4-particle GHZ correlations predicted by the S7 model. The settings in this simulation were confined to the xy-plane, with the azimuthal angle ϕabcd=ϕa+ϕbϕcϕd.

It is evident from this graph that the predictions of the S7 model match exactly with the predictions of quantum mechanics (i.e. with the negative cosine curve), despite the S7 model being manifestly local-realistic. In particular, it is evident from the graph that the product ABCD takes both positive and negative values for the settings chosen by GHSZ, contrary to their impossibility claim. What is more, the graph has been generated by 200 000 runs of the simulated 4-particle experiment. Thus, statistically our simulation is far more robust compared to the mere 250 or so events observed in the ‘loophole-free’ experiments [813]. As some familiarity with the languages of Geometric Algebra [18] and the GAViewer program [39] is prerequisite for understanding our simulation, we first present a simpler simulation of the 2-particle singlet correlations (3.65), which, as we discussed in previous sections (cf. §i), are also predicted by our S7 model:

5.1. Numerical simulation of the 2-particle EPR-Bohm correlations

The following code (as well as the one in §5.2) was written in collaboration with Carl F. Diether III.

5.1.

The graph generated by this simulation is shown in figure 8. It is evident from it that the predictions of S7 model match exactly with those of quantum mechanics (i.e. with the negative cosine curve), despite the model being local-realistic.

Figure 8.

Figure 8.

Plot of an event-by-event numerical simulation of the EPR-Bohm correlations predicted by the S7 model. The setting vectors a and b in this simulation were confined to the xy-plane as is usually done in the Bell-test experiments [813].

5.2. Numerical simulation of the 4-particle GHZ correlations

It is now straightforward to generalize the above code to simulate the 4-particle GHZ correlations (4.30) as follows:

5.2.

In addition to the above simulations with all settings confined to the xy-plane, we have also built three-dimensional simulations for both the 2-particle EPR-Bohm state and the 4-particle GHZ state using arbitrary settings, not confined to the xy-plane [43].

6. Concluding remarks

Any experiment in physics can be reduced to a series of elementary questions with possible ‘yes’ or ‘no’ answers. These answers in turn may be observed as ‘clicks’ of event-detectors, as is usually done in the EPR-Bohm-type correlation experiments [31]. When we compare such answers—possibly recorded by remotely located observers—we find that they are correlated in a remarkably disciplined manner, with the strength of the correlations exceeding the expectations based on Bell’s theorem [57,14]. The natural question then is: Why are these answers correlated in such a disciplined manner when in quantum mechanics there appears to be no predetermined cause dictating the correlations? In this paper, we have shown that the discipline and strength exhibited in the correlation experiments are natural consequences of the fact that the three-dimensional physical space in which all experiments are conducted respects the symmetries of a Clifford-algebraic 7-sphere, which arises from an associative interplay of the graded Euclidean primitives, such as points, lines, planes and volumes. These primitives provide the basis for the conformal (see footnote 1) geometry of the physical space (2.16), namely that of a quaternionic 3-sphere, S3, embedded in an eight-dimensional Clifford-algebraic manifold, Kλ. They allow us to understand the origins and strengths of all quantum correlations locally, as aspects of the geometry of the compactified physical space S3, with S7Kλ being its algebraic representation space. Thus, every quantum correlation can be understood as a correlation among a set of points of this S7. We have demonstrated this by proving a comprehensive theorem about the geometric origins of the correlations predicted by arbitrary quantum states:

EL.R.(a,b,c,d,)=limm[1mk=1mA(a,λk)B(b,λk)C(c,λk)D(d,λk)]=cosθxy(a,b,c,d,). 6.1

We have also proved within our framework that the strengths of these correlations are bounded by Tsirel’son’s bounds:

|EL.R.(x,y)+EL.R.(x,y)+EL.R.(x,y)EL.R.(x,y)|21(x×x)(y×y)22. 6.2

We have then explicitly reproduced the strong correlations predicted by the EPR-Bohm state within our framework,

EL.R.EPR(a,b)=limm[1mk=1mA(a,λk)B(b,λk)]=cosθab,together withEL.R.EPR(n)=limm[1mk=1mN(n,λk)]=0, 6.3

as well as explicitly reproduced the strong correlations predicted by the 4-particle GHZ state:

EL.R.GHZ(a,b,c,d)=limm[1mk=1mA(a,λk)B(b,λk)C(c,λk)D(d,λk)]=cosθacosθbcosθccosθdsinθasinθbsinθcsinθdcos(ϕa+ϕbϕcϕd),together withEL.R.GHZ(n)=limm[1mk=1mN(n,λk)]=0. 6.4

We have also presented two event-by-event numerical simulations of the correlations (6.3) and (6.4) predicted within our local-realistic framework. The comprehensive theorem we have proved dictates that—at least in principle—it is always possible to locally reproduce the strong correlations predicted by any arbitrary quantum state. The raison d’être for the strength of the correlations turns out to be the non-trivial twist in the Hopf bundle of S3 [16,17], or in its algebraic representation space S7. Given the fact that we started out our analysis with the most primitive elements of the physical space in the spirit of Euclid’s elements for geometry, our demonstration suggests that the quantum correlations observed in Nature are best viewed as consequences of space–time, rather than space–time as an emergent property of quantum entanglement.

Acknowledgements

The author thanks Carl F. Diether III for his generous help with the simulations of the 2-particle EPR-Bohm and 4-particle GHSZ correlations predicted by the S7 model, by adapting the original GAViewer code of Albert Jan Wonnink for the simulations of the earlier models based on S3.

Appendix A. Local causality and the normed division algebras

In §2, we started out by reviewing the algebraic properties of the physical space, which was crystallized in definition (2.16) and turned out to be a quaternionic 3-sphere. We then constructed the algebraic representation space (2.60) of this physical space, which turned out to be an octonion-like 7-sphere. In the subsequent sections, we saw the crucial role played by these two spheres in understanding the existence, origins and strengths of all quantum correlations. But what is so special about the three and seven dimensions? Why is the vector cross product definable only in three and seven dimensions and no other? Why are R, C, H and O the only possible normed division algebras? Why are only the three- and seven-dimensional spheres non-trivially parallelizable out of infinitely many possible spheres? Why is it possible to derive all quantum mechanical correlations as local-realistic correlations among the points of only the 3- and 7-spheres?

The answers to all of these questions are intimately connected to the notion of factorizability introduced by Bell within the context of his theorem [5,6]. Mathematicians have long been asking: When is a product of two squares itself a square: x2y2=z2? If the number z is factorizable, then it can be written as a product of two other numbers, z=xy, and then the above equality is seen to hold for the numbers x, y and z. For ordinary numbers, this is easy to check. The number 8 can be factorized into a product of 2 and 4, and we then have 64=82=(2×4)2=22×42=64. But what about sums of squares? A more profound equality holds, in fact, for a sum of two squares times a sum of two squares as a third sum of two squares:

(x12+x22)(y12+y22)=(x1y1x2y2)2+(x1y2+x2y1)2=z12+z22. A 1

There is also an identity like this for the sums of four squares. It was first discovered by Euler, and then rediscovered and popularized by Hamilton in the nineteenth century through his work on quaternions. It is also known that Graves and Cayley independently discovered a similar identity for the sums of eight squares. This naturally leads to the question of whether the product of two sums of squares of n different numbers can be a sum of n different squares? In other words, does the following equality hold in general for any n?

(x12+x22++xn2)(y12+y22++yn2)=z12+z22++zn2. A 2

It turns out that this equality holds only for n=1,2,4 and 8. This was proved by Hurwitz in 1898 ([44], see also discussion in ch. 7 of [16]). It reveals a deep and surprising fact about the world we live in. Much of what we see around us, from elementary particles to distant galaxies, is an inevitable consequence of this simple mathematical fact. The world is the way it is because the above equality holds only for n=1,2,4 and 8. For example, the above identity is equivalent to the existence of a division algebra of dimension n over the field R of real numbers. Indeed, if we define vectors x=(x1,…,xn), y=(y1,…,yn) and z=x*y in Rn such that zis are functions of xjs and yks determined by (A 2), then we have

xy=xy. A 3

Thus, the division algebras R (real), C (complex), H (quaternion) and O (octonion) we use in much of our science are intimately related to the dimensions n=1,2,4 and 8. Moreover, from the equation of a unit sphere in m dimensions,

x02+x12+x22++xm2=1, A 4

it is easy to see that the four parallelizable spheres S0, S1, S3 and S7 correspond to n=1,2,4 and 8, which are the dimensions of the respective embedding spaces of these four spheres. What is not so easy to see, however, is the fact that there is a deep connection between Hurwitz’s theorem and the quantum correlations (cf. ch. 7 of [16]).

As we saw, quantum correlations are inevitable consequences of the non-trivial geometry and topology of S7, which in turn is the largest parallelizable sphere permitted by Hurwitz’s theorem. In the language of Hopf fibrations, S7 is locally (but not globally) equal to the product S4×S3, and thus is a Hopf bundle made up of 4-sphere worth of 3-spheres with a non-trivial twist in the bundle. Similarly, S3 is locally (but not globally) equal to the product S2×S1, and thus is a Hopf bundle made up of 2-sphere worth of 1-spheres with a non-trivial twist in the bundle. Thus, the innocent looking algebraic equality (A 2) has far-reaching consequences, not only for the edifice of mathematics, but also for that of quantum physics. In fact, S7 turns out to be both necessary and sufficient for understanding the strong correlations locally. That may seem surprising, but this necessity stems from the profound relationship between the normed division algebras and the parallelizability of the four spheres we noted above. Quantum correlations thus exist and exhibit the remarkable strengths they do because equality (A 2) holds only for n=1,2,4 and 8.

Appendix B. Proof of Bell’s condition of factorizability within S7

In a deterministic hidden-variable framework, local causality can be specified by Bell’s condition of factorizability, as in eqn (3.4) of [20]. For the GHZS measurement results within 7-sphere, this condition is given by equation (3.132):

(AaBbCcDd)(λk)=A(a,λk)B(b,λk)C(c,λk)D(d,λk)=±1S7. B 1

To prove it, we begin with four maps of the following form, which—without the limits—are generic elements of S7:

S7A(a,λk):=limsr1arsd1ad{D(ar,ad,0)N(sr1,sd1,0,λk)}={+1ifλk=+11ifλk=1}, B 2
S7B(b,λk):=limsr2brsd2bd{+N(sr2,sd2,0,λk)D(br,bd,0)}={1ifλk=+1+1ifλk=1}, B 3
S7C(c,λk):=limtr1crtd1cd{D(cr,cd,0)N(tr1,td1,0,λk)}={+1ifλk=+11ifλk=1} B 4
andS7D(d,λk):=limtr2drtd2dd{+N(tr2,td2,0,λk)D(dr,dd,0)}={1ifλk=+1+1ifλk=1}. B 5

But since S7 remains closed under multiplication, the geometric product of these four maps (or, for that matter, the geometric product of any number of such maps)—because of the ‘product of limits equal to limits of product’ rule—is necessarily a fifth map of the form

S7X(x,λk):=limurxrudxd{D(xr,xd,0)N(ur,ud,0,λk)}={+1ifλk=+11ifλk=1}. B 6

Consequently, as a consequence of the non-trivial twists (3.133) and (3.134) in the Hopf bundle of S7, we necessarily have

(AaBbCcDd)(λk)=A(a,λk)B(b,λk)C(c,λk)D(d,λk)=X(x,λk)=±1S7. B 7

Footnotes

1

The conformal space we are considering is an in-homogeneous version of the space usually studied in Conformal Geometric Algebra [19]. It can be viewed as an eight-dimensional subspace of the 32-dimensional representation space postulated in Conformal Geometric Algebra. The larger representation space results from a homogeneous freedom of the origin within E3, which is neither necessary nor useful for our purposes here.

2

Readers not familiar with Bell’s locally causal (or local-realistic) framework [57] may benefit from reviewing it from the appendix of [30] before proceeding further.

3

It is worth noting here that correlations predicted by the two-level systems can be reproduced also within a quaternionic 3-sphere model without any reference to the general 7-sphere framework presented here, as we have shown elsewhere [30]. In fact, even highly non-trivial Hardy-type correlations can also be reproduced within the quaternionic 3-sphere model, as we have demonstrated in ch. 6 of [16]. The 3-sphere framework, however, is rather restrictive. It can accommodate the singlet correlations and Hardy-type correlations, but cannot reproduce more intricate correlations, such as, for example, those predicted by the rotationally non-invariant GHZ states (cf. §3.5). On the other hand, the 7-sphere framework is more general and comprehensive. This is because S7 is made of S4 worth of 3-spheres, with a highly non-trivial twist in the bundle. In other words, in the language of Hopf fibration, S7 is fibrated by S3 over the base manifold S4. Thus, each of the many fibres of S7 that make it up is itself an S3. It is, therefore, not surprising that S7 framework presented in this paper is more complete and is able to reproduce quantum correlations more comprehensively (cf. appendix A). Moreover, as we saw in §2, the algebraic and geometrical properties of the physical space are captured more completely by the octonion-like representation space S7 constructed in equation (2.60), and not by the three-dimensional conformal physical space S3 of equation (2.16) itself.

4

It is, however, possible to distinguish between the two interpretations in a macroscopic experiment. Such a macroscopic experiment has been proposed in [17]. If realized, it will explore whether or not Bell inequalities are violated for a manifestly local, classical system, without involving either quantum entanglement or quantum superposition.

5

A similar inequality was first considered by Boole in 1862, but without the interpretation attributed to it by Bell (cf. [37,38]).

6

In the derivation of the absolute bounds on the Bell-CHSH correlator, such as those in equation (4.9), one usually employs factorized probabilities of observing binary measurement results rather than the actual measurement results we have used in our derivation. But employing probabilities in that manner only manages to obfuscate the conceptual flaw in Bell’s argument we intend to bring out here.

7

The possible space-like separated events being averaged in (4.8) cannot possibly occur in any possible world, classical or quantum. To appreciate this elementary fact, consider the following homely analogy: Imagine a couple, say Jack and Jill, who decide to separate while in Kansas City, and travel to the West and East Coasts respectively. Jack decides to travel to Los Angeles, while Jill cannot make up her mind and might travel to either New York or Miami. So while Jack reaches Los Angeles, Jill might reach either New York or Miami. Thus, there are two possible destinations for the couple. Either Jack reaches Los Angeles and Jill reaches New York, or Jack reaches Los Angeles and Jill reaches Miami. Now suppose that, upon reaching New York, Jill decides to buy either apple juice or orange juice. And likewise, upon reaching Miami, Jill decides to buy either apple juice or orange juice. Consequently, there are following four counterfactually possible events that can realistically occur, at least in our familiar world: (i) While Jack reaches Los Angeles and buys apple juice, Jill reaches New York and buys apple juice; Or, (ii) while Jack reaches Los Angeles and buys apple juice, Jill reaches New York and buys orange juice; Or, (iii) while Jack reaches Los Angeles and buys apple juice, Jill reaches Miami and buys apple juice; Or, (iv) while Jack reaches Los Angeles and buys apple juice, Jill reaches Miami and buys orange juice. So far so good. But what is being averaged in (4.8) are impossible events of the following kind: (v) While Jack reaches Los Angeles and buys apple juice, Jill reaches New York and buys apple juice and Jill reaches Miami and buys orange juice at exactly the same time! Needless to say, no such events can possibly occur in any possible world, even counterfactually. In particular, Einstein’s conception of local realism by no means demands such absurd or impossible events in any possible world [4]. It is therefore not at all surprising why the unphysical bounds of ±2 on the CHSH sum of expectation values obtained by averaging over the absurd events like (4.10) are not respected in the actual experiments [813].

Data accessibility

The codes for the two simulations are included in §5.

Competing interests

The author declares no competing interests.

Funding statement

Not applicable.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The codes for the two simulations are included in §5.


Articles from Royal Society Open Science are provided here courtesy of The Royal Society

RESOURCES