Abstract
Hess and Philipp recently claimed [(2001) Proc. Natl. Acad. Sci. USA 98, 14224–14227 and 14228–14233] that proofs of Bell's theorem have overlooked the possibility of time dependence in local hidden variables, hence the theorem has not been proven true. Moreover they present what is claimed to be a local realistic model of the EPR correlations. If this is true then Bell's theorem is not just unproven, but false. We refute both claims. First, we explain why time is not an issue in Bell's theorem, and second, we show that their hidden variables model violates Einstein separability. Hess and Philipp have overlooked the freedom of the experimenter to choose settings of a measurement apparatus at will: any setting could be in force during the same time period.
Keywords: Bell-type delayed choice experiment, EPR correlations, locality
The recent papers by Hess and Philipp (1–3) have drawn a lot of attention, especially since refs. 1 and 2 were featured on Nature's web pages and from there reached the popular press in many countries. It is claimed that Bell (4) overlooked the role of time, and hence his theorem, that quantum mechanics is incompatible with local realism, is unproven. This claim is supported with an apparent counterexample: a supposedly local hidden variables model for the singlet correlations.
While Hess and Philipp correctly state that time was ignored in the original derivation of Bell's theorem, we show here that this is irrelevant. Moreover we show that Hess and Philipp's hidden variables model (the HP model) is not local at all. Hess and Philipp do not take proper account of a crucial element in the proof of Bell's theorem: the free choice of the experimenter in the laboratory, and of the theoretician imagining a Gedankenexperiment, between several possible experiments. We shall convert this freedom into a statistical independence assumption and show how it plays an essential role in obtaining Bell's theorem. We refer to the result of this free choice, in what follows, as the experimental setting or just the setting.
Hess and Philipp's (1, 2) main criticism of Bell is his lack of consideration of time, but as we shall see below time plays an entirely trivial role in both the formulation and the proof of Bell's theorem. One may think of the correlations as determined by repeating a single measurement in many different laboratories simultaneously. In each laboratory, settings are chosen at random just once and are in force throughout that measurement.
Time naturally does play a role in real-world experiments in which correlations are determined over time by repeated measurements. Memory effects may lead to dependencies and possible time variation should be accounted for. After a time delay, information from one location can arrive at another. However, by results of Gill (5), none of these effects harm experimental conclusions, provided the experimenter exploits his freedom to repeatedly randomize the settings.
Since Hess and Philipp's criticism of Bell is unsubstantiated, their purported counterexample to Bell's theorem must be in error. Indeed this is the case. Hidden deep in the elaborate construction outlined in ref. 2 is an elementary error. To be specific, a crucial hidden variable, which correlates the measurement results at the two spatially separated ends of the experiment, is not explicitly included in the list of variables on which the model is built. Hess and Philipp take pains to show that the other hidden variables of the model can be made to satisfy “Einstein's separability,” but they forget about the missing undercover variable. We show here that this variable has a probability distribution that depends on the settings of the measuring apparatuses in both wings of the experiment. Therefore, Einstein's separability is not satisfied, i.e. the model is nonlocal.
We first present a proof of Bell's theorem and discuss its assumptions, emphasizing aspects of freedom and local disturbances. We conclude by explaining the error in Hess and Philipp's hidden variables model.
Freedom and Local Disturbances in Bell's Theorem
Fig. 1 gives a schematic view of one trial (one pair of photons) in a Bell-type experiment, in particular, as in the experiment of Weihs et al. (6), who for the first time fully implemented Bell's requirement “the filter settings are chosen during the flight of the photons” (see ref. 4, p. 151, figure 7). We will use the words photons, polarizer, and so on, but of course the picture could be applied to many different physical realizations of the Bell singlet or other suitably entangled state. In the two wings of the experiment, a setting, denoted as A and B, is freely and independently chosen by the local observers, and fed into the local polarizer. In each wing, it is enough to have two possible settings. We shall give them the labels 1, 2; i.e., A can take the value 1 or 2 and likewise B. A convenient realization of that free and independent choice is for A and B to be outputs of two independent randomizers. The setting in each wing is fed into the measurement device (the polarizer), just before a quantum particle arrives from a distant source at the device. The measurement results in an outcome ±1. The whole process from random choice to registering the outcome takes less time than it takes a light signal to traverse from one wing of the experiment to the other. Thus the setting used in the left wing of the experiment is not available in the right wing till after the outcome there is settled, and vice versa. We will denote the outcome left by X and right by Y; these are variables taking values ±1.
Fig 1.
One trial of a Bell-type experiment.
So far we have just introduced notation for four random variables, A, B, X, Y, which actually get observed when the experiment (just one trial) is carried out. We next make some assumptions concerning these variables. We describe what is meant by local realism and show how this implies a relationship between various probabilities concerning these observable variables.
The first element of local realism is realism itself. By this we mean any mathematical-physical model, or a scientific standpoint, which allows one to introduce a further eight variables into the model so far, which we denote by Xij, Yij, where i, j = 1, 2, and which are such that
![]() |
In words: one may conceive, as a thought experiment or as part of a mathematical model, of what the measurement outcomes would be, under any of the possible measurement settings; you get to see the outcomes corresponding to the actually selected settings. No hidden variables appear anywhere in our argument beyond these eight. Given a (possibly stochastic) hidden variables theory, for instance that of ref. 2, one will be able to define our eight variables as (possibly random) functions of the variables in that theory. According to the hypothesis of realism, in each individual experiment these variables coexist independently of which experiment is actually performed on either side.
Next, we introduce locality. The following is supposed to hold for all i, j:
![]() |
That is to say, the outcome that you would see left, under either setting, does not depend on which setting might be chosen right, and vice versa. Working under the locality assumption we write
![]() |
Recall that X and Y are the actual outcomes corresponding to the actual settings A and B. Under realism and locality we therefore have
![]() |
Contained in the above is an assumption of local disturbances. When Alice sends her chosen label A = i to her measurement device and Bob sends his chosen label B = j to his, they will likely cause some further unintended disturbance. Implicit in the above is the assumption that any disturbance on the left, as far as it influences the outcome left, is not related to the coin toss nor to the potential outcomes on the right, and vice versa.
Realism and locality can be made artificially true by a mathematical trick: simply define all Xij ≡ X, and Yij ≡ Y! However, this unphysical choice is ruled out by the next assumption, which we call freedom, often only tacit in presentations of Bell's theorem: the generation of the local settings A and B, by randomizers or by free choice of the observers, is physically independent of the system consisting of the source and the two detectors combined that produces the potential outcomes X1, X2, Y1, Y2. The actually observed outcomes can be therefore treated as merely selected by independent setting choices as X ≡ XA, Y ≡ YB.
Let us elaborate a bit more on the freedom assumption. Consider the following rather complicated procedure of choosing the local settings: in the left wing of the experiment, Alice shuffles and cuts a pack of cards, and decides on the basis of the chosen card (red or black) how to encode a subsequent coin toss (H or T) as setting 1 or 2, or vice versa. She selects a coin from her purse and tosses it, using the encoding just determined in order to feed either a 1 or a 2 into the communication line to “her” polarizer. Far away, and simultaneously, Bob follows a similar procedure. Better still, he uses other randomization devices such as a roulette wheel or dartboard (he is a very poor darts player), or pseudo-random number generator with seed chosen by tossing dice, or some other physical random number generator. Note the freedom that the two persons have in: how many times to shuffle their pack of cards, which coin to pick from their purse, and so on. And notice also the complexity of the path, which, even if one believes it is essentially deterministic, results in the choice 1 or 2, to be fed into each polarizer. Finally, please note the complete independence of the above procedure from the workings of the source and detectors of the correlated pairs of photons. We will assume that the complete procedure used to generate A and B may be mathematically modeled as independent, fair coin tosses, thus: each of the four possible values 11, 12, 21, 22 of the pair AB are equally likely. In ref. 5 the randomizers were systems based on quantum optics.
We now procede to prove Bell's inequality, and from this, Bell's theorem. The first step is to note a logical fact: arrange the four binary variables X1, Y1, X2, Y2 at the corners of a square, in the sequence just given. Each of the four sides of the square connects one of the Xi with one of the Yj. Now for each pair Xi, Yj ask the question: do these variables take the same value, or are they different (see Fig. 2). One sees immediately that any three equalities imply the fourth; and also that any three inequalities imply the fourth. It follows that the number of equalities always equals 0, 2 or 4. For an algebraic proof of this fact, note that the value of XiYj encodes the equality or inequality of the variables Xi and Yj, while (X1Y2) = (X1Y1)(X2Y1)(X2Y2).
Fig 2.
The number of equalities is even.
Next, define 1{…} as the indicator variable of a specified event; that is to say, the random variable that takes the value 1 if the event happens, 0 if not. Consider
![]() |
By what we have just said, Δ can only take on the values 0 and −2, hence its expected value is not greater than 0. Now, the expected value of a linear combination of random variables equals the same linear combination of the expected values of each variable separately. Moreover, the expected value of an indicator variable (which only takes the values 0 and 1) is equal to the probability of the value 1, thus equals the probability of the event in question. Therefore
![]() |
If the randomization procedures used by Alice and Bob are of high quality and the physical processes involved are in a physical sense independent of the workings of the source and the local detectors, then the hidden variables (X1, X2, Y1, Y2) are statistically independent of (A, B). Putting it formally
![]() |
![]() |
The technical meaning of statistical independence of (X1, X2, Y1, Y2) from (A, B) is that in repetitions of the experiment the relative frequencies with which the quadruple (X1, X2, Y1, Y2) takes on any of its 24 possible values, remain the same within each subensemble defined by each of the four possible values of the pair (A, B). Statistical independence can also be called complete lack of correlation. The notion is symmetric: the left and right sides of Eq. 7 can be interchanged.
Now consider the conditional probability Pr{X = Y | A = i, B = j} that the outcomes left and right are equal, given any pair of measurement settings i and j. We write this for short as Pr{X = Y | AB = ij}. By local realism, this equals Pr{Xi = Yj | AB = ij}. But by statistical independence this conditional probability is the same as the unconditional probability Pr{Xi = Yj}. Therefore we obtain Bell's inequality:
![]() |
But quantum mechanics makes the prediction, for the familiar choice of state and polarizer settings, that the expression on the left side of this inequality equals
− 1 ≫ 0. Hence Bell's theorem: if quantum mechanics holds, local realism is untenable.
Hess and Philipp's Objections to Bell
Having presented Bell's theorem in this compact form we proceed to discuss Hess and Philipp's difficulties with it. First of all, the only independence we needed was between the chosen polarizer settings, on the one hand, and the physical system of polarizers and source, on the other. Under a local realistic model, any kinds of dependencies of any hidden variables within each of the two measurement devices is allowed, as is any dependence of either one separately on the source.
Second, we did not mention time in our derivation at all because it was completely irrelevant. Our derivation concerned each run of the experiment. We did not compare actual outcomes under different settings at different times, but potential outcomes under different settings at the same time. Therefore, the argument in ref. 1 (equations 8 and 9) or in ref. 2 (end of paragraph after equation 11) is completely beside the point.
To emphasize this point, consider (as a thought experiment) repeating the measurement procedure just described, not as a sequence of successive repetitions at the same locations, but in a million laboratories all over the galaxy. The prediction of local realism is that when we collect the one million sets of observed quadruples (A, B, X, Y) together and compute four relative frequencies estimating the four conditional probabilities Pr{X = Y | AB = ij}, they will satisfy (up to statistical error) Bell's inequality. It is of no importance that the distribution of hidden variables at different locations of the experiment might vary.
Hess and Philipp make a large number of criticisms of the assumptions of Bell's theorem with the main theme being that variables at both locations can vary in time in a dependent way, leading to dependence between the outcomes, which Bell supposedly did not take account of. Before turning to their model, we confront our formalization of the metaphysical assumptions of local realism with the idea of time variation.
How could time variation invalidate the freedom assumption? One would have to argue that because of systematic long-time periodicities in the various component physical systems concerned, the outcomes of a complex series of events involving a card shuffle, a coin toss, and the free will of an experimenter at one location are interdependent and highly correlated with the potential outcome of a certain polarization measurement at a distant location. A good experimental design, with rigorous randomization of the choice of settings, makes this totally implausible.
There is an equally implausible way in which time variation could invalidate the locality assumption, connected to what we called local disturbances above. When we select a 1 or a 2 on a measurement device, by pressing the appropriate button, we have supposed that only our choice has an impact on the subsequent physics at this location. However, it is clear that at the same time we will be introducing a small but uncontrolled disturbance alongside the intended binary input. In conceptual repetitions of the experiment, the length of time our finger presses the button, how hard we press the button, when precisely we do it, and so on, will vary, and each time a different disturbance is introduced into the measurement device. Could it be that this disturbance actually carries with it information about the setting being chosen in the far wing of the experiment? Well, perhaps there is a physics in which the outcomes of coin tosses and polarization measurements are determined long in advance of the events and are encoded in minute variations in timing and pressure, so that the setting being generated by Bob is in fact already “known” at Alice's location and is unwittingly introduced by her into her apparatus along with her own coin toss. Then the outcome left, under different hypothetical settings right, could differ, and our locality assumption would fail, even though the statistical independence between the “nominal” instructions A, B and the “hidden variables” Xij, Yij still held.
HP Model: A Counterexample to Bell?
Hess and Philipp (2) construct an elaborate hidden variables model for the EPR correlations. As we will argue below, in contradiction to their claim, the HP model is nonlocal. They make a number of errors, all centering around a variable m that they treat as a mere (time)-index and the related variable i, which they forget to treat as one of the hidden variables of the model (which is why it is missing in all their formulas except equation 26). As we shall show, the undercover hidden variable i, shared at the two measurement stations of the Bell experiment, has a probability distribution depending on the local polarizer orientations at both stations, and therefore is nonlocal!
An impatient reader may easily notice the inconsistency concerning the variable m in the logic of the HP model, by carefully reading the five lines above equation 35 of ref. 2. Here, A and B are the functions that determine the outcomes in the left and right wings of the experiment from the settings a and b and from various hidden variables and functions thereof:
“Label the corresponding functions A and B as A(m) and B(m) and consider the index (m) a function of the source parameter λ = (λ1, λ2) and the time operators O
, O
. Then the functions A(m) and B(m) can be considered as functions of a, λ, Λ
, and b, λ, Λ
, respectively.”
The first sentence introduces the variable A(m), with m being a function of λ and “time operators” O
and O
. This means that the outcome A depends, among other parameters, on λ, a and b, where b is the setting in the other wing of the experiment! However, in the next sentence the authors write “A(m) can be considered as function of a, λ, Λ
.” They do not notice that A(m) also depends on m and m depends on a and b. Hess and Philipp are inconsistent with the variable m, which on the one hand is treated merely as an index, determining the precise time point of the measurement, while in mathematics it plays a less innocent role.
For the patient reader, we now go into the mathematics of the HP model in more detail. The central role in the model building is played by the theorem on page 14231 and equations 15–19 of ref. 2. The reader should have this paper close to hand in the following discussion. We shall show that the main formula of the theorem, equation 18, should be rewritten, to reveal all relevant hidden variables, in the following form
![]() |
The reader will find definitions of all these objects in ref. 2; however, for the experienced reader, the notation and physical meaning of the entries to the formula should be evident. The proof of the theorem goes smoothly up to equations 26 and 27. Here, a quick glance tells a mathematically experienced reader that there is something missing. The definition in equation 26 should be replaced by
![]() |
simply because, following the arguments of Hess and Philipp, the function κ does depend on i and j. Furthermore, one immediately notices that because of the Kronecker delta an even better choice would be
![]() |
Therefore their next formula (equation 27) defining the density ρab should be reformulated as
![]() |
Since the role of the function κ is to partition the domains of the local weight functions σ and τ, it is more convenient to define new weights σa(u, i) = σa(u)⋅1{i − 1 ≤ u < i} and τb(v, i) = τb(v)⋅1{i − 1 ≤ v < i}, so that
![]() |
This formula fully reveals the true nature of the distribution ρab. The hidden variables u and v are local ones, whereas the very important variable i is at both locations, and as such it has correlating power.
Now we come to the crucial point: the probability distribution of the variable i depends on both a and b, therefore it is of nonlocal nature. For simplicity, let us compute using the elaborate formulas 24 and 25 of ref. 2 the marginal distribution of i, for the fixed value of i = −2, that is
![]() |
For i = −2 the domains of σa(u, i) and τb(v, i) are −3 ≤ u < −2 and −3 ≤ v < −2 respectively, and therefore only the top formulas in equations 24 and 25 are relevant, giving
![]() |
i.e., the distribution is overtly nonlocal!
The subsequent elaborate transformations of the model of the theorem, which are made with the use of the additional variable m, are performed by Hess and Philipp to arrange that the marginal density of the local variables u and v can be made uniform (unnumbered formula below equation 36). This is all irrelevant. The correlating undercover hidden variable i, ignored by notational oversight, has a non-local distribution. This distribution cannot be made local by the introduction of the additional variable m, because the nonlocal character of the construction in the theorem is retained by the final construction in the case m = 1! Therefore the final full model is nonlocal.
Conclusions
Time is not an issue in the proof of Bell's theorem. What is crucial is the freedom of the experimenter to choose either of two settings at the same time. Hess and Philipp's hidden variables model is nonlocal.
Acknowledgments
We are grateful to Wayne C. Myrvold whose paper∥ helped us understand the HP construction. M.Ż. acknowledges KBN Grant 5 P03B 088 20 and the Austrian-Polish programme Quantum Communication and Quantum Information (2002–2003). This research was also supported by the Austrian Science Fund FWF under Grant SFB 65.
Myvold, W. C., Proceedings of 18th Biennial Meeting of the Philosophy of Science Association, to be held Nov. 7–10, 2002, Milwaukee, WI.
References
- 1.Hess K. & Philipp, W. (2001) Proc. Natl. Acad. Sci. USA 98, 14224-14227. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Hess K. & Philipp, W. (2001) Proc. Natl. Acad. Sci. USA 98, 14228-14233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Hess K. & Philipp, W. (2002) Europhys. Lett. 57, 775-781. [Google Scholar]
- 4.Bell J. S., (1987) Speakable and Unspeakable in Quantum Mechanics (Cambridge Univ. Press, Cambridge, U.K.).
- 5.Gill R. D., (2002) Mathematical Statistics and Applications: Festschrift for Constance van Eeden (Lecture Notes—Monograph Series), eds. Moore, M., Leger, C. & Froda, S. (Institute of Mathematical Statistics, Hayward, CA), in press.
- 6.Weihs G., Jennewein, T., Simon, C., Weinfurter, H. & Zeilinger, A. (1998) Phys. Rev. Lett. 81, 5039-5043. [DOI] [PubMed] [Google Scholar]






















