Skip to main content
Royal Society Open Science logoLink to Royal Society Open Science
. 2024 Mar 20;11(3):231953. doi: 10.1098/rsos.231953

Contextual measurement model and quantum theory

Andrei Khrennikov 1,
PMCID: PMC10977392  PMID: 38550757

Abstract

We develop a contextual measurement model (CMM) that is used for the clarification of the quantum foundations. This model matches Bohr’s views on the role of experimental contexts. CMM is based on a contextual probability theory that is connected with generalized probability theory. CMM covers measurements in classical, quantum and semi-classical physics. The CMM formalism is illustrated by a few examples. We consider the CMM framing of classical probability, the von Neumann measurement theory and the quantum instrument theory. CMM can also be applied outside of physics, e.g. in cognition, decision-making and psychology, the so-called quantum-like modelling.

Keywords: contextual measurement model, quantum foundations, contextual probability, generalized probability, quantum instruments, quantum-like modelling

1. Introduction

The interrelation of quantum and classical probability theories is a very complex foundational issue involving interpretational, mathematical and philosophical questions. Research in this area is characterized by the diversity of views, opinions and mathematical formalisms (e.g. [124]). We remark that, generally, quantum mechanics (QM) is characterized by a diversity of interpretations.

My own understanding is that quantum probability is a machinery for probability update, analogous to classical Bayesian inference [2534]. In contrast to the latter, quantum probability inference is not based on the Bayes formula for conditional probability. Quantum probability theory is a theory of probability inference with a special class of probability update transformations given by projections or quantum instruments. It is natural to create a general probabilistic framework that covers both the classical and quantum ones. Such generalization can come with a global panorama, as from the top of a mountain, one can enjoy a panorama of the whole city and, through this panorama, connect districts that otherwise look totally separated. In this way, it is easier to find similarities and differences in district plans and the architecture of buildings. As just one of the possible machines for probability update, the quantum probabilistic formalism would lose its mystery.

One such ‘panoramic framework’ is the contextual measurement model (CMM) based on contextual probability space. Its development was initiated in [25], continued in a series of authors’ works (e.g. [2628]) and summarized in a monograph [29]. In these studies, the main emphasis was on the modification of the formula of total probability (FTP)—its transformation into FTP with an interference term, expressing interference of probabilities, e.g. in the two-slit experiment. In my previous studies, the contextual probability approach was partially shadowed by an appeal to von Mises’ frequency theory of probability [22,30] and the realization of experimental contexts as von Mises collectives.

In this paper, CMM’s development is continued towards the abstract contextual formalization of other basic features of quantum probability, such as order and response replicability effects (RREs) in sequential measurements, entanglement, the violation of the Bell inequalities and establishing coupling with quantum instruments theory as well as with linear space representation (LSR) of generalized probability theory.

CMM is the basis of the Växjö interpretation of QM [25,29,31,32]—one of the contextual probabilistic interpretations. Since the probability update is also an information update, the Växjö interpretation is part of the information interpretation of QM. This paper presents CMM consistently in the most general form by highlighting its basic properties, such as interference of probabilities, order effect (OE), entanglement and violation of the Bell inequalities.

The abstract CMM formalism is illustrated by a few examples. We start with the CMM framing of classical probability theory [35,36], which serves as the basis of classical statistical physics and thermodynamics. Then, we consider the von Neumann [1,37] quantum measurement theory with observables given by Hermitian operators and the state update of the projective type and represent it as CMM. The quantum instrument theory is the generalization of the von Neumann theory permitting state updates of the non-projective type, and it can also be represented as CMM. We also show the connection of generalized probability theory with the state space consisting of probability measures with CMM. Finally, LSR for contextual probability space is constructed using the construction dating back to Mackey.

CMM can also be applied outside of physics, in so-called quantum-like modelling (e.g. [38,39]). This is a rapidly developing area of research stimulated by the recent quantum information revolution. In quantum-like modelling, the quantum methodology is applied to cognition, decision-making, psychology, game theory, economics and finance, and AI. Universally, quantum-like models need not be based on the complex Hilbert space formalism. They can employ other contextual probability calculi and CMMs [38].

1.1. Contextuality of probability

From the mathematical viewpoint, the essence of the problem lies in generalizations of conditional probability and probability update. On the way to such rethinking of the interrelation between the classical and quantum probability theories, I was strongly influenced by Ballentine who treated all quantum probabilities as conditional probabilities [14,15,1820]. Later, I learned that this was also Koopman’s viewpoint [4]. It is interesting that Kolmogorov (who in 1933 formalized classical probability in the measure-theoretic framework [35,36]) advertised this conditional viewpoint even for classical probability. This viewpoint was especially clearly described in his early works preceding the monograph [35]. Unfortunately, these works (in Russian and published in proceedings) are practically unknown; see [22] for references and details. I got to know about these ‘pre-axiomatic’ studies of Kolmogorov from Shiryaev and Bulinski, former students of Kolmogorov. But, even monograph [35] ([36]) contains the statement that, in modern words, can be formulated as a statement about the contextuality of probability. Kolmogorov’s message is that it is meaningless to speak about probability without determining a complex of experimental conditions and measurement context. This Kolmogorov position matches well with Bohr’s statement on the contextuality of measurement outcomes that is the cornerstone of his complementarity principle [40] (which is better to call the contextuality–complementarity principle [41,42]).

Unfortunately, Kolmogorov’s original message on the contextuality of probability was practically ignored in the further development of classical probability theory. A mathematical work on probability is typically started by fixing one probability measure, without mentioning that it corresponds to some measurement context. The contextuality component of Bohr’s statements on complementarity was not emphasized in quantum foundational research; typically, the Bohr complementarity principle is reduced to the wave–particle duality.

Following Bohr [40], Kolmogorov [35,36], Koopman [4], Accardi [14,15] and Ballentine [1720], I introduced the contextual probability space and CMM based on it [29]. The main idea behind this approach is to operate solely with contexts and observables and to exclude physical systems from consideration.

A measurement context consists of a pre-measurement context C , an observable A and a post-measurement context CA=x corresponding to the outcome A=x , i.e. a triple (C,A,CA=x) . Transformation CCA=x can be described as a map TA(x):CC , where C is the set of pre-measurement contexts. Alike theory of quantum instruments, we call the pair IA=(A,TA) a contextual instrument. The latter is the basic mathematical component of measurement theory. It is meaningless to formulate it solely in terms of observables. The same observable A can be a component of various instruments describing different measurement procedures for A . So, an observable is a theoretical quantity expressing some features of pre-measurement contexts.

In QM, one operates with the notion of ‘state’ not ‘context’. These notions are similar but have some inetrpretational differences (see appendix A for the discussion).

We also mention Feynman’s contextual analysis of the two-slit experiment in [2,3]. He presented the purely probabilistic picture of this fundamental experiment of QM and expressed the interference phenomenon as interference of probabilities. Mathematically, he described this situation as the violation of additivity of probability, the classical formula is disturbed by an additional term, the interference term. In classical probability, the combination of additivity and the Bayes formula for conditional probability leads to FTP, which plays an important role in probability inference. Following this line, Feynman’s conclusion can be rewritten as the violation of classical FTP. The difference between classical and quantum probability models can be moved from the violation of additivity of probability to the violation of the Bayes formula for conditional probability—quantum probability is additive, but conditioning is not Bayesian. The quantum FTP is a perturbation of classical FTP with an additional term—the interference term. The main distinguishing feature of Feynman’s presentation of the two-slit experiment and, generally, quantum interference is its contextual structure. He operates with three contexts C1,C2,C12 : the first slit is open and the second is closed, vice versa, and both slits are open. Quantum probabilistic specialty is expressed not via LSR of states and observables but in a purely contextual probabilistic way.

We make a remark on the notion of contextuality. In the modern quantum information literature, the notion of contextuality is reduced to the contextuality of the joint measurement of a few quantum observables. This sort of contextuality was considered by Bell in [43,44] in his analysis of the violation of the Bell inequalities (although he did not use the term ‘contextuality’; it seems that this term was introduced in the book of Beltrametti and Cassinelli [45]).

Feynman’s contextuality [2,3] is more general and, in fact, coincides with Bohr’s contextuality [40]. In my works including those on the Växjö interpretation, I followed Bohr and Feynman: context as a complex of all physical conditions involved in an experiment.

1.2. Linear space versus contextual frameworks for probability

Our foundational pathway is opposite to the pathway leading to the generalized probability theories [511,21,45] (see [24] for a review), which is directed towards the creation of general LSR of probability and the measurement process. The LRS also provides a panoramic view that covers both classical and quantum probabilities and observables. This is the linear panorama that illuminates the place of the quantum probability and measurement formalism among other linear models.

We remark that there are several different approaches to generalized probability theory and corresponding measurement theory, but all of them are either equivalent or only slightly different. One of them is the Davies–Lewis [7] operational probability theory grounded on LRS with the base norm spaces, a class of partially ordered linear spaces. The corresponding measurement theory is formulated within instrument theory [7,8,11,12]; in particular, observables are mathematically represented as positive operator-valued measures (POVMs). They are widely used in quantum information theory [4652] (also see [39,53] for applications to cognition and decision making). Another approach is to start with an abstract definition of state space, a convex subset of a linear space. It goes back to Gudder’s work [9] who constructed the operational representation of quantum states and observables starting with a pre-convex structure. Under natural condition, this approach leads to a convex state space and LRS for the latter. As was shown by Ozawa [11], these two formulations (Davies–Lewis and Gudder) are actually equivalent. In §6, we explore the Davies–Lewis approach for operational LRS of the measurement model with states given by classical probability measures. Then, we express this model in the form of CMM.

The closest to CMM is the model in which one starts with all possible probabilities that can be generated in an experiment, conditioned on preparation and measurement procedures [6]. Then, one proceeds to LSR. This construction can be employed to construct LSR for CMM (§7). However, this is done just to show the connection with previously developed theories. 1

CMM development is important for quantum foundations. CMM demystifies quantum theory by reducing its probabilistic counterpart to a tool for probability update and inference (cf. with QBism [5457]); CMM diminishes the role of pure states, in complete agreement with the statistical (ensemble) interpretation of QM; in CMM, quantum interference is just an additive perturbation of classical FTP due to the interplay between a few measurement contexts; the violation of the Bell inequality has the same origin; contextual entanglement is naturally coupled to classical dependence of random variables. The latter demystifies entanglement. This is very important for the resolution of the century-long debate on quantum non-locality; see appendix B for a comparison of CMMs with and without LSRs.

On the other hand, LSR is very convenient from a mathematical viewpoint (simply linear algebra), and operating within the LSR framework is useful in concrete mathematical calculations. However, the calculations should be completed by the critical analysis on the connection of the mathematical LSR constructions with physics. In quantum-like modelling, a similar problem arises—the problem of matching between the output of the Hilbert space formalism and some psychological effects in decision-making [5862].

Our CMM can be considered the most general probabilistic framework for measurement; in particular, the notion of contextual probability space is based on the first three axioms of Mackey’s theory [6]. Then, Mackey moves towards quantum logic by constraining the model with additional axioms. This path makes theory mathematically elegant but, at the same time, more complex and the basic probabilistic components are blurred by additional mathematical constructions.

2. Contextual measurement model

2.1. Contextual probability space

Definition 2.1. A contextual probability space is a triple Σ=(C,O,P), where C and O are sets of pre-measurement contexts and observables and P is the space of the corresponding probability distributions.

In physics, pre-measurement contexts can be associated with preparation procedures. 2 Each observable A has its range of values XA; for simplicity, consider discrete observables, i.e. having finite ranges of values. The following considerations are straightforwardly extended to observables with arbitrary ranges of values. For an observable A and pre-measurement context C, denote the probability of an outcome xXA as PCA(x)PC(A=x). By the definition of a probability distribution,

PCA(x)0,xXAPCA(x)=1. (2.1)

The range set XA can be endowed with the algebra of all its subsets FA. We set PCA(G)=xGPCA(x),GFA. This is a probability measure on FA. In the definition of Σ, the symbol P denotes the collection of such probability measures,

𝒫={PCA:C𝒞,A𝒪}

(see Axiom 1 in [6]). Elements of P are called contextual probabilities. These are the analogues of the conditional probabilities in the classical (Kolmogorov [36]) probability model. But, we reserve the term ‘conditional probability’ for a special class of contextual probabilities generated by context updates.

It is natural to assume (see Axiom 2 in [6]) that two observables having the same probability distribution for all contexts should coincide, i.e.

PCA1=PCA2 for any CCA1=A2. (2.2)

We also assume (see Axiom 2 in [6]) that two contexts having the same probability distribution for all observables should coincide, i.e.

PC1A=PC2A for any AOC1=C2. (2.3)

The average of an observable AO (with XAR) with respect to a pre-measurement context CC is defined as

ACE[A|C]=xXAxPC(A=x). (2.4)

2.2. Context update and conditional probability

Measurement of an observable A with the concrete outcome x in a pre-measurement context C updates this pre-measurement context:

CCA=x. (2.5)

In terms of preparation procedures, we can consider a measurement as a subsequent preparation procedure; context CA=x is the measurement of A and filtering with respect to the fixed outcome x. 3 It is natural to consider this map only for contexts belonging to the set

CA(x)={C:PC(A=x)>0}. (2.6)

If PC(A=x)=0, then the post-measurement context is not well defined. Thus, each observable A and its outcome x determine a map

TA(x):CC,CCA=x=TA(x)C, (2.7)

with the domain of definition CA(x).

The delicate point of measurement theory is that generally an observable does not determine the context update map unequally. An observable A can be measured via different measurement procedures, and each procedure generates its own context update map. A pair IA = (observable, context update map) = ( A,TA) is called a contextual instrument (cf. §5); a pair (C,IA) or a triple (C,A,TA) is called a measurement context. We stress once again that a variety of instruments can be associated with the same observable A:IA=(A,TA),IA=(A,TA),IA=(A,TA),..... We stress that all these update maps have the same domain of definition determined by the observer A ; see equation (2.6).

Typically, one fixes some class of context update maps. In the von Neumann [1,37] measurement theory (§4), this is the class of normalized projections. In quantum instrument theory (§5), these are quantum channels, or more generally, in the theory of Davis–Levis instruments, these are positive trace-preserving maps.

We emphasize that von Neumann’s measurement theory is very special: here, by fixing an observable and a Hermitian operator A^, we automatically fix the update map—via operator’s spectral family. This special situation leads to the illusion that an observable determines the update map. We repeat that generally, this is not the case.

Definition 2.2. Let Σ=(C,O,P) be a contextual probability space. A CMM is a pair M=(Σ,I), where Σ is a contextual probability space and I is a collection of contextual instruments.

CMM is a set of measurement contexts, i.e. triples (C,A,TA). CMM generates the notion of the conditional probability:

Definition 2.3. Consider a measurement context (C,A,TA). Let it generate the output A=x and the corresponding context update, CCA=x=TA(x)C. Consider the measurement of another observable B under the condition A=x, i.e. with respect to the context CA=x. The conditional probability is given by the formula

PC,IA(B=y|A=x)PCA=x(B=y)=PTA(x)C(B=y),CCA(x). (2.8)

We note that this definition involves context update only for the A observable; different contextual instruments IA,IA,..., induce their own probabilistic conditioning, PC,IA(B=y|A=x),PC,IA(B=y|A=x),.... For simplicity, we shall typically omit the index IA of dependence on the concrete instrument.

2.3. Contextual formula of total probability

Now, we point out that generally, contextual probability differs from the classical Kolmogorov probability [36]. One of the basic classical laws of probability is the law of total probability formulated in the form of FTP:

P(B=y)=xXAP(B=y|A=x)P(A=x). (2.9)

In classical probability theory, contextual probability is identified with the conditional one (§3), and the contextual-conditional analogue of FTP has the form

PC(B=y)=xXAPC(B=y|A=x)PC(A=x); (2.10)

see equation (3.4). However, in a general contextual probability space, this formula can be violated:

PC(B=y)xXAPC(B=y|A=x)PC(A=x)= (2.11)
xXAPCA=x(B=y)PC(A=x).

The difference between the left-hand side (l.h.s.) and right-hand side (r.h.s.) determines the degree of context disturbance due to its update; it can serve as a measure of nonclassicality of a contextual model:

δC(B=y|A)=PC(B=y)xXAPC(B=y|A=x)PC(A=x). (2.12)

We call this quantity the interference term [25,29]. This consideration can be formalized in the contextual FTP (with an interference term):

PC(B=y)=xXAPC(B=y|A=x)PC(A=x)+δC(B=y|A). (2.13)

The equality of the interference term to zero is a necessary condition of the classical probabilistic representation of a contextual probability model, but it is not a sufficient condition [29].

Let us, for the moment, jump to §4, where the von Neumann quantum measurement model is treated as CMM. Consider dichotomous observables A=x1,x2 and B=y1,y2. In this case, the interference term has the form

δC(B=y|A)=
2cosθPC(B=y|A=x1)PC(A=x1)PC(B=y|A=x2)PC(A=x2), (2.14)

where context C is identified with the quantum state ψ (for simplicity, consider contexts corresponding to pure states) and the angle θ=θ(B=y|A;ψ). For dichotomous observables, even in general CMM, it is useful to write the interference term as

δC(B=y|A)=
2λPC(B=y|A=x1)PC(A=x1)PC(B=y|A=x2)PC(A=x2), (2.15)

where λ=λ(B=y|A;C). If |λ|1, then this is the trigonometric interference, and the interference term has the form of equation (2.14). If |λ|1, then this is the hyperbolic interference, and the interference term can be represented as

δC(B=y|A)=
±2coshPC(B=y|A=x1)PC(A=x1)PC(B=y|A=x2)PC(A=x2). (2.16)

In the quantum framework, such interference can be generated by quantum instruments [63]. In general CMM, we can employ the hyperbolic version of QM [29].

2.4. Conditional joint probability distribution and order effect

For observables A1,A2O, the conditional joint probability distribution (JPD) is defined by

PC(A1=x1,A2=x2)=PC(A1=x1)PC(A2=x2|A1=x1). (2.17)

We remark that this is really a probability distribution, i.e. x1,x2PC(A1=x1,A2=x2)=1. We can also define JPD for inverse order of measurements, PC(A2=x2,A1=x1)=PC(A2=x2)PC(A1=x1|A2=x2). Observables A1,A2 show OE with respect to context C, if

PC(A1=x1,A2=x2)PC(A2=x2,A1=x1), (2.18)

for at least one pair of outcomes (x1,x2); otherwise, there is no OE in context C.

We remark that OE was actively investigated in decision-making and psychology, both theoretically and experimentally; in particular, within quantum-like modelling—the applications of quantum methodology and formalism to decision-making and psychology [5862].

2.5. Conditional compatibility

In the absence of OE, we have

PC(A1=x1,A2=x2)=PC(A2=x2,A1=x1),xjXAj, (2.19)

i.e.

PC(A1=x1)PC(A2=x2|A1=x1)=PC(A2=x2)PC(A1=x1|A2=x2). (2.20)

In this case, we call the observables conditionally compatible for context CC , and their JPD is defined by equation (2.19). We remark that the marginals of JPD coincide with the probability distributions PCAi.

In the von Neumann CMM MQVN (§4) with observables and state updates of the projection type, conditional compatibility for all possible pre-measurement contexts (given by density operators) is equivalent to the commonly considered compatibility of observables and their representation by commutative Hermitian operators.

By considering conditional JPD, we do not assume that the observables A1 and A2 are jointly measurable. We consider sequential measurements, say first A1 then A2 or vice versa. We remark that, in fact, precisely, this experimental setup is realized in the Bell experiments. Here, the instances of time for the measurement’s outputs on subsystems coincide with zero probability; always the click of the photo-detector for the subsytem S1 is before the click of the photo-detector for the subsystem S2 or vice versa, and the time window serves for click pairing.

Conditional compatibility implies the Bayes formula for the conditional probability:

PC(A2=x2|A1=x1)=PC(A1=x1,A2=x2)PC(A1=x1), (2.21)
PC(A1=x1|A2=x2)=PC(A1=x1,A2=x2)PC(A2=x2). (2.22)

The equality given in equation (2.20) implies the Bayes theorem for probability inference. Let the outcomes of the observable A2 label some hypotheses, H1,...,Hm. Then, equation (2.20) is written as

PC(A2=Hj|A1=x1)=PC(A2=Hj)PC(A1=x1|A2=Hj)PC(A1=x1). (2.23)

The Bayes formula for conditional probability equations (2.21) and (2.22) implies the validity of the classical FTP, i.e. the Bayes theorem can be written in the standard form:

PC(A2=Hj|A1=x1)=PC(A2=Hj)PC(A1=x1|A2=Hj)HiPC(A1=x1|Hi)PC(A2=Hi). (2.24)

2.6. Replicability and response replicability

Observable A shows replicability for the context C, if

PC(A=x,A=x)=PC(A=x), (2.25)

or

PCA=x(A=x)=1. (2.26)

Observable A shows replicability if equation (2.25) holds for any CCA(x),xXA.

In quantum-like modelling, the following effect plays an important role. Observables A1 and A2 show RRE with respect to context C, if

PC(A1=x1,A2=x2,A1=x1)=PC(A1=x1,A2=x2), (2.27)
PC(A2=x2,A1=x1,A2=x2)=PC(A2=x2,A1=x1) (2.28)

for all pairs of outcomes (x1,x2). This is a kind of memory effect. The challenging problem for quantum-like modelling was the combination of OE and RRE [60]. It was solved in articles [61,62] within quantum instrument theory.

2.7. Correlations and Bell-type inequalities

Consider a pair of conditionally compatible observables A,BO (with XA,XBR). Their correlation with respect to a context CC is defined as

ABCE[AB|C]=x,yxyPC(A=x,B=y). (2.29)

The most popular Bell-type inequality is the CHSH inequality. We consider this inequality within CMM. There are given four observables Ai,Bj,i,j=1,2, valued in [1,1]; observables in the pairs (Ai,Bj) are conditionally compatible for some context C with JPDs PCAi,Bj. The CHSH inequality has the form

|A1B1C+A2B1C+A1B2CA2B2C|2, (2.30)

i.e.

|x,yxy(PCA1,B1(x,y)+PCA2,B1(x,y)+PCA1,B2(x,y)PCA2,B2(x,y))|2. (2.31)

If there exists a probability measure PC such that JPDs PCAi,Bj can be obtained as its marginals, e.g.

PCA1,B1(x,y)=x2,y2PC(x,x2,y,y2),

then the CHSH inequality holds true. However, if such PC does not exist, then this inequality can be violated, and the maximum of its l.h.s. with respect to contexts CC and observables A1,A2,B1,B2O valued in [1,1] can approach the value of 4, maxCHSH=4. This maximum depends on CMM. For von Neumann CMM MQVN, this is maxCHSH=22.

2.8. Functions of observables

Suppose that all observables are valued in multidimensional real space, and we remove (for the moment) the restriction that observables’ ranges of values are finite. We consider probability measures on the σ -algebra B of the Borel sets, which is generated by all semi-open intervals, (α1,β1]××(αn,βn]. So, PCA is a probability measure on B. A function f:RnRm is called B -measurable if for any Borel subset G of Rm its preimage f1(G) is a Borel subset of Rn. Only such functions are considered.

Following Mackey [6] (Axiom 3), we assume that for each AO with the range of values Rn and a Borel function f:RnRm , there exists B=BfO such that, for any CC and Borel set GRm,

PCB(G)=PCA(f1(G)). (2.32)

Such observable is uniquely defined, due to condition (2) (Mackey’s Axiom 2), and it can be denoted as B=f(A).

Two observables A1 and A2 are called functionally compatible (jointly measurable) if there exists an observable A and functions fi such that Ai=fi(A). For the von Neumann CMM MQVN (§4), functional compatibility is equivalent to compatibility and, hence, conditional compatibility. Generally, in CMM, the interrelation between these two notions is complex, and we shall not proceed to a detailed comparison.

2.9. Entanglement of contextual instruments

Entanglement is typically considered as one of the distinguishing features of LSR; from my viewpoint, the association of entanglement with LSR and the tensor product structures shadow its physical nature; its mathematical description is identified with physics. As was shown in the articles [64,65], the notion of entanglement can be formalized in the purely probabilistic framework and dissociated from the tensor product and generally from LSR.

By starting with such a probabilistic approach to the notion of entanglement, the authors of [64,65] proceed towards its complex Hilbert space realization. Now, we present the purely probabilistic picture of entanglement. The main value of the contextual probabilistic realization of entanglement is in the clarification of its foundational meaning. At the same time, the use of LSR can essentially simplify concrete calculations. However, one should be careful with the connection of the mathematical structures of LSR with physics (or in quantum-like modelling such as psychology and decision-making).

Consider CMM M=(Σ,I), where Σ=(C,O,P) is a contextual probability space and I is a collection of contextual instruments of this model, i.e. pairs (observable, state update map). Consider two contextual instruments IA=(A,TA) and IB=(B,TB).

Definition 2.4. In pre-measurement context CC, the outcome B=β depends on the outcomes of A if for at least two values of A,α=αi,αj, the corresponding conditional probabilities do not coincide:

PC(B=β|A=αi)PC(B=β|A=αj) (2.33)

Thus, the probability to get the outcome B=β if the preceding A -measurement had the outcome A=αi differs from the probability to get the same outcome B=β if the preceding A -measurement had the outcome A=αj. We remark that the update map TB is not involved in this definition, i.e. one can consider just an observable B without referring to the corresponding instrument IB. For symmetry reason, we consider two instruments.

We note that the outcome B=β does not depend on the outcomes of the observable A if

PC(B=β|A=αi)=PC(B=β|A=αj),for all pairsαi,αj, (2.34)

i.e. the conditional probability for this outcome is constant with respect to the outcomes of A. Denote it PC(B=β|A). The following natural question arises. Does the probability PC(B=β|A) coincide with the unconditional probability PC(B=β)?

Definition 2.5. Two instruments IA and IB are called AB -entangled in CC or C is AB -entangled, if all outcomes of the B -observable depend on outcomes of the A -observable, i.e. for all β condition (2.33) holds for some αi,αj.

Concerning the notation ‘ AB -entangled’, it would be better to write ‘ IAIB -entangled’ to emphasize that this is the entanglement of instruments and not simply the observables but to make the notation compact, we proceed with ‘ AB -entangled’. The order of observables is important. Generally, AB -entanglement does not imply BA -entanglement. This is a purely probabilistic definition that does not involve LSR and can be applied to any statistical physical theory. This definition formalizes the dependence of observables. We introduce the following quantitative measure of entanglement.

Definition 2.6. For contextual instruments IA and IB and pre-measurement context C, AB -concurrence of conditional probabilities is defined as

λAB(ψ)=βαα|PC(B=β|A=α)PC(B=β|A=α)|. (2.35)

The crucial issue is that AB -concurrence depends on a pair of instruments.

Proposition 2.7. For dichotomous observables A,B=±, dependencies of the values B= and B=+ on the outcomes of A are equivalent. Thus, each dependence is equivalent to the AB -entanglement.

Proof. In the state context C, the value B= depends on the outcomes of A if

PC(B=|A=+)PC(B=|A=). (2.36)

This automatically implies that even the value B=+ depends on the outcomes of A,

PC(B=+|A=+)=1-PC(B=-|A=+)
1-PC(B=-|A=-)=PC(B=+|A=-),

i.e. instruments IA and IB are entangled in the context C.

As was already pointed out, in articles [64,65], contextual probabilistic entanglement can be realized in the complex Hilbert space and, in this way, connected with the ordinary notion of entanglement. In the LSR representation, the main distinguishing feature of AB -entanglement (definition 2.5) is that it is associated with a pair of instruments, IA,IB. The standard definition of entanglement is coupled with the tensor product structure and not with two concrete instruments (observables).

For simplicity, let us consider CMM MQVN (§4) with von Neuman observables [37]; in this CMM, an observable A^, a Hermitian operator, automatically determines the state update map through its spectral family. So, there is no need to operate with instruments; one can solely operate with observables. In this CMM (which is typically used in entanglement studies), contextual probabilistic entanglement is associated with the pairs of observables, i.e. two observables are entangled or disentangled in some state (context) C=ρ^, where ρ^ denotes a density operator. The main mathematical features of AB -entanglement and ordinary tensor product-based entanglement are similar, but some essential differences can be found [64,65].

The probabilistic viewpoint on the ‘EPR-paradox’ [66] is presented in Schrödinger’s papers [67,68], which initiated the modern theory of entanglement. However, this theory ignores the important message of Schrödinger: entanglement characterizes the probability update for the outcomes of observable B conditioned on the outcomes of observable A. In the framework of [67,68], it is meaningless to speak about the entanglement without specifying the observables. The state update—the Hilbert space representation of the probability update—encodes the procedure of conditional prediction. For Schrödinger, quantum formalism is a mathematical machinery for probability prediction (as in the Växjö interpretation or QBism), and a quantum state is a part of such machinery. We can say that Schrödinger interpreted quantum probabilities as conditional (contextual) probabilities and entanglement as contextual probabilistic entanglement. However, this is my private interpretation of Schrödinger’s views, and many experts in quantum foundations may disagree with me.

By following Schrödinger [67,68], in article [64], we considered a special sort of contextual probabilistic entanglement that matches perfectly with the Schrödinger analysis of the EPR argument.

Definition 2.8. For CC, instruments IA and IB are perfectly conditionally correlated for values (A=α,B=β) if the conditional probability to get the outcome B=β and if the preceding A -measurement had the outcome A=α equal to 1:

PC(B=β|A=α)=1. (2.37)

More generally, consider observables with values (αi) and (βi) and some set Γ of pairs (αi,βj).

Definition 2.8a (EPR entanglement). Let CC. If instruments IA and IB are perfectly conditionally correlated for all pairs belonging to Γ, then they are called EPR-entangled with respect to set G in the context C.

We are interested in sets Γ such that each of α and β values appears in the pairs once and only once. We call such EPR entanglement complete.

For example, for two dichotomous observables with α,β=±, we consider, e.g. the set of the pairs (A=+,B=),(A=,B=+), in short, A=B EPR entanglement or the pairs (A=+,B=+),(A=,B=),A=B EPR entanglement. We analyse such EPR entanglements.

Let us start with A=B EPR entanglement, i.e. PC(B=|A=+)=1 and PC(B=+|A=)=1. Thus, PC(B=+|A=+)=0 and PC(B=|A=)=0, and PC(B=|A=+)=1PC(B=|A=)=0 and PC(B=+|A=)=1PC(B=+|A=+)=0. In this case, EPR-entangled instruments are automatically entangled in the sense of definition 2.5.

Now turn to A=B EPR entanglement, i.e. PC(B=+|A=+)=1 and PC(B=|A=)=1. Thus, PC(B=|A=+)=0 and PC(B=+|A=)=0, and PC(B=+|A=+)=1PC(B=+|A=)=0 and PC(B=|A=)=1PC(B=|A=+)=0. And again, EPR-entangled instruments are automatically entangled in the sense of definition 2.5.

Thus, EPR entanglement is just a very special case of AB -entanglement.

For dichotomous observables, the AB -concurrence of conditional probabilities, equation (2.35), has the form CAB(ψ)=|P(B=+|A=)P(B=+|A=+)|+|P(B=|A=)P(B=|A=+)|, and hence it can be written as

λAB(ψ)=2|P(B=+|A=)P(B=+|A=+)|. (2.38)

From this formula, we immediately obtain the following characterization of maximally AB -entangled states:

Proposition 2.9. AB -concurrence of conditional probabilities approaches its maximal value, λAB(ψ)=2, if and only if the instruments are EPR-entangled in the pre-measurement context C.

2.10. Distinguishing features of contextual measurement models

We list the probabilistic constraints that can be used to distinguish different CMMs (theoretically and experimentally):

  • violation of FTP

  • OE

  • RRE

  • OE+RRE

  • violation of Bell inequalities

2.11. Interpretations of contextual probability

Probability is characterized by the diversity of interpretations [22]. We now discuss the interpretations of contextual probability. We start with the remark that mathematically, a contextual probability space Σ=(C,O,P) cannot be described as single Kolmogorov probability space: Σ is a bunch of such spaces. However, fixed CC and AO can be realized within some probability space K=(Ω,F,P) with the realization of observable A by a random variable a:ΩXA; its probability distribution coincides with PCA, i.e.

PCA(α)=P(ωΩ:a(ω)=α),αXA.

We note that a contextual probability space Σ can be represented as

Σ=C,AKC,A, (2.39)

where KC,A is a Kolmogorov probability space for describing the measurement of the observable A in the pre-measurement context C.

Therefore, one can assign any interpretation used for probability defined in the measure-theoretic framework to the contextual probability. The main interpretation employed in classical and quantum physics is the frequency interpretation. In the Kolmogorov theory [35,36], this interpretation is mathematically rooted to the strong law of large numbers. Another basic interpretation of probability in physics is the statistical (ensemble) interpretation. By this interpretation, Ω=ΩC represents an ensemble of systems prepared for measurement, and the probability measure P=PA depends on the observable A. Finally, we mention the subjective interpretation. It is widely employed in decision-making and psychology but was not used in physics until QBism was invented.

In the contextual probability, we need not represent a pre-measurement context C by an ensemble of systems. Instead, we can consider a sequence of measurements of an observable A in the same pre-measurement context C,

xxC,A=(x1,...,xN,...,...), (2.40)

where xj(XA={α1,...,αm}) are the measurement’s outcomes. Such sequence determines the frequencies of realizations of the concrete values,

νN(αj)=nN(αj)/N, (2.41)

where νN(αj) is the number of the measurement’s outcomes with the fixed value αj. The probability to obtain the value αj in a sequence of measurements x is defined as the limit

PCA(αj)limNνN(αj). (2.42)

This is the straightforward frequency introduction of probability. The deep mathematical theory of frequency probability was developed by von Mises [30] (see also [22] for an introduction). A sequence x generated by observations is called a collective. Von Mises’ theory is a theory of collectives. Instead of operations on sets, as done in the Kolmogorov measure-theoretic theory, von Mises constructed a probability theory based on operations with collectives. We remark that PC(A=αj)PCA(αj) can be considered as the probability generated by the collective xxC,A, i.e. PCA(αj)=PxC,A(αj). It is important to note that not all collectives are compatible or combinable in von Mises’ terminology. Two collectives x and y are combinable if their combination

z=(z1,...,zN,...),zj=(xj,yj), (2.43)

is also a collective, and the probability distributions Px and Py are marginal for the probability distribution Pz, i.e.

Px(αj)=βiPz(αj,βi),Pz(βi)=αjPz(αj,βi). (2.44)

The frequency probability theory contains the notion of conditional probability that is similar to CMM’s conditioning, and the post-measurement context CA=a corresponds to the post-measurement collective xC,A=a (see [22,30] for details).

In contrast to the Kolmogorov measure-theoretic probability theory, in the von Mises frequency probability theory, classical FTP is violated [22,29], and the probabilities can interfere and generate the additive perturbation of FTP in the form of the interference term. The presence of incombinable collectives leads to the violation of the Bell-type inequalities [22,29].

The notion of a collective was the seed for the future growing theory of random sequences. Besides the existence of the limits for frequencies, equation (2.42), a collective is characterized by the stability of these limits with respect to place selections within a sequence x , i.e. the limit probability is the same for all subsequences of x for a special class of place selections. However, von Mises’ definition of place selection was criticized for non-rigorousness. Its critical analysis was very fruitful and led to the modern theory of randomness (e.g. [69]). In particular, the monograph presents a ‘light version’ of the von Mises theory. In physics (at least in quantum physics), one does not analyse in the random structure of the sequences of the measurement’s outcomes. In principle, in QM, one can proceed with ‘light-collectives’ determined solely by the existence of limits (2.42). The calculus of such ‘light-collectives’ can be explored for the description of the probabilistic structure of QM [69]. The first version of the contextual probability was presented in such a ‘light-frequency’ framework.

Finally, we note that von Neumann [1,37] pointed to the von Mises [30] frequency probability as the probabilistic foundation for QM. This is a complex foundational issue.

3. Contextual measurement model for Kolmogorov theory

Let K=(Ω,F,P) be a Kolmogorov probability space [36]. Here, Ω is a set of any origin, F is a collection of its subsets forming σ -algebra, i.e. F is closed with respect to countable unions and intersections and the operation of complement. (If Ω is finite, then F is a collection of all its subsets and P is a probability measure on F. )

Set C={CF:P(C)0}. This is the set of contexts. For each context C, the Bayes formula defines the conditional probability measure

PC(G)=P(GC)/P(C),G.

We highlight that the statistical mixtures of contexts are not determined, i.e. for subsets C1,C2 of Ω and weights p1,p20,p1+p2=1, there is no subset C of Ω that can be identified with the weighted sum p1C1+p2C2.

As an illustrative example, consider some agricultural region Ω , and as contexts, consider its sub-fields (some areas). Generally, there is no field of the form p1C1+p2C2. In applications to decision-making and cognition, one can meet the situations such that p1C1+p2C2 is determined only for a few pairs of weights p1,p2. This situation is related to the poorness of the set of possible experimental contexts.

The set of observables O is the set of (discrete) random variables, a:ΩXa, where Xa is a finite set (discrete random variables are considered for simplicity). Denote this set by the symbol Rd. For xXa, we set Ωa=x={ωΩ:a(ω)=x}. The contextual probability coincides with the conditional probability given by the Bayes formula:

PCa(x)PC(a=x)=P(Ωa=xC)/P(C). (3.1)

Thus, the set of probability distributions P={PCa:CC,aO}.

For any set DF and random variable aRd,xXa, we define that set

Da=x={ωD:a(ω)=x}=DΩa=x

and the map

Ta(x):FF,DDa=x=Ta(x)D. (3.2)

For any context CC and random variable aRd,xXa, we define the family of contexts

Ca(x)={CC:PC(a=x)>0}={CC:P(Ca=x)>0}.

Each random variable a and its outcome x determine a map

Ta(x):CC,CCa=x=Ta(x)C, (3.3)

with the domain of definition Ca(x).

Thus, classical CMM Mcl consists of measurement contexts composed of pre-measurement contexts—elements of F with non-zero probabilities, observables—(discrete) random variables and context update maps, T={Ta(x)}. Here, each observable a, random variable, determines uniquely the context update maps Ta(x), and, hence, the contextual instrument.

The conditional probability is given by the Bayes formula:

PC(b=y|a=x)=P(Ωb=yΩa=xC)/P(Ωa=xC)
=P(Ωb=yCa=x)/P(Ca=x)=PCa=x(b=y).

Since, for each CC,PC is a probability measure, for any pair of random variables a,b , we have the following version of FTP (§2.3):

PC(b=y)=xXaPC(b=y|a=x)PC(a=x)= (3.4)
xXaPCa=x(b=y)PC(a=x).

In this measurement model, all observables are compatible, conditional JPD coincides with JPD (again by Bayes formula); no pair of observables show OE, since

PC(a1=x1,a2=x2)=PC(a2=x2,a1=x1)=
PC(ωΩ:a1(ω)=x1,a2(ω)=x2).

All observables show repeatability, since PC(a=x,a=x)=PC(a=x), and RRE, since

PC(a1=x1,a2=x2,a1=x1)=PC(ωΩ:a1(ω)=x1,a2(ω)=x2,a1(ω)=x1)=
PC(a1=x1,a2=x2).

The Bell inequalities are not violated, since their derivation is based on the existence of JPD.

We can summarize the properties of classical CMM by referring to the aforementioned list of possibilities:

  • violation of FTP—no

  • OE—no

  • violation of replicability—no

  • RRE—yes

  • OE+RRE—yes

  • violation of Bell inequalities—no

One of the problems of the above contextual representation of the classical probability is that the uniqueness conditions (2.2) and (2.3), Mackey’s axiom 2, can be violated, i.e. generally,

PCa1=PCa2 for any CCa1=a2. (3.5)
PC1a=PC2a for any aOC1=C2. (3.6)

This problem can be easily resolved in the standard way (see below).

Let us consider a Kolmogorov probability space K=(Ω,F,P) with a complete probability measure, i.e. any subset of a set DF,P(D)=0, also belongs to F. We recall that the symmetric difference of two sets D1 and D2 is defined as

D1ΔD2=(D1D2)(D2D1)=(D1D2)(D1D2).

For D1,D2F, we set D1D2 if P(D1ΔD2)=0. This is an equivalence relation on F; it splits F into disjoint equivalence classes. We denote the set of equivalence classes by the symbol F~ and the equivalence class of zero probability sets by the symbol Z~. The set of pre-measurement contexts is C~=F~Z~, i.e. all equivalent classes of sets from F of non-zero measure.

We also modify the class of observables. Two random variables are equivalent, a1a2, if P(ωΩ:a1(ω)a2(ω))=0. This is the equivalence relation on the space of random variables; in our consideration, these are discrete random variables Rd. So, Rd is split into disjoint classes of equivalent random variables, and we denote the set of these classes by the symbol R~d and set O~=R~d.

For a pre-measurement context C~C~ and observable a~R~d , we define the probability distribution PC~a~(x)=PC(a=x) for some representatives CC~ and aR~d (the correctness of this definition is proved below), and set P~={PC~a~}. The modified classical contextual probability space is the triple Σ~=(C~,R~d,P~).

The map Ta(x), see equation (3.3), generates a map of F~ into itself

T~a(x):F~F~. (3.7)

Set C~a(x)={C~C~:T~a(x)C~C~}. Then, T~a(x):C~a(x)C~. A measurement context is a triple (C~,a~,T~a). Modified classical CMM M~cl is given by the set of such measurement contexts.

Now, we demonstrate that in M~cl , the uniqueness conditions (2.2) and (2.3), Mackey’s axiom 2, hold true. We assume that the ranges of values of random variables are subsets of a set X; for simplicity, let X=R. First, we note that if D1,D2D~, then P(D1)=P(D2). We have P(D1)=P((D1D2)(D1D2))=P(D1D2)=P(D2). Let now C1,C2C~, then, as we have seen, P(C1)=P(C2); also, for any GF,P(GC1)=P(GC1C2)=P(GC2). Hence, PC1(G)=PC2(G). So, conditional probability measure PC does not depend on the choice of a representative CC~ , and it can be denoted as PC~.

We show that implication (2.2) holds. Let for random variables a1,a2,PCa1=PCa2 for any context C. Let, for some x,P(Ωa1=x)>0. Take C=Ωa1=x, then

1=PΩa1=x(Ωa1=x)=P(Ωa2=xΩa1=x)/P(Ωa1=x),

i.e.

P(Ωa2=xΩa1=x)=P(Ωa1=x)=P((Ωa2=xΩa1=x)(Ωa1=xΩa2=x).

Hence, P(Ωa1=xΩa2=x)=0 and symmetrically P(Ωa2=xΩa1=x)=0. The sets Ωai=x,i=1,2, belong to the same equivalence class. This implies that the random variables also belong to the same equivalence class.

Now, we turn to implication (2.3). Let, for any random variable a,PC1a=PC2a. select a as the characteristic function of the set C1. Then, PC1a(1)=1=P(C1C2)/P(C2), i.e. P(C1C2)=P(C2)=P((C1C2)(C2C1), i.e. P(C2C1)=0 and symmetrically P(C1C2)=0, i.e. contexts belong to the same equivalence class.

4. Contextual measurement model for von Neumann observables

We restrict the consideration to finite-dimensional Hilbert state spaces. The space of pre-measurement contexts is mathematically represented as the space of density operators D , i.e. C=D, and observables are Hermitian operators (von Neumann observables [1,37]). We denote the space of Hermitian operators by the symbol LH, i.e. O=LH. This real linear space is endowed the with scalar product A^|B^=TrA^B^.

The operator A^LH has the spectral decomposition: A^=xXAxE^A(x), where E^A(x) is the orthogonal projection on the subspace HA(x) composed of eigenvectors with eigenvalue x, and XA is operator’s spectral set. Then,

PρA(x)Pρ(A=x)=TrE^A(x)ρ^=TrE^A(x)ρ^E^A(x), (4.1)
TA(x):DD,ρ^A=xTA(x)ρ^=E^A(x)ρ^E^A(x)TrE^A(x)ρ^E^A(x), (4.2)

with the domain of definition CA(x)={ρ^D:Pρ(A=x)>0}.

Thus, P={PρA:ρD,ALH} and T={TA(x):ALH,xXA}. We remark that observable A uniquely determines the family of maps TA(x),xXA by equality (4.2). So, measurement contexts can be represented by pairs (ρ^,A^),ρ^D,A^LH. We denote this CMM by the symbol MQVN.

In this CMM, one need not define separately the context update maps, they are automatically encoded in observables. On the one hand, this simplifies theory. On the other hand, this is the misleading path in measurement theory, cf. with quantum instrument theory.

In CMM MQVN, the conditional probability is given by the formula

Pρ(B=y|A=x)=TrE^B(y)TA(x)ρ^=TrE^B(y)E^A(x)ρ^E^A(x)TrE^A(x)ρ^E^A(x). (4.3)

It can be rewritten as

Pρ(B=y|A=x)=TrE^B(y)E^A(x)ρ^E^A(x)E^B(y)TrE^A(x)ρ^E^A(x). (4.4)

In this LSR-based CMM, it is convenient to introduce the maps:

IA(x)ρ^=E^A(x)ρ^E^A(x). (4.5)

Then, the above formulas can be rewritten as

Pρ(A=x)=TrIA(x)ρ^, (4.6)
TA(x)ρ^=1Pρ(A=x)IA(x)ρ^ (4.7)

or

IA(x)ρ^=Pρ(A=x)TA(x)ρ^; (4.8)

and the conditional probability is written in the form

Pρ(B=y|A=x)=TrIB(y)IA(x)ρ^TrIA(x)ρ^, (4.9)

and conditional JPD as

Pρ(A=x,B=y)=TrIB(y)IA(x)ρ^. (4.10)

These formulas lead to the quantum instrument theory (§5): (A,IA(x)) is a special quantum instrument, and (A,TA(x)) is the corresponding contextual instrument.

We note that in CMM MQVN probabilities determine contexts (states) and observables (operators), i.e. equations (2.2) and (2.3) hold (Mackey’s axiom 2).

Let PρA1(x)=PρA1(x) for all ρ^D and real x. Then, Tr(EA1(x)EA2(x))ρ^=EA1(x)EA2(x)|ρ^=0. Hence, EA1(x)=EA2(x) for any x; so A^1=A^2. Thus, an observable can be identified with the set of probability distributions PA={PρA:ρ^D}.

Now let Pρ1A(x)=Pρ2A(x) for all A^LH and real x. Then, TrEA(x)(ρ^1ρ^2)=EA(x)|ρ^1ρ^2=0. Hence, ρ^1=ρ^2. Thus, a quantum state can be identified with the set of probability distributions Pρ={PρA:ALH}.

In the von Neumann measurement theory, two observables are compatible if they are represented by commuting operators A^,B^:[A^,B^]=0. Compatibility is interpreted as guaranteeing the possibility of joint measurement of these observables; their JPD is given by the formula:

PρA,B=TrE^A(x)E^B(y)ρ=TrE^B(y)E^A(x)ρ. (4.11)

In fact, this is the separate axiom—a complement to the Born rule [37]. For compatible observables, JPD and conditional JPD coincide. The conditional JPD is given by

Pρ(A=x,B=y)=Pρ(A=x)Pρ(B=y|A=x)= (4.12)
Pρ(A=x)PρA=x(B=y)==TrE^B(y)E^A(x)ρE^A(x)=
TrE^A(x)E^B(y)E^A(x)ρ=TrE^B(y)E^A(x)ρ.

In particular, for compatible observables, there is no OE for any state ρ. If operators A^1,A^2 do not commute, then there exists a state ρ^ showing OE for these observables, i.e. Pρ(A=x,B=y)Pρ(B=y,A=x).

Each observable A shows replicability, e.g.

Pρ(A=x,A=x)=TrE^A(x)E^A(x)ρ^E^A(x)=TrE^A(x)ρ^=Pρ(A=x). (4.13)

If observables are compatible, then for any ρ^ , they show RRE, e.g.

Pρ(A1=x1,A2=x2,A1=x1)=TrE^A1(x1)E^A2(x2)E^A1(x1)ρ^E^A1(x)E^A2(x2)=
TrE^A2(x2)E^A1(x1)ρ^E^A1(x)=Pρ(A1=x1,A2=x2).

We highlight that it is impossible to combine OE and RRE within CMM MQVN [60] (in the finite-dimensional case).

FTP can be violated; classical FTP is additively perturbed by the interference term; for instance, consider a pure state |ψ, and then

Pψ(B=β)=α,αψ|E^A(α)E^B(β)E^A(α)|ψ (4.14)
=αPψ(B=β|A=α)Pψ(A=α)+δψ(B=y|A),

where

δψ(B=y|A)=ααψ|E^A(α)E^B(β)E^A(α)|ψ. (4.15)

On the r.h.s., the first summand corresponds to classical FTP, and the second one is the interference term; it quantifies the degree of non-classicality for this CMM; see §2.3 for FTP in the general contextual probabilistic framework.

Consider dichotomous observables, A=x1,x2 and B=y1,y2, of the von Neumann type. In this case, the interference term has the form

δψ(B=y|A)=
2cosθPψ(B=y|A=x1)Pψ(A=x1)Pψ(B=y|A=x2)Pψ(A=x2), (4.16)

where the angle θ=θ(B=y|A;ψ).

We can summarize the properties of von Neumann CMM with the list of possibilities presented above:

  • iolation of FTP—yes (equations (4.14) and (4.15)).

  • OE—yes. This is a straightforward consequence of the existence of incompatible observables represented by non-commuting operators.

  • iolation of replicability—no (equation (4.13)).

  • RRE—no. This is again a consequence of the incompatibility of some observables. Consider non-commuting operators A^ and B^. For simplicity, assume that they have non-degenerate spectra; consider the orthonormal bases (ejA) and (ejB) consisting of operators’ eigenvectors with eigenvalues (aj) and (bj). At least for one pair of indexes k,m,0<|ekA|emB|<1. Let the initial state be a pure state |ψ. Suppose that the measurement of observable A gives the outcome ak and the sequential measurement observable B gives the outcome bm. Let observable A be measured again. Then, the probability to get again the outcome ak, given by |ekA|emB|2, does not equal one. The case of operators with degenerate spectra is studied in the article [60]. It is important to note that we consider only the finite-dimensional case.

  • OE+RRE—no. Observables demonstrating OE should be incompatible. But, for incompatible observables, RRE is violated.

  • Violation of Bell inequalities—yes [43,44].

5. Contextual measurement model for quantum instruments

In this section, we present CMM MQI , where a measurement process is mathematically described by quantum instrument theory [7,8,11,12,4652]. This CMM extends CMM MQVN , where a measurement process is mathematically described by a Hermitian operator (von Neumann observable).

The space of linear Hermitian operators LH is a real Hilbert space. We consider linear operators acting in it to be superoperators. A superoperator T is called positive if it maps the set of positive semidefinite operators onto itself: for g0,T(g)0.

Consider an observable A with a finite range of values X. Its measurements can be performed with various apparatuses; for each apparatus, the corresponding measurement procedure is mathematically described in the following way.

Any map xI(x), where for each xX, the map I(x) is a positive superoperator, and

I(X)xXI(x):DD (5.1)

is called a quantum instrument. It determines some observable; we denote it by the symbol A ; see equation (5.9).

The probability of the output A=x is given by the generalized Born rule in the form

Pρ(A=x)=Tr[I(x)ρ]. (5.2)

We note that the measurement with the output A=x generates the state update through the transformation

ρρA=x=TA(x)ρI(x)ρTrI(x)ρ, (5.3)

with the domain of definition CA(x)={ρ^D:Pρ(A=x)>0}.

Let

I(x)ρ=E^(x)ρE^(x), (5.4)

where (E^(x)) are projections giving the orthogonal decomposition of I. Such an instrument is called a projection instrument.

The most natural generalization of projection instruments is an atomic instrument. Let (V^(x)) be a family of linear operators constrained by the normalization condition:

xV^(x)V^(x)=I. (5.5)

An atomic quantum instrument is a superoperator of the form

ρI(x)ρ=V^(x)ρV^(x). (5.6)

Applications of the quantum instrument theory to quantum information are typically restricted by the use of atomic instruments.

The space of Hermitian operators LH is the real Hilbert space, i.e. for each linear operator acting in LH (superoperator), its adjoint is well defined. The generalized Born rule can be written as

Pρ(A=x)=I(x)ρ|I=ρ|I(x)I=I(x)I|ρ, (5.7)

where I is the unit operator and I(x) is the superoperator that is adjoint to I(x) in Hilbert space LH. Hence, the generalized Born rule has the form

Pρ(A=x)=TrA^(x)ρ, (5.8)

where

A^(x)=I(x)I. (5.9)

Operators A^(x),xX are called effects; they are positive semi-definite Hermitian and sum up to the unit operator:

xXA^(x)=I.

The family of operators A=(A^(x),xX) is called a POVM; for a subset Δ of X, we set

A^(Δ)=xΔA^(x)0;

this is an additive operator-valued measure, i.e. for Δ1,Δ2X,Δ1Δ2=,

A^(Δ1Δ2)=A^(Δ1)+A^(Δ2).

Instruments of the projection type, equation (5.4), determine the special class of POVMs, projection-valued measures (PVMs).

Two POVMs A=(A^(x),xX) and B=(B^(y),yY) are called compatible, if there exists a POVM C=(C^(x,y),(x,y)X×Y) such that

A^(x)=yYC^(x,y),B^(x)=xXC^(x,y). (5.10)

The compatibility is interpreted as guarantying the possibility of the joint measurement of these observables, and their JPD is given by generalization of the Born rule for compatible von Neumann observables:

PρA,B(x,y)=TrC^(x,y)ρ^. (5.11)

In the contextual probability space, contexts are mathematically represented by density operators (quantum states), C=D, and observables by POVMs (also known as generalized quantum observables), and the probability distributions are determined by the generalized Born rule, equation (5.2). CMM MQI is endowed by quantum instrument maps updating quantum states (contexts) due to the measurement feedback, equation (5.3).

Consider POVM A^=(A^(x)) and all quantum instruments generating it via (5.9). Then, the corresponding state (context) update maps are defined by equality (5.3). The same POVM, a generalized observable, can be coupled to a variety of such maps. Therefore, the commonly used approach highlighting POVMs as generalized observables is ambiguous. POVMs are just byproducts generated by quantum instruments.

Quantum instruments considered earlier were invented in the article [7] (see also monograph [8]), and these are Davies–Levis instruments, so MQI=MQI;DL. In quantum information theory, one uses the special class of quantum instruments given by completely positive maps I(x); we denote the corresponding CMM MQI;O, where I use the index ‘O’ to mention Masanao Ozawa who contributed so much to the theory of such quantum instruments [11,12,4649]. It is commonly assumed that the instruments belonging to MQI;DLMQI;O are non-physical. I debated this question with Masanao Ozawa, and he firmly stays on this position. As was proved by him, only completely positive instruments can be realized through the indirect measurement scheme [11]. This scheme is adequate for quantum measurement processes, and any deviation from this scheme is non-physical. Nevertheless, it might be that in quantum-like modelling, even instruments that are not completely positive can find applications. Such applications would lead to the modification of the indirect measurement scheme, may be through the consideration of non-unitary interactions.

We remark that instrument maps I(x) are linear in the Hilbert space LH. In terms of context (state) update, these maps can be written as

I(x)=Pρ(A=x)TA(x). (5.12)

Hence, in a quantum instrument, CMM scaling of the update map TA(x) by probability Pρ(A=x) is a linear map. Generally, LSR-CMM with the context (state) space C=D can be endowed with context (state) update maps such that scaling (5.12) can be nonlinear. CMM MQI;NL with nonlinear context update maps might be useful for quantum-like modelling. It is interesting to find concrete applications of MQI;NL. Of course, such applications would lead to the modification of the indirect measurement scheme.

We can summarize properties of quantum instrument CMM, both models MQI;DL and MQI;O. Since the quantum instrument model extends the von Neumann model, the majority of properties follow automatically from its properties.

  • violation of FTP—yes

  • OE—yes

  • violation of replicability—yes

  • RRE—no

  • OE+RRE—generally no, but can be realized by special instruments

  • violation of Bell inequalities—yes

So, MQI differs from MQVN with respect to replicability and OE+RRE combination. Replicability in MQVN is a consequence of projection state update. Such an update is idempotent, and thus the value of an observable is replicable. Model MQI permits more general updates; generally, they are not idempotent. The possibility to reproduce OE+RRE combination was demonstrated in the article [61]. This is a technically complicated construction within the theory of quantum instruments, and it is impossible to present this construction in the present paper.

It is interesting to find a property distinguishing MQI;DL and MQI;O through an experimental test, i.e. some experimentally testable property such that only completely positive instruments have it.

One of the important features of the von Neumann model is a coincidence of JPD and conditional JPD for compatible observables. In contrast, the instrument model shows that generally, the situation is not simple at all. Consider two instruments IA(x) and IB(y) such that their observables are POVMs of the PVM type, i.e. A^=(E^A(x)) and B^=(E^B(y)). They are jointly measurable and the JPD is given by equation (4.11). The conditional JPD is given by

Pρ(A=x,B=y)=Pρ(A=x)Pρ(B=y|A=x)= (5.13)
Pρ(A=x)PρA=x(B=y)=TrB(x)A(x)ρ.

The r.h.s. of equations (4.11) and (5.13) coincide only if the instrument superoperators are of the projection type, i.e. I(x)ρ=E^(x)ρE^(x).

Moreover, in this case, two projection-type observables, PVMs, can have a variety of conditional probability distributions corresponding to different instruments generating them by the rule E^(x)=I(x)I.

6. Ordered space measurement model with probability measure states

In this section, we connect the generalized probability theory (the Davies–Lewis approach [7]) for probability measures with CMM. Here, we use the ordered linear space approach. This is the concrete application of the universal scheme based on the abstract framework of ordered linear spaces.

Consider the space M of all real-valued measures on some set Ω with a σ -algebra of subsets F, i.e. MM(Ω,F). Real linear space M has the natural order structure and the positive cone M+ consisting of non-negative measures. Consider the elements of this cone given by probability measures, i.e. μ0 and μ(Ω)=1; we denote this set by the symbol S; this is the set of states and S is a convex subset of M. The latter is endowed with the variation norm, ||μ||=var(μ) , and it is a Banach space. Consider its dual space M, the space of continuous linear functionals f:MR. We denote by A the subset of M, consisting of functionals mapping S into [0,1]. Elements of A are called effects, and these are basic observables. They can be described solely in terms of the state space S as affine functionals valued in [0,1], i.e. AA(S).

Consider the functional uM defined as μu|μ=μ(Ω). Its characteristic property is that u|μ=1 for any state μS.

Let X={x1,..,xm} be a finite set and let A=(A(xi),i=1,...,m) where A(xi)A(S) and A(X)xXA(x)=u. Such vectors of functionals are analogues of POVMs; we call them M -POVMs. These are observables of the contextual probability space Σmeasure with contexts C=S , and the set of probability distributions P defined as

PμA(x)PC(A=x)=A(x)|μ.

As we learn from the quantum instrument theory, the basic elements of measurement procedures are not observables but instruments. Let L(M) denote the space of continuous linear operators, J:MM. The M -instrument with the range of values X is a map I:XL(M) such that I(x)M+M+ and

(X)x(x):𝒮𝒮.

Each instrument determines the state update map

μTA(x)μ1I(x)μ(Ω)I(x)μ=1u|I(x)μI(x)μ,

and the probability distribution

Pμ(A=x)=u|I(x)μ.

The domain of definition of the state update map TA(x) is given by the set of probability measures CA(x)={μS:Pμ(A=x)>0}.

Let J:MM be a continuous linear operator. Then, its adjoint operator J is well-defined, J:MM and

Jf|μ=f|Jμ.

Set

A(x)=(x)u.

Then, for each x,A(x) is an effect, i.e. A(x)A(S). So, each M -instrument determines a M -POVM.

The M -CMM consists of context (states) given by probability measures and POVM-observables with state updates given by instruments.

7. Linear space representation for contextual probability space

The state space is given by the set S , the set of possible measurement outcomes of an observable quantity is denoted by X. Let a system be in a state sS. A probability p(x,s) is assigned to any possible outcome xX. Thus, we have a function

p:X×S[0,1].

To each outcome xX and state sS, this function is a probability of the outcome x for the system that is in the state s. The generalized probability model is a triple (S,p(,),X). We denote by Φ[0,1] the space of function from X to [0,1]. By considering state s as a variable, we obtain the map

SΦ[0,1],ss()=p(,s). (7.1)

It is natural to assume that each state sS determines the probability distribution uniquely, i.e.

p(x,s1)=p(x,s2) for any xXs1=s2. (7.2)

Under this assumption, the map (7.1) is injection. Thus, each state s can be mapped to a function belonging to space Φ[0,1], and it will be denoted by the same symbol s. Consider now the vector space Φ of all real-valued functions on X. So, S is identified with a subset of this functional space. Consider its closed convex hull S¯. The vectors from it are all possible probabilistic mixtures (convex combinations) of states in S.

Each xX defines a linear functional on Φ,ϕfx(ϕ)=ϕ(x). If ϕ=sS¯, then fx(s)=s(x)[0,1], i.e. fx:S¯[0,1]. This is an affine functional on the convex set S¯. It describes a measurement outcome, and fx(s)=p(x,s) is the probability for this outcome in state s.

We denote by A(S¯) the space of all affine functionals

f:S¯[0,1].

In particular, for any xX,fxA(S¯). Any functional fA(S¯) describes an outcome of some observable, and thus f(s) is the probability for that outcome in state s.

In QM, S¯ is the set of density operators, and elements of A(S¯) are called effects—components of POVMs, f(ρ)=TrE^fρ^, where E^f is the effect corresponding to the affine functional f.

The elements of A(S¯) are called effects. It is typically assumed that there exists an element u of A(S¯) such that u(s)=1 for any sS¯. It is an analogue of quantum observable given by the unit operator I. Consider the point-wise order structure on A(S¯),fg if f(s)g(s) for any state s. Thus, any observable fA(S¯) is majorated by u,0fu. A discrete measurement is represented by a set of effects (fi) such that ifi=u.

We now connect LSR to the contextual probabilistic model. We assume that all observables have the same range of values X. The straightforward intention is to set S=C×O . Let, as above, Φ[0,1] denote the space of functions from X to [0,1]. We map S into Φ[0,1],s=(C,A)PCA. However, generally, this map is not injection: PC1A1(x)=PC2A2(x) for all xX does not imply that C1=C2 and A1=A2. So, such straightforward construction seems to be non-proper for our aim.

We modify it by setting X=O×X, the elements of X are pairs x= (observable, outcome) =(A,x). We now use the symbols Φ[0,1] and Φ for functions from X[0,1] and to real line, respectively. Each context C can be represented as a vector belonging to Φ[0,1],C(x)=C(A,x)=PCA(x). Due to equation (2.3), embedding of the set of contexts C into Φ[0,1] is injection. Again, we denote by C¯ the convex hull of C. Each point x= (observable, outcome) =(A,x) determines the affine functional CfA,x(C)=PCA(x)[0,1]. Now fix AO and consider the family of functionals F=(FA(x)=fA,x:xX). This is the representation of observable A.

So, any contextual probability model can be realized like COM—an observational COM.

8. Concluding remarks

As was emphasized in §1, CMM can be considered as the most general probabilistic model for measurement. It can also be considered as a minimalist restructuring of Mackey’s project [6]. Mackey proceeded to quantum logic, and this made the mathematical construction more complicated. One may even say that mathematics shadowed measurement theory. Surprisingly, even this minimalist model (CMM) has a complex structure and represents the basic elements of quantum probability and measurement theory, e.g. interference of probability, OE, entanglement and the violation of the Bell inequalities.

CMM can be employed not only in quantum foundations but also in quantum-like modelling that can employ contextual probability calculi and CMMs that are not based on the complex Hilbert space formalism.

Finally, we reproduce a list of the basic properties that can be used to classify CMMs:

  • violation of FTP

  • OE

  • violation of replicability

  • RRE

  • OE+RRE

  • violation of Bell inequalities.

Although two basic quantum measurement models, von Neumann’s model MQVN and the quantum instruments model MQI, can be distinguished with respect to violation of replicability and OE+RRE combination, they both violate FTP, demonstrate OE and violate Bell inequalities. It would be interesting to find other CMMs that are distinguished, for example, with respect to the violation of FTP and Bell inequalities.

Acknowledgements

This paper is the completion of the long project on contextual probability and quantum physics that started with my discussions with Kolmogorov and Mackey and later with Accardi, Ballentine, Gudder, Mittelstaedt, Ohya, Shiryaev and Volovich. During recent years, I discussed with Plotnitsky the Bohr complementarity principle and with Ozawa quantum instrument realization of measurement theory. Since 2000, I was involved in critical and stimulated debates with Fuchs on QBism and subjective probability in QM. During my visits to Vienna, I had exciting conversations with Rauch and Zeilinger on the (non)realism, (non)locality and (non)contextuality of QM. All these discussions stimulated my thinking on contextual measurement theory and probability.

Appendix A. Terminology: context versus state

We make the following remark about the terminology ‘context versus state’. Since QM operates with the notion ‘state’, generalized probability theory also employes this terminology. However, even in QM using the term ‘state’ is ambiguous. It matches the orthodox Copenhagen interpretation by which a state is treated as the state of an individual quantum system, say the state of an electron—one concrete electron. Many experts consider this interpretation of the quantum state as leading to paradoxes and mismatching with the statistical nature of quantum phenomena. This is a complicated foundational issue, since the leading supporters of the orthodox Copenhagen interpretation also consider QM as a statistical theory in which the state of an individual system encodes the statistics of the coming experimental runs.

For example, Einstein, Koopman, Margenau, Blohintzev and Ballentine and nowadays, for example, Ballian, Nieuwenhuizen and Khrennikov use the so-called statistical (or ensemble) interpretation of QM. By this interpretation, a quantum state represents statistical properties of an ensemble of identically prepared systems. So, whose state? The state of an ensemble? In the operational approach ‘state’ corresponds to a preparation procedure. It seems that the term ‘state’ borrowed from the orthodox Copenhagen interpretation does not match to the statistical and operational interpretations of QM. In the generalized probability theory, the term ‘state’ is typically associated with a preparation procedure or a class of equivalent preparation procedures. However, this meaning of the state is not highlighted, and the output of the generalized probability theory is often projected onto the orthodox Copenhagen interpretation, i.e. this theory interpreted as a theory about the structure of the state space of individual quantum systems. Therefore, in the Växjö interpretation, we prefer to use the notion of context as a complex of experimental conditions, and pre-measurement context can be associated with a class of equivalent preparation procedures (as is done in the consistent presentation of the generalized probability theory), and measurement context is the combination of the preparation, measurement and state update generated by measurement feedback with the fixed outcome.

In contrast to the generalized probability theory employing LSR, we do not assume that the set of pre-measurement contexts C contains contexts generated by statistical mixtures (see Axiom 4 in Mackey’s book [6]), i.e. for C1,C2C and p1,p20,p1+p2=1, the set C need not contain a context that can be identified with p1C1+p2C2. Proceeding without the mixture axiom illuminates the difference between the state and the context; consider, for example, the ‘basic contextual probability representation’ of the classical Kolmogorov probability space (§3). Here, contexts are not probability distributions but elements of the ( σ -)algebra. Generally, a context provides a finer description of the measurement setup than a probability distribution.

Appendix B. Contextual measurement model with versus without linear space representation

Why is it useful to proceed in contextual probabilistic framework as far as possible without appealing to linear space representation?

I start with some remarks on the uncritical use of LSR:

  • LSR shadows the essence of the quantum probability formalism as the machinery for probability inference.

  • LSR for classical probability, through the use of the linear space of measures with the positive cone of non-negative measures and convex state space of probability measures, seems to be inadequate to Kolmogorov’s theory [35,36] based on conditioning (contextualization) with Bayes’ formula (§3).

  • LSR generates (through the creation of convex linear hull and its closure) a plenty of unphysical states and observables [18]), operating with them led, for instance, to von Neumann’s no-go theorem [37]. 4

  • The picture that quantum probability theory is just one LSR of probability diminishes the exclusiveness of linearity in QM. One loses the physical ground for the latter, LSR becomes just a part of the mathematical apparatus of QM.

  • Linking entanglement to the LSR tensor product structure shadows its contextual probabilistic nature and supports the ambiguous statements on quantum non-locality.

  • Recently, the mathematical formalism of quantum theory, especially probability, started to be widely applied outside of physics, e.g. in cognition, psychology, social and political sciences, and economics and finance, the so-called quantum-like modelling (e.g. [39]). In such models, the set of possible states (pre-measurement contexts) is not as rich as in physics. In quantum-like modelling, even the possibility of preparing statistical mixtures is not evident, i.e. proceeding towards convex structures might be misleading.

Footnotes

1

Another approach to the generation of the complex Hilbert space representation of CMM is developed in a series of authors’ works (e.g. [29,54,55]). It is based on the contextual version of FTP, FTP with an interference term.

2

In applications outside of physics, in so-called quantum-like modelling, not all pre-measurement contexts can be straightforwardly represented in the form of a preparation procedure; here, we operate with mental, social and financial pre-measurement contexts. Within the statistical (ensemble) interpretation of QM, contexts are represented as ensembles of similarly prepared systems.

3

In terms of ensembles, A-measurement is performed for systems of initially prepared ensemble, and then systems generating the outcome A=x form new ensemble.

4

Generally, some outputs of quantum information theory obtained in the abstract LSR framework might be its artifacts without coupling to physical reality. The critical analysis of connection of LSR mathematics and physics is needed.

Ethics

This work did not require ethical approval from a human subject or animal welfare committee.

Data accessibility

This article has no additional data.

Declaration of AI use

I have not used AI-assisted technologies in creating this article.

Authors’ contributions

A.K.: conceptualization, data curation, formal analysis, funding acquisition, investigation, methodology, project administration, resources, software, supervision, validation, visualization, writing—original draft, writing—review and editing.

Conflict of interest declaration

I declare I have no competing interests.

Funding

No funding has been received for this article.

References

  • 1. Von Neumann J. 1932. Mathematische Grundlagen der Quantenmechanik. Berlin, Germany: Springer-Verlag. [Google Scholar]
  • 2. Feynman RP. 1951. The concept of probability in quantum mechanics. In Proc. 2nd Berkeley Symp. on Mathematical Statistics and Probability. Berkeley, CA: University of California Press. [Google Scholar]
  • 3. Feynman R, Hibbs A. 1965. Quantum mechanics and path integrals. New York, NY: McGraw-Hill. [Google Scholar]
  • 4. Koopman BO. 1955. Quantum theory and the foundations of probability. In Applied probability (ed. MacColl LA), pp. 97–102, New York, NY: McGraw-Hill. ( 10.1090/psapm/007) [DOI] [Google Scholar]
  • 5. Mackey GW. 1957. Quantum mechanics and Hilbert space. Am. Math. Mon. 64 , 45–57. ( 10.1080/00029890.1957.11989120) [DOI] [Google Scholar]
  • 6. Mackey GN. 1963. Mathematical foundations of quantum mechanics. New York, NY: Benjamin, Inc. [Google Scholar]
  • 7. Davies EB, Lewis JT. 1970. An operational approach to quantum probability. Commun. Math. Phys. 17 , 239–260. ( 10.1007/BF01647093) [DOI] [Google Scholar]
  • 8. Davies EB. 1976. Quantum theory of open systems. London, UK: Academic Press. [Google Scholar]
  • 9. Gudder SP. 1973. Convex structures and operational quantum mechanics. Commun. Math. Phys. 29 , 249–264. ( 10.1007/BF01645250) [DOI] [Google Scholar]
  • 10. Gudder SP. 2014. Stochastic methods in quantum mechanics. Mineola, NY: Courier Corporation. [Google Scholar]
  • 11. Ozawa M. 1980. Optimal measurements for general quantum systems. Rep. Math. Phys. 18 , 11–28. ( 10.1016/0034-4877(80)90036-1) [DOI] [Google Scholar]
  • 12. Ozawa M. 2016. Probabilistic interpretation of quantum theory. New Gener. Comput. 34 , 125–152. ( 10.1007/s00354-016-0205-2) [DOI] [Google Scholar]
  • 13. Accardi L. 1981. Topics in quantum probability. Phys. Rep. 77 , 169–192. ( 10.1016/0370-1573(81)90070-3) [DOI] [Google Scholar]
  • 14. Accardi L. 1984. The probabilistic roots of the quantum mechanical paradoxes. In The wave-particle dualism: a tribute to Louis de Broglie on his 90th birthday pp. 297–330, Dordrecht, The Netherlands: Springer. ( 10.1007/978-94-009-6286-6) [DOI] [Google Scholar]
  • 15. Accardi L. 1997. Urne e camaleonti. Rome, Italy: Il Saggiatore. [Google Scholar]
  • 16. Accardi L. 2022. New challenges for classical and quantum probability. Entropy 24 , 1502. ( 10.3390/e24101502) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Ballentine L. 1986. Techniques and ideas in quantum measurement theory. Ann. New York Acad. Sci. 480 , 382–392. [Google Scholar]
  • 18. Ballentine LE. 1970. The statistical interpretation of quantum mechanics. Rev. Mod. Phys. 42 , 358–381. ( 10.1103/RevModPhys.42.358) [DOI] [Google Scholar]
  • 19. Ballentine LE. 2014. Quantum mechanics: a modern development. Singapore: WSP. [Google Scholar]
  • 20. Ballentine LE. 2001. Interpretations of probability and quantum theory. In Foundations of probability and physics, quantum probability and white noise analysis (ed Khrennikov A), pp. 71–84, Singapore: WSP. [Google Scholar]
  • 21. Svozil K. 1998. Quantum logic. New York, NY: Springer Science and Business Media. [Google Scholar]
  • 22. Khrennikov A. 2009. Interpretations of probability. Berlin, Germany: De Gruyter. ( 10.1515/9783110213195) [DOI] [Google Scholar]
  • 23. Goyal P, Knuth KH, Skilling J. 2010. Origin of complex quantum amplitudes and Feynman’s rules. Phys. Rev. A 81 , 022109. ( 10.1103/PhysRevA.81.022109) [DOI] [Google Scholar]
  • 24. Holik F, Massri C, Plastino A, Sáenz M. 2021. Generalized probabilities in statistical theories. Quant. Rep. 3 , 389–416. ( 10.3390/quantum3030025) [DOI] [Google Scholar]
  • 25. Khrennikov A. 2001. Origin of quantum probabilities. In Foundations of probability and physics (ed. Khrennikov A), pp. 180–200, Singapore: WSP. [Google Scholar]
  • 26. Khrennikov A. 2003. Contextual viewpoint to quantum stochastics. J. Math. Phys. 44 , 2471–2478. ( 10.1063/1.1570952) [DOI] [Google Scholar]
  • 27. Khrennikov A. 2003. Representation of the Kolmogorov model having all distinguishing features of quantum probabilistic model. Phys. Lett. A 316 , 279–296. ( 10.1016/j.physleta.2003.07.006) [DOI] [Google Scholar]
  • 28. Khrennikov AY. 2007. A formula of total probability with the interference term and the Hilbert space representation of the contextual Kolmogorovian model. Theory Probab. Appl. 51 , 427–441. ( 10.1137/S0040585X97982505) [DOI] [Google Scholar]
  • 29. Khrennikov A. 2009. Contextual approach to quantum formalism. Berlin, Germany: Springer. [Google Scholar]
  • 30. von Mises R. 1957. Probability, statistics and truth. London, UK: Macmillan. [Google Scholar]
  • 31. Khrennikov A. 2004. Växjö interpretation of quantum mechanics. In Quantum theory: reconsideration of foundations (ed. Khrennikov A), pp. 163–170, Växjö, Sweden: Växjö University Press. [Google Scholar]
  • 32. Khrennikov A. 2004. Växjö interpretation-2003: realism of contexts. In Quantum theory: reconsideration of foundations (ed Khrennikov A), pp. 323–338, Växjö, Sweden: Växjö University Press. [Google Scholar]
  • 33. Haven E, Khrennikov A. 2016. Quantum probability and the mathematical modelling of decision-making. Philos. Trans. A. Math. Phys. Eng. Sci. 374 , 20150105. ( 10.1098/rsta.2015.0105) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Haven E, Khrennikov A. 2016. Statistical and subjective interpretations of probability in quantum-like models of cognition and decision making. J. Math. Psychol. 74 , 82–91. ( 10.1016/j.jmp.2016.02.005) [DOI] [Google Scholar]
  • 35. Kolmogoroff AN. 1933. Grundbegriffe der Wahrscheinlichkeitsrechnung. Berlin, Germany: Springer. [Google Scholar]
  • 36. Kolmogorov AN. 1956. Foundations of the theory of probability. New York, NY: Chelsea Publication Company. [Google Scholar]
  • 37. Von Neumann J. 1955. Mathematical foundations of quantum mechanics. Princeton, NJ: Princeton University Press. [Google Scholar]
  • 38. Khrennikov A. 2010. Ubiquitous quantum structure: from psychology to finances. Berlin, Germany: Springer. [Google Scholar]
  • 39. Khrennikov AY. 2023. Open quantum systems in biology, cognitive and social sciences. London, UK: Springer Nature. [Google Scholar]
  • 40. Bohr N. 1987. The philosophical writings of Niels Bohr. Woodbridge, UK: Ox Bow Press. [Google Scholar]
  • 41. Khrennikov A. 2017. Bohr against bell: complementarity versus nonlocality. Open Phys. 15 , 734–738. ( 10.1515/phys-2017-0086) [DOI] [Google Scholar]
  • 42. Khrennikov A. 2017. After Bell. Fortschritte Der Physik 65 , 6–N8. ( 10.1002/prop.201600044) [DOI] [Google Scholar]
  • 43. Bell JS. 1966. On the problem of hidden variables in quantum theory. Rev. Mod. Phys 38 , 447–452. ( 10.1103/RevModPhys.38.447) [DOI] [Google Scholar]
  • 44. Bell JS, Aspect A. 2004. Speakable and unspeakable in quantum mechanics, 2nd edn. Cambridge, UK: Cambridge University Press. ( 10.1017/CBO9780511815676) [DOI] [Google Scholar]
  • 45. Beltrametti EG, Cassinelli C. 1983. The logic of quantum mechanics. SIAM Rev. 25 , 429–431. ( 10.1137/1025105) [DOI] [Google Scholar]
  • 46. Ozawa M. 1984. Quantum measuring processes of continuous observables. J. Math. Phys. 25 , 79–87. ( 10.1063/1.526000) [DOI] [Google Scholar]
  • 47. Ozawa M. 1986. On information gain by quantum measurements of continuous observables. J. Math. Phys. 27 , 759–763. ( 10.1063/1.527179) [DOI] [Google Scholar]
  • 48. Ozawa M. 1995. Mathematical characterizations of measurement statistics. In Quantum communications and measurement (eds Belavkin VP, Hirota O, Hudson RL), pp. 109–117, Boston, MA: Springer. ( 10.1007/978-1-4899-1391-3_11) [DOI] [Google Scholar]
  • 49. Ozawa M. 1997. An operational approach to quantum state reduction. Ann. Phys. 259 , 121–137. ( 10.1006/aphy.1997.5706) [DOI] [Google Scholar]
  • 50. Chiribella G, D’Ariano GM, Perinotti P. 2009. Realization schemes for quantum instruments in finite dimensions. J. Math. Phys. 50 , 042101. ( 10.1063/1.3105923) [DOI] [Google Scholar]
  • 51. D’Ariano GM, Chiribella G, Perinotti P. 2016. Quantum theory from first principles: an informational approach. Cambridge, UK: Cambridge University Press. ( 10.1017/9781107338340) [DOI] [Google Scholar]
  • 52. D’Ariano GM, Perinotti P, Tosini A. 2022. Incompatibility of observables, channels and instruments in information theories. J. Phys. A: Math. Theor. 55 , 394006. ( 10.1088/1751-8121/ac88a7) [DOI] [Google Scholar]
  • 53. Khrennikov A. 2015. Quantum-like model of unconscious-conscious dynamics. Front. Psychol. 6 , 997–1010. ( 10.3389/fpsyg.2015.00997) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54. Fuchs CA. 2002. Quantum mechanics as quantum information (and only a little more). In Quantum theory: reconsideration of foundations (ed. Khrennikov A), pp. 463–543, Växjö, Sweden: Växjö University Press. [Google Scholar]
  • 55. Fuchs CA. 2002. The anti-Växjö interpretation of quantum mechanics. In Quantum theory: reconsideration of foundations (ed. Khrennikov A), pp. 99–116, Växjö, Sweden: Växjö University Press. [Google Scholar]
  • 56. Fuchs CA, Schack R. 2011. A quantum-bayesian route to quantum-state space. Found. Phys. 41 , 345–356. ( 10.1007/s10701-009-9404-8) [DOI] [Google Scholar]
  • 57. Fuchs CA. 2023. Qbism, where next? (https://arxiv.org/abs/2303.01446)
  • 58. Wang Z, Busemeyer JR. 2013. A quantum question order model supported by empirical tests of an a priori and precise prediction. Top Cogn. Sci. 5 , 689–710. ( 10.1111/tops.12040) [DOI] [PubMed] [Google Scholar]
  • 59. Wang Z, Solloway T, Shiffrin RM, Busemeyer JR. 2014. Context effects produced by question orders reveal quantum nature of human judgments. Proc. Acad. Sci. USA 111 , 9431–9436. ( 10.1073/pnas.1407756111) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60. Khrennikov A, Basieva I, Dzhafarov EN, Busemeyer JR. 2014. Quantum models for psychological measurements: an unsolved problem. PLoS One 9 , e110909. ( 10.1371/journal.pone.0110909) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Ozawa M, Khrennikov A. 2019. Application of theory of quantum instruments to psychology: combination of question order effect with response replicability effect. Entropy 22 , 37. ( 10.3390/e22010037) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Ozawa M, Khrennikov A. 2021. Modeling combination of question order effect, response replicability effect, and QQ-equality with quantum instruments. J. Math. Psychol. 100 , 102491. ( 10.1016/j.jmp.2020.102491) [DOI] [Google Scholar]
  • 63. Khrennikov AY, Loubenets ER. 2004. On relations between probabilities under quantum and classical measurements. Found. Phys. 34 , 689–704. ( 10.1023/B:FOOP.0000019631.84010.a6) [DOI] [Google Scholar]
  • 64. Basieva I, Khrennikov A. 2022. Conditional probability framework for entanglement and its decoupling from tensor product structure. J. Phys. A: Math. Theor. 55 , 395302. ( 10.1088/1751-8121/ac8bb3) [DOI] [Google Scholar]
  • 65. Khrennikov A, Basieva I. 2023. Entanglement of observables: quantum conditional probability approach. Found. Phys. 53 , 84. ( 10.1007/s10701-023-00725-7) [DOI] [Google Scholar]
  • 66. Einstein A, Podolsky B, Rosen N. 1935. Can quantum-mechanical description of physical reality be considered complete? Phys. Rev. 47 , 777–780. ( 10.1103/PhysRev.47.777) [DOI] [Google Scholar]
  • 67. Schrödinger E. 1935. Die gegenwörtige situation in der quantenmechanik. Naturwissenschaften 23 , 823–828. ( 10.1007/BF01491914) [DOI] [Google Scholar]
  • 68. Schrödinger E. 1980. The present situation in quantum mechanics: a translation of Schrödinger’s ‘Cat Paradox’ paper (by J. D. Trimmer). Proc. Am. Philos. Soc. 124 , 323–338. [Google Scholar]
  • 69. Khrennikov A. 2016. Probability and randomness. Quantum versus classical. Singapore: WSP. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

This article has no additional data.


Articles from Royal Society Open Science are provided here courtesy of The Royal Society

RESOURCES