Abstract
This paper introduces pragmatic hypotheses and relates this concept to the spiral of scientific evolution. Previous works determined a characterization of logically consistent statistical hypothesis tests and showed that the modal operators obtained from this test can be represented in the hexagon of oppositions. However, despite the importance of precise hypothesis in science, they cannot be accepted by logically consistent tests. Here, we show that this dilemma can be overcome by the use of pragmatic versions of precise hypotheses. These pragmatic versions allow a level of imprecision in the hypothesis that is small relative to other experimental conditions. The introduction of pragmatic hypotheses allows the evolution of scientific theories based on statistical hypothesis testing to be interpreted using the narratological structure of hexagonal spirals, as defined by Pierre Gallais.
Keywords: hypothesis tests, precise hypotheses, pragmatic hypotheses
1. Introduction
Standard hypothesis tests can lead to immediate logical incoherence, which makes their conclusions hard to interpret. This incoherence occurs because such tests have only two possible outcomes. Indeed, Izbicki and Esteves [1] shows that there exists no two-valued test that satisfies desirable statistical properties and is also logically coherent.
In order to overcome such an impossibility result, Esteves et al. [2] propose agnostic hypothesis tests, which have three possible outputs: (A) accept the hypothesis, say H, (E) reject H, or (Y) remain agnostic about H. These tests can be made logically coherent while preserving desirable statistical properties. For instance, both conditions are satisfied by the Generalized Full Bayesian Significance Test (GFBST). Furthermore, Stern et al. [3] show that the GFBST’s modal operators and their respective negations can be represented by vertices of the hexagon of oppositions [4,5,6,7,8,9], which is depicted in Figure 1.
Figure 1.
Hexagons of opposition for statistical modalities.
This paper complements the above static representation with an analysis of the GFBST in the dynamic evolution of scientific theories. The analysis is based on the metaphor of evolutive hexagonal spirals [10,11], in which the logical modalities associated to scientific theories change over time, as in Figure 2. Our key point in this paradigm is reconciling two apparently contradictory facts. On the one hand, precise or sharp hypotheses, that is, hypotheses that have a priori zero probability are central in scientific theories [12,13]. On the other hand, the GFBST never accepts (A) precise hypotheses. These observations lead to the apparent paradox that, if the GFBST were used to test scientific theories, then the acceptance step in the spiral of scientific theories would be forfeited.
Figure 2.
Gallais’ evolutionary spiral.
We overcome this paradox by proposing the concept of a “pragmatic hypothesis” associated to a precise hypothesis. Although precise hypotheses are commonly obtained from mathematical theories used in areas of science and technology [12,13], the associated pragmatic hypothesis is an imprecise hypothesis that is sufficiently good from the practical purpose of an end-user of the theories. For instance, Newtonian theory assumes a gravitational force of magnitude given by the equation , where the gravitational constant G has a precise value. However, the current Committee on Data for Science and Technology (CODATA) value for the gravitational constant is , which includes a standard deviation for the last significant digits, . Thus, it may be reasonable for a given end-user to assume that the theoretical form of the last equation is exact, but that, pragmatically, the constant G can only be known up to a chosen precision. As a result, one might wish to test an imprecise hypothesis associated to the scientific hypothesis of interest [14,15].
This article advocates for the conceptual distinction between a precise scientific theory and an associated pragmatic hypotheses. The alternate use of precise and pragmatic versions of corresponding statistical hypotheses enables the GFBST to (pragmatically) accept scientific hypotheses. Moreover, this alternate use allows the GFBST to track the evolution of scientific theories, as interpreted in the context of Gallais’ hexagonal spirals.
Our main goal in this paper is to formalize testing procedures for a theory taking into consideration the level of precision that is appropriate for a given end-user. To handle this problem, we consider the end-user’s predictions about an experiment of his interest. The variation in these predictions can be explained by a combination of the level of imprecision in the theory and by properties of the end-user’s experiment. For instance, the latter source of variation is influenced by properties of the equipment, including the precision, accuracy, and resolution of measuring devices [16,17], and also error bounds for fundamental constants and calibration factors [18,19,20,21,22,23,24,25,26]. We propose to choose a pragmatic hypothesis in such a way that the imprecision in the end-user’s predictions is mostly due to his experimental conditions and not due to the level of imprecision in the theory that he uses.
In order to develop this argument, Section 2 first adapts Gallais’s metaphor of hexagonal spirals to the evolution of science. Next, Section 3 proposes three methods of decomposing the variability in an end-user’s predictions into the level of precision of the theory he uses and his experimental conditions. Section 3.1 and Section 3.2 use these decompositions in order to build pragmatic hypothesis. They build pragmatic hypotheses for simple hypotheses and then prove that there exists a single way of extending this construction to composite hypotheses while preserving logical coherence in simultaneous hypothesis testing. This methodology is illustrated in Section 4. All proofs are found in Appendix A.
2. Gallais’ Hexagonal Spirals and the Evolution of Science
Following a well-established tradition in structural semantics and narratology [27,28], Gallais and Pollina [10] propose that many classical medieval tales follow the same organizational pattern. More precisely, these narratives exhibit an underlying “intellectual structure” and are organized according to an underlying archetypal format or prototypical pattern. This pattern includes both static and a dynamical aspects. From a static perspective, the logical structure of the narrative is such that each arch is represented by a vertex of the “hexagon of oppositions” [4]. The static hexagon of oppositions is depicted in Figure 1 and represents in each vertex a modal operator among necessity (□), possibility (⋄), contingency (), and their negations (¬). These modal operators are structured according to three axes of opposition, ; a triangle of contrariety, ; another triangle of sub-contrariety, ; and several edges of subalteration ). From a dynamical perspective, the temporal evolution of the narrative follows a spiral (Figure 2) that unwinds (se déroule) around concentric and expanding hexagons of opposition [10,11].
Because the evolution of science can also be conceived as following a spiral pattern [29], its analysis can benefit from the structure in the works by the authors of [10,11]. From a static perspective, the logical modalities induced by agnostic hypothesis tests [3] can be represented in the hexagon of oppositions. From a dynamic perspective, scientific theories evolve as a spiral which unwinds around the following states:
- Extant thesis: This vertex represents a standing paradigm, an accepted theory using well-known formalisms and familiar concepts, relying on accredited experimental means and methods, etc. In fact, the concepts of a current paradigm may become so familiar and look so natural that they become part of a reified ontology. That is, there is a perceived correspondence between concepts of the theory and “dinge-an-sich” (things-in-themselves) as seen in nature [29,30].
- Analysis: This vertex represents the moment when some hypotheses of the standing theory are put in question. At this moment, possible alternatives to the standing hypotheses may still be only vaguely defined.
- Antithesis: This vertex represents the moment when some laws of the standing theory have to be rejected. Such a rejection of old laws may put in question the entire world-view of the current paradigm, opening the way for revolutionary ideas, as described in the next vertex.
- Apothesis/ Prosthesis: This vertex is the locus of revolutionary freedom. Alternative models are considered, and specific (precise) forms investigated. There is intellectual freedom to set aside and dispose of (apothesis) old preconceptions, prejudices and stereotypes, and also to explore and investigate new paths, to put together (prosthesis) and try out new concepts and ideas.
- Synthesis: It is at this vertex that new laws are formulated; this is the point of Eureka moment(s). A selection of old and and new concepts seem to click into place, fitting together in the form of new laws, laws that are able to explain new phenomena and incorporate objects of an expanded reality.
- Enthesis: At this vertex new laws, concepts and methods must enter and be integrated into a consistent and coherent system. At this stage many tasks are performed in order to combine novel and traditional pieces or to accommodate original and conventional components into an well-integrated framework. Finally, new experimental means and methods are developed and perfected, allowing the new laws to be corroborated.
- New Thesis: At this vertex, the new theory is accepted as the standard paradigm that succeeds the preceding one (). Acceptance occurs after careful determination of fundamental constants and calibration factors (including their known precision), metrological and instrumentational error bounds, etc. At later stages of maturity, equivalent theoretical frameworks may be developed using alternative formalisms and ontologies. For example, analytical mechanics offers variational alternatives that are (almost) equivalent to the classical formulation of Newtonian mechanics [31]. Usually, these alternative worldviews reinforce the trust and confidence on the underlying laws. Nevertheless, the existence of such alternative perspectives may also foster exploratory efforts and investigative works in the next cycle in evolution.
Table 1 applies this spiral structure to the evolution of the theories of orbital astronomy and chemical affinity. The evolution of orbital astronomy has been widely studied [32]. The evolution of chemical affinity is presented in greater detail in Stern [29], Stern and Nakano [33].
Table 1.
Evolution of orbital astronomy and chemical affinity.
Vertex | Orbital Astronomy | Chemical Affinity |
---|---|---|
- Enthesis/ - Thesis |
Ptolemaic/Copernican cycles and epicycles |
Geoffroy affinity table and highest rank substitution |
- Analysis | Circular or oval orbits? | Ordinal or numeric affinity? |
- Antithesis | Non-circular orbits | Non-ordinal affinity |
- Apothesis /Prosthesis |
Elliptic planetary orbits, focal centering of sun |
Integer affinity values, for arithmetic recombination |
- Synthesis | Kepler laws! | Morveau rules and tables! |
- Enthesis - Thesis |
Vortex physics theories, Keplerian astronomy |
Affinity + stoichiometry substitution reactions |
- Analysis | Tangential or radial forces? | Total or partial reaction? |
- Antithesis | Non-tangential forces | Non-total substitutions |
- Apothesis/Prosthesis | Radial attraction forces, inverse square of distance | Reversible reactions, equilibrium conditions |
- Synthesis | Newton laws! | Mass-Action kinetics! |
- Enthesis/ - Thesis |
Newtonian mechanics & variational equivalents |
Thermodynamic theories for reaction networks |
The above spiral structure highlights that a statistical methodology should be able to obtain each of the six modalities in the hexagon of oppositions. Before an acceptance vertex (A) in the hexagon is reached by the spiral of scientific evolution, theoretically precise or sharp hypotheses must be formulated. However, a logically coherent hypothesis test, such as the GFBST, can choose solely between rejecting or remaining agnostic (i.e., corroborating) such sharp hypotheses. Once the evolving theory becomes (part of) a well-established paradigm, the GFBST can be used with the goal of accepting non-sharp hypotheses in the context of the same paradigm, a context that includes fundamental constants and calibration factors (and their respective uncertainties), metrological error bounds, specified accuracies of scientific intrumentation, etc. The non-sharp versions of sharp hypotheses used in such tests are called pragmatic, and their formulation is developed in the following sections.
3. Pragmatic Hypotheses
In order to derive pragmatic hypotheses from precise ones, it is necessary to define an idealized future experiment. Let be an unknown parameter of interest, which is used to express scientific hypotheses and that takes values in the parameter space, . A scientific hypothesis takes the form , where . Whenever there is no ambiguity, and are used interchangeably. Also, the determination of is useful for predicting an idealized future experiment, , which takes values in . The uncertainty about Z depends on by means of , the probability measure over when it is known that , .
Often, it is sufficient for an end-user to determine a pragmatic hypothesis, that is, that the parameter lies in a set of plausible values, which is larger than the null hypothesis. This set can be chosen in such a way that the variation over predictions about a future experiment is mostly due to experimental conditions rather than to the imprecision in the value of the parameter. This section formally develops a methodology for determining these pragmatic hypotheses.
In order to compare two parameter values, we use a “predictive dissimilarity”, , which is a function, , such that measures how much the predictions made for based on diverge from the ones made based on . We define and compare three possible choices for such a dissimilarity.
Definition 1.
The Kullback–Leibler predictive dissimilarity, , is
that is, is the relative entropy between and .
Example 1
(Gaussian with known variance.). Let be a random vector with a multivariate Gaussian distribution:
When and ,
(1)
The KL dissimilarity evaluates the distance between the predictive probability distributions for the future experiment under two parameter values, and . Although the KL dissimilarity is general, it can be challenging to interpret. In particular, it can be hard to establish the quality of the predictions for based on when is actually generated from and . A more interpretable dissimilarity is obtained by taking to measure how far are the best predictions for based on and . In this case, if one makes a prediction for based on , , and was actually generated using , then guarantees that will be at most apart from the best possible prediction. Such a dissimilarity is discussed in the following definition.
Definition 2
(Best prediction dissimilarity—BP.). Let be such that is the best prediction for given that . For example, one can take
where is such that measures how bad predicts when . The “best prediction dissimilarity”, , measures how badly predicts relatively to when . Formally,
where is a motononic function. The choice of g in a particular setting aims at improving the interpretation of the best prediction dissimilarity criterion.
Example 2
(BP under quadratic form.). Let , , and be a positive definite matrix. Define the quadratic form induced by to be and
The optimal prediction under is . It follows that
In particular, . Therefore,
(2) In this example, BP can be put in the same scale as by taking . Also, two choices of are of particular interest. When , Equation (2) simplifies to
(3) Similarly, when is the identity matrix, Equation (2) simplifies to
(4) Equation (4) admits an intuitive interpretation. The larger the value of , the more is dispersed and the harder it is to predict its value. Also, measures how far apart are the best prediction for under and . That is, captures that, if one predicts assuming that when it is actually , then the error with respect to the best prediction is increased as a function of the distance between the predictions over the dispersion of .
Example 3
(Gaussian with known variance.). Consider Example 1 and let be as in Example 2. It follows from Equation (4) that when is the identity matrix,
(5) Similarly, it follows from Equation (3) that when ,
(6) Conclude from Equation (6) that, if and , then . Also, when , and , both Equations (5) and (6) simplify to
(7) In some situations, is the average of m independent observations distributed as . In this case, . It follows from Equation (5) that when is the identity, and when .
Although is more interpretable then , it also relies on more tuning variables, such as , , and g. A balance between these features is obtained by a third predictive dissimilarity, which evaluates how easy it is to recover the value of between or based on .
Definition 3
(Classification distance—CD.). Let be such that
assigns to each possible outcome of the future experiment , in which the values of θ, , or make the experimental result more likely. The classification distance between , , and is defined as
is the best Bayes utility in an hypothesis test of against using a uniform prior for θ and the 0/1 utility [15]. By subtracting from this quantity, varies between 0 and and is a distance. Also,
where and is the -distance between probability measures.
Example 4
(Gaussian with known variance.). Consider Examples 1 and 3, when , , obtain
(8) Note that, in this case, would be the same as if, instead of taking , one chose .
Although analytical expressions for CD are generally not available, it is possible to approximate it via numerical integration methods.
The choice between predictive dissimilarity functions depends on the type of guarantee the end-user wishes to obtain. In particular, , , and is not an exhaustive list of dissimilarities. However, some of their properties can be useful in obtaining a choice. For instance, although yields a metric on the parameter space, and are dissimilarities between probability functions. That is, although will generate pragmatic hypothesis that have parameter values numerically close to a given , and will yield pragmatic hypothesis that have parameter values lead to similar prediction about . Also, although evaluates similarity between predictions from an information theoretic perspective, evaluates them from a perspective of hypothesis tests.
3.1. Singleton Hypotheses
We start by defining the pragmatic hypothesis associated to a singleton hypothesis. A singleton hypothesis is one in which the parameter assumes a single value, such as . In this case, the pragmatic hypothesis associated to is the set of points whose dissimilarity to is at most , as formalized below.
Definition 4
(Pragmatic hypothesis for a singleton.). Let , be a predictive dissimilarity function and . The pragmatic hypothesis for , , is
Note that for , .
Example 5
(Gaussian with known variance). Consider Examples 1 and 3 when , and . It follows from Equations (1), (7) and (8) that
Note that the size of each of the pragmatic hypothesis is proportional to . This occurs because each predictive dissimilarity functions makes the prediction error due to the unknown parameter value small with respect to that due to the data variability, .
3.2. Composite Hypotheses
Next, we consider pragmatic hypotheses for general hypotheses , where .
Definition 5.
For each hypothesis , predictive dissimilarity and , is the pragmatic hypothesis associated to induced by and ϵ. Whenever and ϵ are clear or not relevant to the result, we write instead of .
In order to construct these pragmatic hypotheses, we use logically coherent agnostic hypothesis tests. For each hypothesis, an agnostic hypothesis test can either reject it (1), accept it (0), or remain agnostic (1/2) [34]. Esteves et al. [2] shows that an agnostic hypothesis test is logically coherent if and only if it is based on a region estimator. Such tests are presented in Definition 7 and illustrated in Figure 3.
Definition 6.
Let denote the sample space of the data used to test a hypothesis. A region estimator is a function, , where is the power set of Θ.
Definition 7
(Agnostic test based on a region estimator.). The agnostic test based on the region estimator R for testing , , such that
Figure 3.
is an agnostic test based on the region estimator for testing .
Besides the logical conditions on the hypothesis test, one might also impose logical restraints on how pragmatic hypotheses are constructed. For instance, let A and B be two hypothesis, such that B logically entails A, that is, . If a logically coherent test accepts B, then it also accepts A. This property is called monotonocity [1,35,36,37]. One might also impose that , such that if a logically coherent hypothesis test accepts , then it should also accept . Similarly, let be a collection of hypothesis which cover A, that is, . If a logically coherent hypothesis test rejects every , then it rejects A. This property is called union consonance. One might also impose that is such that, if a logically coherent hypothesis test rejects for every i, then it should also reject . The above conditions define the logical coherence of a procedure for constructing pragmatic hypotheses.
Definition 8.
A procedure for constructing pragmatic hypothesis, , is logically coherent if, for every logically coherent hypothesis test ϕ and sample point x:
If for some , then .
If for every and , then .
In order to motivate the above definition, consider that the frequencies of , , and in a given population are , , and , respectively. Note that is a subset of , which denotes the Hardy–Weinberg equilibrium. That is, if the frequencies , , and are, respectively, 0.25, 0.5, and 0.25, then the population follows the Hardy–Weinberg equilibrium. As a result, if one pragmatically accepts that the population satisfies the specified proportions, then one might also wish to pragmatically accept that the population follows the Hardy–Weinberg. Similarly, if one pragmatically rejects for every that the frequencies of , , and are, respectively, , , and , then one might also wish to pragmatically reject that the population follows the Hardy–Weinberg equilibrium. These conditions are assured in Definition 8.
In a logically coherent procedure for constructing pragmatic hypotheses, the pragmatic hypothesis associated to a composite hypothesis is completely determined by the pragmatic hypotheses associated to simple hypotheses. This result is presented in Theorem 1.
Theorem 1.
A procedure for constructing pragmatic hypothesis, , is logically coherent if and only if, for every hypothesis, , .
Using Theorem 1, it is possible to determine a logically coherent procedure for constructing pragmatic hypotheses by determining only the pragmatic hypothesis associated to simple hypothesis, such as in Section 3.1. Theorem 1 is illustrated in Section 4. One can also obtain the following general relation between predictive dissimilarities.
Lemma 1.
.
Besides being logically coherent, it is often desirable in statistics [38,39] and in science [12,13] for a procedure to be invariant to reparametrization, so as to ensure that the procedure reaches the same conclusions whatever the coordinate system is used to specify both the sample and the parameter spaces. For instance, the pragmatic hypothesis that is obtained using the International metric system should be compatible to the one that is obtained using the English metric system. Invariance to reparametrization is formally presented in Definition 10.
Definition 9.
is a reparameterization of if there exists a bijective function, , such that for every , .
Definition 10.
Let be a reparametrization of by a bijective function, . Also, let and be predictive dissimilarity functions. The functions and are invariant to the reparametrization if for every logically coherent procedure for constructing pragmatic hypotheses, ,
Definition 10 states that, if is an hypothesis and invariance to reparametrization holds, then the pragmatic hypothesis obtained in a reparametrization of , say , is the same as the transformed pragmatic hypothesis associated to , . Theorem 2 presents a sufficient condition for obtaining invariance to reparametrization.
Theorem 2.
Let be a reparameterization of given by a bijective function, f. If and satisfy , then and are invariant to this reparametrization.
Corollary 1.
If and are the same choice between KL, BP, or CD, then and are invariant to every reparametrization.
The procedures for constructing pragmatic hypotheses induced by and also satisfy an additional property given by Theorem 3.
Theorem 3.
Let , where ’s are i.i.d. and is identifiable [40,41]. Also, let and be the dissimilarities calculated using . If is logically coherent, then, for every and ,
- (i)
and are non-increasing sequences of sets
- (ii)
and .
Theorem 3 states that the sequence of pragmatic hypotheses for induced by is non-increasing if the dissimilarity is evaluated by either KL or CD. The greater the number of observable quantities , the easier it is to distinguish two parameter values and , and therefore the smaller the amount of parameters that are taken as close to . Also, as the sample size goes to infinity, the pragmatic hypothesis associated to converges to to . In other words, for each , no other parameter value can predict infinitely many observable quantities with a precision sufficiently close to that of .
4. Applications
In the following, pragmatic hypotheses for standard statistical problems are derived. Coscrato et al. [42] provide additional examples and methods for obtaining pragmatic hypotheses.
Example 6
(Gaussian with unknown variance.). Consider the setting from Example 5, but with unknown and . In this case, the parameter is . Consider the composite hypothesis , which is often written as . In this case, let and . Proceeding as in Example 5, it follows that
The rectangular shape of these pragmatic hypotheses seems to be unreasonable, as, for instance, whether a point is close to does not depend on . This is a consequence of the choice of δ in Example 5.
Figure 4 presents the pragmatic hypotheses for and when and , and using the KL and CD dissimilarities. Contrary to BP, the hypotheses obtained from these dissimilarities do not have a rectangular shape. In particular, the triangular shape of the pragmatic hypotheses for is such that the closer is to 0, the smaller the range of values for μ that are included in the pragmatic hypothesis. This behavior might be desirable, as when is small, there is little uncertainty about the value of , and consequently a narrow interval of values of μ can predict Z with precision ϵ.
Example 7
(Hardy–Weinberg equilibrium). Let , where , , and . The Hardy–Weinberg (HW) hypothesis [43], , which is depicted in the red curve in Figure 5 satisfies
If , and , then it follows from Example 2 that
The pragmatic hypotheses that are obtained using , , and for the HW hypothesis are depicted in Figure 5. The choice between BP or KL and CD has a large impact over the shape of the pragmatic hypotheses. Although, for BP, the width of the pragmatic hypothesis is approximately uniform along the HW curve, the width of the pragmatic hypotheses obtained using and is smaller towards the edges of the HW curve. This behavior could be expected, as towards the edges of the HW curve, has the smallest variability. The figure also depicts the challenge in calibrating . Although the pragmatic hypotheses for and have similar sizes when using , this result was obtained for while using .
The pragmatic hypotheses in Figure 5 are further tested using data from Brentani et al. [44], which is presented in Table 2. This study had the goal of verifying association between the APOE-ϵ4 gene and Alzheimer disease. The lower panels of Figure 5 present the 80% HPD regions for the distribution of this gene in each of the eight groups observed in the study. Additionally, they present two simulated datasets, 9 and 10. Groups 9 and 10 were generated by populations that were, respectively, not under and under the HW equilibrium. Group 9 and 10 fall, respectively, outside and inside of the pragmatic hypothesis.
Example 8
(Bioequivalence). Assume that , with σ known. We derive the pragmatic hypothesis for , that is, for . Such a test might be used in a bioequivalence study, where X and Y are the concentrations of an active ingredient in a generic (test) drug medication and in the brand name (reference) medication [45], respectively. As is composite, it helps to derive the pragmatic hypothesis of its constituents.
In order to do so, let , , , and . If and , then
Hence, , which is a circle with center and radius , as depicted on the left panel of Figure 6. In this case, the pragmatic hypothesis is the Tier 1 Equivalence Test hypothesis suggested by the US Food and Drug Administration [45]. The pragmatic hypothesis for is obtained by taking the union of the pragmatic hypotheses associated to its constituents, as illustrated in the right panel of Figure 6. Specifically,
The pragmatic hypothesis for using KL is obtained similarly. Note that
Therefore, and
The pragmatic hypothesis for that is obtained using CD has no analytic expression. However, by observing that , it is possible to show that there exists a monotonically increasing function, , such that
That is, the pragmatic hypothesis associated to have the same shape as in the right panel of Figure 6. They differ solely on how many standard deviations correspond to the width of the pragmatic hypothesis.
Figure 4.
Pragmatic hypotheses in Example 6 for with KL (upper), CD (lower), , and . is represented by a red line in all figures.
Figure 5.
Pragmatic hypotheses obtained for the HW equilibrium, depicted in red, using , for BP and CD and for KL. The blue regions indicate the pragmatic hypothesis for HW and (top) and for HW (bottom). The lower, middle, and right panels were obtained, respectively, with BP, KL, and CD. The green regions in the right panels represents 80% HPD regions for the genotype distribution of each of the eight groups collected by Brentani et al. [44] and two simulated datasets.
Table 2.
Genotype counts for the eight groups in Brentani et al. [44]. Also, the decision of the GFBST agnostic hypothesis test [2] for testing in each group the pragmatic Hardy–Weinberg equilibrium hypothesis with . The decisions are the same for , , and .
AA | AD | DD | Decision | |
---|---|---|---|---|
1 | 4 | 18 | 94 | Agnostic |
2 | 6 | 53 | 74 | Accept |
3 | 57 | 118 | 100 | Agnostic |
4 | 58 | 97 | 48 | Agnostic |
5 | 120 | 361 | 194 | Agnostic |
6 | 206 | 309 | 142 | Accept |
7 | 110 | 148 | 44 | Accept |
8 | 34 | 22 | 12 | Agnostic |
9 | 198 | 282 | 520 | Reject |
10 | 641 | 314 | 45 | Accept |
Figure 6.
Pragmatic hypotheses using BP in Example 8 when σ is known.
5. Final Remarks
The spiral structure studied in the work by the authors of [10] can be used to describe scientific evolution. However, in order for the analogy to be complete, it is necessary to indicate what types of scientific theories or hypotheses are effectively tested in the acceptance vertex of the hexagon of oppositions. We defend that these are pragmatic hypotheses, which are sufficiently precise for the end-user of the theory.
In order to make this statement formal, we introduce three methods for constructing a pragmatic hypothesis associated to a precise hypothesis. These methods are based on three predictive dissimilarity functions: KL, BP and CD. Each of these methods have different advantages. For instance, the scale of BP and CD is more interpretable than KL, making it easier to determine whether the former are large or small. On the other hand, BP relies on the definition of more functions than KL and CD, such as in Definition 2. If these function are chosen inadequately, then the shape of the resultant pragmatic hypothesis might be counterintuitive or meaningless. Finally, CD often does not have an analytic expression. It relies on numerical integration over the sample space, which can be taxing in high dimensions.
The applications at Section 4 present adequate choices of metrics and bounds (defining pragmatic hypotheses) for some given theoretical and experimental setups. Nevertheless, the authors did not propose a general or automated recipe for making these choices, nor do they think this to be a feasible goal. In future research, the authors intend to explore a variety of application cases, some using historical data of important experiments, and discuss possible choices of metrics and bounds for each case. The authors hope that, in time, the accumulation of such examples will provide useful guidelines for the good use of methods developed in this paper, in the same way that the statistical literature provides useful guidelines for choosing good statistical models for practical applications.
Acknowledgments
The authors are grateful for the support of the Institute of Mathematics and Statistics of the University of São Paulo (IME-USP), and the Department of Statistics of UFSCar—The Federal University of São Carlos. Finally, the authors are grateful for advice and comments received from anonymous referees; from participants of the 6th World Congress on the Square of Opposition, held on November 1 to 5, 2018, at Chania, Crete, having as main organizers Jean-Yves Béziau and Ioannis Vandoulakis; and from participants of the 39th International Workshop on Bayesian Inference and Maximum Entropy Methods, held on June 30 to July 5, 2019, at Garching bei München, Bavaria, having as main organizers Udo von Toussaint and Roland Preuss.
Appendix A. Proofs
Proof of Lemma 1.
Let . It follows from Theorem 1 that there exist , such that . Conclude from Pinsker’s inequality that , that is, . □
Proof of Theorem 1.
Let be logically coherent. Pick an arbitrary and note that, if , then . Since is logically coherent, conclude that , that is, . Since was arbitrary, conclude that
(A1) Next, let . For every , . Since is logically coherent, , that is, . Conclude that
(A2) It follows from Equations (A1) and (A2) that . It also follows from direct calculation that, if , then is logically coherent. □
Proof of Theorem 2.
Let
□
Proof of Theorem 3.
Since the ’s are i.i.d., . It follows that
Thus, is a non-increasing sequence of sets. It follows that
where the next-to-last equality follows from the assumption that is identifiable. The proofs for the divergence follows from the fact that . □
Author Contributions
All authors contributed equally to this paper.
Funding
This work was partially supported by Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) grants PQ 306943-2017-4, 301206-2011-2, and 301892-2015-6; and Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP) grants 2019/11321-9, 2017/03363-8, 2014/25302-2, CEPID-2013/07375-0, and CEPID-2014/50279-4. This study was also financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES), Finance Code 001.
Conflicts of Interest
The authors declare no conflict of interest.
References
- 1.Izbicki R., Esteves L.G. Logical consistency in simultaneous statistical test procedures. Log. J. IGPL. 2015;23:732–758. doi: 10.1093/jigpal/jzv027. [DOI] [Google Scholar]
- 2.Esteves L.G., Izbicki R., Stern J.M., Stern R.B. The logical consistency of simultaneous agnostic hypothesis tests. Entropy. 2016;18:256. doi: 10.3390/e18070256. [DOI] [Google Scholar]
- 3.Stern J., Izbicki R., Esteves L., Stern R. Logically-Consistent Hypothesis Testing and the Hexagon of Oppositions. Log. J. IGPL. 2017;25:741–757. doi: 10.1093/jigpal/jzx024. [DOI] [Google Scholar]
- 4.Blanché R. Structures Intellectuelles: Essai sur l’Organisation Systématique des Concepts. Vrin; Paris, France: 1966. (In French) [Google Scholar]
- 5.Béziau J.Y. The power of the hexagon. Log. Univers. 2012;6:1–43. doi: 10.1007/s11787-012-0046-9. [DOI] [Google Scholar]
- 6.Béziau J.Y. Opposition and order. In: Béziau J.Y., Gan-Krzywoszynska K., editors. New Dimensions of the Square of Opposition. Philosophia Verlag; Munich, Germany: 2015. pp. 1–11. [Google Scholar]
- 7.Carnielli W., Pizzi C. Modalities and Multimodalities (Logic, Epistemology, and the Unity of Science) Springer; Berlin, Germany: 2008. [Google Scholar]
- 8.Dubois D., Prade H. On several representations of an uncertain body of evidence. In: Gupta M., Sanchez E., editors. Fuzzy Information and Decision Processes. Elsevier; Amsterdam, The Netherlands: 1982. pp. 167–181. [Google Scholar]
- 9.Dubois D., Prade H. From Blanché’s Hexagonal Organization of Concepts to Formal Concept Analysis and Possibility Theory. Log. Univers. 2012;6:149–169. doi: 10.1007/s11787-011-0039-0. [DOI] [Google Scholar]
- 10.Gallais P., Pollina V. Hegaxonal and Spiral Structure in Medieval Narrative. Yale Fr. Stud. 1974;51:115–132. doi: 10.2307/2929682. [DOI] [Google Scholar]
- 11.Gallais P. Dialectique Du Récit Mediéval: Chrétien de Troyes et l’Hexagone Logique. Rodopi; Amsterdam, The Netherlands: 1982. (In French) [Google Scholar]
- 12.Stern J.M. Symmetry, Invariance and Ontology in Physics and Statistics. Symmetry. 2011;3:611–635. doi: 10.3390/sym3030611. [DOI] [Google Scholar]
- 13.Stern J.M. Continuous versions of Haack’s Puzzles: Equilibria, Eigen-States and Ontologies. Log. J. IGPL. 2017;25:604–631. doi: 10.1093/jigpal/jzx017. [DOI] [Google Scholar]
- 14.DeGroot M.H., Schervish M.J. Probability and Statistics. Pearson Education; London, UK: 2012. [Google Scholar]
- 15.Berger J.O. Statistical Decision Theory and Bayesian Analysis. Springer Science & Business Media; Berlin, Germany: 2013. [Google Scholar]
- 16.Bucher J.L. The Metrology Handbook. 2nd ed. ASQ Quality Press; Milwaukee, WI, USA: 2012. [Google Scholar]
- 17.Czichos H., Saito T., Smith L. Springer Handbook of Metrology and Testing. 2nd ed. Springer; Berlin/Heidelberg, Germany: 2011. [Google Scholar]
- 18.Cohen R., Crowe K., DuMond J. The Fundamental Constants of Physics. CODATA/Interscience Publishers; Geneva, Switzerland: 1957. [Google Scholar]
- 19.Cohen R. Mathematical Analysis of the Universal Physical Constants. Il Nuovo Cimento. 1957;6:187–214. doi: 10.1007/BF02724772. [DOI] [Google Scholar]
- 20.Lévy-Leblond J. On the conceptual nature of the physical constants. Il Nuovo Cimento. 1977;7:187–214. [Google Scholar]
- 21.Pakkan M., Akman V. Hypersolver: A graphical tool for commonsense set theory. Inform. Sci. 1995;85:43–61. doi: 10.1016/0020-0255(94)00114-Q. [DOI] [Google Scholar]
- 22.Akman V., Pakkan M. Nonstandard set theories and information management. J. Intell. Inf. Syst. 1996;6:5–31. doi: 10.1007/BF00712384. [DOI] [Google Scholar]
- 23.Wainwright M.J. Ph.D. Thesis. Massachusetts Institute of Technology; Cambridge, MA, USA: Apr, 2002. Stochastic Processes on Graphs with Cycles: Geometric and Variational Approaches. [Google Scholar]
- 24.Bishop C.M. Pattern Recognition and Machine Learning (Information Science and Statistics) Springer; Heidelberg, Germany: 2006. [Google Scholar]
- 25.Iordanov B. International Conference on Web-Age Information Management. Springer; Berlin, Germany: 2010. HyperGraphDB: A generalized graph database; pp. 25–36. [Google Scholar]
- 26.Gelman A., Vehtari A., Jylänki P., Sivula T., Tran D., Sahai S., Blomstedt P., Cunningham J.P., Schiminovich D., Robert C. Expectation propagation as a way of life: A framework for Bayesian inference on partitioned data. arXiv. 20141412.4869 [Google Scholar]
- 27.Greimas A. Structural Semantics: An Attempt at a Method. University of Nebraska Press; Lincoln, NE, USA: 1983. [Google Scholar]
- 28.Propp V. Morphology of the Folktale. University of Texas Press; Austin, TX, USA: 2000. [Google Scholar]
- 29.Stern J.M. Jacob’s Ladder and Scientific Ontologies. Cybern. Human Knowing. 2014;21:9–43. [Google Scholar]
- 30.Stern J.M. Constructive Verification, Empirical Induction, and Falibilist Deduction: A Threefold Contrast. Information. 2011;2:635–650. doi: 10.3390/info2040635. [DOI] [Google Scholar]
- 31.Abraham R., Marsden J.E. Foundations of Mechanics. Addison-Wesley; Boston, MA, USA: 2013. [Google Scholar]
- 32.Hawking S. The Illustrated On the Shoulders of Giants: The Great Works of Physics and Astronomy. Running Press; Philadelphia, PA, USA: 2004. [Google Scholar]
- 33.Stern J.M., Nakano F. Optimization Models for Reaction Networks: Information Divergence, Quadratic Programming and Kirchhoff’s Laws. Axioms. 2014;3:109–118. doi: 10.3390/axioms3010109. [DOI] [Google Scholar]
- 34.Coscrato V., Izbicki R., Stern R.B. Agnostic tests can control the type I and type II errors simultaneously. [(accessed on 9 September 2019)];Braz. J. Probab. Stat. 2019 Available online: https://www.imstat.org/wp-content/uploads/2019/01/BJPS431.pdf.
- 35.Izbicki R., Fossaluza V., Hounie A.G., Nakano E.Y., de Braganca Pereira C.A. Testing allele homogeneity: The problem of nested hypotheses. BMC Genet. 2012;13:103. doi: 10.1186/1471-2156-13-103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Da Silva G.M., Esteves L.G., Fossaluza V., Izbicki R., Wechsler S. A bayesian decision-theoretic approach to logically-consistent hypothesis testing. Entropy. 2015;17:6534–6559. doi: 10.3390/e17106534. [DOI] [Google Scholar]
- 37.Fossaluza V., Izbicki R., da Silva G.M., Esteves L.G. Coherent hypothesis testing. Am. Statist. 2017;71:242–248. doi: 10.1080/00031305.2016.1237893. [DOI] [Google Scholar]
- 38.Pereira C.A.B., Stern J.M., Wechsler S. Can a Signicance Test be Genuinely Bayesian? Bayesian Anal. 2008;3:79–100. doi: 10.1214/08-BA303. [DOI] [Google Scholar]
- 39.Stern J.M., Pereira C.A.D.B. Bayesian Epistemic Values: Focus on Surprise, Measure Probability! Log. J. IGPL. 2014;22:236–254. doi: 10.1093/jigpal/jzt023. [DOI] [Google Scholar]
- 40.Casella G., Berger R.L. Statistical Inference. Duxbury Press; Pacific Grove, CA, USA: 2002. [Google Scholar]
- 41.Wechsler S., Izbicki R., Esteves L.G. A Bayesian look at nonidentifiability: A simple example. Am. Statist. 2013;67:90–93. doi: 10.1080/00031305.2013.778787. [DOI] [Google Scholar]
- 42.Coscrato V., Esteves L.G., Izbicki R., Stern R.B. Interpretable hypothesis tests. arXiv. 20191904.06605 [Google Scholar]
- 43.Hardy G. Mendelian proportions in a mixed population. 1908. Yale J. Biol. Med. 2003;76:79. [PMC free article] [PubMed] [Google Scholar]
- 44.Brentani H., Nakano E.Y., Martins C.B., Izbicki R., Pereira C.A.d.B. Disequilibrium coefficient: A Bayesian perspective. Stat. Appl. Genet. Mol. 2011;10 doi: 10.2202/1544-6115.1636. [DOI] [Google Scholar]
- 45.Chow S.C., Song F., Bai H. Analytical similarity assessment in biosimilar studies. AAPS J. 2016;18:670–677. doi: 10.1208/s12248-016-9882-5. [DOI] [PMC free article] [PubMed] [Google Scholar]