Skip to main content
PLOS One logoLink to PLOS One
. 2024 Jul 25;19(7):e0307866. doi: 10.1371/journal.pone.0307866

A Bayesian network for modelling the Lady tasting tea experiment

Gang Xie 1,*
Editor: Gonzalo A Ruz2
PMCID: PMC11271908  PMID: 39052687

Abstract

A cup of tea can be made in one of the two ways: the milk or the tea infusion was first added to the cup. The Lady Tasting Tea experiment consists in mixing eight cups of tea, four in one way and four in the other, and presenting them to the Lady for judgment in a random order. This short article presents a Bayesian Network (BN) for modelling the Lady Tasting Tea experiment that provides a comprehensive perspective in inferential analysis of all the data samples possibly generated from the experiment. More specifically, with respect to a prior distribution of three possible levels (pure guessing, 75% sure, and 100% sure) of the Lady’s ability in correctly deciding how a served cup of tea has been made, the proposed BN model enables us to calculate the posterior probabilities of any judgment outcomes possibly made by the Lady.

Introduction

The story and the experiment of the lady tasting tea were employed by R.A. Fisher to open the second chapter of his seminal book, ’The Design of Experiments’ [1] and it is commonly regarded as the symbolic beginning of the test of significance paradigm in modern statistical analysis practice. In 2001, David Salsburg published his book ‘The Lady Tasting Tea: How Statistics Revolutionized Science in the Twentieth Century,’ [2] which was well-received by a wide range of readers. Twenty years later, John Richardson provided ‘A closer look at the lady tasting tea’ [3], in which he quoted Fisher’s description of the story and the experiment [1]:

A lady declares that by tasting a cup of tea made with milk she can discriminate whether the milk or the tea infusion was first added to the cup. We will consider the problem of designing an experiment by means of which this assertion can be tested …

Our experiment consists in mixing eight cups of tea, four in one way and four in the

other, and presenting them to the subject for judgment in a random order. The subject has been told in advance of what the test will consist, namely that she will be asked to taste eight cups, that these shall be four of each kind, and that they shall be presented to her in a random order … Her task is to divide the 8 cups into two sets of 4, agreeing, if possible, with the treatments received.

In order to calculate the probability or chance of the lady correctly identifying all 8 cups, assuming she had made her assessments purely by guessing (namely, the null hypothesis: Pr(correct | each cup) = 0.5), Fisher considered the experiment consisting of 70 equally likely possible outcomes (8! / 4! / 4! = 70). Therefore, the answer is Pr(all correct | 8 cups) = 1/70 ≈ 0.0143 (namely, the p-value, Pr(data | hypothesis) = 0.0143 (4dp) for a one-sided test). Following this approach, it can be shown that, out of 70 all possible outcomes, there are 16 possible cases of guessing 6 cups correctly (4 x 4 = 16); 36 possible cases of guessing 4 cups correctly (6 x 6 = 36); 16 possible cases of guessing 2 cups correctly (4 x 4 = 16); and one possible case of guessing all 8 cups incorrectly. Hence, the probability distribution of the experimental outcomes can be derived and the corresponding p-values be calculated.

However, the very purpose of the lady tasting tea experiment is actually to assess the assumed ability level (hypothesis) of correctly identifying the cups served to this lady based on her performance (the observed sample data) and many of us would agree that this is beyond what p-values can do [4,5]. This motivated the development of this Bayesian network (BN) which is able to provide us the posterior probabilities for the hypothesized ability levels given any one set of the possible samples of the observed cup identification results.

Method and results

Originally developed as a modelling tool from artificial intelligence since late 1980s, today Bayesian Networks (BNs) have found their applications range across the sciences, industries and government organizations [6,7]. Formally, a BN model is a graphical representation, i.e., a directed acyclic graph (DAG), of a joint probability distribution of a set of random variables in which each variable is represented by a node and the dependency relationship is represented by a link/edge for two associated variables [6,7]. Bayesian Network got its name because it can be considered as a mechanism for automatically applying Bayes’ theorem to complex problems. The Bayes Theorem (or Bayes Rule) is a mathematical statement which expresses the interrelationships between the conditional, marginal, and joint probability distributions of random variables as defined in the following formula [8]:

Pr(B|A)=Pr(A|B)Pr(B)Pr(A)=Pr(A,B)Pr(A), (1)

Where A and B are two random variables/events; Pr(A) and Pr(B) are the marginal probability distributions of A and B, respectively; Pr(B|A) is the conditional probability distribution of B given A; Pr(A|B) is the conditional probability distribution of A given B; and Pr(A, B) is the joint probability distribution of A and B. In a complex model that involves many variables, through the Bayes Theorem, a BN model quantifies the local dependency relationships between a variable (node) and its parent variables (nodes) and then all local dependency relationships are integrated based on the probability chain rule so that the joint distribution of the global (i.e., overall) interrelationships among all variables can be determined/characterised [7,9]. Because a BN model represents the joint distribution of all variables included in the model, any one or a subset of these variables can be selected as the target variable(s) (equivalent to the ‘response’ variable in a regression model), allowing for various inferential analyses to be performed by assuming different scenarios based on the ‘findings’ of the remaining variables.

Essentially, a BN model follows a machine learning approach for data analysis. Although the theoretical foundation and computational algorithms underlying BNs are highly involved in subjects such as computer science, mathematics and statistics, the applications of BN models are very intuitive and relatively straightforward because of the availability of many well tested BN application software packages [7,9]. In this study, the BN model was developed using the popular BN software Netica [10] and a static picture of the model was shown in Fig 1.

Fig 1. A Bayesian network with an uniform prior distribution for modelling the experiment of the Lady tasting tea as described in ‘The design of experiments’ [1].

Fig 1

Note that each variable in a BN model is represented by a node. The link between two nodes represents the dependency relationship between two variables. The middle column of each node is a percentage totalling 100%, representing the analysis outcomes of each level within a node. The last column is a graphical representation of the percentage values for each level, which are shown as distribution bars. The vertical dotted lines are markers, which are equally spaced to aid in visualising the comparative heights of the distribution bars. As shown in Fig 1, there are 17 nodes/variables in the BN model: the eight nodes in (bottom row) blue colour represent the sequence of eight cups being served each with 50% of probability being either pouring tea in first or milk first; the eight (middle row) yellow nodes represent the corresponding assessment outcome for each of the eight cups–tea first or milk first as identified by the lady; the pink node at the top is the variable specifying the prior distribution of the lady’s ability to make a correct identification–the default setting is a uniform distribution with three possible ability levels: pure guessing (0.5), 100% sure (1), or an imperfect but true ability to make correct decision (0.75). Here, the concept of probability has been defined as ‘propensity’ while either the ‘relative frequency’ or ‘personal belief’ should only be considered as two alternative/different ways of estimating the magnitude of probability rather than the competitive metaphysical definitions of probability [11]. This BN model therefore is a joint probability distribution of all these 17 nodes/variables (with a total of 3573 conditional probability values) which fully characterises and represents the probabilistic and statistical properties of the lady tasting tea experiment. The BN model’s structure has been manually specified as shown graphically in Fig 1. Note that, the ‘Ability to test’ and ‘cup 1’ are two root nodes which do not have any nodes to depend on. Once these two root nodes were set up, other nodes may be set up according to the lady tasting tea experimental design. In particular, the ways of links connecting nodes and the values input into the conditional probability tables with those eight (bottom row) blue nodes ensure that the served cups shall be four of each kind and the serve of each cup in random order; depending on the status of the prior probability node (the pink node at the top) and the (bottom row) blue nodes, the set of eight (middle row) yellow nodes enables a researcher to model the lady’s tasting and identification processes. Although the default setting with the prior probability node assumes a uniform prior distribution as shown in Fig 1, the researcher can easily specify a non-uniform prior distribution for subsequent inferential analysis by employing Netica’s enter finding calibration function as shown in Fig 2 [12].

Fig 2. The Bayesian network presented in Fig 1 being set up as a model with a non-uniform prior distribution for further inferential analyses.

Fig 2

With the BNs shown in Figs 1 and 2, various inferential analyses can be performed with respect to the lady tasting tea experiment. In particular, with respect to any set of served eight cups of tea (i.e., evidence results presented in (bottom row) blue nodes), the posterior probabilities for the three possible ‘ability to test’ options will be calculated for any set of testing outcomes (i.e., evidence results given in (middle row) test outcome nodes). For example, suppose a testing scenario as this: the set of served eight cups of tea was TMMTMMMMTT (T = Tea first and M = Milk first in preparing the tea) and the lady correctly identified all eight cups as shown in Fig 3. The BN model with uniform prior distribution would return the posterior probability distribution as P(0.5|D) = 0.012, P(0.75|D) = 0.149, and P(1|D) = 0.839 as shown by the (top) pink node in Fig 3. That is, for this testing scenario, the observation evidence showed that the lady’s ability to make a correct identification is much more likely to be 100% certain than partially certain (75:25) or totally by guessing (50:50).

Fig 3. The posterior probability distribution of the lady’s ability to test based on a testing scenario of 100% correct identification of the served set of eight cups of tea in the order of TMMTMMTT.

Fig 3

Compared to a binomial trial with n = 8 experimental design, the lady tasting tea experimental design is much more difficult to model because the former consists of eight independent Bernoulli trials while the latter involves dependent trials. Therefore, different from a binomial trial design, in the lady tasting tea experiment, the posterior probability distribution depends on three factors: the order of the served cups, and both the number and the order of the correctly identified cups. For example, in Fig 4, it is a scenario that the lady supposed having correctly identified six of the eight cups served in the same order of Fig 3. The resulting posterior probability distribution were: P(0.5|D) = 0.419, P(0.75|D) = 0.581, and P(1|D) = 0. However, as shown in Fig 5, a different order of six correctly identified cups scenario ended up with a different posterior probability distribution: P(0.5|D) = 0.243, P(0.75|D) = 0.757, and P(1|D) = 0. Of course, if we would like to accept a non-uniform prior distribution BN model as in Fig 2, an extra layer of complication will add in which renders different posterior probability distributions with respect to the corresponding scenarios assumed above.

Fig 4. The posterior probability distribution of the lady’s ability to test based on a testing scenario of six of eight cups were correctly identified with the tasting identification order as MMMTTMTT (middle row test outcome nodes) against the same (as of Fig 3) served order of TMMTMMTT (bottom row blue nodes).

Fig 4

Hence, the tasting assessment results are: wrong, correct, correct, correct, wrong, correct, correct, correct.

Fig 5. The posterior probability distribution of the lady’s ability to test based on a testing scenario of six of eight cups were correctly identified with the tasting identification order as TMTTMMTM (middle row test outcome nodes) against the same (as of Fig 3) served order of TMMTMMTT (bottom row blue nodes).

Fig 5

Hence, the tasting assessment results are: correct, correct, wrong, correct, correct, correct, correct, wrong.

Discussion

Therefore, this article has presented a BN modelling the lady tasting tea experiment which allows us to perform various inferential analyses, in particular to obtain the posterior probability distribution for any specific set of tea tasting assessment results given a fixed pattern of serving the cups. This BN model can also play a role in providing another empirical case to show the limitations of Fisher’s testing of significance and the much richer applications that the Bayesian statistical analysis can offer. To complete this short article, assuming the lady identified all eight cups correctly, a table of posterior probability distributions was presented by referring to the uniform prior distribution BN as in Fig 1 but by referring to three different settings of prior distributions. Note that, according to Fisher’s test of significance paradigm, the lady in the experiment only has 1/70 ≈ 0.0143 = 1.43% chance to be able to get all eight cups correctly identified by pure guessing; hence, the null hypothesis (i.e., the lady making her assessment based on pure guessing only) should be rejected based on the p-value < 0.05 criterion. However, the nuances of the lady tasting tea experiment told us that the probability of achieving perfect match results could be slightly different if the set of eight cups were served in different order due to the conditional probability nature in its design; furthermore, from a Bayesian perspective, the estimation of the posterior probability of the lady’s ability to make a correct decision also depended on how the prior probability distribution was specified as shown in Table 1. For example, depending on different prior distributions, the estimated posterior probability of pure guessing could be as low as 1.07% or as high as 9.69%!

Table 1. Posterior probability distributions (in percentages) of all 70 possible sets of the making of eight cups of tea that would all be correctly identified under three prior distribution scenarios: Pr3 represents the uniform prior distribution of three possible ability levels 0.5, 0.75, 1; Pr2 represents the uniform prior distribution of binary ability levels 0.5 or 1; Pr2a represents the uniform prior distribution of binary ability levels 0.5 or 0.75 where 0.5 means a pure guessing; 0.75 means 75% of the time the making of the tea will be correctly identified and 1 means the identification will be 100% correct.

cup1 cup2 cup3 cup4 cup5 cup6 cup7 cup8 Pr3=0.5 Pr3=0.75 Pr3=1 Pr2=0.5 Pr2=1 Pr2a=0.5 Pr2a=0.75
Milk Milk Milk Milk Tea Tea Tea Tea 1.07 23.8 75.1 1.41 98.6 4.32 95.7
Milk Milk Milk Tea Milk Tea Tea Tea 1.14 19 79.9 1.41 98.6 5.68 94.3
Milk Milk Milk Tea Tea Milk Tea Tea 1.2 14.9 83.9 1.41 98.6 7.42 92.6
Milk Milk Milk Tea Tea Tea Milk Tea 1.25 11.6 87.1 1.41 98.6 9.67 90.3
Milk Milk Milk Tea Tea Tea Tea Milk 1.25 11.6 87.1 1.41 98.6 9.67 90.3
Milk Milk Tea Milk Milk Tea Tea Tea 1.14 19 79.9 1.41 98.6 5.69 94.3
Milk Milk Tea Milk Tea Milk Tea Tea 1.2 14.9 83.9 1.41 98.6 7.43 92.6
Milk Milk Tea Milk Tea Tea Milk Tea 1.25 11.6 87.1 1.41 98.6 9.69 90.3
Milk Milk Tea Milk Tea Tea Tea Milk 1.25 11.6 87.1 1.41 98.6 9.69 90.3
Milk Milk Tea Tea Milk Milk Tea Tea 1.2 14.9 83.9 1.41 98.6 7.43 92.6
Milk Milk Tea Tea Milk Tea Milk Tea 1.25 11.6 87.1 1.41 98.6 9.69 90.3
Milk Milk Tea Tea Milk Tea Tea Milk 1.25 11.6 87.1 1.41 98.6 9.69 90.3
Milk Milk Tea Tea Tea Milk Milk Tea 1.25 11.6 87.1 1.41 98.6 9.69 90.3
Milk Milk Tea Tea Tea Milk Tea Milk 1.25 11.6 87.1 1.41 98.6 9.69 90.3
Milk Milk Tea Tea Tea Tea Milk Milk 1.2 14.9 83.9 1.41 98.6 7.43 92.6
Milk Tea Milk Milk Milk Tea Tea Tea 1.14 19 79.9 1.41 98.6 5.67 94.3
Milk Tea Milk Milk Tea Milk Tea Tea 1.2 14.9 83.9 1.41 98.6 7.42 92.6
Milk Tea Milk Milk Tea Tea Milk Tea 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Milk Tea Milk Milk Tea Tea Tea Milk 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Milk Tea Milk Tea Milk Milk Tea Tea 1.2 14.9 83.9 1.41 98.6 7.42 92.6
Milk Tea Milk Tea Milk Tea Milk Tea 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Milk Tea Milk Tea Milk Tea Tea Milk 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Milk Tea Milk Tea Tea Milk Milk Tea 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Milk Tea Milk Tea Tea Milk Tea Milk 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Milk Tea Milk Tea Tea Tea Milk Milk 1.2 14.9 83.9 1.41 98.6 7.42 92.6
Milk Tea Tea Milk Milk Milk Tea Tea 1.2 14.9 83.9 1.41 98.6 7.42 92.6
Milk Tea Tea Milk Milk Tea Milk Tea 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Milk Tea Tea Milk Milk Tea Tea Milk 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Milk Tea Tea Milk Tea Milk Milk Tea 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Milk Tea Tea Milk Tea Milk Tea Milk 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Milk Tea Tea Milk Tea Tea Milk Milk 1.2 14.9 83.9 1.41 98.6 7.42 92.6
Milk Tea Tea Tea Milk Milk Milk Tea 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Milk Tea Tea Tea Milk Milk Tea Milk 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Milk Tea Tea Tea Milk Tea Milk Milk 1.2 14.9 83.9 1.41 98.6 7.42 92.6
Milk Tea Tea Tea Tea Milk Milk Milk 1.14 19 79.9 1.41 98.6 5.67 94.3
Tea Milk Milk Milk Milk Tea Tea Tea 1.14 19 79.9 1.41 98.6 5.67 94.3
Tea Milk Milk Milk Tea Milk Tea Tea 1.2 14.9 83.9 1.41 98.6 7.42 92.6
Tea Milk Milk Milk Tea Tea Milk Tea 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Tea Milk Milk Milk Tea Tea Tea Milk 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Tea Milk Milk Tea Milk Milk Tea Tea 1.2 14.9 83.9 1.41 98.6 7.42 92.6
Tea Milk Milk Tea Milk Tea Milk Tea 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Tea Milk Milk Tea Milk Tea Tea Milk 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Tea Milk Milk Tea Tea Milk Milk Tea 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Tea Milk Milk Tea Tea Milk Tea Milk 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Tea Milk Milk Tea Tea Tea Milk Milk 1.2 14.9 83.9 1.41 98.6 7.42 92.6
Tea Milk Tea Milk Milk Milk Tea Tea 1.2 14.9 83.9 1.41 98.6 7.42 92.6
Tea Milk Tea Milk Milk Tea Milk Tea 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Tea Milk Tea Milk Milk Tea Tea Milk 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Tea Milk Tea Milk Tea Milk Milk Tea 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Tea Milk Tea Milk Tea Milk Tea Milk 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Tea Milk Tea Milk Tea Tea Milk Milk 1.2 14.9 83.9 1.41 98.6 7.42 92.6
Tea Milk Tea Tea Milk Milk Milk Tea 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Tea Milk Tea Tea Milk Milk Tea Milk 1.24 11.6 87.1 1.41 98.6 9.67 90.3
Tea Milk Tea Tea Milk Tea Milk Milk 1.2 14.9 83.9 1.41 98.6 7.42 92.6
Tea Milk Tea Tea Tea Milk Milk Milk 1.14 19 79.9 1.41 98.6 5.67 94.3
Tea Tea Milk Milk Milk Milk Tea Tea 1.2 14.9 83.9 1.41 98.6 7.43 92.6
Tea Tea Milk Milk Milk Tea Milk Tea 1.25 11.6 87.1 1.41 98.6 9.69 90.3
Tea Tea Milk Milk Milk Tea Tea Milk 1.25 11.6 87.1 1.41 98.6 9.69 90.3
Tea Tea Milk Milk Tea Milk Milk Tea 1.25 11.6 87.1 1.41 98.6 9.69 90.3
Tea Tea Milk Milk Tea Milk Tea Milk 1.25 11.6 87.1 1.41 98.6 9.69 90.3
Tea Tea Milk Milk Tea Tea Milk Milk 1.2 14.9 83.9 1.41 98.6 7.43 92.6
Tea Tea Milk Tea Milk Milk Milk Tea 1.25 11.6 87.1 1.41 98.6 9.69 90.3
Tea Tea Milk Tea Milk Milk Tea Milk 1.25 11.6 87.1 1.41 98.6 9.69 90.3
Tea Tea Milk Tea Milk Tea Milk Milk 1.2 14.9 83.9 1.41 98.6 7.43 92.6
Tea Tea Milk Tea Tea Milk Milk Milk 1.14 19 79.9 1.41 98.6 5.69 94.3
Tea Tea Tea Milk Milk Milk Milk Tea 1.25 11.6 87.1 1.41 98.6 9.67 90.3
Tea Tea Tea Milk Milk Milk Tea Milk 1.25 11.6 87.1 1.41 98.6 9.67 90.3
Tea Tea Tea Milk Milk Tea Milk Milk 1.2 14.9 83.9 1.41 98.6 7.42 92.6
Tea Tea Tea Milk Tea Milk Milk Milk 1.14 19 79.9 1.41 98.6 5.68 94.3
Tea Tea Tea Tea Milk Milk Milk Milk 1.07 23.8 75.1 1.41 98.6 4.32 95.7

It is hoped that readers could taste the richness of the inferential analysis results possibly obtained from the lady tasting tea experiment. Furthermore, interested readers may employ the BN model attached to this article to explore more inferential analyses by assuming different scenarios (of whatever their research interest) that are permitted by the lady tasting tea experiment. While the posterior probabilities of the Lady Tasting Tea experiment can be obtained from the BN model proposed in this study, expanding the current model to include more flexible prior distributions presents a challenging task. For example, it would be more realistic to assume that the Lady has more than three ability levels, leading to more realistic posterior probability patterns. This should be considered a limitation of this study and warrants future research work.

Supporting information

S1 File. The Bayesian network model presented in this article is provided as a supporting information file.

The model is saved as “The Lady Tasting Tea_extension.dne”.

(DNE)

pone.0307866.s001.dne (133.7KB, dne)

Data Availability

All relevant data are within the paper.

Funding Statement

The author(s) received no specific funding for this work.

References

  • 1.Fisher RA. The Design of Experiments. Edinburgh: Oliver and Boyd; 1935. [Google Scholar]
  • 2.Salsburg D. The Lady Tasting Tea: How Statistics Revolutionized Science in the Twentieth Century: Henry Holt and Company; 2002. [Google Scholar]
  • 3.Richardson JTE. A closer look at the lady tasting tea. SIGNIFICANCE. 2021(October):34–7. doi: 10.1111/1740-9713.01572 [DOI] [Google Scholar]
  • 4.Cohen J. The Earth Is Round (p < .05). American Psychologist. 1994;49(12):997–1003. [Google Scholar]
  • 5.Wasserstein RL, Lazar NA. The ASA’s Statement on p-Values: Context, Process, and Purpose. The American Statistician. 2016;70(2):129–33. doi: 10.1080/00031305.2016.1154108 [DOI] [Google Scholar]
  • 6.Pearl J. Probabilistic Reasoning in Intelligent Systems. San Mateo, CA, USA: Morgan Kaufmann; 1988. [Google Scholar]
  • 7.Kjærulff UB, Madsen AL. Bayesian Networks and Influence Diagrams: A Guide to Construction and Analysis 2nd ed. New Yor, NY: Springer New York; 2013. [Google Scholar]
  • 8.Upton G, Cook I. A Dictionary of Statistics. 2nd Edition ed: Oxford University Press; 2006. [Google Scholar]
  • 9.Korb KB, Nicholson AE. Bayesian Artificial Intelligence. Second Edition ed: CRC Press, Taylor & Francis Group; 2011. [Google Scholar]
  • 10.Norsys software Corp. Netica 6.09. 2021. [Google Scholar]
  • 11.Bunge M. Four concepts of probability. Applied Mathematical Modelling. 1981;5(5):306–12. [Google Scholar]
  • 12.Norsys software Corp. Netica 6.09 online help manual 2021. Available from: https://www.norsys.com/WebHelp/NETICA.htm. [Google Scholar]

Decision Letter 0

Gonzalo A Ruz

4 Jun 2024

PONE-D-24-03996A Bayesian Network for modelling the Lady Tasting Tea experimentPLOS ONE

Dear Dr. Xie,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Jul 19 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Gonzalo A. Ruz, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

Additional Editor Comments:

It is important that the author be able to comply with what was requested by the reviewers, in particular, what was indicated by reviewer 2, in relation to being able to provide more details on the methodology and experimentation for reproducibility purposes.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: I Don't Know

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: PLOS ONE

A Bayesian Network for modelling the Lady Tasting Tea experiment

The authors present research on the use of BN modelling with Netica BN software using the lady tasting tea experiment—a sensory discrimination test (‘M+N’ method with M=N=4, i.e., the octad method or double-tetrad) as an advantage over Fisher’s exact one-sided test assuming equal probability (p-value=0.0143) for each trial. Motivation for this approach was to provide posterior probabilities for hypothesized ability level given any set of possible samples of the observed results—thus assessing the assumed ability level of correct identification based on observed performance vs basis on p-values.

BN models (joint probability distribution of 17 nodes) are depicted in Figures 1-5 for illustration where the top node (1) represents the assumed prior distribution ‘ability to test’ (either uniform or non-uniform), the bottom nodes (8) represent the sequence of samples (tea or milk first), and the middle nodes (8) represent the corresponding assessment outcomes--applying ‘propensity’ as the concept of probability; the posterior probability distribution and ability to test are depicted at the top node--based on serving order and sample identification results. Inferential statistics can then be applied for any set of served cups with posterior probabilities calculated for any set of outcomes. The dependent (vs independent) nature of this approach is noted (i.e., n=8 independent Bernoulli trials vs ‘octad’ experimental design)—3 dependent factors were identified and exemplified via examples and discussion: order of served cups, both number and order of correctly identified cups, plus prior distribution assumption (uniform or non-uniform).

I believe the authors have done an excellent job in describing an empirical case of the lady tasting tea experiment and exemplifying their use of the BN model in this paper. Figures 1-5 and the Table 1 are clear and well presented. The paper is well organized and easy to read/follow. I have no challenges or questions in regard to the application as highlighted in this paper. The authors may suggest follow-on research to expand upon their ‘hopeful’ acceptance/adoption of the method as another case of limitations of Fisher’s testing of significance. Also, they should call out limitations for this BN modelling approach.

Suggested Edits

INTRODUCTION

Needs reference:

In 2001, David Salsburg published his book ‘The Lady Tasting Tea: How

Statistics Revolutionized Science in the Twentieth Century,’ which was well-received by a

wide range of reader

Lines 81-82 –

Drop can--“can fully characterises and represents the probabilistic and statistical properties of the lady tasting tea experiment.”

METHOD AND RESULTS

Line 85 – change “maybe” to “may be”

DISCUSSION

Line 153 – “Therefore” not needed

Lines 153-154 -- Drop “that”:

Lines 156-158 “Hopefully this BN model can…” – Perhaps state this more positively vs hedging—“This BN model can…”

“Therefore, this article has presented a BN that modelling the lady tasting tea experiment which allows us to perform…”

to

“Therefore, this article has presented a BN modelling the lady tasting tea experiment which allows us to perform…”

Line 162 – comma after paradigm

Line 167 – Change to “were served”

Line 171 – Change to “a different prior distribution”, OR “depending on different prior distributions”

Lines 206-207—Bayesian network model in supporting information—I was not able to access this BN model that is supposed to be in the supporting information to check/assess it.

Reviewer #2: The article seems to have been written somewhat carelessly. It would be good to organize the article and review the notations used (in detail). It is not clear how reproducible and reusable the experiment is. It would be helpful to provide more details about how the program works.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2024 Jul 25;19(7):e0307866. doi: 10.1371/journal.pone.0307866.r002

Author response to Decision Letter 0


10 Jun 2024

7 June, 2024

To: Gonzalo A. Ruz, Ph.D.

Academic Editor

PLOS ONE

Dear Dr. Gonzalo A. Ruz,

Thank you for the opportunity of revision of our manuscript. We really appreciate the time and effort you and the reviewers committed in reviewing our manuscript and providing the valuable feedback for the revision. We have carefully considered all the comments and have made the necessary revisions to improve the quality of our work. Below, we provide a detailed, point-by-point response to each comment. The reviewers' comments are included in italics, followed by our responses in regular font.

Reviewer #1:

Comment 1: “I believe the authors have done an excellent job in describing an empirical case of the lady tasting tea experiment and exemplifying their use of the BN model in this paper. Figures 1-5 and the Table 1 are clear and well presented. The paper is well organized and easy to read/follow. I have no challenges or questions in regard to the application as highlighted in this paper.”

Response: We are sincerely appreciate your positive feedback and delighted to hear that you found our manuscript to be well-organized, clear, and easy to follow. We are pleased that our description of the empirical case and the use of the BN model met your expectations. Your recognition of the clarity of Figures 1-5 and Table 1 is very encouraging. Thank you for your kind words, which motivate us to continue our research with the same rigor and clarity.

Comment 2: “The authors may suggest follow-on research to expand upon their ‘hopeful’ acceptance/adoption of the method as another case of limitations of Fisher’s testing of significance. Also, they should call out limitations for this BN modelling approach.”

Response: Change has been made by adding a few sentences at the end of Discussion section to suggest directions for follow-on research.

Comment 3: “Suggested Edits

INTRODUCTION

Needs reference:

In 2001, David Salsburg published his book ‘The Lady Tasting Tea: How

Statistics Revolutionized Science in the Twentieth Century,’ which was well-received by a

wide range of reader

Lines 81-82 –

Drop can--“can fully characterises and represents the probabilistic and statistical properties of the lady tasting tea experiment.”

METHOD AND RESULTS

Line 85 – change “maybe” to “may be”

DISCUSSION

Line 153 – “Therefore” not needed

Lines 153-154 -- Drop “that”:

Lines 156-158 “Hopefully this BN model can…” – Perhaps state this more positively vs hedging—“This BN model can…”

“Therefore, this article has presented a BN that modelling the lady tasting tea experiment which allows us to perform…”

to

“Therefore, this article has presented a BN modelling the lady tasting tea experiment which allows us to perform…”

Line 162 – comma after paradigm

Line 167 – Change to “were served”

Line 171 – Change to “a different prior distribution”, OR “depending on different prior distributions” ”

Response: Thank you for your thorough editing check of our manuscript. We have carefully examined all your suggestions and have incorporated them into the revised version.

Comment 4: “Lines 206-207—Bayesian network model in supporting information—I was not able to access this BN model that is supposed to be in the supporting information to check/assess it.”

Response: We have indeed submitted the BN model file as part of the supporting information when the manuscript was first submitted. You may check with the Editor to request the submitted BN model for assessment. Otherwise, we are more than happy to provide the BN to you directly upon receiving permission of the journal Editor.

Reviewer #2:

Comment 1: “The article seems to have been written somewhat carelessly. It would be good to organize the article and review the notations used (in detail). It is not clear how reproducible and reusable the experiment is. It would be helpful to provide more details about how the program works.”

Response: We thank this reviewer for his/her time and effort in reviewing our manuscript and providing the valuable feedback. Since the feedback was generally broad, we have done our best to respond in a more specific manner and hope our responses meet reviewer’s expectations.

The structure and style of our manuscript may not follow exactly the common pattern of a research project journal paper because the very purpose of this study is to provide a novel Bayesian solution to the classic Frequentist hypothesis testing question based on the well-known the Lady Tasting Tea experiment that was first introduced by R. A. Fisher some 100 years ago. In the writing of the manuscript, we implicitly assumed that the target readers were familiar with major statistical paradigms (e.g., frequentist and Bayesian approaches) and Bayesian network (BN). This assumption may have been unrealistic. To address the reviewer’s concern, we have therefore added a few paragraphs in the Method and Results section to give a brief introduction to the theoretical foundation of Bayesian network and how BN works for statistical analysis.

We have employed standard mathematical and statistical notations consistent with year-one undergraduate-level textbooks. However, the concepts such as prior and posterior probabilities may be new to readers who are not familiar with Bayesian statistics. The distinction between different probability definitions (e.g., relative frequency definition versus propensity definition) can be both philosophical and challenging to appreciate.

The analyses conducted using the BN model and the results presented in the manuscript can be fully reproduced. We have indeed submitted the BN model file as part of the supporting information when the manuscript was first submitted. You may check with the Editor to request the submitted BN model for assessment. Otherwise, we are more than happy to provide the BN to you directly upon receiving permission of the journal Editor.

All the changes made to the manuscript can be identified clearly in the marked-up copy of the revised version of our manuscript.

Thank you again for your time and effort in handling our manuscript.

Sincerely,

Gang Xie

Charles Sturt University, Australia

Attachment

Submitted filename: Response to Reviewers_Xie_BNmodel.docx

pone.0307866.s002.docx (16.7KB, docx)

Decision Letter 1

Gonzalo A Ruz

15 Jul 2024

A Bayesian Network for modelling the Lady Tasting Tea experiment

PONE-D-24-03996R1

Dear Dr. Xie,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice will be generated when your article is formally accepted. Please note, if your institution has a publishing partnership with PLOS and your article meets the relevant criteria, all or part of your publication costs will be covered. Please make sure your user information is up-to-date by logging into Editorial Manager at Editorial Manager® and clicking the ‘Update My Information' link at the top of the page. If you have any questions relating to publication charges, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Gonzalo A. Ruz, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Both reviewers are satisfied with the revised version of the manuscript.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: I am satisfied with the revision of this paper, though I did not pursue the supplementary file from the editor as I don't have access to the Netica software used (I did not download the free LIMITED version). I trust the supplementary file is complete in order for someone to replicate the research herein. The additional explanation of the methodology should help, though I believe some background in Bayesian analysis makes the paper more accessible.

Reviewer #2: All observations have been addressed, therefore I recommend the manuscript "A Bayesian Network for modelling the Lady Tasting Tea experiment" for publication in PLOS ONE.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

Acceptance letter

Gonzalo A Ruz

17 Jul 2024

PONE-D-24-03996R1

PLOS ONE

Dear Dr. Xie,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Prof. Gonzalo A. Ruz

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File. The Bayesian network model presented in this article is provided as a supporting information file.

    The model is saved as “The Lady Tasting Tea_extension.dne”.

    (DNE)

    pone.0307866.s001.dne (133.7KB, dne)
    Attachment

    Submitted filename: Response to Reviewers_Xie_BNmodel.docx

    pone.0307866.s002.docx (16.7KB, docx)

    Data Availability Statement

    All relevant data are within the paper.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES