Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2022 Sep 30.
Published in final edited form as: Stat Med. 2021 Sep 30;40(22):4770–4771. doi: 10.1002/sim.9157

Rejoinder to “On the robustness of latent class models for diagnostic testing with no gold standard”

Matthew R Schofield 1, Michael J Maze 2,3, John A Crump 4, Matthew P Rubach 5, Renee L Galloway 6, Katrina J Sharples 1
PMCID: PMC8552917  NIHMSID: NIHMS1747327  PMID: 34515367

We thank both Dendukuri and Albert for their insightful commentaries on our article. They have provided food for thought for us (the authors), as well as the wider biostatistical community. We highlight some of the important points that were raised and offer our perspective.

Dendukuri highlights the value of working collaboratively within an interdisciplinary group. They discuss how this can help in the development of a realistic depiction of how the tests operate in the context of the disease in question, making it more likely that we specify and fit statistical models that are fit for purpose and can be applied by practitioners. Using the leptospirosis example in Schofield et al1 they discuss how this could involve the latent classes being defined in terms of levels of IgM and IgG antibodies.

We agree that interdisciplinary collaborations (such as ours) are of tremendous value but note the importance of considering not only the model but also the underlying clinical question. We need to ensure that our models are not providing an “answer to the wrong question” (paraphrasing Tukey2). In the leptospirosis examples in Schofield et al,1 we note the rapid tests were being evaluated for clinical utility in diagnosing leptospirosis disease. Therefore, an ability to interpret the latent classes as “disease (leptospirosis)” and “no disease (no leptospirosis)” is critical. Considering a model where we can interpret the latent classes in terms of IgM and IgG antibodies is of little to no clinical interest.

Dendukuri is critical of the simple model we have evaluated noting it is “unrealistic.” Implicit is the notion that understanding such models is of little value in applied research. While we agree with Dendukuri that the model is often unrealistic for real data, we disagree that exploration of a two-state conditionally independent LCM model is of little value. That idealized latent variables (disease status) may not correspond to the fitted latent variables is a fundamental difficulty for any model involving latent features. We believe that demonstrating potential problems with the interpretation of latent states in simple scenarios can be a powerful tool for understanding limitations of a wider model class.

Albert describes the similarity between our results and those for more complex LCMs, in particular Albert and Dodd.3 We agree with Albert’s summary and appreciate the clarity their comments have brought to the discussion. They also outline possible approaches for alleviating the identified problems with lack of robustness of LCMs.4,5 In particular, they suggest the use of a gold standard, even if only for a subset of our observations. We agree that if possible, consideration should be given in the design of the study to use of a gold-standard test on a subset of participants. It is likely to add expense and complexity, but it could provide confidence in the reliable interpretation of the latent classes as disease and disease free, and thus reliable estimates of sensitivity and specificity. Unfortunately this is not possible for leptospirosis as no gold standard is available. What may be possible is a silver standard,6 a test that has perfect specificity but unknown sensitivity. It is unclear the extent to which a silver standard can alleviate the lack of robustness that we and others have identified. Understanding the properties of such models is an area of future research.

REFERENCES

  • 1.Schofield MR, Maze MJ, Crump JA, Rubach MP, Galloway R, Sharples KJ. Robustness of latent class models for diagnostic testing with no gold standard. Stat Med. 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Tukey JW. The future of data analysis. Ann Math Stat. 1962;33:1–67. [Google Scholar]
  • 3.Albert PS, Dodd LE. A cautionary note on the robustness of latent class models for estimating diagnostic error without a gold standard. Biometrics. 2004;60:427–435. [DOI] [PubMed] [Google Scholar]
  • 4.Albert PS, Dodd LE. On estimating diagnostic accuracy from studies with multiple raters and partial gold standard evaluation. J Am Stat Assoc. 2008;103(481):61–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Zhang B, Chen Z, Albert PS. Estimating diagnostic accuracy of raters without a gold standard by exploiting a group of experts. Biometrics. 2012;68(4):1294–1302. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Wu Z, Deloria-Knoll M, Hammitt LL, Zeger SL. Partially latent class models for case–control studies of childhood pneumonia aetiology. J Royal Stat Soc Ser C (Appl Stat). 2016;65:97–114. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES