To the Editor,
We read the paper from Kim et al. [1] published in J Gynecol Oncol. The aim of the study was to compare the diagnostic accuracy of dilatation and curettage (D&C) versus endometrial aspiration biopsy in follow-up evaluation of patients treated with progestin for endometrial hyperplasia (EH). In a prospective multicenter study, the authors reported kappa value to compare.
Although we appreciate this significant study, we would like to raise some methodological issues that can affect the interpretation of results. First, it is important to note that the estimate reported by the authors cannot provide all the information needed for diagnostic accuracy and decision-making in clinical practice, since prevalence of the concordance cells dramatically affect the value of kappa [2,3,4]. Therefore, an alternative approach is to determine sensitivity, specificity, positive and negative predictive values, the likelihood ratios (LR+, ranging from 1 to infinity; the higher the LR+, the more accurate the test; and LR−, ranging from 0 to 1; the lower the LR−, the more accurate the test) and odds ratio (ratio of true to false results), which provides a more accurate estimate [2,5,6]. Second, the authors estimated the diagnostic accuracy by agreement. They mentioned that aspiration biopsy is less accurate than D&C and might not be a reliable method. It should be noted that agreement is typically used to determine reliability (precision). There is a methodological difference between accuracy and reliability. Third, in diagnostic accuracy research, it is essential to evaluate the diagnostic added value, since a diagnostic accuracy of a single test might be excellent, however for clinical purposes it can be worthless. Like evaluating discrimination, it would be possible to estimate the diagnostic added value by receiver operating characteristic (ROC) [2,7,8].
Eventually, any decision in clinical practice needs to evaluate both reliability and accuracy of the test. Without knowledge about the reliability of the test, any judgment would be wrong. Hence, we suggest the authors to estimate both accuracy and reliability of the test by an appropriate method.
Siamak Sabour (https://orcid.org/0000-0002-1928-992X)
Department of Clinical Epidemiology, School of Health and Safety and Safety Promotions and Injury Prevention Research Centre, Shahid Beheshti University of Medical Sciences, Tehran, I.R. Iran. s.sabour@sbmu.ac.ir
Matin Shokrgozar (https://orcid.org/0000-0002-9151-7077)
Department of Clinical Epidemiology, School of Health and Safety, Shahid Beheshti University of Medical Sciences, Tehran, I.R. Iran.
Footnotes
Conflict of Interest: No potential conflict of interest relevant to this article was reported.
- Conceptualization: S.S.
- Methodology: S.S., S.M.
- Supervision: S.S.
- Validation: S.S.
- Writing - original draft: S.S., S.M.
- Writing - review & editing: S.S., S.M.
References
- 1.Kim MK, Seong SJ, Park DC, Hong JH, Roh JW, Kang SB. Comparison of diagnostic accuracy between endometrial curettage and aspiration biopsy in patients treated with progestin for endometrial hyperplasia: a Korean Gynecologic Oncology Group study. J Gynecol Oncol. 2020;31:e51. doi: 10.3802/jgo.2020.31.e51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Grobbee DE, Hoes AW. Clinical epidemiology: principles, methods, and applications for clinical research. Burlington, MA: Jones & Bartlett Learning; 2014. [Google Scholar]
- 3.Sabour S. Reproducibility of semi-automatic coronary plaque quantification in coronary CT angiography with sub-mSv radiation dose; common mistakes. J Cardiovasc Comput Tomogr. 2016;10:e21–2. doi: 10.1016/j.jcct.2016.07.002. [DOI] [PubMed] [Google Scholar]
- 4.Naderi M, Sabour S. Reproducibility of diagnostic criteria associated with atypical breast cytology: a methodological issue. Cytopathology. 2018;29:396. doi: 10.1111/cyt.12560. [DOI] [PubMed] [Google Scholar]
- 5.Sabour S. A common mistake in assessing the diagnostic value of a test: failure to account for statistical and methodologic issues. J Nucl Med. 2017;58:1182–1183. doi: 10.2967/jnumed.115.156745. [DOI] [PubMed] [Google Scholar]
- 6.Šimundić AM. Measures of diagnostic accuracy: basic definitions. EJIFCC. 2009;19:203–211. [PMC free article] [PubMed] [Google Scholar]
- 7.Sabour S, Ghassemi F. Accuracy, validity, and reliability of the infrared optical head tracker (IOHT) Invest Ophthalmol Vis Sci. 2012;53:4776. doi: 10.1167/iovs.12-10324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Sabour S. Reproducibility of dynamic Scheimpflug-based pneumotonometer and its correlation with a dynamic bidirectional pneumotonometry device: methodological issues. Cornea. 2015;34:e14–e15. doi: 10.1097/ICO.0000000000000401. [DOI] [PubMed] [Google Scholar]