To the Editor:
We have read the recent paper by Shi et al. (1), titled “Diagnostic value of volume-based fluorine-18-fluorodeoxyglucose PET/CT parameters for characterizing thyroid incidentaloma,” with great interest. Diagnostic and prognostic models are typically evaluated using measures of accuracy that do not address clinical consequences (2). The receiver operating characteristic curve (discrimination) is developed by varying the cut-off point used to determine which values of the observed variable will be considered abnormal and then plotting the resulting sensitivities against the corresponding false positive rates (3). In this paper, Shi et al. (1) constructed several logistic regression models to assess the clinical value of fluorine-18-fluorodeoxyglucose positron emission tomography/computed tomography (PET/CT) for differentiating malignant from benign focal thyroid incidentaloma (metabolic tumor volume [MTV] 4.0, MTV 3.5, MTV 3.0, MTV 2.5, total lesion glycolysis [TLG] 4.0, TLG 3.5, TLG 3.0, TLG 2.5, etc.). However, the area under the curve [AUC] value just represents the predictive accuracy (4). In clinical settings, the AUC may be a poor measure of performance in risk prediction models in certain clinical scenarios. 1) The models need not be accurate at extreme ranges, and 2) there may be situations in which a model with a higher AUC may not be desirable (5).
Decision curve analysis (DCA) is an increasingly used method for evaluating diagnostic tests and predictive models by integrating the clinical consequences of false positives and false negatives (2). Plotting the net benefit against the threshold probability yields the “decision curve.” In addition to its many other advantages, this method takes into consideration the patient's choice to put themselves at a risk of false negatives or false positives (6). What the decision curve tells you is the range of threshold probabilities for which the prediction model would be of value (7). Therefore, the threshold TLG 4.0 of 2.475 should be interpreted with caution. DCA is recommended to find the optimal threshold for each model (7).
Footnotes
This study was supported by Undergraduate Scientific Research Projects of The Third Clinical School of Guangzhou Medical University (2018A004 and 2018A0017).
References
- 1.Shi H, Yuan Z, Yuan Z, Yang C, Zhang J, Shou Y, et al. Diagnostic value of volume-based fluorine-18-fluorodeoxyglucose PET/CT parameters for characterizing thyroid incidentaloma. Korean J Radiol. 2018;19:342–351. doi: 10.3348/kjr.2018.19.2.342. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Vickers AJ, Elkin EB. Decision curve analysis: a novel method for evaluating prediction models. Med Decis Making. 2006;26:565–574. doi: 10.1177/0272989X06295361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44:837–845. [PubMed] [Google Scholar]
- 4.Jeong CW, Jeong SJ, Hong SK, Lee SB, Ku JH, Byun SS, et al. Nomograms to predict the pathological stage of clinically localized prostate cancer in Korean men: comparison with western predictive tools using decision curve analysis. Int J Urol. 2012;19:846–852. doi: 10.1111/j.1442-2042.2012.03040.x. [DOI] [PubMed] [Google Scholar]
- 5.Talluri R, Shete S. Using the weighted area under the net benefit curve for decision curve analysis. BMC Med Inform Decis Mak. 2016;16:94. doi: 10.1186/s12911-016-0336-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Allyn J, Allou N, Augustin P, Philip I, Martinet O, Belghiti M, et al. A comparison of a machine learning model with EuroSCORE II in predicting mortality after elective cardiac surgery: a decision curve analysis. PLoS One. 2017;12:e0169772. doi: 10.1371/journal.pone.0169772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Steyerberg EW, Vickers AJ. Decision curve analysis: a discussion. Med Decis Making. 2008;28:146–149. doi: 10.1177/0272989X07312725. [DOI] [PMC free article] [PubMed] [Google Scholar]