Authors’ Reply to: Screening Tools: Their Intended Audiences and Purposes. Comment on “Diagnostic Accuracy of Web-Based COVID-19 Symptom Checkers: Comparison Study”

Nicolas Munsch; Alistair Martin; Stefanie Gruarin; Jama Nateqi; Isselmou Abdarahmane; Rafael Weingartner-Ortner; Bernhard Knapp

doi:10.2196/26543

letter

. 2021 May 21;23(5):e26543. doi: 10.2196/26543

Authors’ Reply to: Screening Tools: Their Intended Audiences and Purposes. Comment on “Diagnostic Accuracy of Web-Based COVID-19 Symptom Checkers: Comparison Study”

Nicolas Munsch ¹, Alistair Martin ¹, Stefanie Gruarin ², Jama Nateqi ^2,³, Isselmou Abdarahmane ¹, Rafael Weingartner-Ortner ^1,², Bernhard Knapp ^1,^4,^✉,^#

Editor: Thomas Derrick

PMCID: PMC8178729 PMID: 33989162

We thank Millen et al [1] (Ada Health) for having taken the time to read our recently published COVID-19 symptom checker comparison study [2] and for their letter to the editor. Millen et al [1] identified three opportunities for improvement to our manuscript. We wish to provide an itemized response to their letter as outlined below.

First, Millen et al [1] state that the Symptoma symptom checker is a fundamentally different tool, which “might be useful in some hospital settings as a clinical decision support tool“ but should not be compared to other layperson COVID-19 symptom checkers. We disagree with this assessment. Indeed, the opposite is true: the same Symptoma engine is widely used by laypersons and therefore the comparison as executed in our publication is appropriate.

Second, Millen et al [1] state that Symptoma’s superior accuracy is due to its unique ability “to make use of professional interpretation of clinical findings” and that such data should not be used by any of the symptom checkers. At Symptoma, we have found that the greater the wealth of data flowing into the symptom checker AI (artificial intelligence), the better the output and results generated for our users. Further, Millen et al [1] state that “Symptoma does not indeed perform superiorly” if no clinical findings are taken into account while referring to the appendix of our study [2]. We respectfully disagree with this on several fronts: (1) the appendix shows a different analysis than that referred to by Millen et al [1] and (2) that if the analysis is modified to fit Millen et al’s [1] suggestion, which we consider less appropriate, the ranking still remains unchanged. The F1 scores will be slightly modified (from 0.92 to 0.90 for “high risk” only while it remains at 0.91 for "high risk" and "medium risk") while still ranking Symptoma first.

Lastly, Millen et al [1] suggest that the evaluation of their symptom checker should not be based on “accuracy.” The most prevalent outcome parameter of comparative symptom checker studies is diagnostic accuracy [3-5]. Even though the symptom checker results do not represent a diagnosis, the importance of accurate results as generated by a symptom checker is paramount to its functionality. Other outcome parameters such as the ones suggested by Millen et al [1] are poorly measurable, harbor the potential for bias, and lack comparability.

Abbreviations

AI: artificial intelligence

Footnotes

Conflicts of Interest: All authors are employees of Symptoma GmbH. JN holds shares in Symptoma.

References

1.Millen E, Gilsdorf A, Fenech M, Gilbert S. Screening Tools: Their Intended Audiences and Purposes. Comment on “Diagnostic Accuracy of Web-Based COVID-19 Symptom Checkers: Comparison Study”. J Med Internet Res. 2021 May;23(5):e26148. doi: 10.2196/26148. https://www.jmir.org/2021/5/e26148. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Munsch N, Martin A, Gruarin S, Nateqi J, Abdarahmane I, Weingartner-Ortner R, Knapp B. Diagnostic Accuracy of Web-Based COVID-19 Symptom Checkers: Comparison Study. J Med Internet Res. 2020 Oct 06;22(10):e21299. doi: 10.2196/21299. https://www.jmir.org/2020/10/e21299/ [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Semigran HL, Linder JA, Gidengil C, Mehrotra A. Evaluation of symptom checkers for self diagnosis and triage: audit study. BMJ. 2015 Jul 08;351:h3480. doi: 10.1136/bmj.h3480. http://www.bmj.com/lookup/pmidlookup?view=long&pmid=26157077. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Nateqi J, Lin S, Krobath H, Gruarin S, Lutz T, Dvorak T, Gruschina A, Ortner R. [From symptom to diagnosis-symptom checkers re-evaluated : Are symptom checkers finally sufficient and accurate to use? An update from the ENT perspective] HNO. 2019 May;67(5):334–342. doi: 10.1007/s00106-019-0666-y. [DOI] [PubMed] [Google Scholar]
5.Semigran Hannah L, Levine David M, Nundy Shantanu, Mehrotra Ateev. Comparison of Physician and Computer Diagnostic Accuracy. JAMA Intern Med. 2016 Dec 01;176(12):1860–1861. doi: 10.1001/jamainternmed.2016.6001. [DOI] [PubMed] [Google Scholar]

[ref1] 1.Millen E, Gilsdorf A, Fenech M, Gilbert S. Screening Tools: Their Intended Audiences and Purposes. Comment on “Diagnostic Accuracy of Web-Based COVID-19 Symptom Checkers: Comparison Study”. J Med Internet Res. 2021 May;23(5):e26148. doi: 10.2196/26148. https://www.jmir.org/2021/5/e26148. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref2] 2.Munsch N, Martin A, Gruarin S, Nateqi J, Abdarahmane I, Weingartner-Ortner R, Knapp B. Diagnostic Accuracy of Web-Based COVID-19 Symptom Checkers: Comparison Study. J Med Internet Res. 2020 Oct 06;22(10):e21299. doi: 10.2196/21299. https://www.jmir.org/2020/10/e21299/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref3] 3.Semigran HL, Linder JA, Gidengil C, Mehrotra A. Evaluation of symptom checkers for self diagnosis and triage: audit study. BMJ. 2015 Jul 08;351:h3480. doi: 10.1136/bmj.h3480. http://www.bmj.com/lookup/pmidlookup?view=long&pmid=26157077. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref4] 4.Nateqi J, Lin S, Krobath H, Gruarin S, Lutz T, Dvorak T, Gruschina A, Ortner R. [From symptom to diagnosis-symptom checkers re-evaluated : Are symptom checkers finally sufficient and accurate to use? An update from the ENT perspective] HNO. 2019 May;67(5):334–342. doi: 10.1007/s00106-019-0666-y. [DOI] [PubMed] [Google Scholar]

[ref5] 5.Semigran Hannah L, Levine David M, Nundy Shantanu, Mehrotra Ateev. Comparison of Physician and Computer Diagnostic Accuracy. JAMA Intern Med. 2016 Dec 01;176(12):1860–1861. doi: 10.1001/jamainternmed.2016.6001. [DOI] [PubMed] [Google Scholar]

PERMALINK

Authors’ Reply to: Screening Tools: Their Intended Audiences and Purposes. Comment on “Diagnostic Accuracy of Web-Based COVID-19 Symptom Checkers: Comparison Study”

Nicolas Munsch, MSc

Alistair Martin, DPhil

Stefanie Gruarin, MD, PhD

Jama Nateqi, MD

Isselmou Abdarahmane, MSc

Rafael Weingartner-Ortner, MSc

Bernhard Knapp, PhD

Abbreviations

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Authors’ Reply to: Screening Tools: Their Intended Audiences and Purposes. Comment on “Diagnostic Accuracy of Web-Based COVID-19 Symptom Checkers: Comparison Study”

Nicolas Munsch, MSc

Alistair Martin, DPhil

Stefanie Gruarin, MD, PhD

Jama Nateqi, MD

Isselmou Abdarahmane, MSc

Rafael Weingartner-Ortner, MSc

Bernhard Knapp, PhD

Abbreviations

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases