To the editor,
Cough carries an annual prevalence in the population of 10–33% and remains the most common reason for primary care physician visits.1,2 Gastroesophageal reflux disease (GERD) is estimated to be present in 20–40 % of Western adult populations.3 Pulmonary manifestations, such as cough, have been recognized as a potential consequence of GERD, with the prevalence of GERD-associated cough ranging from 10%−40%.4 To best guide treatment, it remains imperative to differentiate cough from underlying GERD versus non-GERD causes. As patients are increasingly health literate, it is common for them to utilize publically available symptom checker applications prior to seeing a healthcare provider5, as such, we aimed to prospectively analyze the diagnostic accuracy of symptom checkers versus doctors to accurately delineate GERD vs. non-GERD cough.
116-consecutive adult patients presented to a medicine clinic with chief complaint of “cough.” A questionnaire was designed navigating cough symptom input for the three most visited online symptom checkers (WebMD, iTriage, FreeMD)6, which included specific questions pertaining to GERD symptoms as part of each cough algorithm. Forms were completed in the office prior to seeing the provider. The provider of 20 years clinical experience (B.B.) independently completed the patient assessment on the initial visit and asked whether GERD was considered the top diagnosis for the patient’s cough symptoms. A panel of three active and equally experienced physicians used the aforementioned patient output data and ranked their top three diagnoses, and whether GERD was listed in the top three diagnoses. The three physician panel was then given the original symptom checker forms and the same patient’s initial clinician visit note, without the assessment or plan listed, and asked whether GERD was part of this differential diagnosis. Statistical comparisons utilizing proportions were then made for the following: initial clinician diagnosis (CD) versus symptom checkers, initial CD versus physician panel with symptom checker output only, and initial CD versus physician panel with symptom checker output and initial clinician office visit note.
116 patients enrolled, 66 females and 50 males. 26 of 116 patients reported GERD symptoms on the questionnaires. There were no differences in gender or age of those reporting GERD symptoms. The office physician diagnosed five patients (5 of 26; 19%) with GERD as the top diagnosis for cough, of those patients listing initial GERD symptoms on symptom checker input. Subsequent symptom checker analysis alone failed to recognize GERD in the patients the office physician diagnosed with GERD as top diagnosis or inappropriately diagnosed GERD in patients not diagnosed by the initial office physician (Table 1). Physicians utilizing just the symptom checker data did not appear to enhance diagnostic performance. Physicians given the symptom checker data plus the blinded visit note did yield mild improvement in diagnostic performance. However, significant discordance remained between in-office clinician diagnosis and symptom checker diagnostic utilization.
Table 1:
Non-GERD | GERD Diagnosis Ranking | |||
---|---|---|---|---|
Comparison | Diagnosis | Rank 1 | Rank 2 | Rank 3 |
Symptom Checker (SC) | ||||
WebMDa | 102 (98) | 7 (0) | 4 (0) | 3 (1) |
iTriagea | 116 (111) | 0 (0) | 0 (0) | 0 (0) |
FreeMDa | 116 (111) | 0 (0) | 0 (0) | 0 (0) |
Doctor + SC | ||||
Doc 1b | 113 (108) | 0 (0) | 1 (0) | 2 (0) |
Doc 2b | 95 (94) | 1 (0) | 9 (3) | 11 (1) |
Doc 3b | 107 (104) | 0 (0) | 2 (1) | 7 (1) |
Doctor + SC + Office Note | ||||
Doc 1Ac | 100 (97) | 2 (1) | 10 (1) | 4 (0) |
Doc 2Ac | 85 (85) | 4 (3) | 10 (2) | 17 (0) |
Doc 3Ac | 110 (107) | 1 (0) | 1 (1) | 4 (1) |
N=116 unique patients. The numbers in the parentheses are the results consistent with the Clinical Diagnosis (CD) by the physician seeing the patient in the initial office visit (the gold standard for comparison). For example for DOC2A, there are 85 patients not diagnosed as GERD and all are consistent with CD results (=0). DOC2A ranked four patients as “1”, three of them were also ranked as “1” by CD. DOC2A ranked 10 patients as “2” while 2 of them were diagnosed as “1” by CD. DOC2A ranked 17 patients as “3” and none of them were diagnosed as “1” by CD.
The clinic physician’s top diagnosis for cough yielded the following diagnosis: asthma, bronchitis, heart failure, chronic obstructive pulmonary disease, influenza, pneumonia, sinusitis, upper respiratory infection, GERD
Patient symptom checker diagnosis versus initial clinical provider diagnosis.
Doctor panel review of patient symptom checker symptoms versus initial clinical provider diagnosis.
Doctor panel review of patient symptom checker symptoms + initial clinical provider visit note (sans assessment and plan) versus initial clinical provider diagnosis.
CD: Clinical Diagnosis
GERD: Gastroesophageal Reflux Disease
Symptom checkers cannot reliably delineate GERD versus non-GERD cough in patients presenting with a chief complaint of cough in an ambulatory medicine clinic setting. Symptom checker and adjunct provider discrepancies also persisted, as some physician panels did not confirm a diagnosis of GERD from the initial clinical encounter, while others believed GERD (not in agreement with initial provider) to be the etiology of patient presenting with cough. The inability of the panel physicians, utilizing symptom checker input and clinic notes, to perform on par with the initial physician seeing the patient may stem from invaluable aspects of the in-person clinical encounter.
Key limitations included our use of a small physician panel in one medicine practice, precluding generalization of these findings across different settings. The primary analysis did not include a control group, assessing physician versus symptom checker diagnostic performance for other etiologies of cough. Real-life, complex, non-textbook patient cases in this analysis may account for poor symptom checker algorithmic diagnostic output, something prior studies have not accounted for6–8, and potentially affecting our use of symptom checkers to help delineate presenting symptoms.
The in-person clinical encounter carriers the potential to ascertain and interpret the patient’s symptoms conveyed in vague and non-buzzword terms, something online symptom checkers still need improvement in. Clinical encounters also carry the ability to correlate patient’s non-verbal physical examination cues, such as a grimace or body language. Physicians are also more likely to obtain more information than they end up documenting, as they may fail to document pertinent negative findings. Both sensitivity and specificity are likely enhanced by clinical encounters versus symptom checkers or retrospectively utilizing clinic notes. Certain diagnoses may carry similar diagnostic potential as symptom checkers, but the overall trend appears to not be the case.
Innovative, open-source, global patient databases, such as The Human Diagnosis Project9, may help refine symptom checker diagnostic algorithms by utilizing large-scale patient data to better incorporate the real-life, complex patient. Open-source systems with clinical testing and authentication may allow a more “real-life” scenario of online symptom checkers, closer mimicking a real-time clinical encounter. With such projects, we may better be able to delineate patient presenting symptoms—such as differentiating the ever-elusive GERD versus non-GERD cough—to better guide treatment strategies.
Acknowledgments
The funding bodies played no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; or preparation, review, or approval of the manuscript. All authors made substantial contributions to the conception, design and interpretation of data, and writing of the manuscript and gave final approval of the version to be published. Dr. AC Berry had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis and is the article guarantor.
Footnotes
Statistical analysis reported in this publication by Dr. Mulekar and Dr. Wang, Department of Mathematics and Statistics was supported by the National Center for Advancing Translational Sciences of the National Institutes of Health under award number UL1TR001417.
None of the authors have financial or proprietary interest in the subject matter of this article.
The article has not been submitted elsewhere or published in its entirety previously. An abstract in brief form has been published in a supplement issue from presentation at the World Congress of Gastroenterology at ACG 2017. October 13–18, 2017. Orlando, FL. Poster was selected as Presidential Award winner. Berry AC, Berry NA, Wang B, Mulekar M, Melvin A, Berry BB. Symptom Checkers vs. Doctors: A Prospective, Head-to-Head Comparison for GERD vs. Non-GERD Cough. American Journal of Gastroenterology 2017; 112: S190. doi:10.1038/ajg.2017.299.
References:
- 1.Laux G, Rosemann T, Körner T, et al. Detaillierte Erfassung von Inanspruchnahme, Morbidität, Erkrankungsverläufen und Ergebnissen durch episodenbezogene Dokumentation in der Hausarztpraxis innerhalb des Projekts CONTENT. Gesundheitswesen. 2007;69:284–291 [DOI] [PubMed] [Google Scholar]
- 2.Chung KF, Pavord ID. Prevalence, pathogenesis, and causes of chronic cough. Lancet. 2008;371:1364–1374. [DOI] [PubMed] [Google Scholar]
- 3.Cohen E, Bolus R, Khanna D, et al. GERD symptoms in the general population: prevalence and severity versus care-seeking patients. Dig Dis Sci. 2014. October;59(10):2488–9246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Irwin RS, Richter JE. Gastroesophageal reflux and chronic cough. Am J Gastroenterol. 2000;95:S9–14. [DOI] [PubMed] [Google Scholar]
- 5.Fox S, Duggan M. Health Online 2013. Internet and American life project. Pew Research Center and California Health Care Foundation, 2013:4. [Google Scholar]
- 6.Semigran HL, Linder JA, Gidengil C, Mehrotra A. Evaluation of symptom checkers for self diagnosis and triage: audit study. BMJ. 2015;351:h3480. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Semigran HL, Levine DM, Nundy S, Mehrotra A. Comparison of Physician and Computer Diagnostic Accuracy. JAMA Intern Med. 2016. December 1;176(12):1860–1861. [DOI] [PubMed] [Google Scholar]
- 8.Fraser HSF, Clamp S, Wilson CJ. Limitations of Study on Symptom Checkers. JAMA Intern Med. 2017. May 1;177(5):740–741. [DOI] [PubMed] [Google Scholar]
- 9.Abbasi J Shantanu Nundy, MD: The Human Diagnosis Project. JAMA. 2018. January 23;319(4):329–331. [DOI] [PubMed] [Google Scholar]