Abstract
The aim of this study is to show the concordance of an app-based decision support system and the diagnosis given by spinal surgeons in cases of back pain. 86 patients took part within 2 months. They were seen by spine surgeons in the daily routine and then completed an app-based questionnaire that also led to a diagnosis independently. The results showed a Cramer’s V = .711 (p < .001), which can be taken as a strong relation between the tool and the diagnosis of the medical doctor. Besides, in 67.4% of the cases, the diagnosis was concordant. An overestimation of the severity of the diagnosis occurred more often than underestimation (15.1% vs. 7%). The app-based tool is a safe tool to support healthcare professionals in back pain diagnosis.
Keywords: Decision support system, Back pain, Symptom checker, App-based decision
Introduction
Back pain is one of the leading reasons for people seeking for health care services [1]. Thus, back pain is a serious disease of increasing socio-economic importance [2–5]. Globally, especially low back pain is the main cause of years lived with disability [3]. Every age group is affected by back pain. The incidence of low back pain is 60–90%, and low back pain is the main cause of working disability in most countries [6–10]. Also neck pain is in increasing problem with incidences of between 10.4% and 71.5%, and an annual prevalence varying between 30 and 50% [7, 11–16]. Because of these enormous costs for the health care system, more and more attempts are made to support health care providers with a computerized decision support systems (DSS). The DSS can be considered as one specific form of a symptom checker [10]. The study group of the authors has already developed and published a first use of a computerized algorithm which is based on the next best questions to find out the most possible diagnosis [10]. Also, other institutions use these systems. Already in 2015, Semigran et al. examined 23 symptom checkers for self-diagnoses and triage using 45 standardized patients. The results showed that overall the correct diagnosis was given in 34% of the patients and the correct diagnosis under the top 20 diagnoses was given in 58% [17]. A study in 2018 showed significant medium correlation between the DSS and the medical recommendation. In 49.6% the diagnoses were concordant [10].
The aim of the present study was to redesign the questions and algorithm to improve the concordance between medical diagnosis given by a spinal surgeon and the algorithm tool.
Methods
This non-randomized unblinded correlational study included male and female patients with back pain who visited the Department of Orthopaedics of the University Medical Centre Regensburg between September and November 2020. Inclusion criteria were: consultation because of back pain, German language to understand the questions. Exclusion criteria were missing consent or patients who were not able to take part because of other medical reasons. The study was approved by the Ethics Commission of the University of Regensburg (21.08.18, 18-1007-121, DRKS DRKS00012467) and carried out in accordance with the approved guidelines of the Helsinki Declaration of 1975. A written informed consent was obtained from all study participants. Participation in this study was voluntary.
Patients
After fulfilling the inclusion criteria, 89 patients were included, of whom 86 completed the study. Three patients had to be excluded due to technical problems, so they were not able to complete the questionnaire. From the remaining 86 patients, 40 were female and 46 males. (Table 1).
Table 1.
Women (N = 40) | Men (N = 46) | |
---|---|---|
Age (mean (SD), Min–Max) | 50.90 (± 16.83), 21–84 | 49.39 (15.89), 19–77 |
Weight (mean (SD), Min–Max) | 75.13 (20.23), 52–160 | 88.96 (14.57), 54–123 |
Height (mean, (SD), Min–Max) | 164.38 (17.71), 62–184 | 179.65 (8.21), 162–198 |
App-based questionnaire for back pain
The algorithm of the app-based tool was developed in cooperation with members of Digital Health, medical doctors and psychologists. The idea for the tool algorithm is an algorithm that reflects the spinal surgeons´ way of thinking while he is taking the anamneses. First the algorithm was written down as a big decision tree. Then an app-based question tool was taken, and the questions were used within this frame setting to facilitate the evaluation. Thus, the algorithm and not the tool itself was subject of investigation.
For safety reasons the typical red flag questions have to be answered all with “no”, otherwise the patient is sent directly to an emergency room. Then the patient chooses the area of pain (cervical, thoracic, lumbar, S-I-Joint).
After that two to five dichotomic questions have to be answered with yes or no. These “decision” questions lead to the most likely diagnosis. This diagnosis is then reassured by a block of questions, which are sensitive for the supposed diagnosis. If more than 65% of the answers are “yes”, the diagnosis is confirmed. If not, the second most likely diagnosis is evaluated, by again a block of sensitive questions.
If these are answered with “yes” in more than 65%, this diagnosis is taken, if not the patient it advised to go to see a doctor because a diagnosis cannot be found for certain (“no diagnosis”). See in Table 2 an overview of all possible diagnoses.
Table 2.
Buttock | Lumbar spine | Thoracic spine | Cervical spine | Whole spine |
---|---|---|---|---|
Iliosacral joint block Sacroilitis |
Ankylosing spondylitis Facet joint arthritis Herniated disc Low back pain Spondylodiscitis Osteoporotic vertebra fracture Spinal stenosis Spondylolisthesis |
Ankylosing spondylitis Osteoporotic vertebra fracture Spondylodiscitis Thoracic block |
Cervicocephalic syndrome Herniated disc Myelogelosis Spinal stenosis |
Fibromyalgia Psychologically induced back pain |
Procedure
The study was conducted in the Department of Orthopaedics of a University Medical Centre. All patients were seen by one of two spinal surgeons. Each examination took between 10 and 20 min. The app-based form was filled either before the appointment or directly before the consultation while waiting by the patient itself within a maximum of 5 min. The order was chosen by chance for a smooth integration in the daily clinical routine. There were two experienced consultant spine surgeons of the same experience working together for 10 years, which means that their thinking and decisions are comparable. They were blinded in that way, that they did not have any knowledge of the result of the tool. Thus, both procedures (tool and medical examination) were treated independently.
After a first pilot study of 20 patients, the technical setting was improved using a better tablet. Some mistakes in linking the questions within the algorithm were fixed and the present study started.
Statistical analysis
To calculate the number of necessary cases, the effect size was taken from our former study [10]: With an effect size of phi = 0.717, an error probability of 5% and a power of 95%, the calculated number of cases is 86 [18].
The scored diagnoses were given by the spinal surgeons manually and by the tool as saved data. At first, the correlation between the diagnoses of the tool and the medical diagnosis were calculated. Because data were nominal scaled, and both variables have more than two characteristics, Cramers V was used. All diagnoses were described in the result section. Furthermore, the combination between the medical and the computerized diagnoses were presented for every single diagnosis. A possible difference in the frequency was statistically investigated with the Chi-square test.
“Overestimation” means, that the tool´s diagnosis is more severe than the spinal surgeon´s diagnosis. E.g., the tool says herniated disc and the spinal surgeon says unspecific low back pain. The diagnoses facet joint arthritis, low back pain and Iliosacral joint block were regarded as equal as there is nearly the same first line conservative treatment.
Results
Relation between the tool and the diagnosis of the spinal surgeon:
Table 3 shows descriptively the relation of the single diagnoses given by the tool and the medical doctor.
Table 3.
Diagnosis tool | |||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 11 | 12 | 13 | Total | |
Diagnosis spine surgeon | |||||||||||||
1 | 3 | 0 | 0 | 2 | 0 | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 10 |
2 | 1 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 10 |
3 | 0 | 2 | 8 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 3 | 0 | 15 |
4 | 1 | 0 | 0 | 3 | 0 | 0 | 3 | 0 | 0 | 0 | 2 | 0 | 9 |
5 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
6 | 0 | 0 | 0 | 0 | 0 | 13 | 0 | 0 | 0 | 0 | 0 | 0 | 13 |
7 | 0 | 0 | 0 | 0 | 0 | 0 | 10 | 0 | 0 | 0 | 0 | 0 | 10 |
8 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 4 | 0 | 0 | 2 | 2 | 10 |
10 | 4 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 6 |
11 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 |
13 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
Total | 9 | 12 | 8 | 6 | 2 | 13 | 19 | 4 | 2 | 1 | 8 | 2 | 86 |
1 = facet joint arthritis, 2 = spinal stenosis, 3 = herniated disc, 4 = low back pain, 5 = osteoporotic vertebra fracture, 6 = Iliosacral joint block, 7 = thoracic block, 8 = cervical myelogelosis, 9 = spondylodiscitis, 10 = spondylolisthesis, 11 = fibromyalgia, 12 = no diagnosis, 13 = cervicocephalic syndrome
Statistically, a significant relation between the tool and the diagnosis of the medical doctor could be carved out, Cramer’s V = 0.711, p < 0.001, which can be taken as a strong relationship. This relation holds true, if the data were calculated separately for women, Cramer’s V = 0.720, p < 0.001 and men, Cramer’s V = 0.780, p < 0.001.
In 67.4% of the cases the diagnosis between the medical doctors and the DSS were concordant, in 32.6% they were discordant. The difference of the frequencies was statistically significant, χ2 (1, N = 86) = 10.47, p < 0.001).
Analysing the over- and underestimation (fusing the diagnoses: facet joint arthritis, low back pain and Iliosacral joint block) the results showed that in 77.9% of the cases the diagnoses were concordant, in 15.1% they were overestimated, and in 7% they were underestimated by the DSS. The frequencies of the categories were statistically significant, χ2 (2, N = 86) = 77.74, p < 0.001); however, the difference of the frequencies between the categories “overestimation” and “underestimation” was not significant, χ2 (1, N = 19) = 2.58, p = 0.108).
Discussion
The hypothesis that there is a correlation between the diagnoses of the spinal surgeons and the clinical decision support tool holds true. In comparison to former studies, the correlational effects are strong.
Since ages symptom checkers are part of scientific work with the aim to unify and to simplify clinical diagnosis finding or to support patients identifying their problems themselves [19, 20]. In addition, the consortium selfBACK introduced a protocol to be used by patients themselves to promote self-management of low back pain (LBP) [21, 22].
Despite all these efforts only few studies were published in the field of back pain in the last years. In the present study 67.4% of the diagnoses between the medical doctors and the DSS were concordant, analysing the over- and underestimation the results even showed that 77.9% of the diagnoses were concordant.
That means that e.g., pain because facet joint arthritis or low back pain was counted equally. Only 7% of the diagnoses were underestimated by the patients. But none of them was an urgent case. E.g., the tool recognized low back pain and the spinal surgeons diagnosed a spondylolisthesis without surgical indication.
In comparison to other studies the concordance is really strong. In 2015 Semigran et al. evaluated 23 symptom checkers for self-diagnoses and triage [17]. For this evaluation 45 standardized patient vignettes were formed. 15 were each assigned to a group. Group 1 needed immediate help, group 2 need no immediate help and group 3 only required exercises on their own. Overall in the 23 symptom checkers, only in 34% the correct diagnosis was given [17]. In another study by Bison et al., the patients filled the symptom checker and then identify the cause of their knee pain themselves in a list of 2–15 diagnoses. This only worked in 58% of the cases [23].
Further aim of this should not be to replace the medical doctor, but to help patients to have an initial assessment of their symptoms, decision making in outpatient centres or patient telephone hotline. In addition, it could help to triage in emergency rooms. The assessment only takes about 5 min. So, the tool can be used in waiting rooms as stand-alone tablet or on telephone hotlines by medical staff.
Due to the different treatment methods and individual decisions, treatment recommendations were deliberately avoided when developing this tool. However, diagnosis-specific physiotherapy exercises are recommended to patients.
In our former study, it is noticeable that there was a minor number of psychiatric diagnoses, a result which was not in line with the literature because depression and anxiety is related to back pain as well as avoidance behaviour [10, 17, 24]. In the algorithm of this study, only chronic pain syndrome and fibromyalgia are possible diagnoses.
Despite the growing digitalization in all fields, this study is the first for 2 years to evaluate a specific kind of symptom checker for back pain.
Limitations
There are some limitations to the study. Only spinal surgeons who work at the same hospital were included for the clinical examination, which means that most patients have had some treatment before. Next, outpatient centres should be included, as there is usually the first contact to the patient and the doctors are general practitioners and not spine surgeons. Another limitation is that psychiatric diagnoses are not evaluated.
Conclusion
This study showed strong correlation between the diagnoses by spinal surgeons and the clinical decision support system in back pain patients. In a next step it should be evaluated, if the tool can help to reduce the time that passes before a successful treatment is achieved for those patients.
Acknowledgements
We are grateful to Saire Imser and Ramon Rodriguez for their technical assistance.
Author contributions
AB, LCP, JG, FR and PJ made substantial contributions to the conception and design of the study. AB, LCP, PJ and JG participated in the acquisition of data, analysis and statistics. All authors made contributions to the interpretation of data and have been involved in drafting the manuscript. All authors read and approved the final manuscript.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Data availability
The data used to support the findings of this study are available from the corresponding author upon request.
Declarations
Conflict of interest
The development of the decision support system was financially sponsored by Allianz Digital Health GmbH. As it can be seen in the affiliations, one author is employed by this company and was therefore not involved in statistics or data acquisition. The authors declare that they have no further competing financial or non-financial interests.
Ethical approval
The study was approved by the Ethics Commission of the University of Regensburg and carried out in accordance with the approved guidelines (21.08.18, 18–1007-121). Registration in Deutsche Register Klinischer Studien (DRKS), German Clinical Trials Register DRKS00012467 (WHO). All procedures conducted in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the Helsinki declaration of 1964 and its later amendments or comparable ethical standards.
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Achim Benditz, Email: achim.benditz@ukr.de.
Loreto C. Pulido, Email: l.pulido@asklepios.com
Joachim Grifka, Email: j.grifka@asklepios.com.
Fabian Ripke, Email: mail@ach-im.de.
Petra Jansen, Email: petra.jansen@ur.de.
References
- 1.Melloh M, Röder C, Elfering A, Theis J-C, Müller U, Staub LP, et al. Differences across health care systems in outcome and cost-utility of surgical and conservative treatment of chronic low back pain: a study protocol. BMC Musculoskelet Disord. 2008;9:81. doi: 10.1186/1471-2474-9-81. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Ikeda T, Sugiyama K, Aida J, Tsuboya T, Watabiki N, Kondo K, et al. Socioeconomic inequalities in low back pain among older people: the JAGES cross-sectional study. Int J Equity Health. 2019 doi: 10.1186/s12939-019-0918-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Traeger AC, Buchbinder R, Elshaug AG, Croft PR, Maher CG. Care for low back pain: Can health systems deliver? Bull World Health Organ. 2019 doi: 10.2471/BLT.18.226050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Clark S, Horton R. Comment Low back pain : a major global challenge Series Low back pain 1. What low back pain is and why we need to pay attention. Lancet. 2018 doi: 10.1016/S0140-6736(18)30725-6. [DOI] [PubMed] [Google Scholar]
- 5.Hartvigsen J, Hancock MJ, Kongsted A, Louw Q, Ferreira ML, Genevay S, et al. What low back pain is and why we need to pay attention. Lancet. 2018 doi: 10.1016/S0140-6736(18)30480-X. [DOI] [PubMed] [Google Scholar]
- 6.Kent PM, Keating JL. The epidemiology of low back pain in primary care. Chiropr Osteopat. 2005 doi: 10.1186/1746-1340-13-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Vos T, Allen C, Arora M, Barber RM, Brown A, Carter A, et al. Global, regional, and national incidence, prevalence, and years lived with disability for 310 diseases and injuries, 1990–2015: a systematic analysis for the Global Burden of Disease Study 2015. Lancet. 2016 doi: 10.1016/S0140-6736(16)31678-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Benyamin RM, Manchikanti L, Parr AT, Diwan S, Singh V, Falco FJE, et al. The effectiveness of lumbar interlaminar epidural injections in managing chronic low back and lower extremity pain. Pain Physician. 2012 doi: 10.2307/501042. [DOI] [PubMed] [Google Scholar]
- 9.Brunner M, Schwarz T, Zeman F, König M, Grifka J, Benditz A. Efficiency and predictive parameters of outcome of a multimodal pain management concept with spinal injections in patients with low back pain: a retrospective study of 445 patients. Arch Orthop Trauma Surg. 2018 doi: 10.1007/s00402-018-2916-y. [DOI] [PubMed] [Google Scholar]
- 10.Benditz A, Faber F, Wenk G, Fuchs T, Salak N, Grifka J, et al. The role of a decision support system in back pain diagnoses: a pilot study. Biomed Res Int. 2019 doi: 10.1155/2019/1314028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Manchikanti L, Nampiaparampil DE, Candido KD, Bakshi S, Grider JS, Falco FJ, et al. Do cervical epidural injections provide longterm relief in neck and upper extremity pain? A systematic review. Pain Physician. 2015 doi: 10.4172/2165-7939.1000177. [DOI] [PubMed] [Google Scholar]
- 12.Manchikanti L, Abdi S, Atluri S. An update of comprehensive evidence-based guidelines for interventional techniques in chronic spinal pain. Part II: guidance and recommendations. Pain Physician. 2013 doi: 10.1007/s00586-015-4179-4. [DOI] [PubMed] [Google Scholar]
- 13.Côté P, Cassidy JD, Carroll L. The factors associated with neck pain and its related disability in the Saskatchewan population. Spine (Phila Pa 1976) 2000 doi: 10.1097/00007632-200005010-00012. [DOI] [PubMed] [Google Scholar]
- 14.Leboeuf-Yde C, Fejer R, Nielsen J, Kyvik KO, Hartvigsen J. Pain in the three spinal regions: The same disorder? Data from a population-based sample of 34,902 Danish adults. Chiropr Man Ther. 2012 doi: 10.1186/2045-709X-20-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Fejer R, Kyvik KO, Hartvigsen J. The prevalence of neck pain in the world population: A systematic critical review of the literature. Eur Spine J. 2006 doi: 10.1007/s00586-004-0864-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Benditz A, Brunner M, Zeman F, Greimel F, Florian V, Boluki D, et al. Effectiveness of a multimodal pain management concept for patients with cervical radiculopathy with focus on cervical epidural injections. Sci Rep. 2017 doi: 10.1038/s41598-017-08350-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Semigran HL, Linder JA, Gidengil C, Mehrotra A. Evaluation of symptom checkers for self diagnosis and triage: audit study. BMJ. 2015;351:h3480. doi: 10.1136/bmj.h3480. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Faul F, Erdfelder E, Buchner A, Lang A-G. Statistical power analyses using G*Power 3.1: tests for correlation and regression analyses. Behav Res Methods. 2009;41:1149–1160. doi: 10.3758/BRM.41.4.1149. [DOI] [PubMed] [Google Scholar]
- 19.Mork PJ, Bach K. A decision support system to enhance self-management of low back pain: protocol for the selfBACK project. JMIR Res Protoc. 2018 doi: 10.2196/resprot.9379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Druzdzel MJ, Flynn RR. Decision support systems. Underst Inf Retr Syst Manag Types Stand. 2011 doi: 10.5937/poseko1501295k. [DOI] [Google Scholar]
- 21.Rasmussen CDN, Svendsen MJ, Wood K, Nicholl BI, Mair FS, Sandal LF, et al. App-delivered self-management intervention trial self BACK for people with low back pain: Protocol for implementation and process evaluation. JMIR Res Protoc. 2020 doi: 10.2196/20308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Jarle Mork P, Bach K. Erratum: metadata correction: a decision support system to enhance self-management of low back pain: protocol for the selfback project. JMIR Res Protoc. 2019;8(1):e12180. doi: 10.2196/12180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Bisson LJ, Komm JT, Bernas GA, Fineberg MS, Marzo JM, Rauh MA, et al. How accurate are patients at diagnosing the cause of their knee pain with the help of a web-based symptom checker? Orthop J Sport Med. 2016;4:2325967116630286. doi: 10.1177/2325967116630286. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Wertli MM, Rasmussen-Barr E, Weiser S, Bachmann LM, Brunner F. The role of fear avoidance beliefs as a prognostic factor for outcome in patients with nonspecific low back pain: a systematic review. Spine J. 2014;14:816–36.e4. doi: 10.1016/j.spinee.2013.09.036. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data used to support the findings of this study are available from the corresponding author upon request.