Assessing the Quality of Mobile Health-Related Apps: Interrater Reliability Study of Two Guides

Jordi Miró; Pere Llorens-Vernet

doi:10.2196/26471

. 2021 Apr 19;9(4):e26471. doi: 10.2196/26471

Assessing the Quality of Mobile Health-Related Apps: Interrater Reliability Study of Two Guides

Jordi Miró ^1,^✉,^#, Pere Llorens-Vernet ^1,^#

Editor: Lorraine Buis

Reviewed by: Winfried Schlee, Lei Guo

PMCID: PMC8094021 PMID: 33871376

Abstract

Background

There is a huge number of health-related apps available, and the numbers are growing fast. However, many of them have been developed without any kind of quality control. In an attempt to contribute to the development of high-quality apps and enable existing apps to be assessed, several guides have been developed.

Objective

The main aim of this study was to study the interrater reliability of a new guide — the Mobile App Development and Assessment Guide (MAG) — and compare it with one of the most used guides in the field, the Mobile App Rating Scale (MARS). Moreover, we also focused on whether the interrater reliability of the measures is consistent across multiple types of apps and stakeholders.

Methods

In order to study the interrater reliability of the MAG and MARS, we evaluated the 4 most downloaded health apps for chronic health conditions in the medical category of IOS and Android devices (ie, App Store and Google Play). A group of 8 reviewers, representative of individuals that would be most knowledgeable and interested in the use and development of health-related apps and including different types of stakeholders such as clinical researchers, engineers, health care professionals, and end users as potential patients, independently evaluated the quality of the apps using the MAG and MARS. We calculated the Krippendorff alpha for every category in the 2 guides, for each type of reviewer and every app, separately and combined, to study the interrater reliability.

Results

Only a few categories of the MAG and MARS demonstrated a high interrater reliability. Although the MAG was found to be superior, there was considerable variation in the scores between the different types of reviewers. The categories with the highest interrater reliability in MAG were “Security” (α=0.78) and “Privacy” (α=0.73). In addition, 2 other categories, “Usability” and “Safety,” were very close to compliance (health care professionals: α=0.62 and 0.61, respectively). The total interrater reliability of the MAG (ie, for all categories) was 0.45, whereas the total interrater reliability of the MARS was 0.29.

Conclusions

This study shows that some categories of MAG have significant interrater reliability. Importantly, the data show that the MAG scores are better than the ones provided by the MARS, which is the most commonly used guide in the area. However, there is great variability in the responses, which seems to be associated with subjective interpretation by the reviewers.

Keywords: mHealth; mobile health; mobile apps; evaluation studies, rating; interrater reliability; MARS; MAG

Introduction

In recent years, there has been an explosion of interest in the use of mobile devices (eg, smartphones, tablets) [1], alongside huge advances in the development of health-related mobile apps [2]. For example, a total of 325,000 different health-related apps has recently been reported to be available [3]. There are mobile apps for virtually all kinds of health conditions: for example, chronic pain [4,5], cancer [6], diabetes [7], and cardiovascular diseases [8]. This growth has brought considerable benefits not only to patients but also to society at large and at multiple levels. For example, health-related apps help to (1) improve treatment management, (2) facilitate patient-doctor communication, (3) monitor the patient's condition in real time, and (4) improve accessibility to treatment [9-12]. But there is also a number of caveats, mostly related to the somewhat unsupervised and unregulated nature of the process. And it has been suggested that the fact that the field is evolving without much scientific support or guidance [13] not only acts as a barrier to improvement [14] but also, and more importantly, can potentially put an individual’s health at risk [15]. Some of the main problems related to health apps are (1) faulty reminders that make proper treatment follow-up difficult (eg, the instructions on when to do an activity or take medication are not correct [16]); (2) lack of health expert involvement [17]; (3) inappropriate response to consumer needs (eg, bipolar disorder apps failing to provide any response when asked about extreme mood swings or suicidal ideation [18]); and (4) incorrect medication doses (eg, incorrect calculation of insulin dose from blood glucose values [19]).

In order to overcome the issues health-related apps are facing, some rating scales and guides have been developed (eg, [20,21]). One of the first was the Mobile App Rating Scale (MARS) [22]. It is one of the most used rating scales to measure the quality of health-related apps [23-27]. However, the MARS was created from a narrow perspective [28-30] on the basis of analyzing studies on existing mobile apps and leaving out information from other relevant sources (eg, standards governing the design of software for medical devices).

Recently, the Mobile App Development and Assessment Guide (MAG) [13] was created to address the problems observed in the guides available (but not current key concerns such as privacy and security) and to help assess health-related apps and guide stakeholders in the development of new quality apps. The MAG was developed using data from all potential relevant sources and a representative sample of the guidelines, frameworks, and standards in the field of health app development. The MAG has been acknowledged as a good quality guide by an international and interdisciplinary group of stakeholders [31].

These guides are important in the field as they provide quality scores that are key to identifying the best apps available and distinguishing them from the poorly designed ones. However, there are little data on the comparative value and consistency of the very few guides there are. The field would benefit considerably from studies that guide the development of new apps and comparatively assess the quality of existing ones.

The main objective of this research was to study and compare the MAG and MARS. More specifically, we aimed to compare the interrater reliability of the 2 measures. We also focused on whether the interrater reliability of the measures is consistent across multiple types of apps and stakeholders.

Methods

App Selection Process

In order to evaluate the interrater reliability of the MAG and MARS across different types of apps, we evaluated the top 4 search results for chronic health conditions in the medical category of the Apple and Android stores (ie, App Store and Google Play, respectively). The search and selection of the apps were conducted in October 2020.

The inclusion criteria were as follows: The app had to be focused on a chronic health condition, in English or Spanish, and free to download. We selected chronic health conditions because it is one of the domains in which health apps are becoming more relevant (56% of health apps are intended for this kind of patient [32]). Reports by governmental agencies indicate that chronic health conditions are a major health problem that affects 31% of the population [33-36]. In addition, chronic health conditions are the leading cause of death and disability in both the developed and developing world in the global burden of disease equation. The most important chronic health conditions are low back pain and headache, neoplasms, diabetes and kidney diseases, and cardiovascular diseases [37-40]. We used the following search terms, which are related to the top 4 chronic health conditions in the Global Burden of Disease study [41]: “pain,“ “cancer,” “diabetes,” and “cardiovascular.” In this search, we identified 886 apps and excluded 265 as they were not related to any of the 4 health conditions of interest. Finally, we selected the top 4 most downloaded apps (1 for each chronic health condition), which we then used in this study.

App Evaluation Process

The apps were rated by 8 reviewers during the months of October and November 2020. The reviewers were a group of stakeholders that included clinical researchers, engineers, health care professionals, and end users as potential patients. These groups of stakeholders were identified as representative of individuals that would be most knowledgeable and interested in the use and development of health-related apps. The individuals in the “end users/potential patients” and “health care professionals” groups were identified and approached by the authors while at the university hospital (for a health checkup or while at work, respectively). The individuals in the “clinical researchers” and “engineers” groups were professors or technicians working at the university. Only individuals that agreed to participate and reported having experience in the use of smartphones and health apps were selected. All individuals approached were included. Reviewers received (1) the list of apps, (2) a survey including the items of the MAG and MARS to be evaluated, and (3) specific instructions as to how to proceed with the review and evaluation of the apps. In order to avoid potential interferences and help reviewers to work independently, and in line with similar studies (eg, [42]), they were not given any other suggestions, indications, or training about the procedure.

For the evaluation, all reviewers downloaded and installed the apps on their personal mobile device. Then, they reviewed each of the apps using the specific criteria in the MAG and MARS. In their assessment, the reviewers were instructed to only take into account the content and information provided within the app itself and the stores (ie, App Store and Google Play). This included websites, scientific studies, and other external references as long as they were suggested or mentioned explicitly within the app or the stores. Like similar successful procedures, the reviewers did not receive any specific training, and although they spent several minutes examining the apps, they were not instructed to use them realistically [42]. The objective of this activity and procedure was that they would evaluate the apps in the same way as experts who do not need them would.

The MAG [31] has 48 items grouped into 8 categories or domains: usability, privacy, security, appropriateness and suitability, transparency and content, safety, technical support and updates, and technology. The reviewers used each of the items in the categories to assess the quality of the apps and checked if the apps met those characteristics and functions (1=yes; 0=no).

The MARS [22] has 23 items that are grouped into 5 categories: engagement, functionality, aesthetics, information quality, and subjective quality. It also has 6 items that are app-specific and can be adapted to include or exclude specific information on the topic of interest. For example, these items have been used to assess the perceived effects on the user’s knowledge, attitudes, and intentions to change as well as the likelihood of changing the identified targeted behaviors in a study of mobile apps supporting heart failure symptom monitoring and self-care management [23]. In this study, we discarded these app-specific items. When using the MARS, the reviewers used each of the items to assess the quality of the apps and scored them using a 5-point rating scale (1=inadequate, 2=poor, 3=acceptable, 4=good, 5=excellent).

Data Analysis

In order to study and compare the interrater reliability of the MAG and MARS, we calculated the Krippendorff alpha [43,44] for every category in the 2 guides, for each kind of reviewer and every app, separately and combined. The Krippendorff coefficient has been found to be superior to the Cohen coefficient and can be used with an unlimited number of reviewers [45-47]. An alpha ˃0.667 has been identified as showing acceptable agreement [44]. Therefore, in this study, we used this figure as the minimum level showing agreement [44]. A negative alpha indicated that agreement was less than could be expected by chance. All data analyses were performed using SPSS v.26 for Windows using the Kalpha macro [48].

Results

A total of 8 reviewers rated the 4 apps using the MAG and MARS guides. The mobile apps included in the analysis were “Manage My Pain” (ie, pain), “BELONG Beating Cancer Together” (ie, cancer), “mySugr - Diabetes App & Blood Sugar Tracker” (ie, diabetes), and “ASCVD Risk Estimator Plus” (ie, cardiovascular diseases).

The group of reviewers included 2 clinical researchers, 2 engineers, 2 health care professionals, and 2 end users as potential patients. Reviewers’ ages ranged from 24 to 40 years old, with an equal distribution of women and men. Clinical researchers, engineers, and health care professionals had been involved in the development of health-related apps, but not in any of the apps and guides used in this study (they did not have any conflicts of interest). All reviewers were highly educated individuals (all had completed university studies) and were experienced smartphone and mobile app users.

Complete responses were provided for almost all criteria and apps, although a small number of criteria showed a percentage of data completeness that ranged from 78% to 97% (eg, “It has password management mechanisms”; see Multimedia Appendix 1). Tables 1 and 2 show the interrater reliability coefficients by categories and overall for both guides.

Table 1.

Interrater reliability scores when reviewers used the Mobile App Development and Assessment Guide (MAG).

Category	Reviewers
	Clinical researchers	Engineers	Health care professionals	End users	Aggregate
Usability	0.28	0.28	0.62	0.45	0.38
Privacy	0.36	0.73	0.42	0.43	0.45
Security	0.18	0.78	0.76	0.26	0.47
Appropriateness and suitability	0.38	0	–0.15	0	0.25
Transparency and content	0	1	–0.40	–0.36	0.15
Safety	0.59	0.51	0.61	–0.23	0.33
Technical support and updates	0.38	1	1	0.76	0.30
Technology	0.44	0.45	–0.05	0.45	0.39
Total	0.40	0.66	0.55	0.29	0.45

Open in a new tab

Table 2.

Interrater reliability scores when reviewers used the Mobile App Rating Scale (MARS).

Category	Reviewers
	Clinical researchers	Engineers	Health care professionals	End users	Aggregate
Engagement	0.18	0.50	0.53	0.41	0.43
Functionality	0.24	0.52	0.40	–0.38	0.19
Aesthetics	0.42	0.26	0.23	–0.14	0.17
Information	0.03	0.08	0.05	–0.09	0.06
Subjective	0.57	0.41	–0.08	0.54	0.43
Total	0.27	0.41	0.25	0.19	0.29

Open in a new tab

For the MAG, the reviewers’ scores for several categories complied with the criteria. The highest interrater reliability scores were for the categories “Privacy” (engineers: P=.73) and “Security” (engineers: P=.78; health care professionals: P=.76). In addition, 2 other categories, “Usability” and “Safety,” were very close to compliance (health care professionals: P=.62 and P=.61, respectively). The total interrater reliability of MAG (ie, for all categories) was 0.45 (see Table 1).

For the MARS, none of the reviewers’ scores or the aggregate scores complied with the criteria. The categories with the highest interrater index were “Engagement” and “Subjective” with an overall alpha coefficient of 0.43 in both cases. The total interrater reliability of the MARS (ie, for all categories) was 0.29 (see Table 2).

Tables 3 and 4 show the interrater reliability scores for each mobile app assessed using the MAG and MARS guides. As can be seen, none of the scores complied with the criteria overall or in any category. Nevertheless, the highest interrater reliability scores were for the MAG guide.

Table 3.

Interrater reliability scores for apps when reviewers used the Mobile App Development and Assessment Guide (MAG).

Category	Mobile apps
	Manage My Pain	BELONG Beating Cancer Together	mySugr - Diabetes App & Blood Sugar Tracker	ASCVD Risk Estimator Plus
Usability	0.58	0.49	0.27	0.15
Privacy	0.47	0.38	0.28	0.20
Security	0.44	0.18	0.42	0.32
Appropriateness and suitability	1	0.42	0	–0.04
Transparency and content	0.08	–0.08	–0.06	0.00
Safety	0	0.47	0.33	0.21
Technical support and updates	0.10	0.57	0.16	0.10
Technology	0.17	0.36	0.12	0.45
Total	0.53	0.42	0.32	0.35

Open in a new tab

Table 4.

Interrater reliability scores for apps when reviewers used the Mobile App Rating Scale (MARS).

Category	Mobile apps
	Manage My Pain	BELONG Beating Cancer Together	mySugr - Diabetes App & Blood Sugar Tracker	ASCVD Risk Estimator Plus
Engagement	0.31	0.24	–0.10	0.18
Functionality	0.27	0.05	–0.02	0.16
Aesthetics	–0.05	–0.03	–0.07	0.12
Information	–0.08	0.08	–0.03	0.09
Subjective	0.55	0.44	0.16	0.14
Total	0.20	0.18	0.01	0.42

Open in a new tab

A comparison of the interrater reliability between MAG and MARS is shown in Table 5. Additional supplementary information is also provided on the interrater reliability scores for each item (see Multimedia Appendix 1).

Table 5.

Interrater reliability scores of the Mobile App Development and Assessment Guide (MAG) and the Mobile App Rating Scale (MARS).

Guide and category		Reliability
MAG
	Usability	0.38
	Privacy	0.45
	Security	0.47
	Appropriateness and suitability	0.25
	Transparency and content	0.15
	Safety	0.33
	Technical support and updates	0.30
	Technology	0.39
	Total	0.45
MARS
	Engagement	0.43
	Functionality	0.19
	Aesthetics	0.17
	Information	0.06
	Subjective	0.43
	Total	0.29

Open in a new tab

Discussion

Principal Findings

This research is the first to measure the interrater reliability of the MAG [13,31]. We used the MAG to study 4 mobile health–related apps and compared the results with those obtained with the MARS [22], one of the most extensively used guides in the field.

In studies using the Krippendorff alpha, it is customary to require an alpha >0.800. However, an alpha ˃0.667 has been identified as indicative of acceptable agreement, and anything below that is considered as unacceptable [42,44]. The data revealed that few categories reached that score and showed high interrater reliability. This finding is similar to that of other studies (eg, [26,42,46]) that have analyzed this type of guides. Taken as a whole, the findings demonstrate that it is difficult for reviewers to rate the apps in the same or similar way. First of all, reviewers greatly differed in the amount of time spent in reviewing each app (ranging from 30 minutes to 60 minutes). Thus, it is possible that the time spent in the review process had an influence on the results of the assessment. Our data did not show differences associated to the amount of time spent in the review. However, we used a small number of reviewers (n=8). Therefore, additional research to study this issue is warranted. Another potential explanation for the findings is that reviewers do not interact with the apps in the same way, so they display different responses and functions [46]. Therefore, it is unlikely that reviewers will detect all app functions, which leads to differences in the ratings because they might not be assessing exactly the same items. Support for this explanation can be found in the fact that the most objective categories evaluated, those which require less subjective interpretation by the reviewer (eg, “Privacy,” “Security”), are the ones with the highest interrater reliabilities. This finding is similar to the one reported by Powell and colleagues [42], who detected that the less judgment required by reviewers, the higher the reliability.

Another important finding of this study was that interrater reliability scores for the MAG were better than for the MARS. Importantly, some of the MAG categories with the highest interrater reliability are not included in the MARS (eg, “Privacy,” “Security,” “Technology”). These are issues that have grown in importance in the field in recent years.

It should also be noted that some MAG categories showed a higher interrater reliability than others, but there was considerable variation in the scores between the types of reviewer. This finding suggests that the differences in the interrater reliability scores are related to such individual characteristics of the reviewers as background or training. This could help explain, in part at least, why engineers showed the highest reliability scores in the category of “Security,” as this is an important issue that is currently a matter of key interest in the training of engineers but not in the case of clinical researchers. And it implies that reviewers from different backgrounds are required to assess apps and that reviewers need to be trained. However, it is also possible that the low interrater reliability scores were not only reviewer-related, but also app-related. That is, although we selected the 4 most downloaded apps, they may not have been quality apps or easy to assess (eg, the functions or properties of the apps were not easy to find or identify). In support of this explanation, some items were not answered by any reviewers in either of the guides (eg, “It has a data recovery system in case of loss”; “It is based on ethical principles and values”). Finally, another nonexclusive explanation for these results could be related to the guides (ie, the MAG and MARS). The fact that the categories that required less interpretation (eg, “Security”) were the ones with the highest interrater reliability would support this explanation. This suggests that the guides must be improved.

The differences in interrater reliability and more importantly, the lower scores found suggest that there is a very important underlying problem that is indicative of the difficulty of creating a good guide to help in the development and assessment of health-related apps. On the basis of the results of this study and others (eg, [42,46]), users of health-related apps should use and interpret the results of quality assessments with caution. The guides, as they are, have not been demonstrated to provide a secure reliable measure of the overall quality of the apps.

The assessment of the quality of health-related apps is very important. Therefore, we must continue working on improving the way assessments are conducted. This may not only require improving the available guides but also working with specialized centers and trained reviewers.

Future Research

Studies are needed to help improve available guides that are psychometrically sound so future research should focus on how to improve and empirically test interrater reliability. For example, studies should examine whether giving reviewers additional training is enough or how reviewers’ knowledge and assessment skills can best be improved. They should also establish whether the quality of health-related apps should be assessed by reviewers with different qualifications, training, and background. Moreover, since subjectivity might be an issue in the guides, an area for improvement is that guides include clearly defined criteria. Therefore, research to determine whether understandable and well-defined criteria can improve interrater reliability above and beyond the improvement in reviewer training is warranted. Moreover, and specifically in relation with the MAG, additional research with more apps of different types is also warranted. This would help ascertain whether and how different types of app influence the reviewers' evaluations. In addition, the criteria and the categories included in the guide deserve specific attention. Studies with additional samples of reviewers, including individuals with chronic health conditions, to evaluate their comprehensibility and appropriateness are needed.

Limitations

This study has a number of limitations that should be taken into account when interpreting the results. First, we studied the interrater reliability of the MAG when it was used to evaluate apps that were available for both Android and IOS. Although the apps are generally the same on both platforms, there may be small differences that influence the user’s experience or performance when using different platforms and devices. For example, the amount of information displayed or the position and size of some elements (eg, buttons, menu) may differ due to the size of the screen. Second, we used a very limited number of apps. We selected the most downloaded ones, as we thought they would be of better quality and therefore easier for reviewers to assess. However, they may not be of quality or representative of health-related apps and so may not be suitable for an accurate study of the interrater reliability of the guides. Third, during the period of time that the apps were being assessed, they may have been updated or modified, which would have had an unknown impact on the results of the assessments. Fourth, although individuals from different groups participated, they may be not representative. Even though they were extremely knowledgeable in their respective areas, they may or may not be the best individuals to assess the quality of the apps, as none of them had received any training. Moreover, they did not receive any substantial training in using the MAG or MARS. Thus, it is unclear whether the low interrater reliability is related to the instrument that is being used, to the lack of training provided to the raters, or both. We decided not to give specific training as we wanted to study whether the MAG and MARS can be reliably used as they are. Previous studies have also used this strategy (eg, [42]). However, future studies should examine whether training can help improve the reviewers’ assessment and the interrater reliability.

Conclusions

Despite the limitations of the study, our findings provide new and important information about the MAG. Of particular consequence is that several categories in the MAG have significant interrater reliability. In addition, the data show that the scores are better than the ones provided by the MARS, the most commonly used guide in the area.

Acknowledgments

This work was partly supported by grants from the Spanish Ministry of Economy, Industry and Competitiveness (RTI2018-09870-B-I00; RED2018-102546-T); European Regional Development Fund, the Government of Catalonia (AGAUR; 2017SGR-1321); Fundación Grünenthal (Spain), Universitat Rovira i Virgili (PFR program); and ICREA-Acadèmia. PL benefitted from a predoctoral fellowship (2019 FI_B2 00151) cofinanced by the Secretaria d’Universitats i Recerca del Departament d’Empresa i Coneixement de la Generalitat de Catalunya, the European Union, and the European Social Fund.

Abbreviations

MAG: Mobile App Development and Assessment Guide
MARS: Mobile App Rating Scale

Appendix

Multimedia Appendix 1

Interrater reliability scores and data completeness for each item.

mhealth_v9i4e26471_app1.pdf^{(142.3KB, pdf)}

Footnotes

Conflicts of Interest: None declared.

References

1.The Mobile Economy 2019. GSMA Intelligence. 2019. [2020-07-14]. https://www.gsmaintelligence.com/research/?file=b9a6e6202ee1d5f787cfebb95d3639c5&download.
2.Statista App stores: number of apps in leading app stores 2020. 2020. [2020-07-14]. https://www.statista.com/statistics/276623/number-of-apps-available-in-leading-app-stores/
3.mHealth Economics 2017: Current Status and Future Trends in Mobile Health. Research2Guidance. 2017. [2020-07-14]. https://research2guidance.com/product/mhealth-economics-2017-current-status-and-future-trends-in-mobile-health.
4.de la Vega R, Roset R, Castarlenas E, Sánchez-Rodríguez E, Solé E, Miró J. Development and testing of painometer: a smartphone app to assess pain intensity. J Pain. 2014 Oct;15(10):1001–7. doi: 10.1016/j.jpain.2014.04.009. [DOI] [PubMed] [Google Scholar]
5.de la Vega R, Roset R, Galán S, Miró J. Fibroline: A mobile app for improving the quality of life of young people with fibromyalgia. J Health Psychol. 2018 Jan;23(1):67–78. doi: 10.1177/1359105316650509. [DOI] [PubMed] [Google Scholar]
6.Kessel KA, Vogel MM, Schmidt-Graf F, Combs SE. Mobile Apps in Oncology: A Survey on Health Care Professionals' Attitude Toward Telemedicine, mHealth, and Oncological Apps. J Med Internet Res. 2016 Nov 24;18(11):e312. doi: 10.2196/jmir.6399. https://www.jmir.org/2016/11/e312/ [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Kebede MM, Pischke CR. Popular Diabetes Apps and the Impact of Diabetes App Use on Self-Care Behaviour: A Survey Among the Digital Community of Persons With Diabetes on Social Media. Front. Endocrinol. 2019 Mar 1;10:135. doi: 10.3389/fendo.2019.00135. doi: 10.3389/fendo.2019.00135. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Coorey GM, Neubeck L, Mulley J, Redfern J. Effectiveness, acceptability and usefulness of mobile applications for cardiovascular disease self-management: Systematic review with meta-synthesis of quantitative and qualitative data. Eur J Prev Cardiol. 2018 Mar;25(5):505–521. doi: 10.1177/2047487317750913. [DOI] [PubMed] [Google Scholar]
9.Global Observatory for eHealth series - Volume 3. World Health Organization. 2011. Jun 07, [2021-04-10]. https://www.who.int/goe/publications/ehealth_series_vol3/en/
10.Pérez-Jover V, Sala-González M, Guilabert M, Mira JJ. Mobile Apps for Increasing Treatment Adherence: Systematic Review. J Med Internet Res. 2019 Jun 18;21(6):e12505. doi: 10.2196/12505. https://www.jmir.org/2019/6/e12505/ [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Ernsting C, Dombrowski SU, Oedekoven M, O Sullivan JL, Kanzler M, Kuhlmey A, Gellert P. Using Smartphones and Health Apps to Change and Manage Health Behaviors: A Population-Based Survey. J Med Internet Res. 2017 Apr 05;19(4):e101. doi: 10.2196/jmir.6838. https://www.jmir.org/2017/4/e101/ [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Hamine S, Gerth-Guyette E, Faulx D, Green BB, Ginsburg AS. Impact of mHealth chronic disease management on treatment adherence and patient outcomes: a systematic review. J Med Internet Res. 2015 Feb 24;17(2):e52. doi: 10.2196/jmir.3951. https://www.jmir.org/2015/2/e52/ [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Llorens-Vernet P, Miró J. Standards for Mobile Health-Related Apps: Systematic Review and Development of a Guide. JMIR Mhealth Uhealth. 2020 Mar 03;8(3):e13057. doi: 10.2196/13057. https://mhealth.jmir.org/2020/3/e13057/ [DOI] [PMC free article] [PubMed] [Google Scholar]
14.de la Vega R, Miró J. mHealth: a strategic field without a solid scientific soul. a systematic review of pain-related apps. PLoS One. 2014;9(7):e101312. doi: 10.1371/journal.pone.0101312. http://dx.plos.org/10.1371/journal.pone.0101312. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Akbar S, Coiera E, Magrabi F. Safety concerns with consumer-facing mobile health applications and their consequences: a scoping review. J Am Med Inform Assoc. 2020 Feb 01;27(2):330–340. doi: 10.1093/jamia/ocz175. http://europepmc.org/abstract/MED/31599936. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Park JYE, Li J, Howren A, Tsao NW, De Vera M. Mobile Phone Apps Targeting Medication Adherence: Quality Assessment and Content Analysis of User Reviews. JMIR Mhealth Uhealth. 2019 Jan 31;7(1):e11919. doi: 10.2196/11919. http://mhealth.jmir.org/2019/1/e11919/ [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Subhi Y, Bube SH, Rolskov BS, Skou TAS, Konge L. Expert Involvement and Adherence to Medical Evidence in Medical Mobile Phone Apps: A Systematic Review. JMIR Mhealth Uhealth. 2015 Jul 27;3(3):e79. doi: 10.2196/mhealth.4169. http://mhealth.jmir.org/2015/3/e79/ [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Nicholas J, Larsen ME, Proudfoot J, Christensen H. Mobile Apps for Bipolar Disorder: A Systematic Review of Features and Content Quality. J Med Internet Res. 2015;17(8):e198. doi: 10.2196/jmir.4581. http://www.jmir.org/2015/8/e198/ [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Huckvale K, Adomaviciute S, Prieto JT, Leow MK, Car J. Smartphone apps for calculating insulin dose: a systematic assessment. BMC Med. 2015;13:106. doi: 10.1186/s12916-015-0314-7. http://www.biomedcentral.com/1741-7015/13/106. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Medicines and Healthcare products Regulatory Agency Medical devices: software applications (apps) GOV.UK. 2014. Aug 08, [2020-07-14]. https://www.gov.uk/government/publications/medical-devices-software-applications-apps.
21.Good practice guidelines on health apps and smart devices (mobile health or mhealth) Haute Autorité de Santé. 2016. [2020-07-14]. https://www.has-sante.fr/jcms/c_2681915/en/good-practice-guidelines-on-health-apps-and-smart-devices-mobile-health-or-mhealth.
22.Stoyanov SR, Hides L, Kavanagh DJ, Zelenko O, Tjondronegoro D, Mani M. Mobile app rating scale: a new tool for assessing the quality of health mobile apps. JMIR Mhealth Uhealth. 2015 Mar 11;3(1):e27. doi: 10.2196/mhealth.3422. https://mhealth.jmir.org/2015/1/e27/ [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Masterson Creber RM, Maurer MS, Reading M, Hiraldo G, Hickey KT, Iribarren S. Review and Analysis of Existing Mobile Phone Apps to Support Heart Failure Symptom Monitoring and Self-Care Management Using the Mobile Application Rating Scale (MARS) JMIR Mhealth Uhealth. 2016 Jun 14;4(2):e74. doi: 10.2196/mhealth.5882. https://mhealth.jmir.org/2016/2/e74/ [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Tofighi B, Chemi C, Ruiz-Valcarcel J, Hein P, Hu L. Smartphone Apps Targeting Alcohol and Illicit Substance Use: Systematic Search in in Commercial App Stores and Critical Content Analysis. JMIR Mhealth Uhealth. 2019 Apr 22;7(4):e11831. doi: 10.2196/11831. https://mhealth.jmir.org/2019/4/e11831/ [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Mehdi M, Stach M, Riha C, Neff P, Dode A, Pryss R, Schlee W, Reichert M, Hauck FJ. Smartphone and Mobile Health Apps for Tinnitus: Systematic Identification, Analysis, and Assessment. JMIR Mhealth Uhealth. 2020 Aug 18;8(8):e21767. doi: 10.2196/21767. https://mhealth.jmir.org/2020/8/e21767/ [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Knitza J, Tascilar K, Messner E, Meyer M, Vossen D, Pulla A, Bosch P, Kittler J, Kleyer A, Sewerin P, Mucke J, Haase I, Simon D, Krusche M. German Mobile Apps in Rheumatology: Review and Analysis Using the Mobile Application Rating Scale (MARS) JMIR Mhealth Uhealth. 2019 Aug 05;7(8):e14991. doi: 10.2196/14991. https://mhealth.jmir.org/2019/8/e14991/ [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Grainger R, Townsley H, White B, Langlotz T, Taylor WJ. Apps for People With Rheumatoid Arthritis to Monitor Their Disease Activity: A Review of Apps for Best Practice and Quality. JMIR Mhealth Uhealth. 2017 Feb 21;5(2):e7. doi: 10.2196/mhealth.6956. https://mhealth.jmir.org/2017/2/e7/ [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Nouri R, R Niakan Kalhori S, Ghazisaeedi M, Marchand G, Yasini M. Criteria for assessing the quality of mHealth apps: a systematic review. J Am Med Inform Assoc. 2018 Aug 01;25(8):1089–1098. doi: 10.1093/jamia/ocy050. http://europepmc.org/abstract/MED/29788283. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Siddique AB, Krebs M, Alvarez S, Greenspan I, Patel A, Kinsolving J, Koizumi N. Mobile Apps for the Care Management of Chronic Kidney and End-Stage Renal Diseases: Systematic Search in App Stores and Evaluation. JMIR Mhealth Uhealth. 2019 Sep 04;7(9):e12604. doi: 10.2196/12604. https://mhealth.jmir.org/2019/9/e12604/ [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Levine DM, Co Z, Newmark LP, Groisser AR, Holmgren AJ, Haas JS, Bates DW. Design and testing of a mobile health application rating tool. NPJ Digit Med. 2020;3:74. doi: 10.1038/s41746-020-0268-9. doi: 10.1038/s41746-020-0268-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Llorens-Vernet P, Miró J. The Mobile App Development and Assessment Guide (MAG): Delphi-Based Validity Study. JMIR Mhealth Uhealth. 2020 Jul 31;8(7):e17760. doi: 10.2196/17760. https://mhealth.jmir.org/2020/7/e17760/ [DOI] [PMC free article] [PubMed] [Google Scholar]
32.mHealth Economics 2016 - Current Status and Trends of the mHealth App Market. Research2Guidance. 2016. [2020-07-14]. https://research2guidance.com/product/mhealth-app-developer-economics-2016/
33.Chronic Diseases in America. Centers for Disease Control and Prevention. 2020. [2020-07-14]. https://www.cdc.gov/chronicdisease/resources/infographic/chronic-diseases.htm.
34.Chronic Disease Overview. Australian Institute of Health and Welfare. 2019. [2020-07-14]. https://www.aihw.gov.au/reports-data/health-conditions-disability-deaths/chronic-disease/overview.
35.EU research on chronic diseases. European Commission. 2014. [2020-07-14]. https://ec.europa.eu/info/research-and-innovation/research-area/health-research-and-innovation/chronic-diseases_en.
36.Health at a Glance 2019. Organisation for Economic Cooperation and Development (OECD) 2019. [2021-04-10]. https://www.oecd-ilibrary.org/social-issues-migration-health/health-at-a-glance-2019_4dd50c09-en.
37.Findings from the Global Burden of Disease Study 2017. Institute for Health Metrics and Evaluation (IHME) 2018. [2020-07-14]. http://www.healthdata.org/sites/default/files/files/policy_report/2019/GBD_2017_Booklet.pdf.
38.GBD 2017 Causes of Death Collaborators Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018 Nov 10;392(10159):1736–1788. doi: 10.1016/S0140-6736(18)32203-7. https://linkinghub.elsevier.com/retrieve/pii/S0140-6736(18)32203-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.GBD 2017 Disease and Injury Incidence and Prevalence Collaborators Global, regional, and national incidence, prevalence, and years lived with disability for 354 diseases and injuries for 195 countries and territories, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018 Dec 10;392(10159):1789–1858. doi: 10.1016/S0140-6736(18)32279-7. https://linkinghub.elsevier.com/retrieve/pii/S0140-6736(18)32279-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.GBD 2017 DALYs and HALE Collaborators Global, regional, and national disability-adjusted life-years (DALYs) for 359 diseases and injuries and healthy life expectancy (HALE) for 195 countries and territories, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018 Nov 10;392(10159):1859–1922. doi: 10.1016/S0140-6736(18)32335-3. https://linkinghub.elsevier.com/retrieve/pii/S0140-6736(18)32335-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Global Burden of Disease Study 2017 (GBD 2017) Data Resources. Institute for Health Metrics and Evaluation (IHME) 2017. [2020-07-14]. http://ghdx.healthdata.org/gbd-2017.
42.Powell AC, Torous J, Chan S, Raynor GS, Shwarts E, Shanahan M, Landman AB. Interrater Reliability of mHealth App Rating Measures: Analysis of Top Depression and Smoking Cessation Apps. JMIR Mhealth Uhealth. 2016 Feb 10;4(1):e15. doi: 10.2196/mhealth.5176. https://mhealth.jmir.org/2016/1/e15/ [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Krippendorff K. Computing Krippendorff's Alpha-Reliability. University of Pennsylvania. 2011. [2020-07-14]. https://repository.upenn.edu/asc_papers/43.
44.Krippendorff K. Content Analysis: An Introduction to Its Methodology. Thousand Oaks, CA: Sage Publications, Inc; 2018. [Google Scholar]
45.Karlsson D, Gøeg KR, Örman H, Højen AR. Semantic Krippendorff's α for measuring inter-rater agreement in SNOMED CT coding studies. Stud Health Technol Inform. 2014;205:151–5. [PubMed] [Google Scholar]
46.McKay FH, Slykerman S, Dunn M. The App Behavior Change Scale: Creation of a Scale to Assess the Potential of Apps to Promote Behavior Change. JMIR Mhealth Uhealth. 2019 Jan 25;7(1):e11130. doi: 10.2196/11130. https://mhealth.jmir.org/2019/1/e11130/ [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Volkmann N, Stracke J, Kemper N. Evaluation of a gait scoring system for cattle by using cluster analysis and Krippendorff's α reliability. Vet Rec. 2019 Feb 16;184(7):220. doi: 10.1136/vr.105059. [DOI] [PubMed] [Google Scholar]
48.Hayes AF, Krippendorff K. Answering the Call for a Standard Reliability Measure for Coding Data. Communication Methods and Measures. 2007;1(1):77–89. doi: 10.1080/19312450709336664. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Multimedia Appendix 1

Interrater reliability scores and data completeness for each item.

mhealth_v9i4e26471_app1.pdf^{(142.3KB, pdf)}

[ref1] 1.The Mobile Economy 2019. GSMA Intelligence. 2019. [2020-07-14]. https://www.gsmaintelligence.com/research/?file=b9a6e6202ee1d5f787cfebb95d3639c5&download.

[ref2] 2.Statista App stores: number of apps in leading app stores 2020. 2020. [2020-07-14]. https://www.statista.com/statistics/276623/number-of-apps-available-in-leading-app-stores/

[ref3] 3.mHealth Economics 2017: Current Status and Future Trends in Mobile Health. Research2Guidance. 2017. [2020-07-14]. https://research2guidance.com/product/mhealth-economics-2017-current-status-and-future-trends-in-mobile-health.

[ref4] 4.de la Vega R, Roset R, Castarlenas E, Sánchez-Rodríguez E, Solé E, Miró J. Development and testing of painometer: a smartphone app to assess pain intensity. J Pain. 2014 Oct;15(10):1001–7. doi: 10.1016/j.jpain.2014.04.009. [DOI] [PubMed] [Google Scholar]

[ref5] 5.de la Vega R, Roset R, Galán S, Miró J. Fibroline: A mobile app for improving the quality of life of young people with fibromyalgia. J Health Psychol. 2018 Jan;23(1):67–78. doi: 10.1177/1359105316650509. [DOI] [PubMed] [Google Scholar]

[ref6] 6.Kessel KA, Vogel MM, Schmidt-Graf F, Combs SE. Mobile Apps in Oncology: A Survey on Health Care Professionals' Attitude Toward Telemedicine, mHealth, and Oncological Apps. J Med Internet Res. 2016 Nov 24;18(11):e312. doi: 10.2196/jmir.6399. https://www.jmir.org/2016/11/e312/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref7] 7.Kebede MM, Pischke CR. Popular Diabetes Apps and the Impact of Diabetes App Use on Self-Care Behaviour: A Survey Among the Digital Community of Persons With Diabetes on Social Media. Front. Endocrinol. 2019 Mar 1;10:135. doi: 10.3389/fendo.2019.00135. doi: 10.3389/fendo.2019.00135. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref8] 8.Coorey GM, Neubeck L, Mulley J, Redfern J. Effectiveness, acceptability and usefulness of mobile applications for cardiovascular disease self-management: Systematic review with meta-synthesis of quantitative and qualitative data. Eur J Prev Cardiol. 2018 Mar;25(5):505–521. doi: 10.1177/2047487317750913. [DOI] [PubMed] [Google Scholar]

[ref9] 9.Global Observatory for eHealth series - Volume 3. World Health Organization. 2011. Jun 07, [2021-04-10]. https://www.who.int/goe/publications/ehealth_series_vol3/en/

[ref10] 10.Pérez-Jover V, Sala-González M, Guilabert M, Mira JJ. Mobile Apps for Increasing Treatment Adherence: Systematic Review. J Med Internet Res. 2019 Jun 18;21(6):e12505. doi: 10.2196/12505. https://www.jmir.org/2019/6/e12505/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref11] 11.Ernsting C, Dombrowski SU, Oedekoven M, O Sullivan JL, Kanzler M, Kuhlmey A, Gellert P. Using Smartphones and Health Apps to Change and Manage Health Behaviors: A Population-Based Survey. J Med Internet Res. 2017 Apr 05;19(4):e101. doi: 10.2196/jmir.6838. https://www.jmir.org/2017/4/e101/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref12] 12.Hamine S, Gerth-Guyette E, Faulx D, Green BB, Ginsburg AS. Impact of mHealth chronic disease management on treatment adherence and patient outcomes: a systematic review. J Med Internet Res. 2015 Feb 24;17(2):e52. doi: 10.2196/jmir.3951. https://www.jmir.org/2015/2/e52/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref13] 13.Llorens-Vernet P, Miró J. Standards for Mobile Health-Related Apps: Systematic Review and Development of a Guide. JMIR Mhealth Uhealth. 2020 Mar 03;8(3):e13057. doi: 10.2196/13057. https://mhealth.jmir.org/2020/3/e13057/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref14] 14.de la Vega R, Miró J. mHealth: a strategic field without a solid scientific soul. a systematic review of pain-related apps. PLoS One. 2014;9(7):e101312. doi: 10.1371/journal.pone.0101312. http://dx.plos.org/10.1371/journal.pone.0101312. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref15] 15.Akbar S, Coiera E, Magrabi F. Safety concerns with consumer-facing mobile health applications and their consequences: a scoping review. J Am Med Inform Assoc. 2020 Feb 01;27(2):330–340. doi: 10.1093/jamia/ocz175. http://europepmc.org/abstract/MED/31599936. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref16] 16.Park JYE, Li J, Howren A, Tsao NW, De Vera M. Mobile Phone Apps Targeting Medication Adherence: Quality Assessment and Content Analysis of User Reviews. JMIR Mhealth Uhealth. 2019 Jan 31;7(1):e11919. doi: 10.2196/11919. http://mhealth.jmir.org/2019/1/e11919/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref17] 17.Subhi Y, Bube SH, Rolskov BS, Skou TAS, Konge L. Expert Involvement and Adherence to Medical Evidence in Medical Mobile Phone Apps: A Systematic Review. JMIR Mhealth Uhealth. 2015 Jul 27;3(3):e79. doi: 10.2196/mhealth.4169. http://mhealth.jmir.org/2015/3/e79/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref18] 18.Nicholas J, Larsen ME, Proudfoot J, Christensen H. Mobile Apps for Bipolar Disorder: A Systematic Review of Features and Content Quality. J Med Internet Res. 2015;17(8):e198. doi: 10.2196/jmir.4581. http://www.jmir.org/2015/8/e198/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref19] 19.Huckvale K, Adomaviciute S, Prieto JT, Leow MK, Car J. Smartphone apps for calculating insulin dose: a systematic assessment. BMC Med. 2015;13:106. doi: 10.1186/s12916-015-0314-7. http://www.biomedcentral.com/1741-7015/13/106. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref20] 20.Medicines and Healthcare products Regulatory Agency Medical devices: software applications (apps) GOV.UK. 2014. Aug 08, [2020-07-14]. https://www.gov.uk/government/publications/medical-devices-software-applications-apps.

[ref21] 21.Good practice guidelines on health apps and smart devices (mobile health or mhealth) Haute Autorité de Santé. 2016. [2020-07-14]. https://www.has-sante.fr/jcms/c_2681915/en/good-practice-guidelines-on-health-apps-and-smart-devices-mobile-health-or-mhealth.

[ref22] 22.Stoyanov SR, Hides L, Kavanagh DJ, Zelenko O, Tjondronegoro D, Mani M. Mobile app rating scale: a new tool for assessing the quality of health mobile apps. JMIR Mhealth Uhealth. 2015 Mar 11;3(1):e27. doi: 10.2196/mhealth.3422. https://mhealth.jmir.org/2015/1/e27/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref23] 23.Masterson Creber RM, Maurer MS, Reading M, Hiraldo G, Hickey KT, Iribarren S. Review and Analysis of Existing Mobile Phone Apps to Support Heart Failure Symptom Monitoring and Self-Care Management Using the Mobile Application Rating Scale (MARS) JMIR Mhealth Uhealth. 2016 Jun 14;4(2):e74. doi: 10.2196/mhealth.5882. https://mhealth.jmir.org/2016/2/e74/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref24] 24.Tofighi B, Chemi C, Ruiz-Valcarcel J, Hein P, Hu L. Smartphone Apps Targeting Alcohol and Illicit Substance Use: Systematic Search in in Commercial App Stores and Critical Content Analysis. JMIR Mhealth Uhealth. 2019 Apr 22;7(4):e11831. doi: 10.2196/11831. https://mhealth.jmir.org/2019/4/e11831/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref25] 25.Mehdi M, Stach M, Riha C, Neff P, Dode A, Pryss R, Schlee W, Reichert M, Hauck FJ. Smartphone and Mobile Health Apps for Tinnitus: Systematic Identification, Analysis, and Assessment. JMIR Mhealth Uhealth. 2020 Aug 18;8(8):e21767. doi: 10.2196/21767. https://mhealth.jmir.org/2020/8/e21767/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref26] 26.Knitza J, Tascilar K, Messner E, Meyer M, Vossen D, Pulla A, Bosch P, Kittler J, Kleyer A, Sewerin P, Mucke J, Haase I, Simon D, Krusche M. German Mobile Apps in Rheumatology: Review and Analysis Using the Mobile Application Rating Scale (MARS) JMIR Mhealth Uhealth. 2019 Aug 05;7(8):e14991. doi: 10.2196/14991. https://mhealth.jmir.org/2019/8/e14991/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref27] 27.Grainger R, Townsley H, White B, Langlotz T, Taylor WJ. Apps for People With Rheumatoid Arthritis to Monitor Their Disease Activity: A Review of Apps for Best Practice and Quality. JMIR Mhealth Uhealth. 2017 Feb 21;5(2):e7. doi: 10.2196/mhealth.6956. https://mhealth.jmir.org/2017/2/e7/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref28] 28.Nouri R, R Niakan Kalhori S, Ghazisaeedi M, Marchand G, Yasini M. Criteria for assessing the quality of mHealth apps: a systematic review. J Am Med Inform Assoc. 2018 Aug 01;25(8):1089–1098. doi: 10.1093/jamia/ocy050. http://europepmc.org/abstract/MED/29788283. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref29] 29.Siddique AB, Krebs M, Alvarez S, Greenspan I, Patel A, Kinsolving J, Koizumi N. Mobile Apps for the Care Management of Chronic Kidney and End-Stage Renal Diseases: Systematic Search in App Stores and Evaluation. JMIR Mhealth Uhealth. 2019 Sep 04;7(9):e12604. doi: 10.2196/12604. https://mhealth.jmir.org/2019/9/e12604/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref30] 30.Levine DM, Co Z, Newmark LP, Groisser AR, Holmgren AJ, Haas JS, Bates DW. Design and testing of a mobile health application rating tool. NPJ Digit Med. 2020;3:74. doi: 10.1038/s41746-020-0268-9. doi: 10.1038/s41746-020-0268-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref31] 31.Llorens-Vernet P, Miró J. The Mobile App Development and Assessment Guide (MAG): Delphi-Based Validity Study. JMIR Mhealth Uhealth. 2020 Jul 31;8(7):e17760. doi: 10.2196/17760. https://mhealth.jmir.org/2020/7/e17760/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref32] 32.mHealth Economics 2016 - Current Status and Trends of the mHealth App Market. Research2Guidance. 2016. [2020-07-14]. https://research2guidance.com/product/mhealth-app-developer-economics-2016/

[ref33] 33.Chronic Diseases in America. Centers for Disease Control and Prevention. 2020. [2020-07-14]. https://www.cdc.gov/chronicdisease/resources/infographic/chronic-diseases.htm.

[ref34] 34.Chronic Disease Overview. Australian Institute of Health and Welfare. 2019. [2020-07-14]. https://www.aihw.gov.au/reports-data/health-conditions-disability-deaths/chronic-disease/overview.

[ref35] 35.EU research on chronic diseases. European Commission. 2014. [2020-07-14]. https://ec.europa.eu/info/research-and-innovation/research-area/health-research-and-innovation/chronic-diseases_en.

[ref36] 36.Health at a Glance 2019. Organisation for Economic Cooperation and Development (OECD) 2019. [2021-04-10]. https://www.oecd-ilibrary.org/social-issues-migration-health/health-at-a-glance-2019_4dd50c09-en.

[ref37] 37.Findings from the Global Burden of Disease Study 2017. Institute for Health Metrics and Evaluation (IHME) 2018. [2020-07-14]. http://www.healthdata.org/sites/default/files/files/policy_report/2019/GBD_2017_Booklet.pdf.

[ref38] 38.GBD 2017 Causes of Death Collaborators Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018 Nov 10;392(10159):1736–1788. doi: 10.1016/S0140-6736(18)32203-7. https://linkinghub.elsevier.com/retrieve/pii/S0140-6736(18)32203-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref39] 39.GBD 2017 Disease and Injury Incidence and Prevalence Collaborators Global, regional, and national incidence, prevalence, and years lived with disability for 354 diseases and injuries for 195 countries and territories, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018 Dec 10;392(10159):1789–1858. doi: 10.1016/S0140-6736(18)32279-7. https://linkinghub.elsevier.com/retrieve/pii/S0140-6736(18)32279-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref40] 40.GBD 2017 DALYs and HALE Collaborators Global, regional, and national disability-adjusted life-years (DALYs) for 359 diseases and injuries and healthy life expectancy (HALE) for 195 countries and territories, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018 Nov 10;392(10159):1859–1922. doi: 10.1016/S0140-6736(18)32335-3. https://linkinghub.elsevier.com/retrieve/pii/S0140-6736(18)32335-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref41] 41.Global Burden of Disease Study 2017 (GBD 2017) Data Resources. Institute for Health Metrics and Evaluation (IHME) 2017. [2020-07-14]. http://ghdx.healthdata.org/gbd-2017.

[ref42] 42.Powell AC, Torous J, Chan S, Raynor GS, Shwarts E, Shanahan M, Landman AB. Interrater Reliability of mHealth App Rating Measures: Analysis of Top Depression and Smoking Cessation Apps. JMIR Mhealth Uhealth. 2016 Feb 10;4(1):e15. doi: 10.2196/mhealth.5176. https://mhealth.jmir.org/2016/1/e15/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref43] 43.Krippendorff K. Computing Krippendorff's Alpha-Reliability. University of Pennsylvania. 2011. [2020-07-14]. https://repository.upenn.edu/asc_papers/43.

[ref44] 44.Krippendorff K. Content Analysis: An Introduction to Its Methodology. Thousand Oaks, CA: Sage Publications, Inc; 2018. [Google Scholar]

[ref45] 45.Karlsson D, Gøeg KR, Örman H, Højen AR. Semantic Krippendorff's α for measuring inter-rater agreement in SNOMED CT coding studies. Stud Health Technol Inform. 2014;205:151–5. [PubMed] [Google Scholar]

[ref46] 46.McKay FH, Slykerman S, Dunn M. The App Behavior Change Scale: Creation of a Scale to Assess the Potential of Apps to Promote Behavior Change. JMIR Mhealth Uhealth. 2019 Jan 25;7(1):e11130. doi: 10.2196/11130. https://mhealth.jmir.org/2019/1/e11130/ [DOI] [PMC free article] [PubMed] [Google Scholar]

[ref47] 47.Volkmann N, Stracke J, Kemper N. Evaluation of a gait scoring system for cattle by using cluster analysis and Krippendorff's α reliability. Vet Rec. 2019 Feb 16;184(7):220. doi: 10.1136/vr.105059. [DOI] [PubMed] [Google Scholar]

[ref48] 48.Hayes AF, Krippendorff K. Answering the Call for a Standard Reliability Measure for Coding Data. Communication Methods and Measures. 2007;1(1):77–89. doi: 10.1080/19312450709336664. [DOI] [Google Scholar]

PERMALINK

Assessing the Quality of Mobile Health-Related Apps: Interrater Reliability Study of Two Guides

Jordi Miró, PhD

Pere Llorens-Vernet, PhD

Abstract

Background

Objective

Methods

Results

Conclusions

Introduction

Methods

App Selection Process

App Evaluation Process

Data Analysis

Results

Table 1.

Table 2.

Table 3.

Table 4.

Table 5.

Discussion

Principal Findings

Future Research

Limitations

Conclusions

Acknowledgments

Abbreviations

Appendix

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Assessing the Quality of Mobile Health-Related Apps: Interrater Reliability Study of Two Guides

Jordi Miró, PhD

Pere Llorens-Vernet, PhD

Abstract

Background

Objective

Methods

Results

Conclusions

Introduction

Methods

App Selection Process

App Evaluation Process

Data Analysis

Results

Table 1.

Table 2.

Table 3.

Table 4.

Table 5.

Discussion

Principal Findings

Future Research

Limitations

Conclusions

Acknowledgments

Abbreviations

Appendix

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases