Abstract
Objectives
To generate an algorithm for systematic development and validation of written patient information in accordance with well-established and validated psychometric and statistical methods that can be applied to different fields of medicine.
Methods
A literature search was carried out in PubMed and Google Scholar. Methods were selected and combined to an algorithm. Feasibility and practicability is tested by the development of patient education materials on “ureteral stenting”.
Results
The algorithm includes 4 study phases. After internal audit expert, readability of the first version is objectified using the Flesch Reading Ease formula. This draft is tested by a few patients performing semi-structured interviews using “The think aloud method” by Someren et al. Content validity is evaluated by a written survey by external consultants in accordance with Lawshe's “Quantitative approach to content validity”. The final leaflet is developed at a consensus meeting and validated by patients based on the Consumer Information Rating Form. The new algorithm could be tested by the development of patient education materials on “ureteral stenting” as a test run.
Conclusion
We developed an algorithm for systematic development and validation of written patient information in accordance with well-established, validated psychometric and statistical methods. This algorithm can be applied to arbitrary fields of medicine.
Keywords: Patient information, Patient education, Validation, Agorithm, Ureteral stenting
Introduction
Written patient education materials (PEMs) represent an indispensable source of patient information. As more and more patients are using the internet to supplement or replace patient-doctor communication, PEMs massively gained in importance during the latest years [1,2]. Well-formulated and well-presented healthcare information promotes health awareness, encourages self-care, and improves the effectiveness of clinical care [3,4,5,6]. Simpson et al. [7] showed that explaining and understanding patient concerns, even when they cannot be resolved, result in a significant decline in anxiety. However, in the everyday clinical setting, patients often do not receive sufficient information [8,9]. Thus, Ley [10] showed that most patients want more information about their medical conditions, available treatments, and possible outcomes of treatment.
Nevertheless, many PEMs suffer from clear limitations regarding comprehensibility and content quality [11,12]. In particular readability has been shown to exceed the levels recommended by the US National Institutes of Health [13] and the American Medical Association [14] very frequently [15,16,17]. As comprehensibility is also influenced by factors which are difficult to measure such as complexity of content, layout, images, and coloration [18,19,20], patient survey seems to be essential for validation. In contrast, a systematic assessment by experts is required to assure content quality.
The aim of our study was to generate an algorithm for systematic development and validation of written patient information in accordance with well-established and validated psychometric and statistical methods that can be applied to different fields of medicine. Morbidity associated with ureteral stenting represents a major concern in urology [21,22]. It has been shown that stent-associated morbidity can be significantly reduced by high-quality patient education and that many patients are not satisfied with the information they obtained [3]. Therefore, “ureteral stenting” served as a topic for a proof-of-principle assessment of the developed algorithm.
Material and Methods
A comprehensive literature search for articles and methods published up to July 2018 was carried out in PubMed and Google Schoolar using different combinations of the following search terms: “patient education material”, “patient information”, “validation”, “content”, “readability”, “comprehensibility”, “assessment”, “development”, “psychometric”, “statistical analysis”, “testing”, “leaflet”. We reviewed the publications found in this way, as well as the references cited in these publications.
Appropriate methods were selected and combined to an algorithm to guide through the process of initial text development, early assessment of content validity and comprehensibility, validation by patient survey and revisions that might be required after failure of validation. Feasibility and practicability of the developed algorithm was tested using the development of PEM on “ureteral stenting” as a proof-of-principle assessment. The study was approved by the local ethic committee and all participants provided written informed consent. Statistical analyses were performed in the R programming language (version 3.2.2) [23].
Results
Review
The literature search revealed several methods applicable for the development and assessment of PEMs. Methods used for the algorithm included the Flesch Reading Ease formula [24] (version adapted for German language by Amstad [25]), “The think aloud method” by Someren et al. [26], Lawshe's “Quantitative approach to content validity” [27], and the Consumer Information Rating Form (CIRF) [28]. These assessments were selected based on practicability, validity and their performance in previous studies and are described and discussed below.
Algorithm Development
The developed algorithm included 4 study phases and is illustrated in Figure 1.
Fig. 1.
Multistep process for development and validation of written patient information (dashes boxes indicate optional steps).
Phase 1: Initial Drafting of Text and Design The initial text and design are developed based on current literature available on the topic (e.g. reviews on indications, outcomes and adverse events). Particular attention should be paid to an easy comprehensibility of text and illustrations and an appealing design and layout. The initial version undergoes an internal audit by experts not involved into the initial drafting. Corrections are considered and consolidated at a first consensus meeting. Readability is then objectified using the Flesch Reading Ease formula [24]. A score between 60 and 69 points corresponding to a text with medium to rather easy understanding is defined as appropriate.
Phase 2: Initial Testing by Patients and Experts The first draft leaflet is tested by a few patients, who preferably have a history of the corresponding medical intervention or problem. Semi-structured interviews are conducted and recorded, using “The think aloud method” by Someren et al. [26].
In addition, content validity is evaluated by a written survey by external consultants in accordance with Lawshe's “Quantitative approach to content validity” [27]. Each section of the PEM can be rated as “essential”, “useful but not essential” or “not necessary”, and the content validity ratio (CVR) is calculated according to the formula CVR = (ne - N/2) / (N/2) [ne: number of experts answering with “essential”; N: total number of experts interviewed]. With a possible range between +1 and −1, positive numbers indicate that at least 50% of all experts rate a chapter as “essential”, and the total CVR is calculated as the median of all ratios. For example, with 10 experts, a total CVR of 0.62 indicates validity, while lower values require content revision. Both patients and experts can give additional suggestions and free comments.
A final version for the validation process is drafted at a second consensus meeting. Re-assessment of readability as described above is performed if relevant changes were made to the text [24].
Phase 3: Validation The 17-item questionnaire CIRF is used for the final validation process, which is performed in patients with a history of the corresponding medical intervention or problem, sufficient language knowledge, and absence of cognitive impairment. The CIRF covers the topics comprehensibility (5 items), utility (6 items) and design quality (6 items). Items concerning comprehensibility and design quality are rated on a Likert scale from 1 (bad) to 5 (good) and items regarding utility are rated on a Likert scale from 1 to 4.
Phase 4: Revision and Repetition of Validation If the required validation threshold is not met, the respective item has to be adjusted and the validation process must be repeated after adjustment. If repetition of one or multiple items is triggered, the sample size has to be adjusted as the respective threshold for the p-value, corrected for multiple testing according to Bonferroni, depending on the number of items, which need reassessment (explained in the section “Algorithm Testing” and Table 1).
Table 1.
Required sample size and adjusted p-values (with Bonferroni correction), maintaining the global significance level at 5%, depending on number of items needing revision and reassessment
| Number of items requiring reassessment | Required sample size | Adjusted p-values (0.05 / number of items) |
|---|---|---|
| 1 | 20 | 0.05 |
| 2 | 26 | 0.025 |
| 3 | 29 | 0.0167 |
| 4 | 32 | 0.125 |
| 5 | 34 | 0.01 |
| 6 | 35 | 0.0833 |
| 7 | 37 | 0.0071 |
| 8 | 38 | 0.00625 |
| 9 | 39 | 0.00556 |
| 10 | 40 | 0.005 |
| 11 | 40 | 0.004545 |
| 12 | 41 | 0.00416 |
| 13 | 42 | 0.00385 |
| 14 | 42 | 0.00357 |
| 15 | 43 | 0.00333 |
| 16 | 44 | 0.003125 |
Algorithm Testing
The algorithm (Fig. 1), we tested in a proof-of-principle assessment by creating and assessing a foldable 3-unit leaflet with a total of 6 A5 pages on ureteral stenting. The first draft was built in accordance to phase 1.
The title page contained the heading (“ureteral stent, patient information”), subtitle (“Information leaflet on frequently asked questions”) and a portrait of two neutral patients (male and female). The actual content was allocated to pages 2 to 5, including 3 photos (cystoscope, stone in the ureter, unused ureteral stent) and 3 schematic illustrations of the urinary tract with and without an indwelling stent. The content was subdivided into the following sections: Why do you need a ureteral stent? All about the procedure (Procedure of ureteral stenting; After the intervention; What you should consider); Symptoms caused by the stent; Warning signs. Page 6 was used for contact details of whom to contact for further treatment and in case of problems, and included free space for contact details or a stamp.
The assessment of comprehensibility revealed an average sentence length of 10.11 words and an average number of syllables per word of 1.86. Thus, the Flesch Reading Ease formula [24] (adapted for German language by Amstad) [25] was calculated as follows: 180 − 10.11 − (58.5 × 1.85) = 61.67. This result corresponds to medium to rather easy readability, which was the target for the final version. The same category was reached after minor revisions in phase 2.
In general, the feedback from all 10 patients in phase 2 was very positive. However, some minor suggestions - usually words that were felt to be difficult to understand - were implemented, based on the patient interviews.
According to a written survey by the external consultants, a high CVR (median 1, range 0.8–1) was found for all of the leaflet's sections. Again, as a consequence of the experts' free comments, some minor changes regarding the wording were implemented at a second consensus meeting.
For the validation phase, the second item of the CIRF subsection “utility” which addresses “contraindications” did not seem to be appropriate for ureteral stenting and was therefore deleted from the questionnaire. PEM was determined to be valid with respect to each item if more than 60% of the patients give an answer in the upper half of the Likert scale, i.e. 4 or 5 points for comprehensibility and design quality, and 3 or 4 points for utility. This hypothesis is examined by a one-sided binomial test for the 16 proportions observed. The global significance level is controlled at 5% by applying a correction for multiple testing according to Bonferroni. Therefore, the null hypothesis for each item is rejected if the corresponding p-value is smaller than 0.05/16, which is reached when 36 out of 44 patients provided an answer in the upper half of the Likert scale. The calculations are planned with a power of 80%, assuming that 85% of the participants would provide answers in the upper half of the Likert scale. This determined a sample size of 44 patients.
If the required validation threshold - a p < 0.3125 - is not met, i.e. more than 8 patients provided an answer in the lower half of the Likert scale, the respective item was to be adjusted.
All of the 44 patients included in the study completed the CIRF and excellent validation results were found (Table 2): with 44 responses (100%) in the upper half of the Likert scale, the best result was achieved for the items C1 (How readable is the text with regard to ease of reading?), C2 (How understandable is the text written?) and D3 (How easy is the font size to read?). The worst ratings with 39 answers (88.6%) in the upper half were the items C3 (How well do you remember the content?) and U5 (How do you find the information on the length of time the ureteral stent stays in place?).
Table 2.
Proportions of patients providing answers in the upper half of the Likert scale for each CIRF item in phase 3
| CRF item | Description | Number of answers in upper half of Likert scale | Proportions (%) | 95% CI | p |
|---|---|---|---|---|---|
| How easy would you say the information in the leaflet is to: | |||||
| C1 | Read | 44 | 100.0 | 91.97–100.00 | < 0.0001 |
| C2 | Understand | 44 | 100.0 | 91.97–100.00 | < 0.0001 |
| C3 | Remember | 39 | 88.6 | 76.02–95.05 | < 0.0001 |
| C4 | Locate | 43 | 97.7 | 88.19–99.88 | < 0.0001 |
| C5 | Keep for the future | 40 | 90.9 | 78.84–96.41 | < 0.0001 |
| Indicate your opinion about how much information was provided and how useful you think this information would be: | |||||
| U1 | Benefits of the intervention | 43 | 97.7 | 88.19–99.88 | < 0.0001 |
| Contraindications of the intervention | not applicable | not applicable | not applicable | not applicable | |
| U2 | Providing contact information | 43 | 97.7 | 88.19–99.88 | < 0.0001 |
| U3 | Precautions that need to be taken | 43 | 97.7 | 88.19–99.88 | < 0.0001 |
| U4 | Adverse effects | 41 | 93.2 | 81.77–97.65 | < 0.0001 |
| U5 | Indwelling time of the stent | 39 | 88.6 | 76.02–95.05 | < 0.0001 |
| Indicate your opinion about the following points concerning the design and structure quality of the leaflet | |||||
| D1 | Organization/structure | 43 | 97.7 | 88.19–99.88 | < 0.0001 |
| D2 | Attractiveness/clearness | 42 | 95.5 | 84.87–98.74 | < 0.0001 |
| D3 | Print size of the text | 44 | 100.0 | 91.97–100.00 | < 0.0001 |
| D4 | Tone/configuration of the text | 43 | 97.7 | 88.19–99.88 | < 0.0001 |
| D5 | Text spacing | 43 | 97.7 | 88.19–99.88 | < 0.0001 |
| D6 | General design | 41 | 93.2 | 81.77–97.65 | < 0.0001 |
C = Comprehensibility; U = utility; D = design quality.
A 95% Wilson confidence interval (CI) and the p-value for the one-sided null hypothesis that the observed proportion is less or equal to 60% is provided.
For all 16 questionnaire items, more than 36 out of 44 patients (validation threshold) provided answers in the upper half of the Likert scale (median 97.7%, range 88.6–100%), and p-values for all items were < 0.0001, with respect to the one-sided null-hypothesis that the proportion with answers in the upper half of the Likert scale would be ≤ 60% (Table 2).
As the p-values for all items were below the required validation threshold of 0.05/16, phase 4 adjustments and re-evaluation of the leaflet were not indicated. Number of patients and p-values that would have been required in case of failure of validation are provided in Table 1.
Discussion
Comprehensibility of PEMs can only be assessed conclusively by patient surveys. Such validations are laborious and therefore, materials going into the validation process should already be thoroughly prepared. As content validity cannot be assessed by patients, this preparation should also include a review by specialists.
The developed algorithm ensures, that PEM that is used for validation already provides a high comprehensibly, readability and content-related quality. It also demonstrates how validation can be performed and how to proceed in case of validation failures. Moreover, statistical concepts are provided for each step.
Our extensive literature review provided rather few methods for the development and validation of PEMs. Nevertheless, some of the methods used in the algorithm could be substituted by similar assessments. Though readability can be assessed by a variety of well-established methods (e.g. Flesch Reading Ease formula, Flesch-Kincaid Grade Level, SMOG Grade Level, Coleman-Liau Index, Gunning Fog Index), we recommend the use of the Flesch Reading Ease formula [24] as this is widely used for the assessment of technical manuals and medical content, easy to handle, and already integrated in the most popular word processing programs.
Though any kind of patient interview might serve as a rough indicator for comprehensibility, “The think aloud method” by Someren et al. [26] provides the advantage of assessing the patients first thoughts without being prone to restraint and avoidance of criticism by speaking aloud any words and ideas in ones mind while completing a task, for example reading a text.
Lawshe's “Quantitative approach to content validity” [27] is a quite well-known method assessing the question of how far a group of experts agree on whether the knowledge measured by an indicator is “essential”, “useful but no essential” or “not necessary” for the measurement of the construct and there are few validated alternatives to this method.
For the final validation we used the CIRF [28], as this is a validated questionnaire which is specific for the assessment of written medical information and includes the whole spectrum of comprehensibility, utility and design of PEMs.
In summary, our selection of methods was based on an extensive literature research and all methods are all validated and generally accepted. Moreover, it is clearly more comprehensive than a survey for the evaluation of PEM leaflets published in 2003 [29].
While the algorithm might seem laborious and time consuming at a first glance, each of the steps can be performed quite quickly and the numbers of patients and experts can be easily adapted based on statistical considerations. In accordance, all of the steps performed in phase 1 and 2 during our pilot run could be performed with little efforts and provided important improvements of the PEM. As a consequence all of the CIRF items met the required validation threshold and no adaptions were necessary, which are likely to raise higher efforts than the simple tests performed in phase 1 and 2.
Conclusion
We developed an algorithm for systematic development and validation of written patient information in accordance with well-established and validated psychometric and statistical methods. This algorithm can be applied to arbitrary fields of medicine and proved to be easy to use and functional.
Acknowledgments
The authors thank Olympus Schweiz AG (Volketswil, Switzerland) for technical support and distribution of the leaflet, especially Petra Rufer, Esther Isenschmid and Ulrik Raether. In German speaking countries, interested physicians can obtain the leaflet on a non-commercial basis via their local Olympus representative. We also thank Alistair Reeves for text editing of the manuscript and the external consultants as well as the Clinical Trials Unit of Cantonal Hospital St. Gallen for their valuable input.
References
- 1.Comescore Reston, United States. Mobile health information audience jumps 125 percent in the past year. http://www.comscore.com/2012/01/mobile-health-information-audience-jumps-125-percent-in-the-past-year, cited July 2018. [Google Scholar]
- 2.Madden M, Fox S. Finding answers online in sickness and in health. Pew Internet and American Life Project. 2006 http://www.pewinternet.org.pewinternet.org/2006/05/02/finding-answers-online-in-sickness-and-in-health/ [Google Scholar]
- 3.Abt D, Warzinek E, Schmid HP, Haile SR, Engeler DS. Influence of patient education on morbidity caused by ureteral stents. Int J Urol. 2015;22:679–683. doi: 10.1111/iju.12782. [DOI] [PubMed] [Google Scholar]
- 4.Coulter A. Evidence based patient information. is important, so there needs to be a national strategy to ensure it. BMJ. 1998;317:225–226. doi: 10.1136/bmj.317.7153.225. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Colledge A, Car J, Donnelly A, Majeed A. Health information for patients: time to look beyond patient information leaflets. J R Soc Med. 2008;101:447–453. doi: 10.1258/jrsm.2008.080149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Coulter A, Ellins J. Effectiveness of strategies for informing, educating, and involving patients. BMJ. 2007;335:24–27. doi: 10.1136/bmj.39246.581169.80. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Simpson M, Buckman R, Stewart M, Maguire P, Lipkin M, Novack D, Till J. Doctor-patient communication: the Toronto consensus statement. BMJ. 1991;303:1385–1387. doi: 10.1136/bmj.303.6814.1385. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Korsch BM, Negrete VF. Doctor-patient communication. Sci Am. 1972;227:66–74. doi: 10.1038/scientificamerican0872-66. [DOI] [PubMed] [Google Scholar]
- 9.Johansson K, Leino-Kilpi H, Salantera S, Lehtikunnas T, Ahonen P, Elomaa L, Salmela M. Need for change in patient education: a Finnish survey from the patient's perspective. Patient Educ Couns. 2003;51:239–245. doi: 10.1016/s0738-3991(02)00223-9. [DOI] [PubMed] [Google Scholar]
- 10.Ley P, Giving information to patients . Psychology and Behavioural Medicine. In: Eiser JR, editor. Chichester, John, Wiley and Sons. 2008. [Google Scholar]
- 11.Guan Y, Maloney KA, Roter DL, Pollin TI. Evaluation of the informational content, readability and comprehensibility of online health information on monogenic diabetes. J Genet Couns. 2018;27:608–615. doi: 10.1007/s10897-017-0155-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Smith F, Carlsson E, Kokkinakis D, Forsberg M, Kodeda K, Sawatzky R, Friberg F, Ohlen J. Readability, suitability and comprehensibility in patient education materials for Swedish patients with colorectal cancer undergoing elective surgery: a mixed method design. Patient Educ Couns. 2014;94:202–209. doi: 10.1016/j.pec.2013.10.009. [DOI] [PubMed] [Google Scholar]
- 13.Medlineplus: How to write easy-to-read helath materials. Rockville Pike, United States. https://medlineplus.gov/etr.html
- 14.Weiss BD. Manual for Clinicians. ed 2. Chicago: American Medical Association Foundation and American Medical Association; 2007. Health Literacy and Patient Safety: help patients understand. [Google Scholar]
- 15.Eloy JA, Li S, Kasabwala K, Agarwal N, Hansberry DR, Baredes S, Setzen M. Readability assessment of patient education materials on major otolaryngology association websites. Otolaryngol Head Neck Surg. 2012;147:848–854. doi: 10.1177/0194599812456152. [DOI] [PubMed] [Google Scholar]
- 16.Betschart P, Zumstein V, Bentivoglio M, Engeler D, Schmid HP, Abt D. Readability assessment of online patient education materials provided by the European Association of Urology. Int Urol Nephrol. 2017;49:2111–2117. doi: 10.1007/s11255-017-1695-7. [DOI] [PubMed] [Google Scholar]
- 17.Prabhu AV, Hansberry DR, Agarwal N, Clump DA, Heron DE. Radiation oncology and online patient education materials: deviating from NIH and AMA recommendations. Int J Radiat Oncol Biol Phys. 2016;96:521–528. doi: 10.1016/j.ijrobp.2016.06.2449. [DOI] [PubMed] [Google Scholar]
- 18.Kandula S, Zeng-Treitler Q. Creating a gold standard for the readability measurement of health texts. AMIA Annu Symp Proc. 2008;2008:353–357. [PMC free article] [PubMed] [Google Scholar]
- 19.Michielutte R, Bahnson J, Dignan MB, Schroeder EM. The use of illustrations and narrative text style to improve readability of a health education brochure. J Cancer Educ. 1992;7:251–260. doi: 10.1080/08858199209528176. [DOI] [PubMed] [Google Scholar]
- 20.Houts PS, Doak CC, Doak LG, Loscalzo MJ. The role of pictures in improving health communication: a review of research on attention, comprehension, recall, and adherence. Patient Educ Couns. 2006;61:173–190. doi: 10.1016/j.pec.2005.05.004. [DOI] [PubMed] [Google Scholar]
- 21.Joshi HB, Okeke A, Newns N, Keeley FX, Jr, Timoney AG. Characterization of urinary symptoms in patients with ureteral stents. Urology. 2002;59:511–516. doi: 10.1016/s0090-4295(01)01644-2. [DOI] [PubMed] [Google Scholar]
- 22.Staubli SE, Mordasini L, Engeler DS, Sauter R, Schmid HP, Abt D. Economic aspects of morbidity caused by ureteral stents. Urol Int. 2016;97:91–97. doi: 10.1159/000443379. [DOI] [PubMed] [Google Scholar]
- 23.R Core Team R: a language and environment for statistical computing R Foundation for Statistical Computing, Vienna, Austria. 2015 https://www.r-project.org/ [Google Scholar]
- 24.Flesch R. A new readability yardstick. J Appl Psychol. 1948;32:221–233. doi: 10.1037/h0057532. [DOI] [PubMed] [Google Scholar]
- 25.Amstad T. Wie verstandlich sind unsere Zeitungen (Dissertation)? Universitat Zurich. 1978 [Google Scholar]
- 26.van Someren MW, Barnard YF, Sandberg JA. London: Academic Press; 1994. The think aloud method - a practical guide to modelling cognitive processes. [Google Scholar]
- 27.Lawshe CH. A quantitative approach to content validity. Pers Psychol. 1975;28:563–575. [Google Scholar]
- 28.Koo MM, Krass I, Aslani P. Evaluation of written medicine information: validation of the Consumer Information Rating Form. Ann Pharmacother. 2007;41:951–956. doi: 10.1345/aph.1K083. [DOI] [PubMed] [Google Scholar]
- 29.Gustafsson J, Kalvemark S, Nilsson G, Nilsson JL. A method to evaluate patient information leaflets. Drug Inf J. 2003;37:115–125. [Google Scholar]

