Comparison of Readability of Official Public Health Information About COVID-19 on Websites of International Agencies and the Governments of 15 Countries

Vishala Mishra; Joseph P Dexter

doi:10.1001/jamanetworkopen.2020.18033

. 2020 Aug 18;3(8):e2018033. doi: 10.1001/jamanetworkopen.2020.18033

Comparison of Readability of Official Public Health Information About COVID-19 on Websites of International Agencies and the Governments of 15 Countries

Vishala Mishra ¹, Joseph P Dexter ^2,^✉

¹Multidisciplinary Research Unit, Madras Medical College, Chennai, Tamil Nadu, India

²Neukom Institute for Computational Science, Dartmouth College, Hanover, New Hampshire

Accepted for Publication: July 14, 2020.

Published: August 18, 2020. doi:10.1001/jamanetworkopen.2020.18033

^✉

Corresponding Author: Joseph P. Dexter, PhD, Neukom Institute for Computational Science, Dartmouth College, 27 N Main St, Hanover, NH 03755 (joseph.p.dexter@dartmouth.edu).

Author Contributions: Drs Mishra and Dexter had full access to all of the data in the study and take responsibility for the integrity of the data and the accuracy of the data analysis.

Concept and design: Both authors.

Acquisition, analysis, or interpretation of data: Both authors.

Drafting of the manuscript: Both authors.

Critical revision of the manuscript for important intellectual content: Both authors.

Statistical analysis: Both authors.

Obtained funding: Dexter.

Supervision: Dexter.

Conflict of Interest Disclosures: Dr Dexter reported receiving grants from the American Council of Learned Societies and from the Neukom Institute for Computational Science during the conduct of the study. No other disclosures were reported.

Funding/Support: This work was supported by a Digital Extension Grant from the American Council of Learned Societies and a Neukom Fellowship.

Role of the Funder/Sponsor: The funders had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.

Additional Contributions: Pramit Chaudhuri, PhD (University of Texas at Austin), provided helpful discussions and comments on the manuscript, and Anish Chattopadhyay (AIIMS Patna), Cuiwei Guo, MBA (Northeastern University), and Vasudha Mishra (AIIMS Patna) assisted with data collection and curation. None of these individuals received compensation for this work.

^✉

Corresponding author.

PMCID: PMC7435342 PMID: 32809028

Abstract

This cross-sectional study evaluates the readability of information on COVID-19 on websites of international agencies, governments of 15 countries, and US health departments.

Introduction

Containment strategies for the coronavirus disease 2019 (COVID-19) pandemic have required broad public compliance, yet complex, contradictory, and false information proliferates.¹ The American Medical Association (AMA), National Institutes of Health (NIH), and Centers for Disease Control and Prevention (CDC) recommend that medical information for the public be written at no higher than an eighth-grade reading level.² We evaluated the readability of online information about COVID-19 provided by government and public health agencies and departments.

Methods

For this cross-sectional study, between April 1, 2020, and April 5, 2020, we reviewed 18 websites, including 3 public health agency sites and 15 official government sites of countries with 5000 or more confirmed cases as of April 5 and with guidelines written in English. We identified pages about COVID-19 intended for a general audience, such as lists of frequently asked questions and fact sheets, and extracted the content into text files. This study followed the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guideline.

Readability grade levels based on 5 formulas (Flesch-Kincaid grade level [FKGL]; Simple Measure of Gobbledygook; Gunning Fog Index; Ford, Caylor, Sticht formula; and Coleman-Liau Index) were calculated using Readability Studio Professional, version 2019.3 (Oleander Software). Measures of syntactic complexity, including mean length of clause and dependent clauses per T-unit, were computed using the L2 Syntactic Complexity Analyzer, version 3.3.3.³

We evaluated web pages against grade level recommendations of the AMA, NIH, and CDC; the CDC pages were evaluated using their health literacy guidelines (a reading level of grade 8, 1 to 2 syllables per word, 8 to 10 words per sentence, and substitution of “everyday” synonyms for 121 difficult terms related to public health).^2,4,5 Sample passages from the websites with varying FKGL are provided in the eAppendix in the Supplement. State literacy data were obtained from a previous survey.⁶

Statistical calculations were performed using Stata, version 13 (StataCorp). A Wilcoxon rank-sum test was used for 2-sample comparisons, and correlation was assessed by Spearman r. Statistical significance was defined as P < .05.

Results

Among all 18 websites evaluated, information about COVID-19 uniformly exceeded the recommended reading level of grades 6 through 8 (Table). All pages (n = 149) drawn from the websites scored above 8.0 by at least 1 metric, and 141 (95%) scored above 8.0 by all 5 metrics. A total of 145 pages (97%) exceeded the syntactic complexity typical for text written at a grade 8 level (mean length of clause, 8.0; dependent clauses per T-unit, 0.2).

Table. Readability of COVID-19 Information From the WHO, CDC, ECDC, and Governments of 15 Countries.

Resource	Web pages, No.	Readability formulas, median (IQR) [range]					Syntactic complexity, median (IQR) [range]^a
Resource	Web pages, No.	FKGL	SMOG	GFI	FORCAST	CLI	MLC	DC/T
WHO	12	11.8 (4.0) [8.7-16.7]	13.6 (3.6) [11.2-17.3]	13.6 (4.7) [10.1-15.6]	11.5 (1.3) [10.4-12.4]	12.4 (3.2) [10.5-16.2]	12.5 (2.6) [10.6-16.3]	0.5 (0.4) [0.3-0.9]
CDC	68	11.0 (3.2) [5.4-16.6]	13.4 (2.5) [8.1-17.4]	12.7 (3.0) [6.8-18.4]	11.4 (0.9) [8.7-13.4]	12.8 (2.9) [8.1-18.6]	11.6 (2.0) [7.4-15.1]	0.5 (0.2) [0.2-2.5]
ECDC	1	13.1	15.0	14.0	11.6	12.9	12.5	0.5
US^b	19	11.5 (1.9) [8.8-14.9]	13.5 (1.4) [11.6-15.7]	12.4 (2.1) [10.7-14.9]	11.3 (0.7) [10.6-12.1]	12.2 (1.8) [10.9-15.2]	11.8 (2.6) [10.1-14.1]	0.6 (0.3) [0.3-0.8]
Australia	14	10.7 (1.6) [8.6-11.7]	12.8 (1.6) [11.2-14.0]	11.8 (1.8) [9.9-13.7]	10.7 (0.5) [9.8-11.4]	11.2 (1.7) [9.5-14.1]	10.9 (1.4) [9.4-15.8]	0.6 (0.3) [0.4-0.8]
Austria	1	12.0	14.1	12.4	11.2	11.9	12.3	0.4
Belgium	1	9.4	11.9	11.3	11.3	11.6	10.3	0.3
Canada	1	10.5	13.1	11.5	11	11.5	10.2	0.6
France	1	9.4	11.9	9.9	10.2	10.9	11.1	0.6
Germany	1	11.7	13.9	12.2	11.2	12.3	12.2	0.4
Israel	1	11.3	13.2	12.4	11.1	11.5	12.5	0.4
Italy	1	12.6	14.2	12.6	11.3	12.5	14.4	0.4
Netherlands	2	7.8 (0.7) [7.4-8.1]	10.2 (0.9) [9.7-10.6]	8.7 (0.9) [8.2-9.1]	10.3 (0) [10.3-10.3]	9.2 (0.6) [8.9-9.5]	9.4 (0.2) [9.3-9.5]	0.5 (0) [0.5-0.5]
Norway	13	11.1 (3.2) [7.0-13.9]	12.3 (1.9) [10.6-15.2]	12.1 (1.9) [10.0-14.5]	10.8 (1.3) [9.8-12.5]	11.4 (3.1) [8.7-15.3]	9.9 (1.9) [7.4-13.9]	0.7 (0.2) [0.4-1.2]
South Korea	1	11.5	13.4	12.4	11.4	12.9	11.6	0.5
Sweden	1	10.6	12.7	11.8	11.1	11.5	10.1	0.6
Switzerland	1	9.2	11.9	10.7	10.6	10.8	10.2	0.5
UK^c	10	11.6 (2.4) [8.4-15.1]	12.9 (1.9) [11.2-16.1]	12.8 (3.2) [9.7-15.2]	10.3 (1.0) [9.6-11.6]	10.8 (1.7) [8.7-13.7]	10.4 (1.1) [9.3-15.0]	0.8 (0.4) [0.5-1.2]

Open in a new tab

Abbreviations: CDC, Centers for Disease Control and Prevention; CLI, Coleman-Liau Index; COVID-19, coronavirus disease 2019; DC/T, dependent clauses per T-unit; ECDC, European Centre for Disease Control and Prevention; FKGL, Flesch-Kincaid grade level; FORCAST, Ford, Caylor, Sticht formula; GFI, Gunning Fog Index; MLC, mean length of clause; SMOG, Simple Measure of Gobbledygook; WHO, World Health Organization.

^{^a}

The MLC measures elaboration at the clause level (number of words per clause), and DC/T measures clausal subordination.

^{^b}

Official guidelines provided by the White House Coronavirus Task Force were included.

^{^c}

Official guidelines provided by Public Health England were included.

Across all CDC pages, the median FKGL was 11.0 (interquartile range [IQR], 3.2; range, 5.4-16.6). Median syllables per word was 1.7 (IQR, 0.2; range, 1.3-2.0), median words per sentence was 15.6 (IQR, 3.2; range, 8.2-31.5), and 67 pages (99%) used at least 1 difficult term. Median number of difficult terms used was 11.0 (IQR, 10.5; range, 0-50).

The FKGL was above 8.0 for every state (Figure). Compared with the CDC, states (median, 24.5; IQR, 19.0; range, 4.0-53.0) used significantly more difficult terms (P < .001). Use of difficult terms by states was correlated with FKGL (Spearman r = 0.36; 95% CI, 0.09-0.58; P = .01). Nine of the 10 states with the highest illiteracy rates had information written above a grade 10 level.⁶

Discussion

We found that official information about COVID-19 exceeded the recommended reading level, exhibited complex syntax, and used technical terminology. The significant difference in use of difficult terms between the CDC and state resources may reflect the influence of federal oversight mandating government communication that is understandable to the public. Limitations included the focus on text, with no evaluation of multimedia communication, and lack of data about actual comprehension or relevant outcomes such as adherence to mitigation strategies.

Nonadherence to readability standards may have a greater influence in communities with lower health literacy, potentially exacerbating the disparate effects of the pandemic. As such, efforts should focus on the urgent development of plain-language COVID-19 resources that conform to established guidelines for clear communication and are more accessible to all audiences.

Supplement.

eAppendix. Sample Passages About COVID-19 Written at Different Flesch-Kincaid Grade Levels

Click here for additional data file.^{(162.2KB, pdf)}

References

1.Paakkari L, Okan O. COVID-19: health literacy is an underestimated problem. Lancet Public Health. 2020;5(5):e249-e250. doi: 10.1016/S2468-2667(20)30086-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Badarudeen S, Sabharwal S. Assessing readability of patient education materials: current role in orthopaedics. Clin Orthop Relat Res. 2010;468(10):2572-2580. doi: 10.1007/s11999-010-1380-y [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Jin T, Lu X, Ni J.. Syntactic complexity in adapted teaching materials: differences among grade levels and implications for benchmarking. The Modern Language Journal. 2020;104(1):192-208. doi: 10.1111/modl.12622 [DOI] [Google Scholar]
4.Centers for Disease Control and Prevention Simply put: a guide for creating easy-to-understand materials. Accessed April 5, 2020. https://www.cdc.gov/healthliteracy/pdf/simply_put.pdf
5.Centers for Disease Control and Prevention Everyday words for public health communication. Accessed April 5, 2020. https://www.cdc.gov/other/pdf/everydaywordsforpublichealthcommunication.pdf
6.Kressin NR, Gunn CM, Battaglia TA. Content, readability, and understandability of dense breast notifications by state. JAMA. 2016;315(16):1786-1788. doi: 10.1001/jama.2016.1712 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement.

eAppendix. Sample Passages About COVID-19 Written at Different Flesch-Kincaid Grade Levels

Click here for additional data file.^{(162.2KB, pdf)}

[zld200136r1] 1.Paakkari L, Okan O. COVID-19: health literacy is an underestimated problem. Lancet Public Health. 2020;5(5):e249-e250. doi: 10.1016/S2468-2667(20)30086-4 [DOI] [PMC free article] [PubMed] [Google Scholar]

[zld200136r2] 2.Badarudeen S, Sabharwal S. Assessing readability of patient education materials: current role in orthopaedics. Clin Orthop Relat Res. 2010;468(10):2572-2580. doi: 10.1007/s11999-010-1380-y [DOI] [PMC free article] [PubMed] [Google Scholar]

[zld200136r3] 3.Jin T, Lu X, Ni J.. Syntactic complexity in adapted teaching materials: differences among grade levels and implications for benchmarking. The Modern Language Journal. 2020;104(1):192-208. doi: 10.1111/modl.12622 [DOI] [Google Scholar]

[zld200136r4] 4.Centers for Disease Control and Prevention Simply put: a guide for creating easy-to-understand materials. Accessed April 5, 2020. https://www.cdc.gov/healthliteracy/pdf/simply_put.pdf

[zld200136r5] 5.Centers for Disease Control and Prevention Everyday words for public health communication. Accessed April 5, 2020. https://www.cdc.gov/other/pdf/everydaywordsforpublichealthcommunication.pdf

[zld200136r6] 6.Kressin NR, Gunn CM, Battaglia TA. Content, readability, and understandability of dense breast notifications by state. JAMA. 2016;315(16):1786-1788. doi: 10.1001/jama.2016.1712 [DOI] [PubMed] [Google Scholar]

PERMALINK

Comparison of Readability of Official Public Health Information About COVID-19 on Websites of International Agencies and the Governments of 15 Countries

Vishala Mishra, MBBS

Joseph P Dexter, PhD

Abstract

Introduction

Methods

Results

Table. Readability of COVID-19 Information From the WHO, CDC, ECDC, and Governments of 15 Countries.

Figure. Readability, Literacy, and Use of Difficult Terms Across All 50 US States.

Discussion

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Comparison of Readability of Official Public Health Information About COVID-19 on Websites of International Agencies and the Governments of 15 Countries

Vishala Mishra, MBBS

Joseph P Dexter, PhD

Abstract

Introduction

Methods

Results

Table. Readability of COVID-19 Information From the WHO, CDC, ECDC, and Governments of 15 Countries.

Figure. Readability, Literacy, and Use of Difficult Terms Across All 50 US States.

Discussion

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases