Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2020 May 30;185:21–25. doi: 10.1016/j.puhe.2020.05.041

Readability of online patient education material for the novel coronavirus disease (COVID-19): a cross-sectional health literacy study

T Szmuda a,c, C Özdemir b,∗,c, S Ali b,c, A Singh b,c, MT Syed b,c, P Słoniewski a
PMCID: PMC7260546  PMID: 32516624

Abstract

Objectives

The internet has become one of the most important resources for the general population when searching for healthcare information. However, the information available is not always suitable for all readers because of its difficult readability. We sought to assess the readability of online information regarding the novel coronavirus disease (COVID-19) and establish whether they follow the patient educational information reading level recommendations.

Study design

This is a cross-sectional study.

Methods

We searched five key terms on Google and the first 30 results from each of the searches were considered for analysis. Five validated readability tests were utilized to establish the reading level for each article.

Results

Of the 150 gathered articles, 61 met the inclusion criteria and were evaluated. None (0%) of the articles met the recommended 5th to 6th grade reading level (of an 11-12-year-old). The mean readability scores were Flesch Reading Ease 44.14, Flesch-Kincaid Grade Level 12.04, Gunning-Fog Index 14.27, Simple Measure of Gobbledygook SMOG Index 10.71, and Coleman-Liau Index 12.69.

Conclusions

Online educational articles on COVID-19 provide information too difficult to read for the general population. The readability of articles regarding COVID-19 and other diseases needs to improve so that the general population may understand health information better and may respond adequately to protect themselves and limit the spread of infection.

Keywords: Coronavirus, COVID-19, Readability, Google, Internet

Highlights

  • Online health information regarding COVID-19 is too difficult to read and understand.

  • Based on past research, health articles too difficult to understand may cause misinformation to spread and public panic.

  • The readability of COVID-19 needs to improve so that the general population may respond adequately to protect themselves and limit the spread of infection.

Introduction

The internet has grown to become one of the most popular resources for people in finding health information.1 In the United States, up to 80% of all adult internet users have looked online to find information about several health issues.2 70% of health seekers express that the information that they have found has influenced them in their decision-making regarding the treatment of their disease.3 Although the information is readily accessible, it's utility may be variable depending on the readability of the information. The mean reading level of adults in the United States has been estimated to be equivalent to that of a 13- to 14-year old.4 Taking this into account, the American Medical Association (AMA) and United States Department of Health and Human Services (USDHHS) have recommended that patient educational information should not exceed the reading level of an 11- to 12-year old.5 However, numerous studies have shown that the reading difficulty of health articles online is much higher than recommended.6, 7, 8 To date, no readability analysis has been performed regarding the novel coronavirus disease (COVID-19).

COVID-19 has resulted in thousands of deaths worldwide and has resulted in more fatalities than the previous two coronavirus epidemics combined (i.e. SARS and MERS). As of March 11, 2020, the outbreak was recognized by the World Health Organization (WHO) as a pandemic.9 Thus, proper public education becomes critical so that patients may prevent and contain the infection. Because the internet is often the first source of information regarding health care for patients, it is critical to evaluate the readability level of the information.10 In this study, we sought to assess the readability of online information regarding COVID-19.

Methods

Search strategy

On March 13, 2020, the Google search engine was used to search for the following five key terms: “Coronavirus,” “COVID-2019,” “SARS CoV-2,” “2019-nCoV,” and “What is the coronavirus”. 7.39 × 10⁹, 4.59 × 10⁹, 1.43 × 10⁸, 1.02 × 10⁸, and 5.08 × 10⁹ search results were identified for each search, respectively. The first 30 results per key term were evaluated because 90% of internet users do not look past this number.11

Inclusion and exclusion criteria

A total of 150 search results were considered for the readability analysis. Articles not in English, duplicates, newspaper articles (because they contained mostly political updates), biomedical journal articles, non–open-access articles (behind a paywall), and statistical websites were excluded.

Readability assessment

All articles were restructured into plain text and all irrelevant material was deleted, such as figures, their legends, images, and references. The analysis was performed using five readability formulas: Flesch Reading Ease (FRE), Flesch-Kincaid Grade Level (FKGL), Gunning-Fog Index (GFI), Simple Measure of Gobbledygook (SMOG) Index, and Coleman-Liau Index (CLI).

The FRE and the FKGL are different formulas that use the average sentence length in words and average syllables per 100 words for the assessment.12

GFI uses the average sentence length and the number of words containing three or more syllables for the calculations. Polysyllabic words are excluded, including proper nouns, a combination of easy words (including hyphenated words), and polysyllable verbs whose third syllable is “es” or “ed”.13

SMOG Index is calculated by counting every polysyllabic word (containing three or more syllables) in sections containing 10 sentences each placed in the beginning, in the middle, and at the end of the text in question.14 In contrast to the other readability formulas, CLI does not take the number of syllables into account. Instead, it makes the assessment based on the average number of letters and sentences per 100 words.15

The FRE score determines the reading ease based on a scale from 0 to 100, where a lower score indicates a higher difficulty (0–30 is very difficult, 30–50 is difficult, 50–60 is fairly difficult, 60–70 is standard, 70–80 is fairly easy, 80–90 is easy, 90–100 is very easy).12 The other four formulas, however, use a scale based on the educational level needed to understand the text. A score less than 6 is regarded as 6th grade reading level (11-12 years), a score of 7 as 7th grade reading level (12-13 years), a score of 8 as 8th grade (13-14 years), 9 as high school freshman (14-15 years), 10 as high school sophomore (15-16 years), 11 as high school junior (16-17 years), 12 as high school senior (17-18 years), 13 as college freshman (18-19 years), 14 as college sophomore (19-20 years), 15 as college junior (20-21 years), 16 as college senior (21-22 years), and 17 or above is regarded as college graduate (someone over the age of 22 years).

Results

Of the 150 articles, 61 were analyzed as they met the inclusion criteria. The mean reading level of the articles was equivalent to a high school senior/college freshman (17 to 18 year olds) (12.4 ± 2.1). The minimum score was 8.8, whereas the maximum score was 20.1. According to the mean FRE score, the articles are considered difficult to read (44 ± 11.5). In Table 1 , the mean scores of the readability formulas are presented. Table 2 shows the distribution of the scores.

Table 1.

Mean score for individual formulas used in the readability assessment of the websites.

Readability formula Mean score Standard deviation
Flesch Reading Ease (FRE) 44.14 11.46
Flesch-Kincaid Grade Level (FKGL) 12.04 2.67
Gunning-Fog Index (GFI) 14.27 2.84
Simple Measure of Gobbledygook (SMOG) Index 10.71 1.96
Coleman-Liau Index (CLI) 12.69 1.86

Table 2.

Distribution of scores within each individual formula.

Readability formula Score Number of websites
Flesch Reading Ease (FRE) Easy (80–100) 0
Average (60–79) 5
Difficult (0–59) 56
Flesch-Kincaid Grade Level (FKGL) Below 6 0
6–10 13
Above 10 48
Gunning-Fog Index (GFI) Below 6 0
6–10 1
Above 10 60
Simple Measure of Gobbledygook (SMOG) Index Below 6 0
6–10 23
Above 10 38
Coleman-Liau Index (CLI) Below 6 0
6–10 7
Above 10 54

All articles were of at least a high school sophomore grade level (15-16 years old). According to the FRE, 91.8% of the articles were found difficult to read. On the FKGL scale, 78.68% of the articles had a readability index of a high school senior grade (17- to 18-year olds). Similarly, on GFI, SMOG, and CLI, the percentage of articles that were above the readability index of a high school senior (17- to 18-year olds) were 98.36%, 62.29%, and 88.52%, respectively.

Table 3 displays that the information provided from websites related to governments, hospitals and health organizations (such as WHO) are also not following the recommendations for educational material. All of the medical articles were written beyond the recommended 5th- to 6th-grade level (11- to 12-year olds).16

Table 3.

Mean readability score of websites related to governments, hospitals/clinics, or health institutions/organizations.

Readability formula Mean score Standard deviation
Flesch Reading Ease (FRE) 47.82 12.76
Flesch-Kincaid Grade Level (FKGL) 11.51 3.06
Gunning-Fog Index (GFI) 13.57 3.10
Simple Measure of Gobbledygook (SMOG) Index 10.17 2.16
Coleman-Liau Index (CLI) 12.65 1.82

Fig. 1 shows a comparison between the overall readability, government health organization readability, and hospital readability.

Fig. 1.

Fig. 1

The mean score for each of the readability formulas used in the assessment of the overall readability (colored blue), government health organizations readability (colored red) and hospital readability (colored purple). Abbreviations: avg, average; gov, government health organizations; FKGL, Flesch-Kincaid Grade Level; FRE, Flesch Reading Ease; GFI, Gunning-Fog Index; GFS, Gunning-Fog Score; CLI, Coleman-Liau Index; SMOG, Simple Measure of Gobbledygook.

Discussion

Readability analysis

We found that online information about COVID-19 is too difficult for the general population to read and comprehend. None of the articles met the 5th- to 6th-grade reading level (11- to 12-year olds) recommended by the AMA and the USDHHS.5 , 16 Most articles (84%) were designated as too difficult to read, which makes it tough for the public to acquire understandable information regarding COVID-19. To reduce the spread of infection and, thus, reduce the burden on a country's healthcare system, a country's population needs access to understandable information online. We urge prominent organizations such as the WHO and the Centers for Disease Control and Prevention to make their online information friendlier to the general public. Easy readability of relevant medical information empowers individuals to take the right steps to protect themselves. This may reduce panic and anxiety especially in the midst of a pandemic. Our findings are novel because our article is the first to do a readability analysis on COVID-19. Morever, we conducted our readability analysis on COVID-19 during the pandemic itself to provide a contemporary and relevant view of the problem.

Context

Medical terminology is an essential factor affecting the readability of a text. Even if the person's educational level is high, long sentences and unfamiliar words can make the text challenging to read. If they are not accustomed to medical literature, misunderstanding and misinformation may occur and the patient may stop researching basic medical care. Therefore, information aiming to educate patients should be clear and understandable. A 2018 systematic review analyzing 157 readability studies found that the readability level of online health articles is incomprehensible for the public.8

Limitations

This study is limited by the constraints of a cross-sectional study. Because the material available on the internet is constantly increasing, the results of our study will only reflect what information was present for the public at this point in time. However, our study brings awareness to the problem; this may influence hospitals and government organizations to reduce the reading difficulty on their websites. Morevoer, this readability study is based on text only. Infographics and videos—which may often enhance the understanding of a text—were removed to perform the analysis. This was another limitation of this study.

Future directions

Previous studies have shown that the quality of health information online is biased, misleading, and poor.17 , 18 Thus, although we assessed the readability of COVID-19 health articles, the quality of these articles still needs to be evaluated. In addition, because YouTube has become a major source of patient information, the quality of information on this platform also needs to be assessed.19, 20, 21, 22 The role and effectiveness of online medical resources concerning telemedicine may also be explored.23

We encourage websites to display COVID-19 infographics and videos as they may be a more friendly way of providing information to the public. Moreover, infographics are easy to potentially share on social media and may help spread the health information about the disease.

Public health implications

Considering Europe is now the epicenter of COVID-19 pandemic, it becomes critical that government sites such as ecdc.europa.eu (European Centre for Disease Prevention and Control) publish advisories in simple understandable terms. Studies show that these articles play a critical role in recommending health safety guidelines, reduce the burden on the healthcare system, and help healthcare workers prioritize managing the disease effectively.24, 25, 26, 27 An analysis of the 1918 influenza pandemic showed that early implementation of certain interventions (e.g. isolation policies, mask ordinances, and bans on public gatherings) reduced influenza transmission.26 We assert that for these interventions to be effective, the public needs access to understandable online health information so that personal measures to contain and prevent the disease may be taken (e.g. by washing heads frequently, practicing respiratory hygiene, and seeking medical care early). Low health literacy has been associated with nonadherence to treatment plans and medical regimens, poor patient self-care, high healthcare costs, and increased risk of hospitalization and mortality.28 Thus we hope that this article serves as a “call to action” for health authorities to provide more comprehensible reading material online.

Conclusion

Online information regarding COVID-19 is too difficult to read and understand as designated by the AMA and the USDHHS.5 This includes websites run by governments and health institutions such as the WHO. Because the internet now is one of our most popular sources of information, it is critical that people are provided with understandable information. Health articles too difficult to understand may cause misinformation to spread, public panic due to a lack of accessible information, and a greater burden on a country's healthcare system.

Author statements

Ethical approval

No ethical approval was required as all the data analyzed were publicly available.

Funding

None reported.

Competing interests

None reported.

References

  • 1.Wald H.S., Dube C.E., Anthony D.C. Vol. 68. Patient Education and Counseling; 2007. pp. 218–224. (Untangling the Web-The impact of Internet use on health care and the physician-patient relationship). [DOI] [PubMed] [Google Scholar]
  • 2.Akerkar S.M., Bichile L.S. Doctor patient relationship: changing dynamics in the information age. J Postgrad Med. 2004;50:120–122. [PubMed] [Google Scholar]
  • 3.The online health care revolution | pew research center. 2020. https://www.pewresearch.org/internet/2000/11/26/the-online-health-care-revolution/ [internet]. [cited 2020 mar 17]. Available from:
  • 4.Kutner Greenberg, Jin Paulsen. 2003. The health literacy of America's adults: results from the 2003 National assessment of adult literacy. [Google Scholar]
  • 5.Cotugna N., Vickery C.E., Carpenter-Haefele K.M. Evaluation of literacy level of patient education pages in health-related journals. J Community Health. 2005 Jun;30(3):213–219. doi: 10.1007/s10900-004-1959-x. [DOI] [PubMed] [Google Scholar]
  • 6.Bernard S., Cooke T., Cole T., Hachani L., Bernard J. Quality and readability of online information about type 2 diabetes and nutrition. J Am Acad Physician Assist. 2018 Nov 1;31(11):41–44. doi: 10.1097/01.JAA.0000546481.02560.4e. http://www.ncbi.nlm.nih.gov/pubmed/30358679 [Internet] [cited 2020 Mar 17]. Available from: [DOI] [PubMed] [Google Scholar]
  • 7.Lee K.C., Berg E.T., Jazayeri H.E., Chuang S.K., Eisig S.B. Online patient education materials for orthognathic surgery fail to meet readability and quality standards. J oral maxillofac surg. 2019 Jan 1;77(1):180.e1–180.e8. doi: 10.1016/j.joms.2018.08.033. http://www.ncbi.nlm.nih.gov/pubmed/30296407 [Internet] [cited 2020 Mar 17]. Available from: [DOI] [PubMed] [Google Scholar]
  • 8.Daraz L., Morrow A.S., Ponce O.J., Farah W., Katabi A., Majzoub A. Readability of online health information: a meta-narrative systematic review. Am J Med Qual. 2018 Sep 1;33(5):487–492. doi: 10.1177/1062860617751639. http://www.ncbi.nlm.nih.gov/pubmed/29345143 [Internet] [cited 2020 Mar 17]. Available from: [DOI] [PubMed] [Google Scholar]
  • 9.World Health Organization Coronavirus (COVID-19) events as they happen. 2020. https://www.who.int/emergencies/diseases/novel-coronavirus-2019/events-as-they-happen [Internet]. [cited 2020 Mar 17]. Available from:
  • 10.Hesse B.W., Moser R.P., Rutten L.J. vol. 362. Massachussetts Medical Society; 2010. Surveys of physicians and electronic health information; pp. 859–860.http://www.ncbi.nlm.nih.gov/pubmed/20200398 (New England journal of medicine). [Internet] [cited 2020 Mar 17]. Available from: [DOI] [PubMed] [Google Scholar]
  • 11.iProspect search engine user behaviour study. 2006. [Google Scholar]
  • 12.Flesch R. A new readability yardstick. J Appl Psychol. 1948 Jun;32(3):221–233. doi: 10.1037/h0057532. [DOI] [PubMed] [Google Scholar]
  • 13.Library H. LibGuides: guilford college writing manual: gunning's fog index.
  • 14.Leonard Grabeel K., Russomanno J., Oelschlegel S., Tester E., Heidel R.E. Computerized versus hand-scored health literacy tools: a comparison of Simple Measure of Gobbledygook (SMOG) and Flesch-Kincaid in printed patient education materials. J Med Libr Assoc. 2018 Jan 1;106(1):38–45. doi: 10.5195/jmla.2018.262. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Free coleman liau index readability test tool - readable. 2020. https://www.webfx.com/tools/read-able/coleman-liau-index.html [internet]. [cited 2020 mar 17]. Available from:
  • 16.Health literacy and patient safety: help patients understand. 2007. [Google Scholar]
  • 17.Keselman A., Smith C.A., Murcko A.C., Kaufman D.R. Evaluating the quality of health information in a changing digital ecosystem. J Med Internet Res. 2019 Feb 1;21(2) doi: 10.2196/11129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Sun Y., Zhang Y., Gwizdka J., Trace C.B. vol. 21. Journal of Medical Internet Research; 2019. (Consumer evaluation of the quality of online health information: systematic literature review of relevant criteria and indicators). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Szmuda T., Ali S., Słoniewski P. vol. 130. World Neurosurgery. Elsevier Inc.; 2019. pp. 570–572.https://www.ncbi.nlm.nih.gov/pubmed/31581395 (Letter to the Editor regarding “A quality analysis of disk herniation videos on YouTube”). [cited 2020 Feb 3]. Available from: [DOI] [PubMed] [Google Scholar]
  • 20.Szmuda T., Rosvall P., Hetzger T.V., Ali S., Słoniewski P. YouTube as a source of patient information for hydrocephalus: a content-quality and optimization analysis. World Neurosurg. 2020 Mar 5 doi: 10.1016/j.wneu.2020.02.149. https://www.ncbi.nlm.nih.gov/pubmed/32147561 [Internet] [cited 2020 Mar 7]; Available from: [DOI] [PubMed] [Google Scholar]
  • 21.Chan D.S.Y., Willicombe A., Reid T.D., Beaton C., Arnold D., Ward J. Relative quality of internet-derived gastrointestinal cancer information. J Cancer Educ. 2012 Dec;27(4):676–679. doi: 10.1007/s13187-012-0408-2. http://www.ncbi.nlm.nih.gov/pubmed/22918796 [Internet] [cited 2020 Mar 17]. Available from: [DOI] [PubMed] [Google Scholar]
  • 22.Szmuda T., Özdemir C., Fedorow K., Ali S., Słoniewski P. YouTube as a source of information for narcolepsy: a content-quality and optimization analysis. J Sleep Res. 2020 Apr 21 doi: 10.1111/jsr.13053. https://pubmed.ncbi.nlm.nih.gov/32315117/ [Internet] [cited 2020 Apr 23]. Available from: [DOI] [PubMed] [Google Scholar]
  • 23.Szmuda T., Ali S., Słoniewski P., NsW Group. Telemedicine in neurosurgery during the novel coronavirus (COVID-19) pandemic. Neurol Neurochir Pol. 2020 Apr 22;54(2):207–208. doi: 10.5603/PJNNS.a2020.0038. http://www.ncbi.nlm.nih.gov/pubmed/32319670 [Internet] [cited 2020 Apr 23]; Available from: [DOI] [PubMed] [Google Scholar]
  • 24.Quintana Y., Feightner J.W., Wathen C.N., Sangster M., Marshall J.N. Preventive health information on the Internet. Qualitative study of consumers' perspectives. Can Fam Physician. 2001;47(SEPT):1759–1765. [PMC free article] [PubMed] [Google Scholar]
  • 25.Iverson S.A., Howard K.B., Penney B.K. Impact of internet use on health-related behaviors and the patient-physician relationship: a survey-based study and review. J Am Osteopath Assoc. 2008 Dec;108(12):699–711. [PubMed] [Google Scholar]
  • 26.Hatchett R.J., Mecher C.E., Lipsitch M. Public health interventions and epidemic intensity during the 1918 influenza pandemic. Proc Natl Acad Sci U S A. 2007 May 1;104(18):7582–7587. doi: 10.1073/pnas.0610941104. http://www.ncbi.nlm.nih.gov/pubmed/17416679 [Internet] [cited 2020 Mar 17]. Available from: [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Smith N., Fraser M. Straining the system: novel coronavirus (COVID-19) and preparedness for concomitant disasters. Am J Public Health. 2020 Feb 13:e1–e2. doi: 10.2105/AJPH.2020.305618. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.King A. Vol. 7. Nature Reviews Cardiology; 2010. pp. 473–474.http://www.ncbi.nlm.nih.gov/pubmed/20725102 (Poor health literacy: a “hidden” risk factor). [Internet] [cited 2020 Mar 17]. Available from: [DOI] [PubMed] [Google Scholar]

Articles from Public Health are provided here courtesy of Elsevier

RESOURCES