Skip to main content
Frontiers in Human Neuroscience logoLink to Frontiers in Human Neuroscience
. 2018 Aug 14;12:308. doi: 10.3389/fnhum.2018.00308

Readability of the 100 Most-Cited Neuroimaging Papers Assessed by Common Readability Formulae

Andy W K Yeung 1,*, Tazuko K Goto 2, W Keung Leung 3
PMCID: PMC6104455  PMID: 30158861

Abstract

Background: From time to time, neuroimaging research findings receive press coverage and attention by the general public. Scientific articles therefore should be written in a readable manner to facilitate knowledge translation and dissemination. However, no published readability report on neuroimaging articles like those published in education, medical and marketing journals is available. As a start, this study therefore aimed to evaluate the readability of the most-cited neuroimaging articles.

Methods: The 100 most-cited articles in neuroimaging identified in a recent study by Kim et al. (2016) were evaluated. Headings, mathematical equations, tables, figures, footnotes, appendices, and reference lists were trimmed from the articles. The rest was processed for number of characters, words and sentences. Five readability indices that indicate the school grade appropriate for that reading difficulty (Automated Readability Index, Coleman-Liau Index, Flesch-Kincaid Grade Level, Gunning Fog index and Simple Measure of Gobbledygook index) were computed. An average reading grade level (AGL) was calculated by taking the mean of these five indices. The Flesch Reading Ease (FRE) score was also computed. The readability of the trimmed abstracts and full texts was evaluated against number of authors, country of corresponding author, total citation count, normalized citation count, article type, publication year, impact factor of the year published and type of journal.

Results: Mean AGL ± standard deviation (SD) of the trimmed abstracts and full texts were 17.15 ± 2.81 (college graduate level) and 14.22 ± 1.66 (college level) respectively. Mean FRE score ± SD of the abstracts and full texts were 15.70 ± 14.11 (college graduate level) and 32.11 ± 8.56 (college level) respectively. Both items indicated that the full texts were significantly more readable than the abstracts (p < 0.001). Abstract readability was not associated with any factors under investigation. ANCOVAs showed that review/meta-analysis (mean AGL ± SD: 16.0 ± 1.4) and higher impact factor significantly associated with lower readability of the trimmed full texts surveyed.

Conclusion: Concerning the 100 most-cited articles in neuroimaging, the full text appears to be more readable than the abstracts. Experimental articles and methodology papers were more readable than reviews/meta-analyses. Articles published in journals with higher impact factors were less readable.

Keywords: bibliometrics, information science, neuroimaging, neurosciences, readability

Introduction

Neuroimaging research field frequently witnessed advancements in knowledge, understanding and technology. Nowadays, scientists have utilized neuroimaging to understand and treat brain disorders such as depression, and to predict behavior which is relevant to marketing and policy-making (Poldrack and Farah, 2015). Experimental papers provide knowledge from basic science that enhances fundamental understanding of any biology involved, and from clinical science that contributes to evidence-based clinical practices. For example, brain connectivity at resting state discovered two decades ago (Biswal et al., 1995) is now having a potential to identify patients with Alzheimer’s disease (de Vos et al., 2018). Methodological papers introduce novel ways to acquire and analyze neuroimaging data that may contribute to the understanding of normal neurophysiology, clinical diagnostics or therapeutic approaches. For instance, the voxel-based morphometry method developed to detect brain structural changes (Ashburner and Friston, 2000) is now applied to study such subtle changes in Parkinson’s disease patients with early cognitive impairment (Gao et al., 2017). Systematic reviews and meta-analyses also very important for pooling evidence across studies to discuss and evaluate whether specific findings are consistently reported, and thus particular theories or therapeutic strategy could be established or rejected (Yeung et al., 2017b,d, 2018a). With such a diverse literature, bibliometric reports have an important value because they quantitatively and qualitatively evaluate the peer scientific impact of the academic literature on specific breakthroughs or selected research topics, with the analyses on publication and citation information as well as the content of the papers (Carp, 2012; Guo et al., 2014; Kim et al., 2016; Yeung, 2017a,b, 2018a,b; Yeung et al., 2018b). In particular, a bibliometric study on the readability of neuroimaging papers would be very helpful, as the literature has indicated that there exist translation or communication challenges of neuroimaging research when the findings are disseminated to the public (O’Connor et al., 2012; van Atteveldt et al., 2014).

To avoid the translation or communication challenges, neuroimaging and neuroscience articles should be written in a readable manner to better disseminate the existing knowledge and novel findings (Illes et al., 2010). From time to time, neuroimaging findings received press coverage (Herculano-Houzel, 2002; Racine et al., 2005; O’Connor et al., 2012) and influenced the society through various means such as substantiating morality claims, justifying healthcare procedures and facilitating policy making (Racine et al., 2005). A 10-year survey on mass media identified over 1,000 news articles reporting neuroimaging research findings, most of which were written by journalists (84%) and press agencies (11%) (Racine et al., 2010). Because neuroimaging affects real-world social contexts and creates news headlines, it is important for people working for the press to understand the underlying scientific findings, interpretations and limitations (Racine et al., 2005; O’Connor et al., 2012). The simplified take-home message translated from brain researches to the public should be correct and contain no misconceptions (Beck, 2010; van Atteveldt et al., 2014). Unfortunately the latter is not uncommon. Misconceptions disseminated by journalists may arise from four reasons, namely the accuracy or internal consistency of the articles themselves, articles having a firm conclusion supported by weak data, inappropriate extrapolation of basic and pre-clinical findings to therapeutic uses, and selectively reporting findings published in journals with high impact factors only (Gonon et al., 2011, 2012). Some argued that misconceptions may also arise among practitioners such as those in education because neuroscience articles were not easily understood by lay people (Howard-Jones, 2014). Meanwhile, concerns regarding readability of journal articles is potentially an universal issue as surveys have revealed that the health literacy skills, including print literacy (McCray, 2005), of the general public proficient in English were at or below the eighth grade level (Herndon et al., 2011). Therefore, healthcare journalists should dispense the biomedical research findings to the lay public with accurate interpretation and appropriate language level (Angell and Kassirer, 1994; Eggener, 1998). However, the majority of the healthcare journalists had bachelor’s degrees unrelated to healthcare and had no additional postgraduate degrees (Viswanath et al., 2008; Friedman et al., 2014). Moreover, they often lack time and knowledge to improve the informative value of their reports (Larsson et al., 2003), sometimes had difficulties in understanding the original papers (Friedman et al., 2014) and the news coverage often emphasized on the beneficial effects of the experimental treatments stated in the article abstracts from journals with high impact factors (Yavchitz et al., 2012). Results from these reports indicated an evaluation of the readability of neuroimaging articles and their abstracts is needed, which, to the best of the authors’ knowledge, has not been documented by previously published studies. To begin with, it would be beneficial to evaluate such a factor concerning the 100 neuroimaging articles that received the most all-time attention in terms of scientific citations (Kim et al., 2016). The rationale for choosing this collection of articles was that a collection of “100 most cited articles” was often perceived as a representative sample of the most influential works within the field of interest and this sampling strategy has been continuously published (Joyce et al., 2014; Powell et al., 2016; Wrafter et al., 2016). Hence, the 100 most cited articles are considered drivers for fundamental knowledge development, knowledge application and evidence-based practice with regard to their fields of interest. Moreover, research articles with high impact often received increased media attention (Gonon et al., 2011, 2012; Yavchitz et al., 2012), which underlines the importance of their readability.

Readability of academic articles may be associated with author, article and journal factors, such as the number of authors (Plavén-Sigray et al., 2017), first language of the principal author (Hayden, 2008), country of institutional affiliation of the first author (Weeks and Wallace, 2002), citation count (Gazni, 2011), publication year (Plavén-Sigray et al., 2017), article type (Hayden, 2008), and choice of journal (Shelley and Schuh, 2001; Weeks and Wallace, 2002). Editorial and peer-review process could also improve the readability of the manuscripts (Roberts et al., 1994; Rochon et al., 2002; Hayden, 2008). Based on these considerations, the aim of this study was to assess the 100 most-cited neuroimaging articles to evaluate their readability level. We hypothesized that the abstracts, as the summaries of the articles, would be more readable than the full texts. We also hypothesized that, for both abstracts and full texts, the readability of these most-cited articles published across the years would be similar.

Materials and Methods

Data Selection, Retrieval and Processing

The 100 most-cited articles in neuroimaging identified in a recent study (Kim et al., 2016) were evaluated. Fifty of these 100 articles were published in journals classified by Journal Citation Reports as specialized in neuroimaging, namely NeuroImage (n = 38) and Human Brain Mapping (n = 12). The other fifty articles were published in other journals, led by Magnetic Resonance in Medicine (n = 10). The full articles were individually copied into Microsoft Word and trimmed – headings/subheadings, mathematical equations, tables, figures, footnotes, appendices, and reference lists were removed, as described earlier (Sawyer et al., 2008; Jayaratne et al., 2014). To account for the differences in styles of in-text citations (e.g., superscript numbers versus lists of authors with publication year), the in-text citations were removed. Finally, there were two copies of processed text prepared for each article: abstract and full text.

As reviewed in the introduction, nine factors can potentially influence the readability of an article. Since we did not have access to manuscripts to compare their contents pre-, during and post-editorial and peer-review process, we decided to exclude this factor for the current investigation. Moreover, the authors have considered the number of tables, figures and references as potential factors to be evaluated, but decided to drop them because a study showed that these factors did not associate with readability (Weeks and Wallace, 2002). Therefore, the following eight potential influencing factors (“Factors”) were recorded for each article:

  • simple (1)

    Number of authors

  • simple (2)

    Country of the institution of the corresponding author

  • simple (3)

    Total citation count

  • simple (4)

    Normalized citation count (i.e., total count divided by years since publication)

  • simple (5)

    Publication year

  • simple (6)

    Article type (i.e., experimental article, methodology paper or review/meta-analysis)

  • simple (7)

    Impact factor of the publication year of the journal

  • simple (8)

    Type of journal (i.e., journals specialized in neuroimaging or other journals)

Regarding authors, Hayden (2008) binarized first language of the principal author into English and non-English, whereas Weeks and Wallace (2002) compared first authors working in United States and United Kingdom. We believed that, for the current study, it would be more appropriate to evaluate the country of the institutional affiliation of the corresponding author, because the corresponding author should be responsible for overseeing the work and sometimes it is difficult to determine the first language of the authors including the principal author.

The list of the 100 articles with their information on these eight Factors can be found in Supplementary Table S1.

Readability Assessment

The readability statistics were computed with the Readability Calculator1, a free online program used by previous studies to assess the readability of Web materials (Mcinnes and Haglund, 2011; Jayaratne et al., 2014). The edited text for each article was processed by this website to count the number of characters, words and sentences. The website also computed five readability indices that indicate the school grade appropriate for that reading difficulty. The readability grade level indices were the Automated Readability Index, Coleman-Liau Index, Flesch-Kincaid Grade Level, Gunning Fog index and Simple Measure of Gobbledygook index. All these indices produce an output that approximates the grade level (in United States) estimated necessary to understand the text. Each of them use a formula that differs from each other slightly, particularly the Automated Readability Index and Coleman-Liau Index rely on character counts whereas the others rely on syllable counts. The Gunning Fog index was the most frequently used index to evaluate readability of journal articles (Roberts et al., 1994; Rochon et al., 2002; Weeks and Wallace, 2002), whereas the others were also often used to check readability of journal articles as well as materials on websites targeting patients (Sawyer et al., 2008; Jayaratne et al., 2014). An average reading grade level (AGL) was calculated by taking the mean of these five indices (Sawyer et al., 2008; Jayaratne et al., 2014). Besides the school grade indices, a Flesch Reading Ease (FRE) score was also computed for each article. This score was also frequently used to evaluate readability of journal articles (Roberts et al., 1994; Rochon et al., 2002; Weeks and Wallace, 2002; Plavén-Sigray et al., 2017). In contrast to the school grade indices, a higher FRE score means easier to read. The formulas of the abovementioned indices are listed in Table 1.

Table 1.

Formulas of the readability grade level indices and Flesch Reading Ease score.

Index Formula
Automated Readability Index 4.71 × (characters/word) + 0.5 × (words/sentence) – 21.43
Coleman-Liau Index 0.0588 × (characters/100 words) – 0.296 × (sentences/100 words) – 15.8
Flesch-Kincaid Grade Level (0.39 × words/sentence) + (11.8 × syllables/word) – 15.59
Gunning Fog index 0.4 × (words/sentence + percentage of ≥ 3-syllable words)
Simple Measure of Gobbledygook index 3 + (square root of≥3-syllable words count/30 sentences)
Average reading grade level (AGL) Mean value of the above five indices
Flesch Reading Ease (FRE) score 206.835 – (1.015 × words / sentence) – (84.6 × syllables/word)

It should be noted that these formulae assume the readability is related to the number of long words or sentences without considering the contextual difficulty of the words, such as the use of jargon. However, it is difficult to define what is jargon. There is a formula called New Dale-Chall readability formula that outputs a numerical value representing the comprehension difficulty of the surveyed text (Chall and Dale, 1995): Raw Score = 0.1579 (% of difficult words) + 0.0496 (words/sentences). It uses a list of 3,000 common words, and words outside this list are considered as difficult words. However, no published report has defined the common word list within the context of scientific journal articles, so it would be difficult to consider the effect of jargon and hence that issue was not followed in the 100 articles investigated.

Data Analysis

Cronbach’s alpha was computed for the scores from the five readability grade level indices to evaluate their internal consistency in representing the grade level of the trimmed abstracts and full texts, respectively. A value of 0.7 or above indicated acceptable to excellent internal consistency (Bland and Altman, 1997; Tavakol and Dennick, 2011). Paired t-tests were performed to evaluate if there were significant differences in readability between the trimmed abstracts and full texts of the articles.

Univariate linear regressions were separately performed to evaluate if the abstract and full text readability (i.e., the AGL and FRE score) of the concerned articles were associated with continuous independent variable, i.e., (1) the number of authors, (2) total citation count, (3) normalized citation count, (4) publication year and (5) impact factor. One-way ANOVAs were separately performed to evaluate if there were significant associations between readability data and categorical independent variables such as (1) countries of the institutions of the corresponding authors, (2) article types, and (3) journal type (journal type was tested by two-sample t-tests); post hoc Tukey tests were conducted to reveal the significantly different pairs. Paired t-tests (Table 4) were performed to check if there were significant differences in the readability between the abstracts and full texts according to different countries, journal types, and article types.

Table 4.

Results from one-way ANOVA and two-sample t-tests.

Readability assessment

Abstracts Full texts


Factors AGLa FRE scoreb AGLa FRE scoreb
Country
United States 17.0 (2.7) 17.0 (12.8) 14.4 (1.6)∗∗ 30.8 (7.9)∗∗
United Kingdom 17.3 (2.3) 14.9 (12.8) 13.9 (1.6)∗∗ 34.3 (8.0)∗∗
Rest of the world 17.5 (3.7) 12.9 (19.4) 14.1 (2.0) 32.7 (10.7)∗∗
p = 0.779 p = 0.535 p = 0.338 p = 0.209
Article type
Experimental articles 16.6 (3.3) 17.8 (16.7) 14.0 (1.8)∗∗ 32.6 (8.7)∗∗
Methodology papers 17.1 (2.3) 16.1 (11.5) 13.8 (1.3)∗∗ 34.5 (7.0)∗∗
Reviews/meta-analyses 18.5 (2.7) 9.1 (13.7) 16.0 (1.4) 23.2 (7.4)
p = 0.085 p = 0.121 p < 0.001 p < 0.001
Journal type
Neuroimaging 17.6 (2.6) 13.7 (13.1) 14.2 (1.6)∗∗ 31.8 (8.9)∗∗
Others 16.7 (3.0) 17.8 (14.9) 14.2 (1.8)∗∗ 32.4 (8.3)∗∗
p = 0.103 p = 0.143 p = 0.864 p = 0.727

Standard deviations in parentheses. Supplementary paired t-tests for full text versus abstracts: p < 0.01, ∗∗p < 0.001. aAGL, Average reading grade level: ≤12 is high school level or below, 13–16 is college level, >16 is college graduate level. bFRE, Flesch Reading Ease scores: <30 is college graduate level, 30–50 is college level, >50 is high school level or below.

Finally, if the above univariate tests revealed significant associations, multi-way ANCOVAs were performed to evaluate if any of these factors, when considered together, would still be associated with the readability scores. All statistical analyses were performed in SPSS 24.0 (IBM, Armonk, New York, United States). Test results were significant if p < 0.05.

Results

The 100 articles were published between 1980 and 2012, among which 37 were experimental articles, 48 were methodology papers and 15 were reviews/meta-analyses. The number of authors for each article ranged from one to 21 (mean ± SD: 5.6 ± 3.5), with three-fourths of the articles having two to seven authors. The institutions of corresponding authors of 55 articles were located in the United States, followed by 27 in the United Kingdom and 18 in the rest of the world (five in Canada, four in France, three in Germany and one each in Australia, Austria, Sweden, Switzerland, Netherlands, and Denmark). Total citation count of the articles ranged from 673 to 4,384 (1,328 ± 722), whereas normalized citation count ranged from 24.9 to 313.1 (88.2 ± 59.3). For journal impact factor of the respective publication years, we could only retrieve data for 62 articles2. It ranged from 1.914 to 24.520 (6.577 ± 4.14). The full texts had 5,484–107,338 characters, 1,027–19,795 words, 49–871 sentences and 14.6–28.5 words per sentence (Table 2).

Table 2.

Readability of the abstracts and full texts of the 100 neuroimaging articles.

Assessment item Mean (SD) p-value

Abstracts Full texts
Automated Readability Index 16.77 (3.68) 15.04 (1.37) <0.001
Coleman-Liau Index 15.89 (2.45) 13.24 (1.49) <0.001
Flesch-Kincaid Grade Level 16.92 (3.02) 13.83 (1.70) <0.001
Gunning Fog index 18.98 (3.34) 15.84 (1.85) <0.001
Simple Measure of Gobbledygook index 17.19 (2.34) 15.04 (1.37) <0.001
Average reading grade level (AGL)a 17.15 (2.81) 14.22 (1.66) <0.001
No. of characters 1,084 (382) 30,212 (16,949) N/A
No. of words 194 (69) 5,780 (3,113) N/A
No. of sentences 8.88 (4.86) 287.00 (149.51) N/A
No. of words/sentence 23.56 (5.83) 20.31 (2.74) <0.001
Flesch Reading Ease (FRE) Scoreb 15.70 (14.11) 32.11 (8.56) <0.001

Paired t-tests were performed for all assessment items. aAGL: ≤12 is high school level or below, 13–16 is college level, >16 is college graduate level. bFRE scores: <30 is college graduate level, 30–50 is college level, >50 is high school level or below.

Internal Consistency of the Five Readability Grade Level Indices

The five indices demonstrated excellent internal consistency in assessing the trimmed abstracts (α = 0.96) and full texts (α = 0.98) of the 100 articles. Therefore, an AGL score was representative to be used for subsequent analyses.

Differences Between Abstracts and Full Texts

The detailed readability assessment results of the trimmed abstracts and full texts are listed in Table 2. The AGL of the trimmed abstracts and full texts corresponded with the grade of college graduates and college sophomores, respectively. The difference was significant (t = 11.64, p < 0.001), indicating that the abstracts were less readable.

The AGL of each of the trimmed abstracts and their corresponding full texts were plotted (Figure 1): 37 abstracts and 84 full texts had an AGL < 16, i.e., below college graduate level. All these 37 abstracts except one had their corresponding full text AGL < 16. The exception had a full text AGL of 17.70. No abstract or full text had an AGL < 8, i.e., general public level.

FIGURE 1.

FIGURE 1

Average reading grade level (AGL) of the 100 abstracts and their corresponding full texts. None of them had AGL < 8 (lower dotted line), the readability level of the general public. Thirty-seven abstracts and 84 full texts had AGL < 16 (upper dotted line), i.e., below college graduate level.

The FRE scores of the trimmed abstracts and full texts (Table 2) were considered “very difficult” and “difficult,” respectively (Flesch, 1948). The difference was significant (t = -13.30, p < 0.001), again indicating that the abstracts were less readable.

Readability of Articles Evaluated by Univariate Analyses

The number of authors (Factor 1), total citation count (Factor 3) and normalized citation count (Factor 4) had no significant association with the readability of the trimmed abstracts and full texts of the concerned articles (Table 3).

Table 3.

Results from univariate linear regressions.

Abstracts Full texts


Factors AGL (r2) p-value FRE score (r2) p-value AGL (r2) p-value FRE score (r2) p-value
Author number 0.030 0.083 0.029 0.092 0.004 0.529 0.006 0.439
Total citation count 0.007 0.400 0.011 0.289 0.005 0.470 0.008 0.373
Normalized citation count 0.000 0.876 0.002 0.690 0.006 0.447 0.003 0.577
Publication year 0.023 0.131 0.016 0.208 0.066 0.010 0.053 0.021
Impact factor 0.007 0.525 0.003 0.677 0.114 0.007 0.087 0.020

r2, coefficient of determination. AGL, Average reading grade level. FRE, Flesch Reading Ease.

The country of the institution of the corresponding author (Factor 2) had no significant association with the readability of the trimmed abstracts and full texts (Table 4). No statistical test was attempted to compare corresponding authors from English-speaking countries (i.e., United States, United Kingdom, Canada, and Australia, n = 88) versus non-English-speaking countries (n = 12).

Publication year (Factor 5) had no significant association with the AGL or FRE score for trimmed abstracts (Table 3). For trimmed full texts, the more recent publications appear to have a higher AGL and a lower FRE score, meaning that they are less readable.

Article type (Factor 6) had no significant association with the AGL or FRE score for trimmed abstracts (Table 4). For trimmed full texts, reviews/meta-analyses were significantly less readable than experimental articles and methodology papers, with AGL/FRE scores of 16.0/23.2, 14.0/32.6 and 13.8/34.5 respectively (p < 0.001). These results imply that the readability of reviews/meta-analyses were at college graduate level whereas that of experimental and methodology papers were at college level.

Impact factor (Factor 7) had no significant association with the AGL or FRE score for trimmed abstracts (Table 3). For trimmed full texts published after 1996, impact factor appears to be associated with a higher AGL (r2 = 0.114, p = 0.007) and a lower FRE score (r2 = 0.087, p = 0.020). This means that articles with higher impact factors are less readable.

Journal type (Factor 8) had no significant association with the readability of the trimmed abstracts and full texts (Table 4).

Multi-Way ANCOVA Results

Because the above univariate tests demonstrated that readability of the trimmed full texts was associated with publication year (Factor 5), article type (Factor 6) and impact factor (Factor 7), these three factors were entered into multi-way ANCOVAs for trimmed full text AGL and FRE score, respectively, to investigate if these associations were still significant under multivariate analysis. Results indicated that AGL (Table 5) was significantly associated with article type (Factor 6) and impact factor (Factor 7), whereas FRE score (Table 6) was significantly associated with article type (Factor 6). In brief, the readability of experimental articles and methodology papers was two grade levels easier than that of reviews/meta-analyses. Moreover, reading grade level increased by 0.100 for every increment in impact factor by 1.

Table 5.

Relationship between full text average reading grade level and the independent variables in the ANCOVA model.

Independent variables Estimates SE p-value Multiple
comparison
Publication year -0.037 0.058 0.531
Impact factor 0.100 0.042 0.020
Article type <0.001 (1) = (2) < (3)
    Experimental articles (1) -2.017 0.488
    Methodology papers (2) -2.068 0.444
    Reviews/meta-analyses (3) 0
Intercept 89.179 116.714 0.448

Table 6.

Relationship between full text Flesch Reading Ease score and the independent variables in the final ANCOVA model.

Independent variables Estimates SE p-value Multiple
comparison
Publication year 0.212 0.319 0.508
Impact factor -0.438 0.229 0.061
Article type <0.001 (1) = (2) > (3)
    Experimental articles (1) 9.240 2.672
    Methodology papers (2) 10.325 2.434
    Reviews/meta-analyses (3) 0
Intercept -399.926 639.094 0.534

Discussion

To the authors’ knowledge, this was the first study to evaluate the readability of neuroimaging articles. The 100 most-cited neuroimaging articles were collectively assessed for their readability. We confirmed the second hypothesis by showing that the publication year had no significant effect on the article readability. However, the first hypothesis has to be rejected because trimmed full texts were significantly more readable than abstracts. Due to the large variations in the backgrounds of the 100 papers of interests, we were able to identify factors which might associate with readability. However, readers should be aware that the current result indicated associations while no cause-and-effect relationship could be established.

Experimental articles and methodology papers were more readable than reviews/meta-analyses. Articles published after 1996 in journals with higher impact factors appeared to be less readable in terms of AGL.

Comparison of Readability of Scientific Papers From Other Research Fields

A survey of articles published in five leading peer-reviewed general medical journals reported an average FRE score of 15.4 (college graduate level) (Rochon et al., 2002). Similarly, surveys on articles published in Annals of Internal Medicine and British Journal of Surgery reported average FRE scores of 29.1 and 23.8, respectively (college graduate level) (Roberts et al., 1994; Hayden, 2008). All these results (FRE score < 30) imply that the surveyed medical articles were as difficult to read as legal contracts (Roberts et al., 1994; Weeks and Wallace, 2002). The average FRE score of the 100 most-cited neuroimaging papers was 32.1 (college level), indicating that the surveyed articles could be considered potentially more readable than medical articles (Table 2, footnote). Notwithstanding, they were still considered as difficult as reading corporate annual reports (Roberts et al., 1994). Moreover, these 100 neuroimaging papers were less readable than papers in marketing journals (average FRE score of 35.3, college level) (Sawyer et al., 2008), but more readable than papers in PLoS One (FRE scores mostly of 20–30, college graduate level) (Plavén-Sigray et al., 2017).

Relevance of Results in Relation to Healthcare Journalists

No abstract or trimmed full text had AGL < 8, suggesting that the general public can directly read none of them. This highlighted the relevance of the readability of neuroimaging articles to health journalists. It was recommended that researchers should try to write the articles in a more readable way because the overall accuracy of reporting of neuroimaging articles in the newspapers is low, with minimal details (Illes et al., 2010; van Atteveldt et al., 2014). For instance, it is common for the public to believe that we use only 10% of the brain (Herculano-Houzel, 2002). Such misconceptions should not be created from the media, especially digital or social media coverage. Because results from the current study revealed that reviews/meta-analyses were less readable, extra caution should be given when these papers are simplified and reported in mass media. Previous surveys reported that the majority of healthcare journalists had a bachelor’s degree without a postgraduate degree, which was equivalent to a U.S. grade level of 16 (Viswanath et al., 2008; Friedman et al., 2014). The results from the current study indicated that the mean AGL of the 100 most-cited neuroimaging articles was 17.15th grade for abstracts and 14.22th grade for full texts. Precisely, 37 abstracts and 84 full texts had AGL < 16. In other words, the readability of the majority of the abstracts was at the college graduate level, consistent with the previous large-scale survey on biomedical and life science article abstracts (Plavén-Sigray et al., 2017). There is definitely a need to improve the readability of abstracts. Health journalists should be able to comprehend the average full texts, though they might have some difficulties in comprehending some of the average abstracts. Perhaps health journalists need to consider reading the full texts to produce more accurate media coverage, unless the abstracts published in the future are more readable. This is consistent with the finding that half of the healthcare journalists wanted tighter communications with the researchers to report their studies (Friedman et al., 2014).

Trimmed Full Texts Were More Readable Than Abstracts

Journal articles are one of the most important knowledge sources for academics, graduate and university/college students. According to a recent survey, they are the most frequently used primary resource type for research, while half of the surveyed researchers made annotations on fewer than one-third of their article collections only (Hemminger et al., 2007). That phenomenon is yet to be clarified; for instance, was that due to poor readability of the unannotated articles? Future surveys may try to find out the average reading grade of researchers and scientists working in the neuroimaging field because this information is currently lacking. Healthcare and life science journals such as the Proceedings of the National Academy of Sciences and PLoS Medicine have adopted the practice of using plain language summaries of research to disseminate research findings to a broader audience of the general public, students, researchers and journalists (Shailes, 2017). The booming neuroimaging literature (Yeung et al., 2017a,c,e) certainly requires accurate, sufficient yet understandable accounts of the research performed to translate and disseminate the new knowledge gained (Illes et al., 2010). However, the current study revealed that full texts were significantly more readable than abstracts. This implied that scientific report authors should compose more readable abstracts, or the journal editors might demand such abstracts or edit the abstracts to make them more readable. It is imperative to make abstracts more readable as they are often what journalists read and serve as the basis to write news coverage (Gonon et al., 2011; Yavchitz et al., 2012). Alternatively, interested researchers and medical journalists might have to read the full text for a better comprehension, which is often not practical if time does not allow.

The number of authors was not significantly associated with the readability of the articles. It was reasonable because we expect first and/or corresponding authors would do most of the writings instead of all authors. Moreover, researchers including those in the neuroimaging field should have high literacy levels and all these 100 most-cited neuroimaging articles were published in reputable journals. Therefore, it was not surprising to find that total citation count, normalized citation count, country of the institution of the corresponding author and journal type were not significantly associated with the readability of the articles.

Experimental Articles and Methodology Papers Were More Readable Than Reviews/Meta-Analyses

We were able to replicate the findings in the field of surgery that reviews and meta-analyses were less readable than experimental articles (Hayden, 2008). We are unsure about the reason behind this. However, this is a critical phenomenon: reviews and meta-analyses summarize the evidence from the existing literature, clinical trials and others and therefore they are very informative. The referees and editors could effectively guide the authors to make the manuscript more readable (Remus, 1980; Roberts et al., 1994), for instance below 16th grade level to match the academic background of healthcare journalists (Viswanath et al., 2008; Friedman et al., 2014). On the other hand, methodology papers explain procedures and rationale behind and these will potentially be followed by a massive number of fellow researchers. Therefore, it is also important for them to be readable.

Articles Published in Journals With Higher Impact Factors Appeared to Be Less Readable

There have been very limited research reports on the relationship of article readability to journal impact factor or article type. Previously, it was reported that abstract readability was not affected by journal impact factor within the fields of biology/biochemistry, chemistry, and social sciences (Didegah and Thelwall, 2013). In the current study, we were able to replicate their findings. However, concerning top cited neuroimaging articles, trimmed full texts in journals with higher impact factors appeared to be less readable in terms of AGL. This was not too surprising, as the papers published in prestigious medical journals with high impact factors, such as British Medical Journal and Journal of the American Medical Association, were also “extremely difficult to read” as evaluated by similar readability scores (Weeks and Wallace, 2002). Besides, high impact factor journals may have a compact format. For example, one of the surveyed articles was published in Nature Neuroscience (impact factor in 2003 = 15.1), which requires 2,000–4,000 words for main text with no more than eight figures and/or tables. The implication is that the editorial boards of the top journals with high impact factor might consider helping improve the readability of the abstracts and texts for better understanding of the readers, especially the reviews and meta-analyses. This is considered important as research reports from high impact journals often received more media attention (Gonon et al., 2011, 2012; Yavchitz et al., 2012), which highlights the importance of their readability to reduce the translational challenges.

Study Limitations

The sample size of the current study was comparable to other readability surveys (Roberts et al., 1994; Weeks and Wallace, 2002; Hayden, 2008; Sawyer et al., 2008), but the current sample could only represent articles with considerable citation counts, hence the highest academic merits. The reason for choosing this cohort was that these most-cited articles were among the most influential ones that could potentially reach a broad audience. Readers should be aware that the chosen papers to be evaluated in the current study have been published over a wide time period (33 years), of different types, with variable number of authors, citation counts and impact factors. However, it is indeed with this diverse background hence a correlation analysis become possible, we were then able to ask what factors could be associated with better readability. We subsequently performed multi-way ANCOVA to better adjust for all these variability, instead of applying separate statistical analyses on individual factors.

This study provided the first insights into the readability of highly cited neuroimaging articles. Future studies are needed to further investigate if articles that have received less attention also exhibit similar properties. It should be noted that the readability assessment utilized linguistic formulas but did not account for readers’ knowledge background. For instance, words with many syllables or long sentences might increase the grade level scores but might not cause as much difficulty to the readers as jargon. Moreover, it could be a limitation that readability was only computed automatically. We did not evaluate the actual depth of understanding and hence the level of potential misconceptions by journalists on these 100 articles. Future studies should ask a cohort of journalists to rate the readability of a selection of articles and evaluate if they can properly conceptualize the messages the authors tried to convey, and to assess the potential relationship between poor readability and occurrence of misconceptions. Another limitation was that we were able to retrieve the impact factor data for 62 out of the 100 papers only. Moreover, we did not evaluate if the articles had strong conclusions based on weak data, one potential source of misconceptions besides readability.

Conclusion

In conclusion, among the 100 most-cited neuroimaging articles, trimmed full texts appeared to be more readable than abstracts. Experimental articles and methodology papers were more readable than reviews/meta-analyses. Articles published after 1996 in journals with higher impact factors appeared less readable in terms of AGL. Neuroimaging researchers may consider writing more readable papers so that the medical journalists and hence the general public could better comprehend, provided that their papers are intended to reach a broader audience.

Author Contributions

AY conceived the work, acquired and analyzed the data, and drafted the work. TG and WL facilitated the acquisition of data and critically revised the work. All authors approved the final content of the manuscript.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors sincerely thank Ms. Samantha Kar Yan Li from the Centralised Research Lab, Faculty of Dentistry, The University of Hong Kong, for her assistance with statistics.

Funding. This work was substantially supported by a grant from the Research Grants Council of the Hong Kong Special Administrative Region, China (HKU 766212M).

2

Web of Science (WoS) provides impact factor data from 1997, but 32 articles were published before that year. Furthermore, six articles were published after 1997, but their respective journals did not have impact factor data recorded in WoS by the time of their publication.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnhum.2018.00308/full#supplementary-material

TABLE S1

List of the 100 used articles including the corresponding data collected.

References

  1. Angell M., Kassirer J. P. (1994). Clinical research–what should the public believe? N. E. J. Med. 331 189–190. 10.1056/NEJM199407213310309 [DOI] [PubMed] [Google Scholar]
  2. Ashburner J., Friston K. J. (2000). Voxel-based morphometry—the methods. Neuroimage 11 805–821. 10.1006/nimg.2000.0582 [DOI] [PubMed] [Google Scholar]
  3. Beck D. M. (2010). The appeal of the brain in the popular press. Perspect. Psychol. Sci. 5 762–766. 10.1177/1745691610388779 [DOI] [PubMed] [Google Scholar]
  4. Biswal B., Zerrin Yetkin F., Haughton V. M., Hyde J. S. (1995). Functional connectivity in the motor cortex of resting human brain using echo-planar mri. Magn. Reson. Med. 34 537–541. 10.1002/mrm.1910340409 [DOI] [PubMed] [Google Scholar]
  5. Bland J. M., Altman D. G. (1997). Cronbach’s alpha. Br. Med. J. 314:572 10.1136/bmj.314.7080.572 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Carp J. (2012). The secret lives of experiments: methods reporting in the fMRI literature. Neuroimage 63 289–300. 10.1016/j.neuroimage.2012.07.004 [DOI] [PubMed] [Google Scholar]
  7. Chall J. S., Dale E. (1995). Readability Revisited: The New Dale-Chall Readability Formula. Brookline, MA: Brookline Books. [Google Scholar]
  8. de Vos F., Koini M., Schouten T. M., Seiler S., van der Grond J., Lechner A., et al. (2018). A comprehensive analysis of resting state fMRI measures to classify individual patients with Alzheimer’s disease. Neuroimage 167 62–72. 10.1016/j.neuroimage.2017.11.025 [DOI] [PubMed] [Google Scholar]
  9. Didegah F., Thelwall M. (2013). Which factors help authors produce the highest impact research? Collaboration, journal and document properties. J. Informetr. 7 861–873. 10.1016/j.joi.2013.08.006 [DOI] [Google Scholar]
  10. Eggener S. (1998). The power of the pen: medical journalism and public awareness. J. Am. Med. Assoc. 279 1400–1400. 10.1001/jama.279.17.1400-JMS0506-4-0 [DOI] [PubMed] [Google Scholar]
  11. Flesch R. (1948). A new readability yardstick. J. Appl. Psychol. 32 221–233. 10.1037/h0057532 [DOI] [PubMed] [Google Scholar]
  12. Friedman D. B., Tanner A., Rose I. D. (2014). Health journalists’ perceptions of their communities and implications for the delivery of health information in the news. J. Commun. Health 39 378–385. 10.1007/s10900-013-9774-x [DOI] [PubMed] [Google Scholar]
  13. Gao Y., Nie K., Huang B., Mei M., Guo M., Xie S., et al. (2017). Changes of brain structure in Parkinson’s disease patients with mild cognitive impairment analyzed via VBM technology. Neurosci. Lett. 658 121–132. 10.1016/j.neulet.2017.08.028 [DOI] [PubMed] [Google Scholar]
  14. Gazni A. (2011). Are the abstracts of high impact articles more readable? Investigating the evidence from top research institutions in the world. J. Inf. Sci. 37 273–281. 10.1177/0165551511401658 [DOI] [Google Scholar]
  15. Gonon F., Bezard E., Boraud T. (2011). Misrepresentation of neuroscience data might give rise to misleading conclusions in the media: the case of attention deficit hyperactivity disorder. PLoS One 6:e14618. 10.1371/journal.pone.0014618 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Gonon F., Konsman J.-P., Cohen D., Boraud T. (2012). Why most biomedical findings echoed by newspapers turn out to be false: the case of attention deficit hyperactivity disorder. PLoS One 7:e44275. 10.1371/journal.pone.0044275 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Guo Q., Parlar M., Truong W., Hall G., Thabane L., McKinnon M., et al. (2014). The reporting of observational clinical functional magnetic resonance imaging studies: a systematic review. PLoS One 9:e94412. 10.1371/journal.pone.0094412 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Hayden J. D. (2008). Readability in the British journal of surgery. Br. J. Surg. 95 119–124. 10.1002/bjs.5994 [DOI] [PubMed] [Google Scholar]
  19. Hemminger B. M., Lu D., Vaughan K., Adams S. J. (2007). Information seeking behavior of academic scientists. J. Assoc. Inf. Sci. Technol. 58 2205–2225. 10.1002/asi.20686 [DOI] [Google Scholar]
  20. Herculano-Houzel S. (2002). Do you know your brain? A survey on public neuroscience literacy at the closing of the decade of the brain. Neuroscientist 8 98–110. 10.1177/107385840200800206 [DOI] [PubMed] [Google Scholar]
  21. Herndon J. B., Chaney M., Carden D. (2011). Health literacy and emergency department outcomes: a systematic review. Ann. Emerg. Med. 57 334–345. 10.1016/j.annemergmed.2010.08.035 [DOI] [PubMed] [Google Scholar]
  22. Howard-Jones P. A. (2014). Neuroscience and education: myths and messages. Nat. Rev. Neurosci. 15 817–824. 10.1038/nrn3817 [DOI] [PubMed] [Google Scholar]
  23. Illes J., Moser M. A., McCormick J. B., Racine E., Blakeslee S., Caplan A., et al. (2010). NeuroTalk: improving the communication of Neuroscience. Nat. Rev. Neurosci. 11 61–69. 10.1038/nrn2773 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Jayaratne Y. S., Anderson N. K., Zwahlen R. A. (2014). Readability of websites containing information on dental implants. Clin. Oral Implants Res. 25 1319–1324. 10.1111/clr.12285 [DOI] [PubMed] [Google Scholar]
  25. Joyce C., Kelly J., Sugrue C. (2014). A bibliometric analysis of the 100 most influential papers in burns. Burns 40 30–37. 10.1016/j.burns.2013.10.025 [DOI] [PubMed] [Google Scholar]
  26. Kim H. J., Yoon D. Y., Kim E. S., Lee K., Bae J. S., Lee J.-H. (2016). The 100 most-cited articles in neuroimaging: a bibliometric analysis. Neuroimage 139 149–156. 10.1016/j.neuroimage.2016.06.029 [DOI] [PubMed] [Google Scholar]
  27. Larsson A., Oxman A. D., Carling C., Herrin J. (2003). Medical messages in the media–barriers and solutions to improving medical journalism. Health Expect. 6 323–331. 10.1046/j.1369-7625.2003.00228.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. McCray A. T. (2005). Promoting health literacy. J. Am. Med. Inform. Assoc. 12 152–163. 10.1197/jamia.M1687 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Mcinnes N., Haglund B. J. (2011). Readability of online health information: implications for health literacy. Inform. Health Soc. Care 36 173–189. 10.3109/17538157.2010.542529 [DOI] [PubMed] [Google Scholar]
  30. O’Connor C., Rees G., Joffe H. (2012). Neuroscience in the public sphere. Neuron 74 220–226. 10.1016/j.neuron.2012.04.004 [DOI] [PubMed] [Google Scholar]
  31. Plavén-Sigray P., Matheson G. J., Schiffler B. C., Thompson W. H. (2017). The readability of scientific texts is decreasing over time. eLife 6:e27725. 10.7554/eLife.27725 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Poldrack R. A., Farah M. J. (2015). Progress and challenges in probing the human brain. Nature 526 371–379. 10.1038/nature15692 [DOI] [PubMed] [Google Scholar]
  33. Powell A. G., Hughes D. L., Wheat J. R., Lewis W. G. (2016). The 100 most influential manuscripts in gastric cancer: a bibliometric analysis. Int. J. Surg. 28 83–90. 10.1016/j.ijsu.2016.02.028 [DOI] [PubMed] [Google Scholar]
  34. Racine E., Bar-Ilan O., Illes J. (2005). fMRI in the public eye. Nat. Rev. Neurosci. 6 159–164. 10.1038/nrn1609 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Racine E., Waldman S., Rosenberg J., Illes J. (2010). Contemporary neuroscience in the media. Soc. Sci. Med. 71 725–733. 10.1016/j.socscimed.2010.05.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Remus W. (1980). Why academic journals are unreadable: the referees’ crucial role. Interfaces 10 87–90. 10.1287/inte.10.2.87 [DOI] [Google Scholar]
  37. Roberts J. C., Fletcher R. H., Fletcher S. W. (1994). Effects of peer review and editing on the readability of articles published in Annals of Internal Medicine. JAMA 272 119–121. 10.1001/jama.1994.03520020045012 [DOI] [PubMed] [Google Scholar]
  38. Rochon P. A., Bero L. A., Bay A. M., Gold J. L., Dergal J. M., Binns M. A., et al. (2002). Comparison of review articles published in peer-reviewed and throwaway journals. JAMA 287 2853–2856. 10.1001/jama.287.21.2853 [DOI] [PubMed] [Google Scholar]
  39. Sawyer A. G., Laran J., Xu J. (2008). The readability of marketing journals: are award-winning articles better written? J. Mark. 72 108–117. 10.1509/jmkg.72.1.108 [DOI] [Google Scholar]
  40. Shailes S. (2017). Plain-language summaries of research: something for everyone. eLife 6:e25411. 10.7554/eLife.25411 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Shelley I. I. M., Schuh J. (2001). Are the best higher education journals really the best? A meta-analysis of writing quality and readability. J. Sch. Publ. 33 11–22. 10.3138/jsp.33.1.11 [DOI] [Google Scholar]
  42. Tavakol M., Dennick R. (2011). Making sense of Cronbach’s alpha. Int. J. Med. Educ. 2 53–55. 10.5116/ijme.4dfb.8dfd [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. van Atteveldt N. M., van Aalderen-Smeets S. I., Jacobi C., Ruigrok N. (2014). Media reporting of neuroscience depends on timing, topic and newspaper type. PLoS One 9:e104780. 10.1371/journal.pone.0104780 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Viswanath K., Blake K. D., Meissner H. I., Saiontz N. G., Mull C., Freeman C. S., et al. (2008). Occupational practices and the making of health news: a national survey of US health and medical science journalists. J. Health Commun. 13 759–777. 10.1080/10810730802487430 [DOI] [PubMed] [Google Scholar]
  45. Weeks W. B., Wallace A. E. (2002). Readability of British and American medical prose at the start of the 21st century. Br. Med. J. 325 1451–1452. 10.1136/bmj.325.7378.1451 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Wrafter P. F., Connelly T. M., Khan J., Devane L., Kelly J., Joyce W. P. (2016). The 100 most influential manuscripts in colorectal cancer: a bibliometric analysis. Surgeon 14 327–336. 10.1016/j.surge.2016.03.001 [DOI] [PubMed] [Google Scholar]
  47. Yavchitz A., Boutron I., Bafeta A., Marroun I., Charles P., Mantz J., et al. (2012). Misrepresentation of randomized controlled trials in press releases and news coverage: a cohort study. PLoS Med. 9:e1001308. 10.1371/journal.pmed.1001308 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Yeung A. W. K. (2017a). Do neuroscience journals accept replications? a survey of literature. Front. Hum. Neurosci. 11:468. 10.3389/fnhum.2017.00468 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Yeung A. W. K. (2017b). Identification of seminal works that built the foundation for functional magnetic resonance imaging studies of taste and food. Curr. Sci. 113 1225–1227. [Google Scholar]
  50. Yeung A. W. K. (2018a). An updated survey on statistical thresholding and sample size of fMRI studies. Front. Hum. Neurosci. 12:16. 10.3389/fnhum.2018.00016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Yeung A. W. K. (2018b). Bibliometric study on functional magnetic resonance imaging literature (1995–2017) concerning chemosensory perception. Chemosens. Percept. 11 42–50. 10.1007/s12078-018-9243-0 27327516 [DOI] [Google Scholar]
  52. Yeung A. W. K., Goto T. K., Leung W. K. (2017a). A bibliometric review of research trends in neuroimaging. Curr. Sci. 112 725–734. 10.18520/cs/v112/i04/725-734 [DOI] [Google Scholar]
  53. Yeung A. W. K., Goto T. K., Leung W. K. (2017b). At the leading front of neuroscience: a bibliometric study of the 100 most-cited articles. Front. Hum. Neurosci. 11:363. 10.3389/fnhum.2017.00363 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Yeung A. W. K., Goto T. K., Leung W. K. (2017c). Basic taste processing recruits bilateral anteroventral and middle dorsal insulae: an activation likelihood estimation meta-analysis of fMRI studies. Brain Behav. 7:e00655. 10.1002/brb3.655 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Yeung A. W. K., Goto T. K., Leung W. K. (2017d). Brain responses to stimuli mimicking dental treatment among non-phobic individuals: a meta-analysis. Oral Dis. 10.1111/odi.12819 [Epub ahead of print]. [DOI] [PubMed] [Google Scholar]
  56. Yeung A. W. K., Goto T. K., Leung W. K. (2017e). The changing landscape of neuroscience research, 2006–2015: a bibliometric study. Front. Neurosci. 11:120 10.3389/fnins.2017.00120 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Yeung A. W. K., Goto T. K., Leung W. K. (2018a). Affective value, intensity and quality of liquid tastants/food discernment in the human brain: an activation likelihood estimation meta-analysis. Neuroimage 169 189–199. 10.1016/j.neuroimage.2017.12.034 [DOI] [PubMed] [Google Scholar]
  58. Yeung A. W. K., Heinrich M., Atanasov A. G. (2018b). Ethnopharmacology—a bibliometric analysis of a field of research meandering between medicine and food science? Front. Pharmacol. 9:215 10.3389/fphar.2018.00215 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

TABLE S1

List of the 100 used articles including the corresponding data collected.


Articles from Frontiers in Human Neuroscience are provided here courtesy of Frontiers Media SA

RESOURCES