“…though a Philosopher need not be sollicitous that his style should delight its Reader with his Floridnesse, yet I think he may very well be allow’d to take a Care that it disgust not his Reader by its Flatness, especially when he does not so much deliver Experiments or explicate them, as make Reflections or Discourses on them; for on such Occasions he may be allow’d the liberty of recreating his Reader and himself, and manifesting that he declin’d the Ornaments of Language, not out of Necessity, but Discretion…”—Robert Boyle, Proëmial Essay [1].
Scientists receive (and offer) much advice on how to write an effective paper that their colleagues will read, cite, and celebrate [2–15]. Fundamentally, the advice is similar to that given to journalists: keep the text short, simple, bold, and easy to understand. Many resources recommend the parsimonious use of adjectives and adverbs, the use of present tense, and a consistent style. Here we put this advice to the test, and measure the impact of certain features of academic writing on success, as proxied by citations.
The abstract epitomizes the scientific writing style, and many journals force their authors to follow a formula—including a very strict word-limit, a specific organization into paragraphs, and even the articulation of particular sentences and claims (e.g., “Here we show that…”).
For our analysis, we collected more than one million abstracts from eight disciplines, spanning 17 years. The disciplines were chosen so that biology was represented by three allied fields (Ecology, Evolution, and Genetics). We drew upon a wide range of comparison disciplines, namely Analytic Chemistry, Condensed Matter Physics, Geology, Mathematics, and Psychology (see table in S1 Text). We measured whether certain features of the abstract consistently led to more (or fewer) citations than expected, after accounting for other factors that certainly influence citations, such as article age (S1 Fig), number of authors and references, and the journal in which it was published.
We organized the most frequent suggestions into “Ten Simple Rules,” and probed them by testing a variety of features from the abstracts. Because the style and requirements for abstracts can vary dramatically between journals (S2 Fig), we normalized all the measures according to their distribution for each journal (S1 Text).
Rule 1: Keep It Short
This is the most universally accepted piece of advice given to writers [3,7,9,11–13]. We tested this by examining the effect of shorter abstracts on citation, measuring the number of words (Rule 1a [R1a]) and number of sentences (R1b) in each abstract.
Rule 2: Keep It Compact
The typical advice is to keep sentences or phrasing short, break compound sentences into simpler sentences, and remove any “unnecessary” words [2–6,9–12,14]. We evaluated this by measuring the effect of having sentences shorter than the mean for the journal where the article was published (R2).
Rule 3: Keep It Simple
Canonical advice includes the prescription to use plain language and avoid jargon and technical terms [2–4,7,10,12,14]. Many of the most prominent journals state that their abstracts should be accessible to scientists working in different disciplines. To test this, we measured the proportion of words in the abstract that are found in a standard English dictionary (R3a) and that are present in a dictionary of “easy words” (R3b).
Rule 4: Use the Present Tense
Stylists recommend the use of the present tense [10,12], as it is more direct and deemed easier to understand for non-native speakers. We assessed this by ascertaining the ratio of (present tense)/(present + past tense) (R4).
Rule 5: Avoid Adjectives and Adverbs
Using few adjectives and adverbs avoids fluff and keeps the text short and easy to understand [4,8,9,12]. We measured the effect of having a proportion of adjectives and adverbs smaller than that typical for the journal (R5).
Rule 6: Focus
Many authors suggest sticking to a single point, and reiterating the “take home” message [5,6,11,13,14]. We captured this with the proportion of words in the abstract that were also keywords (R6).
Rule 7: Signal Novelty and Importance
There is conflicting advice on whether to explicitly state the significance of your work. Stressing that the work is novel and solves important problems helps to “sell” the article [12,15]. Opponents of this rule say that all published work should already meet these criteria [8,13]. We examined this by checking whether the abstract contained at least one word signaling novelty (e.g., “novel,” “new,” “innovative” [R7a]) and, separately, a word signaling importance (e.g., “key,” “significant,” “crucial” [R7b]).
Rule 8: Be Bold
Many authors suggest “selling” the work forcefully and stressing positive results. We tested this by measuring the ratio (superlatives)/(superlatives + comparatives) (R8).
Rule 9: Show Confidence
Similarly, using too many “hedge words” (e.g., “somewhat,” “speculative,” “appear,” “almost,” “largely”) can signal a lack of confidence in the work. We explored this with the measure of fewer hedge words in the abstract (R9).
Rule 10: Avoid Evocative Words
A style perceived as too flowery or involving the overuse of highly evocative words is discouraged. We tested whether using words perceived as “pleasant,” “active,” or “easy to imagine” led to more citations than those for abstract containing “unpleasant,” “passive,” or “hard to imagine” words [16–18] (R10a–c).
Results
In Fig 1, we report the sign of the effect associated with each abstract feature (column) for each discipline (row). Surprisingly, half of the typical suggestions—including those that are most common, about brevity and clarity—are associated with a significant decrease in citations.
We find that shorter abstracts (fewer words [R1a] and fewer sentences [R1b]) consistently lead to fewer citations, with short sentences (R2) being beneficial only in Mathematics and Physics. Similarly, using more (rather than fewer) adjectives and adverbs is beneficial (R5). Also, writing an abstract with fewer common (R3a) or easy (R3b) words results in more citations.
The use of the present tense (R4) is beneficial in Biology and Psychology, while it has a negative impact in Chemistry and Physics, possibly reflecting differences in disciplinary culture.
While matching the keywords (R6) leads to universally negative outcomes, signaling the novelty and importance of the work (R7) has positive effects. The use of superlatives (R8) is also positive, while avoiding “hedge” words is negative in Biology and Physics, but positive in Chemistry.
Finally, choosing “pleasant,” “active,” and “easy to imagine” words (R10) has positive effects across the board.
When we measured effect sizes (Fig 2), we found that abstract features can have a strong influence on citations. Being one standard deviation above the mean for a given feature (with respect to the mean for corresponding journal) can increase citations by 4.6% (Mathematics [R7a]), or decrease them by 7.2% (Geology [R1a]). When analyzing each journal separately, we find qualitatively the same results (S3–S10 Figs).
Conclusions
We have found that—when it comes to abstracts—“more is more,” despite clear and abundant advice to the contrary.
This is an interesting and surprising result. An intriguing hypothesis is that scientists have different preferences for what they would like to read versus what they are going to cite. Despite the fact that anybody in their right mind would prefer to read short, simple, and well-written prose with few abstruse terms, when building an argument and writing a paper, the limiting step is the ability to find the right article. For this, scientists rely heavily on search techniques, especially search engines, where longer and more specific abstracts are favored. Longer, more detailed, prolix prose is simply more available for search. This likely explains our results, and suggests the new landscape of linguistic fitness in 21st century science. Future studies could investigate the relationship between stylistic features and retrievability directly, as well as the strength of the relationship between retrievability and citation performance.
Another interesting finding is that there is very little variation across disciplines, with only three out of fifteen features displaying sign changes among the diverse fields we examined.
Scientists are skeptical by disposition, and this exercise shows that, rather than taking advice at face value, they can apply the same machinery they use to interrogate nature to put these recommendations to the test—and write a lengthy, convoluted, highly-indexible, self-describing abstract.
Supporting Information
Acknowledgments
Thanks to G. Barabás, M. Begun, J. Grilli, P. McMahan, E. Sander, M.J. Smith, and M. Teplitskiy for comments and discussion.
Funding Statement
CJW and SA are supported by NSF #1042164, JAE by NSF #1158803. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Boyle R. A proemial essay, wherein, with some considerations touching experimental essays in general, is interwoven such an introduction to all those written by the author, as it is necessary to be perus’d for the better understanding of them. In: Certain physiological essays and other tracts written at distant times, and on several occasions by the honourable Robert Boyle; wherein some of the tracts are enlarged by experiments and the work is increased by the addition of a discourse about the absolute rest in bodies. 2nd ed. Henry Herringman, republished by University of Michigan, Digital Library Production Service; 1669. p. 12–13. Available from: http://quod.lib.umich.edu/e/eebo/A28944.0001.001?view=toc.
- 2. Paul JK. Scientific writing. Oral Surgery, Oral Medicine, Oral Pathology. 1970;30(2):185–191. [DOI] [PubMed] [Google Scholar]
- 3. Lilleyman J. How to write a scientific paper–a rough guide to getting published. Archives of disease in childhood. 1995;72(3):268 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Evans M. Writing for publication. British Journal of Oral and Maxillofacial Surgery. 1998;36(3):161–164. [DOI] [PubMed] [Google Scholar]
- 5. Alexandrov AV. How to write a research paper. Cerebrovascular diseases. 2004;18(2):135–138. [DOI] [PubMed] [Google Scholar]
- 6. Chiswick M. Writing a research paper. Current Paediatrics. 2004;14(6):513–518. [Google Scholar]
- 7. Cunningham S. How to… write a paper. Journal of Orthodontics. 2004;31(1):47–51. [DOI] [PubMed] [Google Scholar]
- 8. Thrower PA. Writing a scientific paper: I. Titles and abstracts. Carbon. 2007;45(11):2143–2144. [Google Scholar]
- 9. Van Way CW. Writing a scientific paper. Nutrition in Clinical Practice. 2007;22(6):636–640. [DOI] [PubMed] [Google Scholar]
- 10. Fahy K. Writing for publication: the basics. Women and Birth. 2008;21(2):86–91. 10.1016/j.wombi.2007.12.005 [DOI] [PubMed] [Google Scholar]
- 11. Christensen NB, Kume H, Autorino R. How to write titles and abstracts for readers. International Journal of Urology. 2009;16(1):2–3. 10.1111/j.1442-2042.2008.02228.x [DOI] [PubMed] [Google Scholar]
- 12. Davidson A, Delbridge E. How to write a research paper. Paediatrics and Child Health. 2012;22(2):61–65. [Google Scholar]
- 13. Mack C. How to write a good scientific paper: title, abstract, and keywords. Journal of Micro-Nanolithography MEMS and MOEMS. 2012;11(2):020101. [Google Scholar]
- 14. Cals JW, Kotz D. Effective writing and publishing scientific papers, part II: title and abstract. Journal of clinical epidemiology. 2013;66:585 10.1016/j.jclinepi.2013.01.005 [DOI] [PubMed] [Google Scholar]
- 15.Reis SRN, Reis AI. How to write your first scientific paper. In: Interdisciplinary Engineering Design Education Conference (IEDEC), 2013 3rd. IEEE; 2013. p. 181–186.
- 16. Sweeney K, Whissell C. A dictionary of affect in language: I. Establishment and preliminary validation. Perceptual and motor skills. 1984;59(3):695–698. [Google Scholar]
- 17. Whissell C. The dictionary of affect in language. Emotion: Theory, research, and experience. 1989;4(113–131):94. [Google Scholar]
- 18. Whissell C. Using the revised dictionary of affect in language to quantify the emotional undertones of samples of natural language 1, 2. Psychological reports. 2009;105(2):509–521. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.