ABSTRACT
The field of microbiology has experienced significant growth due to transformative advances in technology and the influx of scientists driven by a curiosity to understand how microbes sustain myriad biochemical processes that maintain Earth. With this explosion in scientific output, a significant bottleneck has been the ability to rapidly disseminate new knowledge to peers and the public. Preprints have emerged as a tool that a growing number of microbiologists are using to overcome this bottleneck. Posting preprints can help to transparently recruit a more diverse pool of reviewers prior to submitting to a journal for formal peer review. Although the use of preprints is still limited in the biological sciences, early indications are that preprints are a robust tool that can complement and enhance peer-reviewed publications. As publishing moves to embrace advances in Internet technology, there are many opportunities for preprints and peer-reviewed journals to coexist in the same ecosystem.
KEYWORDS: microbiology, open science, publishing, peer review
PERSPECTIVE
A preprint is an interim research product that is made publicly available before going through an official peer review process with the goals of soliciting feedback, accelerating dissemination of results, establishing priority, and publicizing negative results (1–5). Authors can post their article to a preprint server for others to read, share, and comment. In the 1960s, information exchange groups were the first formal attempt to broadly disseminate paper-based preprints among physicists and biologists (6, 7). Although the biological community’s commitment to preprints waned by 1967, the physics community persisted and eventually adopted what is now the arXiv (pronounced “archive”) preprint server that was hosted at the Los Alamos National Laboratories from 1991 to 1999 and then at Cornell University (8). For some physicists and mathematicians, posting a preprint to arXiv, optionally followed by submission to a peer-reviewed journal, has become a standard publication pathway. Although arXiv has hosted a number of computational biology preprints, the server has not drawn widespread attention from biologists. Among proponents of arXiv, preprints have aided in the development of research communication by accelerating the release of science and helping authors reach a wider audience for critique and establishment of priority (9). Considering the broadening adoption of preprints among microbiologists, I sought to explore the specific uses of and concerns regarding preprints.
LANDSCAPE OF PREPRINT SERVERS
In 2013, two preprint servers, bioRxiv (pronounced “bio-archive”) and PeerJ PrePrints, were launched as preprint servers for biologists that would parallel arXiv (10). According to information provided on the bioRxiv and PeerJ PrePrints websites and my personal experiences, both platforms offer similar features. Preprint posting is free. Each preprint receives a digital object identifier (doi) that facilitates the ability to cite preprints in other scholarly work. If the preprint is ever published, the preprint is linked to the published version. The submission process for both options is relatively simple, allowing authors to upload a PDF version of their preprint and supplemental materials. Preprints are typically publicly available in about 24 h. The preprint websites have built-in venues for authors to discuss their research with people who leave comments on the article. Preprints undergo a basic screening process to remove submissions with offensive or nonscientific content. The sites provide article-level metrics indicating the number of times an abstract has been accessed or the preprint has been downloaded. There are several important differences between the two options. First, PeerJ PrePrints is a for-profit organization, and bioRxiv is a nonprofit organization sponsored by Cold Spring Harbor Laboratory. This difference can be meaningful to authors, since some journals, including the American Society for Microbiology (ASM) journals, will accept only submissions that have been posted on preprint servers hosted by not-for-profit organizations (11). Second, preprints at PeerJ PrePrints are posted under the Creative Commons Attribution License (CC-BY), and bioRxiv preprints can be posted under one of four CC-BY licenses or with no permission for reuse. This can be relevant for authors hoping to submit their work to Proceedings of the National Academy of Sciences, as the journal will not consider articles posted as preprints under a CC-BY license. The National Institutes of Health (NIH) encourages authors to post preprints under the CC-BY or public domain licenses (5). The flexibility of the bioRxiv licensing empowers authors to choose the model that best suits them, while ensuring the rapid posting of their research results; however, it is important to provide clear information to authors on the legal and practical tradeoffs of each option. A cosmetic, but still relevant, difference is the layout and feel of the two websites. Compared to the bioRxiv site (see Fig. S1 in the supplemental material), the PeerJ PrePrints site is more fluid, gives readers the ability to “follow” a preprint, and provides better access to article keywords and the ability to search preprints (Fig. S2). With broader acceptance of preprints by traditional journals, many journals, including all of the ASM journals, have established mechanisms to directly submit articles that are posted as preprints on bioRxiv. The only direct submission mechanism for papers submitted at PeerJ PrePrints is to the PeerJ journal. In many ways, preprint servers have taken on the feel of a journal. As adoption of this approach expands, it is likely that the features of these sites will continue to improve. It is also worth noting that numerous other opportunities exist for other forms of interim research products (e.g., blog posts, videos, protocols, etc.) to obtain doi’s that make the work citable. As these possibilities increase, the preprint landscape risks becoming fractured.
One solution to the fracturing of the preprint landscape would be the creation of indexing sites that allow a user to easily search for content across multiple preprint servers. Several examples of these efforts already exist, and it is likely that these interfaces and their ability to span the landscape will improve. For example, although Google Scholar includes preprints hosted at bioRxiv and PeerJ PrePrints in their search results, PubMed and Web of Science do not. A relatively new example of what this might look like is PrePubMed (http://www.prepubmed.org), which seeks to index preprints from numerous sources. A more organized effort is being initiated with funding through ASAPbio to create a “Central Service” that would aggregate preprints in the life sciences (12). As preprint servers and other content providers begin to look and act like traditional journals by incorporating features and interfaces, it is important to value the strength of the preprint—that of an interim research product that is nimble and quickly posted. It is therefore essential to balance the requirements placed on authors for features associated with preprints with the efficiency of the preprint format.
SPECIFIC CHALLENGES FOR MICROBIOLOGY
Although preprints offer an efficient and novel venue for disseminating microbiology research, there are several considerations that the scientific community and those that oversee preprint servers must consider. It is critical that assurances be given that policies are in place to address these issues and that these policies are made transparent. First, attention has to be given to dual-use research of concern (DURC), since microbiology-related research could offer insights to individuals seeking to engage in inappropriate activities. Second, for researchers engaging in research that involves human subjects and other vertebrates, it is critical that assurances be made that institutional oversight committees have been consulted and have approved of the research. Third, there is significant concern regarding researchers disclosing potential conflicts of interest that could affect a project’s experimental design, analysis, and interpretation of results. Finally, recent expansions in scientific publishing have revealed numerous cases of plagiarism or misconduct. Again, while hoping to maintain the efficiency of the preprint format, traditional microbiology journals have screening procedures and oversight committees that address these issues. Similar efforts need to be implemented by preprint servers. As preprint usage continues to expand, many of these problems may also grow, similar to the experiences within the traditional publishing industry as it expanded.
ACCEPTANCE OF PREPRINTS BY JOURNALS
An early controversy encountered by researchers interested in posting their work as preprints as a stage in disseminating their research was whether it constituted prior publication (13). The broad consensus of the International Committee of Medical Journal Editors and numerous journals is that preprints do not constitute prior publication (14). This consensus is reflected in the current policies of journals that commonly publish microbiology research, including those published by ASM, the Microbiology Society, International Society for Microbial Ecology, PLOS, the Proceedings of the National Academy of Science, Science, Nature, Journal of Infectious Diseases, and Cell Press. Each takes a generally permissive stance toward posting of preprints prior to submission. Comprehensive lists of journals’ attitudes toward preprints are available online and are regularly updated (15, 16). Considering the relatively fluid nature of many of these policies and the journals’ specific policies, prospective authors should be aware of the positions taken by the journals where they may eventually submit their work.
PREPRINTS AND PEER REVIEW
The use of preprints for citations in other scientific reports and grant proposals has recently been called into question (17). It is important to note that the peer review process was adapted to the technologies and trends that have evolved over the past 100 years. The formal peer review system that most journals currently use was not developed until the end of the 1800s with the advent of typewriters and carbon paper (18). Editorial decisions were typically made by a single person or a committee (i.e., the editorial board) who had expertise that covered the scope of the journal. As science became more specialized, new journals would form to support and provide a source of validation for the new specialty. The growth in science in the mid-1900s resulted in a shift from journals struggling to find sufficient numbers of articles to publish to having too many articles submitted. It has been argued that the widespread adoption of decentralized peer review was due to the increased specialization and to deal with the large number of article submissions (19). Peer review did not achieve widespread use at many journals, including the Journal of Bacteriology, until the 1940s and 1950s. Thus, the “tradition” of peer review is only 70 years old. Given the rapid advances in communication technology and even greater specialization within microbiology, it is worth pondering whether the current scientific publishing system and peer review system, in particular, need to continue to adapt with our science.
Communicating research has traditionally been done within research group meetings, departmental seminars, and conferences and as publications. Along this continuum, there is an assumption that the quality of the science has been improved because it has been vetted by more experts in the field. The public dissemination of one’s research is a critical component of the scientific method. By describing their research, scientists subject their work to formal and informal peer review. Their research is scrutinized, praised, and probed to identify questions that help seed the next iteration of the scientific method. A common critique of more modern approaches to publishing has been an inability to assess the quality of the science without the validation of peer review. Attached to assertions of the validity of the research have been assertions of the impact and robustness of the research. These are all quality assessments that many acknowledge are difficult to assess by the traditional peer review process. This has led to some journals, most notably PLOS ONE, calling for referees to place a reduced emphasis on the perceived impact or significance of the work. It has also led to the call for replacing or complementing prepublication peer review with postpublication peer review using PubMed Commons, PubPeer, journal-based discussion forums, F1000Research, and other mechanisms. Alas, if scientists are going to depend on postpublication peer review or informal methods of peer review for documents like preprints, they must be willing to provide constructive feedback on the work of others.
PREPRINTS HAVE THE POTENTIAL TO CHANGE THE ADVANCEMENT OF SCIENCE
Preprints are often viewed as existing in a state of scientific limbo. As noted above, they represent a formal communication, but an interim one, not officially published. As the use of preprints grows and scientists’ perceptions of preprints matures, there are a number of issues that will need to be addressed.
First, a common concern is that if a researcher posts their work as a preprint, it will be “scooped” by another researcher, and the preprint author will lose their ability to claim primacy or their ability to publish the work in a journal. Considering the preprint as a citable work with a doi, it would, in fact, be the preprint author that scooped the second. Furthermore, a preprint could prevent getting scooped, since a preprint would indicate to others in the field that the work had already been done, which would prevent wasted time and effort. The use of preprints uncouples the communication of the discovery from the relevance of the discovery, which will come later based on peer review, comments from other scientists at meetings or online, and eventually citations. A growing number of scientific societies and journals, including ASM, view preprints as citable and as having a legitimate claim to primacy (1, 20–22); however, it remains to be determined whether the journals will stand by these policies. Some scientists worry that with such protection, a researcher can make a claim without valid data to support their claims (3). This is possible; however, it is also the responsibility of the scientific community to utilize the peer review mechanisms that are available to comment on those preprints, pointing out methodological problems or to indicate that they are speaking beyond the data. As preprints gain broader adoption, the tension between establishing primacy and the completeness of the preprint may test the policies of preprint-friendly journals.
A second area of concern is whether a preprint can be used to support a grant proposal. Given the length limitations placed on grant proposals by funding agencies, there is a push to cite previous work to indicate a research team’s competence in an area or to provide preliminary data. The National Institutes of Health recently released a notice clarifying their position on the use of preprints and synthesizing feedback that they received as part of a request for information (5). In this notice, the NIH indicated that preprints can be cited anywhere that other research is cited, including research plans, bibliographies, biosketches, and progress reports. Some fear that the use of preprints will allow scientists to circumvent page limits by posting preliminary reports (23). One would hope that both consumers of preprints and grant proposal reviewers would be able to differentiate between someone trying to game the system and someone that is using preprints as a mechanism to improve their science. This would be greatly facilitated by following the NIH recommendation of using preprints as evidence for research progress but providing an indication that the preprints are not peer-reviewed publications (5). This would help review panels in rendering their decisions and help authors substantiate their preliminary data.
A third concern is what role preprints should have in assessing a scientist’s productivity. Clearly, use of publication metrics as an indicator of a scientist’s productivity and impact is a contentious topic without even discussing the role of preprints. Regardless, given the propensity for researchers to list papers as being “in preparation” or “in review” on an application or curriculum vitae, listing them instead as preprints that can be reviewed by a committee would significantly enhance an application and a reviewer’s ability to judge the application. In fact, several funding agencies, including the NIH, Wellcome Trust, and UK Medical Research Council, encourage fellowship applicants to include preprints in their materials (5).
Beyond these concerns, preprints are also causing some to change their publication goals. Some authors are explicitly stating that a preprint will not be submitted to a journal (24). Although these authors may be a minority of those who post preprints, such an approach may be attractive to those who need to cite a report of a brief research communication, a critique of another publication, or negative results. It is clear that the adoption of preprints will challenge how scientists interact and evaluate each other’s work. There is great potential to empower researchers by controlling when a citable piece of work is made public.
MICROBIOLOGY ANECDOTES
The peer review editorial process can be lengthy and adversarial. Because preprints are public and freely available, they represent a rapid and potentially collaborative method for disseminating research. Several anecdotes from the microbiology literature are emblematic of benefits of the rapid release cycle that is inherent in the use of preprints.
First, preprints have proven useful for rapidly disseminating results for disease outbreaks and new technologies. Prior to the recent Zika virus outbreak, there were approximately 50 peer-reviewed publications that touched on the biology and epidemiology of the virus; as of April 2017, there were more than 2,300 Zika virus-related peer-reviewed publications. During the recent outbreak, more than 150 Zika virus-related preprints were posted at bioRxiv. Any paper that was formally published went through a delay of several months in releasing information to health care workers, the public, and scientists needing to learn new methods to study a previously obscure virus. In contrast, those that posted their work as a preprint were able to disseminate their methods and results instantly. Another interesting use of preprints to disseminate new information about Zika virus has been the posting of a preprint describing the Zika virus outbreak in the U.S. Virgin Islands that will be continually updated as new data come in and analyses are performed (25). Over the last several years, there have also been rapid advances in DNA sequencing technologies that have fundamentally changed how microbial science is performed. One notable technology, the MinIon sequencing platform from Oxford Nanopore, has received considerable attention from researchers who have posted more than 110 preprints describing new MinIon-based methods and results to preprint servers. For such a rapidly developing technology, the ability to share and consume methods from other scientists has created a feed forward effect where the technology has likely advanced faster than it otherwise would have.
Second, preprints have proven useful for rapidly correcting the scientific literature. On 9 February 2015, Cell Systems published a study that collected and analyzed metagenomic sequence data from the New York City subway system and reported finding Yersinia pestis and Bacillus anthracis (26). Because of the focus on these two bioterrorism agents, this study generated a considerable amount of media attention. On 25 April 2015, Petit et al. (27) posted a preprint to Zenodo demonstrating that there was no evidence for B. anthracis in the data set. On 29 July 2015, a critique was published by Cell Systems along with a response from the original authors offering a correction to their article (28, 29). A second anecdote of using preprints to aid in postpublication peer review surrounds the publishing of a draft tardigrade genome in the Proceedings of the National Academy of Sciences. On 23 November 2015, a study by Boothby et al. (30) was published online. The authors claimed that 17.5% of its genes came from bacteria, archaea, fungi, plants, and viruses. Another group had been analyzing sequence data from a parallel tardigrade genome sequencing project and did not observe the same result. A week later, on 1 December 2015, the second group posted a preprint comparing the two genome sequences and demonstrating that the exciting claims of horizontal gene transfer were really the product of contaminants (31); this analysis would eventually be peer-reviewed and published online by the original journal on 24 March 2016, followed by a rebuttal by the original authors on 31 May 2016 (32, 33). Two other analyses of the original data were peer-reviewed and published in May 2016, and a third was posted as a preprint on 2 February 2016 (34–36). Both of these anecdotes underscore the value of having a rapid posting cycle to correcting errors in the scientific literature and that results posted to preprint servers were able to correct the record within weeks of the initial publication, while the traditional peer review path took 6 months in both cases. A final notable case where preprints have accelerated the correction of the scientific record was a preprint posted by Bik et al. (37) reporting numerous cases of image manipulation in peer-reviewed studies. This was a case where a journal may have been reluctant to publish the findings because it could have put the journal in a bad light. Posting the paper as a preprint removed potential conflicts of interests from journals that could have hindered its ability to be formally published in a journal. After the preprint was posted on 20 April 2016, it was peer-reviewed and published in mBio on 7 June 2016 (38). Instead of using preprints to react to published papers that have been through peer review, it would be interesting to consider how the editorial process for these examples and the infamous “arsenic life” paper (39) would have been different had they initially been posted as preprints.
METRICS FOR MICROBIOLOGY-AFFILIATED PREPRINTS
To analyze the use of preprints, I downloaded bioRxiv preprints on 17 April 2017. I chose to analyze bioRxiv preprints because these preprints are amenable for submission to ASM journals and there were 9,780 bioRxiv preprints compared to the 2,911 preprints that were available at PeerJ PrePrints on the same date. The code used to analyze these preprints and the rest of this article are available as a reproducible GitHub repository at http://www.github.com/SchlossLab/Schloss_PrePrints_mBio_2017. Among the 9,780 preprints on bioRxiv, 483 were assigned by the authors into the Microbiology category. One limitation of the bioRxiv interface is the inability to assign papers to multiple categories or to tag the content of the preprint. For example, this paper could be assigned to either the Microbiology or Scientific Communication and Education category. To counter this limitation, I developed a more permissive approach that classified preprints as being microbiology affiliated if their title or abstract had words containing “yeast,” “fung,” “viral,” “virus,” “archaea,” “bacteri,” “microb,” “microorganism,” “pathogen,” or “protist”. I identified 1,617 additional papers that I considered microbiology affiliated. These microbiology-affiliated preprints were primarily assigned to the Evolutionary Biology (n = 283), Bioinformatics (n = 237), or Genomics (n = 231) category.
As the total number of preprints has grown exponentially since the creation of bioRxiv, submission of microbiology-affiliated preprints has largely followed this growth (Fig. 1A). Although preprints are still relatively new, the collection of microbiology-affiliated preprints indicates widespread experimentation with the format and considerable geographic diversity. Reflecting the relative novelty of preprints, 1,484 (85.5%) corresponding authors who submitted a microbiology-affiliated preprint (n = 1,735 total) have posted a single preprint and 4.6% have posted three or more preprints. Corresponding authors that have posted microbiology-affiliated preprints are from 67 countries and are primarily affiliated with institutions in the United States (46.2% of microbiology-affiliated preprints), United Kingdom (12.9%), and Germany (4.6%). As the preprint format matures, it will be interesting to see whether the proportion of authors that post multiple preprints increases and whether the geographic diversity among those authors is maintained.
As stated above, preprints offer researchers the opportunity to improve the quality of their work by adding a more formal and public step to the scientific process. Among the microbiology-affiliated preprints, 197 (9.3%) had been commented on at least once and only 48 (2.3%) had been commented on more than three times using the bioRxiv-hosted commenting feature. Although the hosted commenting is only one mechanism for peer review, this result was somewhat disturbing since the preprint model implicitly depends on people’s willingness to offer others feedback. In spite of the lack of tradition within the scientific community to comment publicly online about colleagues’ research results, I am optimistic that this will change given the possibilities of new media (e.g., Twitter, Facebook, blogs); the advantage of the centralized commenting is that it is easier for the authors and others to integrate the feedback with the preprint. It is possible that incentives for open commenting and reviewing could shift the trend. Importantly, authors do appear to be incorporating feedback from colleagues or editorial insights from journals, as 545 (25.9%) microbiology-affiliated preprints were revised at least once. Among the preprints posted prior to 1 January 2016, 31.3% of the Microbiology category preprints, 35.6% of the microbiology-affiliated preprints, and 33.6% of all preprints have been published. As noted above, not all authors submit their preprints to journals. This would indicate that the “acceptance rates” are actually higher. Regardless, considering that these acceptance rates are comparable to many peer-reviewed journals (e.g., approximately 40% at ASM Journals), these results dispel the critique that preprints represent overly preliminary research.
Measuring the impact and significance of scientific research is notoriously difficult. Using several metrics, I sought to quantify the effect that broadly defined microbiology-affiliated preprints have had on the work of others. Using the download statistics associated with each preprint, I found that the median number of times an abstract or PDF had been accessed was 922 (interquartile range [IQR], 601 to 1,446) and 301 (IQR, 155 to 549), respectively. These values represent two aspects of posting a preprint. First, they reflect the number of times people were able to access science before it was published. Second, they reflect the number of times people were able to access a version of a paper that is published behind a paywall. To obtain a measure of a preprint’s ability to garner attention and engage the general public, I obtained the Altmetric attention score for each preprint (Fig. 1B). The Altmetric attention score measures the number of times a preprint or paper is mentioned in social media, traditional media, Wikipedia, policy documents, and other sources; it does not include the number of citations (40). A higher score indicates that a preprint received more attention. Microbiology-affiliated preprints have had a median Altmetric attention score of 7.6 (IQR, 3.2 to 16.6), and all preprints hosted at bioRxiv have had a median score of 7.7 (IQR, 3.1 to 16.2). For comparison, the median Altmetric attention score for articles published in mBio published since 2013 was 5.0 (IQR, 1.5 to 14.5). Of all scholarship tracked by Altmetric, the median Altmetric attention score for preprints posted at bioRxiv ranks at the 87 percentile (IQR, 75 to 94). A controversial, yet more traditional metric of impact has been the number of citations an article receives. I obtained the number of citations for the published versions of papers that were initially posted as preprints. To allow for a comparison to traditional journals, I considered the citations for preprints published in 2014 and 2015 as aggregated by the Web of Science (Fig. 1C). Among the preprints that were published and could be found in the Web of Science database, the median number of citations was 9 (IQR, 3 to 19; mean, 17.1). For comparison, among the papers published in mBio in 2014 and 2015, the median number of citations was 6 (IQR, 3 to 11; mean, 8.5). Although it is impossible to quantify the quality or impact of research with individual metrics, it is clear that the science presented in preprints and the publications that result from them are accepted by the microbiology community at a level comparable to more traditionally presented research.
PREPRINTS FROM AN AUTHOR’S PERSPECTIVE
Posting research as a preprint gives an author great control over when their work is made public. Under the traditional peer review model, an author may need to submit and revise their work multiple times to several journals over a long period before it is finally published. In contrast, an author can post the preprint at the start of the process for others to consume and comment on as it works its way through the peer review process. A first example illustrates the utility of preprints for improving access to research and the quality of its reporting. In 2014, my research group posted a preprint to PeerJ PrePrints describing a method of sequencing 16S rRNA gene sequences using the Pacific Biosciences sequencing platform (41). At the same time, we submitted the paper for review at PeerJ. While the article was under review, we received feedback from an academic scientist and from scientists at Pacific Biosciences that the impact of the results could be enhanced by using a recently released version of the sequencing chemistry. Instead of ignoring this feedback and resubmitting the paper to address the reviews, we generated new data and submitted an updated preprint a year later with a simultaneous submission to PeerJ that incorporated the original reviews and the feedback we received from the academic scientist and Pacific Biosciences. It was eventually published by PeerJ (42, 43). Since 2015, we have continued to post papers as preprints at the same time as we have submitted papers. Although the feedback to other articles has not always been as helpful as our initial experience, in each case we were able to publicize our results prior to lengthy peer review processes by immediately making our results available; in one case, our preprint was available 7 months ahead of the final published version (44, 45). As another example, I posted a preprint of the current paper to bioRxiv on 22 February 2017. I then solicited feedback on the article using social media. On 14 March 2017, I incorporated the comments and posted a revised preprint and submitted the article to mBio. During that time, the abstract was accessed 189 times, and the PDF was accessed 107 times. This process engaged 3 commentators on bioRxiv, 61 people either tweeted or retweeted the preprint on Twitter, 2 people on the paper’s GitHub repository, 1 person on a blog, and 2 via e-mail. Compared to the two scientists that eventually reviewed the article, the preliminary round of informal peer review engaged a much larger and more diverse community than had I foregone the posting of a preprint. By the time that the final version of the paper was submitted on 21 April 2017, the preprint version of this article had an Almetric attention score of 58, which placed it in the top 5% of all research scored by Altmetric and the abstract and PDF had been accessed 2,152 and 512 times, respectively. Although there are concerns regarding the quality of the science posted to a preprint server, I contend that responsible use of preprints as a part of the scientific process can significantly enhance the science.
PREPRINTS FROM A PUBLISHER’S PERSPECTIVE
A lingering question is what role traditional journals will have in disseminating research if there is broad adoption of preprints. Edited peer-reviewed journals offer and will continue to offer significant added value to a publication. A scholarly publishing ecosystem in which preprints coexist with journals will allow authors to gain value from the immediate communication of their work associated with preprints and also benefit from the peer-reviewed, professionally edited publication that publishers can provide.
The professional copyediting, layout, and publicity that these publishers offer are also unique features of traditional journals. An alternative perspective is that preprints will eventually replace traditional journals. Certainly, this is a radical perspective, but it does serve to motivate publishers to capture the innovation opportunities offered by preprints. By adopting preprint-friendly policies, journals can create an attractive environment for authors. As discussed above, a growing number of journals have created mechanisms for authors to directly submit preprints to their journals. An example is offered by the ASM, which earlier this year launched a new venture from mSphere. mSphereDirect is a publication track of the journal that capitalizes on the opportunity offered to couple preprints with rigorous peer review. mSphereDirect actively encourages authors to post their papers as preprints as part of an author-driven editorial process where an editorial decision is rendered within 5 days and publication in mSphere within a month (46). As the mSphereDirect mechanism evolves and is perhaps adopted by other journals, it will be interesting to see whether public feedback on preprints will be used to further streamline the editorial process. ASM is developing a new platform, MicroNow, which will help coalesce specific communities within the microbial sciences, further enhancing the use of preprints and published articles (Stefano Bertuzzi, personal communication). In addition to integrating preprints into the traditional editorial process, several professional societies have also explicitly supported citation of preprints in their other publications and recognize the priority of preprints in the literature (20–22). These policies empower authors and make specific journals more attractive. Other practices have great potential to improve the reputation of journals. As measured above, preprints are able to garner attention on par with papers published in highly selective microbiology journals. Thus, it is in a journal’s best interest to recruit these preprints to their journals. Several journals, including PLoS Genetics and Genome Biology, have publicly stated that they scout preprints for this purpose (47, 48). Preprints can also be viewed as a lost opportunity to journals. A preprint that garners significant attention may be ignored when it is finally published, bringing little additional attention to the journal. Going forward, there will likely be many innovative approaches that publishers develop to benefit from incorporating preprints into their process, and whether publishers’ influence is reduced by the widespread adoption of preprints remains to be seen.
CONCLUSIONS
Since the first microbiology-affiliated preprint was posted on bioRxiv in November 2013 (49), an increasing number of microbiologists are posting their unpublished work to preprint severs as an efficient method for disseminating their research prior to peer review. A number of critical concerns remain about how widespread their adoption will be, how they will be perceived by traditional journals and other scientists, and whether traditional peer review will adapt to the new scientific trends and technologies. Regardless, preprints should offer a great opportunity for both scientists and journals to publish high-quality science.
ACKNOWLEDGMENTS
P.D.S. is an editor for Applied and Environmental Microbiology, which is another ASM Journal, and will be the Chair of the ASM Journals Board starting July 1, 2017.
I am grateful to Stefano Bertuzzi and Lynn Enquist for their helpful comments on earlier versions of this article and to the numerous individuals who provided feedback on the preprint version of this article.
This work was supported in part by funding from the National Institutes of Health (P30DK034933). I appreciate the support of Altmetric, Inc., and Thompson Reuters who provided advanced programming interface (API) access to their databases. The workflow utilized commands in GNU make (v.3.81), GNU bash (v.4.1.2), and R (v.3.3.3). Within R, I utilized the cowplot (v.0.7.0), dplyr (v.0.5.0), ggplot2 (v.2.2.1), httr (v.1.2.1), RCurl (v.1.95-4.8), rentrez (v.1.0.4), RJSONIO (v.1.3-0), rvest (v.0.3.2), sportcolors (v.0.0.1), and tidyr (v.0.6.1) packages.
All journal policies and the information cited using webpage links were current on 21 April 2017.
The views expressed in this Perspective do not necessarily reflect the views of this journal or of ASM.
Footnotes
Citation Schloss PD. 2017. Preprinting microbiology. mBio 8:e00438-17. https://doi.org/10.1128/mBio.00438-17.
REFERENCES
- 1.Vale RD. 2015. Accelerating scientific publication in biology. Proc Natl Acad Sci USA 112:13439–13446. doi: 10.1073/pnas.1511912112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Desjardins-Proulx P, White EP, Adamson JJ, Ram K, Poisot T, Gravel D. 2013. The case for open preprints in biology. PLoS Biol 11:e1001563. doi: 10.1371/journal.pbio.1001563. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Berg JM, Bhalla N, Bourne PE, Chalfie M, Drubin DG, Fraser JS, Greider CW, Hendricks M, Jones C, Kiley R, King S, Kirschner MW, Krumholz HM, Lehmann R, Leptin M, Pulverer B, Rosenzweig B, Spiro JE, Stebbins M, Strasser C, Swaminathan S, Turner P, Vale RD, VijayRaghavan K, Wolberger C. 2016. Preprints for the life sciences. Science 352:899–901. doi: 10.1126/science.aaf9133. [DOI] [PubMed] [Google Scholar]
- 4.Bhalla N. 2016. Has the time come for preprints in biology? Mol Biol Cell 27:1185–1187. doi: 10.1091/mbc.E16-02-0123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.National Institutes of Health 2017. Reporting preprints and other interim research products. Notice no. NOT-OD-17-050 National Institutes of Health, Bethesda, MD: https://grants.nih.gov/grants/guide/notice-files/NOT-OD-17-050.html. [Google Scholar]
- 6.Nature Publishing Group 1966. Preprints galore. Nature 211:897–898. doi: 10.1038/211897a0. [DOI] [PubMed] [Google Scholar]
- 7.Till JE. 2001. Predecessors of preprint servers. Learn Publ 14:7–13. doi: 10.1087/09531510125100214. [DOI] [Google Scholar]
- 8.Ginsparg P. 2011. ArXiv at 20. Nature 476:145–147. doi: 10.1038/476145a. [DOI] [PubMed] [Google Scholar]
- 9.Shuai X, Pepe A, Bollen J. 2012. How the scientific community reacts to newly submitted preprints: article downloads, twitter mentions, and citations. PLoS One 7:e47523. doi: 10.1371/journal.pone.0047523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Callaway E. 2013. Biomedical journal and publisher hope to bring preprints to life. Nat Med 19:512. doi: 10.1038/nm0513-512. [DOI] [PubMed] [Google Scholar]
- 11.American Society for Microbiology 2016. mBio instructions to authors. American Society for Microbiology, Washington, DC: http://mbio.asm.org/site/misc/journal-ita_edi.xhtml. [Google Scholar]
- 12.ASAPbio 23 February 2017. ASAPbio awarded $1 million from Helmsley Charitable Trust for next-generation life sciences preprint infrastructure. ASAPbio, San Francisco, CA: http://asapbio.org/feb23. [Google Scholar]
- 13.Angell M, Kassirer JP. 1991. The Ingelfinger Rule revisited. N Engl J Med 325:1371–1373. doi: 10.1056/NEJM199111073251910. [DOI] [PubMed] [Google Scholar]
- 14.International Committee of Medical Journal Editors 2017. Recommendations for the conduct, reporting, editing, and publication of scholarly work in medical journals. International Committee of Medical Journal Editors http://www.icmje.org/news-and-editorials/icmje-recommendations_annotated_dec16.pdf. [PubMed]
- 15.Wikipedia contributors 21 April 2017. List of academic journals by preprint policy, on Wikipedia, The Free Encyclopedia. https://en.wikipedia.org/wiki/List_of_academic_journals_by_preprint_policy. Accessed 21 April 2017.
- 16.SHERPA/RoMEO 21 April 2017. Publisher copyright policies and self-archiving. SHERPA/RoMEO, London, United Kingdom: http://www.sherpa.ac.uk/romeo/index.php. [Google Scholar]
- 17.Drubin DG. 2016. The mismeasure of scientific research articles and why MBoC quickly embraced preprints. Mol Biol Cell 27:3181–3182. doi: 10.1091/mbc.E16-09-0651. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Spier R. 2002. The history of the peer-review process. Trends Biotechnol 20:357–358. doi: 10.1016/s0167-7799(02)01985-6. [DOI] [PubMed] [Google Scholar]
- 19.Burnham JC. 1990. The evolution of editorial peer review. JAMA 263:1323–1329. doi: 10.1001/jama.1990.03440100023003. [DOI] [PubMed] [Google Scholar]
- 20.Pulverer B. 2016. Preparing for preprints. EMBO J 35:2617–2619. doi: 10.15252/embj.201670030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Loew LM. 2016. Peer review and bioRxiv. Biophys J 111:E01–E02. doi: 10.1016/j.bpj.2016.06.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Vale RD, Hyman AA. 2016. Priority of discovery in the life sciences. eLife 5:e16931. doi: 10.7554/eLife.16931. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Federation of American Societies for Experimental Biology 6 December 2016. Interim research product RFI response from FASEB. Federation of American Societies for Experimental Biology, Bethesda, MD: http://www.faseb.org/Portals/2/PDFs/opa/2016/Interim%20Research%20Product%20RFI.pdf. [Google Scholar]
- 24.Chawla DS. 20 January 2017. When a preprint becomes the final paper. Nature http://www.nature.com/news/when-a-preprint-becomes-the-final-paper-1.21333.
- 25.Black A, Potter B, Dudas G, Feldstein L, Grubaugh ND, Andersen KG, Ellis BR, Ellis EM, Bedford T. 2017. Genetic characterization of the Zika virus epidemic in the US Virgin Islands. bioRxiv doi: 10.1101/113100. [DOI] [Google Scholar]
- 26.Afshinnekoo E, Meydan C, Chowdhury S, Jaroudi D, Boyer C, Bernstein N, Maritz JM, Reeves D, Gandara J, Chhangawala S, Ahsanuddin S, Simmons A, Nessel T, Sundaresh B, Pereira E, Jorgensen E, Kolokotronis S-O, Kirchberger N, Garcia I, Gandara D, Dhanraj S, Nawrin T, Saletore Y, Alexander N, Vijay P, Hénaff EM, Zumbo P, Walsh M, O’Mullan GD, Tighe S, Dudley JT, Dunaif A, Ennis S, O’Halloran E, Magalhaes TR, Boone B, Jones AL, Muth TR, Paolantonio KS, Alter E, Schadt EE, Garbarino J, Prill RJ, Carlton JM, Levy S, Mason CE. 2015. Geospatial resolution of human and bacterial diversity with city-scale metagenomics. Cell Syst 1:72–87. doi: 10.1016/j.cels.2015.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Petit RA III, Ezewudo M, Joseph SJ, Read TD. 2015. Searching for anthrax in the New York City Subway metagenome. Zenodo doi: 10.5281/zenodo.17158. [DOI] [Google Scholar]
- 28.Ackelsberg J, Rakeman J, Hughes S, Petersen J, Mead P, Schriefer M, Kingry L, Hoffmaster A, Gee JE. 2015. Lack of evidence for plague or anthrax on the New York city subway. Cell Syst 1:4–5. doi: 10.1016/j.cels.2015.07.008. [DOI] [PubMed] [Google Scholar]
- 29.Afshinnekoo E, Meydan C, Chowdhury S, Jaroudi D, Boyer C, Bernstein N, Maritz JM, Reeves D, Gandara J, Chhangawala S, Ahsanuddin S, Simmons A, Nessel T, Sundaresh B, Pereira E, Jorgensen E, Kolokotronis S-O, Kirchberger N, Garcia I, Gandara D, Dhanraj S, Nawrin T, Saletore Y, Alexander N, Vijay P, Hénaff EM, Zumbo P, Walsh M, O’Mullan GD, Tighe S, Dudley JT, Dunaif A, Ennis S, O’Halloran E, Magalhaes TR, Boone B, Jones AL, Muth TR, Paolantonio KS, Alter E, Schadt EE, Garbarino J, Prill RJ, Carlton JM, Levy S, Mason CE. 2015. Modern methods for delineating metagenomic complexity. Cell Syst 1:6–7. doi: 10.1016/j.cels.2015.07.007. [DOI] [PubMed] [Google Scholar]
- 30.Boothby TC, Tenlen JR, Smith FW, Wang JR, Patanella KA, Nishimura EO, Tintori SC, Li Q, Jones CD, Yandell M, Messina DN, Glasscock J, Goldstein B. 2015. Evidence for extensive horizontal gene transfer from the draft genome of a tardigrade. Proc Natl Acad Sci USA 112:15976–15981. doi: 10.1073/pnas.1510461112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Koutsovoulos G, Kumar S, Laetsch DR, Stevens L, Daub J, Conlon C, Maroon H, Thomas F, Aboobaker AA, Blaxter M. 2016. No evidence for extensive horizontal gene transfer in the genome of the tardigrade Hypsibius dujardini. bioRxiv doi: 10.1101/033464. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Boothby TC, Goldstein B. 2016. Reply to Bemm et al. and Arakawa: identifying foreign genes in independent Hypsibius dujardini genome assemblies. Proc Natl Acad Sci USA 113:E3058–E3061. doi: 10.1073/pnas.1601149113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Koutsovoulos G, Kumar S, Laetsch DR, Stevens L, Daub J, Conlon C, Maroon H, Thomas F, Aboobaker AA, Blaxter M. 2016. No evidence for extensive horizontal gene transfer in the genome of the tardigrada Hypsibius dujardini. Proc Natl Acad Sci USA 113:5053–5058. doi: 10.1073/pnas.1600338113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Bemm F, Weiss CL, Schultz J, Förster F. 2016. Genome of a tardigrade: horizontal gene transfer or bacterial contamination? Proc Natl Acad Sci 113:E3054–E3056. doi: 10.1073/pnas.1525116113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Arakawa K. 2016. No evidence for extensive horizontal gene transfer from the draft genome of a tardigrade. Proc Natl Acad Sci USA 113:E3057. doi: 10.1073/pnas.1602711113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Delmont TO, Eren AM. 2016. Identifying contamination with advanced visualization and analysis practices: metagenomic approaches for eukaryotic genome assemblies. PeerJ 4:e1839. doi: 10.7717/peerj.1839. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Bik EM, Casadevall A, Fang FC. 2016. The prevalence of inappropriate image duplication in biomedical research publications. bioRxiv doi: 10.1101/049452. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Bik EM, Casadevall A, Fang FC. 2016. The prevalence of inappropriate image duplication in biomedical research publications. mBio 7:e00809-16. doi: 10.1128/mBio.00809-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Wolfe-Simon F, Switzer Blum J, Kulp TR, Gordon GW, Hoeft SE, Pett-Ridge J, Stolz JF, Webb SM, Weber PK, Davies PC, Anbar AD, Oremland RS. 2011. A bacterium that can grow by using arsenic instead of phosphorus. Science 332:1163–1166. doi: 10.1126/science.1197258 [DOI] [PubMed] [Google Scholar]
- 40.Altmetric, Inc 24 November 2016. How is the Altmetric Attention Score calculated? Almetric, Inc, London, United Kingdom: https://help.altmetric.com/support/solutions/articles/6000060969-how-is-the-altmetric-attention-score-calculated-. [Google Scholar]
- 41.Schloss PD, Westcott SL, Jenior ML, Highlander SK. 2015. Sequencing 16S rRNA gene fragments using the PacBio SMRT DNA sequencing system. PeerJ PrePrints 3:e778v1. doi: 10.7287/peerj.preprints.778v1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Schloss PD, Jenior ML, Koumpouras CC, Westcott SL, Highlander SK. 2016. Sequencing 16S rRNA gene fragments using the PacBio SMRT DNA sequencing system. PeerJ PrePrints 4:e778v2. doi: 10.7287/peerj.preprints.778v2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Schloss PD, Jenior ML, Koumpouras CC, Westcott SL, Highlander SK. 2016. Sequencing 16S rRNA gene fragments using the PacBio SMRT DNA sequencing system. PeerJ 4:e1869. doi: 10.7717/peerj.1869. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Baxter NT, Koumpouras CC, Rogers MA, Ruffin MT, Schloss P. 2016. DNA from fecal immunochemical test can replace stool for microbiota-based colorectal cancer screening. bioRxiv doi: 10.1101/048389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Baxter NT, Koumpouras CC, Rogers MAM, Ruffin MT, Schloss PD. 2016. DNA from fecal immunochemical test can replace stool for detection of colonic lesions using a microbiota-based model. Microbiome 4:59. doi: 10.1186/s40168-016-0205-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Imperiale MJ, Shenk T, Bertuzzi S. 2016. mSphereDirect: author-initiated peer review of manuscripts. mSphere 1:e00307-16. doi: 10.1128/msphere.00307-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Vence T. 18 January 2017. Journals seek out preprints. TheScientist http://www.the-scientist.com/?articles.view/articleNo/48068/title/Journals-Seek-Out-Preprints/.
- 48.Barsh GS, Bergman CM, Brown CD, Singh ND, Copenhaver GP. 2016. Bringing PLOS Genetics editors to preprint servers. PLoS Genet 12:e1006448. doi: 10.1371/journal.pgen.1006448. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Hockett KL, Nishimura MT, Karlsrud E, Dougherty K, Baltrus DA. 2013. Interactions between genome architecture and virulence genes in Pseudomonas syringae, strain CC1557 as a model. bioRxiv doi: 10.1101/000869. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.