Version Changes
Revised. Amendments from Version 1
The revised version includes discussions on data ownership and the permanence of open lab notebooks. In particular, we have now implemented a robust archival system in GitHub and https://archive.org for data entered at https://openlabnotebooks.org. As suggested, we also provide a brief history of the open lab notebook community. Finally, we discuss the risk that dubious experiments find their way on open platforms, get amplified over the internet and mislead scientists, patient groups or other communities.
Abstract
The fundamental goal of the growing open science movement is to increase the efficiency of the global scientific community and accelerate progress and discoveries for the common good. Central to this principle is the rapid disclosure of research outputs in open-access peer-reviewed journals and on pre-print servers. The next bold step in this direction is open laboratory notebooks, where research scientists share their research — including detailed protocols, negative and positive results — online and in near-real-time to synergize with their peers. Here, we highlight the benefits of open lab notebooks to science, society and scientists, and discuss the challenges that this nascent movement is facing. We also present the implementation and progress of our own initiative at openlabnotebooks.org, with more than 20 active contributors after one year of operation.
Keywords: open lab notebooks, open science, peer-review, preprints, publishing, science communication
Introduction
The function of the scientific peer-reviewed system is to provide greater confidence that published research is scientifically sound. This system is widely accepted as the best available, although imperfect (as peer reviewers may miss technical flaws or be biased) 1, to guide the global scientific community towards progress. Peer-reviewed publishing is also used by research scientists, funders and institutions as a mechanism to claim ownership of their discoveries. As a result, the community widely believes that findings should be kept secret until they are published in a peer-reviewed journal. This tradition of secrecy, which protects the scientist as opposed to the science, has been transmitted from mentor to trainee for centuries (Galileo kept his discoveries to himself until they were published). In the life sciences, this belief can reach near-mystical levels 2, and can be compounded by constraints associated with patent protection procedures or the absence of clear mechanism to make one’s data publicly available. The peer-review and publication process grew in an era where communication was largely in paper format. Today, in the age of instant communication, one would imagine there should be more efficient ways to operate.
Open lab notebooks: good for science and society
We believe that open laboratory notebooks, where research scientists record their work online and in near-real time, are an efficient way to disseminate data before it is published in peer-reviewed journals, and has several advantages over the traditional “release after publication” system 3. First, making the data accessible within weeks rather than keeping it hidden for years means that others will be able to build upon the research, and avoid spending time and resources on redundant experiments 4. Second, open lab notebooks should include detailed protocols that can be reproduced, which is often not the case in peer-reviewed publications 5, 6. Third, negative data, which are almost never disclosed in the current publishing system but are provided in open lab notebooks, can sometime provide important insight 7, 8. Fourth, open lab notebooks offer a space for anyone to comment on experimental records. This allows experts to provide insight, but also to flag technically unsound experiments, thereby reducing the potential for flawed science to appear in peer-reviewed journals and in pre-print media. Open lab notebooks can therefore help save time, resources, and knowledge. If adopted by many, they should lead to a more synergistic way to do science and to more efficient use of public funds.
Good for scientists
Many believe that the chances of getting scooped before one publishes their work in a peer-reviewed journal increase when openly sharing their work online 9. We argue that open lab notebooks have compensating advantages that are good for scientists. To succeed in academia, one must get funding, assert primacy over discoveries, be known in a field of research and be able to present work and ideas clearly and convincingly. Open lab notebooks can help in all aspects.
First, funding agencies are seeing the open science movement as a long lasting and far-reaching shift for the best, and are increasingly supportive of efforts to embrace open science principles. For instance, the symposium set to launch openlabnotebooks.org was entirely sponsored by the Wellcome Trust and the Canadian Institute of Health Research, and senior representatives from the Gates Foundation and the Chan-Zuckerberg Initiative were also in attendance ( https://www.thesgc.org/open-lab-notebooks-2018). The NIH’s National Institute on Aging dedicated an entire session to open science at their 2018 Alzheimer’s research summit ( https://www.nia.nih.gov/research/nih-ad-summit-2018-program-agenda), as did the 2018 Enroll-HD congress of the CHDI Huntington’s Disease Foundation ( https://www.enroll-hd.org/enroll-hd-congress-2018/). The Wellcome Trust has recently launched the Wellcome Open Research publishing platform and Open Research Fund. Our personal observations seem to indicate that grant applications highlighting the use of open lab notebooks are being viewed positively. For example, Huntington’s disease (HD) research funders such as the CHDI Foundation, the Huntington Society of Canada and the Huntington Society of America, have all generously funded studies of HD biochemistry at the SGC Toronto.
Second, results in open lab notebook are date-stamped, thus claiming temporal priority of the data. Indeed, public repositories such as Zenodo 10 add a date-stamp to depositions, and assign a citable DOI to open lab notebook records (detailed below): once a record has been published, it can no longer be modified, but revised versions can be appended if necessary.
Third, early career scientists can use their open notebooks to connect with their peers and with experts in the field, start new collaborations and build their own network. Fourth, the use of open lab notebooks provides opportunity to present work clearly and concisely to both experts and non-experts. This is an important skill to master in order to write convincing grant applications. Fifth, junior scientists will also find their open lab notebook a good medium to showcase their technical skills and scientific insight, and may find it useful to add a link in their resume when applying for their next position. Finally, many will find a personal satisfaction in embracing open science and FAIR data principles 11.
Implementation of an open lab notebook platform
Open lab notebooks have been pioneered and championed by a number of practitioners but remain a niche activity in the scientific community. Jean-Claude Bradley first coined the term “open notebook science” in 2006 and his definition of this method of scholarly communication have laid the foundations for our own efforts 12. In addition to the notebooks of individual researchers following Bradley’s template, open notebook examples now include collective efforts from the Open Lab Notebook Network ( http://onsnetwork.org) and Open Source Malaria ( http://opensourcemalaria.org). However, the open lab notebook community remains small, the practice is not consistently defined or implemented and the impact of these efforts in the field have not been systematically evaluated.
Following our prediction that open lab notebooks should be good for science and good for scientists, and after a 2-year pilot study where Rachel Harding, a post-doctoral fellow at the Structural Genomics Consortium (SGC) shared her work on Huntington’s disease at labscribbles.com 13, we launched openlabnotebooks.org in January 2018, where 12 scientists from the SGC started reporting their work live, online 14, 15. Each post is composed of two documents. (1) A detailed and rigorous experimental record, including all data and protocols, which experts can evaluate, comment on or build upon ( Figure 1); (2) a blog, aimed at the non-specialist that explains in simple terms the motivation and rational for the experiment, summarizes results – positive and negative – and outlines next steps ( Figure 2). The blogs, posted at openlabnotebooks.org, are managed by a webserver downloaded from wordpress.org (the open-source online system LabTrove would be a valid alternative 16), archived weekly to GitHub (repository https://github.com/thesgc/static-openlabnotebooks), quarterly to archive.org ( https://wayback.archive-it.org/6473/*/https:/opennotebook.thesgc.org/), and link to the experimental records, which are deposited at Zenodo (zenodo.org), but can also be made available from other public repositories, such as GitHub (github.com) or Figshare (figshare.com). While the experimental details posted at Zenodo are important scientifically, the blog written in layman’s term can be used to engage with scientists that may have a complementary set of expertise for future collaborations as well as other stakeholders in the research process, including patient groups, a dimension that most in academia are missing.
The Zenodo repository enables sharing research outputs from across all fields of research, creation and curation of complete digital repositories, flexible licensing with controlled degree of openness and safe storage of the data for the future in the same cloud infrastructure as CERN’s own LHC research data. Open laboratory notebooks need to guarantee that the data will remain accessible, in order to avoid the fate suffered by the pioneer open notebook of Jean-Claude Bradley, which is still accessible while its associated raw data wiki is not. Zenodo is strongly committed to preserving the data it archives. CERN has existed since 1954 and has an experimental program defined for the next 20+ years. Each file has two replicas located on different disk servers. In the highly unlikely event that Zenodo closes operations, they guarantee migration of all content to other suitable repositories, and since all uploads have DOIs, citations and links to Zenodo resources (including data) will not be affected.
The ultimate goal of this open lab notebook initiative is not only to increase the impact of our work but also, along with precursors in the field such as Open Source Malaria ( http://opensourcemalaria.org/) and other isolated open lab notebook efforts, to inspire others to follow, and contribute to the creation of a new open science movement in the life sciences. While it is too early to judge the success of this initiative, the number of contributing scientists and institutions is steadily increasing ( Figure 3). While only one scientist was contributing in November 2017, 23 scientists from six institutions (University of Toronto, University of Oxford, University of North Carolina, University of Leicester, the Karolinska Institute in Sweden and University of Montpellier in France) are recording their work at openlabnotebooks.org as of December 2018.
As importantly, impact is also increasing, judging by the average number of views per experimental record calculated from statistical data available at Zenodo.org ( Figure 3). Some reports raised a considerable interest. For instance, the crystal structure of USP5 in complex with small molecule fragments has 821 unique views and 324 unique downloads as of December 2018 17. If the initiative is successful, we anticipate that within three to five years, usage metrics are comparable at openlabnotebooks.org and bioRxiv, the preprint server for biology.
Data posted at openlabnotebooks.org are raising interest in academic groups, but also in the industry. For instance, a notebook contributor was directly contacted by a big pharmaceutical company to further discuss the results that he had shared online, and a big biotech company asked permission to another contributor to include their data in a presentation at a public scientific meeting. Some of the research reported at openlabnotebooks.org is of direct relevance to patient groups. For instance, four scientists record their results on testing chemical inhibitors of the kinase ALK2, a potential therapeutic target for the treatment of the pediatric brain tumor diffuse intrinsic pontine glioma (DIPG), and the heterotopic ossification disorder fibrodysplasia ossificans progressive (FOP) 18, 19. The compounds, developed by the open science biotech company M4KPharma, are still in pre-clinical phase of development but should ultimately lead to clinical trials for these incurable diseases 20. Scientists working on projects with a clear path to the clinic are eager to share their enthusiasm and commitment with patient groups (sometimes using social media to announce their latest open notebook post) who, in turn, follow their work.
The challenges of open lab notebooks
Three antagonizing points that inhibit scientists from starting their own open lab notebook are the fear of being scooped, the inability to report collaborative work when collaborators want to keep data secret, and the concern that an open notebook will take time away from an already overburdened schedule 21. The language barrier for non-native English speakers, and the availability of open lab notebook solutions can also be challenging. It is indeed likely that maintaining an open lab notebook increases the chances of being scooped, but it is too early at this point to know whether this effect is minor or significant. Paradoxically, and given the territorial nature of the current frameworks for funding and managing scientific research, entries in one’s open lab notebook may mark one’s area very effectively, especially in a conceivable future when funding trusts and councils start looking into them. We would argue that most, if not all, scientists get scooped during their career, and that open lab notebooks serve as a safety net for early career scientists who have a citable record of their work if they ever get scooped. Obtaining permission from collaborators to report collaborative work in open lab notebooks can be challenging. We believe that the best way to avoid such a situation is to clearly state at the outset of a collaboration the intention to adopt open science principles 22. Scientists are more likely to agree if presented with the idea well in advance. The time invested in practicing clear, concise and engaging scientific writing is not lost on one’s career. After some practice, maintaining an open lab notebook should not take more time than using a regular lab notebook.
Open notebooks being published before peer-review, there is a risk that dubious experiments, erroneous analysis or misinterpretations find their way on open platforms, get amplified over the internet and mislead colleague scientists, patient groups or other communities. Once they become indexed by popular search engines, open lab notebooks could become a source of pollution of the scientific (and non-scientific) literature. This risk, which is not limited to open notebooks but extends to the increasing number of Journals that adopt a post-publication peer-review mechanism, is real, serious, and should be monitored. We believe that the best way to mitigate this risk is for open notebooks to provide a platform for open comments. In principle, this could be an even stronger quality control than the current peer-review system in place in most scientific journals, as the number of “open reviewers” for any given report is limitless. At the moment, we find that very few comments are posted at openlabnotebooks.org, a platform that is only a year old, but we see that comments are mainstream, and sometimes turn into healthy discussions at Open Source Malaria, a pioneer in the field 23.
Future directions and conclusion
Open lab notebooks represent a major departure from current practices in science (especially biomedical sciences) and hold a mix of promises and risks. As the community producing these lab notebooks is increasing, there is an opportunity to move beyond ideology and anecdotal data to evidence-based policy design. In the spirit of openness, we call on colleagues from both the life science and the social sciences communities to conduct systematic evaluation of the benefits and downsides of open lab notebooks. It will be important to compare several parameters on a yearly basis. These may include the frequency of research being scooped among scientists disclosing their work in open lab notebooks versus a less open reference group; the frequency of new collaborations; the frequency of comments and ideas received by the authors of open notebooks; and instances where open lab notebooks were essential for compliance with funder or institutional requirements. More difficult to assess will be issues such as recognition, career progression, speeding up research, and impact on reproducibility, but they could all be addressed with appropriate questionnaires and data analytics.
Our goal is to see the number of open lab notebooks increase exponentially over the coming years. Future implementation of novel features, such as the ability to search for experiments containing compounds with specific chemical templates, is expected to extend the reach of the platform to medicinal and computational chemists. Indexing of open lab notebooks by popular search engines such as Google Scholar (which already indexes pre-prints and other non-peer-reviewed documents) would increase the visibility and impact of open notebooks. Importantly, open lab notebook data deposited at Zenodo.org is already searchable with Google’s Dataset search engine. To further encourage scientists to break free from the tradition of secrecy that has been passed on for generations, a cultural change needs to be supported at institutional and governmental levels. Funding bodies are starting to define and enforce open science publication practices 24. Similarly, universities could take a more proactive role, for instance by including adhesion to open-access principles as an evaluation criteria for career advancement 25. Indeed, while strong incentives described above already exist for junior scientists to start their own open lab notebook, the benefit to their PIs who already have established a professional network and don’t need to showcase their skills is not always as clear. As long as scientists are not convinced that open science is good for them, Science 2.0 will have to wait.
Data availability
No data are associated with this article.
Acknowledgments
We thank the following principal investigators whose group members are contributing to openlabnotebooks.org: Cheryl Arrowsmith, Dalia Barsyte, Paul Brennan, Alex Bullock, David Drewry, Susanne Gräslund, Brian Marsden, Dave Morris, Panagiotis Prinos, Frank Von Delft, Tim Willson, and Wyatt Yue. Nicholas Worby at University of Toronto Libraries set-up the quarterly archiving of openlabnotebools.org to https://archive.org/.
As of December 2018, the Open Lab Notebook Consortium is made up of Roslin Adamson, Jose Brandao-Neto, Elizabeth J. Brown, Antoine Claessens, David Damerell, David Dilworth, Thomas Durcan, Benjamin J. Eduful, Aled M. Edwards, Opher Gileadi, Jolene Caifeng Ho, Leonidas Koukouflis, Tobias Krojer, Genna M. Luciani, Sabrina Mackinnon, Mandeep Mann, Carolyn Marks, Sean O’Byrne, Alfredo Picado, Pietro Roversi, Louisa Temme, Eleanor Williams, Jong Fu Wong, Wen Yih Aw.
Funding Statement
The Structural Genomics Consortium (SGC) is a registered charity (number 1097737) that receives funds from AbbVie, Bayer Pharma AG, Boehringer Ingelheim, Canada Foundation for Innovation, Eshelman Institute for Innovation, Genome Canada through Ontario Genomics Institute [OGI-055], Innovative Medicines Initiative (EU/EFPIA) [ULTRA-DD grant no. 115766], Janssen, Merck KGaA, Darmstadt, Germany, MSD, Novartis Pharma AG, Ontario Ministry of Research, Innovation and Science (MRIS), Pfizer, São Paulo Research Foundation-FAPESP, Takeda, and The Wellcome Trust [106169/ZZ14/Z]. RJH is a recipient of the HDSA Berman/Topper HD Career Development Fellowship. PR is funded by the Wellcome Trust Leicester ISSF award reference 204801/Z/16/Z and the Leicester Institute of Chemical and Structural Biology (LISCB).
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
[version 2; peer review: 2 approved
References
- 1. Smith R: Peer review: a flawed process at the heart of science and journals. J R Soc Med. 2006;99(4):178–82. 10.1258/jrsm.99.4.178 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Resnik DB: Openness versus Secrecy in Scientific Research Abstract. Episteme (Edinb). 2006;2(3):135–147. 10.3366/epi.2005.2.3.135 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Woelfle M, Olliaro P, Todd MH: Open science is a research accelerator. Nat Chem. 2011;3(10):745–8. 10.1038/nchem.1149 [DOI] [PubMed] [Google Scholar]
- 4. Powell K: Does it take too long to publish research? Nature. 2016;530(7589):148–51. 10.1038/530148a [DOI] [PubMed] [Google Scholar]
- 5. Wallach JD, Boyack KW, Ioannidis JPA: Reproducible research practices, transparency, and open access data in the biomedical literature, 2015–2017. PLoS Biol. 2018;16(11):e2006930. 10.1371/journal.pbio.2006930 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Reality check on reproducibility. Nature. 2016;533(7604):437. 10.1038/533437a [DOI] [PubMed] [Google Scholar]
- 7. Mlinari A, Horvat M, Šupak Smolčić V: Dealing with the positive publication bias: Why you should really publish your negative results. Biochem Med (Zagreb). 2017;27(3):030201. 10.11613/BM.2017.030201 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Carroll HA, Toumpakari Z, Johnson L, et al. : The perceived feasibility of methods to reduce publication bias. PLoS One. 2017;12(10):e0186472. 10.1371/journal.pone.0186472 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Grubb AM, Easterbrook SM: On the lack of consensus over the meaning of openness: an empirical study. PLoS One. 2011;6(8):e23420. 10.1371/journal.pone.0023420 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Nielsen LH: Sharing your data and software on Zenodo.2017. [Google Scholar]
- 11. Wilkinson MD, Dumontier M, Aalbersberg IJ, et al. : The FAIR Guiding Principles for scientific data management and stewardship. Sci Data. 2016;3:160018. 10.1038/sdata.2016.18 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Drexel CoAS E-Learning: Open Notebook Science. Reference Source [Google Scholar]
- 13. Harding RJ: Open notebook science can maximize impact for rare disease projects. PLoS Biol. 2019;17(1):e3000120. 10.1371/journal.pbio.3000120 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Open notebooks galore: The Structural Genomics Consortium. [[Accessed: 24-Dec-2018] ]; eLife. 2018 Reference Source [Google Scholar]
- 15. Schapira M: Open Lab Notebooks to increase impact and accelerate discovery. Research Data at Springer Nature. [Accessed: 24-Dec-2018] 2018. Reference Source [Google Scholar]
- 16. Badiola KA, Bird C, Brocklesby WS, et al. : Experiences with a researcher-centric ELN. Chem Sci. 2015;6(3):1614–1629. 10.1039/c4sc02128b [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Mann M, Harding R, Ravichandran M, et al. : Co-crystal structures of USP5 Zf-UBD and weak binding compounds. Zenodo. 2018. 10.5281/zenodo.1313723 [DOI] [Google Scholar]
- 18. van Dinther M, Visser N, de Gorter DJ, et al. : ALK2 R206H mutation linked to fibrodysplasia ossificans progressiva confers constitutive activity to the BMP type I receptor and sensitizes mesenchymal cells to BMP-induced osteoblast differentiation and bone formation. J Bone Miner Res. 2010;25(6):1208–1215. 10.1359/jbmr.091110 [DOI] [PubMed] [Google Scholar]
- 19. Taylor KR, Vinci M, Bullock AN, et al. : ACVR1 Mutations in DIPG: lessons learned from FOP. Cancer Res. 2014;74(17):4565–4570. 10.1158/0008-5472.CAN-14-1298 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Morgan MR, Roberts OG, Edwards AM: Ideation and implementation of an open science drug discovery business model – M4K Pharma. Wellcome Open Res. 2018;3:154 10.12688/wellcomeopenres.14947.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Robertson MN, Ylioja PM, Williamson AE, et al. : Open source drug discovery - a limited tutorial. Parasitology. 2014;141(1):148–157. 10.1017/S0031182013001121 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Masum H, Rao A, Good BM, et al. : Ten simple rules for cultivating open science and collaborative R&D. PLoS Comput Biol. 2013;9(9):e1003244. 10.1371/journal.pcbi.1003244 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Williamson AE, Ylioja PM, Robertson MN, et al. : Open Source Drug Discovery: Highly Potent Antimalarial Compounds Derived from the Tres Cantos Arylpyrroles. ACS Cent Sci. 2016;2(10):687–701. | 10.1021/acscentsci.6b00086| [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Else H: Radical open-access plan could spell end to journal subscriptions. Nature. 2018;561(7721):17–18. 10.1038/d41586-018-06178-7 [DOI] [PubMed] [Google Scholar]
- 25. Alperin JP, Fischman GE, McKiernan EC, et al. : How significant are the public dimensions of faculty work in review, promotion, and tenure documents? 2018. 10.17613/M6W950N35 [DOI] [PMC free article] [PubMed] [Google Scholar]