Skip to main content
F1000Research logoLink to F1000Research
. 2024 Sep 30;13:ELIXIR-230. Originally published 2024 Mar 27. [Version 2] doi: 10.12688/f1000research.146301.2

A research data management (RDM) community for ELIXIR

Flora D'Anna 1,a,#, Niclas Jareborg 2,b,#, Mijke Jetten 3,c,#, Minna Ahokas 4, Pinar Alper 5, Robert Andrews 6, Korbinian Bösl 7, Teresa D’Altri 8, Daniel Faria 9, Nazeefa Fatima 10, Siiri Fuchs 4, Clare Garrard 11, Wei Gu 5, Katharina F Heil 11, Yvonne Kallberg 12, Flavio Licciulli 13, Nils-Christian Lübke 14, Ana M P Melo 9, Ivan Mičetić 15, Jorge Oliveira 16, Anastasis Oulas 17, Patricia M Palagi 18, Krzysztof Poterlowicz 19, Xenia Perez-Sitja 19, Patrick Ruch 20,21, Susanna-Assunta Sansone 22, Helena Schnitzer 14, Celia van Gelder 3, Thanasis Vergoulis 23, Daniel Wibberg 14, Ulrike Wittig 24, Brane Leskošek 25, Jiri Vondrasek 26, Munazah Andrabi 27,d,#
PMCID: PMC11474151  PMID: 39410979

Version Changes

Revised. Amendments from Version 1

The reviewers comments were addressed in this version which included minor revisions at a few places for clarification. Figure legends have been modified to further explain the data used.

Abstract

Research data management (RDM) is central to the implementation of the FAIR (Findable Accessible, Interoperable, Reusable) and Open Science principles. Recognising the importance of RDM, ELIXIR Platforms and Nodes have invested in RDM and launched various projects and initiatives to ensure good data management practices for scientific excellence. These projects have resulted in a rich set of tools and resources highly valuable for FAIR data management. However, these resources remain scattered across projects and ELIXIR structures, making their dissemination and application challenging. Therefore, it becomes imminent to coordinate these efforts for sustainable and harmonised RDM practices with dedicated forums for RDM professionals to exchange knowledge and share resources.

The proposed ELIXIR RDM Community will bring together RDM experts to develop ELIXIR’s vision and coordinate its activities, taking advantage of the available assets. It aims to coordinate RDM best practices and illustrate how to use the existing ELIXIR RDM services. The Community will be built around three integral pillars, namely, a network of RDM professionals, RDM knowledge management and RDM training expertise and resources. It will also engage with external stakeholders to leverage benefits and provide a forum to RDM professionals for regular knowledge exchange, capacity building and development of harmonised RDM practices, keeping in line with the overall scope of the RDM Community.

In the short term, the Community aims to build upon the existing resources and ensure that the content of these remain up to date and fit for purpose. In the long run, the Community will aim to strengthen the skills and knowledge of its RDM professionals to support the emerging needs of the scientific community. The Community will also devise an effective strategy to engage with other ELIXIR structures and international stakeholders to influence and align with developments and solutions in the RDM field.

Keywords: Data management, Data stewardship, Data management plans, FAIR principles, community standards, Data management training, Research data life cycle, Common best practices

Introduction

In the 2017 position paper on FAIR Data Management in the Life Sciences ( Blomberg, 2017), ELIXIR emphasised Findable, Accessible, Interoperable and Reusable (FAIR) ( Wilkinson et al., 2016) data management as a crucial part of good scientific practice and research excellence, which requires professional skills and adequate resources. ELIXIR’s mission is to implement standards-based, FAIR data stewardship within European life science projects and to help its users to comply with FAIR and Open Science principles via the ELIXIR Nodes that support the FAIR data management needs of national research projects.

The ELIXIR position paper on FAIR Data Management in the life sciences also states that “ELIXIR Nodes are the national implementation of a harmonised FAIR Data Management programme for the life science” ( Blomberg, 2017), relying on the coordinated action of national Nodes to ensure harmonised data management practices. So far, initiatives to promote harmonisation of practices across Nodes have been conducted within the temporal limits of projects or within different ELIXIR structures (Nodes, Platforms, Communities, Focus Groups and the Staff Exchange Programme).

Since research funders started to require data management plans (DMPs) and the implementation of FAIR and Open Science principles, several Nodes have invested in data stewardship ( Carraro et al., 2022). A high return on investment of data stewardship has been identified for research in general ( Mons, 2020), and managing data has been recognized as a matter of infrastructure, institutions, and economics, instead of simply an individual practice ( Borgman & Bourne, 2021). Nevertheless, there is no consensus on the understanding of what a data steward actually does, the context in which they operate, and the position of the data steward within institutions ( Jetten et al., 2021). ELIXIR Nodes have been involved in national initiatives aimed at defining the role and the competences of a data steward. For instance, the National Programme Open Sciences (NPOS) and ELIXIR Netherlands developed a competency framework for data stewards, published in the EBI Competency Hub ( Scholtens et al., 2019). Although this framework has been adopted by various ELIXIR Nodes, a long-lasting context to discuss, harmonise and update data steward job profiles and competences is still missing.

Members from ELIXIR Nodes have been involved in RDM related activities carried out via Staff Exchange Programmes, Focus Groups (e.g. EOSC, RDA Activities, FAIR Training), Platforms, Communities (e.g. Plant Science s) and projects. The ELIXIR Interoperability Platform, in particular, has also operated in the RDM space, focussing on the services and their use in interoperability stories. The Software Development Best Practices Working Group of the ELIXIR Tools Platform developed the Software Management Plan (SMP) template. The ELIXIR-CONVERGE project created a strong network of data stewards and domain and training experts. Knowledge resources for RDM best practices, such as RDMkit and the Data Stewardship Wizard (DSW) ( Pergl et al., 2019), have been built, enriched and integrated with ELIXIR registries and offer an excellent body of knowledge to apply FAIR data management to life sciences research. Moreover, training resources on RDM have been developed and made findable via the ELIXIR Training Portal TeSS. The FAIRplus project connected ELIXIR with industry partners by developing key resources, including the FAIR Cookbook ( Rocca-Serra et al., 2022) to implement FAIR principles and tools to evaluate FAIRness of datasets. While these initiatives have built the foundations of strategic ambitions in data management, there is currently no mechanism in ELIXIR to sustain and strengthen them beyond the stipulated duration of the projects.

In this white paper, we explain how the ELIXIR Research Data Management (RDM) Community is of benefit to the broader ELIXIR landscape, including its Nodes, Platforms, Communities and ultimately scientists (end-users), with a coherent approach to data management. The establishment of the RDM Community aligns tightly with the ELIXIR Scientific Programme 2024-2028, which adopts the outcomes of ELIXIR-CONVERGE, FAIRplus and other projects. The RDM Community further develops and strengthens the networks of data management experts from these two main projects, the training and capacity building efforts, and data management knowledge for the ELIXIR community and beyond. Essential to the ELIXIR RDM Community is the strengthening and consolidation of these and other relevant ELIXIR data management resources. These resources include tools, best practices and expertise within the ELIXIR Platforms and Communities, and the collaboratively developed ELIXIR training and capacity building resources aimed at the broader community for reuse and adoption. The ELIXIR RDM Community as a unified long-term framework for RDM professionals across Europe to come together and exchange knowledge is the key to the implementation of harmonised data management practices across Nodes. The RDM Community is meant to be inclusive to anyone that has a professional interest in RDM for the life sciences in academia and industry both in and outside of ELIXIR.

Definition of terms.

Although FAIR management often refers to data, the same principles apply to other research objects ( Jiménez et al., 2017; Hong et al., 2022; Garcia et al., 2020), such as software and training materials. Thus, when referring to data, in this white paper, we also refer to other research objects. Since data management and data stewardship are newly evolving expertise areas, and there are no universally accepted definitions yet, in this white paper we will use both terms interchangeably. Our working definition is the one put forward by the Dutch Techcentre for Life Sciences (DTL/ELIXIR Netherlands): “Data stewardship is responsible planning and executing of all actions on digital data before, during and after a research project, with the aim of optimising the usability, reusability and reproducibility of the resulting data”.

Landscape of RDM in ELIXIR

RDM professionals

As a consequence of the implementation of DMPs, FAIR and open data policies at the European, national and institutional levels, a portfolio of RDM tools and services in each ELIXIR Node has become inevitable. In fact, based on a survey in March 2022 ( Carraro et al., 2022), at least 17 Nodes offer services related to data management and stewardship. 75% of these Nodes predict that the RDM services they offer are likely to be sustainable over time, indicating a clear ambition to continue providing these services. The type of services offered vary greatly from guidelines and consultancy to IT infrastructures and training. RDM services are requested and used by scientific and technical staff (e.g. researchers, core facilities, data stewards, consortia for projects etc.) from universities, research institutions and industry.

Based on the same survey, with answers from 11 Nodes, it seems that, on average, a Node itself has 2.9 FTE (full-time equivalents) as data stewards, 2.4 FTE as IT technical staff and 0.3 FTE as infrastructure managers ( Figure 1). Despite a considerable variation in the range of the number of FTEs among the Nodes (ranging from 1 to >5 FTE), this data illustrates the diversity of national environments in which ELIXIR RDM services operate. However, it does not seem straightforward to estimate the current number of FTE providing RDM-related services in each Node. The difficulty is mainly due to the lack of a consensus on what RDM services are and which organisation in the Node is providing the service, and the fact that RDM services are often provided as in-kind contributions by professionals in research, such as bioinformaticians, IT experts, software developers or librarians.

Figure 1. Number of full-time equivalents (FTEs) in Nodes providing RDM related services in 2021.

Figure 1.

Colours represent different ELIXIR Nodes. Data derived from the online surveys and interviews performed in the context of ELIXIR-CONVERGE ( Carraro et al., 2022), investigating how services of data management and stewardship are delivered and financially supported in ELIXIR Nodes.

ELIXIR-CONVERGE brought together RDM professionals from 23 Nodes, creating the Data Management (DM) Network, which currently counted 149 members, of which 37 were coordinators as representatives for each Node (1-2 per Node). The DM Network shared experiences and knowledge about implementing RDM governance, policy, technical infrastructures, business models, etc. in the different Nodes. This information has largely been published in RDMkit as tool assemblies for RDM and on the national resources pages so that it can be used as examples by other Nodes. The DM Network organised focus groups around several RDM topics, where experts discussed and generated guidelines for best practices (e.g. how to find data repositories, how to document data) shared via RDMkit. In a survey designed to evaluate the role of the DM Network among the ELIXIR Nodes involved in ELIXIR-CONVERGE ( Jareborg, 2023), 33 participants from 19 Nodes answered that the Network was clearly beneficial to their work and that keeping the network alive was important to them ( Figure 2). The main benefits perceived by the participants were knowledge exchange, information about RDM initiatives within ELIXIR, and finding experts for collaborations and training.

Figure 2. Survey of benefits of a Data Management expert network during the ELIXIR-CONVERGE project.

Figure 2.

(A) Perceived benefit of the Data Management Network, rated on a scale from 1 (not at all) to 5 (crucial for my work); (B) Importance of keeping the DM Network going after ELIXIR-CONVERGE ends, rated on a scale from 1 (not at all) to 5 (crucial); (C) Numbers of respondents that perceive they obtain different types of benefits by participating in the DM Network.

Similarly, in FAIRplus, almost 100 data professionals from academia (including ELIXIR Nodes), pharmaceutical and information service companies were brought together to share their knowledge on FAIR implementations in the FAIR Cookbook. Collaboration between public and private sectors is essential in and around FAIR RDM matters, and such networks should be cultivated and sustained.

ELIXIR guidelines and best practices for RDM

The RDMkit, FAIRCookbook and the Data Stewardship Wizard (DSW) are ELIXIR Recommended Interoperability Resources, that are the three main resources of ELIXIR guidance and best practices for RDM.

RDMkit

Launched in 2021, RDMkit (the Research Data Management toolkit for life sciences) is a knowledge resource reflecting the evolving best practices and guidelines in RDM. RDMkit was implemented as part of ELIXIR-CONVERGE to help with standardising life science data management across Europe. Professionals in RDM and researchers from ELIXIR Nodes, Communities use RDMkit to provide scoped guidelines that place the relevant life science RDM tools and resources in the context of research. RDMkit guidelines identify RDM issues, provide plausible solutions, recommend relevant tools for RDM activity, and illustrate how these tools have been combined to enable data life cycle management in various domains. RDMkit has integrations with other ELIXIR RDM resources, such as the FAIR Cookbook, the DSW and the ELIXIR registries ( FAIRsharing, bio.tools, TeSS). The knowledge contained in RDMkit has been recognised by the European Commission and recommended as a useful resource in the Horizon Europe Programme Guide and the ERC Guideline. Moreover, ELIXIR Belgium, Norway, and UK have added RDMkit as an ELIXIR Node service (see Node Service Selection Process.)

FAIR Cookbook

Launched in 2020, and created by researchers and data management professionals in academia, pharma and information service industries, the FAIR Cookbook covers the key steps in a FAIRification journey, ‘ levels and indicators of FAIRness, a maturity model, the technologies, tools and standards available, as well as the skills required and challenges presented. All with the aim of improving data FAIRness. The recipes are citable, via PIDs, and authors are credited, via their ORCID and the CreDiT ontology. The FAIR Cookbook is cross-referenced with other ELIXIR resources, such as the RDMkit, FAIRsharing, bio.tools, DSW (work in progress) and external resources such as the Pistoia Alliance’s FAIR Toolkit. As described ( Rocca-Serra et al., 2022), the FAIR Cookbook is uniquely positioned not only to serve as practical guidance to improve everyday tasks but is also contributing to a curriculum on FAIR data and informing discussions on the necessary changes to deliver FAIR within organisations. FAIR Cookbook is one of the key outcomes of the FAIRplus project, which connected ELIXIR with industry partners to address hurdles to FAIR implementation by developing practical guidelines, processes, and tools to make data FAIR. The UK, Luxembourg, Swiss and Spain Nodes have now included the FAIR Cookbook in their service delivery plan.

Data Stewardship Wizard (DSW)

DSW has been developed by a collaborative effort between ELIXIR Czech Republic and ELIXIR Netherlands, combining knowledge about data stewardship and a cutting-edge questionnaire using dynamic web forms developed by ELIXIR Czech Republic ( Pergl et al., 2019). DSW’s primary goal is to turn data management planning from an obligation into a benefit for a project by easing the process. Several experts in data stewardship have contributed to the latest versions of the question and answer flows in the system. Questions in DSW can use or link to external resources to provide answers. A prime example of this is the suggestion of databases and standards from FAIRsharing, based on responses to certain questions. RDMkit is frequently linked from DSW and the inverse link from RDMkit to the relevant section of DSW is created automatically; links are also being created with recipes in the FAIR Cookbook. DSW is used and recommended by several Nodes and funders. Specifically, ELIXIR Sweden and Norway have adopted DSW, as well as institutions in the Czech Republic, Netherlands, Spain, Denmark, France and Portugal. Funders such as EU Commission (Horizon EU programme guide), ZonMw (Netherlands), UB-BOTT (Norway), TAČR/OP JAK (Czech Republic) also recommend DSW.

RDM training resources

In 2021, RDM professionals, training experts and trainers from ELIXIR and beyond identified gaps in the Nodes’ RDM training programmes and defined priority topics (DMP, Data Stewardship, FAIR/metadata and Reproducibility) ( Cardona et al., 2021) for training development. Collaborative work from all the ELIXIR Nodes during ELIXIR-CONVERGE resulted in an extensive Data Management/Data Stewardship (DM/DS) course portfolio ( Cardona et al., 2021) 1 which aligns with the Nodes’ RDM training strategies. The portfolio includes both generic DM/DS resources and materials, as well as specific materials related to the priority topics mentioned above, and it also includes training materials for specific domains (e.g. the Plant Demonstrator). The course portfolio is available in TeSS, in which training materials are tagged with RDM terms to increase their findability, and links to relevant training materials were included in RDMkit. FAIRsharing has recently launched its FAIRsharing Educational component, in collaboration with EOSC and RDA data champions, across all disciplines. In collaboration with the ELIXIR Training Platform, the learning paths methodology has been applied to RDM for data stewards and researchers (e.g. with the Plants Sciences and the System Biology Communities), and work is ongoing to extend the ELIXIR-GOBLET Train-the-Trainer programme to tackle the challenges that exist in teaching RDM. As a result of these ELIXIR-CONVERGE activities, training expertise and capacity in RDM has been significantly increased in ELIXIR.

Examples of RDM initiatives in ELIXIR Nodes

Numerous Nodes are involved in or drive national initiatives aimed at capacity building, professionalisation of data stewardship and alignment of RDM practices across universities and institutions. Examples of national RDM initiatives in which ELIXIR Nodes are involved are listed in Table 1.

Table 1. Examples of RDM initiatives in ELIXIR Nodes.

ELIXIR Node Initiative Description
Belgium Flemish Research Data Network (FRDN)
  • Knowledge Hub: a Research Data Management Community of practice.
  • Making collaborative content that supports researchers in making their data open and FAIR.
Cyprus (observer) Cyprus Open Science Initiative
  • Recommendation to the National Open Science Cloud initiative in the EOSC governance to include RDM.
  • Provide technical and policy support for onboarding of service providers into EOSC, including data management.
Finland National Open Science coordination
  • Working group for professionalisation of RDM experts.
France Recherche Data Gouv
  • Ecosystem to share and open research data.
  • Data management clusters.
  • Thematic reference centre for Biology and Health.
Germany National Research Data Infrastructure (NFDI)
  • Systematic management of scientific and research data.
  • Consortia of stakeholders to provide science-driven data services to research communities.
Greece Hellenic Open Science Initiative
  • Recommendations on Open Science practices (including data management) for the national research performing and research funding organisations in Greece.
Italy Italian Computing and Data Infrastructure
  • Open Science Cafe to increase awareness about RDM and Open Science.
Netherlands Data Stewards Interest Group (DSIG) & Health-RI (health data infrastructure)
  • Platform for data professionals (beyond Netherlands) to share experiences.
  • FAIR data stewardship training and capacity building program for life sciences and health research (incl. ZonMw funded projects).
Norway BioMedData
  • Network of data management experts across research infrastructures.
  • Capacity building, identification and adoption of best practices.
Portugal Ready for BioData Management?
  • Capacity building program in data management for researchers and institutions.
  • Offering introductory and advanced courses in DMPs.
Slovenia ELIXIR-SI RDM expert group initiative
  • Capacity building, network of DM/DS experts, best practices.
  • DM/DMP tools and services.
Sweden SciLifeLab & Wallenberg National Program for Data-Driven Life Science
  • Supporting the RDM needs of the initiative.
Switzerland Swiss Open Research Data Strategy
  • Extending the existing nationwide network of data managers to include data stewards.
  • Developing a Certificate of Advanced Studies (CAS) in data management and data stewardship.
United Kingdom ELIXIR-UK DaSH Fellowship
FAIRsharing Educational
  • RDM capacity building and professionalising data stewardship.
  • Producing and delivering training in FAIR data stewardship using ELIXIR United Kingdom knowledge.
  • Working group for professionalisation of RDM experts.

The individuals driving, organising, or delivering these initiatives are part of ELIXIR Nodes. They learn from members of other Nodes about RDM skills, training, and resources, and then contribute to their national initiatives by organising events, managing national communities, providing training, adopting and sharing resources within their institutions, and taking on other leading and practical roles.

Challenges for RDM professionals in ELIXIR

Lack of consensus on data steward profiles

Institutions and universities that are part of ELIXIR and beyond recognise the importance of professional data stewards for RDM support for researchers in all stages of the research life cycle. RDM involves a variety of professionals, including research scientists, IT specialists, policy makers, legal staff, libraries, funding agencies and publishers. This leads to a lack of consensus and clarity on the responsibilities, knowledge and skills required for RDM professionals. This confusion stifles effective definition of the profile of a data steward across ELIXIR Nodes. Profiles of data stewards are often developed locally in member organisations of ELIXIR and vary greatly both within and between these organisations. Moreover, some RDM professionals are centralised within Nodes or national centres, whereas others are embedded into member universities and institutes. This poses the question of how these professionals can be brought together into an effective community that recognises and supports these differences.

A formalised ELIXIR RDM Community provides the forum necessary to overcome this lack of consensus and initiate the harmonisation efforts around professionalisation of data stewardship across Nodes. The members of the RDM Community can focus on sharing experiences and practices about how RDM services are implemented in each Node, taking into account the interactions with member Node, universities and institutions. This could be considered as a continuation of the work started by the DM Network in the ELIXIR-CONVERGE project. This exchange of experiences and practices will help to shape career pathways across the different ELIXIR member states and can function as examples for the sector.

The report of the Dutch National Programme Open Science (NPOS) and ELIXIR Netherlands have provided an excellent framework and a comprehensive list of competencies, skills and qualifications defining and classifying the role of the data stewards ( Jetten et al., 2021), as has the Danish National Forum for Research Data Management ( Wildgaard et al., 2020). The RDM Community aims to generalise these, and similar guidelines generated by other Nodes, in order to promote adoption of these profiles within the organisations hosting ELIXIR Nodes and within the respective national environments. The RDM Community will undertake this effort in alignment with current insights of the RDA Professionalising Data Stewardship Interest Group as well as the EOSC Task Force Data stewardship, curricula and career paths.

Lack of overview of the ELIXIR RDM ecosystem

ELIXIR’s activities impacting the management of research data and software in line with the FAIR (Findable, Accessible, Interoperable Reusable) principles are scattered across projects, Platforms, Communities and Focus Groups. The multitude of ELIXIR’s tools, information resources, registries, databases, training courses and material suitable for the implementation of the FAIR principles in data management (FAIRification journey) are valuable and often recommended by RDM professionals to peers and researchers. However, an overview of how to use the available RDM resources is missing, whether in an integrated fashion (as an ecosystem) or as individual modules. This makes their dissemination, use and application more challenging. Currently, a user of the ELIXIR services (e.g. researchers, RDM professionals, trainers, funders) has to find and explore each resource from its dedicated webpage, and understand when and how each guideline or component relates to another and how they could be used together during the data life cycle, to make data, software or other outputs FAIR. Although this problem has been partially tackled in the ELIXIR Guidelines section, it is by no means a comprehensive representation of how ELIXIR resources could be used as an ecosystem for good RDM and FAIRification throughout a research project.

Therefore, building an ELIXIR RDM Community where its members, who are involved in several ELIXIR initiatives related to RDM, FAIR implementation and assessment, can share their knowledge is essential for the delineation of an overview of the ELIXIR RDM ecosystem. The Community coordinates and advises about RDM initiatives, by highlighting overlaps, identifying synergy and reducing possible duplication of efforts. The RDM Community acts as a single point of reference for RDM knowledge and expertise under the ELIXIR branding.

Lack of an organisational framework to share and exchange RDM expertise

RDM professionals from different Nodes might have different expertise in specific aspects of research data management (e.g. writing DMPs, managing sensitive data, brokering, assessment and evaluation of FAIRness of digital object etc.), due to differences in the contexts in which Nodes operate. Although this diversity in expertise might be a challenge for the harmonisation effort, it also presents an opportunity for peers to learn from each other and complement their knowledge. However, RDM professionals face the difficulty of finding an effective framework that recognises these differences and supports the sharing of knowledge, particularly in small Nodes that do not receive national funding for such initiatives. Until now, the process of sharing expertise and learning from peers about RDM has only happened during time-limited projects, such as ELIXIR-CONVERGE (e.g. best practices working group, data brokering task group), FAIRplus (e.g. Squads and Fellows) Staff Exchange programme, BioHackathon projects and other RDM related initiatives.

The RDM Community is a structured and long-term framework where RDM professionals can learn new skills and strengthen their expertise by sharing their experience with peers on several practical RDM topics, which affect their day-to-day work. The shared knowledge can then be translated into tangible outputs, such as content for the ELIXIR RDM ecosystem and other materials beneficial for the ELIXIR organisation and the research community at large. These activities help to keep the existing ELIXIR RDM resources relevant and up to date. The RDM Community is a unique opportunity for its members to share expertise, increase their confidence and provide better and more harmonised services for researchers.

Lack of a sustainable framework for RDM training expertise and activities

During ELIXIR-CONVERGE and FAIRplus, extensive RDM course portfolios have been built, the RDM training expertise in ELIXIR Nodes has increased significantly and a strong RDM trainer network has been formed. All activities have been executed in close collaboration with the ELIXIR Training Platform and benefited from its resources. However, the work is far from complete, and course materials are in continuous need of updating. There is no clear portfolio owner or a proper framework for embedding RDM training activities and the RDM trainer network in the ELIXIR structure. The main RDM training related actions going forward are: (i) gathering the RDM courses in the portfolio into learning paths for specific audiences, thus shaping the ELIXIR RDM curriculum, (ii) increasing findability and reusability of the developed courses and modules by optimising annotation and linking in TeSS, RDMkit and FAIR Cookbook, (iii) encouraging (re) use of the RDM materials in ELIXIR Nodes, and (iv) increasing RDM training capacity in Nodes by finalising and launching the RDM Train the Trainer programme.

As is already the case for many other ELIXIR Communities (Galaxy, Single Cell Omics, Plants etc.), the RDM Community needs to establish a strong formal presence in the ELIXIR Training Platform. Thus, the consolidation and further expansion of RDM training activities in ELIXIR, according to and implementing the Training Platform best practices, will be safeguarded.

ELIXIR RDM Community

Members identity

The expected members of the Community are life science RDM professionals in academia and industry in ELIXIR and beyond. It is meant to be inclusive to anyone that has a professional interest in RDM for the life sciences. It shall be open to not only data stewards, but also similar roles like data manager, data architect or data engineer, as well as to interested experts from relevant infrastructure providers (data repositories, computing services and resources etc.) and experts in specific aspects of RDM (IT experts, librarians, policy officers, regulators, software developers, data scientist/analyst, bioinformaticians and trainers). The Community includes discipline or subject specific data stewards. Members should have an interest and an active role in implementing, supporting and promoting good RDM practices to make research data, software and other types of output compliant with Open Science and the FAIR principles.

Examples of activities that the intended Community members engage in are not only direct support to researchers, but also the development and implementation of RDM policies in organisations, as well as the establishment of tools, services, practices and training resources to enable proper data management. Any life science RDM professional that wants to join the Community can do so by registering for membership on the RDM Community web page. Association to ELIXIR is not required for membership.

Members of the ELIXIR RDM Community experience a multitude of benefits that enhance their professional capabilities and collective impact. By engaging with peers across Nodes, members gain valuable insights, share best practices, and exchange knowledge, thereby boosting their confidence and efficiency in delivering harmonised research data management (RDM) services to researchers. This collaborative environment not only strengthens internal cohesion but also serves as a central hub for aligning activities across Nodes, Platforms, and external stakeholders. As a result, members play a pivotal role in harmonising RDM practices, fostering collaboration, and driving innovation within and beyond the ELIXIR network.

Scope of the Community

Research data management is a broad discipline that, to different extents, affects the activities of most of the existing structures of the ELIXIR organisation. Therefore, it is necessary to outline the role and scope of the ELIXIR RDM Community given the established and ongoing activities in ELIXIR. The overall purpose of the RDM Community is to bring together experts in RDM to develop ELIXIR’s vision, and coordinate its activities, within the domain of research data management according to the FAIR principles. Peers in the Community will share knowledge, experience and practices in order to increase their confidence, efficiency, and deliver more harmonised RDM services for researchers across Nodes.

The responsibilities that are in scope for the Community can be categorised into three main interacting pillars: (i) the network of RDM professionals across all Nodes (and beyond), (ii) the management of RDM knowledge and know-how in the form of the content of the ELIXIR RDM ecosystem, and (iii) the management of the RDM training resources and expertise. As RDM is high on the agenda in the global research community, an important fourth component that goes across the three pillars is the engagement with other external stakeholders that are active in this area. Bringing these four elements under the roof of one ELIXIR RDM Community creates a focal point to harmonise these activities across Nodes, Platforms and Communities, as well as with key external stakeholders outside ELIXIR ( Figure 3).

Figure 3. The RDM Community: A focal point for RDM practice in ELIXIR.

Figure 3.

The RDM Community will engage and coordinate with ELIXIR Nodes, Platforms, Communities, Focus Groups and external stakeholders to promote and drive harmonisation of RDM activities within ELIXIR and beyond.

The network of RDM professionals

In the ELIXIR-CONVERGE and FAIRplus projects, networks of life science RDM professionals have been established that include named Node representatives from each ELIXIR Node, as well as other RDM staff in those Nodes. Peers in these networks have shared knowledge and generated a tangible output as guidelines for RDM best practices that mainly reside in RDMkit and FAIR Cookbook. The recognition of RDMkit and the FAIR Cookbook as a valuable source for guidance by the Horizon Europe programme guide and the Innovative Medicines Initiative, as well as the positive feedback given by the DM Network in the ELIXIR-CONVERGE project about its utility, has highlighted the need to establish a home for this group of professionals to continue these activities as a formal structure within ELIXIR, beyond time-bound projects. Providing a long-term framework for RDM professionals across Europe to come together and exchange knowledge is at the heart of an ELIXIR RDM Community. The ELIXIR RDM Community thus supersedes the less formal competence networks established in previous projects. One prioritised aspect of this group is to work on facilitating RDM capacity building in all ELIXIR Nodes, so that the Nodes can provide state-of-the-art RDM services for their local communities while aligning practices across countries and using ELIXIR resources.

Management of RDM knowledge

The comprehensive bodies of RDM knowledge that have been gathered by ELIXIR-CONVERGE, FAIRplus and earlier activities are important and valuable resources to guide RDM professionals and researchers in their everyday practice. This pool of knowledge collated by the community for the community, mainly resides in three different knowledge resources with differing scopes and granularity: RDMkit, DSW, and the FAIR Cookbook. Additional guidance on how to use FAIRsharing for RDM has been collected in FAIRsharing Educational. However, the content in these resources is not a static product but needs constant maintenance and expansion over time to remain relevant. It is the responsibility of the RDM Community to provide the maintenance and overall coordination of the content across this RDM ecosystem for FAIR data, as well as disseminating it among Nodes and institutions in the different countries. The activities of the editorial boards of RDMkit and FAIR Cookbook fit well within the remit of the Community, which will bring these resources closer together, for example under a single board, with individual steering groups. The content that the RDM Community shall capture and manage in these resources should comprise those topics that affect the RDM and FAIRness aspects of research outputs, such as data and software.

Management of RDM training resources and expertise

In ELIXIR-CONVERGE, an extensive portfolio of training resources has been built, as a basis for the ELIXIR DM/DS Curriculum. However, continuous work is needed to make them (re) usable and keep them up to date. Moreover, during ELIXIR-CONVERGE, a strong RDM trainers network was formed, establishing solid RDM training expertise within the Nodes. This emerging network raised great interest in the Nodes, showing their willingness to work together on developing training materials, sharing their knowledge, and establishing best practices and standards for RDM training.

The consolidation and expansion of this RDM trainers network is in the remit of the RDM Community. Also, there is still a lack of trainers and training, and a collaboration between the RDM Community trainers and the ELIXIR-GOBLET Train-the-Trainer programme is beneficial towards forecasting any pedagogical challenges that will arise when the training is rolled out. There is a clear need for dissemination of RDM know-how, practices and resources and a demand for RDM training material and courses, for researchers, trainers and RDM professionals. This can be solved by the RDM Community in close collaboration with the Nodes and the ELIXIR Training Platform. It is the responsibility of the RDM Community, and the network of RDM trainers within, to coordinate the activities to fulfil these needs. This will enable the upskilling of RDM trainers and scaling up RDM training capacity in the Nodes, while bridging to RDM professionals outside ELIXIR as well.

External stakeholders

Apart from the three main pillars above, the responsibility to coordinate the interactions with the increasing number of external international stakeholders in this area is in scope for the RDM Community. This is relevant for the activities of all three main pillars. ELIXIR has an RDA Activities Focus Group with the purpose of enabling ELIXIR members to leverage the benefits of the Research Data Alliance and to ensure that the life sciences point of view is represented in the RDA community. ELIXIR has, through participation in projects, collaborations and Platform activities, engaged with various other stakeholders that are working on solving RDM issues, such as EOSC through the ELIXIR EOSC Focus Group, ESFRIs, international organisations, and national infrastructures outside ELIXIR member states. The FAIRsharing Educational is an example of RDM and FAIR-related information material created under the RDA FAIRsharing WG, and with seed funds from EOSC. It is in the remit of the RDM Community to be responsible for how RDM aspects in these stakeholder activities can be leveraged for the Community, as well as how future developments should be influenced. Engagement with these external global stakeholders both through the various fora for knowledge exchange that they provide, as well as through the possibility for RDM professionals to join and contribute to the Community, will offer the possibility for dissemination and use of the open RDM resources developed and maintained by ELIXIR.

Out of scope

To set the boundary against other possible activities and ELIXIR services, the following should be considered to be out of scope for an ELIXIR RDM Community: the technical development and provisioning of services in the ELIXIR RDM ecosystem, providing RDM helpdesk services for researchers and consortia, delivering RDM training programmes, and curation of data already deposited in data repositories and knowledge bases. The technical service provisioning of RDMkit, DSW and the FAIR Cookbook shall lie with other parties within ELIXIR. However, the content management contributions for these resources must be done in close collaboration with the technical service providers, to ensure that they are fit for purpose.

The RDM Community will not provide any RDM helpdesk support services. These must reside in the various Nodes and other local organisations, though the activities in the RDM Community will help these organisations to establish such services for their local needs. Likewise, it is not in the remit of the Community to deliver training programmes. The development of training materials and teaching needs considerable time and effort and would be delivered through future (ELIXIR) projects and by the Nodes themselves, leveraging the training resources and expertise of the Community. The Community should, at least initially, focus on promoting FAIR at source approaches to data management and facilitating the researchers’ data journeys. Thus, curation of data already deposited in Deposition Databases and knowledge-based resources would be out of scope.

Alignment with ELIXIR structures

ELIXIR Platforms

Data Platform

In collaboration with the ELIXIR Data Platform the Community will develop data management best practice guidelines to facilitate the flow of FAIR data from life science research to Core Data Resources (CDR), ELIXIR Deposition Databases (EDD) and ELIXIR Community data resources. It includes establishing guidelines and maturity models for data brokering together with EDDs. Based on the best practice guidelines, the Data Platform and the RDM Community will work together to promote the harmonisation of database formats and standards. The Community will also benefit from the liaison of the Data Platform and the ELIXIR Biocuration Focus Group to exchange knowledge on best practices for data curation and quality assurance in databases and repositories.

Interoperability Platform

The ELIXIR Interoperability Platform develops, maintains and promotes interoperability services to help people and machines to discover, access, integrate and analyse biological data. In particular, the platform has also operated in the RDM space, focusing on the services and their use in interoperability stories, and plans on continuing to build the FAIR-enabling portfolio of products, processes and practices. The RDM Community will collaborate with the Platform to incorporate interoperability best practices and guidelines aimed at researchers, RDM professionals and infrastructure providers, into the RDM ecosystem. The Platform has activities aimed at alignment of FAIR service architecture across all ELIXIR Communities as well as the Data Platform and ELIXIR projects. The results of those activities that are related to RDM will be included in the ELIXIR RDM ecosystem, whose content coordination falls under the scope of the RDM Community. While incentivizing interoperability practices in RDM, the Community will promote key services offered by the Interoperability platform such as the ELIXIR Recommended Interoperability Resources (RIRs) (e.g. OLS, FAIRsharing, RightField, ISA, Identifiers.org, etc.), and the Platform initiatives (e.g. Bioschemas, RO-Crate, RightField, etc.). The RDM Community will contribute to the harmonisation of interoperability standards across domains and data resources and work together with the Platform for wider adoption.

Training Platform

The RDM trainers network is an integral part of the ELIXIR Training community. As such the RDM Community benefits from and contributes to many ELIXIR Training Platform activities. The RDM Community will continue to expose the information about its training events and materials in TeSS. During ELIXIR-CONVERGE, standardised keywords for RDM training courses and material in TeSS were established to improve the visibility and findability; this work will be further improved. RDM trainers also actively contributed to the work of the FAIR Training Focus Group, both on creating content for courses on FAIR data, as well as on the FAIR Training Handbook. This handbook is currently being developed to train instructors on how to make training materials FAIR, and this should be the basis for the FAIRification of RDM training materials. Furthermore, the RDM trainers will continue using the assessment strategy ( Gurwitz et al., 2020) implemented by the Training Platform, which measures the quality and impact of ELIXIR training events in a coordinated and consistent data collection approach. Finally, RDM Train the Trainer activities will be aligned with the ELIXIR-GOBLET Train-the-Trainer programme. New and expert trainers will continue to build their training skills by following the pedagogical best practices taught in the ELIXIR-GOBLET Train-the-Trainer programme ( Morgan et al., 2017) courses. They will also benefit from the materials and expertise developed in ELIXIR-CONVERGE.

Tools Platform

The RDM Community will align with the Tools Platform to promote the use of available registries and tools for research data management by describing it in best practices, and establish cross-fertilization actions to exchange best practices for research data management and research software management, e.g. making FAIR software and workflows. The RDM Community will also be in a position to convey user needs and feedback to the Platform and tool providers. The RDM Community and the Tools Platform could work together on guidelines for writing Data and Software Management Plans (DMP/ SMP), particularly for European funders, for projects involving both data and software development. The aim could be to provide researchers with guidelines for an integrated approach to manage both data and software in the same project.

Compute Platform

The RDM Community will collaborate with the Compute Platform to write and disseminate best practices to make FAIR software, FAIR data analysis workflows, and secure and trustworthy computing environments. The RDM Community will ensure that computational services about authentication and authorisation, storage and data transfer, and cloud and computing resources are included in the RDM ecosystem and that their use is described from the RDM perspective.

ELIXIR Communities

The existing ELIXIR Communities are key interaction partners that deal with real-world RDM problems and challenges for their respective domains. They offer a possibility for knowledge exchange and capacity building that goes both ways. The RDM Community can use the challenges of the different Communities to help develop and improve RDM guidelines, solutions, standards, learning paths and training for the different domains, as well as to learn from, and leverage, developments made in the Communities themselves to improve on general RDM principles and solutions so that they stay fit for purpose over time. This in turn will help to drive standards harmonisation across communities, whenever possible.

Several ELIXIR Communities (e.g. Plants, Microbiome, Microbial Biotechnology, Toxicology, etc.) have already used RDMkit, DSW and FAIR Cookbook for dissemination of domain specific standards and RDM best practices. Domain pages in the RDMkit include many of the existing ELIXIR Communities, as well as descriptions of tool assemblies that serve as inspiration for real-world solutions to RDM challenges for many life science domains. One such example is the RDMkit Marine metagenomics domain page by the Microbiome Community that outlines RDM considerations and standards, as well as pointing to the tool assembly page for the Norwegian marine metagenomics tool assembly. The ELIXIR Communities have also engaged with RDM training experts in ELIXIR-CONVERGE to create learning paths targeted to the RDM training needs in the respective scientific domains. One such example is the ELIXIR Plant Sciences Community that has established and enhanced learning paths for the Community with the input of stakeholders outside of ELIXIR and ELIXIR-CONVERGE. The general aspects of this work also feed back to the RDM Community as a framework that can be reused in other domains. We foresee that individuals in the different ELIXIR Communities with an interest in RDM will also be members of the RDM Community, so that good RDM becomes embedded in the activities of the ELIXIR Communities.

ELIXIR Focus Groups

Two key Focus Groups in ELIXIR with a remit to interact with external stakeholders and that have a prominent focus on research data management are the RDA Activities Focus Group and the EOSC Focus Group. The RDM Community provides a forum for the bidirectional dissemination of the work done in these two Focus Groups. This activity will improve the exchange and uptake of RDM developments made in these external organisations to ensure that ELIXIR aligns with, and makes most use of, state of the art RDM best practice and know-how. This also has the potential to increase the engagement of RDM professionals in these Focus Groups, which will further strengthen the interactions with these important external stakeholders. As part of this, it offers the opportunity to influence even further the RDM developments in the activities of RDA and EOSC. The ELIXIR EOSC Strategy 2022 ( Tedds et al., 2022) by the EOSC Focus Group states that “ELIXIR is a partner for EOSC - in implementation and application”. The RDM Community shall strive to take on this aspiration for ELIXIR in the area of research data management, acting as a communication hub for standards, interoperability and data flow related questions between ELIXIR and EOSC. There is also a clear connection to the FAIR Training Focus Group, with which the Community collaborates via the RDM trainers around best practices for FAIRification of digital objects. Several of the other ELIXIR Focus Groups also deal with different RDM aspects. The Community will reach out to these Focus Groups to address research data management questions for their particular domains, e.g. data readiness with the Machine Learning (AI) Focus Group and standards for particular life science domains, such as for the Pathogen Data Focus Group.

Objectives of the RDM Community

Short-term and longer-term objectives of the RDM Community are listed in Table 2 below. Short-term objectives focus on sustaining and consolidating the outputs of previous initiatives and projects into the unified structure of the RDM Community. Longer-term objectives, instead, aim to tackle the challenges that the Nodes are facing and to work towards a coordinated action of Nodes to ensure a harmonised FAIR Data Management programme for life science.

Table 2. Objectives of the RDM Community.

Area Objective
Short-term (~2 years)
People network Provide a forum that enables regular inter-personal knowledge exchange between life science RDM professionals.
Facilitate ELIXIR Node capacity building in deploying RDM services, e.g. by establishing a knowledge handbook for RDM service providers and developing an RDM services maturity model for ELIXIR Nodes.
RDM knowledge Coordinate and contribute to the RDM ecosystem content by collaborating with the editorial boards of RDMkit and FAIR Cookbook, to ensure that the content is continuously fit for purpose.
Collect requirements that would facilitate flow of FAIR data from researchers to repositories (such as the ELIXIR Deposition Databases) via RDM services, in order to improve the FAIRness of research data made available for life science and society at large.
RDM training Provide a forum that enables knowledge and teaching best practices exchanges among RDM trainers.
Ensure that RDM training materials are being developed and FAIRified, enabling reuse and adoption by the ELIXIR Nodes.
Develop RDM learning paths for researchers, data stewards, tool developers and trainers.
External stakeholders Create and maintain an overview of external stakeholder engagements with RDM aspects across ELIXIR Platforms, Communities, Focus Groups and projects.
Long-term (~3-5 years)
People network Implement initiatives to enable a harmonised FAIR data management programme for life science across Nodes.
Strengthen skills and knowledge of life science RDM professionals in the Nodes.
Create strong links with national RDM communities to leverage relevant resources or initiatives, and to increase the usage of the ELIXIR RDM ecosystem at the national level.
Continue to sharpen the professional profile of RDM personnel and enable attractive career pathways within the national and European research sectors.
RDM knowledge Support ELIXIR structures - Platforms, Communities, and Focus Groups - regarding best practices in their RDM and FAIRification activities.
Harmonise RDM standards across ELIXIR Communities, wherever possible.
RDM training Maintain and continuously develop an up-to-date collection of RDM training resources that supports the delivery of the defined RDM learning paths.
Establish a strong RDM training expertise hub in ELIXIR.
Ensure all ELIXIR Nodes have sufficient RDM training expertise and capacity through Train the Trainer activities.
External stakeholders Devise a clear strategy on how to strengthen the relationships to key external stakeholders and how to coordinate RDM aspects across ELIXIR in engagements with these, to influence and align with international developments and solutions in the RDM field.
Develop and evolve RDM guidance and models with the Life Science RIs and EOSC, acting as a communication hub for standards, interoperability and data flow related questions.
Engage with national funders to optimise RDM guidance and policy in order to maximise public gain through data deposition and reuse.

Conclusions

RDM has become the focus point for many funders and institutions that are part of ELIXIR and for the ELIXIR organisation itself. RDM professionals in ELIXIR have demonstrated their RDM expertise by bringing their knowledge, experience, know-how and skills to successful projects that aimed to support RDM. Therefore, the ELIXIR RDM Community, as defined by the scope and the objectives described in this paper, provides the way required to start tackling RDM challenges for the life sciences and to harmonise the approach to FAIR data management across ELIXIR Nodes and beyond.

Acknowledgements

We would like to thank the many other members of ELIXIR Nodes that have contributed with useful discussions and input.

Funding Statement

ELIXIR-CONVERGE: European Union’s Horizon 2020 programme under grant agreement No 871075. FAIRplus: European Union’s Horizon 2020 programme under grant agreement No 802750.

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

[version 2; peer review: 2 approved]

Footnotes

1

Updated numbers (February 2020 - December 2022): 18 Nodes contributed to 119 training events with 2849 participants.

Data availability

No data are associated with this article.

References

  1. Blomberg N, ELIXIR Consortium : ELIXIR position paper on FAIR data management in the life sciences [version 1; not peer reviewed]. F1000Res. 2017;6(ELIXIR):1857. (document). 10.7490/f1000research.1114985.1 [DOI] [Google Scholar]
  2. Borgman C, Bourne P: Why it takes a village to manage and share data. arXive e-prints. 2021. 10.48550/arXiv.2109.01694 [DOI]
  3. Cardona A, Gelder C, Palagi PM, et al. : ELIXIR-CONVERGE D2.1 Report on the identified training needs and a timeline for training courses. Zenodo. 2021. 10.5281/zenodo.4892863 [DOI]
  4. Carraro M, Tosatto S, Jareborg N, et al. : ELIXIR-CONVERGE D1.2 Survey of business models. Zenodo. 2022. 10.5281/zenodo.6344131 [DOI]
  5. Chue Hong NP, Katz DS, Barker M, et al. : FAIR Principles for Research Software (FAIR4RS Principles) (1.0). Zenodo. 2022. 10.15497/RDA00068 [DOI] [PMC free article] [PubMed]
  6. Garcia L, Batut B, Burke ML, et al. : Ten simple rules for making training materials FAIR. PLoS Comput. Biol. 2020;16(5):e1007854. 10.1371/journal.pcbi.1007854 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Gurwitz KT, Singh GP, Bellis LJ, et al. : A framework to assess the quality and impact of bioinformatics training across ELIXIR. PLoS Comput. Biol. 2020;16(7):e1007976. 10.1371/journal.pcbi.1007976 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Jareborg N: ELIXIR-CONVERGE - Survey of benefits of an Data Management expert network (1.0.0).[Data set]. Zenodo. 2023. 10.5281/zenodo.7565899 [DOI]
  9. Jetten M, Grootveld M, Mordant A, et al. : Professionalising data stewardship in the Netherlands. Competences, training and education. Dutch roadmap towards national implementation of FAIR data stewardship (1.1). Zenodo. 2021. 10.5281/zenodo.4623713 [DOI]
  10. Jiménez RC, Kuzak M, Alhamdoosh M, et al. : Four simple recommendations to encourage best practices in research software [version 1; peer review: 3 approved]. F1000Res. 2017;6:876. 10.12688/f1000research.11407.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Mons B: Invest 5% of research funds in ensuring data are reusable. Nature. 2020;578:491. 10.1038/d41586-020-00505-7 [DOI] [PubMed] [Google Scholar]
  12. Morgan SL, Palagi PM, Fernandes PL, et al. : The ELIXIR-EXCELERATE Train-the-Trainer pilot programme: empower researchers to deliver high-quality training [version 1; peer review: 2 approved]. F1000Res. 2017;6:1557. 10.12688/f1000research.12332.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Pergl R, Hooft R, Suchánek M, et al. : “Data Stewardship Wizard”: A Tool Bringing Together Researchers, Data Stewards, and Data Experts around Data Management Planning. Data Sci. J. 2019;18. 10.5334/dsj-2019-059 [DOI] [Google Scholar]
  14. Rocca-Serra P, Gu W, Ioannidis V, et al. : The FAIR Cookbook - the essential resource for and by FAIR doers (1.0). Zenodo. 2022. (in press). 10.5281/zenodo.7156792 [DOI] [PMC free article] [PubMed]
  15. Scholtens S, Jetten M, Böhmer J, et al. : Final report: Towards FAIR data steward as profession for the lifesciences. Report of a ZonMw funded collaborative approach built on existing expertise. Zenodo. 2019. 10.5281/zenodo.3474789 [DOI]
  16. Tedds J, Capella-Gutierrez S, Clark-Casey J, et al. : ELIXIR EOSC Strategy 2022. Zenodo. 2022. 10.5281/zenodo.7120997 [DOI]
  17. Wildgaard L, Vlachos E, Nondal L, et al. : National Coordination of Data Steward Education in Denmark: Final report to the National Forum for Research Data Management (DM Forum) (Version 1). Zenodo. 2020. 10.5281/zenodo.3609516 [DOI]
  18. Wilkinson M, Dumontier M, Aalbersberg I, et al. : The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data. 2016;3:160018. 10.1038/sdata.2016.18 [DOI] [PMC free article] [PubMed] [Google Scholar]
F1000Res. 2024 Oct 14. doi: 10.5256/f1000research.172134.r328117

Reviewer response for version 2

Sara El Gebali 1

No further comments

Is the topic of the opinion article discussed accurately in the context of the current literature?

Yes

Are arguments sufficiently supported by evidence from the published literature?

Yes

Are all factual statements correct and adequately supported by citations?

Yes

Are the conclusions drawn balanced and justified on the basis of the presented arguments?

Yes

Reviewer Expertise:

Research Data Management, Curation, Semantic web

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

F1000Res. 2024 Oct 3. doi: 10.5256/f1000research.172134.r328116

Reviewer response for version 2

Jeff Christiansen 1

Thankyou to the authors for addressing all of my comments on the first version of the manuscript.

The manuscript now clearly outlines the broad research data management (RDM) landscape across ELIXIR, challenges facing RDM professionals across ELIXIR and beyond; and the objectives and scope of the ELIXIR RDM Community to support life science RDM professionals in Europe and beyond.

Is the topic of the opinion article discussed accurately in the context of the current literature?

Yes

Are arguments sufficiently supported by evidence from the published literature?

Yes

Are all factual statements correct and adequately supported by citations?

Yes

Are the conclusions drawn balanced and justified on the basis of the presented arguments?

Yes

Reviewer Expertise:

Life Science and Bioinformatics Digital Infrastructure to support Research

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

F1000Res. 2024 Jun 6. doi: 10.5256/f1000research.160372.r281726

Reviewer response for version 1

Sara El Gebali 1

The article “A Research Data Management (RDM) Community for ELIXIR” effectively highlights the pressing need for a coordinated approach to RDM within the ELIXIR infrastructure. By proposing the formation of an ELIXIR RDM Community, the article addresses the current inefficiencies caused by scattered resources and aims to align practices with the FAIR (Findable, Accessible, Interoperable, Reusable) and Open Science principles. The vision for this community is well-articulated and focuses on three main pillars: networking, knowledge management, and training.

Minor comments:

  • While the article lays out a clear vision for the ELIXIR RDM Community, it would benefit from more detailed implementation strategies which include potential challenges and mitigation strategies.

  • The interest in making outputs globally available is mentioned but not detailed. The section “Table 2. Objectives of the RDM Community” could include plans for connecting with the wider global community. Efforts in collating available training/RDM material or translating them into other languages like French or Spanish would benefit a wider population, such as those in Africa and Latin America.

  • Engagement with External Stakeholders:  This section could be moved closer to the description of the network of RDM professionals. 

  • The manuscript lists examples of national RDM initiatives involving ELIXIR Nodes but does not explain the nature of their involvement. Highlighting how ELIXIR contributes to these efforts would strengthen the link between ELIXIR nodes and other national initiatives, showcasing the added value of ELIXIR’s participation 

  • Members Benefits: Including a section on member benefits would make the ELIXIR RDM Community more attractive to potential members. This could detail professional development opportunities, access to exclusive resources, and the ability to influence RDM practices on a broader scale.

  • To make the community more inviting and keep potential members interested, the manuscript should introduce the expected members earlier on. When readers know right away that they are welcome, they are more likely to stay engaged. Perhaps consider moving this text earlier in the article. 
    •  “The expected members of the Community are life science RDM professionals in academia and industry in ELIXIR and beyond. It is meant to be inclusive to anyone that has a professional interest in RDM for the life sciences. It shall be open to not only data stewards, but also similar roles like data manager, data architect or data engineer, as well as to interested experts from relevant infrastructure providers (data repositories, computing services and resources etc.) “

Figure Legends:

    •    Figure 1: The colors of the points need to be clearly defined to ensure readers understand what each point represents.

    •    Figure 3: The description should be expanded to clearly outline the role of the RDM Community as a focal point for RDM practices within ELIXIR. This will enhance understanding of its central role in coordinating and harmonizing RDM activities.

Is the topic of the opinion article discussed accurately in the context of the current literature?

Yes

Are arguments sufficiently supported by evidence from the published literature?

Yes

Are all factual statements correct and adequately supported by citations?

Yes

Are the conclusions drawn balanced and justified on the basis of the presented arguments?

Yes

Reviewer Expertise:

Research Data Management, Curation, Semantic web

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

F1000Res. 2024 Sep 18.
Munazah Andrabi 1

We thank the reviewer for the insightful comments. Please, find below our responses to each comment and the references to the revised manuscript.

Major comments:

  • While the article lays out a clear vision for the ELIXIR RDM Community, it would benefit from more detailed implementation strategies which include potential challenges and mitigation strategies.

Response

We appreciate your suggestion; however, we have decided not to include this information directly in the text. Our focus in this white paper is on the current status quo and the ELIXIR Community roadmap. The implementation will follow ELIXIR and community best practices, primarily through a bottom-up approach, with concrete initiatives aimed at identifying potential challenges and solutions. As we work towards building an active and engaged community, activities will be community-driven, with ongoing assessments of challenges. If necessary, mitigation strategies will be developed as work progresses in each area.

 

  • The interest in making outputs globally available is mentioned but not detailed. The section “Table 2. Objectives of the RDM Community” could include plans for connecting with the wider global community. Efforts in collating available training/RDM material or translating them into other languages like French or Spanish would benefit a wider population, such as those in Africa and Latin America.

Response

We appreciate your suggestion; however, we have decided not to include this information directly in the text.

While the interest in making outputs globally available is noted, it is important to clarify our current scope and priorities. Our primary focus at this stage is to establish a strong foundational network within our existing collaborations. We are coordinating the existing RDM community, working closely with other ELIXIR communities and platforms such as the Training Platform, TeSS group, and Interoperability Platforms, which manage and co-develop resources within the RDM ecosystem. We also collaborate with the Focus Groups in Research Data Alliance(RDA), which is our point of contact for global outreach and synergies. Although broader outreach is within our objectives, translating materials into other languages like French or Spanish is not, as such activities can be done better professionally at each Node level. Our current efforts are concentrated on enhancing collaboration and resource sharing within our established network.

  • Engagement with External Stakeholders:  This section could be moved closer to the description of the network of RDM professionals.

Response

While we appreciate the merit of restructuring the text by moving this section, we have decided to keep the External Stakeholders section in its current place in the texts as we think it fits best as part of the description of the scope of the Community.

 

  • The manuscript lists examples of national RDM initiatives involving ELIXIR Nodes but does not explain the nature of their involvement. Highlighting how ELIXIR contributes to these efforts would strengthen the link between ELIXIR nodes and other national initiatives, showcasing the added value of ELIXIR’s participation

Response

The type of contribution to national initiatives by Node members has been now highlighted with the addition of the following text under the “Examples of RDM initiatives in ELIXIR Nodes” paragraph:

The individuals driving, organising, or delivering these initiatives are part of ELIXIR Nodes. They learn from members of other Nodes about RDM skills, training, and resources, and then contribute to their national initiatives by organising events, managing national communities, providing training, adopting and sharing resources within their institutions, and taking on other leading and practical roles.

  • Members Benefits: Including a section on member benefits would make the ELIXIR RDM Community more attractive to potential members. This could detail professional development opportunities, access to exclusive resources, and the ability to influence RDM practices on a broader scale.

Response

The paragraph on benefits for members of the RDM Community is now included in the paper. The following text has been added to the “Members identity” paragraph:

Members of the ELIXIR RDM Community experience a multitude of benefits that enhance their professional capabilities and collective impact. By engaging with peers across Nodes, members gain valuable insights, share best practices, and exchange knowledge, thereby boosting their confidence and efficiency in delivering harmonised research data management (RDM) services to researchers. This collaborative environment not only strengthens internal cohesion but also serves as a central hub for aligning activities across Nodes, Platforms, and external stakeholders. As a result, members play a pivotal role in harmonising RDM practices, fostering collaboration, and driving innovation within and beyond the ELIXIR network.

  • To make the community more inviting and keep potential members interested, the manuscript should introduce the expected members earlier on. When readers know right away that they are welcome, they are more likely to stay engaged. Perhaps consider moving this text earlier in the article. 

Response

The following text has been added to the Introduction section of the manuscript:

“The RDM Community is meant to be inclusive to anyone that has a professional interest in RDM for the life sciences in academia and industry both in and outside of ELIXIR.”

Comment

Figure Legends:

Figure 1: The colors of the points need to be clearly defined to ensure readers understand what each point represents.

Response

The colours represent different ELIXIR Nodes. The following sentence has been added to the figure legend to clarify this.

 

“Colours represent different ELIXIR Nodes .”

Comment

Figure 3: The description should be expanded to clearly outline the role of the RDM Community as a focal point for RDM practices within ELIXIR. This will enhance understanding of its central role in coordinating and harmonising RDM activities.

Response

The following sentence has been added to the Figure legend:

The RDM Community will engage and coordinate with ELIXIR Nodes, Platforms, Communities, Focus Groups and external stakeholders to promote and drive harmonisation of RDM activities within ELIXIR and beyond.

F1000Res. 2024 Jun 3. doi: 10.5256/f1000research.160372.r281723

Reviewer response for version 1

Jeff Christiansen 1

ELIXIR is a European network of life science data related infrastructures, that is structured around the concepts of Nodes (collection of research institutes within member countries), Platforms (centred around Data, Tools, Compute, Interoperability and Training infrastructures), and Communities (groups of experts across the ELIXIR Nodes that represent a scientific or technological theme, and drive the development of standards, services, and training in and across ELIXIR).

In any open science research endeavour, good research data management is central to conducting best practice research, and enabling research data to be made Findable, Accessible, Interoperable and Reusable (FAIR) enables its re-use where possible and appropriate. In recognition of the central importance of Research Data Management (RDM) to research practice, over the past decade there has been considerable investments by ELIXIR and its Nodes into RDM, through various time-boxed projects and initiatives, and these have resulted in the development many tools and resources to assist. There was however (surprisingly) until recently, no dedicated and ongoing channel for RDM professionals across ELIXIR to exchange knowledge and share resources.

Driven by a need for better coordination, and the generation of sustainable and harmonised RDM practices across ELIXIR, a Research Data Management (RDM) Community within ELIXIR (see https://elixir-europe.org/communities/research-data-management) was recently formed.

The white paper by D’Anna and co-authors (from 17 ELIXIR Nodes), describes the broad landscape of RDM across ELIXIR (including previous joint ELIXIR efforts and projects); RDM tools and guidelines produced by ELIXIR to date; examples of RDM initiatives across various ELIXIR nodes; challenges facing the many RDM professionals across ELIXIR; the objectives and scope of the ELIXIR RDM Community; and how the RDM Community is intended to align with other ELIXIR components (Platforms, other Communities and Focus Groups).

Forming the ELIXIR RDM Community represents a very sensible approach to address the ongoing RDM coordination and uplift challenges across the ELIXIR network, as it leverages previous RDM related outputs and many aspects of the existing operational structures of ELIXIR to underpin its operations. It will also essentially to bring together a community of RDM experts to further develop ELIXIR’s vision and coordinate its RDM activities, initially leveraging existing assets, and importantly, to illustrate to researchers and others how to practically use the various ELIXIR RDM resources.

Overall, the paper is a clear and informative review. It is well written, and logically presented, covering various facets, all of which are relevant for a white paper of this type.

Major comment:

The External Stakeholders paragraph briefly describes how the RDM Community and its members may interact with RDM groups beyond ELIXIR through various projects and initiatives. The activities and outputs of this group will however be of great interest outside Europe as improving RDM is a global challenge. Will all outputs/products/guides etc produced by the RDM Community be made available for adoption and adaptation by others outside the ELIXIR network? If so, how will this occur? The manuscript would benefit from some further commentary on the approaches the RDM Community will utilise to enable global shareability/findability of resources they will produce or curate. With respect to this global angle, it would also be useful to understand the membership rules for the ELIXIR RDM Community - i.e. can external stakeholders can join the RDM Community or is membership restricted to those associated ELIXIR Nodes?

The authors describe the ELIXIR Data Management Network https://elixir-europe.org/about-us/how-funded/eu-projects/converge/wp1/dm-network. How does/will the DM network and RDM Community interact? Will the RDM Community supersede the DM network?

Minor comments:

Tense - the ELIXIR RDM Community appears to already exist ( https://elixir-europe.org/communities/research-data-management) but the future tense is often used in the manuscript (e.g. in Abstract (p2), the ‘proposed’ ELIXIR RDM Community ‘will’ bring together; Introduction (p4) the ‘proposed’ ELIXIR RDM Community etc), which is confusing to the reader. If the RDM Community has been established, I would request that the tense of the manuscript is changed to the present tense to reflect this situation.

Figure 1 Legend - It’s not clear what the colours of the points represent. I assume different ELIXIR Nodes, but can this please be defined in the Figure legend?

Figure 2 Legend. A self contained legend would be beneficial, to explain that the results are from the 2023 survey of DM Network members.

P7 “RDMkit has been implemented” > “RDMkit was implemented’

P7 - “ELIXIR Belgium, Norway, Sweden, and UK have added RDMkit as a Node service”. This needs more explanation. What is a “Node service”? This should be defined.

P7 - “FAIR Cookbook is one of the key outcomes of the FAIRplus project” - link/ref missing for FAIRplus project.

P7 - It would be useful to note that the RDMkit, FAIRCookbook and DSW are all ELIXIR recommended interoperability resources https://elixir-europe.org/news/resource-announcement-2023

P7 - “RDM professionals, training experts and trainers from ELIXIR and beyond” > “ In 2021, RDM professionals, training experts and trainers from ELIXIR and beyond”

P9 - “Institutions and universities part of ELIXIR” > “Institutions and universities that are part of ELIXIR”

P10. “Therefore, building an ELIXIR RDM Community where its members, which are involved” > “Therefore, building an ELIXIR RDM Community where its members, who are involved”

P10. “particularly in small Nodes that do not receive national fundings for such initiatives” > “particularly in small Nodes that do not receive national funding for such initiatives”.

Is the topic of the opinion article discussed accurately in the context of the current literature?

Yes

Are arguments sufficiently supported by evidence from the published literature?

Yes

Are all factual statements correct and adequately supported by citations?

Yes

Are the conclusions drawn balanced and justified on the basis of the presented arguments?

Yes

Reviewer Expertise:

Life Science and Bioinformatics Digital Infrastructure to support Research

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

F1000Res. 2024 Sep 18.
Munazah Andrabi 1

We thank the reviewer for the insightful comments. Please, find below our responses to each comments and the references to the revised manuscript. 

Major comment:

The External Stakeholders paragraph briefly describes how the RDM Community and its members may interact with RDM groups beyond ELIXIR through various projects and initiatives. The activities and outputs of this group will however be of great interest outside Europe as improving RDM is a global challenge. Will all outputs/products/guides etc produced by the RDM Community be made available for adoption and adaptation by others outside the ELIXIR network? If so, how will this occur? The manuscript would benefit from some further commentary on the approaches the RDM Community will utilise to enable global shareability/findability of resources they will produce or curate. With respect to this global angle, it would also be useful to understand the membership rules for the ELIXIR RDM Community - i.e. can external stakeholders can join the RDM Community or is membership restricted to those associated ELIXIR Nodes?

Response

Dissemination of outputs to a global audience:

All outcomes from the RDM Community will be open and accessible to all and published in open resource repositories like Zenodo. We will also be actively disseminating content and expertise  via international conferences like the ECCB. In addition, various ELIXIR nodes are part of international projects like BY-COVID, BioIndustry 4.0 and EOSC Focus groups. Outside the EU ELIXIR actively engages with the NIH in USA and Australian BioCommons. Our resources like RDMkit and FAIRCookbook are already in use by these institutions.

The following paragraph has been added to the paper, in the “scope of the community - external stakeholders” section:

“Engagement with these external global stakeholders both through the various fora for knowledge exchange that they provide, as well as through the possibility for RDM professionals to join and contribute to the Community, will offer the possibility for dissemination and use of the open RDM resources developed and maintained by ELIXIR.”

Membership rules:

Anyone interested can register  via the link (community page).

The following sentence with the relevant link has been added to the manuscript, under the “members identity” paragraph:

“Any life science RDM professional that wants to join the Community can do so by registering for membership on the RDM Community web page . Association to ELIXIR is not required for membership.”

Comment

The authors describe the ELIXIR Data Management Network https://elixir-europe.org/about-us/how-funded/eu-projects/converge/wp1/dm-network. How does/will the DM network and RDM Community interact? Will the RDM Community supersede the DM network?

Response

The DM Network was created and active during ELIXIR-CONVERGE as a time-bound project activity. The RDM Community is a continuation of, and supersedes, the DM Network as a formal structure ELIXIR.

The following sentences have been added in the “Scope of the community - The network of RDM professionals” paragraph to clarify that the DM Network was a network during the project, and that the RDM Community supersedes the DM Network.

“The ELIXIR RDM Community thus supersedes the less formal competence networks established in previous projects.”

Minor comments:

Tense - the ELIXIR RDM Community appears to already exist ( https://elixir-europe.org/communities/research-data-management) but the future tense is often used in the manuscript (e.g. in Abstract (p2), the ‘proposed’ ELIXIR RDM Community ‘will’ bring together; Introduction (p4) the ‘proposed’ ELIXIR RDM Community etc), which is confusing to the reader. If the RDM Community has been established, I would request that the tense of the manuscript is changed to the present tense to reflect this situation.

Response

The tense of the text wherever applicable has been changed to reflect the existence of the RDM Community instead of referring to future establishment.

Comment

Figure 1 Legend - It’s not clear what the colours of the points represent. I assume different ELIXIR Nodes, but can this please be defined in the Figure legend?

Response

Yes, colours represent different ELIXIR Nodes. The following sentence has been added to the figure legend to clarify this.

“Colours represent different ELIXIR Nodes .”

Comment

Figure 2 Legend. A self contained legend would be beneficial, to explain that the results are from the 2023 survey of DM Network members.

Response

The following clarifying sentence has been added to the beginning of the figure legend:

“Survey of benefits of a Data Management expert network during the ELIXIR-CONVERGE project.”

Comment

P7 “RDMkit has been implemented” > “RDMkit was implemented’

Response

Changed as suggested.

Comment

P7 - “ELIXIR Belgium, Norway, Sweden, and UK have added RDMkit as a Node service”. This needs more explanation. What is a “Node service”? This should be defined.

The following link to the official definition of Node Service has been included in the manuscript

Node Service Selection Process.

We noticed that Sweden was incorrectly listed as one of the Nodes that included RDMkit as Node service, but it was an oversight. We have now corrected the text in the manuscript by deleting Sweden from the list.

Comment

P7 - “FAIR Cookbook is one of the key outcomes of the FAIRplus project” - link/ref missing for FAIRplus project.

Response

Hyperlinks are provided the first time a resource is mentioned in the document. To maintain readability and avoid redundancy, we do not include hyperlinks each time the resource is referenced again later in the text. FAIRplus and FAIR Cookbook are hyperlinked already in the Introduction.

Comment

P7 - It would be useful to note that the RDMkit, FAIRCookbook and DSW are all ELIXIR recommended interoperability resources https://elixir-europe.org/news/resource-announcement-2023

Response

It has been added to the “ELIXIR guidelines and best practices for the RDM” section.

Comment

P7 - “RDM professionals, training experts and trainers from ELIXIR and beyond” > “ In 2021, RDM professionals, training experts and trainers from ELIXIR and beyond”

Response

Changed as suggested

Comment

P9 - “Institutions and universities part of ELIXIR” > “Institutions and universities that are part of ELIXIR”

Response

Changed as suggested.

Comment

P10. “Therefore, building an ELIXIR RDM Community where its members, which are involved” > “Therefore, building an ELIXIR RDM Community where its members, who are involved”

Response

Changed as suggested.

Comment

P10. “particularly in small Nodes that do not receive national fundings for such initiatives” > “particularly in small Nodes that do not receive national funding for such initiatives”.

Response

Changed as suggested.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Jareborg N: ELIXIR-CONVERGE - Survey of benefits of an Data Management expert network (1.0.0).[Data set]. Zenodo. 2023. 10.5281/zenodo.7565899 [DOI]

    Data Availability Statement

    No data are associated with this article.


    Articles from F1000Research are provided here courtesy of F1000 Research Ltd

    RESOURCES