Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2022 Aug;27(8):2080–2085. doi: 10.1016/j.drudis.2022.05.010

Selection of data sets for FAIRification in drug discovery and development: Which, why, and how?

Ebtisam Alharbi a,b,1, Yojana Gadiya c,d,1, David Henderson e, Andrea Zaliani c,d, Alejandra Delfin-Rossaro f, Anne Cambon-Thomsen f, Manfred Kohler c,d, Gesa Witt c,d, Danielle Welter g, Nick Juty a, Caroline Jay a, Ola Engkvist h, Carole Goble a, Dorothy S Reilly i, Venkata Satagopam g, Vassilios Ioannidis j,, Wei Gu g,, Philip Gribbon c,d,
PMCID: PMC9236643  PMID: 35595012

Highlights

  • Research organisations are focussed on quantifying the costs and benefits of implementing FAIR.

  • Criteria used for the selection of data for FAIRification can be opaque and inconsistent.

  • FAIRification effort depends on individual skills, competencies, resources, and time available.

  • FAIRification should satisfy reuse scenarios, and lead to scientific and economic impacts.

  • Organisational challenges include providing training to individuals and developing a FAIR organisation culture.

Keywords: FAIR, FAIRification, Pharmaceutical R&D, Drug discovery, Cost–benefit

Abbreviations: FAIR, Findable, Accessible, Interoperable, and Reusable; GDPR, General Data Protection Regulation; R&D, Research and Development; DPIA, Data Protection Impact Assessment; RDM, Research Data Management; DMP, Data Management Plan; EFPIA, European Federation of Pharmaceutical Industries and Associations; DTA, Data Transfer Agreement

Abstract

Despite the intuitive value of adopting the Findable, Accessible, Interoperable, and Reusable (FAIR) principles in both academic and industrial sectors, challenges exist in resourcing, balancing long- versus short-term priorities, and achieving technical implementation. This situation is exacerbated by the unclear mechanisms by which costs and benefits can be assessed when decisions on FAIR are made. Scientific and research and development (R&D) leadership need reliable evidence of the potential benefits and information on effective implementation mechanisms and remediating strategies. In this article, we describe procedures for cost–benefit evaluation, and identify best-practice approaches to support the decision-making process involved in FAIR implementation.

Introduction

The FAIR data-guiding principles provide a framework for the management of scientific research data,1 and offer a systematic approach to tackling the challenges of reusing fast-growing, but frequently inaccessible and inconsistently annotated, research data resources. Since their formulation in 2016, these aspirational principles have attracted considerable interest from biomedical researchers in academia and industry, who aim to foster their implementation to help improve the overall efficiency of the research process.2 Furthermore, major funding agencies now expect commitments toward the creation of open and reusable data resources from organisations receiving financial support from public bodies.3., 4. Implementing these principles has the potential to maximise the value of scientific data by enabling advanced analyses, such as machine learning (ML) and artificial intelligence (AI) techniques.5., 6. Recent studies emphasise that the availability of virus, patient, and therapeutic discovery data in a FAIR format could have accelerated the response to the Coronavirus 2019 (COVID-19) pandemic by enabling large-scale analysis.7., 8. Hence, FAIRifiying data and metadata is a prerequisite for attempts to elevate the overall reproducibility of research findings.9.

To realise the potential of FAIR research data management and to drive FAIR data adoption, several prominent collaborations between academia and industry in the life science discipline have been established. An example of FAIR implementation in this setting is the Pistoia Alliance (https://www.pistoiaalliance.org), a precompetitive pharmaceutical industry collaboration that funds activities fostering FAIR adoption.10 A second initiative is the FAIRplus project (https://fairplus-project.eu), an ongoing EU project developing practical guidelines and tools to FAIRify clinical and translational biomedical data. FAIRplus develops FAIRification tools as part of collaborations with Innovative Medicine Initiative (IMI) projects seeking to FAIRify their data resources, with the aim that the tools can be reused by the wider community, therefore increasing their overall impact.11 Both initiatives make a significant contribution to transforming data management and stewardship and drive FAIR implementation in the biomedical field.

Recent studies have described FAIR implementation attempts in the pharmaceutical industry, which primarily focussed on improving the effectiveness of the drug R&D process12., 13. and accelerating innovation within organisations.14 Many pharmaceutical organisations are also understandably focussed on the associated costs and expected benefits of implementing these principles, in particular for retrospective processing of legacy data, where the immediate impact is arguably less clear than for ongoing projects.15 Despite the recognised value of these principles, putting them into practice (the so-called ‘FAIRification process’16) presents significant challenges.

FAIRification challenges

Several hurdles might hinder the effective implementation of FAIR data principles. The most-frequently cited barriers are the financial, technical, legal, and organisational aspects of implementation.13., 14., 15., 17., 18., 19., 20., 21. Financial challenges are related to the costs of the resources required to implement the FAIRification process, beginning with establishing and maintaining physical data infrastructures. They also include the significant costs of employing personnel and providing for the long-term sustainability of the data resources.15., 18. By contrast, technical challenges are associated with the infrastructure, tools, and methodologies that are required to perform FAIRification (with the help of persistent identifier services, metadata registries, ontology services, etc.).19 Legal challenges correspond to requirements that might pertain to the processing and sharing of the data (e.g., accessibility rights and compliance with data protection regulations), both for meeting the ‘accessibility’ and ‘reusability’ criteria as well as for performing the FAIRification process itself.20., 21. Organisational challenges include providing training to the individuals who would implement and maintain the FAIRification processes. Furthermore, these also involve the development and sustaining of an organisational culture that elevates and rewards the practice of FAIR research data management (https://rdmkit.elixir-europe.org/).13 All the above-mentioned challenges must be systematically addressed to implement FAIR effectively and apply equally to both retrospective and prospective processing of data sets. We define ‘data set’ as a collection of related records, of the same type, presented in a structured way. This typically includes individual files, sets of web pages, or spreadsheets. Table 1 lists the challenges that affect FAIRification along with their required expertise.

Table 1.

FAIRification challenges and their required expertise.

Challenges of the FAIRification process Required expertise
Financial investment Establishing and maintaining the physical data structure
Curation costs
Ensure business continuity
Developing a long-term plan of the data strategy
Business lead, strategy lead, associate director
Technical infrastructure Availability of technical tools (persistent identifier services, metadata registry, ontology services, etc.) IT professionals, data stewards, domain experts
Legal compliance Accessibility rights
Data protection regulations
Data protection officers, lawyers, legal consultants
Organisational culture Organisational business goals
Internal data management policies and plans
Education and training of personnel
Data experts, data champions, data owners, IT professionals

Although some criteria used for the selection of data undergoing FAIRification can be opaque and inconsistent, the legal and ethical aspects of decision-making must take into account protection of participants’ data, respecting rights and freedoms as well as ensuring the interests of data owners. For example, Boeckhout et al. stated that ethical and legal aspects significantly affect the implementation of FAIR practices for sensitive human data.17 Similarly, Holub et al. identified guidelines to help compliance with legal requirements for FAIR data in health and medical research.21 At the outset, if personal data are involved, a thorough assessment of the access and reuse conditions of the data should be made and the requirements for compliance with General Data Protection Regulation (GDPR) and/or other applicable data protection legislation should be satisfied to ensure that the FAIRness goals do not contradict data protection principles.22., 23. A general data protection procedure, based upon the requirements of the GDPR framework,24 should be formulated covering aspects such as data usage, storage, and the intended purpose of analysis when personal data are involved. If the data set to be FAIRified contains sensitive personal data, such as health or racial or ethnic information, for which data protection regulations specify a stricter framework for processing,25 then an assessment of the suitability for FAIRification should identify the security and confidentiality requirements that must be fulfilled as part of the FAIRification process.21.

A Data Protection Impact Assessment (DPIA) should also be conducted to evaluate the risks of data processing and define the measures to take to address those risks and demonstrate compliance with data protection regulation.26 In situations in which anonymisation of data is not possible, participants’ consent should be sought and security measures (e.g., authentication procedure, rules for access, tracking of access, data encryption, etc.) should be considered to protect the privacy of individuals. FAIR Research Data Management (FAIR RDM) procedures are aligned with the necessary compliances needed when working with sensitive data, for example, access in FAIR is meant to apply only for appropriate individuals. However, it can be burdensome in some cases and recent calls have been made to refine GDPR regulations to better account for emerging reuse scenarios for sensitive data sets.27.

The status of a data set, including the quality and completeness of metadata, is a crucial factor influencing decision-making on implementing FAIRification, because this defines the intrinsic suitability for reuse. However, in many cases, it is not possible to predict the full range of reuse scenarios for a specific data set while designing the FAIRification objectives.28 It might also be the case that the scale of data involved could still be insufficient or statistically underpowered to answer meaningful scientific questions.29 Therefore, there may be a lower threshold of the size or scope of a FAIR data set, depending on its reusability needs, such as rare diseases research. For example, the requirements to achieve statistical power in an analysis do not change just because the underlying data set has been FAIRified. Similarly, upper thresholds might exist because of the resources needed to perform FAIRification, especially if the FAIRification procedure is not scalable or readily automatable.19.

The tractability of any planned data FAIRification effort depends on the skills, competencies, resources, and time available to address the specific needs of the data resource or workflow. Therefore, the availability of in-house technical data experts or champions is a crucial factor. Data champions are individuals who can provide practical insight into the selection and prioritisation decisions. To minimise the risk of misinterpretation of the data, it is also desirable to have scientific experts with domain-specific knowledge within FAIRification teams.28 These individuals act as a human reference, able to answer questions to provide salient context-relevant information on the data sets and their underlying properties.30 Domain experts collaborating with IT professionals (to provide platforms, tools, and skills to work on data), bioinformaticians, or data curators can help assess the likely impact of a planned FAIRification process in terms of the scientific or organisational advancements enabled by data reuse. Furthermore, it is crucial to clearly define the underlying goal of the FAIRification effort, especially when it relates to ‘nontechnical’ factors, such as meeting contractual obligations to funders or complying with the data management policy of an organisation.31 Recently, extensive guidelines have been established on the implementation of FAIR-based Data Management Plans (DMP) by the European Commission as well as national research funding organisations.32

The benefits that pharmaceutical organisations stand to gain from data-resource FAIRification are linked to the basic business imperative to bring safe and effective products to patients in a timely and cost-efficient manner. For this group, when considering which resources to FAIRify, the greatest potential impact will come from data associated with the complex and expensive clinical trial processes.33., 34. To convince management to invest in relatively complex FAIR working, especially with sensitive patient data, more examples are needed that show clearly how using community-derived FAIR data has an impact on real-world problems, such as selecting the best treatment regimen or trial design for a specific patient cohort.13., 35. In both drug discovery and development, FAIR data management procedures will potentiate the adoption of improved analysis methods, including ML and AI-based tools and workflows.36., 37. Thus, users will be able to generate, test, and validate general prediction models and/or processes in their specific data domain.6 The higher aggregation levels achievable in this way will pave the way to more-precise models of human health and disease at the molecular, cellular, tissue, and organismal levels.

The value and potential of the reuse of a FAIRified data set differ according to the perspective of the individual scientist or organisation. Personal factors are important for individual scientists who naturally seek to elevate the findability of their own research results, thereby increasing their citation rates, bringing benefits in terms of improved scientific recognition and career progression.14 For project teams, FAIR data management might allow for a study to progress more efficiently because fewer experiments might be necessary when FAIR community-sourced data can be substituted for planned experiments, thereby reducing unnecessary repetition of studies.15., 21., 28. The expected cumulative impact of access to FAIRified data is to achieve a higher degree of process reproducibility because of better metadata descriptions, which can reveal the source of inconsistencies between studies.

It is crucial for the success of FAIR implementation to evaluate the above-mentioned challenges, along with the value generated from FAIRification based on the principles of the cost–benefit analysis.38 Such an analysis is a powerful way to support decision-making by evaluating the associated costs and expected benefits in a complex situation.39., 40. A recent study introduced the FAIRification costs and the FAIRification benefits in the pharmaceutical R&D setting.15 It identified a set of factors that affect the costs and values of the FAIRification process to make an effective decision on data prioritisation. The cost factors refer to the set of indicators or aspects that influence the costs associated with the FAIRification process. These factors are: (i) the legal and ethical considerations, which include the access rights and the ethical compliance; (ii) the technical resources, which pertain to the availability of the IT applications needed to perform the FAIRification and the availability of the documentation, such as DMP; and (iii) the human resources, which mean having skilled personnel to carry out the FAIRification. By contrast, value factors can be defined as the value proposition for performing the FAIRification. These factors are: (i) the societal value, which concerns addressing an area of priority need or cross-cutting impact; and (ii) the scientific value, which focusses on the uniqueness and novelty of the data set, its potential synergies, covering of a domain, and the domain expert availability. Table 2 summarises the identified cost and value factors that might influence decision-making on the FAIRification process.

Table 2.

Factors related to the cost and value of the FAIRification process.

FAIRification cost factors FAIRification value factors
Legal and ethical considerations Existing or required data access/data-processing agreements
Compliance with data protection regulations
Societal value Project focus on area of priority
Cross-cutting impact
Technical resources Availability of (software) tools
Quality and completeness of metadata
Availability of DMP
Scientific value Uniqueness and novelty
Scientific champion
Potential synergies
Coverage of a domain
Human resources Availability of internal expertise
Skills required

The absence of a systematic methodology for deciding whether a biomedical data set requires FAIRification led the FAIRplus consortium to develop a series of identification, evaluation, and prioritisation procedures. Selection is achieved by assessing several dimensions: (i) current status of the data and resources needed for FAIR implementation; (ii) ensuring that FAIRification goals meet intended use and reuse scenarios; and (iii) any scientific and socioeconomic impacts arising from FAIRification. The tools for assessment and selection of suitable data sets are complemented by resources supporting the practical implementation of the FAIRification process itself, which will be reported in a future study. We acknowledge that the approach taken has been mainly retrospective; this makes the FAIR process complicated or even impossible, for example because of data loss or poor data management, highlighting the need for prospective FAIR planning and efficient data management.

Methodology for data prioritisation for FAIRification

To prioritise biomedical research data for FAIRification, robust identification and evaluation procedures were developed based on practical experience gained from working across 15 IMI projects covering discovery to clinical stages (Figure 1). Given that investment in FAIRification must also support its own strategic goals, FAIRplus prioritised collaboration opportunities with IMI projects that would lead to a high scientific or societal impact. This approach also enables representation of the challenges faced by the wider community and, hence, these learnings could be codified in the FAIR cookbook and generally applied.

Fig. 1.

Fig. 1

Overview of the workflow for selection of relevant FAIRification projects.

Phase 1: Project and data set information collection

The information collection phase was carried out in two main steps. First, the identification of relevant projects was conducted using high-level public data describing their scientific activities. This initial identification was followed by a detailed analysis of individual project datasets in collaboration with the data owners.

The starting point for the identification of projects with data resources suitable for FAIRification was text mining from websites and catalogues provided by the IMI (https://www.imi.europa.eu/projects-results/project-factsheets). Basic parameters were extracted, including project start and end dates, participating academic and European Federation of Pharmaceutical Industries and Associations (EFPIA) partners, the contact information of individuals, and a summary of the scientific focus and objectives of each consortium. Pilot-stage evaluations showed that having access to the project-associated personnel with knowledge of the data and its scientific context was a key aspect in deciding FAIRification success. Therefore, projects were grouped into ‘ongoing’, ‘close-to-completion’, and ‘completed’. Active projects with engaged counterparts were later identified. FAIRplus sought out projects working on ‘antimicrobial resistance’, ‘neurodegenerative disease’, ‘healthy aging’, and ‘chronic disease’, to align with its own strategic objective to maximise societal impact. Ensuring strategic alignment with organisational objectives is a general requirement of data set prioritisation efforts. We made use of a Python code for this step of the pipeline. This code is available on GitHub at https://github.com/Fraunhofer-ITMP/IMI-Project-Prioritization/tree/v1.0.

Following initial prioritisation based on strategic alignment with FAIRplus objectives and ability to engage data owners, individual IMI project data resources were analysed to establish the technical, methodological, and ethical features of the data for the establishment of a complete picture of the project, starting from the requirements of the project to the product to be disseminated. Information collected included potential impacts of project and outputs, such as the innovation level (methodological), the comprehensiveness (size), the data availability and accessibility, as well as possible legal aspects associated with the data access, transfer, and/or manipulation. A Data Transfer Agreement (DTA) was developed for use in projects processing sensitive data to provide a legal framework for the processing and reuse of data as part of a FAIRification effort. Additional factors quantified included project maturity, whether data had been made public via other mechanisms, such as publications, or any Intellectual Property (IP) requirements that necessitated that data access would be restricted. The overall FAIR awareness of the consortia was established, which was important in terms of understanding the long-term uptake of the FAIRification measures by the team. One crucial factor was to ensure that collection and processing of all data was carried out in accordance with ethical guidelines and the terms of any informed consent requirements.

Phase 2: Data set evaluation and selection

Using the information from the data set analysis, projects were scored across three criteria: societal, scientific, and methodological/technical with a range of 1–4, where 1 was defined as low impact and 4 as high. Societal value was rated according to the research focus of the project and alignment with the FAIRplus strategic priorities. For scientific value, the uniqueness and novelty of the process, the potential synergies across the community, domain coverage, data quality, and the in-house availability of a project or data champions were assessed. Methodological maturity scoring was based on the access model and DMP alongside technical factors, including availability of machine-readable data, presence and status of metadata, the extent of ontology application, data models, workflow documentation, and so on. In situations in which sensitive data were not accessible, their substitution with synthetic data was evaluated. Following the scoring and ranking process by the FAIRplus consortium, 15 of more than 120 IMI projects were prioritised for further engagement and collaborations were initiated around data resource FAIRification. The findings from the ongoing FAIRification work on these projects, including the use of tools for capability assessment and recipes for elevating the FAIRness of specific data resources, are outside the scope of this paper. The template of the scorecard used by FAIRplus during this process can be found on Zenodo.41 The impact of decision-making based on the resources is described in detail in the FAIRplus cookbook recipe at https://w3id.org/faircookbook/FCB055.

Concluding remarks

Previously, studies have outlined the necessity for the adoption of FAIR principles in the pharmaceutical industry, emphasising the high-level financial, technical, and associated cultural challenges.13., 14., 15. Nevertheless, on a practical level, when biomedical-focussed organisations implement FAIR, data owners and managers must determine which resources should be prioritised for FAIRification, as well as the costs versus benefits balance for individuals, institutions, or research communities. Here, we have identified factors that influence the selection and prioritisation of data for FAIR implementation and describe how these can be assessed and used to prioritise projects and data resources, using the experiences of the IMI FAIRplus projects as an example. Given the limited types of exemplar data set that have undergone the FAIRplus FAIRification process, it is envisaged that other types of data set would likely pose a different set of challenges.

Legal and ethical issues have been identified as important factors and, therefore, should be assessed before the FAIRification process because they might require compliance with certain requirements that, if not met or inadequately met, might result in an obstacle for FAIRification. Organisational aspects, including access to domain experts with the technical understanding of the data sets, are also important. Although the methods described mainly come from the translational medicine and drug discovery areas, the challenges, factors, and methodology can be generalised and adapted to other research and development areas; thus, we recommend that researchers and organisations consider the factors suggested here before implementing FAIRification of their data sets.

Acknowledgements

The authors received funding from the Innovative Medicines Initiative Joint Undertaking (FAIRplus project, grant agreement number 802750) and the European Union’s Horizon 2020 Programme (EOSC-LIFE project, grant agreement number 824087).

Contributor Information

Vassilios Ioannidis, Email: Vassilios.Ioannidis@sib.swiss.

Wei Gu, Email: Wei.Gu@uni.lu.

Philip Gribbon, Email: Philip.Gribbon@itmp.fraunhofer.de.

References

  • 1.Wilkinson M., Dumontier M., Aalbersberg I., Appleton G., Axton M., Baak A., et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data. 2016;3(1):1–9. doi: 10.1038/sdata.2016.18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Jacobsen A., Azevedo R., Juty N., Batista D., Coles S., Cornet R., et al. FAIR principles: interpretations and implementation considerations. Data Intell. 2020;2(1–2):10–29. [Google Scholar]
  • 3.The National Institutes of Health. Public Access Policy Details. https://publicaccess.nih.gov/policy.htm [Accessed May 10, 2022].
  • 4.European Commission. Data Management in the Context of Horizon 2020. http://ec.europa.eu/research/participants/docs/h2020-fundingguide/cross-cutting-issues/open-access-data-management/data-management_en.htm [Accessed May 10, 2022].
  • 5.European Commission. Turning FAIR into Reality. https://ec.europa.eu/info/sites/default/files/turning_fair_into_reality_1.pdf [Accessed May 10, 2022].
  • 6.Makarov V., Stouch T., Allgood B., Willis C., Lynch N. Best practices for artificial intelligence in life sciences research. Drug Discovery Today. 2021;26(5):1107–1110. doi: 10.1016/j.drudis.2021.01.017. [DOI] [PubMed] [Google Scholar]
  • 7.Mons B. The VODAN IN: support of a FAIR-based infrastructure for COVID-19. European Journal of Human Genetics. 2020;28(6):724–727. doi: 10.1038/s41431-020-0635-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Schriml M., Chuvochina M., Davies N., Eloe-Fadrosh E., Finn R., Hugenholtz P., et al. COVID-19 pandemic reveals the peril of ignoring metadata standards. Scientific Data. 2020;7(1):1–4. doi: 10.1038/s41597-020-0524-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Leipzig J., Nüst D., Hoyt C., Ram K., Greenberg J. The role of metadata in reproducible computational research. Patterns. 2021;2(9) doi: 10.1016/j.patter.2021.100322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.The Pistoia Alliance. FAIR Implementation. www.pistoiaalliance.org/projects/current-projects/fair-implementation/ [Accessed May 10, 2022].
  • 11.The FAIRplus Project. https://fairplus-project.eu [Accessed May 10, 2022].
  • 12.Vaudano E. The innovative medicines initiative: a public private partnership model to foster drug discovery. Computational and Structural Biotechnology Journal. 2013;6(7):E201303017. doi: 10.5936/csbj.201303017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Wise J., Barron A., Splendiani A., Balali-Mood B., Vasant D., Little E., et al. Implementation and relevance of FAIR data principles in biopharmaceutical R&D. Drug Discovery Today. 2019;24(4):933–938. doi: 10.1016/j.drudis.2019.01.008. [DOI] [PubMed] [Google Scholar]
  • 14.Herman V., Mons A., Waalkens A., Franke W., Baak A., Ruiter G., et al. The need of industry to go FAIR. Data Intelligence. 2020;2(1–2):276–284. [Google Scholar]
  • 15.Alharbi E., Rigina S., Juty N., Jay C., Goble C. Exploring the current practices, costs and benefits of FAIR implementation in pharmaceutical research and development: a qualitative interview study. Data Intelligence. 2021;3(4):507–527. [Google Scholar]
  • 16.Jacobsen A., Kaliyaperumal R., Luiz S., Mons B., Schultes E., Roos M., et al. A generic workflow for the data FAIRification process. Data Intelligence. 2020;2(1–2):56–65. [Google Scholar]
  • 17.Boeckhout M., Zielhuis G., Bredenoord A. The FAIR guiding principles for data stewardship: fair enough? European Journal of Human Genetics. 2018;26(7):931–936. doi: 10.1038/s41431-018-0160-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Slater T. Overcoming the challenges to making data FAIR in pharma. https://pharmafield.co.uk/opinion/overcoming-the-challenges-to-making-data-fair-in-pharma/ [Accessed May 10, 2022].
  • 19.D4 Pharma. Driving FAIR in Biopharma. Report. https://d4-pharma.com/driving-fair-in-biopharma/ [Accessed May 10, 2022].
  • 20.Corpas M., Nadezda V., Amanda M., Fiona N. A FAIR guide for data providers to maximise sharing of human genomic data. PLoS Computational Biology. 2018;14(3) doi: 10.1371/journal.pcbi.1005873. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Holub P., Kohlmayer F., Prasser F., Mayrhofer M., Schlünder I., Martin G., et al. Enhancing reuse of data and biological material in medical research: from FAIR to FAIR-health. Biopreservation Biobanking. 2018;16(2):97–105. doi: 10.1089/bio.2017.0110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Mons B., Cameron N., Jan V., Dumontier M., Santos D., Bonino L., et al. Cloudy, increasingly FAIR; revisiting the FAIR data guiding principles for the European Open Science Cloud. Information Services & Use. 2017;37(1):49–56. [Google Scholar]
  • 23.Staunton C., Slokenberga S., Mascalzoni D. The GDPR and the research exemption: considerations on the necessary safeguards for research biobanks. European Journal of Human Genetics. 2019;27(8):1159–1167. doi: 10.1038/s41431-019-0386-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.The European Parliament. Regulation (EU) 2016/679 of the European Parliament and of the Council. https://eur-lex.europa.eu/eli/reg/2016/679/oj [Accessed May 10, 2022].
  • 25.The European Parliament. Art. 9 GDPR (Processing of Special Categories of Personal Data). https://gdpr-info.eu/art-9-gdpr/ [Accessed May 10, 2022].
  • 26.European Commission. Guidelines on Data Protection Impact Assessment (DPIA). https://ec.europa.eu/newsroom/article29/items/611236 [Accessed May 10, 2022].
  • 27.Axel Voss. Fixing the GDPR. www.axel-voss-europa.de/wp-content/uploads/2021/05/GDPR-2.0-ENG.pdf [Accessed May 10, 2022].
  • 28.Arefolov A., Adam L., Brown S., Budovskaya Y., Chen C., Das D., et al. Implementation of The FAIR data principles for exploratory biomarker data from clinical trials. Data Intelligence. 2021;3(4):631–662. [Google Scholar]
  • 29.Gu W., Hasan S., Rocca-Serra P., Satagopam P. Road to effective data curation for translational research. Drug Discovery Today. 2020;26(3):626–630. doi: 10.1016/j.drudis.2020.12.007. [DOI] [PubMed] [Google Scholar]
  • 30.Mons B. Data Stewardship for Open Science: Implementing FAIR Principles. New York; Chapman & Hall/CRC. 2018 [Google Scholar]
  • 31.Bloemers M., Montesanti A. The FAIR funding model: providing a framework for research funders to drive the transition toward FAIR data management and stewardship practices. Data Intelligence. 2020;2(1–2):171–180. [Google Scholar]
  • 32.Blackburn M., Jeffrey A., David L., Diego K. Big data and the future of R&D management: the rise of big data and big data analytics will have significant implications for R&D and innovation management in the next decade. Research-Technology Management. 2017;60(5):43–51. [Google Scholar]
  • 33.Fernández D., Lasierra N., Clement D., Mason H., Robinson I. Enabling FAIR clinical data standards with linked data. Lecture Notes in Computer Science. 2020;12124:303–306. [Google Scholar]
  • 34.Briggs K., Bosc N., Camara T., Diaz C., Drew P., Drewe W., et al. Guidelines for FAIR sharing of preclinical safety and off-target pharmacology data. ALTEX. 2021;38:187–197. doi: 10.14573/altex.2011181. [DOI] [PubMed] [Google Scholar]
  • 35.Wise J., Möller A., Christie D., Kalra D., Brodsky E., Georgieva E., et al. The positive impacts of real-world data on the challenges facing the evolution of biopharma. Drug Discovery Today. 2018;23(4):788–801. doi: 10.1016/j.drudis.2018.01.034. [DOI] [PubMed] [Google Scholar]
  • 36.Fleming N. How artificial intelligence is changing drug discovery. Nature. 2018;557(7706) doi: 10.1038/d41586-018-05267-x. S55–S55. [DOI] [PubMed] [Google Scholar]
  • 37.Vamathevan J., Clark D., Czodrowski P., Dunham I., Ferran E., Lee G., et al. Applications of machine learning in drug discovery and development. Nature Reviews Drug Discovery. 2019;18(6):463–477. doi: 10.1038/s41573-019-0024-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Boardman A.E., Greenburg D.H., Vining A.R., Weimer D.L. Cambridge University Press; Cambridge: 2017. Cost-Benefit Analysis: Concepts and Practice (14th edn) [Google Scholar]
  • 39.Mishan E.J., Quah E. Routledge; Abingdon-on-Thames: 2020. Cost-Benefit Analysis (15th edn) [Google Scholar]
  • 40.Tevfik N. Lexington Books; Lanham: 2016. Cost-Benefit Analysis: Theory and Application (3rd edn) [Google Scholar]
  • 41.Zaliani A, Witt G, Gribbon P, Gadiya Y, Ioannidis V, Kohler M. FAIRplus template for project prioritization scorecard (1.0). https://zenodo.org/record/5782838#.Ynp1b5LMJPc [Accessed May 10, 2022].

RESOURCES