Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2024 Nov 13;14:27894. doi: 10.1038/s41598-024-79161-0

The PERMIT guidelines for designing and implementing all stages of personalised medicine research

Paula Garcia 1,, Rita Banzi 2, Vibeke Fosse 3, Chiara Gerardi 2, Enrico Glaab 4, Josep Maria Haro 5,8, Emanuela Oldoni 6, Raphaël Porcher 7, Judit Subirana-Mirete 5,8, Cecilia Superchi 7, Jacques Demotes 1
PMCID: PMC11560950  PMID: 39537728

Abstract

Personalised medicine (PM) research programmes represent the modern paradigm of complex cross-disciplinary research, integrating innovative methodologies and technologies. Methodological research is required to ensure that these programmes generate robust and reproducible evidence. The PERMIT project developed methodological recommendations for each stage of the PM research pipeline. A common methodology was applied to develop the recommendations in collaboration with relevant stakeholders. Each stage was addressed by a dedicated working group, specializing in the subject matter. A series of scoping reviews that mapped the methods used in PM research and a gap analysis were followed by working sessions and workshops where field experts analyzed the gaps and developed recommendations. Through collaborative writing and consensus building exercises, the final recommendations were defined. They provide guidance for the design, implementation and evaluation of PM research, from patient and omics data collection and sample size calculation to the selection of the most appropriate stratification approach, including machine learning modeling, the development and application of reliable preclinical models, and the selection and implementation of the most appropriate clinical trial design. The dissemination and implementation of these recommendations by all stakeholders can improve the quality of PM research, enhance the robustness of evidence, and improve patient care.

Subject terms: Biomarkers, Experimental models of disease, Clinical trial design, Translational research, Computational models, Machine learning

Introduction

Over the past decade, medical research has undergone major changes due to the convergence of high throughput technologies with the digital revolution1. This novel data-driven approach has resulted in a growing number of complex, cross-disciplinary projects requiring new methodologies, technologies and research infrastructure services. The personalised medicine (PM) research pipeline is a prime example of such complex research programmes, presenting major methodological and regulatory challenges. While investigators have developed and implemented innovative solutions, rigorous methodological research is required to ensure robust evidence is generated. The PERsonalised MedicIne Trials (PERMIT) project was therefore designed to develop a set of methodological recommendations to support investigators, evaluators, peer-reviewers and regulators involved in designing, funding, conducting, publishing and validating PM research.

Initially promoted as a single biomarker-driven treatment decision, PM now refers to a broader range of research paradigms2,3. In particular, novel -omics technologies and AI approaches allow for patient stratification using complex multi-omics signatures and subsequent machine learning (ML) stratification, as well as conventional statistical methods. This technological push makes PM a growing priority in the medical research communities and health industry, as well as among policy makers and funding agencies4.

Although PM research programmes can vary in their structure, PERMIT considered the four steps illustrated in Fig. 1 to address the methodological challenges raised by the PM research pipeline.

Fig. 1.

Fig. 1

The personalised medicine research pipeline: Stage 1 – Collection of patient and omics data: collection of patient data, including –omics data, through retrospective and/or prospective cohorts. Stage 2 – Machine learning (ML) stratification: Use of patient data to stratify the patient population into homogeneous clusters, through either conventional statistical or ML methods. Stage 3 - The translational development: Assessing the efficacy and safety of the treatment options in preclinical models is often required. Stage 4 - Clinical trials - therapeutic solutions are tested in the various patient clusters, and the effectiveness of the personalised vs. non-personalised approach can be assessed.

This article presents the key recommendations for the four steps, providing an ensemble view of the PM research pipeline and guidance from end-to-end.

An operational definition of PM research was applied throughout PERMIT: a set of comprehensive methods (methodology, statistics, validation, technology) to be applied in the different phases of the development of a personalised approach to treatment, diagnosis, prognosis or risk prediction. Ideally, robust and reproducible methods should cover all steps between generating the hypothesis (e.g., a given stratum of patients could better respond to a treatment), its validation and preclinical development, and up to the definition of its value in a clinical setting.

Methods

A common methodological framework was developed for the PERMIT project and followed to develop the recommendations, with some flexibility to account for the specificities of the four stages of the pipeline (Fig. 2). N-of-1 trials were not addressed in PERMIT. Each stage was addressed by a dedicated working group, led by a team specializing in the subject matter. Regular meetings of team leaders ensured alignment across the different sets of recommendations.

Fig. 2.

Fig. 2

Common methodological framework of the PERMIT Project.

The main steps of the methodology were:

Background information: scoping reviews and gap analysis

Each working group carried out a scoping review, following a common protocol5 to map the methodologies applied and identify the existing gaps, from a technical, methodological, and regulatory perspective as well as the existing standardization and harmonization challenges. The results of each scoping review were discussed in a joint gap analysis workshop.

Development phase: working sessions and workshops

Each group organized online working sessions and workshops with project partners and field experts from key stakeholder groups to discuss the gaps and challenges and start defining the recommendations to address them. Furthermore, an interdisciplinary meeting was organized to discuss the regulatory gaps and challenges spanning from designing and building stratification cohorts to the translational step.

Finalization phase: collaborative writing and consensus

The outcomes of the working sessions and workshops were then translated by each group into the four subsets of recommendations. Through collaborative writing in smaller groups and formal consensus exercises, the final version of the recommendations was defined.

Results

The series of scoping reviews and subsequent workshops identified significant weaknesses in the design, conduct and reporting of PM research and the detailed findings are reported elsewhere69. Recommendations were developed for each stage of the PM research pipeline and are the subject of detailed publications10,11. The key recommendations are described in the following four chapters:

Collection of patient cohort and -omics data

Prospective or retrospective collection and consolidation of patient data is the foundation of the PM research pipeline. Typically achieved through observational studies, the collection of omics, imaging, clinical, lifestyle, exposome and other data provides the basis for the stratification of patients sharing similar biomarker profiles into homogenous clusters. These recommendations highlight key aspects that must be considered to ensure sufficient and adequate data is collected.

  • 3.1.1

    Both retrospective and prospective creation of cohorts have advantages and disadvantages12 and these must be thoroughly assessed when selecting an approach for the stratification cohort.

  • 3.1.2

    When integrating data from multiple retrospective cohorts to increase sample size, an ex-ante harmonization approach should be favoured over an ex-post approach whenever possible. When integrating data from different cohorts prospectively, interoperable data standards should be adopted to ensure harmonization and seamless integration. In both cases, interoperable multimodal data management systems should be implemented.

  • 3.1.3

    The most relevant data for the stratification analysis should align with study objectives. Other factors including the cost of data generation, data quality and accuracy, must be taken into consideration.

  • 3.1.4

    Thorough sample size estimation should be included in every study design, and should demonstrate that the number of samples per study group provides sufficient statistical power to detect between-group differences at the expected level of sensitivity and specificity for a given minimum effect size. The sample size for individual discovery and validation studies must be estimated separately. In PM, no unique method is available to calculate adequate sample size(s) for stratification cohort(s). It should be estimated using dedicated statistical software tools, ideally using simulations with real-world data from a pilot study. If no pilot data is available, the most similar public dataset, or a simulation or parametric power estimation with a conservative variance estimate can be applied to avoid a rule-of-thumb approach. Alternatively prior assumptions on key data properties can be used to estimate the sample size.

  • 3.1.5

    For most stratification projects, only one discovery and one validation cohort suffice, as long as they are both representative of the patient population addressed and the sample sizes are deemed sufficient according to the sample size calculation. Otherwise, including further cohorts should be considered.

  • 3.1.6

    Data quality and monitoring requirements depend on the objective of the stratification. Whenever stratification drives treatment or healthcare decisions, monitoring should comply with the ISO 20916:2019 standard and omics/biomarker data generation should be conducted using CE marked in vitro diagnostic (IVD) devices.

  • 3.1.7

    To select the most appropriate stratification approach, pilot data should be used to assess whether classical statistical hypothesis tests provide sufficiently robust and discriminative individual biomarker candidates for stratification, or whether a multi-variable ML approach for biomarker signature can account for a lack of discriminative power for individual markers and a complex correlation structure between multiple relevant markers.

  • 3.1.8

    An external validation on a comparable series of patients must be designed when possible, in accordance with the study characteristics. The validation cohort should ideally be collected prospectively. Otherwise, biomarker robustness should be ensured by doing nested cross-validation. If validation with a different cohort is not possible, it is recommended to perform a three-group split of the internal data into a training set for model building and optimization, a validation set for performance assessment, and a test set for additional confirmation of the reproducibility of the performance estimates.

  • 3.1.9

    A complete analysis of the risk of bias should be conducted and published. For personalized medicine studies, this analysis should go beyond standard tools for randomized controlled trials (e.g. Cochrane RoB 2) to address potential biases specific to stratification cohorts and biomarker studies. Key areas to assess include: 1)Selection bias in cohort recruitment and data collection; 2) Information bias in biomarker measurement and outcome assessment; 3) Confounding factors that may influence stratification or treatment effects; 4) Overfitting or lack of external validation in ML models; 5) Missing data and loss to follow-up; 6) Reporting bias of biomarker analyses.

  • 3.1.10

    An infrastructure for management and integration of patient data compliant with local and international regulation is essential. Regulatory expertise in the research team is equally important. Although exploratory stratification research may aim to merely revisit disease taxonomy or have a better understanding of disease prognosis in patient subgroups, it is recommended that clinical regulation be heeded at an early stage and studies be designed with an outlook on the findings ultimately leading to the use of the stratification assay to drive patient care (see recommendation 3.1.6).

This first series of recommendations addresses gaps identified7 while covering a broader spectrum of issues to be considered while designing discovery and validation cohort studies supporting patient stratification and the identification of complex biomarker signatures.

Machine learning methods for patient stratification

Conventional biomarker discovery approaches for patient stratification have used statistical methods to identify single markers that discriminate between patient sub-groups with distinctive disease-relevant characteristics. However, for many complex and heterogeneous disorders, using more advanced ML analyses to identify multi-variable biomarker signatures can provide more robust and accurate stratification models.

The recommendations for ML stratification methods have been grouped chronologically by the main study phases (planning phase, discovery and modelling phase, validation phase) as follows:

Planning phase

  • 3.2.1

    To avoid inadequate sample sizes and underpowered study designs, (see 3.1.7), a small-scale pilot study should be conducted before the main discovery study, in order to collect prior data as input for a statistical sample size estimation (for relevant methods see13). Researchers can also use algorithmic methods to optimise the matching and selection of biospecimens to be included in an experimental profiling study and integrate complementary public biological data into the analysis14,15.

  • 3.2.2

    Strong imbalances in the number of available biospecimens for the relevant study groups can complicate the identification of robust discriminative patterns by ML. To address this common limitation, a detailed plan for further subject recruitment combined with a prior sample size estimation (see 3.2.1) is recommended. Retrospectively, class imbalances can be addressed in the modelling phase, using dedicated sub-sampling or sample weighting approaches16.

  • 3.2.3

    In longitudinal patient stratification studies, study participants may drop out due to reasons related or unrelated to the study (informative or uninformative censoring). Detailed planning for further subject recruitment is also instrumental here. Furthermore, dropouts require special consideration in the data analysis, by reviewing possible causes of censoring and associated biases, and applying dedicated imputation and censored data analysis methods17,18.

Discovery and modelling phase

  • 3.2.4

    To avoid inadequate data pre-processing, filtering and normalisation methods, comprehensive quality control analyses should be performed before and after data pre-processing19,20. In particular, samples that fail multiple outlier and quality control checks should be filtered out, and distribution assumptions should be reviewed to select pre-processing techniques specifically tailored to the observed data distributions. It is also advisable to compare multiple pre-processing approaches by means of quality control or benchmark analyses.

  • 3.2.5

    Without prior knowledge on which types of modelling approaches may best fit the available data, researchers may want to compare multiple representative types of modelling approaches (e.g. linear modelling approaches, nearest neighbour-based methods, neural networks, rule-based and decision-tree-based approaches) using a nested cross-validation with suitable parameter optimization approaches21. As an alternative to single ML algorithms, integrating multiple approaches via ensemble learning methods should be considered22.

  • 3.2.6

    Avoiding overfitting or underfitting of a model to its training data, i.e. finding a suitable balance between the specificity of the model to the training data and its expected ability to generalise to other unseen data sets, is essential. Several methods have been proposed to detect and prevent over- and underfitting. Researchers can apply dedicated regularization methods and optimize model complexity parameters using nested cross-validation21,23. Combining ML methods with prior feature selection and dimension reduction methods can help to avoid building overly complex overfiting models24.

Validation phase

  • 3.2.7

    ML models require appropriate validation using cross-validation and external testing analyses. To ensure the robustness of validation studies and the resulting model performance statistics, the sizes and composition of the training and test sets should be chosen adequately in the prior study design; and robust bootstrapping or cross-validation techniques combined with relevant performance metrics should be applied24.

  • 3.2.8

    While single-cohort analyses can provide initial performance estimates for ML stratification, external validation analyses on independent cohorts representing different ethnic and geographic backgrounds are required to ensure that the model is applicable and accurate across different populations. Meta-analyses of datasets from multiple cohorts can increase the robustness of feature selection and model building analyses, as can validations on distinct cohorts representing different populations.

  • 3.2.9

    Explainability and interpretability of ML models for stratification can be a key requirement for assessing their trustworthiness and biological plausibility. In these settings, so-called “white-box” ML algorithms25 should be used to ensure models are sufficiently explainable, both in terms of biological interpretability and the provision of comprehensive data-driven justifications for the model’s prediction. This is particularly important for regulators.

ML stratification appears as a new and promising instrument in PM. However, care should be taken to carefully plan, implement and validate ML models to ensure reliable evidence is generated.

Translational methods for personalised medicine

Undeniable progress has been made in developing innovative and complex preclinical model systems that hold promise for recapitulating patient clustering. Despite these efforts, to date, fundamental deficits in translational methods prevent the further development of PM. The scoping review identified and highlighted these significant gaps and challenges6.

To address them, the recommendations for the translational step of PM research focus on five areas: clinically relevant translational research; robust model development; transparency and education; revised regulation; and interaction with clinical research and patient engagement.

Clinically relevant translational research

Increased emphasis on using clinically relevant research models is imperative, and to achieve this there should be standards for model relevance. Selection of preclinical models must be evidence-based, and researchers should demonstrate awareness of the model’s limitations when interpreting results. Diverse biological mechanisms and patient heterogeneity usually cannot be represented by one single model system, therefore a combination of models should be considered for complex diseases.

Robust model development

Research models must be robust to be predictive, and there should be a common implementation framework for rigorous research, to provide reliable preclinical data for clinical trials. In addition, further efforts should be made to validate, qualify and adopt innovative preclinical methods. To facilitate systematic validation processes, key stakeholders must support and promote robust model development through specific funding and policies and encourage joint efforts between pharmaceutical industries and academic institutions. In particular for PM, further developing and validating the capacity of preclinical models such as cellular, organoid, patient-derived xenograft and in silico models to recapitulate patient strata will be paramount. Additionally, further resources should be made available for the maintenance of newly developed models.

Transparency, education and stakeholders’ engagement in preclinical research in PM

Transparency is essential, and reliable reporting and data sharing must be a requirement for both academia and industry. Open science is advocated for by many universities, organisations and policy makers26,27. Pre-registering preclinical study protocols in open access databases, is another step to implement transparency. Pre-registration can support researchers during the study design phase, it can reduce the use of animals for unnecessary duplications, as well as facilitate the generation of systematic reviews and meta-analyses of preclinical studies, all contributing to increasing the quality of research28. Education and training of researchers is needed to promote high quality and reproducible translational research, to facilitate a cultural change.

Revised regulation

Preclinical studies are subjected to regulation for good laboratory practice and laws for animal protection, however, these legislations reflect minimum requirements. Additional guidelines are needed, to ensure preclinical evidence is clinically relevant and encourages the adoption of patient-derived models. In addition, regulators and ethics committees reviewing and approving clinical trials should have harmonised guidelines and standards for evaluating the relevance and robustness of preclinical models. Regulatory frameworks should facilitate the incorporation of novel and innovative patient-based models in the existing drug-development pipeline.

Relationship with clinical research and patient engagement

Interdisciplinary interactions between clinical and preclinical stakeholders are vital to address the causes of translational failure and enhance efforts to develop robust research models. Patient engagement is a key aspect of PM, including preclinical research activities, and should be facilitated and incentivised by all stakeholders with the aim to co-create research relevant to its end-users. Targeted public funding is required to support the creation of collaborative and multidisciplinary translational ecosystems that connect resources and services, to translate scientific discoveries into benefits for patients.

Development and validation of robust and predictive preclinical models that capture clinical phenotypes and enable patient stratification for complex diseases, is fundamental for further development of personalised approaches. Implementing these recommendations is ambitious, and can only be achieved by united efforts from research institutions and universities, industry, regulatory agencies and funders.

Clinical trials for personalised medicine

As more complex designs are implemented in PM research, it is essential to ensure that trial designs truly correspond to the research questions, and lead to robust and reproducible evidence. Four types of recommendations (general, personalised vs. non-personalised strategies, design-specific, topic-specific) for PM clinical trials were developed.

General

These recommendations emphasise the need to adhere to high standards of evidence generation in clinical research. It is important to ensure the balance between early access to innovative treatments in case of unmet medical need, and larger randomised trials required by health technology assessment (HTA) agencies. The risks and costs of recommending poorly evaluated treatment strategies should not be underestimated. Conducting larger randomised controlled trials (RCT) on a wisely chosen target can reduce those risks, even if they are lengthier and costlier29.

  • 3.4.1

    Fundamental principles of clinical evidence generation equally apply to PM, which includes principles of good clinical practice, notably the quality by design/proportionality frameworks30, and the statistical principles of clinical trials31,32. Of particular importance is the need for a control group, with random allocation as much as possible and concurrent controls. Randomisation is fundamental to eliminate biases, and separate the prognostic value of a biomarker from its ability to predict the treatment effect, as single-arm enrichment trials with only biomarker-positive participants cannot overcome those issues33,34.

  • 3.4.2

    PM trials should be designed for demonstrating a differential treatment effect according to the phenotype or genotype of participants. They should aim at demonstrating that the effect of an intervention depends on how individuals are characterised.

  • 3.4.3

    Biomarkers and algorithms used in PM trials should have demonstrated external validity. Techniques employed to measure a biomarker in a trial, or algorithms used for stratification must be validated and CE-marked in the context where they would be routinely used.

  • 3.4.4

    Routine care settings should be captured in trials as much as possible. Since the population treated through routine care is often broader than the trials’ population, and trial protocols may also modify management in the control group, it is important to capture routine care settings in trials as much as possible, and increase generalisability35.

Personalised vs. non-personalised medicine strategies

Trials comparing a PM to a non-PM strategy are relevant for many stakeholders, in particular HTA bodies. No specific design was recommended for these trials, acknowledging that the study design should depend on the study objective(s) and knowledge about the biomarker(s). Also, we recognize that in certain situations such trials would be difficult, if not impossible to conduct, such as very rare subpopulations, or when the standard of care (SOC) is already personalised.

  • 3.4.5

    A PM strategy should be compared to the SOC (including a non-personalised one when relevant) using RCTs, as much as possible.

  • 3.4.6

    Trials comparing a personalised to a non-personalised strategy should account for the whole PM pipeline, including sample collection and analysis. This is particularly important for evaluating cost-effectiveness.

Design-specific

These recommendations are directed to the so-called master protocols (basket, umbrella and platform trials). These trial designs, commonly associated with PM8, raise several challenges both in terms of designs and evaluation by health authorities36,37. While many issues have been covered in previous reviews, these are not specific to PM.

  • 3.4.7

    For basket and umbrella trials, randomisation should be preferred for confirmatory trials. It is also important to use clinically relevant and patient-important outcomes. Of note, most basket and umbrella trials conducted until recently were non-controlled trials38. Given the benefits of a control arm and randomisation in eliminating biases, these should be favoured.

  • 3.4.8

    For platform trials, an intervention should be compared to concurrent controls; adding a new stratum when a new arm is added, which may raise issues to maintaining blinding in the case of multiple dummies36. Also, at least one research question for biomarker-negative patients should be considered.

Topic-specific

These recommendations match different trial designs to the typology of PM, acknowledging that the choice of a particular design should depend on prior knowledge on the biomarker, status of development of the intervention, the context of use and the intended purpose of personalisation. Therefore, four typologies of PM have been distinguished, and matched to adequate trial designs in Table 1:

Table 1.

Clinical trial designs that are best suited for each typology of personalised medicine and relevant references.

Type of personalised medicine Most suitable designs
Targeted medicine / trials Biomarker strategy designs, adaptive biomarker designs, tandem two stage design, basket trial, umbrella trial33,34,38,5052
Stratified medicine / trials Classical designs separating within- and between-patient variability, marker stratified designs, hybrid designs, adaptive signature designs, multi-arm multi-stage design, platform trial, umbrella trial50,52
Individualised treatment / medicine Classical designs separating within-and between-patient variability5355
Individualised dynamic regimes Sequential Multiple Assignment Randomized Trial (SMART) designs56
  • Targeted medicine (precision medicine): treatment is adapted to the understanding of the pathophysiology of the disease, or one of its aspects, generally by targeting a gene or protein;

  • Stratified medicine: patients are stratified into different clusters based on the collection of data characterising their phenotype or genotype, but with less emphasis on a precise molecular matching of the treatment;

  • Individualised medicine: treatment is tailored to each patient rather than a group of patients (e.g. chimeric antigen receptor [CAR] T-cell therapies);

  • Individualised medicine with a dynamic regime: treatment is adjusted over time depending on the patient’s response in addition to their individual characteristics (e.g. dose adaptation over time).

While designs used in PM trials vary according to the research question, stringent methodological standards should ensure robustness and reproducibility of trials and reduce bias. In addition, adoption of the PM strategy requires medico-economic assessment of the PM pipeline as compared to the non-personalised SOC.

There have been recent proposals for innovative trial designs, including the use of external or hybrid control arms (the latter referring to augmenting the control arm with external controls), the use of in silico trials and even the numerical generation of artificial controls using artificial intelligence algorithms trained on existing data3944. While in silico trials, modelling and simulation can be useful to optimise the design of clinical trials45, the role of such innovative approaches in replacing the designs we have considered in this article and their acceptability by regulators and HTA bodies are still unclear and a matter of controversy46,47. Those approaches are also not specific to PM, although they may also be valuable for PM. Of note, one study investigated how a hybrid control arm could have improved the estimation of the treatment effect in an umbrella early phase trial48. No specific recommendation on the use of those innovative approaches for PM were developed, but they may be considered, and their role in the development of PM would deserve further investigation.

No specific recommendations were developed for the use of real-world evidence (RWE) but the role it can play in all stages of the PM pipeline must be recognized. In clinical trials they can be used to find external controls, or to train algorithms to generate virtual controls, or to transpose or project the treatment effect as estimated in a RCT in the target population. RWE extracted from electronic health records, clinical databases, patient registries, and even from wearables or patient-reported outcomes can inform the very first stage of the PM research pipeline. RWE may be particularly valuable for biomarker discovery in large diverse populations, assessing effectiveness of personalized interventions in routine practice, enabling long-term follow-up, and informing trial designs. However, careful consideration must be given to data quality, potential biases, and appropriate analytical methods when using RWE for PM. Researchers should consult established frameworks, such as the NICE RWE Framework, when incorporating RWE into PM studies.

Conclusion

The PERMIT methodological recommendations cover the full PM research pipeline to help PM stakeholders when designing, funding, conducting, publishing and validating complex and multistep PM research programmes. As with any new scientific field, PM still requires methodological research for instance, to improve sample size evaluation or to define standards for HTA evaluation of PM strategies. Such methodological development should contribute to the dialogue with regulators to optimise the regulatory ecosystem. Similarly, regulatory science research must address these arising challenges to make appropriate decisions on the validation of AI methods, on authorisation of complex clinical trials, on market access and reimbursement, and more. Furthermore, the ethical implications and challenges regarding data protection of PM research participants should be addressed in depth49. As the field continues to evolve, the PERMIT recommendations should be broadly disseminated among the involved stakeholder communities – public and private investigators, industry, sponsors, regulators, HTAs, ethics committees, patient associations, healthcare providers, payers, funding bodies and journal editors - to strengthen the robustness of evidence, the impact of PM research and the quality of clinical care.

Acknowledgements

We would like to acknowledge all experts who contributed to the development of the recommendations. Recommendations on the collection of patient cohort and -omics data: Albert Sanchez-Niubo (University of Barcelona), Teresa Torres Moral (Institut d’Investigacions Biomèdiques August Pi i Sunyer - IDIBAPS), Fabio Pellegrini (BioGen Digital Health), Changyu Shen (BioGen Digital Health), Bernard Sebastien (Sanofi), Natalie Romanov (Merck KGaA). Recommendations on machine learning methods for patient stratification: Armin Rauschenberger (University of Luxembourg), Ramón Díaz-Uriarte (Universidad Autónoma de Madrid), Holger Fröhlich (Fraunhofer Institute for Algorithms and Scientific Computing), Isabel A. Nepomuceno Chamorro (University of Seville), Petr Nazarov (Luxembourg Institute of Health), Paolo Frasconi (University of Florence), Rosalba Giugno (University of Verona), Anne-Laure Boulesteix (University of Munich), Francisco Azuaje (Genomics England). Recommendations on translational methods for personalised medicine: Antonio L Andreu (EATRIS), Christina Barrias (University of Porto), Florence Bietrix (EATRIS), Monica Binaschi (Menarini Silicon Biosystem), Alfredo Budillon (Istituto Nazionale Tumori IRCCS), Evangelos P. Daskalopoulos (European Commission, Joint Research Centre), Maddalena Fratelli (Istituto di Ricerche Farmacologiche Mario Negri), Hani Gabra (BerGenBio), Björn Gerlach (PAASP GmbH), Liesbet Geris (KU Leuven), Anna Golebiewska (Luxembourg Institute of Health), Peter M.A. Groenen (Idorsia Pharmaceuticals Ltd), Ulrich Guertler (Boehringer-Ingelheim), Sampsa Hautaniemi (University of Helsinki) Sabine M. Hölter (Hemholtz Munich), Peter King (Immuneering) Hans Lehrach (Max Plank Institute), Peter Loskill (University of Tübingen), Frank Luyten (KU Leuven), Malcolm Macleod (University of Edinburgh), Emmet Mc Cormack (University of Bergen), Julia M. L. Menon (Preclinicaltrials.eu, Netherlands Heart Institue), Ali Mobasheri (University of Oulu, State Research Institue Centre for Innovative Medicine, Vilnius, Sun Yet-sen University, University Medical Centre Utrecht, University of Liège), Nikki Osborne (Responsible Researh in Practice), Christine Parker (GlaxoSmithKline), Merel Ritskes-Hoitinga (Utrecht University, Aarhus University), Bettina Ryll (Melanoma Patient Network Europe), Debbie Stanton (Certis Oncology), Elmar Schmitt (Merck Healthcare), Anton Ussi (EATRIS), Mira van der Naald (University Medical Centre Utrecht), Peter van Meer (Utrecht Institute of Pharmaceutical Sciences), Marco Viceconti (University of Bologna) and Emile Voust (National Cancer Institue, Netherlands). Recomendations on clinical trials for personalised medicine: Montserrat Carmona (Instituto de Salud Carlos III), Frank Hulstaert (Belgian Health Care Knowledge Centre), Lorena San Miguel (Belgian Health Care Knowledge Centre)Elena Biagioli (Istituto di Ricerche Farmacologiche Mario Negri), Jan Bogaerts (European Organisation for Research and Treatment of Cancer), Patrick M. Bossuyt (Amsterdam Academic Medical Center), Frank Bretz (Novartis), Louise Brown (MCR Clinical Trial, University College London), Luciano Castiello (ISS), Olivier Collignon (GSK), Laura de la Cruz (DLR), Roberto D’Amico (Università di Modena e Reggio Emilia, Cochrane Italia), Iñaki Imaz Iglesia (Instituto de Salud Carlos III), Ann Marie Janson Lang (CTCG), Olga Kholmanskikh Van Criekingen (CTCG), Franz König (Medical University Vienna), Stephan Lehr (AGES-EMA Scientific Advice Working Party), Balázs Nagy (Semmelweis University), Maureen Rutten-van Mölken (Erasmus University Rotterdam), Frank Petavy (EMA Biostatistics Working Group), Jorg Rahnenfuehrer (University of Dortmund), Luis Maria Sanchez Gomez (Instituto de Salud Carlos III), Stephen Senn (Independent Statistical Consultant), Maria Pia Sormani (Università di Genova), Andrew Thomson (EMA), Valter Torri (Istituto di Ricerche Farmacologiche Mario Negri). Scientific and ethics advisory boards: Virginie Pirard (University Leuven), Annalisa Barla (Università di Genova), Martin Posch (Medical University of Viena), Lars Hulstaert (Johnsson & Johnsson), Tamas Bereczky (independent patient advocate and consultant), David Perol (Léon Berard Research Center), Simone Niclou (Luxembourg Institute of Health). Figure 1 has been drawn by PG.

Author contributions

The manuscript was written through contributions of all authors. PG and JD coordinated the PERMIT project. JD obtained project funding. VF, EO, EG, RP, CS, JMH and JSM coordinated the dedicated working groups for the design and revision of the recommendations for each of the four stages of the pipeline. RB and CG coordinated the methodology. PG prepared the original draft and revisions. All authors have read and approved the final version of the manuscript.

Funding

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No. 874825.

Data availability

The data that support the findings of this study are available on Zenodo (https://zenodo.org/communities/permit-project/?page=1&size=20) and further data can be made available by the corresponding author upon request.

Declarations

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.ERIC Forum. ERIC Forum Policy Brief - Scaling-up Research Projects Through ERICS: Impact of Big Science on the Research Ecosystem. (2022).
  • 2.Chakraborty, B. & Murphy, S. A. Dynamic treatment regimes. Annu. Rev. Stat. Its Appl.1, 447–464 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Müller, A. M., Blandford, A. & Yardley, L. The conceptualization of a Just-In-Time adaptive intervention (JITAI) for the reduction of sedentary behavior in older adults. mHealth. 3, 37–37 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Vicente, A. M. et al. The ICPerMed Vision Paper for 2030. (2019).
  • 5.Banzi, R. et al. Methodological approaches for personalised medicine: protocol for a series of scoping reviews. doi: (2020). 10.5281/ZENODO.3770937
  • 6.Fosse, V. et al. Evaluating translational methods for Personalized Medicine—A Scoping Review. J. Pers. Med.12, 1177 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Torres Moral, T. et al. Methods for stratification and validation cohorts: a scoping review. J. Pers. Med.12, 688 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Superchi, C. et al. Study designs for clinical trials applied to personalised medicine: a scoping review. BMJ Open.12, e052926 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Glaab, E. et al. Biomarker discovery studies for patient stratification using machine learning analysis of omics data: a scoping review. BMJ Open.11, e053674 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Diaz-Uriarte, R. et al. Ten quick tips for biomarker discovery and validation analyses using machine learning. PLOS Comput. Biol.18, e1010357 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Fosse, V. et al. Recommendations for robust and reproducible preclinical research in personalised medicine. BMC Med.21, 14 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wang, X. & Kattan, M. W. Cohort Stud. Chest158, S72–S78 (2020). [DOI] [PubMed] [Google Scholar]
  • 13.Donner, A. Approaches to sample size estimation in the design of clinical trials—a review. Stat. Med.3, 199–214 (1984). [DOI] [PubMed] [Google Scholar]
  • 14.Glaab, E. Using prior knowledge from cellular pathways and molecular networks for diagnostic specimen classification. Brief. Bioinform. 17, 440–452 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Lottaz, C., Toedling, J. & Spang, R. Annotation-based distance measures for patient subgroup discovery in clinical microarray studies. Bioinformatics. 23, 2256–2264 (2007). [DOI] [PubMed] [Google Scholar]
  • 16.Elrahman, S. M. A. & Abraham, A. A. Review of Class Imbalance Problem. 9.
  • 17.Song, Q. & Shepperd, M. Missing Data Imputation techniques. Int. J. Bus. Intell. Data Min.2, 261 (2007). [Google Scholar]
  • 18.Wang, P., Li, Y. & Reddy, C. K. Machine Learning for Survival Analysis: a Survey. ACM Comput. Surv.51, 1–36 (2019). [Google Scholar]
  • 19.Brown, J., Pirrung, M. & McCue, L. A. FQC Dashboard: integrates FastQC results into a web-based, interactive, and extensible FASTQ quality control tool. Bioinformatics. 33, 3137–3139 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wang, S. & Yang, H. pseudoQC: a regression-based Simulation Software for correction and Normalization of Complex Metabolomics and Proteomics datasets. PROTEOMICS. 19, 1900264 (2019). [DOI] [PubMed] [Google Scholar]
  • 21.Wood, I. A., Visscher, P. M. & Mengersen, K. L. Classification based upon gene expression data: bias and precision of error rates. Bioinformatics. 23, 1363–1370 (2007). [DOI] [PubMed] [Google Scholar]
  • 22.Dietterich, T. G. Ensemble learning. in The Handbook of Brain Theory and Neural Networks. 110–125 (MIT Press, (2002).
  • 23.Bickel, P. J. et al. Regularization in statistics. Test. 15, 271–344 (2006). [Google Scholar]
  • 24.Rohart, F., Gautier, B., Singh, A. & Lê Cao, K. A. mixOmics: an R package for ‘omics feature selection and multiple data integration. PLOS Comput. Biol.13, e1005752 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Loyola-Gonzalez, O. Black-Box vs. White-Box: understanding their advantages and weaknesses from a practical point of View. IEEE Access.7, 154096–154113 (2019). [Google Scholar]
  • 26.Raff, J. W. The San Francisco Declaration on Research Assessment. Biol. Open.2, 533–534 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Wilkinson, M. D. et al. The FAIR Guiding principles for scientific data management and stewardship. Sci. Data. 3, 160018 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.van der Naald, M., Wenker, S., Doevendans, P. A., Wever, K. E. & Chamuleau, S. A. J. Publication rate in preclinical research: a plea for preregistration. BMJ Open. Sci.4, e100051 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Ioannidis, J. P. & Khoury, M. J. Are randomized trials obsolete or more important than ever in the genomic era? Genome Med.5, 32 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.ICH E6 (R3) Guideline on good clinical practice (GCP). (2023).
  • 31.Topic, I. C. H. E 9 Statistical Principles for Clinical Trials. (1998). [DOI] [PubMed]
  • 32.ICH E11(R1) guideline on clinical investigation of medicinal products in the pediatric population. (2017).
  • 33.Tajik, P., Zwinderman, A. H., Mol, B. W. & Bossuyt, P. M. Trial designs for personalizing Cancer Care: a systematic review and classification. Clin. Cancer Res.19, 4578–4588 (2013). [DOI] [PubMed] [Google Scholar]
  • 34.Vivot, A. et al. Pharmacogenomic biomarkers as inclusion criteria in clinical trials of oncology-targeted drugs: a mapping of ClinicalTrials.gov. Genet. Med.18, 796–805 (2016). [DOI] [PubMed] [Google Scholar]
  • 35.Hulstaert Frank, Pouppez Céline, Primus-de Jong Célia, Harkin Kathleen, & Neyt Mattias. Evidence Gaps for Drugs and Medical Devices at Market Entry in Europe and Potential Solutions. (2021). [DOI] [PMC free article] [PubMed]
  • 36.Collignon, O. et al. Current statistical considerations and Regulatory perspectives on the planning of Confirmatory Basket, Umbrella, and platform trials. Clin. Pharmacol. Ther.107, 1059–1067 (2020). [DOI] [PubMed] [Google Scholar]
  • 37.Sudhop, T. et al. Master protocols in clinical trials: a universal Swiss Army knife? Lancet Oncol.20, e336–e342 (2019). [DOI] [PubMed] [Google Scholar]
  • 38.Park, J. J. H. et al. Systematic review of basket trials, umbrella trials, and platform trials: a landscape analysis of master protocols. Trials. 20, 572 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Davi, R., Mahendraratnam, N., Chatterjee, A., Dawson, C. J. & Sherman, R. Informing single-arm clinical trials with external controls. Nat. Rev. Drug Discov. 19, 821–822 (2020). [DOI] [PubMed] [Google Scholar]
  • 40.Creemers, J. H. A. et al. In silico cancer immunotherapy trials uncover the consequences of therapy-specific response patterns for clinical trial design and outcome. Nat. Commun.14, 2348 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Lambert, J. et al. Enriching single-arm clinical trials with external controls: possibilities and pitfalls. Blood Adv.7, 5680–5690 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Wedlund, L. & Kvedar, J. Simulated trials: in silico approach adds depth and nuance to the RCT gold-standard. Npj Digit. Med.4 (121), s41746–s41021 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Lee, C. S. & Lee, A. Y. How Artificial Intelligence can transform Randomized controlled trials. Transl Vis. Sci. Technol.9, 9 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Thangaraj, P. M., Shankar, S. V., Oikonomou, E. K. & Khera, R. RCT-Twin-GAN Generates Digital Twins of Randomized Control Trials Adapted to Real-world Patients to Enhance their Inference and Application. Preprint at (2023). 10.1101/2023.12.06.23299464
  • 45.Bajard, A. et al. An in silico approach helped to identify the best experimental design, population, and outcome for future randomized clinical trials. J. Clin. Epidemiol.69, 125–136 (2016). [DOI] [PubMed] [Google Scholar]
  • 46.Musuamba, F. T. et al. Scientific and regulatory evaluation of mechanistic in silico drug and disease models in drug development: building model credibility. CPT Pharmacomet. Syst. Pharmacol.10, 804–825 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Vanier, A. et al. Rapid access to innovative medicinal products while ensuring relevant health technology assessment. Position of the French National Authority for Health. BMJ Evid. -Based Med.29, 1–5 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Li, C. et al. Hybrid-control arm construction using historical trial data for an early-phase, randomized controlled trial in metastatic colorectal cancer. Commun. Med.2, 90 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Dupras, C. & Bunnik, E. M. Toward a Framework for assessing privacy risks in Multi-omic Research and databases. Am. J. Bioeth.21, 46–64 (2021). [DOI] [PubMed] [Google Scholar]
  • 50.Antoniou, M., Kolamunnage-Dona, R. & Jorgensen, A. Biomarker-guided non-adaptive trial designs in phase II and phase III: a methodological review. J. Pers. Med.7, 1 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Simon, R. Clinical trial designs for evaluating the Medical Utility of Prognostic and predictive biomarkers in Oncology. Pers. Med.7, 33–47 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Antoniou, M., Jorgensen, A. L. & Kolamunnage-Dona, R. Biomarker-guided adaptive trial designs in phase II and phase III: a methodological review. PLOS ONE. 11, e0149803 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Senn, S. Cross-over Trials in Clinical Research (Wiley, 2002). 10.1002/0470854596
  • 54.Senn, S. & Ezzet, F. Clinical cross-over trials in phase I. Stat. Methods Med. Res.8, 263–278 (1999). [DOI] [PubMed] [Google Scholar]
  • 55.Lillie, E. O. et al. The N-Of-1 clinical trial: the Ultimate Strategy for Individualizing Medicine? Pers. Med.8, 161–173 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Kidwell, K. M. & Wahed, A. S. Weighted log-rank statistic to compare shared-path adaptive treatment strategies. Biostatistics. 14, 299–312 (2013). [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data that support the findings of this study are available on Zenodo (https://zenodo.org/communities/permit-project/?page=1&size=20) and further data can be made available by the corresponding author upon request.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES