Abstract
Observational studies are notoriously susceptible to bias, and parallel-group randomized trials are important to identify the best overall treatment for eligible patients. Yet, such trials can be expected to be a misleading indicator of the best treatment for some subgroups or individual patients. In selected circumstances, patients can be treated in n-of-1 trials to address the inherent heterogeneity of treatment response in clinical populations. Such trials help to accomplish the ultimate goal of all biomedical research, to optimize the care of individual patients.
Keywords: Personalized medicine, precision medicine, randomized trials, n-of-1 trials
Introduction
The ultimate goal of clinical trials is to improve patient outcomes by better informing clinical practice and policy decisions. However, a growing number of critics note that many trials fail to accomplish this task [1,2]. The root of this challenge is multifactorial and often attributed to shortcomings in study design, result reporting, and limited generalizability. Here we present and discuss an underutilized approach, n-of-1 trials, that can provide tailored information to directly inform patient care [3].
Five conditions have been proposed that would increase the likelihood that a study will be informative in guiding clinical or policy decisions: (1) the study should address a clinically important and unresolved question; (2) a rigorous study design should be utilized that can provide an answer to the question; (3) enrollment goals and primary outcome completion should be feasible; (4) analysis should lend itself to valid interpretation; and (5) methods and results must be reported in a timely fashion [4]. Hutchinson and colleagues created surrogate measures for these conditions of informativeness and conducted a longitudinal cohort study, querying a clinical trials registry to quantify the proportion of randomized trials in specific disease areas that met these criteria [5]. Only 26% of 125 clinical trials met all conditions for informativeness. The authors suggested that improved scientific oversight might help address the methodologic weaknesses.
Even when the above criteria are met, obstacles remain in the direct application of trial results to patient care. Traditional parallel-group randomized trials generate estimates of average treatment effects among an often highly selected population. These results are then applied to individual patients who are judged to be sufficiently similar to the reference population in the study [6].
A problem with this practice is that universal administration of a therapy shown to be safe and effective on average may inadvertently harm some patients or be less beneficial than a generally less effective therapy. Subgroup analyses are often employed to identify which groups of patients may respond differently than other groups. However, subgroup analyses are often underpowered to identify a clinically meaningful difference between subgroups. Negative results may be misleading and require careful evaluation of their credibility [7,8].
Personalized medicine: Shortcomings and an alternative approach
Personalized medicine has developed in response to the observation of significant variability in disease expression and treatment response across diseased individuals. The terms “personalized” or “precision” medicine are commonly used to refer to the application of pharmacogenomics to tailor treatment decisions based on an individual’s genetic profile. Leveraging our understanding of how specific genetic variants might affect the body’s response to a drug can, in theory, allow treatment choices to be personalized for the individual patient. Significant innovation in pharmacogenetics has been made in several clinical fields, including cardiovascular disease and oncology [9,10].
Despite major advances over the last two decades in human genome sequencing, mapping of interindividual genetic variations, and identification in gene-drug interactions, this field has yet to produce the widespread benefits and broad transformation of clinical medicine that have been predicted [11]. Beyond genetic factors, variation in treatment response is also influenced by unique physiological and biochemical characteristics as well as environmental exposures. This complex interplay leads to heterogenous treatment effects that generally will not be predictable based on any single clinical characteristic, genetic variation, or biomarker.
Under certain circumstances, this heterogeneity can be addressed in treating the individual patients using the therapy shown to be best for them in an n-of-1 clinical trial. When there is clinical equipoise on which treatment (or whether any treatment) is optimal for the individual patient, an n-of-1 clinical trial may be employed under certain conditions to determine the preferred treatment for that individual.
N-of-1 trials, also called “personalized trials,” employ a repeated crossover design in a single patient to allow direct comparison of treatment effects in the participant. Treatment order is preferably randomized and the repetition of treatment periods improves the ability to distinguish more precise differences in the estimates of treatment effects within the participant. Many design elements, including blinding, protocolized and objective outcome assessment, and washout periods between treatments, may be borrowed from traditional parallel group trials to minimize bias. Bayesian analyses may also be used to identify the probability of benefit, which is not directly assessed from conventional frequentist analyses [12]. As interventions are alternated, data are systematically collected on the individual’s response to the interventions, allowing for an unbiased comparison of treatment effects that can directly inform treatment decisions for the individual. When a series of these trials are conducted in similar patients, the results can be aggregated across the participants to produce population-level estimates of treatment effects [13].
The interested reader is referred to various examples of n-of-1 trials, including ours in treating children and adolescents with hypertension and other circumstances beyond assessing treatment effectiveness, modification of personal health behaviors, assessing behavioral interventions, and evaluating treatment harms [14–20]. The evidence base on n-of-1 trial methodology has grown rapidly in the last decade and addresses advanced issues in the design [21,22], analysis [23], and reporting quality [24,25].
Indications for n-of-1 trials
N-of-1 trials are appropriate when there is clinical equipoise, and the comparative effectiveness of the treatment options is in doubt. This occurs commonly in clinical practice, particularly when well-performed comparative effectiveness studies are not feasible (as for rare diseases), or their results have limited generalizability. N-of-1 trials may be particularly important when the treatment risks and benefits are most uncertain, as for populations that are often excluded from large randomized trials, such as vulnerable populations (ex. children and the elderly) or those with complicated comorbidities or rare diseases. For these patients, trial data may not be available, and decisions would otherwise be based on biological plausibility, observational studies, or anecdotal experience.
N-of-1 trials are most worthwhile for chronic conditions, which will require long-term maintenance therapy. Examples of previous use cases include hypertension, attention-deficit hyperactivity disorder, chronic pain, osteoarthritis, and atrial fibrillation [26,27]. A self-limited disease or one with infrequent clinical manifestations would not be appropriate for this study design. Participation in n-of-1 trials has the potential to promote self-efficacy and health literacy, especially when they are used to evaluate health behaviors including physical activity, stress reduction, and dietary changes. Additionally, by directly involving patients in the decision-making process, adherence to the selected treatment may be improved. Examples of various uses of n-of-1 trials are shown in the Table 1.
Table 1.
Examples of n-of-1 trials for various use cases
| Clinical question | Design features | Treatment periods | Use of washout | Analytic approach | Digital tools | Outcome assessment | |
|---|---|---|---|---|---|---|---|
| Pharmacologic treatment effectiveness | Evaluate individual responses to stimulants among children with attention-deficit/hyperactivity disorder [35] | Double-blind, placebo-controlled design: stimulant vs placebo or alternative stimulant | One week per condition, 2 conditions tested (each 3x) over total 6 weeks | No washout periods | None described Responder status categorized based on symptom scores |
None | Parent, teacher, and self-rating of ADHD symptoms: assessed during and at the end of each treatment period |
| Assess harms | Verify perceived side effects of statin tablets [36] | Double-blind, placebo-controlled design: statin vs placebo vs no tablet | One month per condition, 3 conditions tested (each 4x) over total 12 months | No washout periods | Mixed (multilevel) linear models, multiple imputation for missing data | Smartphone application to collect daily symptom scores | Self-report of side effects: assessed daily |
| Rare disease | Assess effects of dietary cholesterol supplementation on behavior among children with Smith-Lemli-Opitz syndrome [37] | Double-blind, placebo-controlled design: liquid egg yolks vs egg substitute (placebo) | Two weeks per condition, 2 conditions tested (each 1x) over total 10 weeks including washout periods | 2-week washout with baseline cholesterol therapy between each treatment period | Paired t-test | None | Parent report of hyperactivity: assessed at the end of each 2-week treatment period |
| Nonpharmacological intervention | Assess the effect of bright white light therapy for depressive symptoms in cancer survivors [38] | Single-blind, sham-controlled design: bright white light vs dim red (sham) | Three weeks per condition, 2 conditions tested (each 2x) over total 12 weeks | No washout periods | Autoregressive model: treatment arm as main exposure, adjusted for time and accounting for autocorrelations of the order 1 | Smartphone application to collect daily symptom scores, aggregate data by treatment period, and deliver trial results to participants | Self-reported pain, physical performance score, body weight, step count: assessed daily |
| Lifestyle changes | Evaluate presumed patient-selected triggers for atrial fibrillation [15] | No blinding Six 1-week periods of trigger exposure or non-exposure. Trigger examples: alcohol, caffeine, lack of sleep |
One week per condition, 2 conditions tested (each 3x) over total 6 weeks | No washout periods | Bayesian generalized linear model | Smartphone application to deliver reminders on triggers, collect daily AF self-report, and deliver trial results to participants Mobile electrocardiogram device which paired with smartphone |
Self-report of AF episode assessed daily, instructed to use mobile ECG device to record when they suspected an abnormal rhythm |
The approach requires a patient who is eager to collaborate in an n-of-1 trial. As most patients will be unfamiliar with the concept, caregivers should be forthcoming about the notion of therapeutic uncertainty, and should review the risks and benefits of rotating between treatment options to repeatedly assess treatment response. Patient perspectives on participation may be influenced by cultural factors, and understanding these may help improve health equity in n-of-1 trials. Some barriers to participation identified in qualitative studies among racial and ethnic minority patients include mistrust in healthcare systems, language barriers, and a reluctance to undergo blinded treatment [28].
This trial design is most feasible when the treatment effects have a rapid onset and offset of action. As in traditional crossover trials, carryover effects must be considered, and are more likely to occur when the outcome is assessed before the effects of the previous treatment have sufficiently diminished. This can be addressed with the use of washout periods between treatments. The treatment periods should be long enough to capture a difference in the outcome, but short enough to be practical for the clinician and the patient. The number of times a given treatment is repeated will affect the precision (or power) in differentiating the relative effects of each treatment. Finally, this approach requires the use of measurable and clinically relevant treatment targets. The measurement chosen to assess treatment effects should not only be valid and reliable but also convenient enough that patients would be willing to undergo the assessment repeatedly.
Improving the usefulness of n-of-1 trials
N-of-1 trials are not immune to many of the same issues that plague some randomized trials and limit their informativeness to clinical practice. If the study designs are susceptible to bias or the results are poorly reported, their usefulness to all stakeholders, including patients, will be hindered. The Consolidated Standards of Reporting Trials extension for n-of-1 trials provides guidance on the preparation and appraisal of reports of n-of-1 trials in the medical literature [24]. Trial design should be reported explicitly, including the planned number of periods, the duration of each period, whether a run-in or washout period is used, and whether the design was individualized to each participant. The outcome measurement tools should be comprehensively described, including measurement properties such as validity and reliability. If the order of treatments is randomized, the method used to generate the allocation sequence should be described. The statistical analysis should include a description of how carryover effects, period effects, and intra-subject correlation are handled. Changes from the original plan for treatment periods, including the number and sequence of periods completed, should be explained. Comprehensive reporting on the design and execution of n-of-1 trials will improve their applicability to clinicians who are interested in employing these trials to inform the care of their patients.
Increasing uptake
Several barriers to widespread implementation in clinical practice should be addressed, including the doubt among clinicians and patients about whether the costs and burdens of n-of-1 trials outweigh the benefits. Very few randomized trials have been conducted to assess whether n-of-1 trials, when compared to usual care, produce improved clinical outcomes [27]. While most studies were equivocal for the primary outcome, several studies identified significant differences in secondary outcomes, including satisfaction with care and increased shared decision-making. However, most of the studies did not reach their predefined sample size targets, and insufficient sample sizes may have reduced their power to identify the benefit of n-of-1 trials.
Digitizing n-of-1 trials
Another major constraint to the widespread application of this approach in routine clinical practice is the lack of tools and infrastructure that would allow a clinician to conduct n-of-1 trials in a busy clinical setting. For this reason, most n-of-1 trial activity has been limited to funded clinical research conducted in academic settings. The momentum to incorporate this approach into everyday clinical practice is usually stalled by loss of research funding or diminished interest among clinicians without a research background. Open-access digital platforms and mobile applications to support the design, conduct, and analysis of n-of-1 trials would reduce logistical barriers and might increase use. Integration of such digital platforms into the electronic health record would facilitate the implementation of n-of-1 trials into clinical practice. Additionally, online platforms would promote sharing of resources and knowledge, allowing protocols to be utilized across multiple institutions and populations. N-of-1 trial apps allow patients to self-track symptoms and may also integrate with other mobile health technologies to allow for passive outcome data collection. Further development of these technologies is critical to advancing the integration of n-of-1 trials into clinical practice [29–31].
Ethical and regulatory oversight
Additional considerations in the implementation of n-of-1 trials include the issue of the ethical and regulatory oversight requirements. Currently, there are no national guidelines that address how institutional review boards should approach consent for n-of-1 trials, but some consensus is forming [32,33]. As we have previously outlined [34], if certain conditions are met, a less formal consent process could be sufficient. These criteria include 1) a patient’s well-being remains the primary motivation, 2) there is no incremental increase in risk relative to usual care, and 3) the patient understands the process and agrees to participate. Some n-of-1 trials may qualify as quality improvement whereas others may only require a streamlined consent approach (verbal discussion accompanied by an information sheet without the need for signature).
Conclusions
In the era of precision medicine, n-of-1 trials offer a pathway to provide patient-centered care based on evidence that is generated directly from the individual patient. Embedding these trials into clinical practice may improve patient outcomes and reduce wasted time and resources on therapies that don't work best for the patient. Given the current emphasis on customized, patient-centered care, the n-of-1 trial approach has the potential to be the epitome of personalized medicine.
Funding statement
This work was supported by the Clinical and Translational Sciences Award (1UL1TR003167-01) from the NIH National Center for Advancing Translational Sciences awarded to the University of Texas Health Science Center at Houston (JPS and JET); and from internal funds from the University of Texas Health Science Center at Houston.
Competing interests
The authors have no conflicts of interest to declare.
References
- 1. Ioannidis JP, Greenland S, Hlatky MA, et al. Increasing value and reducing waste in research design, conduct, and analysis. Lancet. 2014;383(9912):166–175. doi: 10.1016/S0140-6736(13)62227-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. London AJ, Kimmelman J. Against pandemic research exceptionalism. Science. 2020;368(6490):476–477. doi: 10.1126/science.abc1731. [DOI] [PubMed] [Google Scholar]
- 3. Guyatt G, Sackett D, Adachi J, et al. A clinician’s guide for conducting randomized trials in individual patients. CMAJ. 1988;139(6):497–503. [PMC free article] [PubMed] [Google Scholar]
- 4. Zarin DA, Goodman SN, Kimmelman J. Harms from uninformative clinical trials. JAMA. 2019;322(9):813–814. doi: 10.1001/jama.2019.9892. [DOI] [PubMed] [Google Scholar]
- 5. Hutchinson N, Moyer H, Zarin DA, Kimmelman J. The proportion of randomized controlled trials that inform clinical practice. Elife. 2022;11. doi: 10.7554/eLife.79491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Kent DM, Hayward RA. Limitations of applying summary results of clinical trials to individual patients: The need for risk stratification. JAMA. 2007;298(10):1209–1212. doi: 10.1001/jama.298.10.1209. [DOI] [PubMed] [Google Scholar]
- 7. Burke JF, Sussman JB, Kent DM, Hayward RA. Three simple rules to ensure reasonably credible subgroup analyses. BMJ Nov. 2015;351:h5651. doi: 10.1136/bmj.h5651. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Sun X, Ioannidis JP, Agoritsas T, Alba AC, Guyatt G. How to use a subgroup analysis: Users' guide to the medical literature. JAMA. 2014;311(4):405–411. doi: 10.1001/jama.2013.285063. [DOI] [PubMed] [Google Scholar]
- 9. Ross S, Krebs K, Paré G, Milani L. Pharmacogenomics in stroke and cardiovascular disease: State of the art. Stroke. 2023;54(1):270–278. doi: 10.1161/STROKEAHA.122.037717. [DOI] [PubMed] [Google Scholar]
- 10. Miteva-Marcheva NN, Ivanov HY, Dimitrov DK, Stoyanova VK. Application of pharmacogenetics in oncology. Biomark Res. 2020;8(1):32. doi: 10.1186/s40364-020-00213-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Joyner MJ, Paneth N. Promises, promises, and precision medicine. J Clin Invest. 2019;01(3):946–948. doi: 10.1172/JCI126119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Wijeysundera DN, Austin PC, Hux JE, Beattie WS, Laupacis A. Bayesian statistical inference enhances the interpretation of contemporary randomized controlled trials. J Clin Epidemiol. 2009;62(1):13–21.e5. doi: 10.1016/j.jclinepi.2008.07.006. [DOI] [PubMed] [Google Scholar]
- 13. Punja S, Xu D, Schmid CH, et al. N-of-1 trials can be aggregated to generate group mean treatment effects: A systematic review and meta-analysis. J Clin Epidemiol. 2016;76:65–75. doi: 10.1016/j.jclinepi.2016.03.026. [DOI] [PubMed] [Google Scholar]
- 14. Samuel JP, Bell CS, Samuels JA, et al. N-of-1 trials versus usual care in children with hypertension: A pilot randomized clinical trial. Am J Hypertens. 2022;36(2):126–132. doi: 10.1093/ajh/hpac117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Marcus GM, Modrow MF, Schmid CH, et al. Individualized studies of triggers of paroxysmal atrial fibrillation: The I-STOP-AFib randomized clinical trial. JAMA Cardiol. 2022;7(2):167–174. doi: 10.1001/jamacardio.2021.5010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Kravitz RL, Schmid CH, Marois M, et al. Effect of mobile device-supported single-patient multi-crossover trials on treatment of chronic musculoskeletal pain: A randomized clinical trial. JAMA Intern Med. 2018;178(10):1368–1377. doi: 10.1001/jamainternmed.2018.3981. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Perski O, Naughton F, Garnett C, et al. Do daily fluctuations in psychological and app-related variables predict engagement with an alcohol reduction app? A series of N-of-1 studies. JMIR Mhealth Uhealth Oct. 2019;7(10):e14098. doi: 10.2196/14098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Daza EJ, Wac K, Oppezzo M. Effects of sleep deprivation on blood glucose, food cravings, and affect in a non-diabetic: An N-of-1 randomized pilot study. Healthcare (Basel). 2019;8(1):6. doi: 10.3390/healthcare8010006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Stevens I, Gilbert F. N-of-1 trials for closed-loop deep brain stimulation devices. Ethics Hum Res. 2020;42(2):28–33. doi: 10.1002/eahr.500045. [DOI] [PubMed] [Google Scholar]
- 20. Shaffer JA, Kronish IM, Falzon L, Cheung YK, Davidson KW. N-of-1 randomized intervention trials in health psychology: A systematic review and methodology critique. Ann Behav Med. 2018;08(9):731–742. doi: 10.1093/abm/kax026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Kravitz R, Duan N, DEcIDE Methods Center N-of-1 Guidance Panel. Design and Implementation of N-of-1 Trials: A User’s Guide. Agency for Healthcare Research and Quality Publication No. 13(14)-EHC122-EF. 2014. [Google Scholar]
- 22. Moise N, Wood D, Cheung YKK, et al. Patient preferences for personalized (N-of-1) trials: A conjoint analysis. J Clin Epidemiol. 2018;102:12–22. doi: 10.1016/j.jclinepi.2018.05.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Schmid C, Yang J. Bayesian models for N-of-1 trials. Harvard Data Science Review (Special Issue 3). 2022. doi: 10.1162/99608f92.3f1772ce. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Shamseer L, Sampson M, Bukutu C, et al. CONSORT extension for reporting N-of-1 trials (CENT) 2015: Explanation and elaboration. J Clin Epidemiol. 2016;76:18–46. doi: 10.1016/j.jclinepi.2015.05.018 . [DOI] [PubMed] [Google Scholar]
- 25. Li J, Gao W, Punja S, et al. Reporting quality of N-of-1 trials published between 1985 and 2013: A systematic review. J Clin Epidemiol. 2016;76:57–64. doi: 10.1016/j.jclinepi.2015.11.016. [DOI] [PubMed] [Google Scholar]
- 26. Gabler NB, Duan N, Vohra S, Kravitz RL. N-of-1 trials in the medical literature: A systematic review. Med Care. 2011;49(8):761–768. doi: 10.1097/MLR.0b013e318215d90d. [DOI] [PubMed] [Google Scholar]
- 27. Samuel JP, Wootton SH, Holder T, Molony D. A scoping review of randomized trials assessing the impact of n-of-1 trials on clinical outcomes. PLoS One. 2022;17(6):e0269387. doi: 10.1371/journal.pone.0269387. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Marrast L, Conigliaro J, Chan C, et al. Racial and ethnic minority patient participation in N-of-1 trials: Perspectives of healthcare providers and patients. Per Med. 2021;18(4):347–359. doi: 10.2217/pme-2020-0166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Davidson K, Cheung Y, McGinn T, Wang Y. Expanding the Role of N-of-1 Trials in the Precision Medicine Era: Action Priorities and Practical Considerations. Washington, DC: NAM Perspectives, National Academy of Medicine; 2018. [Google Scholar]
- 30. Konigorski S, Wernicke S, Slosarek T, et al. StudyU: A platform for designing and conducting innovative digital N-of-1 trials. J Med Internet Res. 2022;24(7):e35884. doi: 10.2196/35884. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Chandereng T. An R shiny app for a chronic lower back pain study, personalized N-of-1 trial. Harvard Data Science Review (Special Issue 3). 2022. doi: 10.1162/99608f92.6c21dab7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Cen R, Hussain A, Pak KJ, et al. Do N-of-1 trials need IRB review? J Empir Res Hum Res Ethics. 2016;11(3):250–255. doi: 10.1177/1556264616662560. [DOI] [PubMed] [Google Scholar]
- 33. Stunnenberg BC, Deinum J, Nijenhuis T, et al. N-of-1 trials: Evidence-based clinical care or medical research that requires IRB approval? A practical flowchart based on an ethical framework. Healthcare (Basel). 2020;8(1):49. doi: 10.3390/healthcare8010049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Samuel JP, Wootton SH. Personalized Trial Ethics and Institutional Review Board Submissions. Harvard Data Science Review; 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Nikles CJ, Mitchell GK, Del Mar CB, Clavarino A, McNairn N. An n-of-1 trial service in clinical practice: Testing the effectiveness of stimulants for attention-deficit/hyperactivity disorder. Pediatrics. 2006;117(6):2040–2046. doi: 10.1542/peds.2005-1328. [DOI] [PubMed] [Google Scholar]
- 36. Howard JP, Wood FA, Finegold JA, et al. Side effect patterns in a crossover trial of statin, placebo, and no treatment. J Am Coll Cardiol. 2021;78:1210–1222. doi: 10.1016/j.jacc.2021.07.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Tierney E, Conley SK, Goodwin H, Porter FD. Analysis of short-term behavioral effects of dietary cholesterol supplementation in Smith-Lemli-Opitz syndrome. Am J Med Genet A. 2010;152A(1):91–95. doi: 10.1002/ajmg.a.33148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Kronish IM, Cheung YK, Julian J, et al. Clinical usefulness of bright white light therapy for depressive symptoms in cancer survivors: Results from a series of personalized (N-of-1) trials. Healthcare. 2019;8(1). doi: 10.3390/healthcare8010010. [DOI] [PMC free article] [PubMed] [Google Scholar]
