Dear editor
We appreciate the interest and comments provided by Chaurasiya et al regarding our recent paper, “A Randomized, Controlled Trial Comparing Tearcare® and Cyclosporine Ophthalmic Emulsion for the Treatment of Dry Eye Disease (SAHARA)”. Most of the comments seem to be criticism of our statistical design and analysis; these are addressed in the order raised below.
The first point concerns the hierarchical testing of the two primary endpoints, tear break-up time (TBUT), and Ocular Surface Disease Index (OSDI). Hierarchical testing using the fixed sequence method (ie, testing the second null hypothesis or endpoint is dependent on rejection of the first null hypothesis) is an accepted and standard method to control for multiplicity, the inflation of a Type 1 error rate due to multiple hypotheses.1 Chaurasiya et al suggest that this methodology could “introduce bias and limit the interpretation of results”. While they do not suggest a specific alternative, they do cite a paper by Huque and Alosh describing a flexible fixed sequence method, the chief advantage being that it allows testing of an endpoint even when a previously tested endpoint was not statistically significant by taking into account correlation between the endpoints to minimize the impact of multiplicity adjustments.2 We appreciate this suggestion, however if we had included this method a priori in our statistical analysis plan, there would be no difference in our reported results or conclusions for our primary outcomes. The first primary outcome variable tested, the difference between groups in TBUT, was in fact, highly significant (p=0.0006), and it is worth noting that the difference between groups for the second primary outcome, OSDI, did not approach statistical significance (p=0.9843).
The second point raises concerns about the adequateness of the sample size and the fact that the SAHARA sample size was based on TBUT and not also considering OSDI. This is a reasonable point although ICH E9 states that sample size is usually determined by the primary objective of the trial.3 As TBUT was our first primary endpoint, and we did not have a priori knowledge of the relative degree of improvement in OSDI for patients treated with TearCare or Restasis, it seemed (and remains so) a reasonable approach. Moreover, while the improvement in OSDI was numerically better for TearCare patients at the Month 6 endpoint, the small difference (2.35) and the large standard deviations (>17) suggest that, even if this difference was real, it would have required an impossibly large sample size to achieve statistical significance and would remain of questionable clinical significance.
The third point notes that our analysis for the primary endpoints utilized a linear mixed effects model for TBUT and analysis of covariance for OSDI. Chaurasiya et al apparently have no issue with these methods except to caution that there are underlying assumptions regarding the data that must be met.
Chaurasiya et al also question our exclusion of patients on “various topical and systemic medications” and that this could “limit the generalizability of the study findings to a broader population”. The concomitant medication exclusions in the SAHARA eligibility criteria were carefully thought out. As stated in our paper, these exclusions were not made “because they were contraindications to TearCare therapy but because use of these drugs could interfere with or confound response to cyclosporine”. Two points are important here. First exclusion of these medications generally favored the cyclosporine arm of the study. Second, a randomized controlled clinical trial (RCT) is designed to answer a specific question in a specific, well defined population, so that variability is minimized and the treatment effect of a therapy can be estimated with some precision; RCTs have strong intrinsic validity. Real world, “all-comers” studies are, in contrast, designed to show how a therapeutic performs in the broader population. Both designs provide important, but different, information.
Chaurasiya et al emphasize the importance of reporting safety results. We agree which is why we provided the adverse event, visual acuity, and intraocular pressure results. Finally, Chaurasiya et al suggest that the results for our secondary endpoints be interpreted cautiously. Of course, the results of any published study should be reviewed with a critical eye and in light of the strengths and weaknesses of the study design and execution. That said, we believe SAHARA to be an unusually robust RCT in the therapeutic area of dry eye disease and that the findings are credible and compelling. We do not agree with the assertion that there is a significant “gap” or flaw in the SAHARA study design.
Disclosure
BA, MB, JL, TC, BS, JE, and SK were Investigators and received research support for the study. BA is a consultant for Allergan and Sight Sciences. MB is a consultant for and reports honoraria from Allergan and Sight Sciences. JL is a consultant for Allergan and Sight Sciences and reports personal fees from Sun Ophthalmics, Alcon, and Bausch and Lomb, outside the submitted work. TC reports grant support, consulting, and speaking for Sight Sciences. BS is a consultant and speaker for Sight Sciences and Allergan. SK is a consultant and speaker for Allergan and a consultant for Sight Sciences. VP reports consulting fees from Sight Sciences, grants from Alcon; stock options from Claris; grants, personal fees from Novartis; personal fees from Dompe, Oyster Point, Oculis, Novaliq, Sun Pharma, Quidel, BrightStar and Kala; stock options from Trefoil; personal fees, stock options from Kuria, outside the submitted work. JD and TR are employees of Sight Sciences. There are no other relevant conflicts of interest in this communication.
References
- 1.U. S. Food and Drug Administration. Center for Drug Evaluation and Research (CDER), Center for Biologics Evaluation and Research (CBER). In: Multiple Endpoints in Clinical Trials. Washington, DC: Guidance for Industry; 2022. [Google Scholar]
- 2.Huque MF, Alosh M. A flexible fixed-sequence testing method for hierarchically ordered correlated multiple endpoints in clinical trials. J Stat Plan Inf. 2008;138:321–335. doi: 10.1016/j.jspi.2007.06.009 [DOI] [Google Scholar]
- 3.U. S. Food and Drug Administration. Center for Drug Evaluation and Research (CDER), Center for Biologics Evaluation and Research (CBER). In: E9 Statistical Principles for Clinical Trials. Washington, DC: Guidance for Industry; 1998. [Google Scholar]