Skip to main content
PLOS One logoLink to PLOS One
. 2024 Feb 13;19(2):e0297235. doi: 10.1371/journal.pone.0297235

Potential efficacy of caffeine ingestion on balance and mobility in patients with multiple sclerosis: Preliminary evidence from a single-arm pilot clinical trial

Afsoon Dadvar 1,#, Melika Jameie 2,3,*,#, Mehdi Azizmohammad Looha 4,#, Mohammadamin Parsaei 5, Meysam Zeynali Bujani 1, Mobina Amanollahi 5, Mahsa Babaei 6, Alireza Khosravi 7, Hamed Amirifard 2,*
Editor: Antony Bayer8
PMCID: PMC10863863  PMID: 38349929

Abstract

Objectives

Caffeine’s potential benefits on multiple sclerosis (MS), as well as on the ambulatory performance of non-MS populations, prompted us to evaluate its potential effects on balance, mobility, and health-related quality of life (HR-QoL) of persons with MS (PwMS).

Methods

This single-arm pilot clinical trial consisted of a 2-week placebo run-in and a 12-week caffeine treatment (200 mg/day) stage. The changes in outcome measures during the study period (weeks 0, 2, 4, 8, and 12) were evaluated using the Generalized Estimation Equation (GEE). The outcome measures were the 12-item Multiple Sclerosis Walking Scale (MSWS-12) for self-reported ambulatory disability, Berg Balance Scale (BBS) for static and dynamic balance, Timed Up and Go (TUG) for dynamic balance and functional mobility, Multiple Sclerosis Impact Scale (MSIS-29) for patient’s perspective on MS-related QoL (MS-QoL), and Patients’ Global Impression of Change (PGIC) for subjective assessment of treatment efficacy. GEE was also used to evaluate age and sex effect on the outcome measures over time. (Iranian Registry of Clinical Trials, IRCT2017012332142N1).

Results

Thirty PwMS were included (age: 38.89 ± 9.85, female: 76.7%). Daily caffeine consumption significantly improved the objective measures of balance and functional mobility (BBS; P-value<0.001, and TUG; P-value = 0.002) at each study time point, and the subjective measure of MS-related QoL (MSIS-29; P-value = 0.005) two weeks after the intervention. Subjective measures of ambulatory disability (MSWS-12) and treatment efficacy (PGIC) did not significantly change. The effect of age and sex on the outcome measures were also assessed; significant sex-time interaction effects were found for MSWS-12 (P-value = 0.001) and PGIC (P-value<0.001). The impact of age on BBS scores increased as time progressed (P-value = 0.006).

Conclusions

Caffeine may enhance balance, functional mobility, and QoL in PwMS. Being male was associated with a sharper increase in self-reported ambulatory disability over time. The effects of aging on balance get more pronounced over time.

Trial registration

This study was registered with the Iranian Registry of Clinical Trials (Registration number: IRCT2017012332142N1), a Primary Registry in the WHO Registry Network.

1. Introduction

Around the world, 2.8 million people are living with multiple sclerosis (MS) [1]. The majority of persons with MS (PwMS) experience balance and gait dysfunction (ambulatory disabilities) even early in the disease course [2]. A variety of MS-related symptoms (e.g., weakness, spasticity, fatigue, and coordination alterations) can affect balance, postural control, gait, and risk of falling in PwMS [3]. Ambulatory dysfunction can remarkably influence patients’ quality of life by increasing personal dependency, limiting daily activities (going to the bathroom on time, crossing the street safely, shopping properly, and working efficiently), instilling fear of falling, and falling-related injuries [2].

Various approaches have been utilized to improve different aspects of functioning, including cognitive and motor function, in patients with neurological diseases [4,5]. Interventions, including physical rehabilitation [6], exercise, non-invasive brain stimulation [7,8], and medications such as dalfampridine (4-aminopyridine) [911], nabiximols [12], polyunsaturated fatty acids, omega-3, omega-6 [13], and lipoic acid [14], have been evaluated for balance and gait improvement in PwMS [2,615]. Notably, dalfampridine is the only U.S. Food and Drug Administration (FDA)-approved medication for improving the balance and walking abilities of PwMS (Ampyra (dalfampridine) Information | FDA) [911,15]. However, it should be prescribed with caution, as it may cause serious side effects [16], including severe allergic reactions, seizures, and triggering/exacerbating trigeminal neuralgia (medication guide available at label (fda.gov)). Additionally, it seems that dalfampridine may help only a subset of PwMS, with one-quarter and one-third of patients experiencing faster walking speed and enhanced walking ability, respectively [16].

Caffeine, a natural compound, is the most widely consumed psychoactive agent in the world [17]. Studies have shown caffeine’s potential benefits on various neurological disorders, including seizure, Alzheimer’s disease, Parkinson’s disease, stroke, and MS [1825], possibly by reducing neuroinflammation and oxidative stress and increasing neurogenesis [26,27]. Specifically, research has highlighted the positive effects of caffeine on the ambulatory performance of non-MS populations [2830], as well as on selected aspects of MS, including attention and disease progression [1824]. Aligned with caffeine’s impact on the central nervous system, skeletal muscles [31], the ambulatory performance of non-MS populations [2830], and specific aspects of MS [1820,23,24], potentially favorable effects on balance and mobility in PwMS could also be anticipated. Consequently, we hypothesized that caffeine might have the potential to improve balance and mobility impairments as debilitating aspects of MS.

Despite supportive evidence [1820,23,24,2830], to our knowledge, the potential effect of caffeine on balance and mobility in PwMS has not yet been studied. In this pilot single-arm phase II clinical trial we evaluated the potential effectiveness of caffeine ingestion on balance and mobility (i.e., static and dynamic balance, functional mobility, and patient’s reported ambulatory disability) among PwMS. Additionally, we investigated caffeine’s potential effect on patients’ health-related quality of life (HR-QoL).

2. Materials and methods

2.1. Trial design and any changes after trial commencement

This was a pilot single-arm phase II clinical trial. Notably, the initial study protocol involved a double-armed design, with one group receiving caffeine and another group receiving a placebo. However, due to financial constraints encountered during the study, it became necessary to modify the protocol and proceed as a single-armed study. To mitigate potential bias, a two-week placebo run-in stage was incorporated before the investigation. Furthermore, although the initial protocol was designed for a caffeine dosage of 2.5 mg/kg/day, we were compelled to utilize available caffeine tablets (200 mg/tablet) in the trial due to practical constraints and the unavailability of an oral caffeine solution. The selection of a 200 mg dosage was rationalized on two grounds: Firstly, 200 mg of caffeine corresponds to 2.5–3 mg/kg for an adult weighing between 60–70 kg [31]. Secondly, according to the European Food Safety Authority (EFSA), caffeine doses of up to 200 mg do not elicit safety concerns in non-pregnant adults [32]. Deviations from the protocol were implemented without adversely affecting the rights and well-being of the participants, and all changes were made in consultation with the university’s IRB to ensure the upholding of ethical standards.

2.2. Participants

Patients who were referred to our specialized MS clinic during the study period were assessed for eligibility. Men and women who met the following criteria were included: (a) aged between 20–55 years, (b) weight > 40 kg, (c) diagnosed with MS according to the McDonald criteria [33], which were confirmed by an expert neurologist regardless of the disease course, (d) having the ability to stand upright for ≥ 180 seconds without any support and to still ambulant, with an Expanded Disability Status Scale (EDSS) < 6.0, (e) without clinical relapse/disease progression in the previous three months, and (f) without coexisting conditions or with stable and well-controlled coexisting conditions (stable condition was defined as no change in the types and dosage of medication or disease severity over 3 months before the enrolment). EDSS is the most common tool for measuring the disability level in PwMS, scoring from 0 to 10, with higher scores indicating a higher disability level [34]. Patients with an EDSS score < 6 can walk without aid for at least 100 meters [34].

Main exclusion criteria included: (a) pregnancy or lactation, (b) MS relapse, corticosteroid treatment, or disease progression three months before the study, during the investigation, or the follow-ups, (c) other neurological, psychiatric, or systemic disorders affecting motor function (i.e., seizure, tremor, insomnia, depression with at least moderate severity assessed by Beck Depression Inventory-II (BDI-II) [35], moderate or severe anxiety assessed by The Persian-validated translation of Hospital Anxiety and Depression Scale-Anxiety subscale (HADS-A≥ 11) [3638], significant cardiovascular disorders, orthopedic problems, respiratory failure, myopathy, or vestibular disorders (any evidence of peripheral vestibulopathy determined through cerebellar examinations, nystagmus examination, and head impulse test), (c) significant cognitive impairment (defined as taking more than 90 seconds to complete the Trail-Making Test-B) [3941], (d) medical therapy alterations in the previous 3 months or during the investigation, (e) using concurrent medications that can affect balance (i.e., selective serotonin reuptake inhibitors [SSRIs] and serotonin-norepinephrine reuptake inhibitors [SNRIs] [42]), (f) use of medications with major interactions with caffeine (i.e., dipyridamole and isocarboxazid, linezolid, etc. [43]), (g) hypersensitivity to caffeine, herbal extracts or dietary supplements, and (h) history of hypertension, cardiovascular disease, migraine headaches, renal or liver impairments, peptic ulcer disease, and drug/alcohol abuse [43]. Eventually, (i) patients with an inability or lack of interest to comply with the study procedures (taking medications, responding to phone calls, attending the clinic for revisits) were excluded.

2.3. Study settings and ethics statement

This study was conducted at an academic hospital complex affiliated with Zahedan University of Medical Sciences, Zahedan, Iran, and followed the Consolidated Standards of Reporting Trials (CONSORT) extension to pilot and feasibility trials [44] (S1 Table in S1 File). The trial was approved by the university’s Institutional Review Board (IRB, IR.ZAUMS.REC.1395.236) and registered with the Iranian Registry of Clinical Trials (IRCT) (IRCT2017012332142N1). The first patient received treatment on March 9, 2017, and the trial ended on January 2, 2018. Fig 1 illustrates the study design and schedule. Patients’ eligibility was assessed during a screening visit (visit 0), and those who qualified scheduled a follow-up visit a week later (visit 1) to receive placebo medication for 2 weeks (the 2-week pre-intervention placebo run-in stage). After the placebo run-in stage, patients started taking experimental caffeine tablets (200 m/day) (visit 2, week 0) for a 12-week period, during which four in-person follow-up visits (visits 3–6) were scheduled (the 12-week post-intervention treatment stage). During the study period, patients were instructed to stop consuming caffeinated products, such as coffee, tea, cola drinks, and cocoa. Patients were allowed to continue using concomitant MS therapies according to their neurologist’s prescription.

Fig 1. Study schedule and design.

Fig 1

After a screening visit (visit 0) for eligibility evaluation, eligible patients returned 1 week later (visit 1) and received placebo medication for 2 weeks (the 2-week placebo run-in stage). At visit 2, patients started taking a 200 mg daily dosage of caffeine tablet and returned for four follow-up visits (visits 3–6) over a 12-week period. Abbreviations: MSWS-12: 12-item Multiple Sclerosis Walking Scale, BBS: Berg Balance Scale, TUG: Timed Up-and-Go, MSIS-29: Multiple Sclerosis Impact Scale, PGIC: Patients’ Global Impression of Change.

Patients’ anonymity was protected, and verbal and written informed consents were properly obtained from participants for participation and publication under the Declaration of Helsinki [45] and Good Clinical Practice [46]. The research team had access to identifiable participant data during the data collection phase. However, steps were taken to ensure that any identifying information was securely stored and separated from the main dataset. After data collection, all personally identifiable information was removed and replaced with unique identifiers. This anonymized dataset was used for analysis and reporting. The research team adhered to strict protocols to protect participant privacy and complied with all applicable data protection regulations. Patients were given the option to drop out of the study, and if they chose to continue, giving an informed re-consent was necessary. An independent neurology specialist performed and scored the tests. All tests were performed by the same neurologist during the first visit, as well as in each follow-up revisit.

2.4. Intervention

A 200 mg/day orally administered caffeine tablet (Karen Pharma & Food Supplement Co.®) was assessed for its potential efficacy on balance, mobility, and MS-related QoL in PwMS over 12 weeks.

2.5. Outcomes

The primary objective of this study was to determine the potential efficacy of an experimental 200 mg/day orally available caffeine tablet (Karen Pharma & Food Supplement Co.®) on balance, mobility, and MS-related QoL of PwMS. The outcome measures included (a) the 12-item Multiple Sclerosis Walking Scale (MSWS-12) for patient’s reported ambulatory disability [47], (b) the Berg Balance Scale (BBS) for static and dynamic balance [48], (c) the Timed Up-and-Go (TUG) for functional mobility and dynamic balance [49], (d) the Multiple Sclerosis Impact Scale (MSIS-29) for patients’ perspective on MS impact on their QoL [50], and (e) the Patients’ Global Impression of Changes (PGIC) to evaluate patients’ own opinions on the treatment efficacy [51]. The outcome measures were assessed at baseline and after 2, 4, 8, and 12 weeks.

As the secondary objectives, we also examined the effects of sex and age on patients’ balance, mobility, and MS-related QoL over time. Finally, the correlations between balance and mobility-related measures (MSWS-12, BBS, and TUG), MS-related QoL (MSIS-29), and the patients’ opinions on treatment efficacy (PGIC) were studied.

2.6. Study measures

Patients’ baseline characteristics (age, sex, past medical history, habitual history, education, and job status), MS-related characteristics (MS phenotype, age of disease onset, disease duration, EDSS, and using disease-modifying drugs [DMDs]), balance and mobility-related measures (MSWS-12, BBS, TUG), and QoL measure (MSIS-29) were documented in week 0 (visit 2). During the following three post-intervention months, MSWS-12, BBS, TUG, MSIS-29, and PGIC were reassessed in each follow-up visit at week 2 (visit 3), week 4 (visit 4), week 8 (visit 5), and week 12 (visit 6) (Fig 1). Herein, we briefly describe the outcome measures:

The 12-item Multiple Sclerosis Walking Scale (MSWS-12): a self-report questionnaire with good test-retest reliability [52], consisting of 12 items that indicate the patient’s reported ambulatory disability caused by MS [47]. Each question earns a score of 1–5, with a total score ranging from 12–60. Higher scores indicate poorer performance [47] (S2 Table in S1 File). A valid and reliable Persian translation was used in this study [53].

The Berg Balance Scale (BBS): an objective 14-item scale, ranging from 0–56, which is used to determine the patient’s ability to safely balance during a series of predetermined tasks [48]. BBS assesses both static and dynamic balance with high reliability [48,54,55]. Each test item consists of a five-point ordinal scale ranging from zero to four, with zero indicating the lowest and four indicating the highest level of function [48] (S3 Table in S1 File). A validated Persian translation of the test was used in this study [56].

The Timed Up-and-Go (TUG): an objective test with high test-retest reliability [52], measuring dynamic balance and functional mobility [49]. The test requires that a patient stands from a chair, walks a distance of 3 meters, 180° turns around a cone, and then returns to a seated position as quickly as possible (TUG_test-print.pdf (cdc.gov); S1 Fig in S1 File) [49]. The faster the TUG speed, the better the mobility and dynamic balance [49].

The Multiple Sclerosis Impact Scale (MSIS-29): a self-report disease-specific health-related QoL (HR-QoL) measure with a high test-retest reliability [50], explaining the patient’s perspective on the disease’s impact on their QoL through the last two weeks. This 29-item questionnaire consists of 20 items related to the physical scale and 9 items related to the psychological scale [50]. Each item has five response options from one (not at all; the best) to five (extremely; the worst) (S4 Table in S1 File).

The Patient’s Global Impression of Change (PGIC): a seven-point self-report scale evaluating the patient’s overall health status changes after taking a specific treatment course [51]. This scale ranges from one to seven, with higher scores indicating a more considerable improvement (Official PGI-C, PGI-I, PGI-S distributed by Mapi Research Trust | ePROVIDE (mapi-trust.org)) [51] (S5 Table in S1 File).

2.7. Sample size

According to the sample size rule of thumb for pilot trials suggested by Browne et al., a sample size of 30 patients was initially calculated [57,58]. Additionally, the determination of sample size for this study involved the utilization of the GPower software. A deliberate underestimation was integrated, employing a conservative approach with an effect size established at 0.25, intentionally lower than the actual expected difference and effect size in our investigation. The analysis was conducted with a significance level of 0.05, and a range of power values from 0.8 to 0.95 was examined, incorporating considerations for five measurements and an assumed correlation of 0.5 between repeated measurements, which underestimated the true effect size and correlation observed in the study. Despite the conservative nature of these choices, the resulting sample size was computed as a minimum of 21 for a power of 0.8 and 30 for a power of 0.95.

2.8. Randomization

Not applicable.

2.9. Blinding

We described the nature of the study and the possibility of receiving either the active drug or a placebo without revealing the specific timing or details of when each would be administered. Participants were informed that at no point during the trial would they know which treatment they were getting, ensuring blinding during each stage, including both the 2-week pre-intervention placebo run-in stage and the 12-week post-intervention treatment stage. To reduce bias, outcome assessors (neurology specialists) were also blinded to the research hypothesis and medication codes. Both caffeine and placebo tablets were similar (in terms of materials, shape, color, taste, and fragrance), and they were packaged in the same bottle type to enable blinding.

2.10. Adherence monitoring

Notably, throughout the course of the 14-week trial, medication compliance and avoidance of caffeinated products were carefully assessed, using several approaches [59,60]. First, before obtaining informed consent, patients were educated about the study’s objectives, the significance of adhering to study requirements, the need to strictly follow the protocol, and detailed guidance on avoiding caffeinated products. Second, trained study personnel dispensed a certain amount of placebo or active medication during scheduled in-person visits (visits 1–6), ensuring that patients received only the required amount until their next visit. Additionally, at each visit, patients were reminded to bring the medication package from their previous visit and advised to abstain from caffeine-containing items. Third, paper-based patient diaries were employed to meticulously document all information related to medication intake, adverse events, and caffeine product consumption. Fourth, between visits, patients were contacted daily to confirm medication compliance, monitor potential adverse events, evaluate caffeine consumption, and address any other concerns. Eventually, participants were encouraged to reach out to the study team in case of difficulties, side effects affecting their compliance, or any violations involving the consumption of caffeinated products.

2.11. Safety monitoring

Regarding safety concerns, available evidence suggests that caffeine consumption is not only not associated with an increased risk of MS development or exacerbation, but it may also potentially reduce susceptibility to MS or ameliorate the disease progression [21,61]. An independent safety monitoring board comprised of three expert neurologists was responsible for assessing any treatment-related adverse events (trAEs). To classify an adverse event as treatment-related, the neurologists took into account the logical temporal association with the treatment administration, the expected patterns of response, and the exclusion of other factors, according to the NIA Adverse Event and Serious Adverse Event Guidelines (available online at NIA Adverse Event and Serious Adverse Event Guidelines (nih.gov)). Solicited trAEs evaluated in this study included CNS symptoms (headaches, lightheadedness, anxiety or agitation, tremor, restlessness, insomnia, seizure, etc.), cardiovascular symptoms (tachycardia, dysrhythmia, hypotension), and gastrointestinal symptoms (nausea, vomiting, abdominal cramping, dyspepsia, diarrhea, anorexia, etc.) [43].

2.12. Statistical analysis

Descriptive statistics were presented using mean ± standard deviation (SD) for numeric variables and frequency (%) for categorical variables. Wilcoxon test with Bonferroni adjustment was used to compare changes in scores between pairs of time using bar plots. The generalized estimation equation (GEE) was performed to examine the change in scores during the study period. In addition, GEE was used to evaluate the impact of age and sex on the scores at different times. Analyses were performed using SPSS (version 26) and R (version 4.2.1). Significant results were determined by P-values less than 0.05.

3. Results

Fig 2 indicates the CONSORT flow diagram of participants. A total of 79 patients were screened for eligibility, of whom 49 individuals were excluded. Finally, 30 PwMS were included for further evaluation. Within the three-month post-intervention period, at each study time point, eight, three, five, and two patients dropped out of the study due to a lack of willingness. No serious trAEs occurred during the study period.

Fig 2. The CONSORT Flow diagram of the participant disposition.

Fig 2

3.1. Descriptive statistics of demographic and MS-related variables

Table 1 shows the demographics and MS-related characteristics of patients. A total of 30 PwMS with a mean EDSS of 4.2 ± 1.3 were evaluated (mean age: 38.89 ± 9.85, female: 76.7%). Most of the patients (76.7%) had a relapsing-remitting course of the disease and were receiving DMD (86.7%). The mean age at MS diagnosis was 29.73 ± 8.82, with a mean disease duration of 8.57 ± 7.13 years.

Table 1. Descriptive statistics of demographic and MS-related variables.

Variable Mean ± SD/Frequency (%)
Baseline and demographic characteristics
Age, years old 38.89 ± 9.85
Sex Female 23 (76.7)
Male 7 (23.3)
PMH Negative 23 (76.7)
Thyroid disorders 2 (6.7)
Diabetes 2 (6.7)
Other 3 (10.0)
Habitual history Negative 26 (86.7)
Smoking 4 (13.3)
Alcohol 1 (3.3)
Hookah 1 (3.3)
Education, years ≤ 12 14 (46.7)
> 12 16 (53.3)
Job status Unemployed 16 (53.3)
Employed 9 (30.0)
Retired 2 (6.7)
MS-related characteristics
MS phenotype RRMS 23 (76.7)
SPMS 4 (13.3)
PPMS 3 (10.0)
Age of disease onset, years old§ 29.73 ± 8.82
Disease duration, years§ 8.57 ± 7.13
EDSS score 4.2 ± 1.3
DMD use Not receiving DMD 4 (13.3)
Receiving DMD 26 (86.7)

Numeric data are described using mean ± SD. Categorial data are presented as numbers (%). Abbreviations: SD: Standard deviation, PMH: Past medical history, MS: Multiple sclerosis, EDSS: Expanded disability status scale, DMD: Disease-modifying drugs.

Past medical history other than MS.

The patients’ age at evaluation were obtained from their medical records, according to their birth certificate.

§ The patients’ age at MS diagnosis and the disease duration were obtained from in-person interviews.

3.2. The trend of criterion scores

Table 2 shows the changes in scores over time for each criterion. The MSWS-12 did not significantly change over time (P-value = 0.583). Nevertheless, the BBS score significantly increased (P-value<0.001), and the time to complete the TUG task significantly decreased (P-value = 0.002) over time. A significant decline was observed in the MSIS-29 score (P-value = 0.005), especially two weeks after the intervention.

Table 2. The score trend of each criterion by times.

Criteria Before After 2 weeks After 4 weeks After 8 weeks After 12 weeks P
MSWS-12 score 36.29 ± 13.55 31.47 ± 12.38 35.58 ± 11.29 36.46 ± 14.36 33.67 ± 13.92 0.583
BBS score 37.56 ± 13.03 42.77 ± 10.65 47.65 ± 8.09 46.42 ± 11.03 48.64 ± 11.60 <0.001
TUG (second) 31.25 ± 20.94 25.24 ± 16.20 23.88 ± 13.35 21.58 ± 15.75 19.30 ± 8.17 0.002
MSIS-29 score 84.38 ± 21.75 74.20 ± 24.60 73.95 ± 17.33 76.14 ± 23.45 76.00 ± 22.60 0.005
PGIC score N/A 3.45 ± 1.68 3.72 ± 1.74 4.92 ± 1.51 4.10 ± 1.85 0.213

Mean ± standard deviation (SD) was reported for all variables. Abbreviations: P: Probability value, SD: Standard deviation, MSWS-12: 12-item Multiple Sclerosis Walking Scale, BBS: Berg Balance Scale, TUG: Timed Up-and-Go, MSIS-29: Multiple Sclerosis Impact Scale, PGIC: Patients’ Global Impression of Change, NA: Not applicable.

Fig 3 illustrates the changes in MSWS-12, BBS, TUG, MSIS-29, and PGIC scores. It was evident that TUG was trending downward in a significant and nearly constant manner. The BBS scores trended in the opposite direction. Furthermore, the MSIS-29 score showed a significant decline 2 weeks after the intervention (P-value = 0.005). However, MSWS-12 and PGIC did not show significant changes over time (P-value = 0.583 and 0.213).

Fig 3. The mean score of each criterion over the study period.

Fig 3

Abbreviations: MSWS-12: 12-item Multiple Sclerosis Walking Scale, BBS: Berg Balance Scale, TUG: Timed Up-and-Go, MSIS-29: Multiple Sclerosis Impact Scale, PGIC: Patients’ Global Impression of Change.

3.3. Impact of sex on the scores

In the next step, as shown in Table 3 and S2 Fig in S1 File, the impact of sex on scores over time was examined. Accordingly, PGIC was significantly lower for males than for females at the end of the second week (β: -3.12, 95% CI: -4.42, -1.83, P-value<0.001). The interaction effect of sex and time on both the MSWS-12 and PGIC was significant (S2 Fig in S1 File). This effect resulted in a significant increase in the MSWS-12 score for males compared to females for every unit increase in time (β: 7.55, 95% CI: 2.95, 12.16, P-value = 0.001). Also, the PGIC slope for males was higher than that for females over time (β: 0.92, 95% CI: 0.66, 1.19, P-value<0.001). No significant interaction effect for sex and time was seen on BBS (P-value = 0.940), TUG (P-value = 0.351), and MSIS-29 (P-value = 0.955) measures.

Table 3. The impact of sex on each criterion during the study period.

Criteria β (95% CI) P-value
MSWS-12 M vs. F 3.00 (-6.47,12.64) 0.543
Time -4.12 (-6.25, -1.98) <0.001
(M vs. F) * Time 7.55 (2.95, 12.16) 0.001
BBS M vs. F -8.79 (-18.90, 1.31) 0.088
Time 2.19 (0.97, 3.41) <0.001
(M vs. F) * Time -0.05 (-1.34, 1.24) 0.940
TUG M vs. F 13.43 (-12.35, 39.21) 0.307
Time -2.47 (-4.24, -0.69) 0.006
(M vs. F) * Time -1.87 (-5.81, 2.06) 0.351
MSIS-29 M vs. F 0.86 (-24.81, 26.52) 0.948
Time -4.21 (-7.37, -1.05) 0.009
(M vs. F) * Time -0.13 (-4.62, 4.36) 0.955
PGIC M vs. F -3.12 (-4.42, -1.83) <0.001
Time -0.08 (-0.24, 0.08) 0.318
(M vs. F) * Time 0.92 (0.66, 1.19) <0.001

The generalized estimation equation (GEE) was used to evaluate the impact of sex on each criterion during the study period. Abbreviations: M: Male, F: Female, CI: Confidence interval, MSWS-12: 12-item Multiple Sclerosis Walking Scale, BBS: Berg Balance Scale, TUG: Timed Up-and-Go, MSIS-29: Multiple Sclerosis Impact Scale, PGIC: Patients’ Global Impression of Change.

3.4. Impact of age on the scores

Table 4 shows the impact of age on scores over time. Older age was associated with higher scores of MSIS-29 (β: 1.80, 95% CI: 0.88, 2.73, P-value<0.001) and lower scores of BBS (β: -0.39, 95% CI: -0.77, -0.02, P-value = 0.039) at baseline. The impact of age on the BBS increased significantly for every unit increase in time (β: 0.09, 95% CI: 0.03, 0.15, P-value = 0.006).

Table 4. The impact of age on each criterion during the study period.

Variable β (95% CI) P-value
MSWS-12 Age 0.32 (-0.15, 0.80) 0.180
Time -0.23 (-5.41, 4.95) 0.930
Age * Time 0.01 (-0.12, 0.13) 0.924
BBS Age -0.39 (-0.77, -0.02) 0.039
Time -1.89 (-4.45, 0.68) 0.149
Age * Time 0.09 (0.03, 0.15) 0.006
TUG Age 0.42 (-0.48, 1.32) 0.362
Time 0.42 (-5.08, 5.92) 0.882
Age * Time -0.07 (-0.23, 0.10) 0.419
MSIS-29 Age 1.80 (0.88, 2.73) <0.001
Time -1.10 (-6.96, 4.76) 0.712
Age * Time -0.07 (-0.22, 0.09) 0.404
PGIC Age 0.03 (-0.06, 0.13) 0.528
Time 0.08 (-0.59, 0.74) 0.825
Age * Time 0.00 (-0.01, 0.02) 0.891

The generalized estimation equation (GEE) was used to evaluate the impact of age on each criterion during the study period. Abbreviations: CI: Confidence interval, MSWS-12: 12-item Multiple Sclerosis Walking Scale, BBS: Berg Balance Scale, TUG: Timed Up-and-Go, MSIS-29: Multiple Sclerosis Impact Scale, PGIC: Patients’ Global Impression of Change.

3.5. The Spearman correlation coefficient

Eventually, we assessed the correlations between outcome measures, including the MSWS-12, BBS, TUG, MSIS-29, and PGIC (Fig 4). The results showed significant positive correlations between MSIS-29 and MSWS-12 (r = 0.82, P-value<0.001), MSIS-29 with TUG (r = 0.73, P-value<0.001), and MSWS-12 with TUG (r = 0.64, P-value<0.001) prior to the intervention. In addition, negative correlation coefficients were observed between BBS and MSIS-29 (r = -0.50, P-value = 0.011), MSWS-12 (r = -0.44, P-value = 0.022), and TUG (r = -0.82, P-value<0.001). The Spearman correlation coefficients for other study time points are shown in Fig 4. S6 and S7 Tables in S1 File present comprehensive details regarding the Spearman correlation coefficients along with confidence intervals for the total scores at each time point throughout the study.

Fig 4. The Spearman correlation coefficient between total scores.

Fig 4

Abbreviations: MSWS-12: 12-item Multiple Sclerosis Walking Scale, BBS: Berg Balance Scale, TUG: Timed Up-and-Go, MSIS-29: Multiple Sclerosis Impact Scale, PGIC: Patients’ Global Impression of Change.

4. Discussion

Our results demonstrated that daily caffeine consumption might enhance balance and functional mobility in PwMS (assessed by BBS and TUG). These improvements were noticed as soon as two weeks and persisted throughout the experiment (3 months). In addition, caffeine ingestion resulted in a considerable improvement in patients’ reported MS-related QoL (assessed by MSIS-29) after two weeks. Despite improvement in the objective measurements of balance and functional mobility, as well as the subjective measurement of MS-related QoL, the patient’s self-reported ambulatory disability (evaluated by MSWS-12) and the patient’s impression of overall change after the treatment (assessed by PGIC) did not considerably improve (Fig 5). To evaluate other factors associated with the outcome measures, we also examined the effect of sex and age on scores over time. Accordingly, being male was associated with a sharper increase in self-reported ambulatory disability (MSWS-12) over time than women. Additionally, while men tend to report lower perceived improvement in overall health status (PGIC) at the end of the second week, the PGIC scores improved more prominently over time in men compared to women. Older age was associated with a worse balance status at baseline (lower BBS score). The impact of age on BBS scores increased as time progressed, suggesting that the effects of aging on balance became more pronounced over longer periods. Finally, we evaluated the correlations between outcome measures, indicating meaningful correlations between MSWS-12, BBS, TUG, and MSIS-29. In other words, worse performance in each of these outcome measures was associated with a worse performance in all other three outcome measures, which is in line with expectations.

Fig 5. Research summary.

Fig 5

Abbreviations: MSWS-12: 12-item Multiple Sclerosis Walking Scale, BBS: Berg Balance Scale, TUG: Timed Up-and-Go, MSIS-29: Multiple Sclerosis Impact Scale, PGIC: Patients’ Global Impression of Change, QoL: Quality of life, PwMS: Persons with multiple sclerosis.

4.1 Preclinical and clinical evidence of caffeine’s benefits in multiple sclerosis

According to an in vivo study on experimental autoimmune encephalomyelitis (EAE), an MS animal model, caffeine administration (0.2–2.0 mg/kg) reduced the neuroinflammatory process and demyelination, through increasing the A1 adenosine receptor (A1AR) expression on macrophage/microglia [23]. Consistent with preclinical findings [23,24], clinical studies also have suggested possible beneficial effects of caffeine in various aspects of MS, including MS susceptibility, disease progression, reduced concentration, reduced attention, and cognitive dysfunction [1820]. Reviewing the literature has suggested positive effects of caffeine on lowering the risk of MS susceptibility or ameliorating the clinical course of the disease [19]. Consistently, a large study on two population-representative case-control studies of PwMS found that high caffeine consumption (> 900 mL daily) has a protective role against MS development [21]. Likely, a cross-sectional study of 1372 PwMS discovered a possible positive effect of caffeine on the MS course and progression in patients with relapsing-remitting MS (RRMS), with a dose-effect relationship [22]. Another questionnaire-based study revealed that caffeine ingestion improved concentration, attention span, and performing structured daily routines in PwMS [18]. Consistently, a recent preliminary study suggested caffeine as a cognitive enhancer in PwMS, although this effect may be limited to tasks requiring a high level of attention [20].

4.2 Previous studies on caffeine’s effects on balance and mobility in Non-MS populations

Although, according to our knowledge, there has not yet been a study evaluating caffeine effects on the balance and ambulatory performance of PwMS, previous studies have revealed caffeine’s potential benefits on balance and some aspects of gait performance in populations other than PwMS [2830,62,63]. Notably, the results of these studies considerably vary by the study population’s age and sex. A recent randomized clinical trial in 30 young adults (20–35 years, female: 50%) indicated that caffeine ingestion (300–350 mg) resulted in improvements in postural balance and motor control [28]. Another study on 25 healthy middle-aged women (50–60 years) reported a positive effect of caffeine ingestion (100 mg) on postural balance during different situations (i.e., with closed eyes on the foam surface) [29]. Consistently, a trial on 20 middle-aged women (mean age: 52 years) recommended that caffeine consumption (100 mg) improved both cognitive and motor functions during static and dynamic dual-task conditions [30]. On the other hand, a clinical trial of 12 healthy older adults (aged >65 years, female: 66.7%) found a negative effect of caffeine ingestion (3 mg/kg) on the bipedal standing balance and no effect on dynamic balance [62]. Another randomized clinical trial investigating the effects of taking caffeine (6 mg/kg) on the static balance and walking speed of 30 elderly patients (age ≥ 70 years, female: 50%) revealed that although caffeine improved postural stability with eyes open, it failed to improve walking speed [63]. However, it is worth mentioning that MS is frequently diagnosed in younger individuals [1] (our participants’ mean age and mean age at MS diagnosis were nearly 39 and 30 years, respectively). Furthermore, available trials are limited by sample size, necessitating larger randomized clinical trials in this area.

4.3 Consideration of age and sex in balance and ambulatory performance studies

We also examined the effect of sex and age on outcome measures over the study period. In line with previous studies, our findings suggested the importance of taking age and sex into consideration while studying balance and ambulatory performance [28,62]. This is in corroboration with a systematic review, demonstrating that BBS scores worsen and become more unpredictable with age [64]. Additionally, our findings demonstrated that men reported more rapid deterioration of ambulatory abilities over time. Consistently, previous research has shown that sex hormones can influence disability progression, with men progressing earlier [65] and women progressing later during the perimenopausal period [66]. Notably, the male sex has been introduced as a negative prognostic factor of MS progression in several studies [65].

4.4 Limitations and strengths of the study

This study has several limitations, the most important of which are the relatively small sample size and a single-arm trial design. Although we designed a 2-week placebo run-in stage to reduce the placebo effect, lacking a control group limits the ability to determine whether the observed improvements were due to caffeine or to other factors, such as natural disease progression, placebo effect, or changes in other aspects of participants’ lives during the study. Therefore, the results should be interpreted with caution, as they only suggest the “potential” efficacy of caffeine in improving balance and mobility in PwMS rather than a “confirmation of efficacy.” However, single-arm trials are commonly used in phase II testing to collect preliminary evidence on potential treatment efficacy, obtain additional safety data, and assess whether a new treatment needs further investigation in a randomized phase III trial [67]. Another study limitation is attributed to the progressive increase in the number of dropouts. To address the issue of dropouts, we incorporated the GEE into our statistical analysis. The GEE is a robust method that accounts for missing data and provides unbiased estimates under the assumption of missing completely at random (MCAR) or missing at random (MAR) [68]. This approach enabled us to analyze the available data for each participant at different time points, accommodating the variability in the number of participants across assessments. Hence, despite the reduction in the number of participants over time, the GEE methodology empowered us to make valid inferences about the trends and changes observed in the outcome measures. Our study also lacked long-term follow-up (after 3 months) to evaluate the sustained effects of caffeine on balance and mobility in PwMS. Furthermore, our results may not be generalizable to all PwMS, as the potential effects of caffeine on patients with higher disability levels (EDSS ≥ 6) were not explored. Although, according to expert neurologists, no serious complications following caffeine consumption were observed in this study, the frequency of complications was not investigated. Another potential study limitation was that while we applied objective metrics to evaluate balance and functional mobility (BBS and TUG), MSWS-12 provided a subjective assessment of the patients’ walking capacity rather than an objective evaluation provided by tests such as the clinician-observed Timed 25-Foot Walk (T25FW). However, there is no clinician-administered scale evaluating all walking aspects in PwMS [69]. Additionally, while self-report questionnaires reflect the patients’ performance during the day or week in their own environment, direct observation is usually performed over a brief period in the clinic and might not reflect patients’ real-world experience [69]. MSWS-12 has also been shown to be more responsive than other walking-based measures, such as T25FW [69]. Notwithstanding these limitations, this is the first study so far documenting preliminary evidence of potential caffeine efficacy in improving balance and mobility in PwMS. This pilot study can serve as a foundation for more rigorous investigations, offering an initial estimate of the effect size to facilitate the planning of future controlled studies.

5. Conclusion and further direction

Although preliminary, our findings showed that daily consumption of 200 mg caffeine resulted in early and sustained improvement in functional mobility, static balance, and dynamic balance in PwMS. According to the patients, caffeine also reduced the impact of MS on their QoL. Hence, it can be considered a promising supplementation for enhancing balance and mobility and, subsequently, QoL in PwMS. The advantages of caffeine include low adverse effects, low cost, and the potential to improve other aspects of MS disease (such as cognitive aspects). Nevertheless, several questions remain to be answered. These results solely stem from this single-arm initial trial, which involved an Iranian population. They hint at the need for future definitive randomized placebo-controlled trials with larger sample sizes and longer follow-ups, involving diverse populations while taking age and sex into account. This approach is necessary to enhance our understanding of the efficacy of caffeine consumption and confirm its impact on the balance and mobility performance of PwMS. We suggest patients with higher disabilities also be included in future studies. We also suggest evaluating the effect of different caffeine doses on the balance and mobility of PwMS to determine the optimal treatment dosage. This would also be a fruitful area for further work to assess by affecting which MS-related symptoms (weakness, spasticity, fatigue, coordination alterations, cognitive dysfunction) can caffeine exert beneficial effects on patients’ balance and mobility. It is also necessary to investigate the possible adverse effects of caffeine ingestion in PwMS, especially considering the long-term use of caffeine (> 12 weeks) and higher doses.

Supporting information

S1 File

The supplementary material file includes Table S1: The CONSORT checklist of information to include when reporting a pilot trial, Table S2: The 12-item Multiple Sclerosis Walking Scale (MSWS-12), Table S3: The Berg Balance Scale (BBS), Figure S1: The Timed Up-and-Go (TUG), Table S4: The Multiple Sclerosis Impact Scale (MSIS-29), Table S5: The Patient’s Global Impression of Changes (PGIC), and Figure S2: The impact of sex on each criterion during the study period. Supplementary Materials (containing the CONSORT checklist): Supplementary Materials

(DOCX)

S2 File. IRCT registry, Original Language: Clinical Trial Protocol Registry (Persian).

(PDF)

S3 File. IRCT registry, English Language: Clinical Trial Protocol Registry (English).

(PDF)

S4 File. Study Protocol, English Language: Study Protocol (Persian).

(PDF)

S5 File. Study Protocol, English Language: Study Protocol (English).

(PDF)

Acknowledgments

We thank all the patients and clinicians who made this study possible.

Data Availability

The ethics committee of the Zahedan University of Medical Sciences, Zahedan, Iran has imposed restrictions on sharing any dataset from this study. The university and the affiliated hospitals have strict policies that prohibit the sharing of such data to protect patient confidentiality and comply with legal and ethical regulations. Data requests may be sent to the ethics committee. Sincerely, for additional information or clarification regarding our data sharing policies and asking for the dataset, here is the contact information for our ethics committee: • Email: zaums.research@gmail.com • The University Central Headquarters Call Center: (+98) 54 33372116 • Address: Zahedan University of Medical Sciences, Khalij -e- Fars Blvd, Zahedan, Sistan & Balouchestan Province, Islamic Republic of Iran. • Website: https://enresearch.zaums.ac.ir/.

Funding Statement

The authors received no specific funding for this work.

References

  • 1.Walton C, King R, Rechtman L, Kaye W, Leray E, Marrie RA, et al. Rising prevalence of multiple sclerosis worldwide: Insights from the Atlas of MS. Multiple Sclerosis Journal. 2020;26(14):1816–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Cameron MH, Nilsagard Y. Balance, gait, and falls in multiple sclerosis. Handbook of clinical neurology. 2018;159:237–50. doi: 10.1016/B978-0-444-63916-5.00015-X [DOI] [PubMed] [Google Scholar]
  • 3.Abdel-Aziz K, Schneider T, Solanky BS, Yiannakas MC, Altmann DR, Wheeler-Kingshott CA, et al. Evidence for early neurodegeneration in the cervical cord of patients with primary progressive multiple sclerosis. Brain. 2015;138(6):1568–82. doi: 10.1093/brain/awv086 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Khan F, Amatya B, Galea MP, Gonzenbach R, Kesselring J. Neurorehabilitation: applied neuroplasticity. J Neurol. 2017;264(3):603–15. doi: 10.1007/s00415-016-8307-9 [DOI] [PubMed] [Google Scholar]
  • 5.Amanollahi M, Amanollahi S, Anjomshoa A, Dolatshahi M. Mitigating the negative impacts of aging on cognitive function; modifiable factors associated with increasing cognitive reserve. Eur J Neurosci. 2021;53(9):3109–24. doi: 10.1111/ejn.15183 [DOI] [PubMed] [Google Scholar]
  • 6.Iyigün G, Aksu S, Snowdon N. Is Physiotherapy Effective in Improving Balance and Gait in Patients with Multiple Sclerosis?: A Systematic Review. Turkiye Klinikleri Journal of Medical Sciences. 2010;30:482–93. [Google Scholar]
  • 7.Springer S, Khamis S. Effects of functional electrical stimulation on gait in people with multiple sclerosis—A systematic review. Mult Scler Relat Disord. 2017;13:4–12. doi: 10.1016/j.msard.2017.01.010 [DOI] [PubMed] [Google Scholar]
  • 8.Pilloni G, Choi C, Coghe G, Cocco E, Krupp LB, Pau M, et al. Gait and Functional Mobility in Multiple Sclerosis: Immediate Effects of Transcranial Direct Current Stimulation (tDCS) Paired With Aerobic Exercise. Frontiers in Neurology. 2020;11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Goodman AD, Brown TR, Krupp LB, Schapiro RT, Schwid SR, Cohen R, et al. Sustained-release oral fampridine in multiple sclerosis: a randomised, double-blind, controlled trial. The Lancet. 2009;373(9665):732–8. doi: 10.1016/S0140-6736(09)60442-6 [DOI] [PubMed] [Google Scholar]
  • 10.Hupperts R, Lycke J, Short C, Gasperini C, McNeill M, Medori R, et al. Prolonged-release fampridine and walking and balance in MS: randomised controlled MOBILE trial. Mult Scler. 2016;22(2):212–21. doi: 10.1177/1352458515581436 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Goodman AD, Brown TR, Edwards KR, Krupp LB, Schapiro RT, Cohen R, et al. A phase 3 trial of extended release oral dalfampridine in multiple sclerosis. Annals of neurology. 2010;68(4):494–502. doi: 10.1002/ana.22240 [DOI] [PubMed] [Google Scholar]
  • 12.De Blasiis P, Siani MF, Fullin A, Sansone M, Melone MAB, Sampaolo S, et al. Short and long term effects of Nabiximols on balance and walking assessed by 3D-gait analysis in people with Multiple Sclerosis and spasticity. Mult Scler Relat Disord. 2021;51:102805. doi: 10.1016/j.msard.2021.102805 [DOI] [PubMed] [Google Scholar]
  • 13.Aristotelous P, Stefanakis M, Pantzaris M, Pattichis CS, Calder PC, Patrikios IS, et al. The Effects of Specific Omega-3 and Omega-6 Polyunsaturated Fatty Acids and Antioxidant Vitamins on Gait and Functional Capacity Parameters in Patients with Relapsing-Remitting Multiple Sclerosis. Nutrients. 2021;13(10). doi: 10.3390/nu13103661 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Loy BD, Fling BW, Horak FB, Bourdette DN, Spain RI. Effects of lipoic acid on walking performance, gait, and balance in secondary progressive multiple sclerosis. Complement Ther Med. 2018;41:169–74. doi: 10.1016/j.ctim.2018.09.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Baird JF, Sandroff BM, Motl RW. Therapies for mobility disability in persons with multiple sclerosis. Expert Rev Neurother. 2018;18(6):493–502. doi: 10.1080/14737175.2018.1478289 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.UpToDate. Symptom management of multiple sclerosis in adults UpToDate: UpToDate; 2023 [updated Feb 01, 2022; cited 2023 June 6]. Available from: https://www.uptodate.com/contents/symptom-management-of-multiple-sclerosis-in-adults?search=Symptom%20management%20of%20multiple%20sclerosisin%20adults&source=search_result&selectedTitle=1~150&usage_type=default&display_rank=1.
  • 17.Alsabri SG, Mari WO, Younes S, Elsadawi MA, Oroszi TL. Kinetic and dynamic description of caffeine. Journal of Caffeine and Adenosine Research. 2018;8(1):3–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Herden L, Weissert R. The effect of coffee and caffeine consumption on patients with multiple sclerosis-related fatigue. Nutrients. 2020;12(8):2262. doi: 10.3390/nu12082262 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Herden L, Weissert R. The Impact of Coffee and Caffeine on Multiple Sclerosis Compared to Other Neurodegenerative Diseases. Front Nutr. 2018;5:133. doi: 10.3389/fnut.2018.00133 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Sharma K, Fallon SJ, Davis T, Ankrett S, Munro G, Christopher G, et al. Preliminary evidence that caffeine improves attention in multiple sclerosis. Mult Scler Relat Disord. 2022;64:103924. doi: 10.1016/j.msard.2022.103924 [DOI] [PubMed] [Google Scholar]
  • 21.Hedström A, Mowry EM, Gianfrancesco M, Shao X, Schaefer C, Shen L, et al. High consumption of coffee is associated with decreased multiple sclerosis risk; results from two independent studies. Journal of Neurology, Neurosurgery & Psychiatry. 2016:jnnp-2015-312176. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.D’Hooghe M B, Haentjens P, Nagels G, De Keyser J. Alcohol, coffee, fish, smoking and disease progression in multiple sclerosis. Eur J Neurol. 2012;19(4):616–24. doi: 10.1111/j.1468-1331.2011.03596.x [DOI] [PubMed] [Google Scholar]
  • 23.Tsutsui S, Schnermann J, Noorbakhsh F, Henry S, Yong VW, Winston BW, et al. A1 adenosine receptor upregulation and activation attenuates neuroinflammation and demyelination in a model of multiple sclerosis. Journal of Neuroscience. 2004;24(6):1521–9. doi: 10.1523/JNEUROSCI.4271-03.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Wang T, Xi NN, Chen Y, Shang XF, Hu Q, Chen JF, et al. Chronic caffeine treatment protects against experimental autoimmune encephalomyelitis in mice: therapeutic window and receptor subtype mechanism. Neuropharmacology. 2014;86:203–11. doi: 10.1016/j.neuropharm.2014.06.029 [DOI] [PubMed] [Google Scholar]
  • 25.Kim WS, Choi CK, Yoon SH, Kwon JY. Usual dose of caffeine has a positive effect on somatosensory related postural stability in hemiparetic stroke patients. Ann Rehabil Med. 2014;38(6):775–83. doi: 10.5535/arm.2014.38.6.775 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Ikram M, Park TJ, Ali T, Kim MO. Antioxidant and Neuroprotective Effects of Caffeine against Alzheimer’s and Parkinson’s Disease: Insight into the Role of Nrf-2 and A2AR Signaling. Antioxidants (Basel). 2020;9(9). doi: 10.3390/antiox9090902 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Amanollahi M, Jameie M, Heidari A, Rezaei N. The Dialogue Between Neuroinflammation and Adult Neurogenesis: Mechanisms Involved and Alterations in Neurological Diseases. Mol Neurobiol. 2023;60(2):923–59. doi: 10.1007/s12035-022-03102-z [DOI] [PubMed] [Google Scholar]
  • 28.Çıldır B, Altın B, Aksoy S. Caffeine Enhances the Balance System and Postural Balance in Short Time in Healthy Individuals. Turk Arch Otorhinolaryngol. 2021;59(4):253–60. doi: 10.4274/tao.2021.2021-4-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Ben Waer F, Laatar R, Srihi S, Jouira G, Rebai H, Sahli S. Acute effects of low versus high caffeine dose consumption on postural balance in middle-aged women. J Women Aging. 2021;33(6):620–34. doi: 10.1080/08952841.2020.1735288 [DOI] [PubMed] [Google Scholar]
  • 30.Laatar R, Ben Waer F, Rebai H, Sahli S. Caffeine consumption improves motor and cognitive performances during dual tasking in middle-aged women. Behav Brain Res. 2021;412:113437. doi: 10.1016/j.bbr.2021.113437 [DOI] [PubMed] [Google Scholar]
  • 31.McLellan TM, Caldwell JA, Lieberman HR. A review of caffeine’s effects on cognitive, physical and occupational performance. Neuroscience & Biobehavioral Reviews. 2016;71:294–312. [DOI] [PubMed] [Google Scholar]
  • 32.EFSA Panel on Dietetic Products N, Allergies. Scientific Opinion on the safety of caffeine. EFSA Journal. 2015;13(5):4102. [Google Scholar]
  • 33.Thompson AJ, Banwell BL, Barkhof F, Carroll WM, Coetzee T, Comi G, et al. Diagnosis of multiple sclerosis: 2017 revisions of the McDonald criteria. The Lancet Neurology. 2018;17(2):162–73. doi: 10.1016/S1474-4422(17)30470-2 [DOI] [PubMed] [Google Scholar]
  • 34.Kurtzke JF. Rating neurologic impairment in multiple sclerosis: an expanded disability status scale (EDSS). Neurology. 1983;33(11):1444–. doi: 10.1212/wnl.33.11.1444 [DOI] [PubMed] [Google Scholar]
  • 35.Beck AT, Steer RA, Ball R, Ranieri WF. Comparison of Beck Depression Inventories-IA and-II in psychiatric outpatients. Journal of personality assessment. 1996;67(3):588–97. doi: 10.1207/s15327752jpa6703_13 [DOI] [PubMed] [Google Scholar]
  • 36.Zigmond AS, Snaith RP. The hospital anxiety and depression scale. Acta psychiatrica scandinavica. 1983;67(6):361–70. doi: 10.1111/j.1600-0447.1983.tb09716.x [DOI] [PubMed] [Google Scholar]
  • 37.Montazeri A, Vahdaninia M, Ebrahimi M, Jarvandi S. The Hospital Anxiety and Depression Scale (HADS): translation and validation study of the Iranian version. Health Qual Life Outcomes. 2003;1:14. doi: 10.1186/1477-7525-1-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Stern AF. The Hospital Anxiety and Depression Scale. Occupational Medicine. 2014;64(5):393–4. doi: 10.1093/occmed/kqu024 [DOI] [PubMed] [Google Scholar]
  • 39.Kabiri S, Jameie M, Balali P, Adib Moradi S, Sanjari Moghaddam H, Aghamollaii V, et al. Trail Making Test Could Predict Impairment in Cognitive Domains in Patients with Multiple Sclerosis: A Study of Diagnostic Accuracy. Archives of Clinical Neuropsychology. 2023;38(1):37–48. doi: 10.1093/arclin/acac059 [DOI] [PubMed] [Google Scholar]
  • 40.Tombaugh TN. Trail Making Test A and B: normative data stratified by age and education. Archives of clinical neuropsychology. 2004;19(2):203–14. doi: 10.1016/S0887-6177(03)00039-8 [DOI] [PubMed] [Google Scholar]
  • 41.Harirchian M, Aghamollaii V, Kabiri S, Jameie M, Balali P, Moradi SA, editors. Trail making test could predict impairment in cognitive domains in patients with multiple sclerosis: a study of diagnostic accuracy. MULTIPLE SCLEROSIS JOURNAL; 2022: SAGE PUBLICATIONS LTD 1 OLIVERS YARD, 55 CITY ROAD, LONDON EC1Y 1SP, ENGLAND. [DOI] [PubMed] [Google Scholar]
  • 42.Cameron MH, Karstens L, Hoang P, Bourdette D, Lord S. Medications are associated with falls in people with multiple sclerosis: a prospective cohort study. International journal of MS care. 2015;17(5):207–14. doi: 10.7224/1537-2073.2014-076 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.caffeine (Rx, OTC)2022 2022. Available from: https://reference.medscape.com/drug/cafcit-nodoz-caffeine-342995#3.
  • 44.Eldridge SM, Chan CL, Campbell MJ, Bond CM, Hopewell S, Thabane L, et al. CONSORT 2010 statement: extension to randomised pilot and feasibility trials. Bmj. 2016;355:i5239. doi: 10.1136/bmj.i5239 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Association WM. World Medical Association Declaration of Helsinki: ethical principles for medical research involving human subjects. Jama. 2013;310(20):2191–4. doi: 10.1001/jama.2013.281053 [DOI] [PubMed] [Google Scholar]
  • 46.Guideline IHT. Guideline for good clinical practice. J Postgrad Med. 2001;47(3):199–203. [PubMed] [Google Scholar]
  • 47.Hobart J, Riazi A, Lamping D, Fitzpatrick R, Thompson A. Measuring the impact of MS on walking ability: the 12-Item MS Walking Scale (MSWS-12). Neurology. 2003;60(1):31–6. doi: 10.1212/wnl.60.1.31 [DOI] [PubMed] [Google Scholar]
  • 48.Berg K, Wood-Dauphine S, Williams J, Gayton D. Measuring balance in the elderly: preliminary development of an instrument. Physiotherapy Canada. 1989;41(6):304–11. [Google Scholar]
  • 49.Podsiadlo D, Richardson S. The timed “Up & Go”: a test of basic functional mobility for frail elderly persons. Journal of the American geriatrics Society. 1991;39(2):142–8. [DOI] [PubMed] [Google Scholar]
  • 50.Hobart J, Lamping D, Fitzpatrick R, Riazi A, Thompson A. The Multiple Sclerosis Impact Scale (MSIS-29): a new patient-based outcome measure. Brain. 2001;124(Pt 5):962–73. doi: 10.1093/brain/124.5.962 [DOI] [PubMed] [Google Scholar]
  • 51.Gnanasakthy A, Barrett A, Norcross L, D’Alessio D, Romano C. Use of Patient and Investigator Global Impression Scales: A Review of Food and Drug Administration–Approved Labeling, 2009 to 2019. Value in Health. 2021;24(7):1016–23. doi: 10.1016/j.jval.2021.01.005 [DOI] [PubMed] [Google Scholar]
  • 52.Bennett SE, Bromley LE, Fisher NM, Tomita MR, Niewczyk P. Validity and reliability of four clinical gait measures in patients with multiple sclerosis. International journal of MS care. 2017;19(5):247–52. doi: 10.7224/1537-2073.2015-006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Nakhostin Ansari N, Naghdi S, Mohammadi R, Hasson S. Multiple Sclerosis Walking Scale-12, translation, adaptation and validation for the Persian language population. Gait Posture. 2015;41(2):420–4. doi: 10.1016/j.gaitpost.2014.11.003 [DOI] [PubMed] [Google Scholar]
  • 54.Berg K, Wood-Dauphinee S, Williams J. The Balance Scale: reliability assessment with elderly residents and patients with an acute stroke. Scandinavian journal of rehabilitation medicine. 1995;27(1):27–36. [PubMed] [Google Scholar]
  • 55.Berg K. Measuring balance in the elderly: Development and validation of an instrument: McGill University; 1992. [PubMed] [Google Scholar]
  • 56.Salavati M, Negahban H, Mazaheri M, Soleimanifar M, Hadadi M, Sefiddashti L, et al. The Persian version of the Berg Balance Scale: inter and intra-rater reliability and construct validity in elderly adults. Disabil Rehabil. 2012;34(20):1695–8. doi: 10.3109/09638288.2012.660604 [DOI] [PubMed] [Google Scholar]
  • 57.Whitehead AL, Julious SA, Cooper CL, Campbell MJ. Estimating the sample size for a pilot randomised trial to minimise the overall trial sample size for the external pilot and main trial for a continuous outcome variable. Statistical Methods in Medical Research. 2015;25(3):1057–73. doi: 10.1177/0962280215588241 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Browne RH. On the use of a pilot sample for sample size determination. Stat Med. 1995;14(17):1933–40. doi: 10.1002/sim.4780141709 [DOI] [PubMed] [Google Scholar]
  • 59.Matsui D. Strategies to Measure and Improve Patient Adherence in Clinical Trials. Pharmaceutical Medicine. 2009;23(5):289–97. [Google Scholar]
  • 60.Robiner WN. Enhancing adherence in clinical research. Contemporary clinical trials. 2005;26(1):59–77. doi: 10.1016/j.cct.2004.11.015 [DOI] [PubMed] [Google Scholar]
  • 61.Massa J O ’Reilly EJ, Munger KL, Ascherio A. Caffeine and alcohol intakes have no association with risk of multiple sclerosis. Mult Scler. 2013;19(1):53–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Tallis J, Bradford C, Duncan MJ, Leddington-Wright S, Higgins MF, Hill M. The Effect of Acute Caffeine Ingestion on Cognitive Dual Task Performance during Assessment of Static and Dynamic Balance in Older Adults. Nutrients. 2020;12(12). doi: 10.3390/nu12123653 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Norager CB, Jensen MB, Madsen MR, Laurberg S. Caffeine improves endurance in 75-yr-old citizens: a randomized, double-blind, placebo-controlled, crossover study. J Appl Physiol (1985). 2005;99(6):2302–6. doi: 10.1152/japplphysiol.00309.2005 [DOI] [PubMed] [Google Scholar]
  • 64.Downs S, Marquez J, Chiarelli P. Normative scores on the Berg Balance Scale decline after age 70 years in healthy community-dwelling people: a systematic review. Journal of Physiotherapy. 2014;60(2):85–9. doi: 10.1016/j.jphys.2014.01.002 [DOI] [PubMed] [Google Scholar]
  • 65.Dunn SE, Gunde E, Lee H. Sex-Based Differences in Multiple Sclerosis (MS): Part II: Rising Incidence of Multiple Sclerosis in Women and the Vulnerability of Men to Progression of this Disease. Curr Top Behav Neurosci. 2015;26:57–86. doi: 10.1007/7854_2015_370 [DOI] [PubMed] [Google Scholar]
  • 66.Krysko KM, Graves JS, Dobson R, Altintas A, Amato MP, Bernard J, et al. Sex effects across the lifespan in women with multiple sclerosis. Ther Adv Neurol Disord. 2020;13:1756286420936166. doi: 10.1177/1756286420936166 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Evans SR. Clinical trial structures. J Exp Stroke Transl Med. 2010;3(1):8–18. doi: 10.6030/1939-067x-3.1.8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Fitzmaurice GM, Laird NM. Generalized linear mixture models for handling nonignorable dropouts in longitudinal studies. Biostatistics. 2000;1(2):141–56. doi: 10.1093/biostatistics/1.2.141 [DOI] [PubMed] [Google Scholar]
  • 69.Bethoux F, Bennett S. Evaluating walking in patients with multiple sclerosis: which assessment tools are useful in clinical practice? Int J MS Care. 2011;13(1):4–14. doi: 10.7224/1537-2073-13.1.4 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Antony Bayer

4 Oct 2023

PONE-D-23-18917Efficacy of caffeine ingestion on the balance and gait in patients with multiple sclerosis: Preliminary evidence from a single-arm pilot clinical trialPLOS ONE

Dear Dr. Amirifard,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that the topic is of interest but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript if you feel you are able to address the points raised during the review process.

Please submit any revised manuscript by Nov 18 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Antony Bayer

Academic Editor

PLOS ONE

Journal requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. We note that you have selected “Clinical Trial” as your article type. PLOS ONE requires that all clinical trials are registered in an appropriate registry (the WHO list of approved registries is at

"https://www.who.int/clinical-trials-registry-platform/network/primary-registries"" https://www.who.int/clinical-trials-registry-platform/network/primary-registries and more information on trial registration is at http://www.icmje.org/about-icmje/faqs/clinical-trials-registration/).

Please state the name of the registry and the registration number (e.g. ISRCTN or ClinicalTrials.gov) in the submission data and on the title page of your manuscript. a) Please provide the complete date range for participant recruitment and follow-up in the methods section of your manuscript. b) If you have not yet registered your trial in an appropriate registry, we now require you to do so and will need confirmation of the trial registry number before we can pass your paper to the next stage of review. Please include in the Methods section of your paper your reasons for not registering this study before enrolment of participants started. Please confirm that all related trials are registered by stating: “The authors confirm that all ongoing and related trials for this drug/intervention are registered”. Please see http://journals.plos.org/plosone/s/submission-guidelines#loc-clinical-trials for our policies on clinical trials.

3. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions.

In your revised cover letter, please address the following prompts:

a) If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data contain potentially identifying or sensitive patient information) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b) If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. Please see http://www.bmj.com/content/340/bmj.c181.long for guidelines on how to de-identify and prepare clinical data for publication. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide.

4. PLOS requires an ORCID iD for the corresponding author in Editorial Manager on papers submitted after December 6th, 2016. Please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. Please see the following video for instructions on linking an ORCID iD to your Editorial Manager account: https://www.youtube.com/watch?v=_xcclfuvtxQ

5. We note that Figure S1 in your submission contain copyrighted images. All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright.

We require you to either (1) present written permission from the copyright holder to publish these figures specifically under the CC BY 4.0 license, or (2) remove the figures from your submission:

1. You may seek permission from the original copyright holder of Figure S1 to publish the content specifically under the CC BY 4.0 license.

We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text:

“I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.”

Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission.

In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”

2. If you are unable to obtain permission from the original copyright holder to publish these figures under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only.

6. We note that the original protocol that you have uploaded as a Supporting Information file contains an institutional logo. As this logo is likely copyrighted, we ask that you please remove it from this file and upload an updated version upon resubmission.

Additional Editor Comments:

1. The authors aim to “evaluated the effects on ambulatory performance and HRQoL” - how can effectiveness of an intervention be evaluated without any control?

2. The discussion should centre on the results and their implications and the background information from previous studies on hypothesis, benefits etc. should be in the Introduction.

3. The participants gave “fully informed consent” yet participants were “blind to the medication they received”? What were they told?

4.How was adherence assured (both for taking the study medication and for excluding other caffeinated products over the 14 weeks)?

5.The authors justify their sample size based on a reference to numbers suitable for a Phase 2 randomised trial (15 in each group, as originally planned in the placebo controlled study). How was this impacted by the eventual study just having a single arm (and by 18 not completing)? What effect size on the outcome measures would a sample size of 30 (and 12) have been able to demonstrate?

6.How did the analysis deal with the 18 dropouts? It seems surprising that the mean values and standard deviations for each assessment were broadly similar at each time point, despite progressively fewer people included.

7.What was the justification for choosing the 200mg dose of caffeine (and why is this different from the 2.5mg/Kg body weight) in the trial registry?

8.How were adverse events decided to be “treatment-related” and how did the 3 expert neurologists assess them?

9.What “deviations from the protocol” (p4) were there? Please explain all the differences from the protocol and the trial registry (age 20-55 included, weight >40Kg excluded, EDSS as primary outcome etc.)

10.What cut-off was used on trail-making task to exclude cognitive impairment?

11.For the PGIC, did a score of 4 indicate no change and so 1-3 a worsening, or did 1 to 7 indicate progressive improvement?

12.Mean age at MS diagnosis=29.73 and at evaluation=38.89 (difference 9.16 years), so how can mean disease duration be only 8.57 years?

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: No

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This study was aimed to evaluate its effects on ambulatory performance and health-related quality of life (HR-QoL) of patients with MS (PwMS). Authors concluded that Caffeine may enhance balance, functional mobility, and QoL in PwMS. Being male was associated with a sharper increase in self-reported ambulatory disability over time. The effects of aging on balance get more pronounced over time Overall, the study is interesting, however there are some clarifications needed.

Comment#1

Keywords: Please add quality of life to Keywords section.

Comment#2

Introduction, Line 87-90: Authors stated that this pilot single-arm phase II clinical trial set out to evaluate the potential effectiveness of caffeine ingestion on ambulatory performance (i.e., static and dynamic balance, functional mobility, and patient’s reported ambulatory disability) among PwMS. Ambulatory performance is not appropriate for static balance. Ambulatory performance is mostly related with dynamic balance rather than static balance.

Comment#3

Materials & Methods, Line 152. How do you assess the presence vestibular disorder or cognitive impairment?

Comment#4

Materials & Methods, Line 151-152. What cut-off point did you consider for the presence of moderate or severe anxiety assessed by The Hospital Anxiety and Depression Scale-Anxiety subscale?

Comment#5

Results, the authors should calculate effect size for each variable.

Reviewer #2: This manuscript presents data analysis from a non-randomized, Phase-II, pilot study on evaluating the effectiveness of caffeine ingestion on the balance and gait in MS patients. The topic is of importance, the study was registered as a RCT within the Iranian system, was approved by the respective IRB/Ethics Committee. While the study objectives sound interesting, is important, and on target, some shortcomings were observed, in regards to abiding by the CONSORT guidelines for conducting and reporting results of high-quality randomized controlled trials (RCTs). Some other (statistical) comments were also provided.

1. Methods:

Methods reporting need some work. An orderly manner is suggested, following CONSORT guidelines, without repeating information, such as Trial Design, Participant Eligibility Crtieria and settings, Interventions, Outcomes, sample size/power considerations, Interim analysis and stopping rules, etc. The authors are advised to create separate subsections for each of the possible topics (whichever necessary), and that way produce a very clear writeup. I see the Authors indeed made an attempt; however, they are advised to write it carefully, following nice examples in the manuscript below:

https://www.sciencedirect.com/science/article/pii/S0889540619300010

Specific comments:

(a) I am somewhat confused with the design! This is a single-arm, Phase-II, but I do not understand (justification not given clearly) behind the administration of placebo initially. Popular Phase-II designs, such as Simon's Phase-II, are often 2-staged. On the contrary, it would have been perfectly OK if a randomized design was considered (which often is much clear!). Why was that not conducted? Any water tight justification?

(b) Sample size/power: The sample size/power statement should reflect the statistical test used (one-sided/two-sided), the significance level (5%?), the corresponding effect size, etc. Even the trial is not randomized, one may compute using the "desired" change one wants to attain at the end of the study. Even pilot trials need to be conducted with some ballpark number. It should also be described in a separate sub-section.

(c) Statistical Analysis: Based on the (longitudinal) design of the study, the authors justifiably conducted a GEE analysis. Any thoughts, why a mixed linear model analysis was not conducted (I am not asking authors to do it)?

2. Results & Conclusions:

(a) The authors should check that any statement of significance should be followed by a p-value in the entire Results section. Otherwise, the Results section look OK.

(b) Conclusions should stress that findings are only based on this pilot trial (using an Iranian population), and allude to future larger trials/studies, combining other populations, to understand the effectiveness of caffeine intake.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Razieh Mofateh

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2024 Feb 13;19(2):e0297235. doi: 10.1371/journal.pone.0297235.r002

Author response to Decision Letter 0


7 Dec 2023

* Note: Although the responses are also written here, please see the attached file (response to reviewers), as the figures and tables might not be displayed here correctly.

Best wishes,

PONE-D-23-18917

Efficacy of caffeine ingestion on the balance and gait in patients with multiple sclerosis: Preliminary evidence from a single-arm pilot clinical trial

PLOS ONE

Dear Dr. Amirifard,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that the topic is of interest but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript if you feel you are able to address the points raised during the review process.

We look forward to receiving your revised manuscript.

Kind regards,

Antony Bayer

Academic Editor

PLOS ONE

Dear Professor Antony Bayer,

Thank you very much for providing us with the opportunity to strengthen our research. We sincerely appreciate all the precious comments from you and the respected reviewers. Having carefully considered the comments and suggestions, we have made all the relevant changes to our manuscript as outlined below in an itemized, point-by-point manner. We sincerely hope that these changes meet the approval criteria of the esteemed reviewers and the editorial board.

Best Regards,

- Hamed Amirifard; MD, Assistant Professor of Neurology, Iranian Center of Neurological Research, Neuroscience Institute, Tehran University of Medical Sciences, Tehran, Iran. E-mail: Dr.amirifard@gmail.com, ORCID: 0000-0001-7675-5328, Phone: (+98) 912 6493820

- Melika Jameie; MD, MBA, Post-doctoral research fellow, Neuroscience Research Center, Iran University of Medical Sciences, Tehran, Iran; Iranian Center of Neurological Research, Neuroscience Institute, Tehran University of Medical Sciences, Tehran, Iran. ORCID: 0000-0002-2028-9935. Email: Jameiemelika@gmail.com; Ms-jameie@farabi.tums.ac.ir; Jameiemelika@sbmu.ac.ir

Journal requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming.

- Reply: Thank you very much. We tried our best to meet the journal’s style requirements. We sincerely hope that these changes meet the approval criteria of the PLOS ONE journal.

2. We note that you have selected “Clinical Trial” as your article type. PLOS ONE requires that all clinical trials are registered in an appropriate registry (the WHO list of approved registries). Please state the name of the registry and the registration number (e.g., ISRCTN or ClinicalTrials.gov) in the submission data and on the title page of your manuscript.

- Reply: Thank you very much for your comment. This study was registered with the Iranian Registry of Clinical Trials (Registration number: IRCT2017012332142N1), a Primary Registry in the WHO Registry Network. Sincerely, we mentioned this in the Registration section. We also added the statement in the Title page, as your precious comment.

a. Please provide the complete date range for participant recruitment and follow-up in the methods section of your manuscript.

- Reply: Thank you very much for your mention. According to your comment. in the Methods section, we mentioned this issue: “The first patient received treatment on March 9, 2017, and the trial ended on January 2, 2018.”

b. If you have not yet registered your trial in an appropriate registry, we now require you to do so and will need confirmation of the trial registry number before we can pass your paper to the next stage of review. Please include in the Methods section of your paper your reasons for not registering for this study before the enrolment of participants started.

- Reply: Thank you for your mention. This study was registered with the Iranian Registry of Clinical Trials (Registration number: IRCT2017012332142N1), a Primary Registry in the WHO Registry Network. This information is available in the Registration section and the Title page.

c. Please confirm that all related trials are registered by stating: “The authors confirm that all ongoing and related trials for this drug/intervention are registered”.

- Reply: Thank you very much. The authors confirm that all ongoing and related trials for this drug/intervention are registered with the Iranian Registry of Clinical Trials (Registration number: IRCT2017012332142N1).

3. We note that you have indicated that data from this study are available upon request. PLOS only allows data to be available upon request if there are legal or ethical restrictions on sharing data publicly. For information on unacceptable data access restrictions, please see http://journals.plos.org/plosone/s/data-availability#loc-unacceptable-data-access-restrictions. In your revised cover letter, please address the following prompts:

a. If there are ethical or legal restrictions on sharing a de-identified data set, please explain them in detail (e.g., data containing potentially identifying or sensitive patient information) and who has imposed them (e.g., an ethics committee). Please also provide contact information for a data access committee, ethics committee, or other institutional body to which data requests may be sent.

b. If there are no restrictions, please upload the minimal anonymized data set necessary to replicate your study findings as either Supporting Information files or to a stable, public repository and provide us with the relevant URLs, DOIs, or accession numbers. Please see http://www.bmj.com/content/340/bmj.c181.long for guidelines on how to de-identify and prepare clinical data for publication. For a list of acceptable repositories, please see http://journals.plos.org/plosone/s/data-availability#loc-recommended-repositories.

We will update your Data Availability statement on your behalf to reflect the information you provide.

- Reply: Thank you very much for your comment. The ethics committee of the Zahedan University of Medical Sciences, Zahedan, Iran has imposed restrictions on sharing any dataset from this study. The university and the affiliated hospitals have strict policies that prohibit the sharing of such data to protect patient confidentiality and comply with legal and ethical regulations. Sincerely, for additional information or clarification regarding our data sharing policies and asking for the dataset, here is the contact information for our ethics committee:

o Email: zaums.research@gmail.com

o The University Central Headquarters Call Center: (+98) 54 33372116

o Address: Zahedan University of Medical Sciences, Khalij -e- Fars Blvd, Zahedan, Sistan & Balouchestan Province, Islamic Republic of Iran.

o Website: https://enresearch.zaums.ac.ir/

We apologize for any inconvenience this may cause and appreciate your understanding of the legal and ethical constraints that prevent us from sharing the data. If you have any further questions or need additional information, please feel free to contact us or our ethics committee for further guidance. This information was also added in the “Revised cover letter”.

4. PLOS requires an ORCID iD for the corresponding author in Editorial Manager on papers submitted after December 6th, 2016. Please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new ID or authenticate a pre-existing iD in Editorial Manager. Please see the following video for instructions on linking an ORCID iD to your Editorial Manager account: https://www.youtube.com/watch?v=_xcclfuvtxQ

- Reply: Thank you very much. The ORCID IDs for the two corresponding authors are provided in the Title page:

o Hamed Amirifard, ORCID: 0000-0001-7675-5328

o Melika Jameie, ORCID: 0000-0002-2028-9935

5. We note that Figure S1 in your submission contains copyrighted images. All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright.

We require you to either (1) present written permission from the copyright holder to publish these figures specifically under the CC BY 4.0 license, or (2) remove the figures from your submission:

1. You may seek permission from the original copyright holder of Figure S1 to publish the content specifically under the CC BY 4.0 license. We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text: “I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.”

Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission. In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”

2. If you are unable to obtain permission from the original copyright holder to publish these figures under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check the copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only.

- Reply: Thank you very much for your consideration, and we sincerely apologize for inadvertently using this figure. We omitted the copyrighted figure and created a new figure. In the figure caption, we mentioned that “The figure is similar but not identical to the original image from TUG_test-print.pdf (cdc.gov) and is therefore for illustrative purposes only”.

Figure S1. The figure is similar but not identical to the original image from TUG_test-print.pdf (cdc.gov) and is therefore for illustrative purposes only.

6. We note that the original protocol that you have uploaded as a Supporting Information file contains an institutional logo. As this logo is likely copyrighted, we ask that you please remove it from this file and upload an updated version upon resubmission.

- Reply: Thank you very much. We omitted the logo and the revised files are uploaded as “Revised Study Protocol, Original Language” and “Revised Study Protocol, English”.

Additional Editor Comments:

1. The authors aim to “evaluate the effects on ambulatory performance and HRQoL” - how can the effectiveness of an intervention be evaluated without any control?

- Reply: Thank you very much for your meticulous comment. As with your precious comment and due to the fact that it was a pilot exploratory clinical trial with no control group, we revised all the “efficacy”-related terms to “potential efficacy” within the whole manuscript, including the Title, Abstract, and Introduction, reflecting the exploratory nature of our study. “Our trial's goal was to give an initial estimate of the effect size so that future controlled studies could be better planned”, even if it was not intended to prove caffeine's efficacy. This is especially crucial in the early phases of research when little is known about the possible effects of the intervention.

- As per your valuable comment, in the Limitation section we also mentioned this important issue to acknowledge this study's limitation and the necessity to interpret the results cautiously: “Although we designed a 2-week placebo run-in stage to reduce the placebo effect, lacking a control group limits the ability to determine whether the observed improvements were due to caffeine or to other factors, such as natural disease progression, placebo effect, or changes in other aspects of participants' lives during the study”.

- We also noted that: “Therefore, the results should be interpreted with caution, as they only suggest the “potential” efficacy of caffeine in improving the balance and mobility in PwMS rather than a “confirmation of efficacy.”

- Although this trial lacked a control group, it is worth noting that “single-arm trials are commonly used in phase II testing to collect preliminary evidence on potential treatment efficacy, obtain additional safety data, and assess whether a new treatment needs further investigation in a randomized phase III trial” (Clinical trial structures - PMC (nih.gov)). Therefore, by mentioning this point in the limitation section, we tried to aware the readers that this study only provides “preliminary evidence on potential treatment efficacy”.

- In the Conclusion section, we also noted that “These results solely stem from this single-arm initial trial, which involved an Iranian population. They hint at the need for future definitive randomized placebo-controlled trials with larger sample sizes and longer follow-ups, involving diverse populations while taking age and sex into account. This approach is necessary to enhance our understanding of the efficacy of caffeine consumption and confirm its impact on balance and mobility performance of PwMS.”

- At the end we again stress the need for further research to confirm the findings: “This pilot study can serve as a foundation for more rigorous investigations, offering an initial estimate of the effect size to facilitate the planning of future controlled studies”.

- In summery, we revised the whole manuscript from Title to Conclusion to convey this concept to the readers that these findings do not “confirm” the efficacy, but rather suggest a “potential” efficacy.

2. The discussion should center on the results and their implications and the background information from previous studies on hypothesis, benefits, etc. should be in the Introduction.

- Reply: Thank you very much for your valuable feedback. This pilot study suggested the possible potential benefits of caffeine on balance and mobility in PwMS. To our knowledge, this is the first study on this subject, limiting our ability to draw direct comparisons with existing literature. The scarcity of related articles compelled us to focus on two closely related areas: (1) studies related to the caffeine effect on other aspects of the lives of PwMS and (2) studies related to the caffeine effect on balance and mobility in populations other than PwMS. According to your valuable comment, we have streamlined the Discussion by integrating the subsection titled "The Hypothesis of Caffeine's Potential Benefits in Improving Balance and Mobility in PwMS" into the Introduction. The repeated parts were also omitted to increase readability.

- Therefore, the study hypothesis and the reasons behind that are provided in the revised manuscript in the Introduction section: “Interventions, including physical rehabilitation (6), exercise, non-invasive brain stimulation (7, 8), and medications such as dalfampridine (4-aminopyridine) (9-11), nabiximols (12), polyunsaturated fatty acids, omega-3, omega-6 (13), and lipoic acid (14), have been evaluated for balance and gait improvement in PwMS (2, 6-15). Notably, dalfampridine is the only U.S. Food and Drug Administration (FDA)-approved medication for improving the balance and walking abilities of PwMS (Ampyra (dalfampridine) Information | FDA) (9-11, 15). However, it should be prescribed with caution, as it may cause serious side effects (16), including severe allergic reactions, seizures, and triggering/exacerbating trigeminal neuralgia (medication guide available at label (fda.gov)). Additionally, it seems that dalfampridine may help only a subset of PwMS, with one-quarter and one-third of patients experiencing faster walking speed and enhanced walking ability, respectively (16).

Caffeine, a natural compound, is the most widely consumed psychoactive agent in the world (17). Studies have shown caffeine's potential benefits on various neurological disorders, including seizure, Alzheimer’s disease, Parkinson’s disease, stroke, and MS (18-25), possibly by reducing neuroinflammation and oxidative stress and increasing neurogenesis (26, 27). Specifically, research has highlighted the positive effects of caffeine on the ambulatory performance of non-MS populations (28-30), as well as on selected aspects of MS, including attention and disease progression (18-24). Aligned with caffeine's impact on the central nervous system, skeletal muscles (31), the ambulatory performance of non-MS populations (28-30), and specific aspects of MS (18-20, 23, 24), potentially favorable effects on balance and mobility in PwMS could also be anticipated. Consequently, we hypothesized that caffeine might have the potential to improve balance and mobility impairments as debilitating aspects of MS.”

3. The participants gave “fully informed consent” yet participants were “blind to the medication they received”? What were they told?

- Reply: Thank you very much for your valuable comment. The patients were told that they might receive either an active drug or a placebo during the study period. Indeed, they consented to receive either a placebo or medication; however, they did not know what they were receiving at each stage of the study. According to your precious comment, we revised the Methods> Blinding section and noted that “We described the nature of the study and the possibility of receiving either the active drug or a placebo without revealing the specific timing or details of when each would be administered. Participants were informed that at no point during the trial would they know which treatment they were getting, ensuring blinding during each stage, including both the 2-week pre-intervention placebo run-in stage and the 12-week post-intervention treatment stage”.

4. How was adherence assured (both for taking the study medication and for excluding other caffeinated products over the 14 weeks)?

- Reply: Thank you very much for your valuable comment. Since we had not mentioned this issue before, we revised the manuscript and provided the related information by adding a new section (Methods> Adherence monitoring): “Notably, throughout the course of the 14-week trial, medication compliance and avoidance of caffeinated products were carefully assessed, using several approaches (48, 49). First, before obtaining informed consent, patients were educated about the study's objectives, the significance of adhering to study requirements, the need to strictly follow the protocol, and detailed guidance on avoiding caffeinated products. Second, trained study personnel dispensed a certain amount of placebo or active medication during scheduled in-person visits (visits 1-6), ensuring that patients received only the required amount until their next visit. Additionally, at each visit, patients were reminded to bring the medication package from their previous visit and advised to abstain from caffeine-containing items. Third, paper-based patient diaries were employed to meticulously document all information related to medication intake, adverse events, and caffeine product consumption. Fourth, between visits, patients were contacted daily to confirm medication compliance, monitor potential adverse events, evaluate caffeine consumption, and address any other concerns. Eventually, participants were encouraged to reach out to the study team in case of difficulties, side effects affecting their compliance, or any violations involving the consumption of caffeinated products.”

5. The authors justify their sample size based on a reference to numbers suitable for a Phase 2 randomized trial (15 in each group, as originally planned in the placebo-controlled study). How was this impacted by the eventual study just having a single arm (and by 18 not completing)? What effect size on the outcome measures would a sample size of 30 (and 12) have been able to demonstrate?

- Reply: Thank you for your constructive input regarding the justification of our sample size, particularly in the context of the initially planned Phase 2 randomized trial and the subsequent decision to conduct a single-arm study, where 18 participants did not complete the study. We appreciate your consideration of the impact of these changes on our study design. In response to your query, the initial reference to a Phase 2 randomized trial with 15 participants in each group was made when planning a placebo-controlled study. Due to practical considerations (mentioned in the Methods > Trial design and any changes after trial commencement), we transitioned to a single-arm design during the course of the study, and unfortunately, 18 participants did not complete the trial. We acknowledge that this change has implications for the interpretation of our findings.

- Regarding the effect size, we employed GPower software with a conservative approach, selecting an effect size of 0.25 for the power analysis. This choice was intentionally lower than the actual observed effect size in our study. The resulting sample size calculation indicated a minimum of 21 for a power of 0.8 and 30 for a power of 0.95. Despite the conservative nature of these choices, we recognize the importance of providing a more detailed explanation of the impact of these decisions on our ability to detect meaningful differences in outcome measures.

- In light of your valuable feedback, we will further elaborate on the implications of our study design changes in the revised manuscript, addressing the specific effect size that a sample size of 30 (and 12, considering the 18 participants who did not complete the study) would have been able to demonstrate. We added this information in the sample size sub-section of the Methods, enhancing the transparency and completeness of our study design rationale: “The determination of sample size for this study involved the utilization of the GPower software. A deliberate underestimation was integrated, employing a conservative approach with an effect size established at 0.25, intentionally lower than the actual expected difference and effect size in our investigation. The analysis was conducted with a significance level of 0.05, and a range of power values from 0.8 to 0.95 was examined, incorporating considerations for five measurements and an assumed correlation of 0.5 between repeated measurements, which underestimated the true effect size and correlation observed in the study. Despite the conservative nature of these choices, the resulting sample size was computed as a minimum of 21 for a power of 0.8 and 30 for a power of 0.95.” We appreciate your guidance and remain committed to ensuring the thoroughness and clarity of our manuscript.

- The change in sample size can be elucidated as follows:

6. How did the analysis deal with the 18 dropouts? It seems surprising that the mean values and standard deviations for each assessment were broadly similar at each time point, despite progressively fewer people included.

- Reply: Thank you for your insightful observation regarding the analysis of dropouts in our study. We appreciate your attention to detail and the opportunity to clarify this aspect of our study. The apparent similarity in mean values and standard deviations for each assessment at different time points despite the progressive reduction in the number of participants is indeed a noteworthy point. We would like to assure you that we have considered this issue in our analysis.

- The handling of dropouts was addressed by utilizing the generalized estimation equation (GEE) in our statistical analysis. GEE is a robust method that accounts for missing data and provides unbiased estimates under the assumption of missing completely at random (MCAR) or missing at random (MAR) (1). In our case, the GEE approach allows us to analyze the available data for each participant at different time points, accommodating the variability in the number of participants across assessments.

- Despite the reduction in the number of participants over time, the GEE methodology enables us to make valid inferences about the trends and changes observed in the outcome measures. We acknowledge that the progressive decrease in the sample size is a limitation of our study, and we emphasized this point in the limitations section of the manuscript, as with your valuable comment: “Another study limitation is attributed to the progressive increase in the number of dropouts. To address the issue of dropouts, we incorporated the GEE into our statistical analysis. The GEE is a robust method that accounts for missing data and provides unbiased estimates under the assumption of missing completely at random (MCAR) or missing at random (MAR) (62). This approach enabled us to analyze the available data for each participant at different time points, accommodating the variability in the number of participants across assessments. Hence, despite the reduction in the number of participants over time, the GEE methodology empowered us to make valid inferences about the trends and changes observed in the outcome measures.”

7. What was the justification for choosing the 200 mg dose of caffeine (and why is this different from the 2.5mg/Kg body weight) in the trial registry?

- Reply: Thank you very much for your precious comment. According to your valuable comment, we revised the manuscript for two issues:

o Justification for deviation from the protocol: The choice of a fixed 200 mg/day dose of caffeine in our clinical trial was primarily driven by practical considerations. At the time of running the study, oral solutions of caffeine were not readily accessible in our country; hence, we decided to use accessible caffeine tablets to proceed with the trial. The use of caffeine tablets, rather than an oral solution, restricted our ability to administer a dose of 2.5 mg/kg. According to your valuable comment, this issue was added to the manuscript to assure transparency and describe any deviations from the protocol: “Furthermore, although the initial protocol was designed for a caffeine dosage of 2.5 mg/kg/day, we were compelled to utilize available caffeine tablets (200 mg/tablet) in the trial due to practical constraints and the unavailability of an oral caffeine solution”.

o Justification for choosing a 200-mg dosage: Notably, 200 mg of caffeine is approximately equivalent to 2.5-3 mg/kg for a 60-70-kg adult. (https://www.sciencedirect.com/science/article/pii/S0149763416300690). Additionally, according to the European Food Safety Authority (EFSA), caffeine dosages up to 200 mg do not raise any safety concerns for non-pregnant adults (https://efsa.onlinelibrary.wiley.com/doi/abs/10.2903/j.efsa.2015.4102). Therefore, we mentioned that in the revised manuscript: “The selection of a 200 mg dosage was rationalized on two grounds: Firstly, 200 mg of caffeine corresponds to 2.5-3 mg/kg for an adult weighing between 60-70 kg. Secondly, according to the European Food Safety Authority (EFSA), caffeine doses of up to 200 mg do not elicit safety concerns in non-pregnant adults.”

8. How were adverse events decided to be “treatment-related” and how did the 3 expert neurologists assess them?

- Reply: Thank you very much for your valuable comment. For an adverse event to be considered treatment-related, we used the “NIA Adverse Event and Serious Adverse Event Guidelines” (https://www.nia.nih.gov/sites/default/files/2018-09/nia-ae-and-sae-guidelines-2018.pdf). According to this guideline, “to classify an adverse event as treatment-related, the neurologists took into account the logical temporal association with the treatment administration, the expected patterns of response, and the exclusion of other factors.” Sincerely, we added this information in the Methods> Safety monitoring section.

9. What “deviations from the protocol” (p4) were there? Please explain all the differences between the protocol and the trial registry (age 20-55 included, weight > 40 kg excluded, EDSS as a primary outcome, etc.)

- Reply: Thank you very much for your valuable comments. We understand the importance of consistency between the study protocol and the trial registry, and we sincerely apologize for any inconvenience this may have caused.

o With regards to the weight criterion, patients weighing more than 40 kg were indeed included in the study. While this criterion was inadvertently forgotten to be written in the previous manuscript version, we have revised the manuscript to specify 'weight > 40 kg' as part of the inclusion criteria.

o In terms of the EDSS, although it was initially chosen as a primary outcome, we did not use it as such due to certain limitations in employing the EDSS as an outcome measure within a short duration, such as 3 months. First, while the EDSS is generally reliable for assessing patients over an extended duration (2), employing 3–6-month confirmed disability progression (sustained increase in EDSS) as a measure of disability might lead to an overestimation of permanent disability levels. This can be problematic for the interpretation of short-term trial results, potentially yielding misleading outcomes (3). Second, studies have suggested that the EDSS may not be as effective in consistently capturing subtle changes in disability for patients with mild or moderate disabilities, primarily due to its limited reliability, especially for lower EDSS scores (2). Additionally, the EDSS has demonstrated limited responsiveness in detecting clinically meaningful changes in disability and has shown limited predictive value in certain contexts (4). In summary, we did not use the EDSS as the primary outcome due to the challenges associated with its application, especially when attempting to measure short-term and minor changes in patients with lower levels of disability, as was the case in our study.

o Regarding the age criteria, our study adhered to an age range of 20-55, which was in accordance with the protocol (age range in our study: 23-55). We appreciate your attention to this discrepancy, and we apologize for any confusion it may have caused. We have since updated the manuscript to accurately reflect the age inclusion criterion.

10. What cut-off was used on trail-making task to exclude cognitive impairment?

- Reply: Thank you very much for your precious comment. We used the Trail Making Test to evaluate cognitive impairment. TMT is a neuropsychological test that has been shown useful in predicting cognitive impairment (5). The TMT consists of two components: TMT-A and TMT-B. The task for each exam is for the individual to draw a line between consecutive circles that are set at random on a page. The TMT-A employs only numbers, while the TMT-B alternates between numbers and letters, forcing the patient to switch between them in a sequential sequence (5). Controversy exists about the best cut-off to determine impaired test results. Ideally, impairment in this test should be defined according to the normative data established based on age, education, ethnicity, and health status (6). Usually, impairment in each cognitive test is defined as “less than 1.5 z scores from the average of the normative population” (7). However, there is no normative data on the TMT test among the Iranian population, and the cutoff (TMT-B> 90 seconds) that we used was based on the previously published normative data (6, 8) and the clinical judgment of an MS specialist with more than 10 years experience. According to your valuable comment, the cut-off used to indicate significant cognitive impairment was added to the manuscript.

11. For the PGIC, did a score of 4 indicate no change and so 1-3 a worsening, or did 1 to 7 indicate progressive improvement?

- Reply: Thank you very much for your precious comment. We acknowledge that different versions of PGIC have been used in the literature. The one we used was similar to that in some previous studies, including https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4623367/, Patients_Global_Impression_of_Change.pdf (chiro.org), https://www.health.mil/Reference-Center/Forms/2015/05/01/Patient-Global-Impression-Change-Scale, Assessing the clinical significance of change scores following carpal tunnel surgery - PMC (nih.gov), and Assessing the clinical significance of change scores recorded on subjective outcome measures - PubMed (nih.gov). In this scoring system, 1 represents “no change” and 7 represents “a great deal better”.

12. The mean age at MS diagnosis = 29.73 and at evaluation = 38.89 (difference 9.16 years), so how can the mean disease duration be only 8.57 years?

- Reply: Thank you very much for your meticulous feedback. The observed difference between the mean age at MS diagnosis (29.73 years) and the mean age at evaluation (38.89 years) compared to the reported mean disease duration (8.57 years) arises from differences in data collection methods. In our study, patients’ ages were obtained from their medical records, which were based on the date of birth recorded on their birth certificates. On the other hand, the age at disease onset and the disease duration were collected through patients’ interviews. While the difference (38.3 years old [29.73+8.57] vs. 38.9 years old) is relatively small (only six months), we fully acknowledge your concern and appreciate your attention to detail. We will consider this feedback for future studies and make efforts to minimize any recording disparities to enhance the uniformity of data collection. Your input is valuable in helping us improve the quality of our research.

- To increase transparency in reporting, we added this issue to the Table caption: “‡ The patients’ age at evaluation were obtained from their medical records, according to their birth certificate. §The patients’ age at MS diagnosis and the disease duration were obtained from in-person interviews.” 

Reviewers' comments:

Reviewer 1:

This study aimed to evaluate its effects on ambulatory performance and health-related quality of life (HR-QoL) of patients with MS (PwMS). The authors concluded that Caffeine may enhance balance, functional mobility, and QoL in PwMS. Being male was associated with a sharper increase in self-reported ambulatory disability over time. The effects of aging on balance get more pronounced over time. Overall, the study is interesting, however, there are some clarifications needed.

- Reply: Dear reviewer,

Thank you very much for your kind words and for providing us with the opportunity to strengthen our research. We sincerely appreciate your precious comments. Having carefully considered the comments and suggestions, we have made all the relevant changes to our manuscript as outlined below in an itemized, point-by-point manner. The revisions requested have been track-changed and highlighted within the manuscript in response.

Keywords:

1. Please add quality of life to the Keywords section.

- Reply: Thank you very much for your comment. We added this keyword in the Keywords section.

Introduction:

2. Line 87-90: Authors stated that this pilot single-arm phase II clinical trial set out to evaluate the potential effectiveness of caffeine ingestion on ambulatory performance (i.e., static and dynamic balance, functional mobility, and patient’s reported ambulatory disability) among PwMS. Ambulatory performance is not appropriate for static balance. Ambulatory performance is mostly related to dynamic balance rather than static balance.

- Reply: Thank you very much for your precious comment and for mentioning this issue. According to your valuable comment, we used the general term "Balance and Mobility" to address all of the outcome measures, including static balance, dynamic balance, ambulatory disability, functional mobility, and gait. We revised the whole manuscript from Title to Conclusion with respect to this issue and changed the term “ambulatory performance” to “balance and mobility”.

Materials & Methods:

3. Line 152. How do you assess the presence of vestibular disorder or cognitive impairment?

- Reply: Thank you very much for your valuable comment.

o For cognitive impairment: We used the Trail Making Test to evaluate cognitive impairment. TMT is a neuropsychological test that has been shown useful in predicting cognitive impairment (5). The TMT consists of two components: TMT-A and TMT-B. The task for each exam is for the individual to draw a line between consecutive circles that are set at random on a page. The TMT-A employs only numbers, while the TMT-B alternates between numbers and letters, forcing the patient to switch between them in a sequential sequence (5). Controversy exists about the best cut-off to determine impaired test results. Ideally, impairment in this test should be defined according to the normative data established based on age, education, ethnicity, and health status (6). Usually, impairment in each cognitive test is defined as “less than 1.5 z scores from the average of the normative population” (7). However, there is no normative data on the TMT test among the Iranian population, and the cutoff (TMT-B> 90 seconds) we used was based on the previously published normative data (6, 8) and the clinical judgment of an MS specialist with more than 10 years experience. According to your valuable comment, the definition used to indicate significant cognitive impairment was added to the manuscript.

o For vestibular disorder: Vestibular disorders were ruled out of in the patient's examinations (cerebellar examinations, nystagmus examination, and head impulse test) no evidence was found in favor of peripheral vestibulopathy. According to your precious comment, we added this to the manuscript.

4. Line 151-152. What cut-off point did you consider for the presence of moderate or severe anxiety assessed by The Hospital Anxiety and Depression Scale-Anxiety subscale?

- Reply: Thank you very much for your valuable comment. The Hospital Anxiety and Depression Scale-Anxiety subscale has a maximum score of 21. We used the Persian-validated translation of this scale (9). This scale is scored as below (10):

o Scores < 7: non-cases

o Scores of 8-10: mild anxiety

o Scores of 11-14: moderate anxiety

o Scores of 15-21: severe anxiety

- Therefore score ≥ 11 was considered moderate to severe anxiety. According to your valuable comment, we added the cut-off in the revised manuscript.

Results:

5. The authors should calculate the effect size for each variable.

- Reply: Thank you very much for your valuable comment. We appreciate your careful review of our manuscript and your insightful comment regarding the calculation of effect sizes for each variable. We would like to clarify that the effect sizes in our study are interpreted differently based on the nature of the data presented in Tables 2, 3, and 4.

- In Table 2, the reported values represent mean differences (Before-After) for each criterion. These differences serve as indicators of the magnitude of change over time within each group, providing a measure of the effectiveness of the intervention.

- Conversely, Tables 3 and 4 present coefficients (β) derived from the generalized estimation equation (GEE) analysis, reflecting the impact of sex (Table 3) and age (Table 4) on the outcome measures during the study period. These coefficients can be considered analogous to effect sizes in the context of regression analysis, representing the estimated change in the criterion for a one-unit change in the predictor variable. Thank you again for your valuable feedback and please let us know if there is anything else we could provide.

Reviewer 2:

This manuscript presents data analysis from a non-randomized, Phase-II, pilot study on evaluating the effectiveness of caffeine ingestion on the balance and gait in MS patients. The topic is of importance, the study was registered as an RCT within the Iranian system and was approved by the respective IRB/Ethics Committee. While the study objectives sound interesting, are important, and are on target, some shortcomings were observed, in regard to abiding by the CONSORT guidelines for conducting and reporting results of high-quality randomized controlled trials (RCTs). Some other (statistical) comments were also provided.

- Reply: Dear reviewer,

Thank you very much for your kind words and for providing us with the opportunity to strengthen our research. We sincerely appreciate your precious comments. Having carefully considered the comments and suggestions, we have made all the relevant changes to our manuscript as outlined below in an itemized, point-by-point manner. The revisions requested have been track-changed and highlighted within the manuscript in response.

Methods:

1. Methods reporting needs some work. An orderly manner is suggested, following CONSORT guidelines, without repeating information, such as Trial Design, Participant Eligibility Criteria and settings, Interventions, Outcomes, sample size/power considerations, Interim analysis, stopping rules, etc. The authors are advised to create separate subsections for each of the possible topics (whichever is necessary), and that way produce a very clear writeup. I see the Authors indeed made an attempt; however, they are advised to write it carefully, following nice examples in the manuscript below: https://www.sciencedirect.com/science/article/pii/S0889540619300010

- Reply: Thank you very much for your precious comment. According to your valuable comment, we divided the method into the following subsections based on the CONSORT guideline and the study you provided:

o Trial design and any changes after trial commencement

o Participants

o Study settings and Ethics statement (sincerely, although this section could be merged with the “Participants” section as per the CONSORT guideline as well as the article you mentioned, it has been divided into two subsections owing to the length of the text.)

o Intervention (this section is newly added).

o Outcomes

o Study measures (Although not explicitly outlined in the CONSORT guidelines, this section is included to offer detailed information on outcome measures that may not be familiar to all readers of the journal.)

o Sample size

o Randomization, sequence generation, allocation concealment mechanism, implementation: not applicable

o Blinding

o Adherence monitoring (Although not explicitly outlined in the CONSORT guidelines, this section is included to offer detailed information on how we assessed patients’ medication compliance and avoidance of caffeinated products.)

o Safety monitoring (While not explicitly specified in the CONSORT guidelines, this section is incorporated to provide comprehensive details on how our safety monitoring team evaluated patients and ensured the identification of any treatment-related adverse events.)

o Statistical analysis

- We omitted the repeated information according to your valuable comment. Additionally, based on the updates in the revised manuscript, the CONSORT checklist (Table S1) was also updated.

2. I am somewhat confused with the design! This is a single-arm, Phase II, but I do not understand (justification not given clearly) behind the administration of placebo initially. Popular Phase-II designs, such as Simon's Phase-II, are often 2-staged. On the contrary, it would have been perfectly OK if a randomized design was considered (which often is much clearer!). Why was that not conducted? Any watertight justification?

- Reply: Thank you for your valuable comment, and we sincerely apologize for any confusion. The decision to transition from a double-armed design to a single-armed study with a placebo run-in was driven by financial constraints encountered during the study. The initial plan involved two arms, with one group receiving caffeine and the other a placebo. However, the projected cost of 2520 placebos (30 patients*84 days) proved financially unfeasible for our university, especially given Zahedan's status as one of the low-income cities in Iran. To address this challenge while mitigating potential bias, we opted for a single-armed trial with a two-week placebo run-in. This decision aimed to balance financial constraints and the need to account for the placebo effect. While not exactly the same, the placebo run-in approach has been utilized by other researchers (e.g., Sustained-release oral fampridine in multiple sclerosis: a randomised, double-blind, controlled trial - PubMed (nih.gov)). In this two-week placebo run-in, only 420 placebos (30 patients*14 days) were required.

- We acknowledge this study's limitation and, in the interest of transparency, have explained the change in the Methods section: “Notably, the initial study protocol involved a double-armed design, with one group receiving caffeine and another group receiving a placebo. However, due to financial constraints encountered during the study, it became necessary to modify the protocol and proceed as a single-armed study. To mitigate potential bias, a two-week placebo run-in stage was incorporated before the investigation.” Furthermore, in the limitations section, we explicitly stated “Although we designed a 2-week placebo run-in stage to reduce the placebo effect, lacking a control group limits the ability to determine whether the observed improvements were due to caffeine or to other factors, such as natural disease progression, placebo effect, or changes in other aspects of participants' lives during the study. Therefore, the results should be interpreted with caution, as they only suggest the “potential” efficacy of caffeine in improving balance and mobility in PwMS rather than a “confirmation of efficacy.”

- Furthermore, recognizing that the study design may be confusing to readers, we have visualized the design in Figure 2 to enhance comprehension.

- Thank you again and please let us know if we can provide any other explanation to increase the study transparency.

3. Sample size/power: The sample size/power statement should reflect the statistical test used (one-sided/two-sided), the significance level (5%?), the corresponding effect size, etc. Even if the trial is not randomized, one may compute using the "desired" change one wants to attain at the end of the study. Even pilot trials need to be conducted with some ballpark number. It should also be described in a separate sub-section.

- Reply: Thank you for your thoughtful and constructive comments on our manuscript. We appreciate your thorough review and have carefully considered your suggestions regarding the sample size and power analysis.

- In response to your comment, we used GPower software to determine the sample size for our study. We employed a conservative approach by selecting an effect size of 0.25, which is lower than the actual difference and effect size observed in our study. Additionally, we set a significance level of 0.05 and explored a range of power values from 0.8 to 0.95. Our analysis included five measurements and assumed a correlation of 0.5 between repeated measurements, which is lower than the actual correlation observed in our study. Despite these conservative choices, the calculated sample size was found to be a minimum of 21 for a power of 0.8 and 30 for a power of 0.95. It's important to note that our effect size and correlation values in the actual study are higher than those used in the power analysis, which would, in turn, require a smaller sample size for equivalent statistical power.

- The change in sample size can be elucidated as follows:

- We understand the importance of providing a comprehensive description of the sample size determination process, including details about the statistical test used, the significance level, and the effect size. Hence, the following paragraph was added to the sample size section of the Methods: “The determination of sample size for this study involved the utilization of the GPower software. A deliberate underestimation was integrated, employing a conservative approach with an effect size established at 0.25, intentionally lower than the actual expected difference and effect size in our investigation. The analysis was conducted with a significance level of 0.05, and a range of power values from 0.8 to 0.95 was examined, incorporating considerations for five measurements and an assumed correlation of 0.5 between repeated measurements, which underestimated the true effect size and correlation observed in the study. Despite the conservative nature of these choices, the resulting sample size was computed as a minimum of 21 for a power of 0.8 and 30 for a power of 0.95.”

4. Statistical Analysis: Based on the (longitudinal) design of the study, the authors justifiably conducted a GEE analysis. Any thoughts, on why a mixed linear model analysis was not conducted (I am not asking authors to do it)?

- Reply: Thank you for your comment regarding the statistical analysis methods employed in our study. We appreciate the opportunity to address your question regarding the choice of the generalized estimation equation (GEE) over a mixed linear model (MLM) analysis in our longitudinal study design. The decision to use GEE instead of MLM was carefully considered based on the following factors:

o Distributional Assumptions: GEE is a robust method for analyzing longitudinal data, especially when the distributional assumptions of MLM, such as normality and homoscedasticity, may not be met. GEE makes fewer assumptions about the distribution of the outcome variables, making it more suitable for non-normally distributed data or when the variance may be non-constant across time points (In our study, some of variables were not normal)

o Population-Averaged Estimates: GEE provides population-averaged estimates, which are often more relevant in epidemiological and clinical research where the focus is on the average response across the population rather than individual-specific changes.

o Model Interpretability: GEE allows for a straightforward interpretation of the regression coefficients, especially when focusing on population-averaged effects. This can enhance the practical significance and communication of the results in the context of our study.

- While MLM could also be a valid approach, the above considerations led us to choose GEE as the more appropriate method for our specific research context.

Results & Conclusions:

5. The authors should check that any statement of significance should be followed by a p-value in the entire Results section. Otherwise, the Results section looks OK.

- Reply: Thank you very much for your valuable comment. In response to your comment, we have carefully examined the Results section and ensured that each statement of significance is accompanied by the corresponding p-value.:

o “Accordingly, PGIC was significantly lower for males than for females at the end of the second week (β: -3.12, 95% CI: -4.42, -1.83, P-value<0.001).”

- We also present the updated Results section with the inclusion of p-values for the Spearman correlation coefficients, as with your precious comment: “Eventually, we assessed the correlations between outcome measures, including the MSWS-12, BBS, TUG, MSIS-29, and PGIC (Figure 4). The results showed significant positive correlations between MSIS-29 and MSWS-12 (r=0.82, p<0.001), MSIS-29 with TUG (r=0.73, p<0.001), and MSWS-12 with TUG (r=0.64, p<0.001) prior to the intervention. In addition, negative correlation coefficients were observed between BBS and MSIS-29 (r=-0.50, p=0.011), MSWS-12 (r=-0.44, p=0.022), and TUG (r=-0.82, p<0.001). The Spearman correlation coefficients for other study time points are shown in Figure 4.”

- Additionally, we added the following tables regarding the “Spearman correlation coefficients along with confidence intervals for the total scores at each timepoint throughout the study” to the supplementary materials (Tables S6 and S7): “Tables S6 and S7 present comprehensive details regarding the Spearman correlation coefficients along with confidence intervals for the total scores at each timepoint throughout the study.”

Correlations

Time BBS TUG PGIC MSWS MSIS

Before Spearman's rho BBS Correlation Coefficient 1.000 -.820** . -.440* -.498*

Sig. (2-tailed) . .000 . .022 .011

N 27 25 0 27 25

TUG Correlation Coefficient -.820** 1.000 . .642** .732**

Sig. (2-tailed) .000 . . .000 .000

N 25 26 0 26 24

PGIC Correlation Coefficient . . . . .

Sig. (2-tailed) . . . . .

N 0 0 0 0 0

MSWS Correlation Coefficient -.440* .642** . 1.000 .822**

Sig. (2-tailed) .022 .000 . . .000

N 27 26 0 28 26

MSIS Correlation Coefficient -.498* .732** . .822** 1.000

Sig. (2-tailed) .011 .000 . .000 .

N 25 24 0 26 26

After 2 weeks Spearman's rho BBS Correlation Coefficient 1.000 -.944** .219 -.748** -.265

Sig. (2-tailed) . .000 .368 .000 .287

N 22 16 19 18 18

TUG Correlation Coefficient -.944** 1.000 -.360 .576* .241

Sig. (2-tailed) .000 . .155 .016 .352

N 16 17 17 17 17

PGIC Correlation Coefficient .219 -.360 1.000 -.353 -.620**

Sig. (2-tailed) .368 .155 . .151 .005

N 19 17 22 18 19

MSWS Correlation Coefficient -.748** .576* -.353 1.000 .595**

Sig. (2-tailed) .000 .016 .151 . .007

N 18 17 18 19 19

MSIS Correlation Coefficient -.265 .241 -.620** .595** 1.000

Sig. (2-tailed) .287 .352 .005 .007 .

N 18 17 19 19 20

After 4 weeks Spearman's rho BBS Correlation Coefficient 1.000 -.860** .448 -.757** -.551*

Sig. (2-tailed) . .000 .082 .001 .022

N 17 13 16 16 17

TUG Correlation Coefficient -.860** 1.000 -.210 .682** .744**

Sig. (2-tailed) .000 . .452 .005 .001

N 13 15 15 15 15

PGIC Correlation Coefficient .448 -.210 1.000 -.416 -.181

Sig. (2-tailed) .082 .452 . .097 .473

N 16 15 18 17 18

MSWS Correlation Coefficient -.757** .682** -.416 1.000 .746**

Sig. (2-tailed) .001 .005 .097 . .000

N 16 15 17 19 18

MSIS Correlation Coefficient -.551* .744** -.181 .746** 1.000

Sig. (2-tailed) .022 .001 .473 .000 .

N 17 15 18 18 19

After 8 weeks Spearman's rho BBS Correlation Coefficient 1.000 -.757* .372 -.826** -.557

Sig. (2-tailed) . .049 .324 .002 .060

N 12 7 9 11 12

TUG Correlation Coefficient -.757* 1.000 .057 .778* .633

Sig. (2-tailed) .049 . .875 .014 .067

N 7 11 10 9 9

PGIC Correlation Coefficient .372 .057 1.000 -.215 -.005

Sig. (2-tailed) .324 .875 . .525 .989

N 9 10 12 11 11

MSWS Correlation Coefficient -.826** .778* -.215 1.000 .691**

Sig. (2-tailed) .002 .014 .525 . .009

N 11 9 11 13 13

MSIS Correlation Coefficient -.557 .633 -.005 .691** 1.000

Sig. (2-tailed) .060 .067 .989 .009 .

N 12 9 11 13 14

After 12 weeks Spearman's rho BBS Correlation Coefficient 1.000 -.690* -.099 -.478 -.185

Sig. (2-tailed) . .040 .800 .162 .610

N 11 9 9 10 10

TUG Correlation Coefficient -.690* 1.000 .017 .610 .600

Sig. (2-tailed) .040 . .965 .081 .088

N 9 11 9 9 9

PGIC Correlation Coefficient -.099 .017 1.000 -.512 -.641

Sig. (2-tailed) .800 .965 . .159 .063

N 9 9 10 9 9

MSWS Correlation Coefficient -.478 .610 -.512 1.000 .781**

Sig. (2-tailed) .162 .081 .159 . .005

N 10 9 9 12 11

MSIS Correlation Coefficient -.185 .600 -.641 .781** 1.000

Sig. (2-tailed) .610 .088 .063 .005 .

N 10 9 9 11 11

**. Correlation is significant at the 0.01 level (2-tailed).

*. Correlation is significant at the 0.05 level (2-tailed).

Confidence Intervals of Spearman's rho

Time Spearman's rho Significance(2-tailed) 95% Confidence Intervals (2-tailed)a,b

Lower Upper

Before BBS - TUG -.820 .000 -.920 -.621

BBS - PGIC .c . . .

BBS - MSWS -.440 .022 -.708 -.060

BBS - MSIS -.498 .011 -.752 -.116

TUG - PGIC .c . . .

TUG - MSWS .642 .000 .328 .828

TUG - MSIS .732 .000 .456 .879

PGIC - MSWS .c . . .

PGIC - MSIS .c . . .

MSWS - MSIS .822 .000 .630 .919

After 2 weeks BBS - TUG -.944 .000 -.981 -.838

BBS - PGIC .219 .368 -.275 .621

BBS - MSWS -.748 .000 -.903 -.419

BBS - MSIS -.265 .287 -.660 .244

TUG - PGIC -.360 .155 -.724 .160

TUG - MSWS .576 .016 .116 .832

TUG - MSIS .241 .352 -.286 .656

PGIC - MSWS -.353 .151 -.711 .151

PGIC - MSIS -.620 .005 -.843 -.217

MSWS - MSIS .595 .007 .179 .830

After 4 weeks BBS - TUG -.860 .000 -.959 -.574

BBS - PGIC .448 .082 -.077 .779

BBS - MSWS -.757 .001 -.913 -.404

BBS - MSIS -.551 .022 -.821 -.080

TUG - PGIC -.210 .452 -.662 .353

TUG - MSWS .682 .005 .246 .889

TUG - MSIS .744 .001 .359 .912

PGIC - MSWS -.416 .097 -.754 .097

PGIC - MSIS -.181 .473 -.607 .326

MSWS - MSIS .746 .000 .416 .902

After 8 weeks BBS - TUG -.757 .049 -.964 .020

BBS - PGIC .372 .324 -.407 .838

BBS - MSWS -.826 .002 -.955 -.433

BBS - MSIS -.557 .060 -.862 .044

TUG - PGIC .057 .875 -.608 .675

TUG - MSWS .778 .014 .214 .953

TUG - MSIS .633 .067 -.077 .917

PGIC - MSWS -.215 .525 -.732 .458

PGIC - MSIS -.005 .989 -.616 .610

MSWS - MSIS .691 .009 .209 .903

After 12 weeks BBS - TUG -.690 .040 -.932 -.023

BBS - PGIC -.099 .800 -.727 .620

BBS - MSWS -.478 .162 -.857 .237

BBS - MSIS -.185 .610 -.740 .520

TUG - PGIC .017 .965 -.668 .686

TUG - MSWS .610 .081 -.114 .911

TUG - MSIS .600 .088 -.130 .908

PGIC - MSWS -.512 .159 -.883 .253

PGIC - MSIS -.641 .063 -.919 .063

MSWS - MSIS .781 .005 .323 .943

a. Estimation is based on Fisher's r-to-z transformation.

b. Estimation of standard error is based on the formula proposed by Fieller, Hartley, and Pearson.

c. Cannot be computed because at least one of the variables is constant.

Thank you again for guiding us to enhance the quality of our manuscript.

6. Conclusions should stress that findings are only based on this pilot trial (using an Iranian population), and allude to future larger trials/studies, combining other populations, to understand the effectiveness of caffeine intake.

- Reply: Thank you very much for your precious comment. According to your valuable comment in the Conclusion section, we emphasized that: “These results solely stem from this single-arm initial trial, which involved an Iranian population. They hint at the need for future definitive randomized placebo-controlled trials with larger sample sizes and longer follow-ups, involving diverse populations while taking age and sex into account. This approach is necessary to enhance our understanding of the efficacy of caffeine consumption and confirm its impact on balance and mobility performance of PwMS.”

References:

1. Fitzmaurice GM, Laird NM. Generalized linear mixture models for handling nonignorable dropouts in longitudinal studies. Biostatistics. 2000;1(2):141-56.

2. Meyer-Moock S, Feng YS, Maeurer M, Dippel FW, Kohlmann T. Systematic literature review and validity evaluation of the Expanded Disability Status Scale (EDSS) and the Multiple Sclerosis Functional Composite (MSFC) in patients with multiple sclerosis. BMC Neurol. 2014;14:58.

3. Kalincik T, Cutter G, Spelman T, Jokubaitis V, Havrdova E, Horakova D, et al. Defining reliable disability outcomes in multiple sclerosis. Brain. 2015;138(11):3287-98.

4. Kragt JJ, Thompson AJ, Montalban X, Tintoré M, Río J, Polman CH, et al. Responsiveness and predictive value of EDSS and MSFC in primary progressive MS. Neurology. 2008;70(13 Pt 2):1084-91.

5. Stebbins GT. Chapter 27 - Neuropsychological Testing. In: Goetz CG, editor. Textbook of Clinical Neurology (Third Edition). Philadelphia: W.B. Saunders; 2007. p. 539-57.

6. Tombaugh TN. Trail Making Test A and B: normative data stratified by age and education. Archives of clinical neuropsychology. 2004;19(2):203-14.

7. Eshaghi A, Riyahi-Alam S, Roostaei T, Haeri G, Aghsaei A, Aidi MR, et al. Validity and reliability of a Persian translation of the Minimal Assessment of Cognitive Function in Multiple Sclerosis (MACFIMS). The Clinical neuropsychologist. 2012;26(6):975-84.

8. Giovagnoli AR, Del Pesce M, Mascheroni S, Simoncelli M, Laiacona M, Capitani E. Trail making test: normative values from 287 normal adult controls. Ital J Neurol Sci. 1996;17(4):305-9.

9. Montazeri A, Vahdaninia M, Ebrahimi M, Jarvandi S. The Hospital Anxiety and Depression Scale (HADS): translation and validation study of the Iranian version. Health Qual Life Outcomes. 2003;1:14.

10. Stern AF. The Hospital Anxiety and Depression Scale. Occupational Medicine. 2014;64(5):393-4.

Attachment

Submitted filename: Response to Reviewers.pdf

Decision Letter 1

Antony Bayer

2 Jan 2024

Potential efficacy of caffeine ingestion on balance and mobility in patients with multiple sclerosis: Preliminary evidence from a single-arm pilot clinical trial

PONE-D-23-18917R1

Dear Dr. Amirifard,

Thank you for your detailed and considered responses to the review comments following your initial submission of this paper. We’re pleased to inform you that your revised manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Antony Bayer

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: (No Response)

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: (No Response)

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: (No Response)

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: (No Response)

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: (No Response)

Reviewer #2: (No Response)

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

Acceptance letter

Antony Bayer

2 Feb 2024

PONE-D-23-18917R1

PLOS ONE

Dear Dr. Amirifard,

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now being handed over to our production team.

At this stage, our production department will prepare your paper for publication. This includes ensuring the following:

* All references, tables, and figures are properly cited

* All relevant supporting information is included in the manuscript submission,

* There are no issues that prevent the paper from being properly typeset

If revisions are needed, the production department will contact you directly to resolve them. If no revisions are needed, you will receive an email when the publication date has been set. At this time, we do not offer pre-publication proofs to authors during production of the accepted work. Please keep in mind that we are working through a large volume of accepted articles, so please give us a few weeks to review your paper and let you know the next and final steps.

Lastly, if your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Professor Antony Bayer

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File

    The supplementary material file includes Table S1: The CONSORT checklist of information to include when reporting a pilot trial, Table S2: The 12-item Multiple Sclerosis Walking Scale (MSWS-12), Table S3: The Berg Balance Scale (BBS), Figure S1: The Timed Up-and-Go (TUG), Table S4: The Multiple Sclerosis Impact Scale (MSIS-29), Table S5: The Patient’s Global Impression of Changes (PGIC), and Figure S2: The impact of sex on each criterion during the study period. Supplementary Materials (containing the CONSORT checklist): Supplementary Materials

    (DOCX)

    S2 File. IRCT registry, Original Language: Clinical Trial Protocol Registry (Persian).

    (PDF)

    S3 File. IRCT registry, English Language: Clinical Trial Protocol Registry (English).

    (PDF)

    S4 File. Study Protocol, English Language: Study Protocol (Persian).

    (PDF)

    S5 File. Study Protocol, English Language: Study Protocol (English).

    (PDF)

    Attachment

    Submitted filename: Response to Reviewers.pdf

    Data Availability Statement

    The ethics committee of the Zahedan University of Medical Sciences, Zahedan, Iran has imposed restrictions on sharing any dataset from this study. The university and the affiliated hospitals have strict policies that prohibit the sharing of such data to protect patient confidentiality and comply with legal and ethical regulations. Data requests may be sent to the ethics committee. Sincerely, for additional information or clarification regarding our data sharing policies and asking for the dataset, here is the contact information for our ethics committee: • Email: zaums.research@gmail.com • The University Central Headquarters Call Center: (+98) 54 33372116 • Address: Zahedan University of Medical Sciences, Khalij -e- Fars Blvd, Zahedan, Sistan & Balouchestan Province, Islamic Republic of Iran. • Website: https://enresearch.zaums.ac.ir/.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES