Abstract
Time-to-event analysis is frequently used in medical research to investigate potential diseasemodifying treatments in neurodegenerative diseases. Potential treatment effects are generally evaluated using the logrank test, which has optimal power and sensitivity when the treatment effect (hazard ratio) is constant over time. However, there is generally no prior information as to how the hazard ratio for the event of interest actually evolves. In these cases, the logrank test is not necessarily the most appropriate to use. When the hazard ratio is expected to decrease or increase over time, alternative statistical tests such as the Fleming-Harrington test, provide a better sensitivity. An example of this comes from a large, five-year randomised, placebo-controlled prevention trial (GuidAge) in 2854 community-based subjects making spontaneous memory complaints to their family physicians, which evaluated whether treatment with EGb761® can modify the risk of developing AD. The primary outcome measure was the time to conversion from memory complaint to Alzheimer's type dementia. Although there was no significant difference in the hazard function of conversion between the two treatment groups according to the preplanned logrank test, a significant treatment-by-time interaction for the incidence of AD was observed in a protocol-specified subgroup analysis, suggesting that the hazard ratio is not constant over time. For this reason, additional post hoc analyses were performed using the Fleming-Harrington test to evaluate whether there was a signal of a late effect of EGb761®. Applying the Fleming-Harrington test, the hazard function for conversion to dementia in the placebo group was significantly different from that in the EGb761® treatment group (p = 0.0054), suggesting a late effect of EGb761®. Since this was a post hoc analysis, no definitive conclusions can be drawn as to the effectiveness of the treatment. This post hoc analysis illustrates the interest of performing another randomised clinical trial of EGb761® explicitly testing the hypothesis of a late treatment effect, as well as of using of better adapted statistical approaches for long term preventive trials when it is expected that prevention cannot have an immediate effect but rather a delayed effect that increases over time.
Key words: Alzheimer disease, dementia, prevention, logrank test, Fleming-Harrington test
Introduction
Time-to-event analysis is frequently used in medical research to investigate the influence of potential risk factors or protective factors, and notably treatments, on the occurrence of binary clinical outcomes. One area of medicine where this type of analysis is widely used is for evaluating potential disease-modifying treatments in neurodegenerative diseases, where the therapeutic goal is to slow down the progression of loss of function. Examples include evaluating treatments that improve survival in amyotrophic lateral sclerosis (1, 2), treatments that delay initiation of L-DOPA therapy in Parkinson’s disease (3) and treatments that delay the diagnosis of Alzheimer’s disease (AD) in patients with age-related cognitive impairment (4., 5., 6.).
In these clinical trials, potential treatment effects are generally evaluated using the logrank test. However, there is generally no prior information as to how the hazard ratio for the event of interest actually evolves over time, and it is possible that the treatment effect is not constant over time, either waning as the disease progresses or increasing over time and often appearing after a certain lag time (delayed onset of action). In these cases, the logrank test has a reduced sensitivity to detect treatment effects and is not necessarily the most appropriate to use. When a decreasing, transient, increasing or delayed effect is expected or suspected alternative statistical tests are justified (7). These include the family of Fleming-Harrington tests, where the contribution of each event of interest to the overall treatment effect may vary over time according to the value of two parameters p and q. The logrank test is equivalent to a Fleming-Harrington test with p = 0 and q = 0. A value of p larger than 1 makes the test more sensitive to early events (more weight given to early difference between treatments) and a value of q larger than 1 makes it more sensitive to late events.
An example of a treatment that may have a delayed effect on cognitive decline in dementia is EGb761®, a standardised extract produced from the dried leaves of the maidenhair tree (Ginkgo biloba), for which there is an indication of a beneficial effect on cognitive symptoms in elderly subjects with and without dementia (8., 9., 10., 11., 12., 13.). A study of the elderly women included in the EPIDOS registry (14) suggested that the risk of development of AD after seven years was reduced in patients receiving EGb761® for at least two years. More recently, twenty-year follow-up data from the PAQUID cohort, a large community based cohort study of cognitive decline, also suggested that long-term exposure to EGb761® is associated with a slower rate of cognitive decline measured with the MMSE (15).
We have recently completed a large, five-year randomised, placebo-controlled secondary prevention trial (GuidAge) in 2854 community-based subjects with spontaneous memory complaints to their family physicians to evaluate whether treatment with EGb761® can modify the risk of developing AD (16). This was a multicentre, double blind, randomised, parallel group, placebo-controlled study performed in a community setting in France between 2002 and 2009. The trial was registered on ClinicalTrials.gov (identifier: NCT00276510) and the methodology of the study has been described in detail elsewhere (17). The study enrolled 2 854 community-dwelling subjects aged >70 years without dementia at inclusion, who spontaneously consulted their family physician for a memory complaint. Subjects were randomised to treatment with EGb761® 120 mg bid or matching placebo for a period of up to 5 years. Participants underwent annual cognitive, functional and dementia evaluations at which diagnostic criteria for AD were assessed systematically. The primary efficacy endpoint was the time to conversion from memory complaint to Alzheimer’s type dementia, assessed on the basis of the DSM-IV (18) and NINCDS-ADRDA (19) diagnostic criteria.
During the study, 134 subjects developed AD over five years of treatment, 61/1406 in the EGb761® group and 73/1414 in the placebo group. According to the preplanned logrank test, there was no significant difference in the hazard function of conversion between the two treatment groups (p = 0.3058) (16). It should be noted that the conversion rate observed in the study was unexpectedly low, raising the question as to whether the GuidAge study should be considered an inconclusive study or a negative one. The a priori power calculations of the study were based on an anticipated total of 338 events over five years with a hazard ratio that was constant over time. This would have provided a power of 80% to detect a 25% difference in conversion rate between the placebo and EGb761® treatment arms using the logrank test. In fact, only 134 events occurred, reducing the actual power of the study to 42%. However, in a protocol-specified subgroup analysis in which conversion rates were compared for each year of the study, a significant treatment-by-time interaction for the incidence of AD was observed (p = 0.043), suggesting that the hazard ratio was not constant over time (Figure 1). Interestingly, the evolution over time of the hazard ratio for the treatment effect observed in the GuidAge study is similar to that previously reported in the EPIDOS study (14).
Figure 1.

Rates of conversion to dementia in the GuidAge study as a function of treatment duration. Hazard ratios are presented with their 95% confidence intervals. Grey columns: placebo group; black columns: EGb761® group
Given the significant interaction and similarities of evolution across studies it was hypothesized that if EGb761 has an effect it is a late one. To verify this working hypothesis, a post hoc Fleming-Harrington test was performed on the entire set of data. For this analysis, the value of the q parameter was set a priori to three and that of the p parameter to 0. Applying the Fleming-Harrington test, the hazard function for conversion to dementia in the placebo group was significantly different from that in the EGb761® treatment group (p = 0.0054), suggesting a late effect of EGb761®. A similar significant difference was observed in the per protocol population (p = 0.0137), as well as in sensitivity analyses in which the event definition was enlarged to include retrieved drop outs (p = 0.0029) or conversion to mixed dementia (p = 0.0012). Moreover, significant between-group differences were also observed using other parameterisations of the Fleming-Harrington test, where q is set to other values >1 and p maintained at 0 (Table 1).
Table 1.
Influence of the value of the q parameter on the p value of the Fleming Harrington test in the GuidAge study. The larger the value of q the larger the weight assigned to late events
| p and q parameters | 0 and 0 (Logrank) | 0 and 1 | 0 and 2 | 0 and 3 | 0 and 4 | 0 and 5 |
|---|---|---|---|---|---|---|
| Statistic | 1.030 | 1.964 | 2.562 | 2.814 | 2.882 | 2.858 |
| p value | 0.304 | 0.049 | 0.010 | 0.004 | 0.003 | 0.002 |
Since this was a post hoc analysis, no definitive conclusions can be drawn as to the effectiveness of the treatment, but the findings do at least challenge the null hypothesis of no difference between treatment groups and provide a signal for a delayed effect of EGb761® on cognitive decline. This signal would need to be confirmed in another prospective study specifically testing the hypothesis of a late treatment effect using an appropriate statistical test.
Effective prevention strategies for AD, which is the most frequent neurodegenerative disease affecting the elderly, currently affecting 35 million people worldwide (20), would represent a powerful public health strategy for reducing the burden of disease (21). However, to date, no prevention strategy has been unequivocally demonstrated to be successful (22). Clinical trials to assess such strategies are costly and time-consuming, and for this reason, appropriate design, including the determination of the sample size and the statistical analysis to be employed, is critical.
In clinical trials using time-to-event analysis, there is generally no prior information as to how the hazard ratio for the event of interest may be expected to evolve over time. However, the statistical testing needs to be specified before the start of the trial. The Fleming-Harrington test allows appropriate testing of different patterns of hazard, but the appropriate parameterisation of the test (setting the values of p and q) will depend upon the expected evolution of the hazard ratio over time (high p parameter and low q value for an early effect and the opposite for a late effect). We have recently proposed a two-step strategy for testing treatment effects in which the best matching value of q can first be determined and the appropriate Fleming-Harrington test then implemented (23, 24). This strategy does not entail an unreasonable inflation of the sample size.
In conclusion, in clinical trials using time-to-event analysis in neurodegenerative diseases, the hazard ratio may evolve over time and this needs to be taken into account in the design of the study. This is illustrated by a post hoc analysis of data from the GuidAge study that provides a signal of the efficacy of long-term treatment with EGb761® in delaying the onset of AD. Potential late effects should be taken into consideration in the planned duration and sample size of future prevention studies and in the choice of statistical approach.
Acknowledgments
This study has been funded by IPSEN.
Conflict of Interest Disclosures
BS reports personal fees from Ipsen, during the conduct of the study and personal fees from Ipsen, outside the submitted work. SA has received consultancy fees, speaker’s fees or research funding from Beaufour-Ipsen, Eisai Inc., Eli Lilly, Lundbeck A/S, Nestle, Novartis, Pierre Fabre, and Servier. JFD reports grants from Ipsen, during the conduct of the study and grants from Ipsen, outside the submitted work. BD reports personal fees from Eli Lilly, grants from Pfizer, outside the submitted work. PR has received consultancy fees from Lundbeck and Ipsen. PJO, GB, FP, FP and JT received consultancy fees from Ipsen Pharma for their participation in the Guidage study’s Scientific Committee. PG and HMF are employees of Ipsen Pharma who funded the GuidAge Study. BV has received research grants or consultancy fees from Lilly, Lundbeck, Medivation, MSD, Nestle, Roche, Sanofi, Servier, Otsuka, Pierre Fabre, Regeneron, Biogen, and Taurx Therapeutics.
Ethical standards
Each subject was informed (both verbally and by receipt of written information) of the aims, methods, anticipated benefits, potential hazards and the discomfort the study may entail, and their right to abstain from participating in the study and to withdraw their consent at any time without affecting their medical care. Written consent was obtained at the time of screening for the study and before to perform all additional tests planned in the protocol amendments. Consent forms were personally signed and dated, by both the subject and the investigator, as a confirmation of consent. Subject information sheets, informed consent forms and protocol were approved by the Independent Ethics Committee (IEC) of Toulouse, France (CCPPRB Toulouse I - Comité Consultatif de Protection des Personnes se Prêtant à des Recherche Biomédicales de Toulouse I).
References
- 1.Miller RG, Munsat TL, Swash M, Brooks BR. Consensus guidelines for the design and implementation of clinical trials in ALS. World Federation of Neurology committee on Research. J Neurol Sci. 1999;169:2–12. doi: 10.1016/s0022-510x(99)00209-9. PubMed PMID: 10540001. [DOI] [PubMed] [Google Scholar]
- 2.Lacomblez L, Bensimon G, Leigh PN, Guillet P, Meininger V. Dose-ranging study of riluzole in amyotrophic lateral sclerosis. Amyotrophic Lateral Sclerosis/Riluzole Study Group II. Lancet. 1996;347:1425–1431. doi: 10.1016/s0140-6736(96)91680-3. [DOI] [PubMed] [Google Scholar]
- 3.Connolly BS, Lang AE. Pharmacological treatment of Parkinson disease: a review. Jama. 2014;311:1670–1683. doi: 10.1001/jama.2014.3654. 10.1001/jama.2014.3654 PubMed PMID: 24756517. [DOI] [PubMed] [Google Scholar]
- 4.Dodge HH, Zitzelberger T, Oken BS, Howieson D, Kaye J. A randomized placebocontrolled trial of Ginkgo biloba for the prevention of cognitive decline. Neurology. 2008;70:1809–1817. doi: 10.1212/01.wnl.0000303814.13509.db. 10.1212/01.wnl.0000303814.13509.db PubMed PMID: 18305231, PMCID 2639649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Shumaker SA, Legault C, Kuller L, Rapp SR, Thal L, et al. Conjugated equine estrogens and incidence of probable dementia and mild cognitive impairment in postmenopausal women: Women’s Health Initiative Memory Study. Jama. 2004;291:2947–2958. doi: 10.1001/jama.291.24.2947. 10.1001/jama.291.24.2947 PubMed PMID: 15213206. [DOI] [PubMed] [Google Scholar]
- 6.Lyketsos CG, Breitner JC, Green RC, Martin BK, Meinert C, et al. Naproxen and celecoxib do not prevent AD in early results from a randomized controlled trial. Neurology. 2007;68:1800–1808. doi: 10.1212/01.wnl.0000260269.93245.d2. 10.1212/01.wnl.0000260269.93245.d2 PubMed PMID: 17460158. [DOI] [PubMed] [Google Scholar]
- 7.Garès V, Andrieu S, Dupuy JF, Savy N (in press) On the use of Fleming- Harrington’s test in prevention trials. Biometrika.
- 8.Wang BS, Wang H, Song YY, Qi H, Rong ZX, et al. Effectiveness of standardized ginkgo biloba extract on cognitive symptoms of dementia with a six-month treatment: a bivariate random effect meta-analysis. Pharmacopsychiatry. 2010;43:86. doi: 10.1055/s-0029-1242817. 10.1055/s-0029-1242817 PubMed PMID: 20104449. [DOI] [PubMed] [Google Scholar]
- 9.Weinmann S, Roll S, Schwarzbach C, Vauth C, Willich SN, Effects of Ginkgo biloba in dementia: systematic review and meta-analysis. BMC Geriatr, 2010, 14 [DOI] [PMC free article] [PubMed]
- 10.Yang Z, Li WJ, Huang T, Chen JM, Zhang X. Meta-analysis of Ginkgo biloba extract for the treatment of Alzheimer’s disease. Neural Regen Res. 2011;6:1125–1129. [Google Scholar]
- 11.Janssen IM, Sturtz S, Skipka G, Zentner A V, Garrido M, et al. Ginkgo biloba in Alzheimer’s disease: a systematic review. Wien Med Wochenschr. 2011;160:539–546. doi: 10.1007/s10354-010-0844-8. 10.1007/s10354-010-0844-8 [DOI] [PubMed] [Google Scholar]
- 12.Grassel E. [Effect of Ginkgo-biloba extract on mental performance. Doubleblind study using computerized measurement conditions in patients with cerebral insufficiency]. Fortschr Med. 1992;110:73–76. [PubMed] [Google Scholar]
- 13.Grass-Kapanke B, Busmane A, Lasmanis A, Hoerr R, Kaschel R. Effects of Ginkgo Biloba special extract EGb 761® in very mild cognitive impairment (vMCI) Neuroscience & Medicine. 2011;2:48–56. 10.4236/nm.2011.21007 [Google Scholar]
- 14.Andrieu S, Gillette S, Amouyal K, Nourhashemi F, Reynish E, et al. Association of Alzheimer’s disease onset with ginkgo biloba and other symptomatic cognitive treatments in a population of women aged 75 years and older from the EPIDOS study. J Gerontol A Biol Sci Med Sci. 2003;58:372–377. doi: 10.1093/gerona/58.4.m372. 10.1093/gerona/58.4.M372 PubMed PMID: 12663701. [DOI] [PubMed] [Google Scholar]
- 15.Amieva H, Meillon C, Helmer C, Barberger-Gateau P, Dartigues JF. Ginkgo biloba extract and long-term cognitive decline: a 20-year follow-up population-based study. Plos One. 2013;8:e52755. doi: 10.1371/journal.pone.0052755. 10.1371/journal.pone.0052755 PubMed PMID: 23326356, PMCID 3543404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Vellas B, Coley N, Ousset PJ, Berrut G, Dartigues JF, et al. Long-term use of standardised ginkgo biloba extract for the prevention of Alzheimer’s disease (GuidAge): a randomised placebo-controlled trial. Lancet Neurol. 2012;11:851–859. doi: 10.1016/S1474-4422(12)70206-5. 10.1016/S1474-4422(12)70206-5 PubMed PMID: 22959217. [DOI] [PubMed] [Google Scholar]
- 17.Vellas B, Andrieu S, Ousset PJ, Ouzid M, Mathiex-Fortunet H. The GuidAge study: methodological issues. A 5-year double-blind randomized trial of the efficacy of EGb 761 for prevention of Alzheimer disease in patients over 70 with a memory complaint. Neurology. 2006;67:S6–11. doi: 10.1212/wnl.67.9_suppl_3.s6. PubMed PMID: 17101932. [DOI] [PubMed] [Google Scholar]
- 18.American Psychiatric Association. Diagnostic and Statistical Manual for Mental Disorders. Washington DC, USA: American Psychiatric Association, 1994.
- 19.McKhann G, Drachman D, Folstein M, Katzman R, Price D, et al. Clinical diagnosis of Alzheimer’s disease: report of the NINCDS-ADRDA Work Group under the auspices of Department of Health and Human Services Task Force on Alzheimer’s Disease. Neurology. 1984;34:939–944. doi: 10.1212/wnl.34.7.939. 10.1212/WNL.34.7.939 PubMed PMID: 6610841. [DOI] [PubMed] [Google Scholar]
- 20.Alzheimer’s Disease International. World Alzheimer Report, 2009.
- 21.Doraiswamy PM, Xiong GL. Pharmacological strategies for the prevention of Alzheimer’s disease. Expert Opin Pharmacother. 2006;7:1–10. doi: 10.1517/14656566.7.1.1. 10.1517/14656566.7.1.1 PubMed PMID: 16370917. [DOI] [PubMed] [Google Scholar]
- 22.Williams JW, Plassman BL, Burke J, Benjamin S, Preventing Alzheimer’s disease and cognitive decline. Evid Rep Technol Assess (Full Rep), 2010, 1–727 [PMC free article] [PubMed]
- 23.Garès V, Andrieu S, Dupuy J-F, Savy N. A comparison of the constant piecewise weighted logrank and Fleming-Harrington tests. Electronic J Stat. 2015;8:841–860. 10.1214/14-EJS911 [Google Scholar]
- 24.Gares V, Andrieu S, Dupuy JF, Savy N.An omnibus test for several hazard alternatives in prevention randomized controlled clinical trials. Stat Med 2015;34:541–557. [DOI] [PubMed]
