Skip to main content
Frontiers in Oncology logoLink to Frontiers in Oncology
. 2019 Jun 4;9:453. doi: 10.3389/fonc.2019.00453

Analysis of Survival Curves: Statistical Methods Accounting for the Presence of Long-Term Survivors

Vera Damuzzo 1, Laura Agnoletto 2, Luca Leonardi 3, Marco Chiumente 4, Daniele Mengato 5, Andrea Messori 6,*
PMCID: PMC6558210  PMID: 31231609

Abstract

Some anti-cancer treatments (e. g., immunotherapies) determine, on the long term, a durable survival in a small percentage of treated patients; in graphical terms, long-term survivors typically give rise to a plateau in the right tail of the survival curve. In analysing these datasets, medians are unable to recognize the presence of this plateau. To account for long-term survivors, both value-frameworks of ASCO and ESMO have incorporated post-hoc corrections that upgrade the framework scores when a survival plateau is present. However, the empiric nature of these post-hoc corrections is self-evident. To capture the presence of a survival plateau by quantitative methods, two approaches have thus far been proposed: the milestone method and the area-under-the-curve (AUC) method. The first approach identifies a long-term time-point in the follow-up (“milestone”) at which survival percentages are extracted. The second approach, which is based on the measurement of AUC of survival curves, essentially is the rearrangement of previous methods determining mean lifetime survival; similarly to the milestone method, the application of AUC can be “restricted” to a pre-specified time-point of the follow-up. This Mini-Review examines the literature published on this topic. The main characteristics of these two methods are highlighted along with their advantages and disadvantages. The conclusion is that both the milestone method and the AUC method are able to capture the presence of a survival plateau.

Keywords: survival, adult, Kaplan-Meier estimator, survival plateau, area under the curve, median survival

Introduction

In the past 5 years, a renewed interest has been focused on qualitative methods that grade the clinical value of anti-cancer treatments. These methods, that are frequently denoted as value frameworks, have been widely debated in the oncologists' community, and their advantages and disadvantages have been highlighted (13). Two main frameworks have emerged in this context: the ASCO value framework (4) developed by the American Society of Clinical Oncology and the ESMO Magnitude of Clinical Benefit scale (5), developed by the European Society of Medical Oncology. Both methods generate a qualitative score: the ASCO score ranges from 0 to 100 while the ESMO one ranges from 1 to 5 (and from C to A for curative treatments); the higher the score, the better the clinical value of the treatment concerned. The ASCO score should be seen as a qualitative method, not a quantitative one, because it synthetizes a series of qualitative assessments concerning incremental benefit and toxicity and also because it is not based on any units of measurements.

More recently, the demonstration that some anti-cancer treatments (e.g., immunotherapies) provide a durable survival in a small percentage of treated patients (6, 7) has raised a difficult methodological question: in fact, these long-term survivors determine a survival plateau in the right tail of the Kaplan-Meier curve, but in analysing these datasets medians are unable to recognize the presence of this plateau. In both approaches of ASCO and ESMO, post-hoc corrections of the scores (810) have been proposed whereby scores are upgraded when a survival plateau is present. However, the empiric nature of these post-hoc corrections is self-evident.

In this context, the need to develop quantitative methods that capture the presence of a survival plateau has emerged. Thus, far, two quantitative methods have been proposed: the milestone method (1113) and the AUC method (14). The first method requires to identify a “long-term” time-point in the survival curve (called “milestone”) that must be longer than median survival; then, the quantitative index of the analysis is directly represented by the survival percentage at the pre-determined milestone. The second method, which has only been reported in abstract form (14), is the rearrangement of a well-known approach based on mean lifetime survival that has been used for many decades (15, 16). The main advantage of these two methods lies in their ability to capture the presence of a survival plateau and to consequently quantify the “weight” of the plateau within the whole shape of the survival curve.

Table 1 presents a glossary that explains the main parameters employed in the analysis of survival curves.

Table 1.

Glossary of the main technical parameters.

Parameter Abbreviation Units Meaning
Hazard ratio HR Adimensional The event risk in the comparison between the experimental group and the control group
Mean survival time MST Time The survival estimate per patient from time 0 to infinity; MST is calculated from the area under the survival curve, using a model-based estimation method (e.g., proportional hazard model, models of Weibull, Gomperz, etc.)
Restricted mean survival time RMST Time The survival estimate per patient calculated from time 0 to a pre-determined time-point in the follow-up; calculations are the same as those employed for MST
Mean lifetime survival MLS Time Synonymous for MST
Area under the survival curve AUC Time The same as MST, but has a model independent nature because of its estimation according to the trapezoidal rule
Restricted area under the survival curve rAUC Time The same as AUC, but is calculated from time 0 to a pre-determined time-point
Milestone survival Percentage (from 0 to 100%) or rate (from 0 to 1) The survival value calculated at a predetermined time-point according to the Kaplan-Meier curve

The Milestone Method

Evaluations of survival at “milestones” is the analysis of survival rates at a fixed time point of follow-up. This approach has been proposed as a means of capturing the right tail of long-term survivors (11, 13) even though, in some applications, also short-term milestones have been employed.

In more detail, milestone survival is defined as the Kaplan-Meier survival probability at a time point defined a priori (e.g., 60 months). Milestone survival analysis is a cross-sectional assessment of the survival data at the prespecified time point using Kaplan-Meier probabilities. The choice of the milestone requires careful consideration because it often represents a clinically meaningful benchmark. It is important to stress that the milestone does not necessarily represent long-term survival. It may represent a time point beyond which the treatment benefit is thought to remain stable. To determine the time point of interest, sufficient follow-up duration is generally required to contribute enough information to the milestone analysis.

An advantage of the milestone method lies in its intuitive ability of easily describing the presence of durable survival. However, milestones have an intrinsic limitation because they depend on the specific time point chosen for the analysis. Hence, unlike medians, milestones cannot be generalized when different treatments aimed at different disease conditions are best described by different milestone time points.

The milestone method has two different purposes: (a) in the analysis of randomized controlled trials, the ratio between the milestone rate in the treatment group and the milestone rate in the controls captures the incremental effect of the experimental treatment (particularly when treated patients have durable survival unlike the controls, but medians are similar across the two patient groups); (b) in the analysis of one-arm trials, the milestone rate compares different treatments across different trials studying the same disease condition and is typically focused on long term effects especially when medians are similar to one another and do not capture the tail of the survival curves.

The AUC Method

The ratio between the area under the survival curve (AUC) and median survival has been proposed as a new parameter to capture the presence of a plateau in survival curves (14). The AUC (as well as its proxy represented by the mean lifetime survival) has already been described many years ago (15, 16). Nowadays, its practical calculation is facilitated by the availability of websites that handle the mathematics of graphical curves.

Mean lifetime survival differs from restricted AUC (rAUC) because rAUC is calculated from time zero until the last time-point in the follow-up, whereas mean lifetime survival includes the extrapolation of survival from the last time-point to infinity, e.g., according to the equations of Gompertz or Weibull (15, 16). One advantage of both AUC and rAUC is that, like in traditional pharmacokinetic analyses (17, 18), the trapezoidal rule permits to reliably estimate their values without any mathematical complexity. Despite the different purposes, survival analysis (handled through AUC) and pharmacokinetic analysis share exactly the same theory and the same computational tools.

Like the milestone method, also the AUC method has two different purposes: (a) in the analysis of randomized controlled trials, the ratio between the AUC values in the treatment group and in the controls resembles the hazard ratio, and therefore tends to reassess the information provided by the hazard ratio; (b) in the analysis of one-arm trials, the ratio between AUC and median is calculated; a long-term survival plateau is shown by the finding that the AUC is greater than the median (ratio > 1); the more this ratio is greater than 1, the greater the “impact” that long-term survivors have on the entire survival pattern. This numerical property of the ratio AUC/median reflects an intuitive concept because a survival plateau is always a prolongation of survival in comparison with the absence of a plateau.

One advantage of AUC lies in its computational simplicity owing to the model-independent nature [as opposed to other methods, e.g., those of Gompertz and Weibull, that are model-dependent and require a complex mathematical analysis (15, 16)]. Another strength is in operational terms because specific references are available about the software tools that apply this method (see Appendix).

On the other hand, the ratio AUC/median retains a limitation of median in that this new parameter cannot be calculated when residual survival is more than 50%; in these cases, however, determining the presence of a plateau in the right tail of the curve makes little sense.

Finally, it should be noted that, in analysing a single survival curve, the AUC is equivalent to the parameter called mean survival time (MST); in more detail, AUC0−>last time point is identical to restricted MST (RMST), while AUC0−>infinity is identical to (unrestricted) MST, otherwise denoted as mean lifetime survival (MLS). In the past, model-dependent methods have typically been used for the mathematical computation of MST, RMST, MLS from the curve; in the present paper, AUC is intended to be estimated by model-independent methods.

Comparative Performance Between the Milestone Method and the AUC Method

We have re-analyzed the two data sets presented by Hellman et al. (12) to describe their approach based on milestones: (1) Comparison of progression-free survival between gefitinib vs. paclitaxel+carboplatin in 1,217 patients with advanced pulmonary adenocarcinoma; (2) Comparison of overall survival between ipilimumab+dacarbazine vs. placebo+dacarbazine in 60 patients with advanced melanoma. The first dataset was proposed by Hellman et al. as a typical example where “the median is the message” (12); the second was as an example where “the milestone is the message” (12).

Table 2 summarizes the results obtained by analysing these two data sets according to both the milestone method and the AUC method. Figure 1 shows how the 4 curves were digitalized for the calculation of AUC.

Table 2.

Results of the comparison of performance between the milestone method and the AUC method.

Trial End-point Median (months) Milestone survival rate (%) AUC (months)
Experimental arm Control arm Experimental arm Control arm Experimental arm Control arm
Gefitinib vs. paclitaxel + carboplatin in patients with metastatic non-small-cell lung cancer Progression-free survival 9.1 5.4 65.8% at 8 months 25.9% at 8 months 11.3* 6.86*
Ipilimumab + dacarbazine vs. placebo + dacarbazine in patients with metastatic melanoma Overall survival 11.2 9.1 18.2% at 5 years 8.8% at 5 years 23.1** 16.1**
*

The ratio of the two values of AUC (0.61) is not similar to the hazard ratio reported in the original trial (0.38; 95% CI, 0.26–0.56).

**

The ratio of the two values of AUC (0.70) is similar to the hazard ratio reported in the original trial (0.69; 95% CI, 0.57–0.84).

All survival information was drawn from the article by Hellman et al. (12); the values of median survival were those of the original trials.

Figure 1.

Figure 1

(A) Progression-free survival curves reported by Fukuoka et al. (21) in patients treated with gefinitib (upper curve) or chemotherapy (lower curve); the series of red circles shows how the automated digitalizer identified the points that define the upper boundary of the area under the curve comprised between 0 and 23 months (area = 11.31 months). (B) Overall survival curves reported by Maio et al. (22) in patients treated with ipilimumab (upper curve) or chemotherapy (lower curve); also these two curves were analyzed by the automated analyser to determine the two values of area under the curve. Survival expressed in percentage and time in months.

Although these two methods pursue a similar objective, their results are clearly not comparable with one another. This is because the milestone method and the AUC methods are based on two different perspectives of analysis that are incompatible.

In the milestone analysis of the first data set, placing the milestone between the values of the two medians (i.e., at 8 months) maximized the magnitude of the survival difference expressed in percentage points; in this case “the median is the message,” and in fact medians adequately represented the better result obtained in the experimental arm. As indicated by the shape of the two curves, placing the milestone at a longer follow-up in this dataset would not provide any advantage (data not shown).

In the AUC analysis of the first dataset, the most interesting finding is that the ratio AUC/median was 1.24 in the gefitinib arm vs. 1.27 in the paclitaxel+carboplatin arm. Although, as expected, AUC was longer than median in both arms, the former exceeded the latter by <30% (relative difference). This finding supports the conclusion that the median is the message. In this dataset, the ratio of the two AUC was not similar to the hazard ratio reported in the trial owing to reasons that are difficult to explain; compared with the shape of the two curves, the hazard ratio of 0.38 seems in fact to be excessively favorable.

In the milestone analysis of the second data set, placing the milestone on the long term (i.e., at 60 months) maximized the magnitude of the survival difference expressed in percentage points; in this case “the milestone is the message,” and in fact milestones adequately represented the better long-term result obtained in the experimental arm (survival plateau) compared with the controls. As indicated by the shape of the two curves, placing the milestone at a long follow-up produced a more meaningful clinical result than that indicated by medians.

In the AUC analysis of the second dataset, the most interesting finding is that the ratio AUC/median was 2.06 in the ipilimumab+dacarbazine arm vs. 1.77 in the placebo+dacarbazine arm. These findings, where AUC tends to be approximately twice the median, are in keeping with the conclusion that the median is not the message. In this dataset, the ratio of the two AUC was similar to the hazard ratio.

Discussion

Capturing a survival plateau can be useful for a variety of reasons. For many decades, median has been the standard parameter for summarizing outcomes in oncology, and its role is undisputed owing to the long-standing scientific reputation and ubiquitous use. Despite this, the availability of additional parameters for analysing survival curves (such as the milestone or the ratio AUC/median) can be justified to fill the gap represented by the inability of medians to account for the final portion of survival curves.

A preliminary experience has already accumulated on the milestone method whereas less experience is available with the AUC method. In particular, the ratio between AUC and median seems to be a parameter of remarkable interest, even though a more thorough confirmation will be required based on an adequate number of survival curves.

Finally, considerable literature has accumulated in the past years about other innovative statistical methods that can improve the interpretation of survival curves (2325). Since the approach based on RMST is the one most commonly employed (23, 25), some specific comments on this technique are warranted. In comparing two survival curves (experimental arm vs. control arm), the RMST, which can be considered an area under the survival curve, can be used either as the ratio of two RMSTs (i.e., RMSTexperimental/RMSTcontrol) or their difference (i.e., RMSTexperimental - RMSTcontrol). In the first case (23, 24), the ratio is adimensional (like the HR) and in fact can essentially be interpreted as an HR (without any substantial difference). In the second case (23), the RMST difference has the units of time (like the difference of the two medians) and can be seen as an improved estimate of the survival gain. This is because the RMST is influenced by the entire shape of the survival curve whereas the median has the drawback of being a punctiform parameter that does not reflect the whole survival curve.

One important point is that the RMST has been developed as a tool that quantitatively improves the representation of survival curves but not as a parameter that captures the presence of long-term survivors. In contrast, both the milestones method (12) and our method based on the ratio AUC/median (14) have been specifically designed to capture the presence of a long-term survival plateau in the right tail of the curve. This is the reason why the present review has been focused more on milestones and on the ratio AUC/median than on the RMST.

The recent paper published by Wang et al. (25) has combined the advantages of milestones and of RMST, but unfortunately the ratio of the two RMSTs has been expressed in an atypical reciprocal form, that prevents a sound interpretation of the findings published by these authors (26, 27).

Author Contributions

All authors listed have made a substantial, direct and intellectual contribution to the work, and approved it for publication.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Appendix

In the application of the AUC method, the survival curves can be digitalized using a web-based program (19); in this phase of the analysis, the above software identifies (for each survival curve) the y-vs.-x data pairs describing the Kaplan-Meier graph. Thereafter, the AUC can be calculated for each curve using the trapezoidal rule, as in pharmacokinetics (17). A subroutine written in Microsoft Excel is available for this purpose (20).

References

  • 1.Cherny NI, de Vries EGE, Dafni U, Garrett-Mayer E, McKernin SE, Piccart M, et al. Comparative assessment of clinical benefit using the ESMO-magnitude of clinical benefit scale version 1.1 and the ASCO value framework net health benefit score. J Clin Oncol. (2019) 37:336–49. 10.1200/JCO.18.00729 [DOI] [PubMed] [Google Scholar]
  • 2.Cheng S, McDonald EJ, Cheung MC, Arciero VS, Qureshi M, Jiang D, et al. Do the American Society of Clinical Oncology value framework and the European society of Medical Oncology magnitude of clinical benefit scale measure the same construct of clinical benefit? J Clin Oncol. (2017) 35:2764–71. 10.1200/JCO.2016.71.6894 [DOI] [PubMed] [Google Scholar]
  • 3.Slomiany M, Madhavan P, Kuehn M, Richardson S. Value frameworks in oncology: comparative analysis and implications to the pharmaceutical industry. Am Health Drug Benefits. (2017) 10:253–60. [PMC free article] [PubMed] [Google Scholar]
  • 4.ASCO Value Framework Available online at: https://www.asco.org/practice-guidelines/cancer-care-initiatives/value-cancer-care (accessed December 17, 2018).
  • 5.Cherny NI, Dafni U, Bogaerts J, Latino NJ, Pentheroudakis G, Douillard JY, et al. ESMO-magnitude of clinical benefit scale version 1.1. Ann Oncol. (2017) 28:2340–66. 10.1093/annonc/mdx310 [DOI] [PubMed] [Google Scholar]
  • 6.Schnipper LE, Schilsky RL. Are value frameworks missing the mark when considering long-term benefits from immuno-oncology drugs? JAMA Oncol. (2018) 4:333–4. 10.1001/jamaoncol.2017.4339 [DOI] [PubMed] [Google Scholar]
  • 7.Ben-Aharon O, Magnezi R, Leshno M, Goldstein DA. Association of immunotherapy with durable survival as defined by value frameworks for cancer care. JAMA Oncol. (2018) 4:326–32. 10.1001/jamaoncol.2017.4445 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Cherny NI, de Vries EGE; European society for medical oncology magnitude of clinical benefit scale working group. Improving on tail-of-the-curve evaluation with the American Society of Clinical Oncology value framework. JAMA Oncol. (2018) 4:1437 10.1001/jamaoncol.2018.3264 [DOI] [PubMed] [Google Scholar]
  • 9.Vivot A, Créquit P, Porcher R. Improving on tail-of-the-curve evaluation with the American Society of Clinical Oncology value framework. JAMA Oncol. (2018) 4:1437–8. 10.1001/jamaoncol.2018.3289 [DOI] [PubMed] [Google Scholar]
  • 10.Ben-Aharon O, Goldstein DA. Improving on tail-of-the-curve evaluation with the American Society of Clinical Oncology value framework-reply. JAMA Oncol. (2018) 4:1438–9. 10.1001/jamaoncol.2018.3295 [DOI] [PubMed] [Google Scholar]
  • 11.Chen TT. Milestone survival: a potential intermediate endpoint for immune checkpoint inhibitors. J Natl Cancer Inst. (2015) 107:djv156. 10.1093/jnci/djv156 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Hellmann MD, Kris MG, Rudin CM. Medians and milestones in describing the path to cancer cures: telling “Tails.” JAMA Oncol. (2016) 2:167–8. 10.1001/jamaoncol.2015.4345 [DOI] [PubMed] [Google Scholar]
  • 13.BlumenthaL GM, Zhang L, Zhang H, Kazandjian D, Khozin S, Tang S, et al. Milestone analyses of immune checkpoint inhibitors, targeted therapy, and conventional therapy in metastatic non-small cell lung cancer trials a meta-analysis. JAMA Oncol. (2017) 3:e171029. 10.1001/jamaoncol.2017.1029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Damuzzo V, Agnoletto L, Leonardi L, Chiumente M, Mengato D, Messori A. Area under the survival curve: a novel parameter to account for the presence of long-term survivors (Abstract 03). In Italian Society of Clinical Pharmacy and Therapeutics. “AVVICINARE” Project, Firenze. Available online at: http://www.sifact.info/joomla30/index.php/la-formazione/presentazioni-relazioni-ad-eventi (accessed September 15, 2018).
  • 15.Messori A, Becagli P, Trippoli S. Median versus mean lifetime survival in the analysis of survival data. Haematologica. (1997) 82:730. [PubMed] [Google Scholar]
  • 16.Messori A, Trippoli S, Becagli P, Tendi E. Pharmacoeconomic profile of paclitaxel as a first-line treatment for patients with advanced ovarian carcinoma. A lifetime cost-effectiveness analysis. Cancer. (1996) 78:2366–73. [DOI] [PubMed] [Google Scholar]
  • 17.Gibaldi M, Perrier D. Pharmacokinetics. 2nd ed. New York, NY: Marcel Dekker; (1982). [Google Scholar]
  • 18.Matucci M, Messori A, Donati-Cori G, Longo G, Vannini S, Morfini M, et al. Kinetic evaluation of four factor VIII concentrates by model-independent methods. Scand J Haematol. (1985) 34:22–8. 10.1111/j.1600-0609.1985.tb00739.x [DOI] [PubMed] [Google Scholar]
  • 19.Rohatgi A. WebPlotDigitizer. Available online at: https://automeris.io/WebPlotDigitizer (accessed December 30, 2018).
  • 20.Anonymous Find the Area Under a Curve in Excel. https://www.youtube.com/watch?v=U6EWnEsdR5A (accessed December 31, 2018).
  • 21.Fukuoka M, Wu YL, Thongprasert S, Sunpaweravong P, Leong SS, Sriuranpong V, et al. Biomarker analyses and final overall survival results from a phase III, randomized, open-label, first-line study of gefitinib versus carboplatin/paclitaxel in clinically selected patients with advanced non-small-cell lung cancer in Asia (IPASS). J Clin Oncol. (2011) 29:2866–74. 10.1200/JCO.2010.33.4235 [DOI] [PubMed] [Google Scholar]
  • 22.Maio M, Grob JJ, Aamdal S, Bondarenko I, Robert C, Thomas L, et al. Five-year survival rates for treatment-naive patients with advanced melanoma who received ipilimumab plus dacarbazine in a phase III trial. J Clin Oncol. (2015) 33:1191–6. 10.1200/JCO.2014.56.6018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Trinquart L, Jacot J, Conner SC, Porcher R. Comparison of treatment effects measured by the hazard ratio and by the ratio of restricted mean survival times in oncology randomized controlled trials. J Clin Oncol. (2016) 34:1813–9. 10.1200/JCO.2015.64.2488 [DOI] [PubMed] [Google Scholar]
  • 24.Huang B, Kuan PF. Comparison of the restricted mean survival time with the hazard ratio in superiority trials with a time-to-event end point. Pharm Stat. (2018) 17:202–13. 10.1002/pst.1846 [DOI] [PubMed] [Google Scholar]
  • 25.Wang ZX, Wu HX, Xie L, Wang YN, Yang LP, He MM, et al. Correlation of milestone restricted mean survival time ratio with overall survival hazard ratio in randomized clinical trials of immune checkpoint inhibitors - a systematic review and meta-analysis. JAMA Netw Open. (2019) 2:e193433. 10.1001/jamanetworkopen.2019.3433 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Messori A. Correlation between hazard ratio and ratio of mean survival times (Comment, May 4, 2019). JAMA Netw Open. Available online at https://jamanetwork.com/journals/jamanetworkopen/fullarticle/2732339?resultClick=3#comment-wrapper (accessed May 9, 2019).
  • 27.Damuzzo V. Direct and inverse correlation between restricted mean survival times and hazard ratio (Comment, May 10, 2019). JAMA Netw Open. Available online at https://jamanetwork.com/journals/jamanetworkopen/fullarticle/2732339?resultClick=3#comment-wrapper (accessed May 10, 2019).

Articles from Frontiers in Oncology are provided here courtesy of Frontiers Media SA

RESOURCES