Skip to main content
British Journal of Cancer logoLink to British Journal of Cancer
. 2013 Jan 29;108(3):699–707. doi: 10.1038/bjc.2013.14

Study populations for period analyses of cancer survival

L Jansen 1,*, T Hakulinen 2, H Brenner 1
PMCID: PMC3593560  PMID: 23361050

Abstract

Background:

Period analysis is increasingly used to compute long-term cancer survival, as it provides better prediction of survival of newly diagnosed patients than traditional cohort analysis. However, the patient population to which period survival estimates best pertain to and which should be described in a study is less obvious.

Methods:

Using Finnish Cancer Registry data on 23 common cancer sites, age-standardized period estimates of 5-, 10-, 15-, and 20-year relative survival were computed for each 2-, 5-, and 10-year calendar period in 1954–2003 and compared with survival estimates for two cohorts by means of mean, mean absolute and mean squared differences: a full cohort of all patients potentially contributing some data to the survival analysis and a restricted cohort of patients diagnosed in the period of interest.

Results:

In most computations, survival estimates for the full cohorts were on average closer to the period estimates for the majority of cancer sites. For 10-year survival, results were less obvious with respect to the mean difference. However, mean squared and mean absolute differences were smaller for the majority of cancers when using the full cohort.

Conclusion:

Our results suggest that the full cohort should be described in reports of period survival analysis.

Keywords: epidemiologic methods, cancer registries, survival analysis, period analysis, study population


Analyses of long-term survival are a key component of monitoring progress against cancer. Such monitoring should be as up-to-date as possible. However, with traditional cohort-based survival analyses, even the most up-to-date long-term cancer survival estimates pertain to patients diagnosed many years ago and followed up for many years since then. Period analysis, first introduced in 1996 (Brenner and Gefeller, 1996), enables in overcoming this problem by restricting the analysis to the survival experience in the most recent years for which the data of patients diagnosed in various calendar years are available. For example, a cancer registry that has, in 2012, complete data on cancer incidence and mortality follow-up up to the year 2010 might derive a cohort estimate of 5-year survival for patients diagnosed in 2001–2005. However, this estimate would essentially reflect the level of cancer care of ∼10 years ago. A better prediction of the survival of newly diagnosed cancer patients can be obtained by a period analysis for the 2006–2010 period by restricting the analysis to the survival experience (of patients diagnosed in 2001–2010) in 2006–2010. It has been demonstrated by extensive empirical investigation that period analysis provides better predictions of the survival of newly diagnosed patients than traditional cohort analysis (Brenner and Hakulinen, 2002a; Brenner et al, 2002b; Talback et al, 2004; Ellison, 2006). The method has therefore been increasingly used in cancer survival studies in recent years (e.g. Verdecchia et al, 2007; Sankaranarayanan et al, 2010; Coleman et al, 2011; van de Schans et al, 2011).

A question that requires specific attention in the reporting of results from the period analysis of cancer survival is the study population to which the survival estimates pertain to. This study population is commonly described with respect to sociodemographic and clinical characteristics in one or more introductory tables. For example, in a comparison of survival estimates from two countries, the study populations of the countries should be compared in an introductory table with respect to factors like age, sex, cancer type, and stage. Such a table will help in interpreting the findings of a study and enable a more detailed comparison of results across articles. In studies using cohort analysis, which refers to a specific cohort of patients, there is no doubt that this very cohort is the study population to be described. In studies using period analysis, the relevant study population is less obvious.

There appear to be at least two ‘natural candidates' that have both been reported in studies including period analysis (Ellison et al, 2007; Gondos et al, 2007; Verdecchia et al, 2007; Quaglia et al, 2009; Redaniel et al, 2009; Pollack et al, 2011; Chen et al, 2012), showing that the investigation of the appropriate study population is a relevant issue in enhancing comparability across studies. The first one consists of all patients who potentially contribute some data to the survival analysis (‘full cohort'). For illustration, consider a period analysis for the period 1999–2003, as illustrated in Figure 1 (bold frame). In this example, the full cohort would include all patients diagnosed in 1994–2003, even though some of them make only very limited or no contributions to the survival analysis. For example, patients diagnosed in 1994 would only contribute some survival experience in the 5th year of follow-up if they were still alive in 1999. In case of improvements in survival over time, survival of the 1994–2003 cohort might be substantially lower than survival of the 1999–2003 cohort, whose survival expectation is aimed to be approximated by period analysis. The second ‘natural candidates' for the relevant study population to be described would include patients diagnosed in the period of interest (here: 1999–2003) only (‘restricted cohort'). Although survival of the cohort diagnosed in a certain period is often well approximated by period analysis for this period, survival of this cohort may typically end up being somewhat better than that estimated by period analysis in case of ongoing improvement in survival (Brenner and Hakulinen, 2002a; Brenner et al, 2002b). Therefore, period estimates of survival may often lie between the survival expectations of the full and the restricted cohort.

Figure 1.

Figure 1

Years of diagnosis and years of follow-up included in various types of 5-year survival analyses. (i) period analyses for 1999–2003: data in bold frame. (ii) full cohort 1994–2003: all grey-shaded cells. (iii) restricted cohort 1999–2003: dark grey-shaded cells only. The numbers within cells indicate years since diagnosis.

In this study, we aim to assess which study population should typically be described along with the presentation of period survival estimates by investigating whether the full or restricted cohort has a survival experience that is closer to the period survival estimate.

Materials and methods

Our analysis is based on data from the Finnish Cancer Registry that is well known for its very long time series of high quality cancer registration. In Finland, cancer registration is mandatory by law, and both registration and mortality follow-up are virtually complete. For this analysis, patients registered in 1954–2003, that is, within half a century, with 1 of the 23 most common forms of cancer (those with on average >100 cases per year) and followed with respect to mortality by the end of 2008 were included. For each type of cancer, we first derived a period estimate of 5-year survival for the time period 1999–2003 (from the data included in the bold frame in Figure 1) and compared it with 5-year survival of the corresponding full cohort (diagnosed in 1994–2003, derived from data included in all grey-shaded cells in Figure 1) and of the restricted cohort (diagnosed in 1999–2003, derived from data included in dark grey-shaded cells only in Figure 1). We then extended this type of comparison to each of the nine 5-year time windows from 1959–1963 to 1999–2003, and compared the period estimates of 5-year survival for those periods with 5-year survival for the corresponding full cohorts (1954–1963 to 1994–2003) and for the corresponding restricted cohorts (1959–1963 to 1999–2003). We calculated mean differences and mean squared differences of 5-year period survival estimates from 5-year survival of both cohorts. Mean differences quantify to what extent the cohorts on average have lower or higher survival compared with the period survival estimates. Mean squared differences reflect average absolute differences and also take differences in random variation into account. Random variation should typically be smaller (due to larger numbers of patients) for survival estimates for the full cohorts than for survival estimates for the restricted cohorts. In additional analysis, we also computed the mean absolute differences to estimate absolute differences without taking random variation into account.

With improving prognosis over time for many forms of cancer, survival rates over longer time periods, such as 10 years, become of increasing interest, and the advantages of period analysis over traditional cohort-based survival analysis in terms of better predictions of survival are even larger for such longer follow-up times (Brenner and Hakulinen, 2002b; Brenner et al, 2002b). We therefore repeated the analyses for 10-year survival, starting with the period 1994–1998 as outlined in Figure 2, and proceeding with all seven 5-year periods from 1964–1968 to 1994–1998 (with full and restricted cohorts from 1954–1968 to 1984–1998 and 1964–1968 to 1994–1998, respectively). The more restricted range of time periods included for the analyses of 10-year survival is due to the fact that the 1964–1968 period is the first time period for which a 10-year period survival estimate and a full cohort estimate of 10-year survival can be calculated with the database including patients from 1954 on. Likewise, the 1994–1998 period is the most recent period for which 10-year survival can be calculated for the corresponding full and restricted cohorts with the database including follow-up until the end of 2008.

Figure 2.

Figure 2

Years of diagnosis and years of follow-up included in various types of 10-year survival analyses. (i) period analyses for 1994–1998: data in bold frame. (ii) full cohort 1984–1998: all grey-shaded cells. (iii) restricted cohort 1994–1998: dark grey-shaded cells only. The numbers within cells indicate years since diagnosis.

To investigate whether the appropriate study population for introductory tables may be different in case of even longer survival times, the analysis was repeated for 15- and 20-year survival computed on five 5-year periods from 1969–1973 to 1989–1993 and on three 5-year periods from 1974–1978 to 1984–1988, respectively. To investigate whether results change when longer or short calendar periods of interest are used, the analysis on 5- and 10-year survival was repeated on 2-year and 10-year calendar periods. For the 2-year calendar period, 5-year relative survival was estimated from 22 periods (1960–1961 to 2002–2003) and 10-year relative survival was estimates from 17 periods (1965–1966 to 1997–1998). For 10-year calendar periods, computations were based on four (1964–1973 to 1994–2003) and three (1969–1978 to 1989–1998) periods.

According to standard practice in population-based cancer registration, relative rather than observed survival was assessed. Relative survival was calculated as the ratio of observed survival of cancer patients divided by expected survival in the general population (Henson and Ries, 1995). The latter was calculated from period-specific population life tables according to the so-called Ederer II method (Ederer and Heise, 1959; Hakulinen et al, 2011). Owing to major changes in the age distribution of cancer cases over time, all survival analyses were age-adjusted to the International Standard Cancer Populations proposed by Corazziari et al (2004). All analyses were performed by the statistical software system SAS, version 9.2, using publicly available macros for cohort and period analysis of relative survival (Brenner et al, 2002a).

Results

Table 1 shows the development of 5-year relative survival for the 23 most common forms of cancer within half a century from 1954–2003. For all included cancers, a total number of more than 5000 cases were diagnosed, that is, the mean annual number of diagnoses was >100. With >90 000 cases each, the most common cancers were cancers of the lung and breast, followed by colorectal, prostate and stomach cancer with ∼60 000 cases each. Case numbers were ∼20 000 for cancers of the pancreas, bladder, corpus uteri, kidney, oral cavity, and ovaries as well as for non-Hodgkin lymphoma, leukaemia, and skin melanoma, and between 5000 and 15 000 cases for all other cancers.

Table 1. The 23 most common forms of cancer in Finland in 1954–2003: total number of cases (N) and age-adjusted 5-year relative survival for the earliest (1954–1958) and the most recently diagnosed cohort (1999–2003) of patients.

 
 
 
5-year relative survivala
 
 
 
19542003
19541958
19992003
 
Cancer ICD 10 N PE (SE) PE (SE) Difference
Oral cavity
0–14
17215
66.3 (2.4)
64.4 (1.4)
−1.9
Oesophagus
15
11306
2.1 (0.4)
10.8 (1.2)
8.7
Stomach
16
59808
6.0 (0.3)
26.1 (0.8)
20.1
Colorectal
18–21
62094
21.6 (1.1)
59.8 (0.6)
38.2
Liver
22
6711
0.4 (0.4)
7.8 (1.0)
7.4
Gallbladder
23
5981
3.5 (1.3)
10.4 (1.8)
6.9
Pancreas
25
24482
1.9 (0.5)
4.2 (0.5)
2.3
Larynx
32
6587
37.8 (2.8)
59.3 (2.9)
21.5
Trachea, lung , and bronchus
33–34
92293
3.8 (0.3)
11.2 (0.4)
7.4
Skin melanoma
43
16241
36.2 (5.2)
82.0 (1.1)
45.8
Female breast
50
91048
48.1 (1.7)
86.3 (0.5)
38.2
Cervix
53
12586
38.4 (2.7)
58.2 (2.4)
19.8
Corpus uteri
54
21483
47.0 (2.9)
83.3 (0.9)
36.3
Ovarian
56
16552
19.4 (2.3)
41.6 (1.1)
22.2
Prostate
61
61374
23.5 (1.9)
87.8 (0.7)
64.3
Kidney
64
18388
14.4 (2.1)
58.6 (1.0)
44.2
Bladder
67
22203
26.7 (2.4)
75.9 (1.0)
49.2
Brain and nervous system
71–72
10281
3.1 (0.5)
16.2 (0.9)
13.1
Thyroid
73
9667
34.8 (3.3)
78.3 (1.8)
43.5
Hodgkin lymphoma
81
5389
11.0 (2.0)
66.4 (3.8)
55.4
Non-Hodgkin lymphoma
82–85
21099
18.2 (2.3)
56.5 (0.8)
38.3
Multiple myeloma
90
8714
6.9 (1.7)
34.7 (1.4)
27.8
Leukaemias 91–96 16989 8.0 (1.4) 42.8 (1.1) 34.8

Abbreviations: PE=point estimate; SE=standard error.

a

All survival estimates were age-adjusted to the International Standard Cancer Populations proposed by Corazziari et al (2004).

For all cancers except cancer of the oral cavity, 5-year relative survival was <50% among patients diagnosed in 1954–1958. For the majority of cancers, substantial increases in age-adjusted 5-year relative survival were observed during this 50-year time interval. With increases of 64.3, 55.4, 49.2, 45.8, 44.2 and 43.5 per cent units, increases were strongest and exceeded 0.8 per cent units per year on average for prostate cancer, Hodgkin lymphoma, bladder, skin melanoma, kidney, and thyroid cancer. By contrast, no or only very minor increases (<10 % units overall) were seen for cancers of the oral cavity, pancreas, gallbladder, liver, lung, and oesophagus. For the latter five cancers, age-adjusted 5-year relative survival remained <12% even for cases diagnosed in 1999–2003. Average differences between the most recent and earliest period across cancer sites showed that 1-year survival increased by 21.5 % units (data not shown). With respect to temporal trends in conditional survival, 4-year survival for patients who have already survived 1 year increased on average by a comparable amount (21.8 % units). One-year survival conditional on 1-, 2-, 3-, and 4-year survival increased by 12.8, 12.7, 6.3, and 6.8 % units.

Table 2 shows 5-year relative survival for the 1999–2003 period compared with 5-year relative survival of the corresponding ‘full cohorts' and ‘restricted cohorts' of patients diagnosed in 1994–2003 and 1999–2003, respectively. With only one exception (ovarian cancer), 5-year relative survival of the full cohorts was very close to the period estimate of 5-year relative survival. Differences were <2 % units for 22 of 23 cancers and <1 % unit for 16 cancers. For a majority of 17 cancers, larger differences were seen between 5-year relative survival of restricted cohorts and the period estimates. Latter differences exceeded 2 % units for five cancers, and were particularly large for prostate cancer, Hodgkin and non-Hodgkin lymphoma and breast cancer (+6.21, 4.47, 3.78, and 3.03 % units), respectively, that is, cancers for which major recent increases in survival were observed. For all but six cancers, 5-year relative survival of the restricted cohorts of patients diagnosed in 1999–2003 exceeded period estimates of 5-year relative survival for the 1999–2003 period.

Table 2. Five-year relative survival for the 1999–2003 period compared to 5-year relative survival of patients diagnosed in 1994–2003 (‘full cohort') and patients diagnosed in 1999–2003 (‘restricted cohort').

 
5-year relative survivala
Change from
Difference from period 19992003
Cancer Period 19992003 PE (SE) Full cohort 19942003 PE (SE) Restricted cohort 19992003 PE (SE) 19941998 to 1999–2003 Full cohort 19942003 Restricted cohort 19992003
Stomach
27.7 (0.8)
26.3 (0.6)
26.1 (0.8)
−0.5
−1.39
−1.61
Oral cavity
63.3 (1.4)
64.4 (1.0)
64.4 (1.4)
0.3
1.09
1.16
Oesophagus
10.9 (1.3)
10.7 (0.8)
10.8 (1.2)
0.2
−0.18
−0.10
Gallbladder
11.8 (1.7)
10.3 (1.2)
10.4 (1.8)
0.3
−1.53
−1.35
Trachea, lung and bronchus
11.3 (0.4)
11 (0.3)
11.2 (0.4)
0.4
−0.36
−0.12
Larynx
57.6 (2.8)
59.3 (2.0)
59.3 (2.9)
0.5
1.72
1.65
Pancreas
3.4 (0.4)
3.7 (0.3)
4.2 (0.5)
1.1
0.32
0.83
Multiple myeloma
34.7 (1.4)
34.0 (1.0)
34.7 (1.4)
1.3
−0.69
0.04
Liver
7.0 (1.0)
7.1 (0.7)
7.8 (1.0)
1.4
0.05
0.80
Brain and nervous system
15.2 (0.8)
15.4 (0.6)
16.2 (0.9)
1.7
0.18
1.03
Cervix
56.9 (2.5)
57.2 (1.7)
58.2 (2.4)
1.9
0.30
1.32
Bladder
73.4 (1.0)
74.9 (0.7)
75.9 (1.0)
2.2
1.53
2.54
Thyroid
77.1 (1.8)
77.2 (1.3)
78.3 (1.8)
2.2
0.07
1.16
Kidney
57.9 (1.0)
57.5 (0.7)
58.6 (1.0)
2.3
−0.38
0.72
Leukaemias
40.8 (1.2)
41.5 (0.8)
42.8 (1.1)
2.7
0.65
1.93
Skin melanoma
80.0 (1.2)
80.5 (0.8)
82.0 (1.1)
3.3
0.49
1.94
Colorectal
58.2 (0.6)
58.2 (0.4)
59.8 (0.6)
3.4
0.00
1.60
Corpus uteri
81.4 (0.9)
81.6 (0.7)
83.3 (0.9)
3.6
0.19
1.93
Ovarian
43.6 (1.2)
39.7 (0.8)
41.6 (1.1)
3.7
−3.88
−1.93
Female breast
83.2 (0.5)
84.1 (0.3)
86.3 (0.5)
4.7
0.89
3.03
Hodgkin lymphoma
61.9 (3.9)
63.1 (2.7)
66.4 (3.8)
6.1
1.21
4.47
Non-Hodgkin lymphoma
52.7 (0.9)
52.7 (0.6)
56.5 (0.8)
8.0
0.07
3.78
Prostate 81.6 (1.0) 82.4 (0.7) 87.8 (0.7) 16.2 0.84 6.21

Abbreviations: PE=point estimate; SE=standard error.

Finnish Cancer Registry, 23 most common forms of cancer.

a

All survival estimates were age-adjusted to the International Standard Cancer Populations proposed by Corazziari et al (2004).

Regarding period estimates of 10-year relative survival in 1994–1998, differences from survival of both the full (diagnosed in 1984–1998) and restricted cohorts (diagnosed in 1994–1998) were somewhat larger and exceeded 2 % units for 9 out of 23 cancers in each case (Table 3). For a majority of 13 cancers, the 10-year relative survival of the restricted cohort exceeded the period estimates. By far, the largest difference (+17.3% units) was seen for prostate cancer, with 10-year relative survival of 62.9% for the restricted cohort, compared with the period estimate of 45.6%. By contrast, 10-year relative survival of the full cohorts was lower than the period estimates for a clear majority of 19 out of 23 cancers. The full and restricted cohort estimates were closer to the period estimates for 10 and 13 cancers, respectively. Differences between the period estimate and the full and restricted cohort estimate, respectively, were not related to the change in survival within the investigated period from 1984 to 1998.

Table 3. Ten-year relative survival for the 1994–1998 period compared to 10-year relative survival of patients diagnosed in 1984–1998 (‘full cohort') and patients diagnosed in 1994–1998 (‘restricted cohort').

 
10-year relative survivala
Change from
Difference from period 19941998
Cancer Period 19941998 PE (SE) Full cohort 19841998 PE (SE) Restricted cohort 19941998 PE (SE) 19841988 to 1994–1998 Full cohort 19841998 Restricted cohort 19941998
Oral cavity
56.1 (1.9)
55.3 (1.1)
53.4 (1.8)
−2.7
−0.75
−2.73
Multiple myeloma
16.8 (1.3)
15.6 (0.7)
16.2 (1.2)
−1.5
−1.25
−0.57
Pancreas
2.6 (0.4)
2.2 (0.2)
1.8 (0.3)
−0.5
−0.45
−0.79
Trachea, lung and bronchus
7.2 (0.4)
7.1 (0.2)
7.2 (0.4)
−0.4
−0.1
−0.05
Oesophagus
8.7 (1.2)
7.4 (0.6)
8.4 (1.1)
1.4
−1.24
−0.31
Gallbladder
6.7 (1.4)
7.3 (0.8)
8.0 (1.3)
1.4
0.61
1.31
Liver
3.2 (0.8)
2.8 (0.4)
3.0 (0.7)
1.5
−0.43
−0.23
Larynx
49.8 (5.0)
50.1 (2.4)
55.3 (4.3)
1.5
0.21
5.43
Brain and nervous system
9.8 (1.0)
10.4 (0.6)
12.1 (0.9)
2.2
0.54
2.26
Skin melanoma
75.2 (2.4)
72.9 (1.2)
73.9 (1.9)
3.1
−2.31
−1.27
Leukaemias
25.9 (1.4)
23.1 (0.7)
25.5 (1.2)
4.3
−2.81
−0.42
Cervix
48.8 (2.9)
45.0 (1.5)
47.5 (2.7)
4.8
−3.8
−1.33
Bladder
66.2 (1.6)
61.9 (0.8)
65.3 (1.3)
5.9
−4.25
−0.87
Stomach
22.0 (0.9)
20 (0.5)
22.8 (0.9)
6.6
−1.98
0.82
Female breast
68.7 (0.9)
67.9 (0.5)
71.8 (0.7)
6.6
−0.76
3.13
Corpus uteri
74.3 (1.6)
73.0 (0.9)
77.2 (1.4)
6.9
−1.35
2.9
Non-Hodgkin lymphoma
37.5 (1.3)
36.0 (0.7)
38.8 (1.0)
7.1
−1.43
1.31
Colorectal
49.6 (0.9)
47.6 (0.5)
52.2 (0.8)
7.7
−2.03
2.56
Ovarian
29.1 (1.2)
27.2 (0.7)
29.4 (1.2)
7.8
−1.85
0.31
Thyroid
69.9 (2.9)
67.4 (1.6)
72.8 (2.6)
10.7
−2.56
2.86
Kidney
45.1 (1.5)
42.2 (0.8)
47.1 (1.3)
12.7
−2.89
2.05
Hodgkin lymphoma
47.5 (4.5)
42.6 (2.4)
48.4 (5.1)
12.9
−4.82
0.97
Prostate 45.6 (1.2) 52.1 (0.9) 62.9 (1.3) 17.4 6.48 17.34

PE=point estimate; SE=standard error.

Finnish Cancer Registry, 23 most common forms of cancer.

a

All survival estimates were age-adjusted to the International Standard Cancer Populations proposed by Corazziari et al (2004).

Summary results of the comparisons of the period estimates of 5- and 10-year relative survival with survival of the corresponding full and restricted cohorts computed on 5-year calendar periods during the 50 calendar years (1954–2003) included in this analysis are shown in Table 4. Supplementary Figure 1 shows the mean and mean squared differences between the survival estimates according to the improvements in survival over time. Five-year relative survival of full cohorts was on average closer to the period estimates than 5-year relative survival of the restricted cohorts for a majority of 16 out of 23 cancers. The full cohort was especially closer for cancer sites showing strong increases in survival between 1954 and 2003. For 10 out of 11 cancer sites with increases >30 % units, the full cohort estimate was closer. Absolute values of mean differences were <1 % unit in 19 out of 23 cancers. Whereas 5-year relative survival of full cohorts was on average lower than the period 5-year relative survival estimate for all cancers, 5-year relative survival of the restricted cohorts was on average higher than the period estimates for all but two cancers (oral and gallbladder cancer), which showed minor or no improvements in survival. Mean squared differences in 5-year relative survival were lower for the full cohorts than for the restricted cohorts for 19 out of 23 cancers. Again, this relationship was stronger for cancer sites with increases >30 % units between 1954 and 2003.

Table 4. Mean difference and mean squared difference of full cohort and restricted cohort 5- and 10-year relative survival from period 5- and 10-year relative survival computed on 5-year calendar periods, respectively.

  5-year relative survivala 10-year relative survivala
 
Change from
Full cohort
Restricted Cohort
Change from
Full cohort
Restricted Cohort
Cancer 195458 to 199903 MD MSD MD MSD 195458 to 199498 MD MSD MD MSD
Oral cavity
−1.9
−0.32
3.45
−0.41
4.34
−7.9
−0.68
2.98
−0.79
5.29
Pancreas
2.3
−0.07
0.08
0.06
0.21
0.2
−0.12
0.22
0.03
0.32
Gallbladder
6.9
−0.41
0.8
−0.05
2.41
5.9
0.17
0.88
0.81
2.91
Liver
7.4
−0.03
0.15
0.38
0.49
2.6
−0.01
0.19
0.29
0.41
Trachea, lung and bronchus
7.4
−0.16
0.23
0.24
0.18
4.7
−0.38
0.38
0.17
0.3
Oesophagus
8.7
−0.27
0.20
0.23
0.41
7.0
−0.45
0.80
0.29
1.44
Brain & nervous system
13.1
−0.23
1.44
0.44
1.78
10.3
0.57
1.18
1.64
3.86
Cervix
19.8
−0.01
4.09
1.03
14.22
9.3
−1.30
18.74
0.40
37.00
Stomach
20.1
−0.63
0.66
0.53
1.25
17.8
−1.64
3.1
0.87
1.04
Larynx
21.5
−0.46
6.08
0.60
9.29
17.2
−1.65
14.31
0.73
17.69
Ovarian
22.2
−1.06
2.61
0.09
2.02
10.1
−1.18
2.17
0.02
3.65
Multiple myeloma
27.8
−1.17
2.77
0.15
2.34
12.1
−1.20
11.89
0.21
9.3
Leukaemias
34.8
−0.27
1.24
1.49
4.26
23.0
−1.33
7.25
1.26
10.51
Corpus uteri
36.3
−0.46
1.57
1.31
3.78
37.8
−1.52
13.38
2.03
24.71
Colorectal
38.2
−0.81
1.09
1.10
1.61
33.8
−2.16
5.17
1.51
2.89
Female breast
38.2
−0.32
1.33
1.56
5.64
37.8
−0.52
4.17
3.24
18.71
Non-Hodgkin lymphoma
38.3
−1.13
10.18
0.69
8.89
27.6
−1.40
8.66
0.81
9.02
Thyroid
43.5
−0.95
2.87
1.24
5.47
48.9
−1.01
10.89
4.15
38.73
Kidney
44.2
−0.78
1.04
1.40
2.68
33.4
−2.10
5.67
1.83
7.08
Skin melanoma
45.8
−0.00
5.48
2.09
11.48
34.4
−0.73
47.49
3.52
61.67
Bladder
49.2
−0.63
3.69
1.78
6.50
46.6
−2.73
22.56
2.12
21.86
Hodgkin lymphoma
55.4
−1.07
4.9
1.96
6.64
38.9
−2.63
13.47
2.53
19.89
Prostate 64.3 −0.55 5.3 2.31 16.90 48.7 0.28 17.66 4.44 80.07

Abbreviations: MD=mean difference; MSD=mean squared difference.

Finnish Cancer Registry, 23 most common forms of cancer in 1954–2003.

a

All survival estimates were age-adjusted to the International Standard Cancer Populations proposed by Corazziari et al (2004).

Regarding 10-year relative survival, differences between period estimates and full cohort and restricted cohort estimates were on average somewhat larger. With three exceptions (gallbladder, prostate, and brain and nervous system cancer), the full cohort estimates were on average lower than the period estimates, whereas the restricted cohort estimates were on average higher than the period estimates for all cancers except oral cavity cancer. Absolute values of mean differences from period estimates were of similar magnitude for full and restricted cohort estimates and for the restricted cohort on average larger for cancer sites with stronger improvements in survival over time. However, mean squared differences were lower for full cohort estimates for a majority of 18 out of 23 cancers.

Comparisons of the period estimates of 15- and 20-year relative survival with survival of the corresponding full and restricted cohorts computed on 5-year calendar periods during the 50 calendar years are shown in Supplementary Table S1. Like for 5-year survival, the absolute mean differences from period estimates were on average lower for full than restricted cohort estimates for a majority of cancer sites (15-years: 16 cancer sites; 20-years: 20 cancer sites). Mean squared differences were lower for the full than for the restricted cohort for 19 out of 23 cancers for both 15- and 20-year survival. No strong relationship between the improvements in survival during the period of interest and the difference between the cohort estimates and the period estimates were observed.

Summary results of the comparison of 5- and 10-year period survival estimates with full and restricted cohort estimates computed on 2- and 10-year calendar periods are shown in Supplementary Tables S2 and S3. For 2-year calendar periods (Supplementary Table S2), 5-year and 10-year period estimates were on average closer to full than restricted cohort estimates for 17 and 14 out of 23 cancer sites, respectively. Absolute mean squared differences were on average lower for the full than for the restricted cohort for 22 cancers for 5-year survival and 20 cancers for 10-year survival. For 10-year calendar periods (Supplementary Table S3), 5-year period estimates were again on average closer to full than restricted cohort estimates (17 out of 23 cancer sites). In contrast, 10-year period estimates were on average closer to restricted cohort estimates for 14 of 23 cancer sites. However, mean squared differences were on average lower for full cohort estimates for 5-year (16 out of 23 cancers) and 10-year (15 out of 23 cancers) survival. In general, 5-year period survival estimates were especially closer to full than restricted cohort estimates in case of strong improvements in survival during the period of interest, irrespective of the length of the period. This pattern was not observed for 10-year survival.

We repeated all analyses computing mean absolute differences between the period survival estimates and the full and restricted cohort estimates. In all scenarios, the mean absolute differences between the full cohort and the period survival estimate were smaller than the absolute differences between the restricted and the period survival estimate for the majority of cancer sites (data not shown).

Finally, we investigated whether the magnitude of the difference between the period survival estimates and the full and restricted cohort estimates depended on the size of the period estimate (data not shown). The differences between period survival estimates and restricted cohort estimates were larger for higher period survival estimates. As a consequence, in particular for cancer sites with longer survival, the full cohort estimates were closer to the period estimate than the restricted cohort estimates.

Discussion

As has been shown by extensive empirical investigations, period analysis provides better predictions of survival of recently diagnosed patients than traditional cohort analysis (Brenner and Hakulinen, 2002a; Brenner et al, 2002b; Talback et al, 2004; Ellison, 2006) and, therefore, has been increasingly used in cancer survival studies in recent years (e.g. Verdecchia et al, 2007; Sankaranarayanan et al, 2010; Coleman et al, 2011; van de Schans et al, 2011). Until now, it has not yet been investigated which study population pertains best to the period survival estimates and, thus, should be described with respect to the sociodemographic and clinical characteristics in an introductory table. As a consequence, reporting of the underlying population in period analyses has not been standardized, which hampers comparability across studies.

We investigated the use of two ‘natural candidates' of study populations for period analysis: the full cohort, including all patients who potentially contributed some data to the survival analysis, and the restricted cohort, including only patients diagnosed in the period of interest. When survival is constant over time, survival of both cohorts is the same. Fortunately, however (as also observed for many cancers in Finland), survival is increasing over time for many cancers (Gondos et al, 2009; Verdecchia et al, 2009; Storm et al, 2010), in which case, survival of the full cohorts is typically slightly lower, and survival of the restricted cohort is typically slightly higher than the survival estimated by period analysis. This pattern was observed in our results when information from various periods was collapsed. However, for 5- and 10-year survival in the most recent time period, this relationship was only observed for some cancer sites, which might be explained by smaller improvements in cancer survival in recent years and by random errors in the survival estimates.

In published articles, the full (Ellison et al, 2007; Verdecchia et al, 2007; Redaniel et al, 2009; Chen et al, 2012) as well as the restricted cohort (Gondos et al, 2007; Quaglia et al, 2009; Pollack et al, 2011) have often been described showing that the investigation of the appropriate study population is a relevant issue. Our results show that in most of the investigated situations (5-, 10-, 15-, and 20-year relative survival computed on 5-year calendar periods, 5-year survival on 2-year and 10-year calendar periods and 10-year survival on 2-year calendar periods) survival estimates for the full cohort were mostly closer to the period estimates than the survival estimates for the restricted cohort. Results for 10-year survival estimates on 5-year and 10-year calendar periods were less obvious with respect to the mean difference. However, in these situations, the mean squared differences, which quantify not only over- or underestimations but also the degree of random variation, were mostly smaller when comparing the period survival estimates with the survival estimates obtained from the full cohorts. The mean squared difference can be expected to be smaller for the full cohort, even in case of comparable differences, as the restricted cohort is a subset of the full cohort. However, as also the mean absolute difference was smaller for the full than the restricted cohort in these settings, our results nonetheless suggest that the survival estimate for the full cohort, consisting of all patients who potentially contributed some data to the survival analysis, is overall closer to the period survival estimate than the restricted cohort estimate and, thus, this cohort should be described as study population in an introductory table when period analysis is used.

Agreement of the period survival estimate with the survival of the various types of cohorts is of course just one of several possible criteria for selecting the cohort of patients to be described. We believe, however, that it is a particularly relevant criterion, especially when the description of the patient cohort includes distributions of factors closely related to survival, such as stage, cancer subsite, or histopathology, or indicators of data quality of potential relevance to survival estimates, such as the proportion of cases notified by death certificate only (Brenner and Holleczek, 2011). We used the mean, mean squared, and absolute difference as criteria to measure the similarity of survival estimates. Evaluation of other criteria, like the frequency of the direction of deviation, could be helpful to get further insights in the relationship between period survival and full cohort and restricted cohort estimates. A limitation of the mean squared difference as similarity measure is that it might be confounded by the heterogeneity across calendar years.

The finding that the full cohorts typically had somewhat lower survival, whereas the restricted cohorts typically had somewhat higher survival than the period survival estimates might suggest that cohorts that include some, but not all of the years of diagnosis prior to the period of interest included in the full cohorts might even be preferable over the full cohorts. An alternative and theoretically plausible cohort would consist of all patients who contribute data to the survival analysis. This would correspond to all patients of the full cohort who do not experience the event and are not censored before the start of the period window. An advantage of using this cohort instead of the full or restricted cohort is that the person-time and number of events in this cohort would be directly related to the data used in the period analysis. However, we feel that communicating the rationale for the use of such cohorts would be much less straightforward than for the full cohorts, and that this disadvantage would outweigh any possible minor benefits in the fit of the survival expectations.

A disadvantage of the recommendation to describe the full cohort instead of the restricted cohort is that the definition of the cohort, which should be described, depends on the length of the survival estimate, which is investigated. For example, if 5- and 10-year period survival estimates for the period 1999–2003 are computed in the same study, the underlying population will include all patients diagnosed in 1994–2003 and 1989–2003, respectively. In these situations, it might be preferable to describe the restricted cohort (1999–2003), as this cohort is commonly defined for both analyses. On the other hand, description of the full cohort is also common standard in other studies even if certain parts of the analysis pertain to defined subcohorts only. Furthermore, if an investigator also wants to include the number of person-years and cases included in the period analysis, these will also be contributed by the full cohort.

Our study has specific strengths and limitations. Strengths include the use of high-quality data from the Finnish Cancer Registry with a very long time series, including cancer diagnosis over half a century and the comprehensive investigation of the appropriate study population for various cancer sites, survival times, and lengths of calendar periods. The use of an empirical instead of a theoretical approach for the investigation of the appropriate study population might be considered to be a limitation of this study. Although an empirical investigation closely resembles later analysis, on which the results should be applied on, results on the difference between period and full and restricted cohort estimates might be affected by chance. We used various cancer sites and computed the difference for each possible 5-year calendar period to reduce such biases. However, results on mean differences for 10-year survival computed on 10-year periods and 20-year survival computed on 5-year periods were based on three calendar periods only. Nonetheless, our results showed a consistent pattern across various analysis situations.

In conclusion, due to the increasing use of period analysis it has become important to investigate which study population should be reported in an introductory table to standardise the reporting of results and, thus, enhance comparability across studies. Our analysis suggests that generally all patients who potentially contributed some data to the survival analysis should be included in the study base for the description of patients' characteristics.

Acknowledgments

This study was supported by the German Cancer Aid (Deutsche Krebshilfe) grant no. 108257.

Footnotes

Supplementary Information accompanies this paper on British Journal of Cancer website (http://www.nature.com/bjc)

This work is published under the standard license to publish agreement. After 12 months the work will become freely available and the license terms will switch to a Creative Commons Attribution-NonCommercial-Share Alike 3.0 Unported License.

Supplementary Material

Supplementary Information

References

  1. Brenner H, Gefeller O. An alternative approach to monitoring cancer patient survival. Cancer. 1996;78 (9:2004–2010. [PubMed] [Google Scholar]
  2. Brenner H, Gefeller O, Hakulinen T. A computer program for period analysis of cancer patient survival. Eur J Cancer. 2002a;38 (5:690–695. doi: 10.1016/s0959-8049(02)00003-5. [DOI] [PubMed] [Google Scholar]
  3. Brenner H, Hakulinen T. Up-to-date long-term survival curves of patients with cancer by period analysis. J Clin Oncol. 2002a;20 (3:826–832. doi: 10.1200/JCO.2002.20.3.826. [DOI] [PubMed] [Google Scholar]
  4. Brenner H, Hakulinen T. Very-long-term survival rates of patients with cancer. J Clin Oncol. 2002b;20 (21:4405–4409. doi: 10.1200/JCO.2002.99.060. [DOI] [PubMed] [Google Scholar]
  5. Brenner H, Holleczek B. Deriving valid population-based cancer survival estimates in the presence of nonnegligible proportions of cancers notified by death certificates only. Cancer Epidemiol Biomarkers Prev. 2011;20 (12:2480–2486. doi: 10.1158/1055-9965.EPI-11-0697. [DOI] [PubMed] [Google Scholar]
  6. Brenner H, Soderman B, Hakulinen T. Use of period analysis for providing more up-to-date estimates of long-term survival rates: empirical evaluation among 370 000 cancer patients in Finland. Int J Epidemiol. 2002b;31 (2:456–462. [PubMed] [Google Scholar]
  7. Chen T, Jansen L, Gondos A, Ressing M, Holleczek B, Katalinic A, Brenner H, GEKID Cancer Survival Working Group Survival of endometrial cancer patients in Germany in the early 21st century: a period analysis by age, histology, and stage. BMC Cancer. 2012;12 (1:128. doi: 10.1186/1471-2407-12-128. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Coleman MP, Forman D, Bryant H, Butler J, Rachet B, Maringe C, Nur U, Tracey E, Coory M, Hatcher J, McGahan CE, Turner D, Marrett L, Gjerstorff ML, Johannesen TB, Adolfsson J, Lambe M, Lawrence G, Meechan D, Morris EJ, Middleton R, Steward J, Richards MA. Cancer survival in Australia, Canada, Denmark, Norway, Sweden, and the UK, 1995–2007 (the International Cancer Benchmarking Partnership): an analysis of population-based cancer registry data. Lancet. 2011;377 (9760:127–138. doi: 10.1016/S0140-6736(10)62231-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Corazziari I, Quinn M, Capocaccia R. Standard cancer patient population for age standardising survival ratios. Eur J Cancer. 2004;40 (15:2307–2316. doi: 10.1016/j.ejca.2004.07.002. [DOI] [PubMed] [Google Scholar]
  10. Ederer F, Heise H. Instructions to IBM 650 programmers in processing survival computations. National Cancer Institute: Bethesda (MD); 1959. [Google Scholar]
  11. Ellison LF. An empirical evaluation of period survival analysis using data from the Canadian Cancer Registry. Ann Epidemiol. 2006;16 (3:191–196. doi: 10.1016/j.annepidem.2005.02.017. [DOI] [PubMed] [Google Scholar]
  12. Ellison LF, Pogany L, Mery LS. Childhood and adolescent cancer survival: a period analysis of data from the Canadian Cancer Registry. Eur J Cancer. 2007;43 (13:1967–1975. doi: 10.1016/j.ejca.2007.05.014. [DOI] [PubMed] [Google Scholar]
  13. Gondos A, Arndt V, Holleczek B, Stegmaier C, Ziegler H, Brenner H. Cancer survival in Germany and the United States at the beginning of the 21st century: an up-to-date comparison by period analysis. Int J Cancer. 2007;121 (2:395–400. doi: 10.1002/ijc.22683. [DOI] [PubMed] [Google Scholar]
  14. Gondos A, Bray F, Hakulinen T, Brenner H. Trends in cancer survival in 11 European populations from 1990 to 2009: a model-based analysis. Ann Oncol. 2009;20 (3:564–573. doi: 10.1093/annonc/mdn639. [DOI] [PubMed] [Google Scholar]
  15. Hakulinen T, Seppa K, Lambert PC. Choosing the relative survival method for cancer survival estimation. Eur J Cancer. 2011;47 (14:2202–2210. doi: 10.1016/j.ejca.2011.03.011. [DOI] [PubMed] [Google Scholar]
  16. Henson DE, Ries LA. The relative survival rate. Cancer. 1995;76 (10:1687–1688. doi: 10.1002/1097-0142(19951115)76:10<1687::aid-cncr2820761002>3.0.co;2-i. [DOI] [PubMed] [Google Scholar]
  17. Pollack LA, Li J, Berkowitz Z, Weir HK, Wu XC, Ajani UA, Ekwueme DU, Li C, Pollack BP. Melanoma survival in the United States, 1992 to 2005. J Am Acad Dermatol. 2011;65 (5 Suppl 1:S78–S86. doi: 10.1016/j.jaad.2011.05.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Quaglia A, Tavilla A, Shack L, Brenner H, Janssen-Heijnen M, Allemani C, Colonna M, Grande E, Grosclaude P, Vercelli M. The cancer survival gap between elderly and middle-aged patients in Europe is widening. Eur J Cancer. 2009;45 (6:1006–1016. doi: 10.1016/j.ejca.2008.11.028. [DOI] [PubMed] [Google Scholar]
  19. Redaniel MT, Laudico A, Mirasol-Lumague MR, Gondos A, Uy GL, Toral JA, Benavides D, Brenner H. Ovarian cancer survival population differences: a ‘high resolution study' comparing Philippine residents, and Filipino-Americans and Caucasians living in the US. BMC Cancer. 2009;9:340. doi: 10.1186/1471-2407-9-340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Sankaranarayanan R, Swaminathan R, Brenner H, Chen K, Chia KS, Chen JG, Law SC, Ahn YO, Xiang YB, Yeole BB, Shin HR, Shanta V, Woo ZH, Martin N, Sumitsawan Y, Sriplung H, Barboza AO, Eser S, Nene BM, Suwanrungruang K, Jayalekshmi P, Dikshit R, Wabinga H, Esteban DB, Laudico A, Bhurgri Y, Bah E, Al-Hamdan N. Cancer survival in Africa, Asia, and Central America: a population-based study. Lancet Oncol. 2010;11 (2:165–173. doi: 10.1016/S1470-2045(09)70335-3. [DOI] [PubMed] [Google Scholar]
  21. Storm HH, Engholm G, Hakulinen T, Tryggvadottir L, Klint A, Gislum M, Kejs AM, Bray F. Survival of patients diagnosed with cancer in the Nordic countries up to 1999–2003 followed to the end of 2006. A critical overview of the results. Acta Oncol. 2010;49 (5:532–544. doi: 10.3109/02841861003801148. [DOI] [PubMed] [Google Scholar]
  22. Talback M, Stenbeck M, Rosen M. Up-to-date long-term survival of cancer patients: an evaluation of period analysis on Swedish Cancer Registry data. Eur J Cancer. 2004;40 (9:1361–1372. doi: 10.1016/j.ejca.2004.02.004. [DOI] [PubMed] [Google Scholar]
  23. van de Schans SA, Gondos A, van Spronsen DJ, Rachtan J, Holleczek B, Zanetti R, Coebergh JW, Janssen-Heijnen ML, Brenner H. Improving relative survival, but large remaining differences in survival for non-Hodgkin's lymphoma across Europe and the United States from 1990 to 2004. J Clin Oncol. 2011;29 (2:192–199. doi: 10.1200/JCO.2010.28.6377. [DOI] [PubMed] [Google Scholar]
  24. Verdecchia A, Francisci S, Brenner H, Gatta G, Micheli A, Mangone L, Kunkler I. Recent cancer survival in Europe: a 2000–02 period analysis of EUROCARE-4 data. Lancet Oncol. 2007;8 (9:784–796. doi: 10.1016/S1470-2045(07)70246-2. [DOI] [PubMed] [Google Scholar]
  25. Verdecchia A, Guzzinati S, Francisci S, De Angelis R, Bray F, Allemani C, Tavilla A, Santaquilani M, Sant M. Survival trends in European cancer patients diagnosed from 1988 to 1999. Eur J Cancer. 2009;45 (6:1042–1066. doi: 10.1016/j.ejca.2008.11.029. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information

Articles from British Journal of Cancer are provided here courtesy of Cancer Research UK

RESOURCES