Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2012 Feb 26.
Published in final edited form as: J Data Sci. 2011 Oct 1;8(4):631–644.

A Weighted-Least-Squares Estimation Approach to Comparing Trends in Age-Adjusted Cancer Rates Across Overlapping Regions

Kimberly A Walters 1, Yi Li 2, Ram C Tiwari 3, Zhaohui Zou 4
PMCID: PMC3286621  NIHMSID: NIHMS353971  PMID: 22375146

Abstract

Li and Tiwari (2008) recently developed a corrected Z-test statistic for comparing the trends in cancer age-adjusted mortality and incidence rates across overlapping geographic regions, by properly adjusting for the correlation between the slopes of the fitted simple linear regression equations. One of their key assumptions is that the error variances have unknown but common variance. However, since the age-adjusted rates are linear combinations of mortality or incidence counts, arising naturally from an underlying Poisson process, this constant variance assumption may be violated. This paper develops a weighted-least-squares based test that incorporates heteroscedastic error variances, and thus significantly extends the work of Li and Tiwari. The proposed test generally outperforms the aforementioned test through simulations and through application to the age-adjusted mortality data from the Surveillance, Epidemiology, and End Results (SEER) Program of the National Cancer Institute.

Keywords: Age-adjusted cancer rates, annual percent change (APC), cancer surveillance, trends, weighted-Least-Squares estimation, hypothesis testing

1. Introduction

Cancer has been a major epidemic concern in the industralized nations, contributing, for example, 570,280 deaths each year in the United States (American Cancer Society 2005). Many public and private agencies dealing with cancer and related problems depend on the rates of cancer deaths or new cases as an estimate of cancer burden for planning and resource allocation. Among these agencies, the Surveillance, Epidemiology, and End Results (SEER) Program of the National Cancer Institute (NCI) is the most authoritative and comprehensive source of information on cancer incidence and deaths in the United States, which currently collects and publishes cancer incidence and survival data from population-based cancer registries covering approximately over a quarter of the entire US population.

One main task of the SEER program is to routinely monitor and compare trends in cancer mortality and incidence rates across geographic regions or over different time periods. The data are analyzed by SEER*STAT software, which is maintained by the NCI, with the results periodically published in SEER Cancer Statistics Review; see Ries et al. (2001). In this annual report (available at http://seer.cancer.gov/csr), the estimated annual percent change (APC) for over 80 cancer sites are presented across geographic regions (e.g. counties or states) for different specified periods. As the APC measures the trend in cancer mortality and incidence rates, its comparison across various regions has important social and economic ramifications, ranging from deciding which cancer programs get funded to deciding how the funds are allocated among various regions.

However, a fundamental statistical difficulty arises when such comparisons, largely for policy making purposes, have to be made for regions or time intervals that overlap, e.g. comparing the most recent changes in trends of cancer rates in a local area (e.g. the mortality rate of breast cancer in California) with a more global level (i.e. the national mortality rate) over two overlapping time periods, because of availability of the data. For example, as detailed in the data analysis section, it is of substantial interest to compare the changes in California cancer mortality rates with the national cancer mortality rates in the last 15 years. However, for a 15-year block, the California cancer rates were available for 1990–2004, while the national data were available for 1988–2002.

In the current SEER*STAT software, the two-sample pooled t-test (Kleinbaum et al., 1988) is available to compare two APC values from two non-overlapping regions or non-overlapping time intervals, based on two independent linear models with a common variance. But, when one wishes to compare APCs for two overlapping regions or time intervals, the samples are no longer independent, invalidating the two sample t-test. Recently, Li and Tiwari (2008) developed a corrected Z-test that properly accounts for the overlapping. However, their derivation relied on a common time-independent variance assumption. Indeed, as the age-adjusted rates are linear combinations of mortality or incidence counts, arising from an underlying Poisson process (Brillinger, 1986), such a constant variance assumption may be dubious. In this paper, we relax such an unrealistic assumption and derive a Z-test using weighted least squares (WLS) for comparing two APCs when the (transformed) cancer rates have heteroscedastic variances.

The rest of the paper is organized as follows. Section 2 gives the definition of the annual percent change (APC) and introduces the problem at hand of comparing two APCs. This section also briefly reviews the t-test of Kleinbaum et al. (1988) and the corrected Z-test of Li and Tiwari (2008). In Section 3, the new WLS Z-test is developed and, in Section 4, its performance with respect to the previous corrected Z-test is considered via a simulation study and application to SEER cancer mortality data. The conclusions are summarized in Section 5.

2. Annual Percent Change (APC) and Tests for Comparing Two APCs

Let nkji and dkji denote the mid-year population and counts for region k, age-group j and time ti, and let wj denote the standard for the age-group j standardized so that j=1Jwj=1, i = 1, …, Ik, j = 1, …, J, k = 1, 2. The age-adjusted rate is defined as

rki=j=1Jwjdkjinkji, (2.1)

where wj > 0, j = 1, …, J, are the known standards for the age group j so that j=1Jwj=1. For the SEER analysis, there are J = 19 standard age-groups consisting of 0–1, 1–4, 5–9, …, 85+, and wj are chosen to be the year 2000 population standards (Fay et al. 2006).

To describe the change in cancer trend, we work with the logarithmic transformation of ki, and fit a linear regression of ki on calendar time ti. However, since ki may be 0 for some rare cancer sites, we consider a discrete correction of ki as

rki=j=1Jwjdkji+1JZjinkji=rki+w¯ki. (2.2)

Here the random purtabation Zji=l=1JI(Xli=j), where Xli, l = 1, …, J, i = 1, …, Ik are iid random variables, each of which takes values 1, …, J with equal probability 1/J. Note that Σj Zji = J with E(Zji) = 1. This amounts to distributing a count with mean 1 over all J age-groups at each time ti, and hence avoids the singular situation. It is notable that this correction, specifically designed to accommodate the discrete nature of the counts, differs slightly from the continuous correction proposed in Tiwari et al. (2006), by introducing a correction factor, w¯ki=1Jj=1JwjZjinkji.

Consider a simple linear regression of logarithm yki = log(rki) on calendar time ti, given by

yki=βk0+βk1ti+eki,i=1,,Ik;k=1,2, (2.3)

where eki are independent random errors with E(eki) = 0. For the variance of eki, we note that (dkji) behaves as independent realizations of Poisson random variables, with means equal to their variances. We further note that the random perturbation Zji follows Binomial(J, 1/J), cov(Zji, Zji) = −1/J, cov(Zji, Zji) = 0 if ii′, and also Zji and dkji are independent. Hence, using the delta method, we obtain the heterogeneous error variances of yki as

νki2vki2rki2, (2.4)

where

vki2=j=1Jwj2Var(dkji)nkji2+Var(j=1Jwj1JZjinkji)j=1Jwj2dkjinkji2+1J2[j=1Jwj2nkji21J(j=1Jwjnkji)2]

is the estimated variance of rki. Note that vki2 is smaller than the Var(rki) given in Tiwari et al. (2006) by a term 1J(11J)j=1Jwj2nkji2+1J3(j=1Jwjnkji)20, a negligible constant.

With eki having a heteroscedastic variance structure, the weighted least squares estimates or the maximum likelihood estimates of (βk0, βk1) are given by (β̃k0, β̃k1), where

βk0=ykβk1tk;βk1=i=1Ik(ykiyk)(titk)/νki2i=1Ik(titk)2/νki2,

with

tk=i=1Ikti/νki2i=1Ik1/νki2,yk=i=1Ikyki/νki2i=1Ik1/νki2. (2.5)

As a special case when Var(eki)=σk2, k = 1, 2, which are invariant of i, the estimates of βk1 its variance, and σk are given by

βk1=i=1Ik(ykiy¯k)(tit¯k)i=1Ik(tit¯k)2;σ^β^k12=σ^k2i=1Ik(tit¯k)2;σ^k2=i=1Ik(ykiy^ki)2Ik2,

with ŷki = β̂k0 + β̂k1ti, y¯k=1Iki=1Ikyki,t¯k=1Iki=1Ikti.

The annual percent change (APC), defined as APCk = 100(eβk1 − 1) for each region, describes the change in trend of cancer mortality or incidence. When comparing the change trends of a cancer across two regions, it is often of interest to test the null hypothesis H0: APC1 = APC2 versus the alternative hypothesis H1 : APC1APC2, or equivalently to test H0:β11=β21 versus H1:β11β21. Under a further assumption that σ12=σ22(=σ2), the two-sample pooled t-test is given by (Kleinbaum et al., 1988)

t=β^11β^21[σ^2(1i=1I1(t1it¯1)2+1i=1I2(t2it¯2)2)]1/2t(I1+I24), (2.6)

where the “pooled” estimate of σ2 is given by

σ^2=i=1I1(y1iy^1i)2+i=1I2(y2iy^2i)2I1+I24. (2.7)

This is the test that is implemented in SEER*STAT software. However, when there is an overlap between the two regions or in the two time intervals, the two samples are not independent, and there is a need to adjust for the covariance between β̂11 and β̂21. Li and Tiwari (2008) proposed a corrected Z-test procedure that includeds such an adjustment.

Specifically, they considered the following models

y1i=β10+β11ti+e1i,i=1,,m, (2.8)
y2i=β20+β21ti+e2i,i=s+1,,s+I, (2.9)

respectively for overlapping Regions 1 and 2. Region 1 was observed for the time points of {t1, …, tm}, while Region 2 was observed for the time points of {ts+1, …, ts+I}. When t1ts+1 < tmts+I (e.g. the two time periods are overlapping), these two regressions are not independent.

Further introduce nk=i=s+1mj=1Jnkji for k = 1, 2, n(O)=i=s+1mj=1Jnji(O), where the superscript ‘O’ is used to denote the intersection of Regions 1 and 2, and denoted by nkji and nji(O) the numbers of underlying population at risk for age group j at time ti in Region k(k = 1, 2), and in the overlapping subregion, respectively.

Li and Tiwari (2008) showed

β^11β^21N(β11β21,σ2(1σ1t2+1σ2t22σ12tσ1t2σ2t2(n(O))2n1n2)), (2.10)

where σ12t=i=s+1m(tit¯1)(tit¯2), based on which, a corrected Z-test was proposed as

ZCT=β^11β^21σ^2(1σ1t2+1σ2t22σ12tσ1t2σ2t2(n(O))2n1n2),

which reject the null hypothesis for large absolute values of ZCT.

However, Li and Tiwari’s derivation hinged upon the common variance assumption of Var(e1i) ≡ Var(e2j) ≡ σ2, which seems rather stringent. In the next section, we relax such an assumption and propose a weighted-least-squares (WLS) based Z-test, which accommodate Li and Tiwari’s test as a special case.

3. Proposed Test

Our proposed WLS Z-test stems from the assumption that the observed counts dkji follow Poisson distributions, and from the transformed linear regression models (2.8) and (2.9) with the errors eki having heteroscedastic variances. The standard statistical theory reveals that the WLS estimators β̃11, β̃21 follow

β11β21N(β11β21,1σ1t2+1σ2t22Cov(β11,β21)).

It turns out, however, that the derivation of Cov(β̃11, β̃21), when the two time intervals [t1, tm] and [ts+1, ts+I] under consideration are overlapping, is nontrivial as it requires a careful consideration of the overlapping of two regions. The detailed derivation is given in the Appendix, which shows

Cov(β11,β21)σ12tσ1t2σ2t2(n(O))2n1n2, (3.1)

where nk=i=s+1mj=1Jnkji for k = 1, 2, n(O)=i=s+1mj=1Jnji(O),σ1t2=i=1m(tit1)2/ν1i2,σ2t2=i=s+1s+I(tit2)2/ν2i2 and

σ12t=i+s+1mν12i(O)ν1i2ν2i2(tit1)(tit2),

where 1 and 2 are defined in (2.5), ν1i2 and ν2i2 are as defined in (2.4), ν12i(O)=(vi(O))2r1ir2i with (vi(O))2=j=1Jwj2dji(O)(nji(O))2+1J2[j=1Jwj2(nji(O))21J(j=1Jwjnji(O))2].

Hence, we have that

β11β21N(β11β21,1σ1t2+1σ2t22σ12tσ1t2σ2t2(n(O))2n1n2) (3.2)

as the basis for the WLS Z-test statistic, defined as

ZWLS=β11β211σ1t2+1σ2t22σ12tσ1t2σ2t2(n(O))2n1n2,

which would reject the null hypothesis for the large absolute value of ZWLS.

To compare the efficiency of ZWLS and ZCT, we compute the ratio of the variances (RoV) in (2.10) and (3.2) as,

RoV=σ2(1σ1t2+1σ2t22σ12tσ1t2σ2t2(n(O))2n1n2)1σ1t2+1σ2t22σ12tσ1t2σ2t2(n(O))2n1n2. (3.3)

Several points are worthy of noting. First, the RoV is essentially the Pitman asymptotic relative efficiency (ARE) under the assumption of common variance, when both tests are valid and maintain the nominal type I error. In particular, as a special case of νki2σ^2 for all (k, i), ARE is 1 and further ZCTZWLS, hence ZCT is a special case of ZWLS. In the violation of such common variance assumption, the RoV is no longer the ARE, but provides an approximate assessment of efficacies of these two tests. Secondly, the signs of σ12t and σ̃12t will determine, respectively, whether Cov(β̂11, β̂21) and Cov(β̃11, β̃21) are positive or negative, and their signs are often but not necessarily the same (as shown in simulations).

We will conduct simulation studies in the next section to evaluate (3.3), and to assess the performance of the proposed WLS Z-test.

4. Simulation and Application to SEER Data

To evaluate the finite sample performance of the proposed test under various scenarios, we conducted the following simulations to compare the APCs for two regions. We mimicked the comparision between, say, the Southern Region (Region 1) consisting of Georgia (GA), South Carolina (SC) and North Carolina (NC), and the Eastern Region (Region 2) consisting of NC, Virginia (VA) and Maryland (MD), with NC the overlapping state. The three different time periods, with varying degree of overlap in the intervals, are taken to be : (a) [1980,1989] for Region 1, and [1990,1999] for Region 2 so that there is no overlap between the two time intervals and σ12t = 0, (b) [1980,1989] for Region 1, and [1983,1992] for Region 2 so that there a considerable overlap of six years between the two intervals and σ12t = 12.25, and (c) [1980,1989] for Region 1, and [1987,1996] for Region 2 so that there is a little overlap of three years between the two intervals and σ12t = −34.75.

For generating the counts, dkji, we assume that dkjiindPoisson(nkjiλkji), where log(λkji) = βkj,0+βk1ti, with ti taking values in the intervals corresponding to the two regions stated above. Note that this specification of for λkji leads to

E(rki)=exp(βk1ti)j=1Jwjexp(βkj,0)=exp(βk1ti)Bk,0

so that log(E(rki)) = log(Bk,0) + βk1ti where APCk = 100(eβk1 − 1).

Now to specify the regression for λkji, we take βk1 = log(100−1 APCk + 1), based on specified values of APCk ranging from −0.3% to 3.0%, and assume that βkj,0=log(dkj,0nkj,0βk1tk,0) where dkj,0 and nkj,0 are, respectively, the observed number of deaths and the number of person-years at risk at tk,0, the beginning of the time interval considered for Region k. The age-specific counts for the overlapping state, NC, are generated from Poisson distributions with means 13min{n1jiλ1ji,n2jiλ2ji}.

The results of the simulation study for the three cases of overlapping time intervals, based on 1000 simulations per cancer site and (APC1, APC2) combination, are obtained. To save the space, we only report those for case (a) in Table 1, while the results for the other two cases are available upon request. Several points are worth mentioning.

Table 1.

No Overlap (σ12t = 0): Simulation Results for GA,SC,NC [1980–1989] and NC,VA,MD [1990–1999]; Comparison of changes in age-adjusted cancer mortality rates in males; APC1 and APC2 are the annual percent changes specified for the respective regions

Cancer site APC1 APC2 (a) Average RoV (b) (c)
All Malignant Cancers −0.3 −0.3 0.0 1.0193 0.0720 0.0490
0.1 0.1 0.0 1.0258 0.0700 0.0590
0.5 0.5 0.0 1.0266 0.0690 0.0600
1.0 1.0 0.0 1.0187 0.0630 0.0480
3.0 3.0 0.0 1.0327 0.0590 0.0470
0.1 0.5 0.4 1.0261 0.8590 0.8770
−0.3 0.3 0.6 1.0199 0.9930 0.9980
1.0 2.0 1.0 1.0202 1.0000 1.0000
1.0 3.0 2.0 1.0218 1.0000 1.0000

Esophagus −0.3 −0.3 0.0 1.0058 0.0690 0.0510
0.1 0.1 0.0 1.0034 0.0630 0.0490
0.5 0.5 0.0 1.0065 0.0620 0.0520
1.0 1.0 0.0 1.0075 0.0730 0.0580
3.0 3.0 0.0 1.0173 0.0650 0.0410
0.1 0.5 0.4 1.0062 0.1080 0.0870
−0.3 0.3 0.6 1.0070 0.1420 0.1250
1.0 2.0 1.0 1.0091 0.2930 0.2920
1.0 3.0 2.0 1.0096 0.7900 0.7840

Lip −0.3 −0.3 0.0 0.8318 0.0710 0.0160
0.1 0.1 0.0 0.8324 0.0720 0.0160
0.5 0.5 0.0 0.8383 0.0720 0.0160
1.0 1.0 0.0 0.8465 0.0710 0.0130
3.0 3.0 0.0 0.8692 0.0810 0.0160
0.1 0.5 0.4 0.8394 0.0700 0.0160
−0.3 0.3 0.6 0.8384 0.0670 0.0170
1.0 2.0 1.0 0.8571 0.0660 0.0130
1.0 3.0 2.0 0.8758 0.0720 0.0130

Prostate −0.3 −0.3 0.0 1.0180 0.0660 0.0410
0.1 0.1 0.0 1.0180 0.0660 0.0430
0.5 0.5 0.0 0.9924 0.0580 0.0510
1.0 1.0 0.0 1.0215 0.0650 0.0460
3.0 3.0 0.0 1.0184 0.0700 0.0450
0.1 0.5 0.4 1.0183 0.2120 0.1820
−0.3 0.3 0.6 1.0185 0.3750 0.3570
1.0 2.0 1.0 1.0222 0.7590 0.7730
1.0 3.0 2.0 1.0232 1.0000 1.0000

Note: (a) = |APC2APC1|, (b) = P {ZCT rejects H0}, (c) = P {ZWLS rejects H0}.

  • the table showa that, in general, Li and Tiwari’s corrected Z-test (referred to as ZCT ) is aggressive in rejecting the null hypothesis, and has higher Type I error probabilities, whereas the proposed WLS Z-test (referred to as ZWLS) is conservative and retains the Type I error probabilities, when the null hypothesis is true (or close to being true).

  • for common cancer sites, as the absolute difference between the two APC values or the amount of the overlap between the comparison intervals increase, the power of the ZWLS test gets better than that of the ZCT test. The average RoV of the ZWLS test is close to 1 and increases as we move from the case of σ12t < 0, to σ12t = 0, and to σ12t > 0. When σ12t > 0, the average RoV is greater than 1 for all the choices of APC values.

  • for the rare cancer sites, such as the lip cancer, as there is higher variability in the observed counts, both tests show that there is not enough evidence to reject the null hypothesis. The ZCT test, however, incurs higher-than-nominal Type I error probabilities, while the ZWLS retains the nominal level.

  • the simulation runs were used to reveal the relationship between σ̃12t and σ12t. We found that the sign of σ̃12t followed that of σ12t in most cases, though there were a few exceptions. Because of space, the results were omitted.

It is of substantial interest to compare the changes in cancer mortality rates in California with the national levels as a California law (Health and Safety Code, Section 103885) was passed in late 1980’s that mandated the reporting of malignancies diagnosed throughout the state. In particular, we applied the proposed methodology to compare the annual percent change (APC) in the age-adjusted mortality rate in Breast Cancer of California (CA) during the period from 1990 to 2004 to that of the United States (US) during the period from 1988 to 2002, for which the national mortality data were available. The mortality data for the United States are compiled by the National Center for Health Statistics (NCHS) of the Centers for Disease Control and Prevention (www.cdc.gov/nchs) and are available from the National Cancer Institute’s Surveillance, Epidemiology, and End Results (SEER) Program (http://www.seer.cancer.gov). The ratio of the total population for all age-groups combined for CA to that for the US for the overlapping years (i.e. n1/n2) was around 11% for females. The observed log-transformed annual age-adjusted rates and the fitted regression lines from the ZWLS test procedure are shown in Figure 1, and the test results are summarized in Table 2. The calculated RoV is 4.42426. Both tests reject the null hypothesis of equal APCs, and suggest that the drop in the mortality rate of Breast cancer is greater in California than at the national level. But the ZWLS test is much more powerful with a much smaller p-value, i.e., pWLS = 0.000000757 while pCT = 0.017372. Thus the ZWLS test gives a much stronger evidence for the conclusion.

Figure 1.

Figure 1

Observed and fitted log-transformed age-adjusted breast cancer mortality rates in CA [1989–2004] and US [1987–2002]

Table 2.

Results of Comparison of CA [1989–2004] with the US [1987–2002] (σ12t = 213.5) in annual percent changes (APC) of age-adjusted breast cancer mortality rates; APC^k=100(eβk11) and APCk=100(eβk11).

California United States
APC^(SE)
−2.33 (0.127) −1.93 (0.127)
APC(SE)
−2.33 (0.084) −1.94 (0.027)
ZWLS −4.94 p-value=0.000000757
ZCT −2.37 p-value= 0.017372

5. Conclusion

In this paper, we have considered an important problem where comparisons have to be made for two correlated linear regressions. Previous work, e.g., Li and Tiwari (2008), relied on constant residual variance assumption for the linear regressions, which is likely to be violated. Viewing the cancer rates as the linear combinations of mortality or incidence counts, which arise naturally from an underlying Poisson process, we have developed a weighted-least-squares based test that incorporates heteroscedastic error variances, and thus significantly extends the work of Li and Tiwari. The simulation results, along with the application to the SEER data, confirmed that our proposed method outperformed that proposed in Li and Tiwari.

One possible limitation of this study is the confinement of the local linearity for the cancer rates when the time periods of consideration is of short or moderate length. Indeed, linearity assumption for the cancer rates is debatable in cancer surveillance, which is likely to be violated over a longer period (e.g. ≥ 30 years). A detailed discussion on this issue has been made in Fay et al. (2006), which proposed a joinpoint linear regression for long-term cancer rate analysis. In a similar context, we plan to pursue APC comparisons for longer periods by considering joinpoint linear regressions, and will report the results in a subsequent communication.

Appendix: Derivation of (3.1)

For t1ts+1 < tmts+I,

Cov(β11,β21)=1σ1t2σ2t2Cov(i=1m1ν1i2(tit1)y1i,i=s+1s+I1ν2i2(tit2)y2i)=1σ1t2σ2t2i=s+1m1ν1i2ν2i2(tit1)(tit2)Cov(y1i,y2i),

with σ1t2=i=1m(tit1)2/ν1i2,σ2t2=i=s+1s+I(tit2)2/ν2i2.

Now, let dkji, dji(O),dkji(NO) denote the number of events (e.g. deaths or cancer cases) and let nkji, nji(O),nkji(NO) denote the population at risk for Region k, age-group j, and at time ti, where the subscript “O” stands for the overlapping region and “NO” stands for the nonoverlapping region, and where we have dropped the subscript k in dji(O) and nji(O) as they are same for the two regions. Let nki=j=1Jnkji,ni(O)=j=1Jnji(O),nki(NO)=j=1Jnkji(NO), and similarly define dki, di(O),dki(NO).

Assuming that in both the overlapping and nonoverlapping regions, the distribution of the population across different age-groups is same; that is (Pickle and White, 1995),

n1i(O)nk1i==nJi(O)nkJi=pki(O),andnk1i(NO)nk1i==nkJi(NO)nkJi=pki(NO). (5.1)

We can express rki as

rki=pki(O)ri(O)+pki(NO)rki(NO), (5.2)

where

ri(O)=j=1Jwjdji(O)+1JZjinji(O),rki(NO)=j=1Jwjdkji(NO)nkji(NO).

Hence, using delta method,

Cov(y1i,y2i)=Cov(log(r1i),log(r2i))1E(r1i)E(r2i)Cov(r1i,r2i)=1E(r1i)E(r2i)p1i(O)p2i(O)Var(ri(O)).

We can now estimate pki(O) by p^ki(O)=ni(O)nki. However, for the US population, we have noticed that p^ki(O) is approximately constant over years (i.e. over index i), and hence, we replace p^ki(O) by p^k(O)=n(O)nk, where nk=i=s+1mj=1Jnkji and n(O)=i=s+1mj=1Jnji(O). So that using the delta method,

Cov^(y1i,y2i)=1E^(r1i)E^(r2i)p^1i(O)p^2i(O)Var^(ri(O))=1r1ir2i(n(O))2n1n2(vi(O))2=(n(O))2n1n2ν12i(O),

where ν1i2 and ν2i2 are as defined in (2.4), ν12i(O)=(vi(O))2/r1ir2i with

(vi(O))2=j=1Jwj2dji(O)(nji(O))2+1J2[j=1Jwj2(nji(O))21J(j=1Jwjnji(O))2].

References

  1. Cancer Facts & Figures. American Cancer Society; Atlanta, Georgia: 2007. [Google Scholar]
  2. Fay M, Tiwari R, Feuer E, Zou Z. Estimating average annual percent change for disease rates without assuming constant change. Biometrics. 2006:62847–854. doi: 10.1111/j.1541-0420.2006.00528.x. [DOI] [PubMed] [Google Scholar]
  3. Ries LAG, Eisner MP, Kosary CL, Hankey BF, Miller BA, Clegg L, Mariotto A, Feuer EJ, Edwards BK, editors. SEER Cancer Statistics Review, 1975–2002. National Cancer Institute; Bethesda, MD: 2003. http://seer.cancer.gov/csr/1975-2002/ [Google Scholar]
  4. Kleinbaum D, Kupper, Muller P. Applied Regression Analysis and Other Multivariable Methods. 2. PWS-Kent; 1988. [Google Scholar]
  5. Li Y, Tiwari R. Comparing trends in age-adjusted cancer rates across overlapping regions. Biometrics. 2008;64:1280–1286. doi: 10.1111/j.1541-0420.2008.01002.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Brillinger DR. The natural variability of vital rates and associated statistics (with discussion) Biometrics. 1986;42:693–734. [PubMed] [Google Scholar]
  7. Tiwari R, Zou Z. Efficient interval estimation for age-adjusted cancer rates. Statistical Methods in Medical Research. 2006;15:547–569. doi: 10.1177/0962280206070621. [DOI] [PubMed] [Google Scholar]
  8. Pickle LW, White AA. Effects of the choice of age-adjustment method on maps of death rates. Statistics in Medicine. 1995;14:615–627. doi: 10.1002/sim.4780140519. [DOI] [PubMed] [Google Scholar]

RESOURCES