An Age-Stratified Poisson Model for Comparing Trends in Cancer Rates Across Overlapping Regions

Yi Li; Ram C Tiwari; Zhaohui Zou

doi:10.1002/bimj.200710430

. Author manuscript; available in PMC: 2008 Sep 16.

Published in final edited form as: Biom J. 2008 Aug;50(4):608–619. doi: 10.1002/bimj.200710430

An Age-Stratified Poisson Model for Comparing Trends in Cancer Rates Across Overlapping Regions

Yi Li ^1,^*, Ram C Tiwari ², Zhaohui Zou ³

PMCID: PMC2536754 NIHMSID: NIHMS50273 PMID: 18615411

Summary

The annual percent change (APC) has been used as a measure to describe the trend in the age-adjusted cancer incidence or mortality rate over relatively short time intervals. The yearly data on these age-adjusted rates are available from the Surveillance, Epidemiology, and End Results (SEER) Program of the National Cancer Institute. The traditional methods to estimate the APC is to fit a linear regression of logarithm of age-adjusted rates on time using the least squares method or the weighted least squares method, and use the estimate of the slope parameter to define the APC as the percent change in the rates between two consecutive years. For comparing the APC for two regions, one uses a t-test which assumes that the two datasets on the logarithm of the age-adjusted rates are independent and normally distributed with a common variance. Two modifications of this test, when there is an overlap between the two regions or between the time intervals for the two datasets have been recently developed. The first modification relaxes the assumption of the independence of the two datasets but still assumes the common variance. The second modification relaxes the assumption of the common variance also, but assumes that the variances of the age-adjusted rates are obtained using Poisson distributions for the mortality or incidence counts. In this paper, a unified approach to the problem of estimating the APC is undertaken by modeling the counts to follow an age-stratified Poisson regression model, and by deriving a corrected Z-test for testing the equality of two APCs. A simulation study is carried out to assess the performance of the test and an application of the test to compare the trends, for a selected number of cancer sites, for two overlapping regions and with varied degree of overlapping time intervals is presented.

Keywords: Age, adjusted incidence/mortality rates, age, stratified Poisson Regression, annual percent change (APC), surveillance, trends, hypothesis testing

1 Introduction

The American Cancer Society (ACS) in its annual publication Cancer ACS 2007 (http://www.cancer.org/) reports that in 2007 about 1.5 million new cancer cases are expected to be diagnosed, and approximately 560,000 Americans are expected to die of cancer. Cancer is the most common cause of death in US, exceeded only by heart disease, and accounts for 1 of every 4 deaths. The same report also reveals that, for a number of cancer sites (such as breast, stomach, colon and rectum, lung and bronchus and leukemia), the age-adjusted cancer mortality rates have been steadly decreasing in recent years. In addition, the National Surveillance, Epidemiology, and End Results (SEER) Program of the National Cancer Institute (NCI) periodically publishes similar reports on trends of cancer incidence at http://seer.cancer.gov/csr; see Ries et al. (2003) So much has been at stake in terms of human life and cost - for example, the government agencies such as the National Health of Institutes (NIH), and many private sectors spend billions of dollars every year on cancer research, health insurance and medical and other costs - that there is an urgent need for new methods that produce more accurate and reliable estimates of measures of cancer trends.

The annual percent change (APC) has been used as a measure of cancer trends over short time periods, and to compare the recent cancer trends by gender or by geographic regions, one compares their APC values using the two-sample pooled t-test ( Kleinbaum et al., 1988) that assumes that the datasets on age-adjusted rates under the study are independent. However, a fundamental statistical difficulty arises when such comparisons, largely for policy making purposes, have to be made for regions or time intervals that overlap, e.g. comparing the most recent changes in trends of cancer rates in a local area (e.g. the mortality rate of breast cancer in California) with a more global level (i.e. the national mortality rate) over two overlapping time periods. For example, as detailed in the data analysis section, it is of substantial interest to compare the changes in California cancer mortality rates with the national cancer mortality rates in the last 15 years.

Recently, Li and Tiwari (2007) and Li et al. (2007) developed Z-tests which adjust for the dependence between the two APCs, and are more efficient than the naive test which assumes independence. However, these tests are based on the logarithmic transformation of the age-adjusted rates, and fits a simple linear regression model of the transformed data on time using either the ordinary least squares (OLS) or the other weighted least squares (WLS) procedures. The proposed test procedure is based on the natural assumption that the age-specific mortality or incidence counts are results of underlying Poisson processes (Brillinger, 1986), and hence are realizations of independent Poisson random variables. The age-specific instantaneous hazards are modeled by a log-link function, thus leading to an age-stratified Poisson model. The estimation of the parameters is then carried out using a likelihood-based approach.

The rest of the paper is organized as follows. In Section 2, we briefly review the existing tests, and derive the new test in Section 3. To compare the performance of the proposed test with respect to the above mentioned tests, a simulation study is carried out in Section 4. In this section, we also give application to breast cancer mortality data from California (CA) and the US extracted from the SEER*STAT software of the SEER Program. Section 5 ends this paper with a short discussion.

2 A Brief Review of Existing Tests

Consider two regions, and let d_kji denote the number of counts (deaths or new cancer cases) from the population at risk n_kji observed in Region k (k = 1, 2) in age-group j (j = 1, …, J) and at times T₁, …, T_m for Region 1 and T_s₊₁, …, T_s₊_n for Region 2, where T₁ ≤ T_s₊₁ < T_m ≤ T_s₊_n, with 0 ≤ s < m leading to overlapping time intervals. Note that this formulation is general and allows one region to have fewer time points than the other. In the SEER program, it is common to choose n_kji (at year T_i) to be the mid-year population representing the total person-years in one year, with the assumption of “drop-outs” being uniform over the unit-intervals. The age-adjusted rates are defined as

r_{k i} = \sum_{j = 1}^{J} w_{j} \frac{d_{kji}}{n_{kji}}

where w_j > 0, j = 1, …, J, are the known standards for the age group j so that $\sum_{j = 1}^{J} w_{j} = 1$ . For the SEER analysis, there are J = 19 standard age-groups consisting of 0–1, 1–4, 5–9, …, 85+, and w_j are chosen to be the year 2000 population standards (Fay et al. 2006).

Let y_ki = log(r_ki), be the logarithmic transformations of the age-adjusted rates. Consider the linear regression models

y_{k i} = β_{0 k} + β_{1 k} t_{k i} + e_{k i}, i = 1, \dots, I_{k},

(1)

for k = 1, 2, flagging Regions 1 and 2, respectively. Here e_ki are random errors with mean 0, and t_ki corresponds to the calendar times of data collection in region k with I₁ = m and I₂ = n. More specifically, (t₁₁, …, t₁_I₁) = (T₁, …, T_m), while (t₂₁, …, t₂_I₂) = (T_s₊₁, …, T_s₊_n). For the two regions, the annual percent change (APC) are defined as APC_k = 100(e^β¹^k − 1)= 100β₁_k, for a small β₁_k, e.g. in the order of 10⁻² (Kim et al., 2000; Fay et al., 2006; Tiwari et al., 2006).

Then under the assumptions that e_ki are independent and have common variance σ², the two-sample pooled t-test (Kleinbaum et al., 1988) for testing the null hypothesis H₀: APC₁ = APC₂ versus the alternative H_a: APC₁ ≠ APC₂ is given by

T_{t} = \frac{{\hat{β}}_{11} - {\hat{β}}_{12}}{\sqrt{{\hat{σ}}^{2} ({(\sum_{i = 1}^{I_{1}} (t_{1 i} - {\bar{t}}_{1})}^{2})^{- 1} + {(\sum_{i = 1}^{I_{2}} (t_{2 i} - {\bar{t}}_{2})}^{2})^{- 1})}} \sim t_{(I_{1} + I_{2} - 4)},

(2)

where ${\bar{t}}_{k} = \sum_{i = 1}^{I_{k}} t_{k i} / I_{k}$ for k = 1, 2, and σ̂² is the “pooled” unbiased estimate of σ² given by

{\hat{σ}}^{2} = \frac{\sum_{i = 1}^{I_{1}} (y_{1 i} - {\hat{y}}_{1 i})^{2} + \sum_{i = 1}^{I_{2}} {(y_{2 i} - {\hat{y}}_{2 i})}^{2}}{I_{1} + I_{2} - 4},

where ŷ_ki = β̂₀_k + β̂₁_kt_ki are the predictions for k = 1, 2. Here, β̂₀_k and β̂₁_k are obtained from the least squares estimation. That is,

{\hat{β}}_{1 k} = \frac{\sum_{i = 1}^{I_{k}} (t_{k i} - {\bar{t}}_{k}) (y_{k i} - {\bar{y}}_{k})}{\sum_{i = 1}^{I_{k}} {(t_{k i} - {\bar{t}}_{k})}^{2}}, {\hat{β}}_{0 k} = {\bar{y}}_{k} - β_{1 k} {\bar{t}}_{k}

where ${\bar{y}}_{k} = \sum_{i = 1}^{I_{k}} y_{k i} / I_{k} .$

The above test is not appropriate, however, when there is an overlap between the two regions or the two time periods. For this case, Li and Tiwari (2007) proposed the following corrected Z-test

Z_{C T} = \frac{{\hat{β}}_{11} - {\hat{β}}_{12}}{{{\hat{σ}}^{2} (σ_{1}^{- 2} + σ_{2}^{- 2} - 2 σ_{12} σ_{1}^{- 2} σ_{2}^{- 2} \frac{{(n^{(O)})}^{2}}{n_{1} n_{2}})}^{1 / 2}},

(3)

where $σ_{k}^{2} = \sum_{i = 1} {(t_{k i} - {\bar{t}}_{k})}^{2}$ , $σ_{12} = \sum_{s + 1}^{m} (T_{i} - {\bar{t}}_{1}) (T_{i} - {\bar{t}}_{2})$ , $n_{k} = \sum_{i = s + 1}^{m} \sum_{j = 1}^{J} n_{kji}$ for k = 1, 2, $n^{(O)} = \sum_{i = s + 1}^{m} \sum_{j = 1}^{J} n_{j i}^{(O)}$ and and $n_{j i}^{(O)}$ are the numbers of at-risk population in the overlapping region. Note that there is no suffix k in $n_{j i}^{(O)}$ . The sign of σ₁₂ determines, whether the covariance between β̂₁₁ and β̂₁₂ is positive or negative, and when there is no overlap in time intervals, σ₁₂ = 0. Under the log-normal model, the corrected Z_CT test was shown to follow a standard normal distribution under the null hypothesis, and to be more efficient than the pooled t-test; see Li and Tiwari (2007).

However, one assumption in Li and Tiwari (2007) is the equal variance in both regression models, which may not be realistic, especially for rare cancers. A further refinement has been made to derive the variance of y_ki by using the Poisson assumptions on the first two moments of the counts d_kji, i.e. E(d_kji) = var(d_kji). Under these assumptions, the consistent estimate of the error variance of e_ki is given by $v_{k i}^{2} = \frac{1}{r_{k i}^{2}} \sum_{j = 1}^{J} w_{j}^{2} \frac{d_{kji}}{n_{kji}^{2}}$ , leading to the following weighted least squares test proposed by Li et al. (2007), referred to as Z_WLS:

Z_{W L S} = \frac{{\tilde{β}}_{11} - {\tilde{β}}_{12}}{{{\tilde{σ}}_{1}^{- 2} + {\tilde{σ}}_{2}^{- 2} - 2 {\tilde{σ}}_{12} {\tilde{σ}}_{1}^{- 2} {\tilde{σ}}_{2}^{- 2} \frac{{(n^{(O)})}^{2}}{n_{1} n_{2}}}^{1 / 2}} .

(4)

with ${\tilde{σ}}_{1}^{2} = \sum_{i = 1}^{m} {(T_{i} - {\tilde{t}}_{1})}^{2} / v_{1 i}^{2}$ and ${\tilde{σ}}_{2}^{2} = \sum_{i = s + 1}^{s + n} (T_{i} - {\tilde{t}}_{2})^{2} / v_{2 i}^{2}$ ,

{\tilde{σ}}_{12} = \sum_{i = s + 1}^{m} (T_{i} - {\tilde{t}}_{1}) (T_{i} - {\tilde{t}}_{2}) \frac{v_{12 i}^{(o)}}{v_{1 i}^{2} v_{2 i}^{2}},

where

v_{12 i}^{(o)} = \frac{1}{r_{1 i} r_{2 i}} \sum_{j = 1}^{J} w_{j}^{2} \frac{d_{j i}^{(o)}}{{(n_{j i}^{(o)})}^{2}},

and $d_{j i}^{(o)}$ are the counts in the overlapping region and during the overlapping period. Here,

{\tilde{t}}_{1} = \frac{\sum_{i = 1}^{m} T_{i} / v_{1 i}^{2}}{\sum_{i = 1}^{m} 1 / v_{1 i}^{2}}, {\tilde{t}}_{2} = \frac{\sum_{i = s + 1}^{s + n} T_{i} / v_{2 i}^{2}}{\sum_{i = s + 1}^{s + n} 1 / v_{2 i}^{2}},

and β̃₁₁, β̃₁₂ are weighted least square estimates of β₁₁, β₁₂.

Under the null hypothesis, because of the normal approximation, Z_WLS approximately follows a standard normal distribution. The Z_WLS has been shown to be more conservative than Z_CT in retaining the size of the test, but is more powerful for the common cancer sites; see Li et al. (2007). However, there are several disadvantages of the existing methods. First, one key step of Li et al. (2007) is the normal approximation of the age-adjusted rates. Secondly, both Li et al. (2007) and Li and Tiwari (2007) need adjustments for zero counts.

3 Age-stratified Poisson Regression Model

As the existing approaches to dealing with age-adjusted cancer rates were all based on the normal approximation, we take a more natural route in the sequel by considering the Poisson nature of the underlying count data and propose an age-stratified Poisson regression to describe the change trend of incident (or death) counts on time. Based on this model, a proper test that accounts for overlapping is proposed.

Specifically, since d_kji, the number counts (deaths or new cancer cases) observed in Region k (k = 1, 2) in age-group j, is a count, we assume that

d_{kji}^{\underset{\sim}{ind}} Pois (n_{kji} λ_{kji}),

with

log λ_{kji} = β_{0 k j} + β_{1 k} t_{k i},

(5)

which is referred to as the Age-stratified Poisson Regression Model as the age-specific intercept β₀_kj is assumed for age-group j. The common slope β₁_k is of particular importance as it transcribes the trends of mortality or incidence and, in particular, determines the APC value.

Again let APC₁ and APC₂ be the corresponding APC values for these two Poisson regressions. A natural test for the null hypothesis H₀: APC₁ = APC₂ versus the alternative hypothesis H₁: APC₁ ≠ APC₂ would be

Z_{POIS} = \frac{{\hat{β}}_{11} - {\hat{β}}_{12}}{\sqrt{Var {{\hat{β}}_{11} - {\hat{β}}_{12}}}},

where β̂₁₁ and β̂₁₂ are the maximum likelihood estimates of β₁₁ and β₁₂ derived in the Appendix.

Because of the possible overlapping of Regions 1 and 2, β̂₁₁ and β̂₁₂ may be correlated. Thus the key to the derivation of the test lies in a correct evaluation of Cov(β̂₁₁, β̂₁₂).

3.1 Derivation of the Test

To proceed, we let β_k = (β ₁_k, β ₀_k₁, …, β ₀_kJ)′, k = 1, 2, whose estimates β̂_k can be btained by solving the score equations [based on (5)]

U_{k} (β_{k}) = 0,

for k = 1, 2, where U_k = (U₁_k_,1, U₀_k_,1, …, U₀_k_,_J)′ and

\begin{array}{l} U_{1 k, 1} = \sum_{i = 1}^{I_{k}} \sum_{j = 1}^{J} n_{kji} t_{k i} exp (β_{0 k j} + β_{1 k} t_{k i}) - \sum_{i = 1}^{I_{k}} \sum_{j = 1}^{J} d_{kji} t_{k i}, \\ U_{0 k, j} = \sum_{i = 1}^{I_{k}} n_{kji} exp (β_{0 k j} + β_{1 k} t_{k i}) - \sum_{i = 1}^{I_{k}} d_{kji}, \end{array}

for j = 1, …, J.

As U_k(β̂_k) = 0, expanding it around the true value β_k, and ignoring the higher order terms yields

0 \equiv \frac{1}{\sqrt{I_{k}}} U_{k} ({\hat{β}}_{k}) = \frac{1}{\sqrt{I_{k}}} U_{k} (β_{k}) + \frac{1}{I_{k}} U_{k}^{(1)} (β_{k}) {\sqrt{I_{k}} ({\hat{β}}_{k} - β_{k})} + o_{p} (1),

where $U_{k}^{(1)} = \partial U_{k} (β_{k}) / \partial β_{k}$ are (J+1) × (J+1) matrices with its 1^st row as $(U_{1 k, 11}^{(1)}, U_{1 k, 01}^{(1)}, \dots, U_{1 k, 0 J}^{(1)})$ and the (j + 1)^th row (j = 1, …, J) as $(U_{0 k, j 1}^{(1)}, U_{0 k, 0 j 1}^{(1)}, \dots, U_{0 k, 0 j J}^{(1)})$ , for k = 1, 2. Here

\begin{array}{l} U_{1 k, 11}^{(1)} = \frac{\partial U_{1 k, 1}}{\partial β_{1 k}} = \sum_{i = 1}^{I_{k}} \sum_{j = 1}^{J} n_{kji} t_{k i}^{2} exp (β_{0 k j} + β_{1 k} t_{k i}), \\ U_{1 k, 0 j}^{(1)} = \frac{\partial U_{1 k, 1}}{\partial β_{0 k j}} = \sum_{i = 1}^{I_{k}} n_{kji} t_{k i} exp (β_{0 k j} + β_{1 k} t_{k i}), \\ U_{0 k, j 1}^{(1)} = \frac{\partial U_{0 k, j}}{\partial β_{1 k}} = \sum_{i = 1}^{I_{k}} n_{kji} t_{k i} exp (β_{0 k j} + β_{1 k} t_{k i}), \\ U_{0 k, 0 j j^{'}}^{(1)} = \frac{\partial U_{0 k, j}}{\partial β_{0 k j^{'}}} = δ_{j j^{'}} \sum_{i = 1}^{I_{k}} n_{kji} exp (β_{0 k j} + β_{1 k} t_{k i}), \end{array}

and δ_jj_′= 1 if j = j′ and 0 otherwise.

Denote by $A_{k} = pli m_{I_{k} \to \infty} - U_{k}^{(1)} (β_{k}) / I_{k}$ for k=1, 2, where plim denotes the limit in probability. Then for large I₁(≡m) and I₂(≡n), standard probabilistic arguments yield

{\sqrt{I_{1}} ({\hat{β}}_{1} - β_{1}), \sqrt{I_{2}} ({\hat{β}}_{2} - β_{2})} \underset{\sim}{d} {A_{1}^{- 1} \frac{1}{\sqrt{I_{1}}} U_{1} (β_{1}), A_{2}^{- 1} \frac{1}{\sqrt{I_{2}}} U_{2} (β_{2})} .

Here, $\underset{\sim}{d}$ denotes approximate equivalence in joint distribution functions. Hence,

\begin{array}{l} Var (\sqrt{m} ({\hat{β}}_{1} - β_{1}), \sqrt{m} ({\hat{β}}_{1} - β_{1})) ≐ A_{1}^{- 1} \frac{1}{m} Cov (U_{1} (β_{1}), U_{1} (β_{1})) A_{1}^{- T}, \\ Var (\sqrt{n} ({\hat{β}}_{2} - β_{2}), \sqrt{n} ({\hat{β}}_{2} - β_{2})^{'}) ≐ A_{2}^{- 1} \frac{1}{n} Cov (U_{2} (β_{2}), U_{2} (β_{2})) A_{2}^{- T}, \\ Cov (\sqrt{m} ({\hat{β}}_{1} - β_{1}), \sqrt{n} ({\hat{β}}_{2} - β_{2})^{'}) ≐ A_{1}^{- 1} \frac{1}{\sqrt{m n}} Cov (U_{1} (β_{1}), U_{2} (β_{2})) A_{2}^{- T}, \end{array}

Let $V = \frac{1}{m} Cov (U_{1} (β_{1}), U_{1} (β_{1}))$ , $W = \frac{1}{n} Cov (U_{2} (β_{2}), U_{2} (β_{2}))$ , $\sum = \frac{1}{\sqrt{m n}} Cov (U_{1} (β_{1}), U_{2} (β_{2}))$ , and V̂, Ŵ, Σ̂ be the corresponding estimates, whose derivations are given in the Appendix.

Then,

\begin{array}{l} \hat{C} o v ({\hat{β}}_{1}, {\hat{β}}_{1}) = m {U_{1}^{(1)} ({\hat{β}}_{1})}^{- 1} \hat{V} {U_{1}^{(1)} ({\hat{β}}_{1})}^{- T}; \\ \hat{C} o v ({\hat{β}}_{2}, {\hat{β}}_{2}) = n {U_{2}^{(1)} ({\hat{β}}_{2})}^{- 1} \hat{W} {U_{2}^{(1)} ({\hat{β}}_{2})}^{- T}; \\ \hat{C} o v ({\hat{β}}_{1}, {\hat{β}}_{2}) = \sqrt{m n} {U_{1}^{(1)} ({\hat{β}}_{1})}^{- 1} \sum^{^} {U_{2}^{(1)} ({\hat{β}}_{2})}^{- T}; \end{array}

These are three (J +1) × (J +1) matrices, the (1, 1) entries of which are ${\hat{σ}}_{1}^{2} = \hat{V} a r ({\hat{β}}_{11})$ , ${\hat{σ}}_{2}^{2} = \hat{V} a r ({\hat{β}}_{12})$ and σ̂₁₂ = Ĉov(β̂₁₁, β̂₁₂), respectively. From this we compute V̂ar (β̂₁₁ − β̂₁₂) = V̂ar (β̂₁₁)+V̂ar (β̂₁₂) − 2Ĉov(β̂₁₁, β̂₁₂). Hence, the Z-test for comparing APC values is given by

Z_{POIS} = \frac{{\hat{β}}_{11} - {\hat{β}}_{12}}{{({\hat{σ}}_{1}^{2} + {\hat{σ}}_{2}^{2} - 2 {\hat{σ}}_{12})}^{1 / 2}},

(6)

which follows the standard normal distribution under H₀: β₁₁ = β₁₂. The computation of β̂₁₁, β̂₁₂ is given in the Appendix.

3.2 ARE Comparison with the WLS test

It is of substantial interest to evaluate the gains in efficiency of the proposed test compared with the WLS test. First note (5) implies that $E (r_{k i}) \equiv E (\sum_{j} w_{j} \frac{d_{kji}}{n_{kji}}) = (\sum_{j} w_{j} e^{β_{0 k j}}) e^{β_{1 k} T_{i}}$ . This in turn implies that

E (log r_{k i}) ≐ log E (r_{k i}) = log (\sum_{j} w_{j} e^{β_{0 k j}}) + β_{1 k} t_{k i},

(7)

when Var(r_ki) is small, which is often the case for the cancer incidence and mortality data (Kim et al., 2000).

A comparison between (1) and (7) reveals that models (1) and (5) approximately specify the same first moment of the age-adjusted cancer rates, making it possible to compare the efficiency of the tests based on these two models via the measure of the Pitman Asymptotic Relative Efficiency. Specifically, standard asymptotic analysis will yield

{\hat{β}}_{11} - {\hat{β}}_{12} \sim N (β_{11} - β_{12}, {\hat{σ}}_{1}^{2} + {\hat{σ}}_{2}^{2} - 2 {\hat{σ}}_{12}),

while, for the WLS estimates,

{\tilde{β}}_{11} - {\tilde{β}}_{12} \sim N (β_{11} - β_{12}, {\tilde{σ}}_{1}^{- 2} + {\tilde{σ}}_{2}^{- 2} - 2 {\tilde{σ}}_{12} {\tilde{σ}}_{1}^{- 2} {\tilde{σ}}_{2}^{- 2} \frac{{(n^{(O)})}^{2}}{n_{1} n_{2}}) .

Hence, the Pitman Asymptotic Relative Efficiency (ARE) comparing tests (6) and (4), which is the ratio of the noncentralities of the above two normal distributions, is given by

ARE = \frac{{\tilde{σ}}_{1}^{- 2} + {\tilde{σ}}_{2}^{- 2} - 2 {\tilde{σ}}_{12} {\tilde{σ}}_{1}^{- 2} {\tilde{σ}}_{2}^{- 2} \frac{{(n^{(O)})}^{2}}{n_{1} n_{2}}}{{\hat{σ}}_{1}^{2} + {\hat{σ}}_{2}^{2} - 2 {\hat{σ}}_{12}}

(8)

The evaluation of (8) typically involves numerical computations.

4 SEER mortality data analysis and Simulations

Li et al. (2007) demonstrated that Z_WLS performs better than Z_CT, via the calculation of ARE, as Z_CT relies on the common variance assumption, which may not be realistic. Hence, we focus this paper on comparing Z_POIS with Z_WLS. To comprehensively evaluate these tests, we consider several scenarios of overlap in two regions and in two different time intervals. Specifically, we assume that Region 1 consists of Georgia (GA), South Carolina (SC), and North Carolina (NC), and that Region 2 consists of NC, Virginia (VA) and Maryland (MD); with NC as the overlapping state between the two regions. The three different time intervals, with varying degree of overlap in the intervals, are taken to be: (a) [1980,1989] for Region 1, and [1990,1999] for Region 2 so that there is no overlap between the two time intervals and, hence, σ₁₂ = 0; (b) [1980,1989] for Region 1, and [1983,1992] for Region 2 so that there a considerable overlap of six years between the two intervals and σ₁₂ = 12.25; (c) [1980,1989] for Region 1, and [1987, 1996] for Region 2 so that there is a little overlap of three years’ between the two intervals and σ₁₂ = −34.75.

The counts, d_kji were generated based on model (5) with t_ki taking values in the intervals corresponding to the two regions stated above. More specifically, the t₁_i take values of {0, 1, …, 9}, while the t₂_i take values of {10, …, 19}, {3, …, 12}, and {7, …, 16}, respectively for cases (a)–(c).

In order to fully specify λ_kji in (5), we assume that β₀_kj = log(d_kj₁/n_kj₁) − β₁_kt_k₁, where d_kj₁ and n_kj₁ are respectively the observed number of deaths and the number of at-risk population at the beginning of the intervals considered, and take β₁_k = log(APC_k/100 + 1), based on the specified values of APC_k. The age-specific counts for the overlapping state, NC, are generated from Poisson distributions with means, $n_{j i}^{(o)} \times \frac{1}{2} (λ_{1 j i} + λ_{2 j i})$ , where $n_{j i}^{(o)}$ denotes the at-risk population in the overlapping region. When λ₁_ji =λ₂_ji, this reduces to the situation specified by the null hypothesis. The number of at risk population and the observed number of deaths were obtained from the SEER database for all malignant male cancers and prostate cancer. The values of APC were assumed to range from −0.3% to 3.0%. For each parameter configuration, a total of 1000 simulated data were obtained. The results for the three time-overlapping cases are summarized in Tables 1–3.

Table 1.

Comparison of the power functions under various hypotheses between the Poisson-based test Z_POIS and the weighted-least-squares based test Z_WLS for two overlapping regions over disjoint time intervals, Region 1 (1980–1989) vs Region 2 (1990–1999). APC1 and APC2 are the annual percent changes in Regions 1 and 2, respectively.

Cancer Sites	APC₁	APC₂	ARE	Z_WLS	Z_POIS
All Malignant	0.100	0.100	1.1490	0.059	0.050
	−0.300	−0.300	1.1491	0.049	0.045
	0.500	0.500	1.1492	0.060	0.049
	1.000	1.000	1.1493	0.048	0.053
	3.000	3.000	1.1504	0.047	0.057
	0.100	0.500	1.1507	0.877	0.907
	−0.300	0.300	1.1509	0.998	1.000
	1.000	2.000	1.1515	1.000	1.000
	1.000	3.000	1.1518	1.000	1.000
Prostate	0.100	0.100	1.1610	0.043	0.045
	−0.300	−0.300	1.1611	0.041	0.047
	0.500	0.500	1.1612	0.051	0.041
	1.000	1.000	1.1613	0.046	0.051
	3.000	3.000	1.1614	0.045	0.051
	0.100	0.500	1.1617	0.182	0.212
	−0.300	0.300	1.1619	0.357	0.40
	1.000	2.000	1.1622	0.773	0.830
	1.000	3.000	1.1634	1.000	1.000

Open in a new tab

Table 3.

Comparison of the power functions between the Poisson based Z_POIS and the weighted-least-squares based Z_WLS for two overlapping regions over slightly overlapping time intervals, Region 1 (1980–1989) vs Region 2 (1987–1996). APC1 and APC2 are the annual percent changes in Regions 1 and 2, respectively.

Cancer Sites	APC₁	APC₂	ARE	Z_WLS	Z_POIS
All Malignant	0.100	0.100	1.1350	0.044	0.044
	−0.300	−0.300	1.1350	0.044	0.046
	0.500	0.500	1.1351	0.046	0.051
	1.000	1.000	1.1351	0.045	0.051
	3.000	3.000	1.1352	0.043	0.047
	0.100	0.500	1.1354	0.834	0.891
	−0.300	0.300	1.1355	0.994	0.994
	1.000	2.000	1.1362	1.000	1.000
	1.000	3.000	1.1367	1.000	1.000
Prostate	0.100	0.100	1.1410	0.045	0.051
	−0.300	−0.300	1.1410	0.043	0.049
	0.500	0.500	1.1412	0.060	0.051
	1.000	1.000	1.1423	0.061	0.055
	3.000	3.000	1.1426	0.052	0.052
	0.100	0.500	1.1428	0.170	0.184
	−0.300	0.300	1.1431	0.319	0.361
	1.000	2.000	1.1436	0.706	0.759
	1.000	3.000	1.1439	0.998	1.000

Open in a new tab

We remark that that, even though both Z_WLS and Z_POIS are derived under different model assumptions, they are both valid tests for testing the equality of two APCs and hence the ARE defined in (8) is valid. The tables show that the ARE of Z_POIS with respect to Z_WLS is greater than 1 for all the three cases, meaning that Z_POIS would be more powerful than Z_WLS when the alternative hypothesis is true. The tables also show that in most situations Z_POIS outperforms the Z_WLS in retaining the Type I error probabilities and, hence, yields a more valid test. Also the powers of both WLS and Poisson-based tests are sensitive to the delta values (the differences of APC values). The larger the delta values are, the more powerful the tests are. The larger delta values also lead to slightly larger AREs, though the differences are not so obvious.

It is of substantial interest to compare the changes in cancer mortality rates in California with the national levels starting late 1980’s as a California law (Health and Safety Code, Section 103885) was passed then, which mandated the reporting of malignancies diagnosed throughout the state. For this purpose, we applied the proposed methodology to compare the annual percent change (APC) in the age-adjusted mortality rates for the United States (US) for the period from 1988–2002 to that of California (CA) for the period from 1990 to 2004. We fitted the weighted linear models as well as the age-stratified Poisson model, and applied both tests to compare the age-adjusted mortality rates of female breast cancer in CA for the 16-year period from 1989–2004 to that of US for the 16-year period from 1987–2002, for which the national mortality data were available. The observed values of the log-transformed annual age-adjusted rates and fitted regression lines from the Z_POIS test procedure are plotted in Figure 1. The parameter estimates and the values of the test statistics are summarized in Table 4. The results indicated the mortality rates of Breast cancer for California and the US have decreased. Both tests reject the null hypothesis of equality of the two APCs, indicating that the annual percent change (APC) of California, is significantly different from the national level. However, the p-value for Z_POIS is much smaller than that of Z_WLS, rendering more evidence again the null hypothesis.

Fig. 1 — Observed and fitted log-transformed age-adjusted breast cancer mortality rates in CA [1989–2004] and US [1987–2002]

Table 4.

Comparing APC of Breast Cancer Mortality Between CA and the US

	CA	US

APC_WLS	−2.33	−1.94
SE_WLS	0.084	0.027
Z_WLS	−4.95	(p=0.000000757)

APC_POIS	−2.29	−1.84
SE_POIS	0.083	0.026
Z_POIS	−5.62	(p= 0.000000019593)

Open in a new tab

5 Discussion

In this paper, we have considered an important problem where comparisons have to be made for regions or time intervals that overlap. As opposed to the existing work, e.g Li and Tiwari (2007) and Li et al. (2007), this project advances this area in two distinct ways. First, the developed test does not rely on the normal approximation of the cancer rate, but directly model the counts to follow a Poisson regression model. The parameters are then estimated using their maximum likelihood estimates, and the Z-test is derived for testing the equality of two APCs. Secondly, the developed Poisson regression model can easily accommodate 0 count data (for the rare cancers), as opposed to the log normal model (Li and Tiwari, 2007; Li et al., 2007) which needs to involve extra zero-corrected adjustment. We have applied the developed methodology to the analysis of the major cancer sites from the SEER Program and have found that the corrected Z-test renders more power than the existing tests. A Bayesian Poisson regression would be a useful approach. However, choice of priors is always difficult and computation may not be so straightforward compared to this current work, wherein analytical solutions have been derived. Hence, we envision that the proposed method would be preferable because of simple interpretation of the model parameters, natural choice of the model and computational readiness.

In our technical development, we have modelled the logarithm transformation of the age-adjusted rates as a linear regression on time in (5) and have indeed explicitly assumed parallelism across age groups. That is, the growth curves of the cancer rates for various age groups share the same slope, which carries the information for the APC. Indeed, linearity parallelism for the cancer rates could be a debatable issue in cancer surveillance, which is likely to be violated for some cancers. One alternative, along the line of generalized mixed models, is to assume a random slope (as opposed to a constant slope) across age groups. This ongoing work will be reported in a subsequent communication.

Table 2.

Comparison of the power functions between the Poisson based Z_POIS and the weighted-least-squares based Z_WLS for two overlapping regions over roughly the same time intervals, Region 1 (1980–1989) vs Region 2 (1983–1992). APC1 and APC2 are the annual percent changes in Regions 1 and 2, respectively.

Cancer Sites	APC₁	APC₂	ARE	Z_WLS	Z_POIS
All Malignant	0.100	0.100	1.1710	0.055	0.057
	−0.300	−0.300	1.1710	0.056	0.057
	0.500	0.500	1.1711	0.048	0.050
	1.000	1.000	1.1712	0.047	0.049
	3.000	3.000	1.1723	0.051	0.056
	0.100	0.500	1.1726	0.872	0.908
	−0.300	0.300	1.1729	0.994	0.997
	1.000	2.000	1.1733	1.000	1.000
	1.000	3.000	1.1737	1.000	1.000
Prostate	0.100	0.100	1.1820	0.043	0.047
	−0.300	−0.300	1.1821	0.043	0.043
	0.500	0.500	1.1822	0.046	0.041
	1.000	1.000	1.1823	0.035	0.053
	3.000	3.000	1.1823	0.034	0.055
	0.100	0.500	1.1825	0.170	0.197
	−0.300	0.300	1.1827	0.341	0.393
	1.000	2.000	1.1832	0.742	0.802
	1.000	3.000	1.1835	1.000	1.000

Open in a new tab

Acknowledgments

The authors thank the editor, an AE and two referees for their insightful suggestion, which led to a better version of this manuscript.

Appendix A: Derivation of V̂, Ŵ, Σ̂

Write

V = {(\begin{matrix} V_{11} & V_{12} \\ V_{12}^{'} & V_{22} \end{matrix})}_{(J + 1) \times (J + 1)}; W = {(\begin{matrix} W_{11} & W_{12} \\ W_{12}^{'} & W_{22} \end{matrix})}_{(J + 1) \times (J + 1)}; \sum = {(\begin{matrix} \sum_{11} & \sum_{12} \\ \sum_{12}^{'} & \sum_{22} \end{matrix})}_{(J + 1) \times (J + 1)}

where

V_{11} = \frac{1}{m} Var (U_{11, 1}) = \frac{1}{m} \sum_{i = 1}^{m} \sum_{j = 1}^{J} T_{i}^{2} Var (d_{1 j i}) = \frac{1}{m} \sum_{i = 1}^{m} \sum_{j = 1}^{J} T_{i}^{2} E (d_{1 j i}) .

Hence a consistent estimate is ${\hat{V}}_{11} = \frac{1}{m} \sum_{i = 1}^{m} \sum_{j = 1}^{J} T_{i}^{2} d_{1 j i} .$

Similarly,

{\hat{V}}_{12} = ({\hat{V}}_{12, 1, \dots,} {\hat{V}}_{12, J})

where

{\hat{V}}_{12, j} = \hat{C} o v (U_{11, 1,} U_{01, j}) = \frac{1}{m} \sum_{i = 1}^{m} T_{i} d_{1 j i} .

Also, V₂₂ = ((V₂₂_,jj_′)) with V̂₂₂_,jj_′= Ĉov (U_01,_j; U_01,_j_′) = 0; j ≠ j′; $\frac{1}{m} \sum_{i = 1}^{m} d_{1 j i,} j = j^{'} .$

Next compute the estimate of W:

W_{11} = \frac{1}{n} Var (U_{12, 1}) = \frac{1}{n} \sum_{i = s + 1}^{s + n} \sum_{j = 1}^{J} T_{i}^{2} Var (d_{2 j i}) = \frac{1}{n} \sum_{i = s + 1}^{s + n} \sum_{j = 1}^{J} T_{i}^{2} E (d_{2 j i})

so that ${\hat{W}}_{11} = \frac{1}{n} \sum_{i = s + 1}^{s + n} \sum_{j = 1}^{J} T_{i}^{2} d_{2 j i}$ Similarly, Ŵ₁₂ = (Ŵ₁₂_,₁, …, Ŵ_12,_J) where ${\hat{W}}_{12, j} = \hat{C} o v (U_{12, 1}, U_{02, j}) = \frac{1}{n} \sum_{i = s + 1}^{n} T_{i} d_{2 j i}$ . Also, Ŵ₂₂ = ((Ŵ₂₂_,jj_′)) with ${\hat{W}}_{22, j j^{'}} = 0, j \neq j^{'}; = \frac{1}{n} \sum_{i = s + 1}^{s + n} d_{2 j i,} j = j^{'}$ . Finally, the estimate of Σ is computed as follows:

\begin{array}{l} \sum_{11} = \frac{1}{\sqrt{m n}} Cov (U_{11, 1}, U_{12, 1}) \\ = \frac{1}{\sqrt{m n}} Cov (\sum_{i = 1}^{m} \sum_{j = 1}^{J} T_{i} d_{1 j i}, \sum_{i = s + 1}^{s + n} \sum_{j = 1}^{J} T_{i} d_{2 i j}) \\ = \frac{1}{\sqrt{m n}} Cov (\sum_{i = 1}^{m} \sum_{j = 1}^{J} T_{i} (d_{1 j i}^{(N O)} + d_{j i}^{(O)}), \sum_{i = s + 1}^{s + n} \sum_{j = 1}^{J} T_{i} (d_{2 j i}^{(N O)} + d_{j i}^{(O)})) \end{array}

where $d_{kji} = d_{jki}^{(N O)} + d_{j i}^{(O)}$ and the superscripts “NO” and “O” denote the non-overlapping and overlapping regions, respectively. Thus, ${\sum^{^}}_{11} = \frac{1}{\sqrt{m n}} \sum_{i = s + 1}^{m} \sum_{j = 1}^{J} T_{i}^{2} d_{j i}^{(O)}$ . Let Σ̂₁₂ = (Σ₁₂_,₁, …, Σ̂₁₂_,J ) where ${\sum^{^}}_{12, j} = \frac{1}{\sqrt{m n}} \sum_{i = s + 1}^{m} T_{i} d_{j i}^{(O)}$ . Also, Σ̂₂₂ = ((Σ̂_22,_jj_′)) with ${\sum^{^}}_{22, j, j^{'}} = 0, j \neq j^{'}; \frac{1}{\sqrt{m n}} \sum_{i = s + 1}^{m} d_{j i}^{(O)}, j = j^{'}$ .

Appendix B: Computation of the MLEs

Note that the MLEs of β₀_jk and β₁_k satisfy:

\begin{array}{l} e^{β_{0 j k}} = \frac{\sum_{i} d_{kji}}{\sum_{i} e^{β_{1 k} T_{i}} n_{kji}}; \\ \tilde{U} (β_{1 k}) = \sum_{j, i} n_{kji} T_{i} (\frac{\sum_{i} d_{jki}}{\sum_{i} e^{β_{1 k} T_{i}} n_{kji}}) e^{β_{1 k} T_{i}} - \sum_{j, i} d_{kji} T_{i} \\ = \sum_{j} d_{k j .} (\frac{\sum_{i} n_{kji} T_{i} e^{β_{1 k} T_{i}}}{\sum_{i} e^{β_{1 k} T_{i}} n_{kji}}) - \sum_{i} d_{k . i} T_{i} \\ = \sum_{j} d_{k j .} [A_{k j} (1) A_{k j}^{- 1} (0)] - \sum_{i} d_{k . i} T_{i} = 0; k = 1, 2, \end{array}

where d_kj_· =Σ_i d_kji, d_k_·_i =Σ_j d_kji, and $A_{k j} (a) = \sum_{i} n_{kji} T_{i}^{a} e^{β_{1 k} T_{i}}$ for a = 0, 1, 2.

Since Ũ(β₁_k) is a monotonic function of β₁_k, there is a unique solution to this equation. Let β̂₁_k be the solution. This is the MLE of β₁_k. We can use a Newton-Raphson method to obtain β̂₁_k as follows. Let β̂₁_k(l) be the estimate at the l^th iteration, then the estimate at the (l + 1)^th iteration is given by β̂₁_k(l + 1) = β̂₁_k(l) − [Ũ⁽¹⁾(β̂₁_k(l))]^{− 1}Ũ(β̂₁_k(l)), where Ũ⁽¹⁾(β) is the first (partial) derivative of Ũ(b) with respect to b evaluated at b = β. We stop iterating wh en |β̂₁_k(l + 1) − β̂₁_k(l)| < ε for some pre-specified value of ε. Note that

\begin{array}{l} {\tilde{U}}^{(1)} (β_{1 k}) = \sum_{j, i} n_{jki} T_{i}^{2} e^{β_{1 k} T_{i}} (\frac{\sum_{i} d_{kji}}{\sum_{i} e^{β_{1 k} T_{i}} n_{kji}}) - \sum_{j, i} n_{jki} t_{i} e^{β_{1 k} t_{i}} \frac{\sum_{i} n_{kji} t_{i} e^{β_{1 k} t_{i}}}{{(\sum_{i} e^{β_{1 k} t_{i}} n_{kji})}^{2}} \\ = \sum_{j} (\frac{(\sum_{i} n_{kji} t_{i}^{2} e^{β_{1 k} t_{i}}) (\sum_{i} d_{kji})}{\sum_{i} e^{β_{1 k} t_{i}} n_{kji}}) - \sum_{j} \frac{{(\sum_{i} n_{kji} t_{i} e^{β_{1 k} t_{i}})}^{2}}{{(\sum_{i} e^{β_{1 k} t_{i}} n_{kji})}^{2}} \\ = \sum_{j} d_{k j .} (\frac{\sum_{i} n_{kji} t_{i}^{2} e^{β_{1 k} t_{i}}}{\sum_{i} e^{β_{1 k} t_{i}} n_{kji}}) - \sum_{j} \frac{{(\sum_{i} n_{kji} t_{i} e^{β_{1 k} t_{i}})}^{2}}{{(\sum_{i} e^{β_{1 k} t_{i}} n_{kji})}^{2}} \\ = \sum_{j} dkj . [A_{k j} (2) {(A_{k j} (0))}^{- 1}] - \sum_{j} [{(A_{k j} (1))}^{2} {(A_{k j} (0))}^{- 1}], k = 1, 2. \end{array}

Substituting the MLE β̂₁_k in place for β₁_k in $e^{β_{0 k j}} = \frac{\sum_{i} d_{kji}}{\sum_{i} e^{β_{1 k} t_{i}} n_{kji}}$ gives the MLE ${\hat{β}}_{0 k j} = log (\sum_{i} d_{kji}) - log (\sum_{i} e^{{\hat{β}}_{1 k} t_{i}} n_{kji}) .$

References

American Cancer Society. Cancer Facts & Figures. Atlanta: Georgia; 2007. [Google Scholar]
Brillinger DR. The natural variability of vital rates and associated statistics (with discussion) Biometrics. 1986;42:693–734. [PubMed] [Google Scholar]
Fay M, Tiwari R, Feuer E, Zou Z. Estimating Average Annual Percent Change for Disease Rates without Assuming Constant Change. Biometrics. 2006;62:847–854. doi: 10.1111/j.1541-0420.2006.00528.x. [DOI] [PubMed] [Google Scholar]
Kim H, Fay M, Feuer E, Midthune D. Permutation tests for joinpoint regression with applications to cancer rates. Statistics in Medicine. 2000;19:335–351. doi: 10.1002/(sici)1097-0258(20000215)19:3<335::aid-sim336>3.0.co;2-z. [DOI] [PubMed] [Google Scholar]
Kleinbaum D, Kupper, Muller P. Applied Regression Analysis and Other Multivariable Methods. 2 PWS-Kent; Boston, Mass: 1988. [Google Scholar]
Li Y, Tiwari R. Comparing Trends in Age-Adjusted Cancer Rates Across Overlapping Regions or Time Intervals for the NCI SEER Program. Technical Report. 2007 http://www.bepress.com/harvardbiostat/paper71.
Li Y, Tiwari R, Walters K, Zou Z. A Weighted-Least-Squares Estimation Approach to Comparing Trends in Age-Adjusted Cancer Rates Across Overlapping Regions. Technical Report. 2007 http://biowww.dfci.harvard.edu/~yili/apc2.pdf. [PMC free article] [PubMed]
Ries L, Eisner M, Kosary C, Hankey B, Miller B, Clegg L, Mariotto A, Feuer E, Edwards B. SEER Cancer Statistics Review, 1975–2002. National Cancer Institute; Bethesda, MD: 2003. http://seer.cancer.gov/csr/1975-2002/ [Google Scholar]
Tiwari R, Clegg L, Zou Z. Efficient interval estimation for age-adjusted cancer rates. Statistical Methods in Medical Research. 2006;15:547–569. doi: 10.1177/0962280206070621. [DOI] [PubMed] [Google Scholar]

[R1] American Cancer Society. Cancer Facts & Figures. Atlanta: Georgia; 2007. [Google Scholar]

[R2] Brillinger DR. The natural variability of vital rates and associated statistics (with discussion) Biometrics. 1986;42:693–734. [PubMed] [Google Scholar]

[R3] Fay M, Tiwari R, Feuer E, Zou Z. Estimating Average Annual Percent Change for Disease Rates without Assuming Constant Change. Biometrics. 2006;62:847–854. doi: 10.1111/j.1541-0420.2006.00528.x. [DOI] [PubMed] [Google Scholar]

[R4] Kim H, Fay M, Feuer E, Midthune D. Permutation tests for joinpoint regression with applications to cancer rates. Statistics in Medicine. 2000;19:335–351. doi: 10.1002/(sici)1097-0258(20000215)19:3<335::aid-sim336>3.0.co;2-z. [DOI] [PubMed] [Google Scholar]

[R5] Kleinbaum D, Kupper, Muller P. Applied Regression Analysis and Other Multivariable Methods. 2 PWS-Kent; Boston, Mass: 1988. [Google Scholar]

[R6] Li Y, Tiwari R. Comparing Trends in Age-Adjusted Cancer Rates Across Overlapping Regions or Time Intervals for the NCI SEER Program. Technical Report. 2007 http://www.bepress.com/harvardbiostat/paper71.

[R7] Li Y, Tiwari R, Walters K, Zou Z. A Weighted-Least-Squares Estimation Approach to Comparing Trends in Age-Adjusted Cancer Rates Across Overlapping Regions. Technical Report. 2007 http://biowww.dfci.harvard.edu/~yili/apc2.pdf. [PMC free article] [PubMed]

[R8] Ries L, Eisner M, Kosary C, Hankey B, Miller B, Clegg L, Mariotto A, Feuer E, Edwards B. SEER Cancer Statistics Review, 1975–2002. National Cancer Institute; Bethesda, MD: 2003. http://seer.cancer.gov/csr/1975-2002/ [Google Scholar]

[R9] Tiwari R, Clegg L, Zou Z. Efficient interval estimation for age-adjusted cancer rates. Statistical Methods in Medical Research. 2006;15:547–569. doi: 10.1177/0962280206070621. [DOI] [PubMed] [Google Scholar]

PERMALINK

An Age-Stratified Poisson Model for Comparing Trends in Cancer Rates Across Overlapping Regions

Yi Li

Ram C Tiwari

Zhaohui Zou

Summary

1 Introduction

2 A Brief Review of Existing Tests

3 Age-stratified Poisson Regression Model

3.1 Derivation of the Test

3.2 ARE Comparison with the WLS test

4 SEER mortality data analysis and Simulations

Table 1.

Table 3.

Fig. 1.

Table 4.

5 Discussion

Table 2.

Acknowledgments

Appendix A: Derivation of V̂, Ŵ, Σ̂

Appendix B: Computation of the MLEs

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

An Age-Stratified Poisson Model for Comparing Trends in Cancer Rates Across Overlapping Regions

Yi Li

Ram C Tiwari

Zhaohui Zou

Summary

1 Introduction

2 A Brief Review of Existing Tests

3 Age-stratified Poisson Regression Model

3.1 Derivation of the Test

3.2 ARE Comparison with the WLS test

4 SEER mortality data analysis and Simulations

Table 1.

Table 3.

Fig. 1.

Table 4.

5 Discussion

Table 2.

Acknowledgments

Appendix A: Derivation of V̂, Ŵ, Σ̂

Appendix B: Computation of the MLEs

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases