Bayesian Parametric Accelerated Failure Time Spatial Model and its Application to Prostate Cancer

Jiajia Zhang; Andrew B Lawson

doi:10.1080/02664760903521476

. Author manuscript; available in PMC: 2012 Mar 1.

Published in final edited form as: J Appl Stat. 2011 Mar;38(2):591–603. doi: 10.1080/02664760903521476

Bayesian Parametric Accelerated Failure Time Spatial Model and its Application to Prostate Cancer

Jiajia Zhang ^a,^*, Andrew B Lawson ^b

PMCID: PMC3070364 NIHMSID: NIHMS204764 PMID: 21475617

Abstract

Prostate cancer is the most common cancer diagnosed in American men and the second leading cause of death from malignancies. There are large geographical variation and racial disparities existing in the survival rate of prostate cancer. Much work on the spatial survival model is based on the proportional hazards model, but few focused on the accelerated failure time model. In this paper, we investigate the prostate cancer data of Louisiana from the SEER program and the violation of the proportional hazards assumption suggests the spatial survival model based on the accelerated failure time model is more appropriate for this data set. To account for the possible extra-variation, we consider spatially-referenced independent or dependent spatial structures. The deviance information criterion (DIC) is used to select a best fitting model within the Bayesian frame work. The results from our study indicate that age, race, stage and geographical distribution are significant in evaluating prostate cancer survival.

Keywords: Spatial, Accelerated failure time model, Deviance information criterion (DIC), Bayesian, Likelihood

1. Introduction

In public health and population-based biomedical studies, data are often collected by geographic regions, such as the district or postal code of the residence of individuals. Often the adjacent neighborhoods may be more alike than those from distant region due to similar environmental and social factors. Failing to account for the correlation within neighborhoods may lead to biased statistical inference.

Recently, there has been lots of attention paid to the analysis of geographical patterns of survival times, in addition to the impact of other covariates. For example, Henderson et al. [10] modeled spatial variation in survival of acute myeloid leukemia patients in northwest England; Banerjee et al. [2] applied a spatial frailty model to infant mortality in Minnesota by assuming a parametric Weibull baseline hazard and geostatistical or Gaussian Markov random field priors for the spatial component; Li and Ryan [17] analyzed the effect of risk factors on the onset of childhood asthma with spatial data from the East Boston Asthma Study; and Hennerfeind et al. [12] applied a geoadditive survival model to data on waiting times for coronary artery bypass grafting. However, all of these examples focus on the proportional hazards (PH) model or its extensions which measure the spatial effects on the hazard scale. For example, Henderson et al. [10] used the PH model because the initial survival analysis indicates that the PH assumption was satisfied in their data set. There are some discussions about the spatial survival models based on the non-proportional cases, such as the multivariate adaptive regression spline (MARS) model [19], the additive hazard survival model with frailty [20], dynamic survival models with spatial frailty [3], and the normal transformation model for spatial correlated data [16]. In the normal transformation model, the survival outcome marginally follows a PH model, and in their discussions, they proposed an extension to the accelerated failure time (AFT) model. The AFT model is widely accepted as an alternative approach when the PH assumption does not hold, but there are few studies using semiparametric Bayesian analysis in the AFT model [1, 8, 21]. A semi-Bayesian analysis of the AFT model is given by [5], where they utilized a Dirichlet process to estimate the survival distribution. Walker and Mallick [24] proposed a fully Bayesian approach for the median regression model by the Polya tree prior. Recently, Komarek and his colleagues [14, 15] proposed a normal mixture as the error distribution in the AFT model.

In this paper, we propose an AFT spatial model by adding a random effect to the AFT model for investigating risk effects of prostate cancer (PrCA). Prostate cancer diagnosis data from the state of Louisiana from the Surveillance, Epidemiology, and End Results (SEER) program [11] of the National Cancer Institute (NCI) is used as an example. The purpose of this application is to investigate whether PrCA is much more aggressive in African-Americans than in Whites and whether there exists regional environmental difference. The estimation procedure is developed from a Bayesian perspective with parametric assumption. A semiparametric AFT estimation approach [5, 7, 14, 15, 24] could be considered for this study, which requires more effort on programming than the parametric AFT model. In order to provide a clear picture of the AFT spatial model and its application, we will analyze the PrCA data using WinBUGS code [18]. Then, the DIC [22] is applied to choose the best fit parametric model. In the appendix, the WinBUGS code for the parametric AFT spatial model or the parametric AFT frailty model is provided.

The remainder of this paper is organized as follows: Section 2 describes the data that motivate this study. The standard survival analysis and possible issues are described in Section 3. Section 4 outlines the AFT spatial model. The estimation procedure is discussed in Section 5. Section 6 illustrates the application of the proposed approach to the PrCA data from Louisiana. Finally, Section 7 summarizes and discusses the results.

2. Motivating Data

Prostate cancer (PrCA) is a major public health problem, which over a lifetime will affect an estimated one in five American men. Since PrCA is the number one incident cancer and the number two cause of cancer deaths among US men, the data in PrCA from the SEER program are particularly important for researchers, clinicians, policy makers, and citizens in understanding this disease. The SEER program has 17 registries, which include San Francisco-Oakland, Connecticut, Detroit, Hawaii, Iowa, New Mexico and Utah for period 1973-2004, Seattle for period 1974-2004, Atlanta for period 1975-2004, Alaska, San Jose-Monterey, Los Angeles and Rural Georgia for period 1992-2004, Great California, Kentucky, Louisiana and New Jersey for period 2000-2004. We extract the PrCA data from the SEER cancer incidence public-use data base. Observations with missing values on race, age, county of residence, stage and marital status at diagnosis are excluded in this analysis. According to patient’s medical records, race includes White, Black, or Other, with Black being the designator for African American. In this study we are only interested in the disparities between white and black, and so we remove other races. Stage of cancer has four categories: local, regional, distant and unstaged. Unstaged means information is not suffcient to assign a stage for the cancer. So, we exclude the unstaged cases.

In order to investigate large geographical variation and racial disparities in the survival rate of PrCA, we need to select the registry with a relatively large proportion of African-American males. After checking all registries, we focused our study on the SEER data set from Louisiana, which has 64 counties and whose ratio of black men is 29.34%. Note, the data from Louisiana can not represent the whole population due to the limit of the observation period, but it does represent the status of the incidence for the five year period 2000-2004 in Louisiana.

The individual-specific information for a patient that is used in this study are: age (age of the patient at diagnosis in complete years), race (White and Black), county (patient’s county of residence at the time of diagnosis), stage (SEER summary stage, localized/regional and distant), marital status at diagnosis (single, married and other), and survival time after diagnosis (including censoring time). It is worthwhile pointing out that in the definition of the stage, localized tumors are confined to the prostate gland, regional tumors are spread to contiguous organs or lymph nodes, and distant tumors are spread to remote organs. Clinically localized tumors are frequently upstaged to regional stage after surgery, so there is an extra category (localized/regional) only for PrCA in SEER data. We have 446 observations from localized category, 103 observations from the regional category, and 15 132 observations from the localized/regional category, so we combined both localized and regional staged cancers into the localized/regional category in our data analysis. Table 1 provides a summary of the characteristics of the PrCA patients included in this study.

Table 1.

Summary characteristics of prostate cancer patients: Louisiana, 2000-2004

Covariate	N	Patients(%)
Race
Black	3 006	29.34
White	7 240	70.66
Marital Status
Single (Never married)	939	9.16
Married	7 752	75.66
Other (Separated, divorced)	1 555	15.18
Cancer Stage
Localized/regional	9 870	96.33
Distant	376	3.67
Vital status
Alive	9 274	90.51
Dead	972	9.49

Open in a new tab

3. Modeling Issues

Commonly, the PH model and the AFT model are the most popular survival models and the nonparametric Kaplan-Meier (KM) survival curve is used as a rule of thumb to choose between them [13]. After visual inspection, the test based on the Schoenfeld residuals [23] is applied. For illustration purpose, we select nine different counties from the different parts of Louisiana in order to detect the difference between locations. Among these, Caddo, Bossier and Webster are in the northwest; Sabine, Grant and Avoyelles are in the midwest; Calcasieu, Vermilion and Acadia are in the southwest. In each county, we fitted the KM survival curves for white and black respectively (Figure 1). In each plot, the y-axis presents the survival probability for white or black and x-axis denotes the time period after the diagnosis of PrCA.

KM survival curves according to race for PrCA in nine different counties, Louisiana.

From Figure 1, we find that the survival rate does change markedly with the location and it appears that the middle west part of the state tends to have higher survival rates than the other two parts. So, considering spatial effects in survival models should improve the estimates of risk effects.

The Schoenfeld residual test is used to check the PH assumption. If the P-value from the Schoenfeld residual test is less than significant level (such as, 0.05), it indicates that the PH assumption is not satisfied. Investigating the survival curve with respect to different races, we find that some of the KM survival curves cross over in Figure 1, such as in county Webster, Avoyelles and Acadia. Through the Schoenfeld residual test, we find that the P-value are significantly less than 0.05 in county Bossier (P-value=0.0261), Avoyelles (P-value= 0.043), Vermilion (P-value 0.00892), and Acadia (P-value=0.0428). The P-value is not significant enough in county Webster (P-value= 0.103). Thus, we doubt the accuracy of the PH assumption and consider the AFT spatial model for this data set.

4. Accelerated Failure Time Spatial Model

Let T_ij denote the survival time after diagnosis for patient j in county i, and x_ij denote possible risk effects corresponding to T_ij, where j = 1, … , n_i, i = 1, … , n. The AFT model can be expressed as:

\log (T_{i j}) = μ + β x_{i j} + σ ε_{i j},

where β is the unknown coefficient, ε_ij’s are independent random errors, μ and σ are the shape parameter and scale parameter. Letting n_i = 1, we obtain the regular AFT model.

The spatial structure can be considered by adding a random effect to the AFT model and the AFT spatial model is specified as:

\log (T_{i j}) = μ + β x_{i j} + W_{i} + σ ε_{i j},

(1)

where W_i’s are spatial random effects. The advantage of the AFT spatial model is that the interpretation of risk/spatial effects on the failure time are easy since the AFT spatial model simply regresses the logarithm of the survival time over covariates and random spatial effects. In this paper, we consider county specific random effects.

Let f(·) denote the density function of T and f₀(·) denote the density function of ε. S(·) and S₀(·) denote the survival functions, and h(·) and h₀(·) represent the hazard functions corresponding to f(·) and f₀(·). Then, we have

f (t_{i j} ∣ W_{i}) = \frac{1}{σ t_{i j}} f_{0} (\frac{\log (t_{i j}) - λ (x_{i j})}{σ}),

S (t_{i j} ∣ W_{i}) = S_{0} (\frac{\log (t_{i j}) - λ (x_{i j})}{σ}),

h (t_{i j} ∣ W_{i}) = \frac{1}{σ t_{i j}} h_{0} (\frac{\log (t_{i j}) - λ (x_{i j})}{σ}),

where λ(x_ij) = μ + βx_ij + W_i. From the relationship between survival functions, we can see that the spatial random effects have a direct effect on the survival probability. Note that the hazard rate keeps changing over time even when the spatial random effect is fixed in the AFT spatial model, while it stays at the same rate given the specific region in the PH spatial model. For some data set, we believe it is more reasonable to assume the hazard rate changes over time even in the same location.

It is common to assume that S₀(·) comes from the standard normal distribution, the standard extreme value distribution, or the logistic distribution. The S₀(·) expressions and their corresponding S(·)’s are summarized in Table 2, where φ(·) denotes the cumulative density function from the standard normal distribution. Corresponding to the distribution of ε, the survival distribution of T follows the lognormal distribution, Weibull distribution or loglogistic distribution.

Table 2.

Common distributions in the AFT spatial model, where λ(x_ij) = μ + βx_ij + W_i

Distribution	S₀(·)	S(·)
Normal	1 – Φ(ε_ij)	$1 - Φ (\frac{\log (t_{i j}) - λ (x_{i j})}{σ})$
Extreme value	exp(–exp(ε_ij	$\exp {(- \exp (- λ (x_{i j})) t_{i j})}^{\frac{1}{σ}}$
Logistic	$\frac{1}{1 + \exp (ε_{i j})}$	$\frac{1}{1 + {(\exp (- λ (x_{i j})) t_{i j})}^{1 ∕ σ}}$

Open in a new tab

We consider survival data (t_ij, δ_ij, x_ij), where δ_ij is the censoring indicator. We assume that the censoring is independent and noninformative. Let W = (W₁, … , W_n), O denote the observations and ϕ = {μ, σ, β} denote the parameters to be estimated. Given the spatial random effect, the likelihood function can be written as:

\begin{matrix} L (t ∣ ϕ, W) & = \prod_{i = 1}^{n} \prod_{j = 1}^{n_{i}} f {(t_{i j})}^{δ_{i j}} S {(t_{i j})}^{1 - δ_{i j}}, \\ = \prod_{i = 1}^{n} \prod_{j = 1}^{n_{i}} {[\frac{1}{σ t_{i j}} f_{0} (\frac{\log (t_{i j}) - λ (x_{i j})}{σ})]}^{δ_{i j}} S_{0} {(\frac{\log (t_{i j}) - λ (x_{i j})}{σ})}^{1 - δ_{i j}} . \end{matrix}

(2)

The spatial random errors can be correlated or not among counties. We refer to mutually uncorrelated county-specific effects as spatially uncorrelated heterogeneity and model this situation with independent Gaussian distributions defined as $W_{i} \sim Normal (0, v_{s}^{2})$ , where $v_{s}^{2}$ denotes the variance of spatial random effect. It is worthwhile pointing out that the AFT spatial model under the independent correlation is similar to the AFT frailty model with normal random effects, albeit with frailty effects at the county level rather than individual level. In the correlated situation, we can consider the conditional autoregressive (CAR) model. The CAR model, first introduced by Besag et al. [4], is widely used not only for smoothing in image processing but also in disease mapping. This formulation permits correlation among the random effects according to a neighborhood structure:

W_{i} ∣ (W_{k}, v_{s}^{2}) \sim Normal ({\overset{‒}{W}}_{i}, σ_{i}^{2}),

where

{\overset{‒}{W}}_{i} = \frac{\sum_{k} W_{k} g_{i k}}{\sum_{k} g_{i k}}

σ_{i}^{2} = v_{s}^{2} ∕ \sum_{k} g_{i k}

g_{i k} = 1 (if region i and k and adjacent i \neq k); 0 (otherwise)

For the model’s identifiability, it is common to assume that $\sum_{i} W_{i} = 0$ . From the specification of the CAR distribution, it can be seen that in the ith region W_i depends on the corresponding values in their neighborhood regions and the number of neighborhoods in the ith region, hence exhibiting spatial correlation. We call W_i with the CAR model prior specification the spatially correlated heterogeneity.

We also include both spatially correlated and uncorrelated random effects in a single model to permit a trade-off between independence and a purely local spatially structured dependence of the random effects [4], that is W_i = W_i1 + W_i2, where

W_{i 1} ∣ (W_{k 1}, v_{s 1}^{2}) \sim Normal ({\overset{‒}{W}}_{i 1}, σ_{i}^{2}),

where

{\overset{‒}{W}}_{i 1} = \frac{\sum_{k} W_{k 1} g_{i k}}{\sum_{k} g_{i k}}

σ_{i}^{2} = v_{s 1}^{2} ∕ \sum_{k} g_{i k}

g_{i k} = 1 (if region i and k are adjacent i \neq k); 0 (otherwise)

and

W_{i 2} \sim Normal (0, v_{s 2}^{2}) .

This combined spatially correlated and uncorrelated heterogeneity is called a convolution prior, and $v_{s 1}^{2}$ and $v_{s 2}^{2}$ are used to control the variability of the correlated and uncorrelated heterogeneity separately. Apparently, the convolution prior maintains the correlation between adjacent counties, but the correlation is weakened by the uncorrelated structure.

5. Estimation Procedure

The parameter estimates for the AFT spatial model in this study are obtained by posterior sampling based on a Markov chain Monte Carlo (McMC) simulation method. Let p(ϕ) denote the prior distribution for ϕ and p(v_s) denote the prior for the variance of the spatial random effects. The posterior distribution can be expressed as

p (ϕ, W, v_{s} ∣ t) \propto L (t ∣ ϕ, W) p (W ∣ v_{s}) p (ϕ) p (v_{s}) .

(3)

To conduct data analysis from the Bayesian perspective, we must specify the prior distributions for each parameter in the model. Because we have little prior information for all the parameters to be estimated, we want our data information to dominate the prior distribution by assuming reasonably non-informative priors for all parameters in this model. For all regression coefficients β and the shape parameter μ, we assume independent vague normal priors with mean 0 and variance 1 × 10⁶. The scale parameter σ in the model is given non-informative priors by gamma distribution with shape parameter 1 and scale parameter 0.001 (with mean 1000, variance 1 × 10⁶). For parameters $v_{s}^{2}$ , $v_{s 1}^{2}$ and $v_{s 2}^{2}$ , which control the variability of the spatial random effects, we assign the vague proper gamma prior distribution with shape parameter 0.001 and scale parameter 0.001 for their reciprocals (precision parameters for the random effects). Posterior sampling of the AFT spatial model can proceed from the definition of the posterior distribution in (3).

Given the likelihood function Eq (2), the posterior distribution can be factored into different components:

for $σ ∣ W, β : \propto \prod_{i = 1}^{n} \prod_{j = 1}^{n_{i}} f {(t_{i j} ∣ W, ϕ, v_{s})}^{δ_{i j}} S {(t_{i j} ∣ W, ϕ, v_{s})}^{1 - δ_{i j}} P (σ)$
for $W ∣ σ, β, v_{s} : \propto \prod_{i = 1}^{n} \prod_{j = 1}^{n_{i}} f {(t_{i j} ∣ W, ϕ, v_{s})}^{δ_{i j}} S {(t_{i j} ∣ W, ϕ, v_{s})}^{1 - δ_{i j}} P (W ∣ v_{s})$
for $β ∣ W, μ, σ : \propto \prod_{i = 1}^{n} \prod_{j = 1}^{n_{i}} f {(t_{i j} ∣ W, ϕ, v_{s})}^{δ_{i j}} S {(t_{i j} ∣ W, ϕ, v_{s})}^{1 - δ_{i j}} P (β)$
for $μ ∣ β, W, σ : \propto \prod_{i = 1}^{n} \prod_{j = 1}^{n_{i}} f {(t_{i j} ∣ W, ϕ, v_{s})}^{δ_{i j}} S {(t_{i j} ∣ W, ϕ, v_{s})}^{1 - δ_{i j}} P (μ)$

where P(σ) ~ Gamma(a, b), P(W|v_s) where W_i is defined conditionally as $W_{i} ∣ v_{s}^{2} \sim Normal ({\overset{‒}{W}}_{i}, σ_{i}^{2})$ , β_j, ~ Normal(0, c), and P(μ) ~ Normal(0, c) where a = b = 0.001, c = 10⁶. To evaluate competing models, we have run each model with the sampler using multiple chains with overdispersed starting points. Trace plots, the Brooks-Gelman-Rubin diagnostic [6], and autocorrelations within chains are used to assess the convergence of the iterations based on the multiple chains.

Remark: WinBUGS is used to run the whole program with a “zero trick” employed since the joint likelihood function can not be expressed directly by a standard density function. This trick allows arbitrary sampling distributions to be used, and is particularly suitable when, say, dealing with truncated distributions.

6. Real Data Analysis

The risk effects we consider in the AFT or AFT spatial model include the e ect of race, marital status, age and stage, which are very common in PrCA analysis.

We will assume ε follows the standard normal distribution, the standard extreme value distribution and the logistic distribution in both the AFT model and the AFT spatial models. In the AFT spatial model, we will consider three different cases according to different spatial correlations, which are summarized as:

Case 1: W_i is spatially uncorrelated. That is W_i follows an independent normal distribution.
Case 2: W_i is spatially correlated. That is W_i follows the CAR model.
Case 3: W_i = W_i1 + W_i2, where W_i1 is the spatially correlated random effect and W_i2 denotes the spatially uncorrelated random effect. This case consider both spatially correlated and uncorrelated effects.

In our algorithm we ran two, initially overdispersed, parallel McMC chains for 20 000 iteration each. Then, we discarded the first 10 000 iterations as pre-convergence burn-in and retained 10 000 as the posterior analysis. In the Bayesian framework, model assessments and choices of the best-fitting model can be performed using the DIC [22], which is a Bayesian analog of the Akaike’s information criterion (AIC). pD represents the effective number of parameters, which reflects the model complexity or degrees of freedom. If a Bayesian hierarchical model has negligible prior information, pD will approximate the actual number of parameters, and DIC will approximate AIC. Lower values of DIC indicate a better-fitting model. Spiegelhalter et al. [22] suggest that models with DIC values within 1 or 2 units of the “best” model deserve consideration, those with values within 3-7 units of the “best” are only weakly supported and models with a DIC value more than 7 units higher than the “best” model are substantially inferior. The DIC and pD are listed in Table 3.

Table 3.

A comparison of goodness-of-fit (DIC, pD) for the AFT model and the three cases of the AFT spatial model between the three baseline survival distributions (normal, extreme value and logistic)

Distribution of S₀(·)	DIC	pD
Normal AFT	11 950.0	6.133
Normal+Case 1	11 960.0	65.42
Normal+Case 2	11 930.0	17.59
Normal+Case 3	12 000.0	85.01
Extreme value AFT	12 090.0	12.05
Extreme+Case 1	12 090.0	66.51
Extreme+Case 2	12 060.0	19.25
Extreme+Case 3	12 200.0	117.60
Logistic AFT	12 000.0	5.768
Logistic+Case 1	12 020.0	68.93
Logistic+Case 2	11 990.0	17.27
Logistic+Case 3	12 090.0	99.14

Open in a new tab

From Table 3, we can see that the model with spatially correlated random effects under the normal baseline is the best among these cases, which has the smallest DIC value (11 930). Therefore, we believe that the survival probability is affected by the geographical region. The pD value for the normal distribution with spatial correlation (Normal+Case 2) is 17.59, which indicates the complexity of the model.

The estimated parameters for the normal distribution with spatial correlation is summarized in Table 4.

Table 4.

The best tting AFT spatial model (Normal+Case 2): mean parameter estimates, sample standard deviations, and quantiles

	mean	sd	2.5%	50%	97.5%
age	−0.0336^*	0.00184	−0.0374	−0.0335	−0.0301
marital	−0.0096	0.0299	−0.0688	−0.0100	0.0494
race	0.1843^*	0.0360	0.1171	0.1835	0.2553
stage	−0.9643^*	0.0569	−1.080	−0.9617	−0.8578

Open in a new tab

denotes that the coefficient is significant at 0.95 level.

The exponential of coeffcient illustrates the effect of covariate on survival time. For example: one unite change of age decreases the survival time by e^−0.0336 = 0.967. From the table we can see that the age, race and stage have significant influence on the survival probability of the PrCA (as indicated by * in the table). Marital status does not display significance. More than 75% of patients in this study are married, so there may not be enough evidence to show the effect of the marital status.

In order to show the spatial effect, we present the median of the posterior spatial random effect in Figure 2, which displays considerable spatial structure in the middle eastern area of the state. It is worthwhile pointing out that the survival time is affected by the exponential of spatial random effects. The larger the spatial random effects indicates the longer the survival time. Note, the cut point in this figure is generated automatically by the default in WinBUGS, which is based on the absolute value of the variable to be mapped and are chosen to give equally spaced intervals.

Posterior mean of spatial random effects from the best fitted model, Louisiana counties.

For illustration purpose, we compare the estimated survival curves between five regions indicated in Figure 2 since the survival curve for each region will be effected by the spatial random effects. The estimated survival curves for the different races based on the AFT spatial model and the KM approach according to the different regions are displayed in Figure 3. In the KM approach, we only consider the race effect. In the AFT spatial model, we consider the median value of the age, marital status and stage for each race and the median of the estimated spatial random effects in each region. The survival curves are illustrated in Figure 3.

Fitted survival curves from the KM approach and the AFT spatial model, Region 1-5, Louisiana counties. Step line represents the estimated survival curve from the KM approach and smoothed line represents the estimated curve from the normal AFT spatial model.

We can see that the survival curves from the AFT spatial model are similar to those from the KM approach, which indicates that the AFT spatial model fits the data set well. As appointed out by a referee, the survival curve corresponding to “Black” for the AFT spatial model does not fit very well to that for the KM approach when the time is around 20 to 30 in Region 1, 2, or 3. In order to solve this issue, we may relax the parametric assumption. For example, we may assume that the survival time follows the generalized gamma distribution or piecewise exponential distribution. Non parametric approach can also be considered. The survival probability is different in each region, which may indicate the geographical effect on PrCA. For example, the survival rate for black men at 60 months is around 0.6 in region 1 from the AFT spatial model, 0.67 from the KM approach, while in region 5 it is around 0.8 from the AFT spatial model and 0.83 from the KM approach. Similar effects can be found in other regions.

7. Discussion and Conclusion

In this paper, we investigate the PrCA data of Louisiana from the SEER program by the Bayesian parametric AFT spatial model. The spatially correlated and un-correlated heterogeneity were considered to characterize the spatial distribution pattern for better understanding the geographic features of survival probability of PrCA. WinBUGS is used to analyze the PrCA data via the parametric AFT spatial model and the DIC criterion is used to select the appropriate assumption for the error term distribution and correlation structure. The estimated survival curve from the parametric AFT spatial model is compared to that from the nonparametric approach. Finally, we concluded that the normal AFT spatial model with correlated spatial random effects is the best fitting model to analyze the PrCA data of Louisiana. The results from our study indicate that the age, race, stage and geographical distribution have significant impact in evaluating PrCA survival in Louisiana.

However, we have focused on a parametric specification of the AFT spatial model, so the DIC criteria is applied to check the model fitting. The model can be more flexible if we release the parametric assumption, which will increase the computational diffculties [5, 7, 24]. For the spatial component a CAR model is a common choice [9]. We demonstrate that different forms of spatial model have variable success in describing the Louisiana data, but a CAR model yields the best fit. Even the semiparametric structure is more flexible than the parametric model, the parametric spatial survival model is recommended in this paper because it can be conducted easily in WinBUGS in practice. In addition, we fit this data set by the PH spatial model with the lognormal distribution. Under the PH spatial model with the lognormal distribution, the DIC and pD are 12 010 and 19.40 for the spatially uncorrelated case, 12 006 and 19.98 for the spatially correlated case, and 12 007 and 20.14 for the convolution prior. Thus the lognormal PH spatial model has a better fit under the spatial correlation structure. However, comparing with the AFT spatial model, we find that the AFT spatial model with the normal distribution (DIC=11 930; pD=17.59) is still a better fit than a PH spatial model with the lognormal distribution (DIC=12 006; pD= 19.98).

It is also worthwhile pointing out that this model could be extended in a number of ways. First, more complex spatial structures could be included. Second, some smoothing terms rather than linear terms could be allowed for covariate modeling, such as log(T_ij) = μ + ∑f_k(x_k) + W_i + σε_ij, where f_k(·) is a unknown function. Finally latent spatial structure could be important and this might be a fruitful path to pursue in application to such survival data.

8. Acknowledgement

We sincerely thank the referees for the valuable comments that lead to greatly improved presentation of our work. The project described was supported by Award Number R03CA139538 from the National Cancer Institute. The content is solely the responsibility of the authors and does not necessarily represent the offcial views of the National Cancer Institute or the National Institutes of Health.

Appendix

References

[1].Banerjee S, Dey DK. Semiparametric proportional odds models for spatially correlated survival data. Lifetime Data Anal. 2005;11:175–191. doi: 10.1007/s10985-004-0382-z. [DOI] [PubMed] [Google Scholar]
[2].Banerjee S, Wall MM, Carlin BP. Frailty modeling for spatially correlated survival data, with application to infant mortality in Minnesota. Biostatistics. 2003;4:123–142. doi: 10.1093/biostatistics/4.1.123. [DOI] [PubMed] [Google Scholar]
[3].Bastos LS, Gamerman D. Dynamic survival models with spatial frailty. Lifetime Data Anal. 2006;12:441–460. doi: 10.1007/s10985-006-9020-2. [DOI] [PubMed] [Google Scholar]
[4].Besag J, Mollie A, York J, Mollié A. Bayesian image restoration, with two applications in spatial statistics (with discussion) Ann. Inst. Stat. Math. 1991;43:1–59. [Google Scholar]
[5].Christensen R, Johnson W. Modelling accelerated failure time with a Dirichlet process. Biometrika. 1988;75:693–704. [Google Scholar]
[6].Gelman A, Rubin DB. Inference from iterative simulation using multiple sequences (with discussion) Statist. Sci. 1992;7:457–511. [Google Scholar]
[7].Hanson T, Johnson WO. A Bayesian semiparametric aft model for interval-censored data. J. Comput. Graph. Statist. 2004;13:341–361. [Google Scholar]
[8].Hanson T, Yang M. Bayesian Semiparametric Proportional Odds Models. Biometrics. 2007;63:88–95. doi: 10.1111/j.1541-0420.2006.00671.x. [DOI] [PubMed] [Google Scholar]
[9].Henderson R, Diggle P, Dobson A. Identification and efficacy of longitudinal markers for survival. Biostatistics. 2002;3:33–50. doi: 10.1093/biostatistics/3.1.33. [DOI] [PubMed] [Google Scholar]
[10].Henderson R, Shimakura S, Gorst D. Modeling spatial variation in leukemia survival data. J. Amer. Statist. Assoc. 2002;97:965–972. [Google Scholar]
[11].National Cancer Institute Surveillance and Epidemiology and End Results (SEER) Program. www.seer.cancer.gov. Limited-Use Data. DCCPS, Surveillance Research Program, Cancer Statistics Branch, released April 2008, based on the November 2007 submission, 1973-2005.
[12].Hennerfeind A, Brezger A, Fahrmeir L. Geoadditive survival models. J. Amer. Statist. Assoc. 2006;101:1065–1075. [Google Scholar]
[13].Klein JP, Moeschberger ML. Survival Analysis: Techniques for Censored and Truncated Data. Springer-Verlag Inc; New York: 1997. [Google Scholar]
[14].Komárek A, Lesaffre E. Bayesian accelerated failure time model for correlated interval-censored data with a normal mixture as error distribution. Statist. Sinica. 2007;17:549–569. [Google Scholar]
[15].Komárek A, Lesaffre E, Hilton JF. Accelerated failure time model for arbitrarily censored data with smoothed error distribution. J. Comput. Graph. Statist. 2005;14:726–745. [Google Scholar]
[16].Li Y, Lin X. Semiparametric normal transformation models for spatially correlated survival data. J. Amer. Statist. Assoc. 2006;101:591–603. [Google Scholar]
[17].Li Y, Ryan L. Modeling spatial survival data using semiparametric frailty models. Biometrics. 2002;58:287–297. doi: 10.1111/j.0006-341x.2002.00287.x. [DOI] [PubMed] [Google Scholar]
[18].Lunn D, Thomas A, Best N, Spiegelhalter D. Winbugs–a bayesian modelling framework: concepts, structure, and extensibility. Statist. Comput. 2000;10:325–337. [Google Scholar]
[19].Mallick BK, Denison DGT, Smith AFM. Bayesian survival analysis using a mars model. Biometrics. 1999;55:1071–1077. doi: 10.1111/j.0006-341x.1999.01071.x. [DOI] [PubMed] [Google Scholar]
[20].Silva GL, Amaral-Turkman MA. Bayesian analysis of an additive survival model with frailty. Comm. Statist. A. 2004;33:2517–2533. [Google Scholar]
[21].Sinha D, Dey DK. Semiparametric Bayesian analysis of survival data. J. Amer. Statist. Assoc. 1997;92:1195–1212. [Google Scholar]
[22].Spiegelhalter DJ, Best NG, Carlin BP, Lindevan der A. Bayesian measures of model complexity and fit (Pkg: P583-639) J. Roy. Statist. Soc. Ser. B. 2002;64:583–616. [Google Scholar]
[23].Therneau TM, Grambsch PM. Modeling Survival Data: Extending the Cox Model. Springer-Verlag Inc; New York: 2000. [Google Scholar]
[24].Walker S, Mallick BK. A Bayesian semiparametric accelerated failure time model. Biometrics. 1999;55:477–483. doi: 10.1111/j.0006-341x.1999.00477.x. [DOI] [PubMed] [Google Scholar]

[R1] [1].Banerjee S, Dey DK. Semiparametric proportional odds models for spatially correlated survival data. Lifetime Data Anal. 2005;11:175–191. doi: 10.1007/s10985-004-0382-z. [DOI] [PubMed] [Google Scholar]

[R2] [2].Banerjee S, Wall MM, Carlin BP. Frailty modeling for spatially correlated survival data, with application to infant mortality in Minnesota. Biostatistics. 2003;4:123–142. doi: 10.1093/biostatistics/4.1.123. [DOI] [PubMed] [Google Scholar]

[R3] [3].Bastos LS, Gamerman D. Dynamic survival models with spatial frailty. Lifetime Data Anal. 2006;12:441–460. doi: 10.1007/s10985-006-9020-2. [DOI] [PubMed] [Google Scholar]

[R4] [4].Besag J, Mollie A, York J, Mollié A. Bayesian image restoration, with two applications in spatial statistics (with discussion) Ann. Inst. Stat. Math. 1991;43:1–59. [Google Scholar]

[R5] [5].Christensen R, Johnson W. Modelling accelerated failure time with a Dirichlet process. Biometrika. 1988;75:693–704. [Google Scholar]

[R6] [6].Gelman A, Rubin DB. Inference from iterative simulation using multiple sequences (with discussion) Statist. Sci. 1992;7:457–511. [Google Scholar]

[R7] [7].Hanson T, Johnson WO. A Bayesian semiparametric aft model for interval-censored data. J. Comput. Graph. Statist. 2004;13:341–361. [Google Scholar]

[R8] [8].Hanson T, Yang M. Bayesian Semiparametric Proportional Odds Models. Biometrics. 2007;63:88–95. doi: 10.1111/j.1541-0420.2006.00671.x. [DOI] [PubMed] [Google Scholar]

[R9] [9].Henderson R, Diggle P, Dobson A. Identification and efficacy of longitudinal markers for survival. Biostatistics. 2002;3:33–50. doi: 10.1093/biostatistics/3.1.33. [DOI] [PubMed] [Google Scholar]

[R10] [10].Henderson R, Shimakura S, Gorst D. Modeling spatial variation in leukemia survival data. J. Amer. Statist. Assoc. 2002;97:965–972. [Google Scholar]

[R11] [11].National Cancer Institute Surveillance and Epidemiology and End Results (SEER) Program. www.seer.cancer.gov. Limited-Use Data. DCCPS, Surveillance Research Program, Cancer Statistics Branch, released April 2008, based on the November 2007 submission, 1973-2005.

[R12] [12].Hennerfeind A, Brezger A, Fahrmeir L. Geoadditive survival models. J. Amer. Statist. Assoc. 2006;101:1065–1075. [Google Scholar]

[R13] [13].Klein JP, Moeschberger ML. Survival Analysis: Techniques for Censored and Truncated Data. Springer-Verlag Inc; New York: 1997. [Google Scholar]

[R14] [14].Komárek A, Lesaffre E. Bayesian accelerated failure time model for correlated interval-censored data with a normal mixture as error distribution. Statist. Sinica. 2007;17:549–569. [Google Scholar]

[R15] [15].Komárek A, Lesaffre E, Hilton JF. Accelerated failure time model for arbitrarily censored data with smoothed error distribution. J. Comput. Graph. Statist. 2005;14:726–745. [Google Scholar]

[R16] [16].Li Y, Lin X. Semiparametric normal transformation models for spatially correlated survival data. J. Amer. Statist. Assoc. 2006;101:591–603. [Google Scholar]

[R17] [17].Li Y, Ryan L. Modeling spatial survival data using semiparametric frailty models. Biometrics. 2002;58:287–297. doi: 10.1111/j.0006-341x.2002.00287.x. [DOI] [PubMed] [Google Scholar]

[R18] [18].Lunn D, Thomas A, Best N, Spiegelhalter D. Winbugs–a bayesian modelling framework: concepts, structure, and extensibility. Statist. Comput. 2000;10:325–337. [Google Scholar]

[R19] [19].Mallick BK, Denison DGT, Smith AFM. Bayesian survival analysis using a mars model. Biometrics. 1999;55:1071–1077. doi: 10.1111/j.0006-341x.1999.01071.x. [DOI] [PubMed] [Google Scholar]

[R20] [20].Silva GL, Amaral-Turkman MA. Bayesian analysis of an additive survival model with frailty. Comm. Statist. A. 2004;33:2517–2533. [Google Scholar]

[R21] [21].Sinha D, Dey DK. Semiparametric Bayesian analysis of survival data. J. Amer. Statist. Assoc. 1997;92:1195–1212. [Google Scholar]

[R22] [22].Spiegelhalter DJ, Best NG, Carlin BP, Lindevan der A. Bayesian measures of model complexity and fit (Pkg: P583-639) J. Roy. Statist. Soc. Ser. B. 2002;64:583–616. [Google Scholar]

[R23] [23].Therneau TM, Grambsch PM. Modeling Survival Data: Extending the Cox Model. Springer-Verlag Inc; New York: 2000. [Google Scholar]

[R24] [24].Walker S, Mallick BK. A Bayesian semiparametric accelerated failure time model. Biometrics. 1999;55:477–483. doi: 10.1111/j.0006-341x.1999.00477.x. [DOI] [PubMed] [Google Scholar]

PERMALINK

Bayesian Parametric Accelerated Failure Time Spatial Model and its Application to Prostate Cancer

Jiajia Zhang

Andrew B Lawson

Abstract

1. Introduction