Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2016 Sep 1.
Published in final edited form as: Ann Appl Stat. 2015 Nov 2;9(3):1549–1570. doi: 10.1214/15-AOAS840

COMBINING NONEXCHANGEABLE FUNCTIONAL OR SURVIVAL DATA SOURCES IN ONCOLOGY USING GENERALIZED MIXTURE COMMENSURATE PRIORS

Thomas A Murray ‡,*, Brian P Hobbs ‡,*,, Bradley P Carlin §,*
PMCID: PMC4635972  NIHMSID: NIHMS704244  PMID: 26557211

Abstract

Conventional approaches to statistical inference preclude structures that facilitate incorporation of supplemental information acquired from similar circumstances. For example, the analysis of data obtained using perfusion computed tomography to characterize functional imaging biomarkers in cancerous regions of the liver can benefit from partially informative data collected concurrently in non-cancerous regions. This paper presents a hierarchical model structure that leverages all available information about a curve, using penalized splines, while accommodating important between-source features. Our proposed methods flexibly borrow strength from the supplemental data to a degree that reflects the commensurability of the supplemental curve with the primary curve. We investigate our method’s properties for nonparametric regression via simulation, and apply it to a set of liver cancer data. We also apply our method for a semiparametric hazard model to data from a clinical trial that compares time to disease progression for three colorectal cancer treatments, while supplementing inference with information from a previous trial that tested the current standard of care.

Keywords and phrases: Bayesian hierarchical model, clinical trials, colorectal cancer, commensurate prior, computed tomographic imaging, evidence synthesis, mixture priors, penalized splines, proportional-hazards, semiparametric methods

1. Introduction

Statistical investigations begin by determining which sources(s) of information will be used to answer the motivating questions and generate hypotheses for future exploration. Conventional approaches to statistical inference preclude structures that facilitate incorporation of partially informative data, imposing polarity on the data selection process. Putatively relevant supplemental data acquired from broadly similar therapeutic interventions, patient cohorts, previous investigations, or biological processes are often either excluded from statistical analysis or treated as exchangeable with the primary data. In oncology, prospective studies designed to evaluate the performance of an experimental therapy usually ignore historical information about the control therapy, and limit enrollment to patients presenting lesions with a particular histological subtype, grade, or performance status, or who are naïve to prior therapy. In contrast, studies with more liberal information inclusion rules, such as intention-to-treat designs, often pool information from potentially heterogeneous sources, or adjust for this potential heterogeneity using simple linear regressors.

Ignoring relevant, supplemental sources of information reduces the reproducibility and scope of the primary study. On the other hand, using the supplemental information while neglecting to account for heterogeneity between the sources of information obscures understanding of the complex underlying mechanisms that produced the primary data and may lead to severely biased inference. Relaxing this dichotomy would improve the efficiency of the experimental process and enable investigators to implement statistical models that use all available information while accommodating important between-source features. Several models have been proposed for incorporating partially informative supplemental data. Pocock (1976) used generalized linear models with static, data-independent borrowing using a pre-specified amount of between-source variability. Ibrahim and Chen (2000) proposed data-independent or dynamic non-hierarchical methods for partially weighting likelihoods. Bayesian (Smith, Spiegelhalter and Thomas, 1995) and frequentist (Doi, Barendregt and Mozurkewich, 2011) methods using hierarchical modeling have been developed for estimating between-source variability for univariate observables or repeated measures with generalized linear relations among covariates.

Bayesian hierarchical models facilitate dynamic partial pooling of between-source information and flexibly estimate the extent of borrowing. Here the goal is limiting bias for estimating primary effects when between-source heterogeneity occurs, while improving efficiency when approximate coherence, or commensurability, occurs; see Hobbs, Sargent and Carlin (2012) for generalized linear mixed models, and Hobbs, Carlin and Sargent (2013) and Murray et al. (2014) for piecewise-exponential time-to-event models. In Section 2 we discuss these methods, which can be used to borrow strength from supplemental data in parametric models for regression coefficients and other univariate parameters. We then develop a Bayesian hierarchical model structure to leverage supplemental information more generally, including for both nonparametric regression and semiparametric hazard models with penalized splines. The literature appears devoid of general methods for flexibly borrowing strength from supplemental data in semi- and non-parametric models for a group of related parameters that characterize a complex object (e.g., a curve or surface).

Two oncological applications motivate our methodological developments. The first, from diagnostic radiology, involves estimating prognostic functional imaging biomarkers acquired using perfusion computed tomography (CT) in cancerous and cancer-free liver tissue. Perfusion CT is an emerging technology that enables observation and quantification of characteristics pertaining to the passage of fluid through blood vessels. Researchers have developed physiological models to quantify a variety of perfusion characteristics derived from analysis of the distribution of contrast enhancement in tissue acquired using repeated CT scans during intravenous administration of contrast medium. Investigators have used the technology in a number of organs and tumors, including prostate, colorectal, head and neck, lung, and liver. Miles and Griffiths (2003) review the clinical relevance of perfusion CT.

Our application considers three characteristics: permeability-surface area product (PS), blood volume (BV), and blood flow (BF). Each characteristic is measured at 7 to 13 acquisition times between 11 and 95 seconds following contrast injection. Data from 16 individuals comprise a total of 25 regions containing pathology-verified metastases to the liver from neuroendocrine tumors (i.e., cancerous liver tissue), and a total of 27 regions consisting of non-cancerous liver tissue. PS and BF are rates measured as milliliters per minute per 100 grams of liver tissue (mL/min per 100g), whereas BV is a volume measured as milliliters per 100 grams of liver tissue (mL per 100g). Figure 1 displays the observed perfusion CT (CTp) curves along with Loess estimates of the average CTp curve for each characteristic in each tissue region. Characterization of the perfusion characteristics in cancerous tissue has implications for constructing biomarkers to assist in treatment monitoring, prognostication, and pathophysiological understanding of metastatic tumor vasculature. It is important that the acquisition duration cover the range of CTp curve instability; but once the curve has stabilized, the scan can be terminated. To limit radiation exposure and cost, the acquisition duration should be minimized (Ng et al., 2013). Critically, because the tissue type is unknown prior to diagnosis, any proposed acquisition period must ensure stable quantification of CTp characteristics for both types of tissue before CTp can be used for detection of metastatic sites. Though nonexchangeable, the perfusion CT data obtained in cancer-free liver tissue may inform the shape and stabilization time of the corresponding CTp curve in cancerous tissue. Figure 1 shows substantial heterogeneity in the shapes of the average CTp curves by tissue region for PS, whereas for BV and BF, the average CTp curves are more similar.

Fig 1.

Fig 1

Individual-by-region perfusion CT (CTp) curves are displayed by tissue region (“Individual CTp curves”) for permeability-surface area product (PS), blood volume (BV) and blood flow (BF), over acquisition durations between 11 and 95 seconds after contrast injection. The dots mark actual observations. The thick “Average CTp curve” is a Loess estimate that ignores potential clustering.

Our second application, from colorectal cancer, involves estimating progression-free survival (PFS) with data from two consecutive randomized phase III colorectal cancer trials, reported by Saltz et al. (2000) and Goldberg et al. (2004). Both trials used PFS to assess the efficacy of various treatment regimens for patients with previously untreated metastatic colorectal cancer; disease progression was defined as a 25% increase in measurable tumor size, presence of a new lesion, or death. The initial trial (Saltz et al., 2000) compared three treatment regimens: 5-Fluorouracil and Leucovorin, Irinotecan alone, and Irinotecan and bolus Fluorouracil plus Leucovorin (IFL). The results indicated that the IFL regimen was significantly more efficacious than the other two regimens, and IFL became the “standard of care” leading into the subsequent trial (Goldberg et al., 2004), which then compared an identical IFL regimen with two novel regimens: Oxaliplatin with infused Fluorouracil plus Leucovorin (FOLFOX), and Irinotecan with Oxaliplatin (IROX).

Figure 2 shows that the PFS curves for the IFL regime are commensurate in the Saltz and Goldberg trials, with the PFS curve in the Goldberg trial tracking just above that of the Saltz trial, though within the 95% CIs for nearly all of follow-up. Moreover, Figure 2 suggests that in the Goldberg trial FOLFOX is superior to both IROX (log-rank test p-value of 0.006) and IFL (log-rank test p-value <0.001), and that IFL and IROX perform similarly (log-rank test p-value of 0.404). The efficiency for estimating the PFS curve of the IFL regimen in the Goldberg trial may be improved by using nonexchangeable, yet relevant data on the IFL regimen in the Saltz trial.

Fig 2.

Fig 2

Kaplan-Meier estimates of PFS for the IFL regimen in the Saltz trial (along with 95% CIs), and the IFL, FOLFOX, and IROX regimes in the Goldberg trial.

The commonality between our two motivating examples is the availability of supplemental information about unknown curves that may aid our inference in the primary investigation (i.e., CTp curves among tissue types, and PFS curves of IFL among clinical trials). In Section 3 we investigate the borrowing properties via simulation of the proposed hierarchical model structure for nonparametric regression using penalized splines, and then we analyze the perfusion CT data. In Section 4 we analyze data from the colorectal cancer clinical trials using the proposed hierarchical structure for a semi-parametric hazard model. In Section 5 we close with a discussion and propose directions for future work.

2. Leveraging Supplemental Information

We restrict our attention to two Bayesian methods for leveraging supplemental information, power priors (Ibrahim and Chen, 2000) and commensurate priors (Hobbs et al., 2011). In general, Bayesian models consist of a likelihood for the data and prior specifications for the parameters in the likelihood (see, e.g., Carlin and Louis, 2009). For both these modeling approaches, the primary and supplemental likelihoods are assumed to have the same structure (e.g., both Gaussian), and we denote them by L(θ|D) and L(θ0|D0), respectively. As a result, the supplemental parameter θ0 is analogous to the primary parameter θ, and information about θ0 can be leveraged to aid posterior inference about θ.

2.1. Existing Methods

Power priors assume the sources of information are exchangeable (i.e., θθ0) and downweight the supplemental likelihood by raising it to a prespecified power a0 ∈ [0, 1]. This strategy works because raising the supplemental likelihood to the power a0 diffuses it. In the extreme case where a0 = 0, the downweighted supplemental likelihood is a noninformative constant. Formally, the posterior arises as

p(θD,D0)L(θD)L(θD0)a0π(θ). (1)

Alternatively, a0 can be modeled as another unknown parameter in the model (Ibrahim and Chen, 2000). Power priors are extremely general, so nothing explicitly prevents their use in any modeling context. However, they employ a single parameter (a0) to control between-source borrowing, thereby making differential borrowing among components of θ infeasible unless L(θ|D) has a convenient factorization, as it generally does not. This is undesirable because the supplemental data may provide relevant information for only a subset of θ. Furthermore, when a0 is assumed unknown, power priors tend to excessively downweight the supplemental likelihood, even when D0 and D are identical (Neelon and O’Malley, 2010).

Commensurate priors do not assume the sources of information are exchangeable (i.e., θθ0), and instead specify a hierarchical model where the posterior arises as

p(θ,θ0,ηD,D0)L(θD)L(θ0D0)π(θθ0,η)π(η)π(θ0). (2)

The commensurate prior, π(θ|θ0, η), “centers” θ about θ0, and η controls between-source borrowing to reflect the commensurability of the two parameters. In practice, π(θ|θ0, η) is defined so that E[θ|θ0, η] = θ0 and V ar[θ|θ0, η] is decreasing in η.

For a unidimensional real-valued parameter θ, a useful approach takes θ|θ0, η ~ Inline graphic (θ0, η−1), a Gaussian distribution with mean θ0 and precision η. Estimation of η is inherently difficult, but feasible by inducing sparsity over the precision domain using a “spike-and-slab” prior for η (Hobbs, Sargent and Carlin, 2012). The spike-and-slab prior has a mixture density

π(η)(1-p0)U(ηsl,su)+p0δR(η), (3)

where 0 ≤ sl < suInline graphic and p0 ∈ [0, 1] are pre-specified, and Inline graphic(η) is one at Inline graphic and zero otherwise. The distribution in (3) is locally uniform on the “slab,” (sl, su), with probability (1 − p0), and places probability mass p0 at the “spike,” Inline graphic, otherwise.

We prefer to modify the commensurate prior construction suggested by Hobbs, Sargent and Carlin (2012) as

θθ0,τ,ι~[N(θ0,τ-1)](1-ι)[N(θ0,R-1)]ι,ι~Bern(p0),andτ~U(sl,su), (4)

where Bern(p0) denotes a Bernoulli distribution with P r(ι = 1) = p0. Therefore, the commensurate prior in (3) induces a two-part mixture prior distribution on θ that consists of a highly concentrated component (i.e., Inline graphic (θ0, Inline graphic)) and a relatively diffuse component (i.e., Inline graphic (θ0, τ−1)), both centered at θ0. The model in (4) has parallels to Bayesian variable selection methods, in which the two-part mixture prior for a regression coefficient has one component heavily concentrated about zero and the other vague (see, e.g., George and McCulloch, 1997). Hence, the commensurate prior facilitates selective borrowing by inheriting the selective shrinkage property of the spike-and-slab distributions used in Bayesian variable selection. In particular, the commensurate prior strongly shrinks θ to θ0 when evidence indicates that this difference is small, thereby improving efficiency, and minimally affects θ otherwise, thereby limiting bias.

We specify the spike so that if θ| θ0 ~ Inline graphic(θ0, Inline graphic), θ deviates negligibly from θ0. On the other hand, we specify the slab to contain small values that correspond to modest shrinkage of θ toward θ0. Hence, a N(θ0,su-1) prior for θ will be weakly informative. The tails of the prior distribution in (4) are jointly controlled by sl and su: when sl ≈ 0, smaller values of su provide a prior with heavier tails. In our experience, posterior inference on θ is insensitive to modest shifts in these hyperparameters, so we suggest specifying sl near zero, with su small and Inline graphic large relative to the magnitude of a meaningful difference between θ and θ0. Posterior inference can be sensitive to the fourth hyperparameter, p0, the prior probability of effective equality between θ and θ0. We recommend choosing the value for p0 to deliver satisfactory operating characteristics at the anticipated sample size, such as mean-squared-error properties over a range of likely true differences between θ and θ0; see Murray et al. (2014).

Specifying a value for su that is too large can result in a model that is not robust to between-source heterogeneity, whereas specifying a value for Inline graphic that is too small will result in a model that gains little posterior precision for θ even when D0 and D are equivalent. Specifying too small a value for p0 may also result in a model that borrows little when D0D, whereas specifying too large a value for p0 may result in a model that borrows too much (resulting in unacceptable bias) in the presence of substantial between-source heterogeneity. Of course, p0’s influence on posterior inference diminishes as the primary sample size increases.

2.2. Generalized Mixture Commensurate Priors

The general model structure described in (2) can accommodate a latent precision parameter (component of η) for each component of θ. Therefore, the extent to which the supplemental data influence estimation of the primary effects can be allowed to differ among the components of θ, yielding flexibility and reducing bias. However, assuming mutual independence between the components of η allows strength to be borrowed from the supplemental source(s) differentially among the components of θ, reducing efficiency. Though this may be sensible, differential borrowing among the components of θ may not always be appropriate.

Suppose β = (β1,, βK) is a subset of θ that characterizes a feature in the primary data (e.g., the shape of a CTp curve), with an analogous definition for β0. We may wish to borrow from the supplemental data for β0 to a degree that reflects its coherence with β in its entirety, rather than case-by-case for each βk. Moreover, β (and β0) may conventionally be assigned a prior that induces smoothness. Ideally, (2) will inherit the smoothness properties of the conventional prior and simultaneously borrow similar amounts for related subgroups of θ (i.e., β). We propose to model

βkιk,βk,0~[π(βkβk,0)]1-ιk[π(βkβk,0)]ιk,ιkν~Bern(ν),ν~B(a1,a2),β0,k~π(β0,k) (5)

where Inline graphic(a1, a2) denotes a beta distribution with mean a1a1+a2 and the specifications for π* mirror a conventional analysis of these data. To facilitate borrowing of strength, we take π(βk0,k) in (5) to be heavily concentrated about β0,k; we provide exact specifications for each application later. To borrow similar amounts of strength for each component of β, we assume the ιk are identically distributed with P r(ιk = 1) = ν. Thus, the resulting prior is composed of K two-piece mixtures and generalizes the commensurate prior defined in (4). Hereafter, we refer to the general prior structure defined in (5) as a generalized mixture commensurate (GMC) prior.

3. GMC Priors in Nonparametric Regression Analysis

Suppose y = (y1,, yN) is a real-valued variable (e.g., log blood flow, as in Figure 1) that depends on a continuous covariate t = (t1,, tN) (e.g., time, as in Figure 1), and that we model yi ~ Inline graphic (ϕ(ti), σ2). Often primary interest lies in the shape, derivatives, or some other function of ϕ, and ϕ requires a smooth but flexible nonparametric specification, as opposed to a parametric linear specification ϕ(t) = β0 + β1t. Penalized splines are a practical choice for modeling ϕ, see Ruppert, Wand and Carroll (2003, Ch. 3 and Ch. 14). There are many ways to formulate a penalized spline. Low-rank thin-plate (LRTP) splines are appealing since they are defined by tractable radial basis functions, and tend to exhibit fast Markov chain Monte Carlo (MCMC) convergence properties in a Bayesian context relative to truncated basis splines (Crainiceanu, Ruppert and Wand, 2005). B-splines are another reasonable Bayesian option because they too tend to exhibit fast MCMC convergence, but they rely on a recursive algorithm to define the basis functions, making them less tractable than LRTP splines (Eilers and Marx, 1996). Our method will work with either LRTP or B-spline formulations for ϕ, and we do not anticipate substantial differences in the behavior of our method under either formulation. Hereafter, we focus on a LRTP cubic spline specification for ϕ.

Without loss of generality, we take t ∈ [0, 1] and model

ϕ(t;β)=β0+β1t+k=2Kβk(t-tk-13-tk-13), (6)

where = (0 = 0 < t̃1 << t̃K−1 < t̃K = 1) is a generic partition with K intervals. We discuss this choice later. Typically, a LRTP spline model is formulated as

ϕ(t;β)=β0+β1t+k=2Kβkt-tk-13; (7)

however, in (7), β0 is not an intercept because ϕ(0;β)=β0+k=2Kβktk-13. We prefer the modified LRTP (mLRTP) model defined in (6), because it simplifies interpretation and prior elicitation when the regression includes an intercept, which will be the case for the CTp data, and it retains an equivalent interpretation of (β1, …, βK) and ( β1,,βK). In the sequel, we denote β(−0) = (β1, …, βK), i.e., the vector β with β0 omitted.

A penalized spline requires pre-specification of a fine partition , and smooths via the prior whereas, a free-knot spline uses complex sampling algorithms (e.g., reversible-jump MCMC) to sample over all possible partitions. We prefer penalized splines, since they are computationally much simpler and have proven to be competitive with free-knot splines when the temporal variation of ϕ is smooth (Ruppert, Wand and Carroll, 2003, Section 3.16). Wand (2000) reviews methods for free-knot spline estimation; we do not consider this approach further. To select for a penalized spline, we consider a set of values for K with either equally-spaced or quantile-spaced k’s, and select the partition that has low posterior mean deviance and deviance information criterion (DIC) relative to the other partitions considered (Spiegelhalter et al., 2002).

Following Crainiceanu, Ruppert and Wand (2005), without supplemental data, we complete the Bayesian specification of (6) by placing vague priors on β0 and β1, and a N(0,σβ2Ω-) prior on (β2, …, βK)′, where the (j, k)th entry of Ω is defined as |j−1k−1|3, for j, k = 2, …, K. In practice, we apply the transformations b0 = β0, b1 = β1, (b2, …, bK)′ = Ω1/2(β2, …, βK)′, and σb2=σβ2, and then complete the model specification by assuming

π(bk)N(bk0,104),fork=0,1,π(bkσb)N(bk0,σb2)fork=2,,K,andπ(σb)U(σb0.01,100). (8)

The prior in (8) smooths ϕ by preferring values for (b2, …, bK) near zero a priori.

3.1. GMC Prior Specification

With supplemental data (y0 = (y0,1, …, y0,n0), t0 = (t0,1, …, t0,n0)), we also assume y0,i0t0,i0~N(ϕ0(t0,i0),σ02), for i0 = 1, …, n0. To that ensure ϕ and ϕ0 are analogous, we use the same partition for ϕ0, and model ϕ0(t; β0) using (6) by replacing β with β0. We apply the transformation b0,k = β0,k, for k = 0, 1, and (b0,2, …, b0,K)′= Ω1/2(β0,2, …, β0,K)′, and use the prior specifications π*(b0,k) ≡ Inline graphic (b0,k|0, 104), for k = 0, 1, π(b0,kσb0)N(b0,k0,σb02) for k = 2, …, K, and π*(σb0) ≡ Inline graphic(σb0|0.01, 100).

To borrow flexibly for the shape of ϕ from the supplemental data for ϕ0, we apply the GMC prior specification defined in (5) to b(−0) = (b1, …, bK) given b0,(−0) = (b0,1, …, b0,K). We assume

π(b(-0)b0,(-0),ι(-0),σb)=[π(b1)](1-ι1)[π(b1b0,1)]ι1k=2K[π(bkσb)](1-ιk)[π(bkb0,k)]ιk,

where ι(−0) = (ι1, …, ιK), and π*(b1), π*(bk|σb) and π*(σb) are defined in (8). We set π(bkb0,k)N(bkb0,k,Rb-1), for k = 1, …, K, where Inline graphic is a prespecified large value. We then specify ιkνiidBern(ν) and ν ~ Inline graphic(a1, a2) with fixed, known hyperparameters a1 and a2 that reflect our prior opinion about the relevance of the supplemental data for the shape of ϕ.

To allow differential borrowing for the intercept versus the shape of ϕ, we specify an independent commensurate prior distribution for b0 given b0,0 following (4). We define π(b0|b0,0, ι0, τ) ≡ [π*(b0|b0,0, τ)](1−ι0)[π(b0|b0,0)]ι0, where π*(b0|b0,0, τ) ≡ Inline graphic(b0|b0,0, τ−1) and π(b0|b0,0) = Inline graphic(b0|b0,0, Inline graphic). We then specify τ ~ Inline graphic(sl, su) and ι0 ~ Bern(p0), where sl, su, Inline graphic and p0 are prespecified following the guidance in Section 2. Taken together, the full posterior arises as

p(b,b0,σb,σb0,ι,ν,τD,D0)L(bD)L(b0D0)π(b0b0,0,ι0,τ)π(ι0)π(τ)π(b0,0)×π(b(-0)b0,(-0),ι(-0),σb)π(ι(-0)ν)π(ν)π(σb)π(b0,(-0)σb0)π(σb0). (9)

3.2. Simulation Assessment

We now investigate via simulation the borrowing properties of the GMC prior model in (9) for nonparametric regression. To do so, we sample yi|ti ~ Inline graphic{μ(ti), σ2}, i = 1, …, N, where μ(t) = 5t sin{5t}, and sample y0,i0|t0,i0, d~N{μ0(t0,i0d),σ02}, i0 = 1, …, N0, where μ0(t|d) = (5 + d)t sin((5 + d)t) and d ∈ [0, 5]. Hence, the primary data always have the same true mean structure, and the supplemental data have a mean structure that deviates from that of the primary according to the value of d, the discordance parameter. When d = 0, the two curves are the same; and as d increases, the supplemental curve has increasingly greater curvature than the primary curve. Figure 3 shows curves for selected values of d.

Fig 3.

Fig 3

Mean curves for the supplemental data for a subset of discordance parameter (d) values. The mean curve for the primary data is denoted by the solid line (i.e., d = 0).

In each run, we sample d uniformly from the set {0, 0.05, 0.10, 0.20, 0.35, 0.50, 0.75, 1, 1.50, 2, 3, 4, 5}. Given d, we then generate primary and supplemental datasets with N = N0 = 50, error variances σ2=σ02=1, and equally spaced t and t0 values over [0, 1]. For each dataset pair, (D = (y, t), D0 = (y0, t0)), we fit the GMC prior model developed in Section 3.1. We use the mLRTP model defined in (6) with K = 10 and spaced equally over [0, 1] for both ϕ0(t0; β0) and ϕ(t; β). After preliminary investigation, we set sl = 0, su = 2, Inline graphic = 2000, p0 = 0.50, and π(ν) = Inline graphic(ν|0.50, 0.50). The latter two choices represent an indifferent prior opinion about the relevance of the supplemental data for both the shape and the intercept of ϕ. We choose to place an extremely vague yet indifferent prior on ν to allow the primary and supplemental data to have substantial influence on whether to borrow for curve shape.

We used the R2jags package to call JAGS from R, and ran two chains for 20,000 iterations of burn-in, followed by 200,000 iterations for posterior estimation. We also fit the mLRTP model defined in (6) with prior specifications described in (8) to the primary data alone, as well as to the dataset obtained by simply pooling the primary and supplemental data. These models feature lower MCMC autocorrelation, and thus required only two chains with 2,000 iterations of burn-in, followed by 20,000 iterations for posterior estimation. These choices reflect preliminary investigations to ensure acceptable MCMC convergence and relatively small MC standard errors for the intercept and functional effect coefficients. Estimation of the GMC model took about 45 seconds, whereas estimation of each comparison model took about 5 seconds.

To evaluate the three models, we calculated four criteria at each run: mean error (ME), root-mean-square error (RMSE), mean pointwise credible interval width (CrIW), and mean pointwise coverage probability (CP). Alternatively, we could calculate simultaneous confidence bands following Krivobokova, Kneib and Claeskens (2010, Section 3). We define the four criteria as

ME(d(m))=N-1i=1N[ϕ^(tid(m))-μ(tid(m))],RMSE(d(m))={N-1i=1N[ϕ^(tid(m))-μ(tid(m))]2}12,CrIW(d(m))=N-1i=1N[ϕ0.975^(tid(m))-ϕ0.025^(tid(m))],CP(d(m))=N-1i=1NI{μ(tid(m))[ϕ0.025^(tid(m)),ϕ0.975^(tid(m))]},

for m = 1,, M, where ϕ̂(t|d(m)) denotes the posterior mean estimate for ϕ(t|d(m)), and ϕq^(td(m)) denotes the qth quantile posterior estimate of ϕ(t|d(m)) in the mth run. We then compare the sampling average of each criterion over the M simulated dataset pairs as a function of d, the discordance parameter. Specifically, we visually compare Loess estimates of each evaluation criteria as a function of the discordance parameter. One simulation iteration takes about 60 seconds, which entails generating a pair of datasets, fitting the three models, and then calculating and saving the evaluation criteria, along with the sampled value of d in that run. We reduced overall computation time using the snowfall package for R to conduct the simulation runs in parallel. A R program to implement this simulation is available through the third author’s software page http://www.biostat.umn.edu/~brad/software.html.

The results of our simulation investigation are illustrated in Figure 4. Each panel shows the Loess estimates of the sampling average of the corresponding evaluation criteria as a function of d for three models: the GMC prior model (solid), the conventional model fit to the primary data alone (dashed), and the conventional model fit to the pooled data (dot-dashed). As expected, inferential properties for the GMC prior model reflect those for the pooled approach under true concordance (i.e., d = 0), and approach primary data alone for increasing degrees of discordance. The primary-alone approach has the largest average CrIW and RMSE under true concordance, but its properties do not deteriorate for increasing degrees of discordance (because it does not acknowledge the supplemental data). By contrast, the pooling approach has the smallest CrIW and RMSE under true concordance, but its performance on all four evaluation criteria deteriorates substantially as discordance increases.

Fig 4.

Fig 4

Simulation assessment of the GMC prior spline model (solid) versus a conventional spline model fit to the primary data alone (dashed) and to the dataset obtained by pooling the supplemental and primary data (dot-dashed). These methods are compared on sampling averages of mean error (top left), root-mean-square error (top right), mean credible interval width (bottom right), and mean pointwise coverage probability (bottom left) as a function of d, the discordance parameter. All results are based on M = 2,000 runs.

The ME plot (top left panel) in Figure 4 demonstrates that the GMC prior model has bias properties that interpolate the two conventional approaches for near concordance, but near d = 1 it learns to effectively ignore the supplemental information. Furthermore, the RMSE (top right panel) shows that the GMC prior model learns to borrow strongly from the supplemental information when the sources are commensurate (i.e., slight discordance), achieving RMSE similar to that of the pooling approach. Similarly, the CrIW plot (bottom right panel) shows for slight discordance that the credible intervals from the GMC prior model are nearly as tight as those for the pooling approach. Conversely, when the supplemental curve has greater oscillation than the primary curve, the GMC prior approach has credible interval width similar to those of the primary-alone approach. Lastly, the CP plot (bottom left panel) shows that the mean pointwise coverage probabilities for the GMC prior model are similar to those of the conventional spline model fit to the primary data alone.

3.3. Application: Liver Imaging Study

We now apply our GMC prior model structure to estimate the temporal features of each perfusion characteristic in cancerous liver tissue. Recall that similar information derived from non-cancerous tissue is potentially valuable supplemental information because the shape of the average CTp curve in non-cancerous regions may provide relevant information about the shape of the corresponding average CTp curve in cancerous regions. The data consist of 7 to 13 readings acquired at times between 11 and 100 seconds after contrast injection in 0 to 2 cancerous and non-cancerous regions of interest (ROIs) among 16 individuals. For each perfusion characteristic, there are 687 total readings from fifteen individuals who each contribute Ni = 11–13 readings in MT,i = 1–2 cancerous and MN,i = 1–2 non-cancerous ROIs, and one individual who contributes 7 readings in 1 non-cancerous ROI.

Individual acquisition times ti,, ℓ = 1,, Ni, are necessarily identical for all ROIs and all perfusion characteristics. Thus, we let yr,i,j,, for j = 1,, Mr,i, denote a reading at time ti, . We denote the average CTp curve for a given perfusion characteristic in cancerous tissue and non-cancerous tissue by ϕT(t) and ϕN (t), respectively. We assume each individual’s CTp curve deviates from the average CTp curve smoothly over time, and we denote these deviation curves by ψr,i(t), r = T, N and i = 1,, 16. We then model

yr,i,j,~N{ϕr(ti,)+ψr,i(ti,),σe,r2}. (10)

We use the mLRTP model defined in (6) for ϕr and ψr,i, parametrized by βr and αr,i, respectively. Therefore, the first derivative of the average CTp curve is

ϕr(t;βr)=βr,1+k=2Ksign(t-tk-1)3βr,k(t-tk-1)2,r=N,T. (11)

To shrink the individual deviation curves ψr,i(t; αr,i) toward the average CTp curve ϕr(t; βr), we assume αr,i,k~N(0,σa,r,i2), for k = 0,…, K, ℓ = N, T and i = 1,…, 16.

For our conventional model, we use the prior specifications on βN and βT described in (8), and specify vague Inline graphic(0, 104) priors for bN,0 and bT,0. For the GMC prior model, we apply the prior developed in Section 3.1 for βT given βN. That is, we specify a GMC prior for βT,(0) = (βT,1,, βT,K) given βN,(0) = (βN,1,, βN,K) and an independent commensurate prior for bT,0 given bN,0. This choice enables differential borrowing from the non-cancerous tissue data for the intercept versus the curve shape parameters, and simultaneously enables borrowing similar amounts among curve shape parameters. Thus, if the average CTp curve in non-cancerous regions differs from the average CTp curve in cancerous regions only for the intercept, the model still permits borrowing for the CTp curve shape. Lastly, we place vague Inline graphic(0.01, 100) priors on each of the standard deviation parameters (σe,r, σb,r, σa,r,i).

We let t ∈ (0, 1] by taking t=t-tmintmax-tmin, where t* is the original timescale. Preliminary analysis of the non-cancerous regions was used to specify the hyperparameters Inline graphic = 500, sl = 0.01, su = 0.50, p0 = 0.10, a1 = 0.10 and a2 = 0.90. To select a partition, we fit our conventional model (i.e., independent spline models for the average CTp curve in each tissue type) for each perfusion parameter using K = 5, 10, 15, and 25 with equally spaced over [0, 1], and using K = 5, 10, and 15 with placed at equally spaced quantiles of the observed acquisition times. The model using K = 10 with quantile-spaced knots resulted in low deviance and low DIC relative to models with the other partitions for all three perfusion characteristics, so we chose to conduct our analysis using this partition. We estimated the conventional model using 40,000 posterior samples after 2,000 burn-in samples from two MCMC chains. We estimated the GMC model posterior using 200,000 posterior samples after 20,000 burn-in samples from two MCMC chains; there was greater autocorrelation for the basis coefficient parameters than the conventional model.

Table 1 reports the posterior mean borrowing parameters (i.e., ιk’s) from the GMC model for the three perfusion characteristics. Values near 1 indicate strong borrowing for the corresponding basis parameter. The first column reports the posterior mean for ι0, which corresponds to the intercept. PS and BF show virtually zero borrowing for the intercept, whereas BV shows little borrowing. The remaining columns illustrate borrowing for the basis parameters that control the shape (and thus, first derivative) of the CTp curve, which we have restricted to have similar magnitude by assuming ιk|ν ~ Bern(ν), for k = 1,, 10, a priori. For PS, these parameters all have posterior means less than 0.43, and many less than 0.25, which indicates little borrowing for CTp curve shape. In contrast, these parameters all exceed 0.82 for BV, and 0.92 for BF. Thus, for BF and BV the GMC model borrows substantially from the supplemental information in non-cancerous tissue for CTp curve shape in cancerous tissue.

Table 1.

Posterior borrowing parameter estimates (i.e., ιk’s). ι0 corresponds to the CTp curve intercept, and the remaining ιk’s correspond to CTp curve shape. Values near one indicate strong borrowing.

Perfusion Characteristic ι0 ι1 ι2 ι3 ι4 ι5 ι6 ι7 ι8 ι9 ι10
PS 0.00 0.23 0.00 0.00 0.24 0.43 0.24 0.05 0.15 0.03 0.34
BV 0.20 0.92 0.95 0.92 0.82 0.86 0.90 0.82 0.88 0.96 0.96
BF 0.02 0.96 0.98 0.92 0.92 0.93 0.94 0.92 0.96 0.98 0.99

The first row in Figure 5 indicates that both the shape and intercept of the PS curve differ substantially by tissue type and that the results of the conventional model analysis (first column) and the GMC prior model analysis (second column) are virtually indistinguishable. Consequently, the first derivative of the CTp curve in cancerous tissue (third column) is estimated with similar precision in either the conventional or GMC model analysis. In fact, the GMC prior model results in 95% pointwise credible intervals (CrIs) that are 4% wider on average than those of the conventional model analysis. Regardless, the PS CTp curve in cancerous tissue has not stabilized after 95 seconds, so longer acquisition durations for PS appear necessary, which is consistent with the findings by Ng et al. (2013).

Fig 5.

Fig 5

The first column displays posterior mean CTp curves from the conventional model analysis for cancerous (solid black) and non-cancerous regions (dotted grey), along with 95% pointwise CrIs. The second column displays the same results for the GMC model analysis. The third column compares the posterior mean first derivative of the CTp curve in cancerous tissue from the conventional model (solid black) and the GMC prior model (dotted grey), along with 95% pointwise CrIs.

The second row in Figure 5 corresponds to BV. Here the GMC prior model, relative to the conventional model, results in a non-cancerous CTp curve that is more similar for both the intercept and the shape to that of the cancerous tissue CTp curve. Moreover, the GMC model gains precision over the conventional model for the first derivative estimate of the cancerous tissue CTp curve via borrowing from the non-cancerous tissue CTp curve shape. The average 95% pointwise CrIs for the GMC model are 18% tighter than those of the conventional model. The third row corresponds to BF and illustrates that the posterior CTp curve estimates are similar for the two modeling approaches. However, the first derivative of the cancerous tissue CTp curve is estimated more precisely for the GMC model than the conventional model, resulting in 27% tighter 95% pointwise CrIs on average. For both BF and BV, we can infer a shorter acquisition duration using the GMC prior model than using the conventional model, although we did not impose precise stability criteria here.

4. GMC Priors in Semiparametric Survival Analysis

We now describe a flexible semiparametric survival model and then apply our GMC prior technology to the colorectal cancer clinical trials data described in Section 1. For simplicity, assume each observation consists of a possibly right-censored time t ∈ (0, 1], a binary event indicator c, and a binary treatment indicator z.

4.1. Piecewise-Exponential Proportional-Hazards Model

A flexible survival model commonly favored by Bayesians is the piecewise-exponential proportional-hazards model, which is constructed by partitioning the time-axis into K intervals = (0 = 0 < 1 <… < K−1 < K = 1) and assuming the baseline hazard is constant in each interval (cf. Ibrahim, Chen and Sinha, 2001, Section 3.1). Under this model, the likelihood for an observation (t, c, z) is given by h(tz;γ,ρ)cexp{-0th(sz;γ,ρ)ds}, where

h(tz;γ,ρ)=exp(γk+ρz)fortIk=(tk-1,tk],k=1,,K. (12)

Thus, h(t|z; γ, ρ) is assumed to be piecewise-constant, where γk denotes the log-hazard in the kth interval of the time-axis partition for treatment assignment z = 0, and ρ denotes the log-hazard-ratio for treatment assignment z = 1 relative to z = 0. Following Ibrahim, Chen and Sinha (2001), we select K using a DIC comparison over a small set of partitions. To resist overfitting, we specify a correlated prior process for π*(γ). We focus on the random-walk prior process for π*(γ) introduced by Fahrmeir and Lang (2001). Formally, we specify

π(γ1)N(γ10,104),π(γkγk-1,σγ)N(γkγk-1,σγ2),fork=2,,K,andπ(σγ)U(σγ0.01,100). (13)

The prior process in (13) smooths adjacent γk’s toward each other by assuming their first differences are exchangeable. We also assume ρ is independent of γ a priori, and specify a vague Inline graphic(0, 104) prior. The model defined in (12) is useful in the absence of supplemental data, and the parameter space can be decomposed into separable subsets, γ characterizes the baseline hazard and ρ characterizes the treatment effect.

4.2. Application: Colorectal Cancer Trials

Extending Hobbs, Carlin and Sargent (2013), we apply our GMC prior model structure to supplement the inference on the progression-free survival (PFS) curve among all three regimens in the trial reported by Goldberg et al. (2004) using the historical information on the IFL regimen from the trial reported by Saltz et al. (2000). We use the piecewise-exponential proportional-hazards model to estimate the PFS curves for the two trials from these data. During the first two years of follow-up, the primary (Goldberg) dataset contains 197 progression events among 211 persons treated with IFL, 190 events among 216 persons treated with FOLFOX, and 189 events among 206 persons treated with IROX. For the primary data, we have two binary treatment indicators for assignment to the FOLFOX (zF) and IROX (zI) regimens, respectively, so following (12) we model

h(tz;γ,ρ)=exp(γk+ρFzF+ρIzI)fort(tk-1,tk],

where k = 1, …, K. The supplemental (Saltz) data contain 172 progression events during the first two years of follow-up among 224 persons treated with IFL. The model for the supplemental data is defined analogously, though zF = zI = 0 for all the supplemental observations. Therefore, the supplemental hazard model is completely parametrized by γ0, which, as required, is analogous to γ.

We specify a GMC prior to flexibly borrow strength from the supplemental information on the IFL regimen. Namely, we specify the random-walk prior process defined by (13) for π*(γ0), and apply the GMC prior structure on γ. In this setting, the prior specification follows as γ1|γ0,1, ι1~[N(γ10,104)]1-ι1[N(γ1γ0,1,Rγ-1)]ι1, γk|γk−1, γ0,k, ιk, σγ~[N(γkγk-1,σγ2)]1-ιk[N(γkγ0,k,Rγ-1)]ιk, for k = 2, …, K, and ιk|νγ ~ Bern(νγ), for k = 1, …, K, where Inline graphic is pre-specified. Next, we place a Inline graphic(a1, a2) prior on νγ, where a1 and a2 are also pre-specified. Lastly, we place vague Inline graphic(0.01, 100) priors on each of the standard deviation parameters (i.e., σγ and σγ0), and vague Inline graphic(0, 104) priors on the treatment effect parameters (i.e., ρF and ρI).

We transformed the timescale so that t ∈ (0, 1], then selected Inline graphic = 10000 and specified a vague Inline graphic(0.10, 0.90) prior for νγ, which represents a vague, yet skeptical prior opinion about the relevance of the supplemental data. For estimation, we used 200,000 posterior samples for estimation after 50,000 iterations of burn-in from two MCMC chains. For comparison, we also fit conventional piecewise-exponential proportional-hazards models with the same time-axis partition and random-walk prior process to the primary (Goldberg) data alone, supplemental (Saltz) data alone, and the dataset obtained by naively pooling the two sources of information. To estimate these conventional models, we used 20,000 posterior estimation draws after 2,000 burn-in draws from two MCMC chains, again the shorter chain length justified by faster convergence and lower post-convergence autocorrelations.

We begin with a comparison of the estimated PFS curves for the IFL regimen from the four analyses. The left panel in Figure 6 indicates that the estimated PFS curve from the Goldberg data (solid, with shaded grey 95% pointwise CrIs) overlaps substantially with the PFS curve estimated from the Saltz data (dashed, with dotted 95% pointwise CrIs), suggesting the Saltz data provide relevant information. The middle panel in Figure 6 illustrates that the GMC analysis (solid, with shaded grey 95% pointwise CrIs) results in a PFS estimate nearly indistinguishable from that of the conventional analysis of the pooled dataset (dashed, with dotted 95% pointwise CrIs), though shifted very slightly toward the primary information. The GMC prior model achieved a PFS curve estimate with noticeably greater precision than that of the conventional analysis that ignores the supplemental data without requiring bold a priori assumptions regarding the relevance of the Saltz data. Finally, the rightmost panel in Figure 6 displays a posterior histogram of νγ that has much of its mass shifted toward one. In fact, the ιk’s had posterior means of at least 0.87, with an average of 0.94, indicating that the GMC prior model has learned that the supplemental FOLFOX data from the Saltz trial are indeed relevant.

Fig 6.

Fig 6

The left panel displays the posterior mean PFS curve for the IFL regimen along with 95% pointwise CrIs from the conventional piecewise-constant-hazard analysis of the primary (Goldberg) data alone and the supplemental (Saltz) data alone. The middle panel displays the same results from the GMC prior model analysis and the conventional model analysis of the dataset obtained by pooling the two sources of information. The third panel displays a posterior histogram for νγ, which controls borrowing for the baseline hazard among trials. For reference, the prior assigned to νγ is also displayed (grey line).

Turning to the comparative effectiveness of three regimens, Table 2 shows that the hazard-ratios (95% CrIs) from the GMC prior model analysis for FOLFOX and IROX versus IFL are 0.66 (0.56,0.79), and 0.87 (0.73,1.04), respectively. The hazard-ratios from the conventional analysis of the Goldberg data alone are slightly larger with notably wider CrI widths, 0.70 (0.57,0.86) for FOLFOX and 0.92 (0.75,1.12) for IROX. By contrast, the hazard-ratio estimates and CrI widths from the conventional analysis of the pooled dataset are indistinguishable from the GMC prior model analysis. Table 2 also contains the posterior estimates from each analysis of median PFS for the IFL, FOLFOX, and IROX regimens. The conventional analysis of the Saltz data alone estimates median PFS in the IFL regimen to be 200 (168, 236) days, broadly similar to the estimate of 205 (182, 227) days provided by the conventional analysis of the Goldberg data alone. The estimates of median days to disease progression for each regimen from the pooled analysis of these data are also nearly identical to the estimates from the GMC prior analysis, namely, 200, 265 and 221 days for the IFL, FOLFOX and IROX regimens, respectively. The estimates from the GMC prior model are consistent with the results reported by Goldberg et al. (2004), who found FOLFOX to be the superior regimen, and significantly better than IFL. In addition, the GMC prior analysis yields stronger evidence that IROX may be better than IFL for PFS, though the difference remains statistically insignificant.

Table 2.

Hazard-ratio estimates for the FOLFOX and IROX regimens relative to the IFL regimen, and estimates of median days to disease progression for each treatment regimen from the four analyses of the Goldberg data and Saltz data.

Drug Regimen IFL FOLFOX IROX
Analysis Hazard Ratios (95% CrIs)

Goldberg Alone 0.70 (0.57, 0.86) 0.92 (0.75, 1.12)
Pooled Analysis 0.66 (0.56, 0.79) 0.87 (0.73, 1.04)
GMC Prior Analysis 0.66 (0.56, 0.79) 0.87 (0.73, 1.04)

Median Days to Disease Progression (95% CrIs)

Saltz Alone 200 (168, 236)
Goldberg Alone 205 (182, 227) 262 (235, 293) 217 (194, 242)
Pooled Analysis 200 (182, 217) 263 (245, 286) 221 (201, 240)
GMC Prior Analysis 200 (184, 216) 265 (245, 286) 221 (203, 240)

5. Discussion and Future Work

Our proposed methods for prior specification in functional and survival data models with penalized splines facilitate data-dependent borrowing that is robust to biased estimation of primary effects when conflict among information sources occurs. The simulation study illustrates the beneficial flexible borrowing properties the proposed methods offer. The application in perfusion CT illustrates potential gains in CTp curve estimation from using supplemental data collected concurrently. By contrast, the colorectal cancer application illustrates the use of these methods for semiparametric survival modeling, to flexibly borrow from supplemental data on a control therapy collected in a previous trial. A hierarchical model with the proposed structure allows the degree of borrowing to be estimated differentially for each feature (e.g., CTp curve intercept versus shape). The amount of strength being borrowed between sources reflects the evidence of commensurability for that feature. When substantial evidence indicates that sources differ for a feature, the proposed method will learn to effectively ignore the supplemental data for that feature, yet possibly still borrow strength for another feature. For minor discordance or concordance, the proposed method also facilitates partial (rather than full) pooling of information from the supplemental source; hence a modest yet justifiable gain in efficiency. Our general modeling strategy enables data-driven estimation of heterogeneity among information sources for complex functional relationships, and thus provides a sensible and justifiable synthesis of clinical information.

In future work, we plan to extend the commensurate prior approach to settings that have multiple supplemental sources of information. This setting substantially complicates the construction of a hierarchical model that facilitates flexible borrowing for each supplemental source. Ideally, the method will facilitate data-dependent differential borrowing from various sources, learning from which to borrow and which to ignore as primary data accumulate. We also hope to develop concise, interpretable summaries that quantify the amount of strength being borrowed from each information source, thereby allowing the use of these models in an adaptive trial (see Hobbs, Carlin and Sargent, 2013). We will also consider using posterior summaries of ν in (5) to determine whether the curves are commensurate among information sources, e.g., by assessing Pr(ν > 0.80|D, D0) > 0.90. Furthermore, we are currently studying the properties of the piecewise-exponential model used in the colorectal cancer application, which relies on a simple, yet flexible piecewise-constant assumption for the baseline hazard. We are developing extensions that use a piecewise-linear model for the baseline hazard function, relax the proportional-hazards assumption, and allow functional covariate effects with shape constraints (e.g., monotonicity).

Acknowledgments

We thank the Associate Editor and Referees for their well thought-out and detailed comments during revision that substantially improved the clarity of presentation in this manuscript and the model we use in the CTp analysis.

References

  1. Carlin BP, Louis TA. Bayesian Methods for Data Analysis, 3rd edition. Boca-Raton, FL: Chapman & Hall/CRC Press; 2009. [Google Scholar]
  2. Crainiceanu CM, Ruppert D, Wand MP. Bayesian Analysis for Penalized Spline Regression Using WinBUGS. Journal of Statistical Software. 2005;14:1–24. [Google Scholar]
  3. Doi SAR, Barendregt JJ, Mozurkewich EL. Meta-analysis of heterogeneous clinical trials: An empirical example. Contemporary Clinical Trials. 2011;32:288–298. doi: 10.1016/j.cct.2010.12.006. [DOI] [PubMed] [Google Scholar]
  4. Eilers PHC, Marx BD. Flexible smoothing with B-splines and penalties. Statistical Science. 1996;11:89–121. [Google Scholar]
  5. Fahrmeir L, Lang S. Bayesian inference for generalized additive mixed models based on Markov random field priors. Journal of the Royal Statistical Society: Series C (Applied Statistics) 2001;50:201–220. [Google Scholar]
  6. George EI, McCulloch RE. Approaches for Bayesian Variable Selection. Statistica Sinica. 1997;7:339–373. [Google Scholar]
  7. Goldberg RM, Sargent DJ, Morton RF, Fuchs CS, Ramanathan RK, Williamson SK, Findlay BP, Pitot HC, Alberts SR. A randomized controlled trial of fluorouracil plus leucovorin, irinotecan, and oxaliplatin combinations in patients with previously untreated metastatic colorectal cancer. Journal of Clincal Oncology. 2004;22:23–30. doi: 10.1200/JCO.2004.09.046. [DOI] [PubMed] [Google Scholar]
  8. Hobbs BP, Carlin BP, Sargent DJ. Adaptive Adjustment of the Randomization Ratio Using Historical Control Data. Clinical Trials. 2013;10:430–440. doi: 10.1177/1740774513483934. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Hobbs BP, Sargent DJ, Carlin BP. Commensurate Priors for Incorporating Historical Information in Clinical Trials Using General and Generalized Linear Models. Bayesian Analysis. 2012;7:639–674. doi: 10.1214/12-BA722. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Hobbs BP, Carlin BP, Mandrekar SJ, Sargent DJ. Hierarchical Commensurate and Power Prior Models for Adaptive Incorporation of Historical Information in Clinical Trials. Biometrics. 2011;67:1047–1056. doi: 10.1111/j.1541-0420.2011.01564.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Ibrahim JG, Chen M-H. Power prior distributions for regression models. Statistical Science. 2000;15:46–60. [Google Scholar]
  12. Ibrahim JG, Chen M-H, Sinha D. Bayesian Survival Analysis. New York: Springer; 2001. [Google Scholar]
  13. Krivobokova T, Kneib T, Claeskens G. Simultaneous confidence bands for penalized spline estimators. Journal of the American Statistical Association. 2010;105:852–863. [Google Scholar]
  14. Miles KA, Griffiths MR. Perfusion CT: a worthwhile enhancement? The British Journal of Radiology. 2003;76:220–231. doi: 10.1259/bjr/13564625. [DOI] [PubMed] [Google Scholar]
  15. Murray TA, Hobbs BP, Lystig TC, Carlin BP. Semiparametric Bayesian commensurate survival model for post-market medical device surveillance with non-exchangeable historical data. Biometrics. 2014;70:185–191. doi: 10.1111/biom.12115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Neelon B, O’Malley AJ. Bayesian Analysis Using Power Priors with Application to Pediatric Quality of Care. Journal of Biometrics and Biostatistics. 2010;1:103. [Google Scholar]
  17. Ng CS, Hobbs BP, Chandler AG, Anderson EF, Herron DH, Charnsangavej C, Yao J. Metastases to the Liver from Neuroendocrine Tumors: Effect of Duration of Scan Acquisition on CT Perfusion Values. Radiology. 2013;269:758–767. doi: 10.1148/radiol.13122708. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Pocock SJ. The combination of randomized and historical controls in clinical trials. Journal of Chronic Diseases. 1976;29:175–188. doi: 10.1016/0021-9681(76)90044-8. [DOI] [PubMed] [Google Scholar]
  19. Ruppert D, Wand M, Carroll R. Semiparametric Regression. New York: Cambridge University Press; 2003. [Google Scholar]
  20. Saltz LB, Cox JV, Blanke C, Rosen LS, Fehrenbacher L, Moore MJ, Maroun JA, Ack-land SP, Locker PK, Pirotta N, Elfring GL, Miller LL. Irinotecan plus fluorouracil and leucovorin for metastatic colorectal cancer. New England Journal of Medicine. 2000;343:905–914. doi: 10.1056/NEJM200009283431302. [DOI] [PubMed] [Google Scholar]
  21. Smith TC, Spiegelhalter DJ, Thomas A. Bayesian approaches to random-effects meta-analysis: A comparative study. Statistics in Medicine. 1995;14:2685–2699. doi: 10.1002/sim.4780142408. [DOI] [PubMed] [Google Scholar]
  22. Spiegelhalter DJ, Best NG, Carlin BP, Van Der Linde A. Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2002;64:583–639. [Google Scholar]
  23. Wand MP. A comparison of regression spline smoothing procedures. Computational Statistics. 2000;15:443–462. [Google Scholar]

RESOURCES