Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2017 Oct 15.
Published in final edited form as: Stat Med. 2016 Jun 20;35(23):4077–4092. doi: 10.1002/sim.7005

Measuring the Individual Benefit of a Medical or Behavioral Treatment Using Generalized Linear Mixed-Effects Models

Francisco J Diaz 1
PMCID: PMC5012921  NIHMSID: NIHMS791220  PMID: 27323698

Abstract

We propose statistical definitions of the individual benefit of a medical or behavioral treatment and of the severity of a chronic illness. These definitions are used to develop a graphical method that can be used by statisticians and clinicians in the data analysis of clinical trials from the perspective of personalized medicine. The method focuses on assessing and comparing individual effects of treatments rather than average effects, and can be used with continuous and discrete responses, including dichotomous and count responses. The method is based on new developments in generalized linear mixed-effects models, which are introduced in this article. To illustrate, analyses of data from the STAR*D clinical trial of sequences of treatments for depression and data from a clinical trial of respiratory treatments are presented. The estimation of individual benefits is also explained.

Keywords: Chronic diseases, clinical trials, disease severity, empirical Bayesian prediction, generalized linear mixed-effects models, random effects linear models

1. Introduction

Health professionals and statisticians involved in personalized medicine (PM) for chronic disorders face many challenges, including the need to develop methods for assessing individual variations in clinical responses to medical or behavioral treatments (MBTs) [1]. Despite the fact that the individual benefit of a treatment is a central concept in PM, however, the analysis of data from clinical trials (CTs) usually focuses on average treatment effects. Although researchers may conduct subgroup and interaction analyses secondarily, they are usually limited by lack of power and by the difficulty in knowing or measuring all relevant variables. Moreover, current statistical theory and practice lack a statistical or mathematical definition of individual benefit and methods specifically designed for operationally measuring this important concept.

The main objectives of this article are to propose a statistical definition of the individual benefit of an MBT administered to a patient with a chronic disorder and to suggest a graphical method for assisting statisticians and clinicians in the data analysis of CTs from a PM perspective. Our method focuses on the analysis of individual rather than average effects of treatments. The definition and the graphical method that it implies are based on new developments in generalized linear mixed-effects models (GLMMs), which are described here and have not been published before. Our method can be used with continuous and discrete responses, including dichotomous and count responses. The estimation of individual benefits is also described.

The development of personalized medicine (PM) is spurred by two commonly accepted tenets: (1) that some MBTs may be detrimental rather than beneficial to some people, and (2) that the degrees of benefit provided by the same treatment may vary in different people [1]. These tenets are not always demonstrated empirically, however, and in many cases the evidence presented to support them is questionable [2, 3]. This is because researchers often overlook the importance of variance components in the design and interpretation of clinical trials from a PM frame of reference [3]. For instance, researchers frequently are unaware of the impossibility of detecting subject-by-treatment interactions when they do not take subject-level repeated measures within and across treatments or do not analyze them appropriately. This, despite the fact that such interaction is a central concept of PM [2, 3, 4]. As a result, repeated measures are underutilized in clinical research, and parallel-group designs are still the most common ones in placebo-controlled or comparative-effectiveness studies.

Some authors have proposed graphical methods for demonstrating the existence of differential treatment effects not necessarily explained by measured variables. For cross-over designs with two treatments and four periods, Senn [3] plots the difference between the second pair of responses against that of the first pair. A high correlation between differences suggests a strong element of personal response. In the same design, Schall et al. [5] plot studentized residuals based on differences between the averages of measures from the two treatments against a subject identifier. An outlying point indicates a strong subject-by-formulation interaction. In meta-analyses, several authors have graphed mortality or event rates from study arms under treatment against those from control arms [6, 7, 8]. They aver differential treatment effects if the treatment-control relationship is not parallel to the identity line. This latter approach, however, may be hampered by regression to the mean and ecological fallacy since it is based on aggregated data [9]. Thus, conclusions may not hold true at the individual level.

Boissel et al. [8] defines a treatment benefit as the difference in the rate of the disease outcome between untreated and treated subjects. Using this definition and a Cox regression model of time to cardiovascular events with fixed regression coefficients, Li et al. [10] computes confidence intervals for the individual benefit of some drugs based on the subject's prognostic factors. The prediction, though, does not take into account the potential variability of regression coefficients in the subject population.

GLMMs are useful tools for the analysis of repeated measures [11, 12]. Considerable research suggests that regression models with random effects (REs) can be used to establish a solid paradigm for the construction of the mathematics and statistics of PM research and practice, especially in the treatment of chronic diseases [4, 13-18]. The fact that GLMMs have concepts that allow describing patient populations as a whole (the fixed effects) and, simultaneously, concepts that allow describing patients as individuals (the REs) suggests that these models contain the key ideas for providing PM with a rigorous mathematical language [4, 16-19]. Underlying this is the belief that the variability of a random coefficient is not just a mathematical artifact to control for patients' heterogeneity but also the result of real variation in the biological and environmental factors that have made humans develop as individuals [2, 4, 16, 17, 20]. Moreover, solid empirical and theoretical work by the Sheiner School of Pharmacology [13-15] and others [17, 18, 21] shows that a combination of mixed models with empirical Bayesian feedback (EBF) can be employed successfully in pharmacotherapy individualization and that EBF is well anchored in standard decision theory. Thus, both biological and mathematical arguments support the development of methodological instruments for PM based on GLMMs.

In Section 2, we present the overarching statistical models of medical or psychological care for chronic disorders guiding this article. One can view this section as a useful description of GLMMs from a PM viewpoint in the context of clinical trials. Section 3 contains the main ideas of this article and introduces statistical definitions of the benefit function of an MBT and the illness severity function. It also has graphical examples of how these functions represent many clinical phenomena. The benefit function allows modeling the benefit of an MBT as it varies with (as a function of) the patient or subject. To illustrate how to use our graphical tools in practice, we analyze data from clinical trials of treatments for depression (Section 4) and respiratory illnesses (Section 5). Section 6 presents an approach to the estimation of an individual benefit. Stata code is in Appendices 1 and 2 (Tables 1-2) and in the online material (Table 3 and figures).

Table 1.

Random effects linear model of HAM-D17 scores from 170 subjects refractory to Citalopram monotherapy who received either Bupropion or Venlafaxine as alternative treatments. The effects of Bupropion and Venlafaxine are measured with respect to observations obtained before administering these treatments.a

Parameter name Parameter estimate P-value 95% CI
Fixed effects
 Black raceb 3.54 0.001 [1.48, 5.59]
 BUPc −10.0 <0.001 [−11.3, −8.71]
 VENd −9.85 <0.001 [−10.9, −8.82]
 Interaction Black race*BUP 5.24 0.002 [1.89, 8.58]
 Intercept 17.0 <0.001 [16.1, 17.9]
Variances of random effects
 BUP 10.6 -- [3.54, 31.8]
 VEN 8.49 -- [2.77, 26.0]
 Intercept 20.9 -- [14.9, 29.4]
Covariances between random effectse
 Cov(BUP, Intercept) −8.23 0.035 [−15.9, −0.59]
 Cov(VEN, Intercept) −8.06 0.022 [−15.0, −1.15]
Residual variance 19.9 -- [16.7, 23.6]

CI: Confidence interval; BUP: Bupropion; VEN: Venlafaxine.

a

Before fitting the models, a few inconsistencies in the STAR*D data were found and corrected by the author. These inconsistencies and corrections were documented in a text file that is available from the author on request.

b

The dichotomous variable was defined as 1 if the subject was self-declared black or African American, 0 otherwise.

c

The dichotomous variable was defined as 1 if the HAM-D17 score was obtained under BUP treatment, or 0 if it was obtained at baseline or under VEN treatment.

d

The dichotomous variable was defined as 1 if the HAM-D17 score was obtained under VEN treatment, or 0 if it was obtained at baseline or under BUP treatment.

e

Since subjects on BUP and VEN constituted two independent samples, the covariance between the random effects of BUP and VEN was fixed to 0 during parameter estimation.

Table 2.

Random intercept logistic regression model of good respiratory status (N=56). The effects of Drug A and Placebo are measured with respect to baseline respiratory status.a

Parameter name Parameter estimate P-value 95% CI
Fixed effects
 Age −0.073 0.05 [−0.15, 0.0004]
 Drug Ab 1.81 0.002 [0.65, 2.98]
 Placeboc 0.49 0.39 [−0.63, 1.62]
 Intercept 0.65 0.60 [−1.76, 3.05]
Variance of intercept 7.93 [3.55, 17.7]

CI: Confidence interval.

a

The dichotomous respiratory status was defined as 1 if the subject had a good respiratory status (status 3 or 4), 0 otherwise.

b

The dichotomous variable was defined as 1 if the respiratory status was obtained under Drug A, or 0 if it was obtained at baseline or under Placebo.

c

The dichotomous variable was defined as 1 if the respiratory status was obtained under Placebo, or 0 if it was obtained at baseline or under Drug A.

Table 3.

Empirical Bayes predictors and 95% credibility intervals for the benefit of bupropion to specific non-black patients. Estimation of the benefit for a particular patient requires knowing the patient's average baseline HAM-D17 score (Ȳ0,ω), the average response score (ȲQ,ω), and the numbers of repeated measures (k0,ω and k1,ω). Benefits were multiplied by 100.

(A) k0,ω = 1
ȲQ,ω =3 ȲQ,ω =10 ȲQ,ω =20 ȲQ,ω =30
Ȳ0,ω =30 k1,ω = 1 39.6 (7.9, 81.2) 20.4 (2.2, 63.9) 5.2 (0.3, 32.0) 0.8 (0.0, 10.4)
k1,ω = 2 51.8 (16.8, 84.7) 21.8 (3.7, 58.5) 2.4 (0.1, 16.7) 0.1 (0.0, 1.6)
k1,ω = 3 58.4 (25.0, 85.6) 22.4 (4.9, 55.0) 1.5 (0.1, 10.4) 0.0 (0.0, 0.4)
Ȳ0,ω =20 k1,ω = 1 55.4 (15.6, 86.8) 34.3 (5.8, 75.3) 11.4 (0.9, 48.5) 2.3 (0.1, 19.9)
k1,ω = 2 61.8 (25.5, 87.6) 31.5 (7.0, 68.8) 5.0 (0.3, 25.4) 0.2 (−0.0, 3.4)
k1,ω = 3 65.4 (32.4, 87.6) 29.9 (7.7, 63.1) 2.7 (0.2, 15.1) 0.0 (−0.0, 0.7)
Ȳ0,ω =10 k1,ω = 1 59.7 (20.5, 86.1) 44.5 (9.0, 78.7) 19.7 (0.9, 59.4) 4.9 (−1.3, 31.7)
k1,ω = 2 62.0 (24.9, 86.1) 38.7 (8.6, 72.1) 7.8 (−0.8, 34.3) 0.4 (−1.6, 6.0)
k1,ω = 3 63.0 (28.2, 85.7) 34.7 (8.4, 66.6) 4.0 (−1.7, 20.6) 0.0 (−1.1, 1.3)
(B) k0,ω = 2
ȲQ,ω =3 ȲQ,ω =10 ȲQ,ω =20 ȲQ,ω =30
Ȳ0,ω =30 k1,ω = 1 30.8 (5.3, 73.3) 15.3 (1.5, 54.8) 3.9 (0.2, 25.5) 0.6 (0.0, 8.3)
k1,ω = 2 44.1 (12.9, 79.7) 17.6 (2.8, 51.9) 2.0 (0.1, 14.1) 0.1 (0.0, 1.4)
k1,ω = 3 52.0 (20.5, 81.9) 19.0 (4.0, 49.6) 1.3 (0.1, 9.1) 0.0 (0.0, 0.4)
Ȳ0,ω =20 k1,ω = 1 53.6 (15.2, 86.6) 33.5 (6.0, 74.6) 12.1 (1.1, 48.6) 2.8 (0.1, 21.2)
k1,ω = 2 60.8 (24.7, 87.5) 31.1 (7.2, 68.2) 5.5 (0.4, 26.1) 0.3 (−0.0, 4.1)
k1,ω = 3 64.8 (31.7, 87.7) 29.8 (7.9, 62.6) 3.1 (0.2, 15.9) 0.1 (−0.1, 0.9)
Ȳ0,ω =10 k1,ω = 1 59.8 (21.2, 84.9) 46.3 (9.0, 78.7) 22.9 (−1.0, 61.6) 6.5 (−5.5, 37.0)
k1,ω = 2 61.3 (24.9, 84.9) 39.8 (7.6, 72.1) 8.9 (−5.0, 37.5) 0.2 (−6.4, 7.8)
k1,ω = 3 61.9 (28.0, 84.5) 35.3 (6.8, 66.5) 4.1 (−6.7, 22.6) -0.1 (-5.2, 1.5)

2. Personalized medicine models

Let Y be a (possibly transformed) variable measuring or indicating some aspect of the state of a particular chronic disorder of a subject (or patient). Assume the goal of an MBT in a CT or clinical practice is to modify the value of Y. Examples of Y are: 1) a dichotomous indicator of whether or not a child with autism has appropriate expressive language skills; 2) the positive or negative subscales of the PANSS syndrome scale, which assess schizophrenia severity; 3) blood glucose or cholesterol concentrations, which partially assess the metabolic syndrome; 4) the 17-item Hamilton Rating Scale for Depression (HAM-D17); or 5) the total number of abstemious days (NADs) in a month, which assesses chronic alcoholism. For some of these measurements, the MBT's goal is usually to reduce the value of the measurement to the minimum possible value (e.g. PANSS, HAM-D17), or to increase it to the maximum possible value (e.g. NADs). But for some others, the goal is that the value lies inside a prespecified range or set (e.g. glucose, cholesterol concentration, dichotomous language indicator).

Let Q symbolize a particular MBT or sequence of MBTs. Before a particular subject ω is administered Q, a baseline value of Y is measured k0, ω times. A measured baseline value of Y is denoted by Y0,ω,i and assumed to satisfy

E[Y0,ω,i|Λω]=g1(Λω),i=1,,k0,ω, (1)

where Λω is a constant number that characterizes subject ω and g is a known link function. Λω is viewed as a constant number representing the state of the disease of subject ω before the MBT is started. But, Λω varies from subject to subject, that is, Λω is a particular realization of a random variable Λ′ whose distribution is completely determined by variability in the subject population. It is additionally assumed the conditional distribution of Y0,ω,i given Λ′ belongs to an exponential family ℱ. For instance, ℱ may be the family of normal, Poisson or Bernoulli distributions.

After administering Q to subject ω and the subject's response is stabilized, Y is measured k1,ω times. Each measure is denoted by YQ,ω,j and it is assumed there is a number βQ,ω that satisfies

E[YQ,ω,j|Λω,βQ,ω]=g1(Λω+βQ,ω),j=1,,k1,ω. (2)

We stress that both Λω and βQ,ω are constant numbers in subject ω (they do not change during the clinical trial). We also assume

Λω=αω+λTXω, (3)

where Xω is a vector of baseline clinical, environmental, genetic, biological or demographic covariates, λ has the same value for all subjects, but αω is a characteristic constant of subject ω which varies from subject to subject. Thus, αω is a realization of a population-level random variable α.

In some subject populations, a covariate vector Zω moderates the effect of Q. This vector may contain some of the covariates in Xω. In these cases, we additionally assume

βQ,ω=γω+θTZω, (4)

where θ has the same value for all subjects, but γω is a characteristic constant of subject ω which varies from subject to subject. That is, γω is a realization of a population-level random variable γ. It is possible to assume that λ, θ or both vary from subject to subject, but the simpler model that assumes fixed λ and θ will suffice to introduce our ideas.

Definition 2.1. If βQ,ω has the same value for all subjects, that is, if βQ,ω does not depend on ω, then Eqs. (1)-(4) together are called a one-dimensional personalized medicine (1-PM) model of treatment Q. And if βQ,ω varies from subject to subject, then Eqs. (1)-(4) are a two-dimensional personalized medicine (2-PM) model of treatment Q. Here, βQ,ω is viewed as a particular realization of a random variable βQ that reflects variation at the subject population level. In the 1-PM model, βQ is a constant. In both models, we additionally assume the conditional distribution of YQ,ω,j given (Λ′, βQ) belongs to ℱ.

For estimation purposes, we followed the classical formulation of GLMMs and assumed that Λ′ in the 1-PM model, or (Λ′, βQ) in the 2-PM model, are normally distributed given Xω and Zω. This assumption, however, is not essential for our proposed approach and can be relaxed if robust estimation methods are used (for instance, the methods in Wang et al. [22]).

In contrast to the traditional interpretation of random parameters presented in the literature of random-effects linear models (RELM) or GLMMs, the randomness of Λ′ (or α), or that of βQ (or γ) in a 2-PM model, are not viewed here as the result of a random selection of subjects for a CT, or of a randomization of subjects to experimental MBTs, but the result of real-life variation in the biological, environmental and clinical dimensions that shape subjects' individualities [2, 4, 16, 18]. That is, Λ′ (and βQ in the 2-PM model) are unobserved but real dimensions with probability distributions that reflect the underpinnings of the observed clinical phenomenon. The actual values of these dimensions in a particular individual, that is, Λω and βQ,ω can be predicted using empirical Bayes estimators which are also called best linear unbiased predictors (BLUPs) in the context of RELMs [11, 12]. Note also that the conditional expectations E[ · | Λω] and E[ · | Λω, βQ,ω] can be interpreted as expected values “given subject ω”.

3. Severity and benefit functions

This section proposes a statistical definition of the benefit function of an MBT or sequence of MBTs, under the general assumption of a 1-PM or a 2-PM model. This function describes how the benefit varies depending on the subject (or patient) and enables statisticians and clinicians to interpret some results obtained with GLMMs from a PM perspective. Here, Λω and (Λω, βQ,ω) are called the basal trait and trait vector of subject ω. We write Λ and βQ in place of Λω and βQ,ω to refer to a generic subject from the population of subjects. We interpret (Λ, βQ) as “a patient”. In 1-PM models, we also interpret Λ as “a patient” since βQ is a population constant in those models. We write Y0 and YQ in place of Y0,ω,i and YQ,ω,j to refer to generic values of Y measured before or under Q, respectively.

Suppose the therapy's goal is that the subjects achieve the state Y ∈ 𝒯 ⊆ ℝ. The set 𝒯 is the therapeutic target. The basal severity function (SF) s0 is a function with domain ℝ defined as s0(Λ) = P(Y0 ∈ 𝒯C | Λ), Λ ∈ ℝ. For a subject with basal trait Λ, s0(Λ) is called the subject's basal disease severity. When y, y* and δ ≥ 0 are fixed numbers, the targets, [y, ∞), (−∞, y] and [y* −δ, y* + δ] are called an incremental, a decremental, and a closed target, respectively, and y* is a target response.

Now let Q be a particular MBT (or sequence of MBTs). If (Λ, βQ) is the trait vector of a particular subject, the post-treatment severity of the subject's disease is defined as P(YQ ∈ 𝒯C |Λ, βQ). Also, under a 1-PM model, the post-treatment severity function s1 is a function with domain ℝ defined as s1(Λ; Q) = P(YQ ∈ 𝒯C |Λ, βQ), Λ ∈ ℝ. Alternatively, under a 2-PM model, the post-treatment severity function s2 is defined as s2(Λ, βQ) = P(YQ ∈ 𝒯C |Λ, βQ), (Λ, βQ) ∈ ℝ2 (the domain is ℝ2).

For fixed Λ, s0(Λ) is interpreted as the severity of the disease of a subject with basal trait Λ = Λω before the subject is administered any MBT. An incremental target occurs, for instance, in a treatment for chronic alcoholism with the goal that the patient achieves at least y = 26 abstemious days in a month. Closed targets are implemented in treatments for diabetes, at which, for instance, glucose levels between 70-100 mg/dL before meals are the treatment goal. And a decremental target is implemented in a treatment for major depression aiming at reducing the HAM-D17 score; for instance, aiming at a score ≤ y = 7. A closed target with δ = 0 is of interest only with discrete responses. This case occurs particularly with dichotomous responses, for which reaching one of the two values of Y is the therapy goal.

For a treatment Q represented by a 1-PM model, the ideal situation is that s1(Λ; Q) < s0(Λ) for all Λ. That is, Q should reduce all subjects' illness severity. However, it is possible that, for some treatments, s1(Λ; Q) ≥ s0(Λ) for some or all Λ (see Examples 3.1 and 3.2). Thus, our general formulation allows modeling MBTs that may be ineffective or even detrimental for some subjects.

Definition 3.1. Under a 1-PM model of treatment Q, the benefit function (BF) b1 of Q is defined by b1(Λ; Q) = s0(Λ) – s1(Λ; Q), Λ ∈ ℝ. Alternatively, under a 2-PM model, the BF b2 of Q is defined by b2(Λ, βQ) = s0(Λ) – s2(Λ, βQ), (Λ, βQ) ∈ ℝ2.

In practice, BFs are useful in the comparison of the benefits of two or more MBTs to a particular patient. Examples of SFs and BFs for 1-PM models with Gaussian response and identity link are provided below. Sections 4 and 5 present data analyses using a 2-PM model with Gaussian response and identity link and a 1-PM model with Bernoulli response and logit link.

Example 3.1. (Incremental targets in 1-PM models with Gaussian response and identity link). With a Gaussian response and identity link, the equations of a PM model for a subject ω can be written as

Y0,ω,i=Λω+ε0,ω,iandYQ,ω,j=Λω+βQ,ω+εω,j,

where ε0,ω,j is a random variable with mean 0 which represents the variation of Y about Λω that is caused by interoccasion variability in either the subject's body or external environment or by measurement errors. Similarly, εω,j has mean 0 and represents variation of Y about Λω + βQ,ω which may have the same causes as those of the variation of ε0,ω,i. Thus, for an incremental target and fixed y ∈ ℝ,

s0(Λ)=Φ(yΛσε)andb1(Λ;Q)=Φ(yΛσε)Φ(yΛβQσε),ΛR,

where Φ is the standard normal cdf, and σε and σε are the standard deviations of ε0,ω,i and εω,j, respectively. Next we explain the graphical interpretation of these functions from a PM perspective, using y = 1 and σε=σε=1. Figure 1 (A) shows s0. For incremental targets, s0(Λ) is a decreasing function of Λ since, by Eq. (1), untreated subjects with large Λω tend to have higher values of Y0,ω,i which reflect a less severe illness. Figure 1 (B) illustrates function b1(·; Q1) for a treatment Q1 with βQ1 = 3. Here, b1(Λ; Q1) > 0 for all Λ. However, in general, the illness severity is not reduced by Q1 by the same amount in all subjects. For instance, the probability that Y ∈ (−∞, 1] is reduced by 0.5 or more only in subjects with basal trait Λ in [−2, 1]. In contrast, the effectiveness of Q1 in subjects for whom Λ < −5 is very small or unimportant because b1(Λ; Q1) ≈ 0 in these subjects. Figure 1 (C) depicts b1(·; Q2) for a treatment Q2 with βQ2 = −2. It illustrates that, with an incremental target, b1(Λ; Q) ≤ 0 for all Λ if and only if βQ ≤ 0. Therefore, the MBT is detrimental or ineffective for all subjects when βQ ≤ 0; that is, in such cases, the treatment undesirably increases the illness severity rather than decreasing it.

Figure 1.

Figure 1

Illustration of a basal severity function and benefit functions for a 1-PM model with a Gaussian response with identity link, using an incremental therapeutic target with y = 1 and σε=σε=1. (A) Basal severity function. (B) Benefit function for βQ1 = 3. (C) Benefit function for βQ2 = −2. (D) Superimposition of the functions in A and B and another benefit function for which βQ3 = 1.7.

Figure 1 (D) superimposes 3 functions: the functions s0(·) and b1(·; Q1) shown in (A) and (B), and b1(·; Q3) for a treatment Q3 with βQ3 = 1.7. For subjects with basal severities s0(Λ) ≤ 0.6, Q1 and Q3 are both very effective in that subjects' post-treatment severities reach values close to 0, which is indicated by the fact that, for these subjects, s0(Λ) ≈ b1(Λ; Q1) ≈ b1(Λ; Q3). But, for subjects with Λ satisfying 0.6 < s0(Λ) < 0.87, only Q1 is able to produce post-treatment severities close to 0. Finally, for subjects with basal severities > 0.87 but not too close to 1, treatment Q1 may be substantially more beneficial than Q3, although neither treatment will reduce completely these subjects' illness severity.

Figure 1 (D) illustrates the potential of BFs as tools that may help clinicians to compare and personalize MBTs. For instance, if treatment Q3 is cheaper or expected to produce fewer side effects than Q1, then there is no doubt that Q3 is a better choice than Q1 for patients with severities ≤ 0.6. Empirical Bayesian estimation of the value of Λω for a particular patient will help make decisions once 1-PM model parameters are estimated with a training patient sample. This estimation of Λω is made using baseline values Y0,ω,i provided by the patient.

Observe that SFs and BFs allow modeling two well-known facts in clinical practice when incremental or decremental targets are used: (1) that two MBTs may have essentially the same benefits in a large group of patients, even if the treatments' effects (the βQ's) on the outcome variable Y are different; and (2) that even the best treatments may fail with very severe illnesses, that is, in patients for whom s0(Λ) ≈ 1.

BF and basal SF under a decremental target and a 2-PM model are illustrated in Section 4. With incremental or decremental targets, the value of y does not affect the graphs' shapes of BFs and basal SFs. It affects only the graphs' positions on the horizontal axis. So, if the goal is to compare two MBTs in the same patient population, the actual value of y used is immaterial. Any clinical decision regarding therapy personalization is invariant with respect to the value of y, provided patients are identified with their basal illness severities, not with their values of Λ. To illustrate, all the above conclusions about treatments Q1 and Q3 obtained from Figure 1 (D) are valid regardless of the value of y chosen.

Example 3.2. (Closed targets in 1-PM models with Gaussian response and identity link). More complex clinical phenomena occur when the therapeutic target is closed. In this case, for δ > 0,

s0(Λ)=1Φ(y+δΛσε)+Φ(yδΛσε),ΛR,and (5)
b1(Λ;Q)=Φ(yδΛσε)Φ(y+δΛσε)+Φ(y+δΛβQσε)Φ(yδΛβQσε),ΛR. (6)

Figure 2 shows examples of the functions in (5) and (6) for σε=σε=1 and therapeutic goals represented by y* = 1 and δ = 1.64. Figure 2 (A) shows the basal severity function s0. For a closed target, there exists a Λ at which s0(Λ) reaches its minimum value. This Λ is denoted as Λm = argminΛ∈ℝ s0(Λ). In Figure 2 (A), Λm = 1 and s0m) = 0.1. In this case, there are no subjects with a basal severity lower than 0.1. Figure 2 (B) shows the benefit function b1(·; Q1) for a treatment Q1 with βQ1 = 3. Here, Q1 is beneficial for subjects with Λ < −0.5, but detrimental if Λ > −0.5. That is, under treatment Q1, the health of subjects with Λ > −0.5, which is measured by Y, worsens. For subjects with Λ = −0.5, Q1 does not produce any change in their illness severity. A somewhat opposite situation occurs for a treatment Q2 for which βQ2 = −2, whose BF is depicted in Figure 2 (C); in this case, Q2 benefits only subjects with Λ > 2.

Figure 2.

Figure 2

Figure 2

Illustration of a basal severity function and benefit functions for a 1-PM model with a Gaussian response with identity link, using a closed target with y* = 1, δ = 1.64 and σε=σε=1. (A) Basal severity function. (B) Benefit function for βQ1 = 3. (C) Benefit function for βQ2 = −2. (D) Superimposition of the functions in A, B and C.

(E) Superimposition of the functions in A and B and an additional benefit function with βQ3 = 1.7.

Figure 2 (D) superimposes the functions in A, B and C. It shows that Q1 benefits only a subpopulation of subjects for whom Λ < Λm, and Q2 benefits only a subpopulation of subjects for whom Λ > Λm, because −0.5 < Λm = 1 < 2. However, subjects with basal severities very close to 1 obtain meager benefits from both treatments.

Figure 2 (E) superimposes the functions s0(·) in A, b1(·; Q1) in B, and an additional benefit function b1(·; Q3) with βQ3 = 1.7. From this figure, the following can be said about subjects for whom Λ < Λm = 1: 1) subjects with basal severities < 0.22 will obtain undesirable effects (negative benefits) from both Q1 and Q3; 2) those with basal severities between 0.22 and 0.45 will obtain positive benefits from Q3 but not from Q1; 3) those with basal severities between 0.45 and 0.76 will obtain positive benefits from both Q1 and Q3, but Q3 is superior to Q1 (i.e. Q3 produces a larger reduction of the illness severity); and 4) Q1 is more beneficial than Q3 when administered to subjects with severities > 0.76, but the effects of Q1 and Q3 are negligible in subjects with basal severities that are very close to 1.

In general, the shapes of the functions in (5) and (6) do not depend on y*, although they depend on δ. In particular, the lowest basal severity s0m) does not depend on y* but depends on δ. In contrast, Λm depends on y* but does not depend on δ.

Importantly, with a closed target and a 1-PM model with Gaussian response and identity link, if treatments Q1 and Q2 have different average effects on the illness (that is, if βQ1βQ2), there are always subjects for whom treatment Q1 is better than Q2 and subjects for whom Q2 is better than Q1. Also, in general, for any treatment Q, if βQ > 0, then Q is detrimental to subjects with trait Λ > y* − βQ/2 (that is, b1(Λ; Q) < 0) but beneficial to the other subjects, although Q may not have the same detrimental or beneficial effect in two different subjects. And if βQ < 0, then Q is detrimental for subjects with trait Λ < y* −βQ/2 but beneficial otherwise. In particular, if βQ > 0, Q does not benefit subjects with Λ > Λm; and if βQ < 0, Q does not benefit subjects with Λ < Λm.

Thus, many well-known clinical phenomena occurring under a closed target can be modeled using BFs and SFs. For instance, that MBTs with higher statistical effect sizes on Y are not necessarily better for all patients. In general, for two different MBTs Q and Q′, even if βQ > βQ′ > 0, there may be a subset of patients with Λ < Λm for whom Q′ is more effective than Q. Moreover, for any treatment Q, there may be patients with a refractory illness for whom Q is neither beneficial nor detrimental, even if |βQ| is very large. These are the patients with very large values of |Λ|, for whom b1(Λ; Q) is close to 0. (See Figure 2.)

To further illustrate SFs' and BFs' versatility, note that, with a closed target and Gaussian response and identity link, and appropriate values of σε, σε and δ, it is possible to build basal severities for which s0m) is close to 0 in order to model subjects with low basal severities. Also, values of βQ representing MBTs that reduce the illness severity to almost 0 in subjects with low basal severities can easily be exhibited. In summary, SFs and BFs provide a mathematical language that allows describing many phenomena observed in actual clinical settings.

4. Application to clinical trial of antidepressants

The STAR*D study investigated 4,041 adult subjects with non-psychotic major depressive disorder who were treated in outpatient settings [23]. The primary purpose of the study was to determine which alternative treatments work best if Citalopram (CIT) monotherapy does not produce an acceptable response. The study examined a large number of MBTs and MBT sequences and combined both standard randomization procedures and a detailed protocol with a naturalistic approach that attempted to mimic real-life clinical practice. Here, we focus only on 170 subjects who satisfied the following criteria: (1) the subjects did not have an acceptable response to CIT therapy but received either Bupropion (BUP, N=69) or Venlafaxine (VEN, N=101) as alternative treatments in the next study phase; and (2) the subjects were declared to have a successful response to these alternative treatments and were consequently assigned to follow-up, in which the subjects continued under the alternative treatment. Thus, each of these subjects was considered to have benefited from BUP or VEN. We want to compare the amounts of benefit BUP and VEN provided.

Here, Y is the HAM-D17 total depression score. It produced values from 0 to 42 inclusive, and all integer numbers in this range were observed in the patient population. Thus, we treated the HAM-D17 score as a continuous variable, as most statisticians would do in practice. Each subject provided at most two baseline measures: Y0,ω,1, which was obtained right before CIT therapy started, and Y0,ω,2, obtained at the end of CIT therapy. Thus, k0,ω = 0, 1 or 2. Since CIT therapy was not considered successful in these subjects, we assumed Y0,ω,1 and Y0,ω,2 were repeated measures of the same dimension Y0. Similarly, each subject provided at most two measures of Y under the alternative treatment: YQ,ω,1, which was obtained right before the subject passed to follow-up, and YQ,ω,2, obtained after about three months of follow-up, where Q = BUP or VEN. Thus, k1,ω = 0, 1 or 2. In total, the 170 subjects provided 598 HAM-D17 scores for this analysis. Each subject provided one to four scores, and the average number of scores provided was 3.5. An assumption of missingness at random was made to include subjects who did not provide all four values of Y.

2-PM models for BUP and VEN were combined into a GLMM model, assuming a Gaussian response, identity link and independent homoscedastic errors (Table 1). Stata's meglm command was used to fit the model (StataCorp LP, College Station, TX, USA) (Appendix 1). Residual analyses showed both that the model fitted well and that the assumption of normality for the untransformed scores was reasonable. The model included a random intercept, and random effects for BUP and VEN treatments. An unstructured covariance matrix for these 3 random coefficients was estimated, except that the covariance between BUP and VEN was set to 0 because the two treatments were administered to independent samples. The database had one row per each available HAM-D17 score, and two dummy variables representing treatments were computed, namely XBUP and XVEN. The variable XBUP was coded as 1 if the score was obtained under BUP, or 0 if the score was obtained at baseline or under VEN. Analogously, XVEN was coded as 1 if the score was obtained under VEN, or 0 if the score was obtained at baseline or under BUP. In Table 1, baseline is the “reference point” with respect to which the effects of BUP and VEN are assessed. The effects of age, gender, race, smoking, alcohol and use of illegal drugs were examined as possible patient covariates, but the final model included only black race (Table 1). The black race variable was coded as 1 if the subject was self-declared black or African American, 0 otherwise.

There was a significant interaction between BUP treatment and black race (Table 1). Using the estimates in Table 1, we computed the benefit functions in Figure 3. As in the STAR*D protocol, we used HAM-D17 ≤ 7 as an indicator of remission, which is a decremental target with y = 7. With a decremental target, we have 𝒯C = (y, ∞). Then, the formulas for basal SF and for BF under decremental targets in 2-PM models with Gaussian response and identity link are respectively

Figure 3.

Figure 3

Comparison of benefit functions of Bupropion and Venlafaxine in patients refractory to Citalopram treatment. (A) Black patients. (B) Non-black patients.

s0(Λ)=1Φ(yΛσε),ΛR,andb2(Λ,βQ)=Φ(yΛβQσε)Φ(yΛσε),(Λ,βQ)R. (7)

Figure 3 shows s0, and profiles of b2 for different values of βQ, for blacks (A) and non-blacks (B). In Figure 3 (A), b2(Λ, βQ) is plotted as a function of only Λ for four different values of βQ, namely

βQ,1=E^[βBUP],βQ,2=E^[βVEN],βQ,3=E^[βBUP]1×(V^(βBUP))1/2andβQ,4=E^[βVEN]+1×(V^(βBUP))1/2,

where, Ê[βBup] = −10.0 + 5.24, Ê[βVEN] = −9.85, (βBUP) = 10.6, and (βVEN) = 8.49. Each of these four curves represents a subpopulation of patients. For instance, the curve corresponding to βQ,1 represents black subjects for whom BUP had an average effect; the curve for βQ,3 represents black subjects for whom BUP had an effect one standard deviation lower than BUP average effect; and so on.

Similarly, in Figure 3 (B), in addition to the curves for βQ,i, i = 1, …, 4, we also plotted curves with

βQ,5=E^[βBUP]+1.5×(V^(βBUP))1/2andβQ,6=E^[βVEN]1.5×(V^(βVEN))1/2.

However, for non-black subjects, Ê[βBUP] = −10.0.

Figure 3 also shows vertical grid lines representing subjects with different values of the basal trait Λ. For instance, the number −2 in the top horizontal axis represents the value of Λ given by Ê[Λ] − 2 × ((Λ))1/2, where (Λ) = 20.9, and Ê[Λ] = 17.0 + 3.54 for blacks and Ê[Λ] = 17.0 for non-blacks (Table 1). In this case, the vertical line −2 represents subjects for whom Λ is two standard deviations lower than the average Λ of their race group. Similarly, the vertical line with 0 at the top corresponds to the average Λ of the corresponding race group (17.0 + 3.54 for blacks, 17.0 for non-blacks).

Here, the basal severity is interpreted as the probability that a particular subject with basal trait Λ has a HAM-D17 score ≤ 7 at a particular moment before taking BUP or VEN, and the benefit of a treatment for that subject is the reduction of this severity in probability units.

In black subjects, the average effect of BUP (estimated as −10.0 + 5.24) was statistically different from the average effect of VEN (−9.85) (Wald χ2 = 9.7, df = 1, p = 0.002). Comparing the curves for βQ,1 and βQ,2 in blacks (Figure 3A), we observe that, in general, after controlling for basal illness severity, average black subjects benefitted substantially more from VEN than BUP; that is, fixing the value of Λ, the curve for βQ,2 is substantially higher than that for βQ,1, especially in subjects whose Λ was between −2 and −1 standard deviations from the average Λ. However, when comparing the curves for βQ,3 and βQ,4, we observe that there were subpopulations of black subjects who obtained similar or more benefit from BUP than from VEN (the curve for βQ,3 is higher than that for βQ,4).

In non-black subjects, the average effect of BUP was not statistically different from the average effect of VEN (−10 versus −9.85) (p = 0.82). In Figure 3 (B), we observe that, after controlling for basal severity, average non-black subjects obtained the same benefit from BUP and VEN, which can be inferred from the fact that the curves for βQ,1 and βQ,2 are hardly distinguishable. However, we can exhibit subpopulations of non-black subjects for whom BUP provided substantially more benefit than VEN (compare the curves for βQ,3 and βQ,4), as well as subpopulations for whom VEN provided substantially more benefit than BUP (compare the curves for βQ,5 and βQ,6). These subpopulations are predicted to exist because V(βBUP) and V(βVEN) are estimated to be larger than 0 (for BUP, the estimate is 10.6 [3.54, 31.8]; for VEN, 8.49 [2.77, 26.0] ; Table 1).

Interestingly, both black and non-black subjects with Λ higher than the average Λ of their race had a very severe illness before trying BUP or VEN. This can be inferred from the fact that the basal severities in both Figures 3 (A) and (B) are almost flat and close to 1 at the right of the vertical lines with 0 at the top. However, the curves for βQ,2 show that the maximum benefit of VEN was 0.2 for these black subjects (Figure 3A), but 0.44 for the non-black subjects (Figure 3B). In general, VEN benefitted more non-black than black severely ill subjects with comparable values of βVEN. This illustrates the important point that differences in the basal trait Λ may contribute to differences in the benefit of a treatment between two populations, not only to the basal severity.

5. Application to clinical trial of respiratory treatment

Next, we analyze data from a clinical trial comparing the effects of a drug (called Drug A) versus Placebo on a chronic respiratory condition [24]. Here, only data from Center 1 is analyzed. Fifty-six subjects were randomized to either Drug A (N=27) or Placebo (N=29). The dichotomous response Y was coded as 1 if the subject had a good respiratory status (status 3 or 4), or 0 otherwise. Only one baseline (pre-treatment) measure was available from each subject: Y0,ω,1. (Of the 56 subjects, only seven had a good baseline respiratory status.) Four repeated measures were obtained during treatment: YQ,ω,j, j = 1, …, 4, where Q = Drug A or Placebo.

1-PM models for Drug A and Placebo were combined into a random intercept logistic regression model of Y, using Stata's meglm command (Table 2; Appendix 2). Each subject occupied 5 rows of the database, one per each value of Y. Two dummy variables were computed and used as independent variables: XA, which was coded as 1 if Y was obtained under Drug A, or 0 if Y was obtained at baseline or under Placebo; and XP, coded as 1 if Y was obtained under Placebo, or 0 if Y was obtained at baseline or under Drug A.

Since the therapeutic target is to achieve a good respiratory status (i.e. 𝒯 = {1}) and we are modeling a Bernoulli response with a logit link, the basal SF and BF are respectively

s0(Λ)=1(1+exp(Λ))1andb1(Λ;Q)=(1+exp(ΛβQ))1(1+exp(Λ))1,ΛR,

where βQ = 1.81 if Q = Drug A, or βQ = 0.49 if Q = Placebo (Table 2).

Figure 4 depicts s0(·), and b1(·; Q) for both Q = Drug A and Q = Placebo. A particular value on the Λ axis can be viewed as a composite of a particular subject's age and unobserved variables that also shaped the subject's individuality. That is, one value on the Λ axis represents a particular individual. Here, the basal severity for a particular subject is interpreted as the probability that he/she had a poor respiratory status at a particular time point before treatment started.

Figure 4.

Figure 4

Benefit functions for Drug A and Placebo in a clinical trial aiming at improving respiratory status. The vertical lines mark average values of Λ for subjects of particular ages. The illustrated ages are the youngest age in the patient sample (11), the oldest (63), the median age (28), and the 10% and 90% percentiles (15 and 47).

The vertical lines on Figure 4 mark average values of Λ for particular ages observed in the patient sample (the youngest and oldest observed ages were 11 and 63 years old). For instance, the predicted average value of Λ for subjects of age 28 was 0.65 − 0.073 × 28 = −1.39. Thus, for an average subject of age 28, the basal severity was 0.8, and the benefit this subject obtained from Drug A was 0.4. Thus, Drug A reduced by half the probability that this subject exhibited a poor respiratory condition.

Observe that subjects of age 47 or older tended to have a very severe illness, with basal severities > 0.9. For these subjects, Drug A provided a very modest benefit, reducing their illness severity just by 0.2 or less probability units. In contrast, children of age 15 or younger tended to have basal severities < 0.6, and Drug A reduced their illness severity by about 0.4 probability units. Interestingly, Drug A provided a similar benefit for subjects of age between 11 and the observed median age (28). In these particular subjects, Drug A reduced the illness severity by about 0.4. In general, for subjects of age >28, older age was not only associated with increased disease severity, but also with a reduced benefit from Drug A. Note the above conclusions cannot be easily inferred by just browsing the parameter estimates in Table 2. This reinforces our claim that benefit functions help understand the clinical phenomenon and provide valuable additional information. In fact, the above complex interaction between age and Drug A could not be detected by including an interaction term in the model.

6. Estimation of the benefit to an individual patient

To illustrate how our approach translates into clinical practice, we examine how well a non-black patient with depression responded to bupropion using empirical Bayesian estimation of treatment benefit. The patient can be a STAR*D subject or a new patient. The empirical Bayes predictor and the credibility interval for the benefit to a patient ω depend on the averages Y¯0,ω=(i=1k0,ωY0,ω,i)/k0,ω and Y¯Q,ω=(j=1k1,ωYQ,ω,j)/k1,ω, the amount of data collected from the patient (k0,ω and k1,ω), covariate values, and the estimates of population parameters (Appendix 3). For instance, Table 3 shows that if bupropion lowers the patient's HAM-D17 average score from 20 to 10, and k0,ω = k1, ω = 2, then the estimated bupropion benefit is 31.5% [95% credibility interval, (7.0, 68.8)].

In Table 3, estimated benefits increase from right to left. Indeed, larger score reductions from baseline reflect higher bupropion benefits, and the benefit scale is sensitive to score deterioration—the larger the deterioration, the lower our degree of belief that the treatment is beneficial. Also, other things being equal, the greater the k1,ω the shorter the interval length and, therefore, the higher the precision of the benefit estimate. An analogous trend is observed for k0,ω.

7. Discussion

In this paper, we propose definitions for the severity functions of chronic diseases and the benefit functions of MBTs. We also describe a graphical method for examining the severity of a chronic disease and the individual benefit of an MBT. BFs can be a good addition to the statistician's tool box. In fact, rather than just reporting that the effects of two or more MBTs on an outcome variable are significantly different, the statistician may provide a better picture of the clinical reality by reporting the treatments' BFs along the basal SF and by describing how the benefits of the treatments differ in specific patient subpopulations. In particular, reporting that the effects of two MBTs are significantly different is not enough since an average difference does not necessarily imply that the two MBTs will provide substantially dissimilar epidemiological benefits to a specific patient subpopulation although they can provide differing benefits across the patients of other subpopulations.

Our methodology allows demonstrating graphically the existence of subjects with differential responses to a treatment, comparing the benefits of a variety of treatments, and quantifying the effectiveness of treatments in specific patients. It utilizes repeated measures to separate from noise the components of variability that are relevant to PM; i.e., between-subject variability and subject-by formulation interaction variability [2, 3]. Since conclusions based on our methodology are model-dependent, however, we must carefully evaluate the goodness-of-fit of the model before drawing conclusions.

In 1-PM models, the effect βQ of an MBT on Y is the same for all patients; whereas, in 2-PM models, it differs across patients. This may wrongly suggest that only 2-PM models are of interest in PM, whose main tenet is that some MBTs may act differentially across patients. As Figure 1 suggests, however, the fact that an MBT has the same effect in all patients does not imply that the MBT benefits all patients in the same fashion. (Actually, most statisticians seem to believe incorrectly that same effect size implies same MBT benefits.) Furthermore, since the basal trait Λ is not only shaped by covariates but also by other unobserved factors represented by α, the presence of the same effect in all patients does not imply similar benefits for two patients with the same covariate values. Thus, 1-PM models and their associated BFs also allow modeling important clinical phenomena.

Empirical Bayes predictors of Λ and βQ can be used to estimate both the basal severity of a specific patient and the benefit that the patient obtained from a treatment. To estimate parameters of 1-PM and 2-PM models, we can use standard software for fitting GLMMs (or RELMs in the case of Gaussian response with identity link). For example, we can use SAS GLIMMIX or MIXED procedures (SAS Institute Inc, Cary, NC, USA) or Stata's meglm or mixed commands (StataCorp LP, College Station, TX, USA). These computer packages can easily provide empirical Bayes estimators.

The potential for using REs models as tools for separating genetic and environmental causes of human and animal responses was openly recognized by some of the first researchers who explored these models' mathematics and applications [25-27]. In fact, empirical and theoretical studies suggest a working hypothesis that can be named The Mixed-Model Hypothesis: The random coefficients of a mixed-regression model of a pharmacokinetic or pharmacodynamic response incorporate all the variability of the response that patients' uncontrolled genetic heterogeneity causes as well as some inter-individual variability that other factors generate. In contrast, the variability of the error term of the model does not reflect genetic variability across patients [16]. In other words, the total variability of random coefficients is an “upper bound” of the genetic variability that was not controlled for by model covariates [2, 4, 28]. Thus, statisticians and clinicians involved in PM research must not overlook the potential of the concept of REs to develop statistical tools for PM.

Supplementary Material

Supp Fig 3 color
Supp Fig S1
Supp Fig S2
Supp Fig S3
Supp Fig S4
Supp Instruction
Supp Material
Supp Preben
Supp Preben.sthlp

Acknowledgments

The STAR*D study was supported by NIMH Contract #N01MH90003, and has the ClinicalTrials.gov Identifier, NCT00021528. The Author was supported by an Institutional Clinical and Translational Science Award, NIH/NCATS Grant Number UL1TR000001 (awarded to the University of Kansas Medical Center). The author did not receive any salary or payment from the STAR*D grant or its investigators. This manuscript may not reflect the opinions or views of the STAR*D study investigators or the NIH. The author thanks two reviewers whose suggestions led to substantial improvements to the paper. The author also thanks Mr. Douglas Eikermann for editorial assistance.

Appendix 1: Stata code for STAR*D data

First, we created a matrix that set the covariance between the random effects of BUP and VEN to 0:

matrix input ReCovMat=( .,. ,. \0,. ,. \ .,. ,. )
matrix rownames ReCovMat=BUP VEN Intercept
matrix colnames ReCovMat= BUP VEN Intercept

The random-effects linear model was fitted with the following code:

meglm HRSD17_Total black01 BUP_01 VEN_01 interBlackBUP ///
‖ id: BUP_01 VEN_01, covariance(fixed(ReCovMat))

where HRSD17_Total = Y, black01 is the race variable, BUP_01 = XBUP, VEN_01 = XVEN, interBlackBUP is the product of black01 and BUP_01, and id is a variable that identified the subjects.

Appendix 2: Stata code for respiratory data

The following code fitted the random intercept logistic regression model:

meglm GoodStat01 Age Placebo_01 Drug_01 ‖Patient: , ///
family(bernoulli) link(logit)

where GoodStat01 = Y, Placebo_01 = XP, Drug_01 = XA, and Patient is a variable that identified the subjects.

Appendix 3: Formulas for the empirical Bayes predictor and credibility interval for an individual benefit

Consider the 2-PM model in Section 4. Using formulas in Frees [12] (pages 149, 420), it can be shown that, for a particular patient ω, the posterior mean vector and variance-covariance matrix of (Λ,βQ)T given the patient's HAM-D17 scores are respectively

μY=σε2D(I2σε2ZTZ(D1+σε2ZTZ)1)Y+(E[Λ],E[βQ])Tand=DDZTV1ZD,

where D=Var((Λ,βQ)T), V=σε2Ik+ZDZT, k = k0,ω + k1,ω and I2 and Ik are 2 × 2 and k × k identity matrices. Z = (Zi j) is a k × 2 matrix such that Zi 1 = 1 for all 1 ≤ ik; Zi 2 = 0 for 1 ≤ ik0,ω; and Zi 2 = 1 for k0,ω + 1 ≤ ik. Finally, Y* = (b1, b2)T where

b1=k0,ω(Y¯0,ωE[Λ])+k1,ω(Y¯Q,ωE[Λ]E[βQ])andb2=k1,ω(Y¯Q,ωE[Λ]E[βQ]).

We substituted parameters in μY and Σ with estimates obtained with code in Appendix 1. Using the estimates of μY and Σ, we simulated 20,000 pairs of the form (Λ, βQ) from the corresponding bivariate normal distribution. These pairs were replaced into b2 in formula (7). The median of the 20,000 Monte Carlo benefits was the empirical Bayes predictor of the patient's benefit, and the 2.5% and 97.5% percentiles constituted the 95% credibility interval (Table 3). The author wrote a Stata ado program called preben, which does these computations (online material).

Footnotes

Conflicts of interest. The Author declares that there is no conflict of interest.

References

  • 1.Food and Drug Administration. Paving the way for personalized medicine: FDA's role in a new era of medical product development. U.S. Department of Health and Human Services. Food and Drug Administration; Silver Spring, MD: 2013. [Google Scholar]
  • 2.Senn S. Individual therapy: new dawn or false dawn. Drug Information Journal. 2001;35:1479–1494. [Google Scholar]
  • 3.Senn S. Mastering variation: variance components and personalised medicine. Statistics in Medicine. 2015 doi: 10.1002/sim.6739. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Diaz FJ, Berg MJ, Krebill R, Welty T, Gidal BE, Alloway R, Privitera M. Random-effects linear modeling and sample size tables for two special cross-over designs of average bioequivalence studies: the 4-period, 2-sequence, 2-formulation and 6-period, 3-sequence, 3-formulation designs. Clinical Pharmacokinetics. 2013;52:1033–1043. doi: 10.1007/s40262-013-0103-4. [DOI] [PubMed] [Google Scholar]
  • 5.Schall R, Endrenyi L, Ring A. Residuals and outliers in replicate design crossover studies. Journal of Biopharmaceutical Statistics. 2010;20:835–849. doi: 10.1080/10543401003618876. [DOI] [PubMed] [Google Scholar]
  • 6.L'Abbé KA, Detsky AS, O'Rourke K. Meta-analysis in clinical research. Annals of Internal Medicine. 1987;107:224–233. doi: 10.7326/0003-4819-107-2-224. [DOI] [PubMed] [Google Scholar]
  • 7.Lubsen J, Tijssen JG. Large trials with simple protocols: indications and contraindications. Controlled clinical trials. 1989;10:151–160. doi: 10.1016/0197-2456(89)90054-8. [DOI] [PubMed] [Google Scholar]
  • 8.Boissel JP, Kahoul R, Marin D, Boissel FH. Effect model law: An approach for the implementation of personalized medicine. Journal of Personalized Medicine. 2013;3:177–190. doi: 10.3390/jpm3030177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Sharp SJ, Thompson SG, Altman DG. The relation between treatment benefit and underlying risk in meta-analysis. British Medical Journal. 1996;313:735–738. doi: 10.1136/bmj.313.7059.735. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Li W, Girard P, Boissel JP, Gueyffier F. The calculation of a confidence interval on the absolute estimated benefit for an individual patient. Computers and Biomedical Research. 1998;31:244–256. doi: 10.1006/cbmr.1998.1477. [DOI] [PubMed] [Google Scholar]
  • 11.Rabe-Hesketh S, Skrondal A. Generalized linear mixed-effects models. In: Fitzmaurice G, Davidian M, Verbeke G, et al., editors. Longitudinal Data Analysis. London: Chapman & Hall/CRC; 2009. [Google Scholar]
  • 12.Frees EW. Longitudinal and panel data. Cambridge: Cambridge University Press; 2004. [Google Scholar]
  • 13.Sheiner LB, Rosenberg B, Melmon KL. Modelling of individual pharmacokinetics for computer-aided drug dosage. Computers and Biomedical Research. 1972;5:411–459. doi: 10.1016/0010-4809(72)90051-1. [DOI] [PubMed] [Google Scholar]
  • 14.Whiting B, Kelman AW, Grevel J. Population pharmacokinetics. Theory and clinical application. Clinical Pharmacokinetics. 1986;11:387–401. doi: 10.2165/00003088-198611050-00004. [DOI] [PubMed] [Google Scholar]
  • 15.Pillai G, Mentré F, Steimer JL. Non-linear mixed effects modeling – from methodology and software development to driving implementation in drug development science. Journal of Pharmacokinetics and Pharmacodynamics. 2005;32:161–183. doi: 10.1007/s10928-005-0062-y. [DOI] [PubMed] [Google Scholar]
  • 16.Diaz FJ, Yeh HW, de Leon J. Role of statistical random-effects linear models in personalized medicine. Current Pharmacogenomics and Personalized Medicine. 2012;10:22–32. doi: 10.2174/1875692111201010022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Diaz FJ, Cogollo M, Spina E, Santoro V, Rendon DM, de Leon J. Drug dosage individualization based on a random-effects linear model. Journal of Biopharmaceutical Statistics. 2012;22:463–484. doi: 10.1080/10543406.2010.547264. [DOI] [PubMed] [Google Scholar]
  • 18.Diaz FJ, de Leon J. The mathematics of drug dose individualization should be built with random effects linear models. Therapeutic Drug Monitoring. 2013;35:276–277. doi: 10.1097/FTD.0b013e318283e3c6. [DOI] [PubMed] [Google Scholar]
  • 19.Diaz FJ, Eap CB, Ansermot N, Crettol S, Spina E, de Leon J. Can valproic acid be an inducer of clozapine metabolism? Pharmacopsychiatry. 2014;47:89–96. doi: 10.1055/s-0034-1371866. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Botts S, Diaz FJ, Santoro V, Spina E, Muscatello MR, Cogollo M, Castro FE, de Leon J. Estimating the effects of co-medications on plasma olanzapine concentrations by using a mixed model. Progress in Neuro-Psychopharmacology & Biological Psychiatry. 2008;32:1453–1458. doi: 10.1016/j.pnpbp.2008.04.018. [DOI] [PubMed] [Google Scholar]
  • 21.Diaz FJ, Rivera TE, Josiassen RC, de Leon J. Individualizing drug dosage by using a random intercept linear model. Statistics in Medicine. 2007;26:2052–2073. doi: 10.1002/sim.2636. [DOI] [PubMed] [Google Scholar]
  • 22.Wang P, Tsai GF, Qu A. Conditional inference functions for mixed-effects models with unspecified random-effects distribution. Journal of the American Statistical Association. 2012;107:725–736. [Google Scholar]
  • 23.Fava M, Rush AJ, Trivedi MH, Nierenberg AA, et al. Background and rationale for the sequenced treatment alternatives to relieve depression (STAR*D) study. Psychiatric clinics of North America. 2003;26:457–494. doi: 10.1016/s0193-953x(02)00107-7. [DOI] [PubMed] [Google Scholar]
  • 24.Stokes ME, Davis CS, Koch GG. Categorical Data Analysis Using the SAS System. 2nd. Cary, NC: SAS Institute Inc.; 2000. [Google Scholar]
  • 25.Henderson CR. Estimation of variance and covariance components. Biometrics. 1953;9:226–252. [Google Scholar]
  • 26.Kalow W, Endrenyi L, Tang BK. Repeat administration of drugs as a means to assess the genetic component in pharmacological variability. Pharmacology. 1999;58:281–284. doi: 10.1159/000028292. [DOI] [PubMed] [Google Scholar]
  • 27.Neale MC, Cardon LR. Methodology for genetic studies of twins and families. London: Kluwer; 1992. [Google Scholar]
  • 28.Ozdemir V, Kalow W, Tothfalusi L, Bertilsson L, Endrenyi L, Graham JE. Multigenic control of drug response and regulatory decision-making in Pharmacogenomics: The need for an upper-bound estimate of genetic contributions. Current Pharmacogenomics and Personalized Medicine. 2005;3:53–71. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supp Fig 3 color
Supp Fig S1
Supp Fig S2
Supp Fig S3
Supp Fig S4
Supp Instruction
Supp Material
Supp Preben
Supp Preben.sthlp

RESOURCES