Abstract
The estimation of carry-over effects is a difficult problem in the design and analysis of clinical trials of treatment sequences including cross-over trials. Except for simple designs, carry-over effects are usually unidentifiable and therefore nonestimable. Solutions such as imposing parameter constraints are often unjustified and produce differing carry-over estimates depending on the constraint imposed. Generalized inverses or treatment-balancing often allow estimating main treatment effects, but the problem of estimating the carry-over contribution of a treatment sequence remains open in these approaches. Moreover, washout periods are not always feasible or ethical. A common feature of designs with unidentifiable parameters is that they do not have design matrices of full rank. Thus, we propose approaches to the construction of design matrices of full rank, without imposing artificial constraints on the carry-over effects. Our approaches are applicable within the framework of generalized linear mixed-effects models. We present a new model for the design and analysis of clinical trials of treatment sequences, called Antichronic System, and introduce some special sequences called Skip Sequences. We show that carry-over effects are identifiable only if appropriate Skip Sequences are used in the design and/or data analysis of the clinical trial. We explain how Skip Sequences can be implemented in practice, and present a method of computing the appropriate Skip Sequences. We show applications to the design of a cross-over study with 3 treatments and 3 periods, and to the data analysis of the STAR*D study of sequences of treatments for depression.
Keywords: Augmented regression, Carry-over effects, Vross-over design, Design matrix, Estimability, Generalized inverses, Generalized least squares, Identifiability, Maximum likelihood, Placebo, Quasi-likelihood, Random effects linear models, Robust fixed-effects estimators
Resumen
La estimación de los efectos de arrastre es un problema difícil en el diseño y análisis de ensayos clínicos de secuencias de tratamientos, incluyendo ensayos cruzados. Excepto por diseños simples, estos efectos son usualmente no identificables y, por lo tanto, no estimables. La imposición de restricciones a los parámetros es a menudo no justificada y produce diferentes estimativos de los efectos de arrastre dependiendo de la restricción impuesta. Las inversas generalizadas o el balance de tratamientos a menudo permiten estimar los efectos principales de tratamiento, pero no resuelven el problema de estimar la contribución de los efectos de arrastre de una sequencia de tratamiento. Además, los períodos de lavado no siempre son factibles o éticos. Los diseños con parámetros no identificables comúnmente tienen matrices de diseño que no son de rango completo. Por lo tanto, proponemos métodos para la construcción de matrices de rango completo, sin imponer restricciones artificiales en los efectos de arrastre. Nuestros métodos son aplicables en un contexto de modelos lineales mixtos generalizados. Presentamos un nuevo modelo para el diseño y análisis de ensayos clínicos de secuencias de tratamientos, llamado Sistema Anticrónico, e introducimos secuencias de tratamiento especiales llamadas Secuencias de Salto. Demostramos que los efectos de arrastre son identificables sólo si se usan Secuencias de Salto apropiadas. Explicamos como implementar en la práctica estas secuencias, y presentamos un método para calcular las secuencias apropiadas. Presentamos aplicaciones al diseño de un estudio cruzado con 3 tratamientos y 3 períodos, y al análisis del estudio STAR*D de secuencias de tratamientos para la depresión.
Keywords: Cuasi-verosimilitud, diseño cruzado, efectos de arrastre, estimabilidad, estimadores robustos de efectos fijos, identificabilidad, inversas generalizadas, matriz de diseño, máxima verosimilitud, mínimos cuadrados generalizados, modelos lineales de efectos aleatorios, placebo
1. Introduction
The estimation of carry-over effects is a challenging problem in the design and analysis of clinical trials (CTs) of treatment sequences. Parametrizations of generalized linear mixed-effects models (GLMMs) that include carry-over effects usually produce design matrices that are not of full rank (Jones & Kenward 2015). When the design matrix is not of full rank, not all linear contrasts of parameters are identifiable (Christensen 2011). In particular, except for very simple designs and parametrizations, carry-over effects are usually not identifiable, which creates difficulties in the quantification of the contribution of carry-over effects to the subjects’ responses. The most common solution when the design matrix is not of full rank is to impose linear constraints that produce full rank. For instance, in cross-over trials, one assumption is that carry-over effects add up to zero (Jones & Kenward 2015). Such constraints are usually not justified, however, in terms of the clinical phenomenon under study, and this limits parameter interpretation (Fleiss 1989). Moreover, as pointed out by Christensen (2011), different computer packages may use different linear constraints, giving the impression that programmers employ some constraints for the sole purpose of delivering a computer output. The problem is that although different linear constraints on unidentifiable parameters produce identical estimates of an identifiable parameter, they lead to different estimates of the unidentifiable parameters (Christensen 2011, p. 97).
Another common approach when the design matrix is not of full rank is to use generalized inverses (Christensen 2011, Bronson 1989). Usually, data analysts can use this approach to estimate overall differences among treatment effects. Only identifiable contrasts can be estimated in this way, however, and further elaborations on how treatments administered in earlier stages of the clinical trial affect the final response are often difficult, except for simple parametrizations that are not always justified (Fleiss 1989, Senn 2002, Jones & Kenward 2015). With this approach, therefore, the problem of examining the contribution of carry-over to the subjects’ responses remains open.
The main objective of this article is to propose an approach to constructing a design matrix of full rank for clinical trials (CTs) of treatment sequences (including cross-over trials) without imposing unjustified constraints on the parameters representing carry-over effects. The goal is to achieve the identifìability of model parameters including the parameters corresponding to carry-over effects. The proposed approach is applicable within the framework of GLMMs and allows assessing the contribution of carry-over effects to the subjects’ responses. With this approach, data analysts can assess not only first-order carry-over effects but also effects from treatments administered two or more stages (or periods) earlier.
Cross-over trials are the most common CTs of treatment sequences (Jones & Kenward 2015, Senn 2002, Diaz, Berg, Krebill, Welty, Gidal, Alloway & Privitera 2013, Berg, Welty, Gidal, Diaz, Krebill, Szaflarski, Dworetzky, Pollard, Elder Jr, Jiang, Jiang, Switzer & Privitera 2017, Grajales & Lopez 2006). In this article, however, the author is not only concerned with cross-over trials but also with CTs of treatment sequences in general. A representative example of a CT of treatment sequences, whose study design is not usually treated in the literature of cross-over trials, is the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) study (Fava, Rush, Trivedi, Nierenberg, Thase, Sackeim, Quitkin, Wisniewski, Lavori, Rosenbaum & Kupfer 2003). In that study, the researchers investigated adult subjects who were treated in outpatient settings for non-psychotic major depressive disorder. The primary purpose of the study was to determine which alternative treatments worked best if Citalopram monotherapy did not produce an acceptable response. The investigators examined a large number of treatments and treatment sequences including antidepressants and/or cognitive therapy and combined standard randomization procedures and a detailed protocol with a naturalistic approach that modeled real-life clinical practice. For instance, if a treatment fails, the subject and physician may choose the next treatment from a prescribed list of alternative treatments that are allowed at that stage, or the subject may be randomly assigned a treatment from a subset of the list determined by the subject and/or the physician. The idea was to repeat this procedure until reaching an acceptable treatment.
The difficulty of modeling carry-over effects is well documented in the crossover trials literature (Senn & Lambrou 1998, Senn, D’Angelo & Potvin 2004, Senn 2002, Abeyasekera & Curnow 1984, Fleiss 1989). For pharmacokinetic studies, the modern consensus is that minimizing carry-over effects during the design and implementation of the study is better than attempting to model these effects during the data-analysis phase (Senn 2002, Privitera, Welty, Gidal, Diaz, Krebill, Szaflarski, Dworetzky, Pollard, Elder, Jiang, Jiang & Berg 2016). In particular, specialists recommend washout periods in order to rationally exclude the possibility of carry-over effects, followed by regression modeling that does not include carry-over effect parameters [Senn (2002), Center for Drug Evaluation and Research(2001, 2003)]. In pharmacodynamic or behavioral studies, however, the situation is more complex. Although washout periods of sufficient length may help eliminate the carry-over effects of some therapies, the permanence of carry-over effects cannot always be ruled out when cognitive or learning processes are involved in the therapy (Hofmann, Wrobel, Kessner & Bingel 2014). Moreover, washout periods may not be ethical or convenient in some cases. For instance, patients may not agree to stop receiving treatment temporarily because they believe that doing this may exacerbate their symptoms.
In some cases, in cross-over trials, a balance of treatments across periods can be used to eliminate the influence of carry-over effects on treatment comparisons (Senn 2002). Even in these cases, however, an evaluation of how one treatment administered in one period affects the responses measured in subsequent periods cannot be conducted if the carry-over effects are not identifiable. When the presence of carry-over effects cannot be rationally ruled-out, another common solution is to include only first-order carry-over effects in the modeling equations. However, this type of carry-over effect, despite its widespread use in data analyses of pharmacological trials, may not be consistent with standard pharmacokinetic theory (Senn 2002).
Mixed-effects regression modeling is a well-established approach in statistical practice (Frees 2004, Diaz et al. 2013, Berg et al. 2017, Diaz 2016). In particular, the approach is frequently used in the analysis of clinical trials collecting repeated measures and/or longitudinal data. This article presents new developments in GLMMs that have not been published before. These developments are applicable to continuous and discrete responses, including dichotomous and count responses. Our approach is applicable to pharmacological and non-pharmacological therapies and combinations of these. It can also accommodate the same treatment being administered at different stages (or periods) of the trial to some subjects as well as different treatment durations at a particular stage. In addition, it allows the examination of whether clinical decisions made at a particular sequence stage affect the subjects’ responses in future stages as well as the investigation of delayed treatment effects. For instance, for the STAR*D study, we can answer the question: Is the benefit of cognitive therapy administered in the first stage still present in the second stage which consisted of only antidepressant drugs?
GLMMs are a combination of two general modeling frameworks, which are very popular in statistical practice: random effects linear regression models (RELMs) and generalized linear models (GLMs) (see the review by Rabe-Hesketh & Skrondal 2009). RELMs (Henderson 1953, Laird & Ware 1982) consist of the classic linear regression model with the additional assumption that some of the regression coefficients are random variables. RELMs have had a dramatic impact on the statistical analysis of repeated measures and longitudinal data (Frees 2004, Fitzmaurice & Molenberghs 2004, Hooks, Marx, Kachman & Pedersen 2009, Diaz et al. 2013, Diaz 2017). A GLM (Nelder & Wedderburn 1972) is essentially a linear regression model whose response has a distribution that belongs to an exponential family of distributions with a possibly transformed mean response that depends linearly on covariates. The richness of GLMs comes of the profound statistical properties of exponential families (see Shao 2003) and the fact that many useful families are exponential (e.g. normal, Poisson and Bernoulli). Popular GLMMs are the Poisson regression model with random effects (Long, Preisser, Herring & Golin 2015), the logistic regression model with random effects (Vermunt 2005), and RELMs.
The most popular estimation methods for GLMMs are maximum likelihood, quasi-likelihood, and generalized least squares (GLS) (Breslow & Clayton 1993, Rabe-Hesketh, Skrondal & Pickles 2005, Frees 2004), but robust methods such as the generalized method of moments (Kim & Frees 2007) and robust fìxed-effects estimation (Hausman 1978, Mundlak 1978, Frees 2001, Kim & Frees 2006, Ebbes, Bockenholt & Wedel 2004, Frees 2004) also exist. These methods can estimate only identifiable model parameters (Christensen 2011). Thus, if the design matrix is not of full rank, users cannot estimate all parameters. Therefore, ideally, the design matrix should be of full rank to ensure the estimability of all parameters and contrasts of interest. As illustrated in this article, the definition of covariates in the context of clinical trials of treatment sequences is not a trivial algebraic problem if we want to obtain a design matrix of full rank that includes all carryover effects of interest. In this article, we propose a new model, called Antichronic System (AS), along with new experimental design and analysis strategies that assist in the construction of design matrices of full rank. Once users implement the proposed tools, they can utilize the usual estimation methods.
In addition to the difficulties of building an appropriate design matrix, we face another challenge in the application of GLMMs to clinical trials of treatment sequences. The treatments administered at some stages may be determined by (unmeasured or measured) subjects’ characteristics and/or by their responses to treatments administered in previous stages. Problems such as self-selection, simultaneity, and omitted variables may occur (Kim & Frees 2006, Ebbes et al. 2004). Such problems can occur, for instance, when the subjects are not randomly assigned to the sequences. They cause correlations between random effects and covariates and, consequently, if data analysts employ inappropriate estimation methods, they bias the estimators of GLMM parameters. Fortunately, the econometrics literature contains well-established solutions for analogous problems. In the case of RELMs, the most popular solutions are robust fìxed-effect estimation, which provides unbiased estimates of regression coefficients for stage dependent covariates (Hausman 1978, Kim & Frees 2006), and augmented regression, which provides unbiased coefficient estimates for time invariant covariates such as demographics (Mundlak 1978, Frees 2001, 2004). These are the estimation methods we will adopt for the analysis of the STAR*D data. On the other hand, if reasons exist to believe that these issues did not occur, standard maximum likelihood, quasi-likelihood or GLS may be appropriate.
Due to the novelty of our approach and to ensure mathematical coherence, rigorous definitions of the new concepts will be necessary. In sections 2–3, we introduce some special sequences of treatments, called skip sequences (SSs), and propose that SSs be incorporated in the experimental design and analysis of trials because they guarantee the existence of design matrices of full rank. Section 4 introduces antichronic systems (ASs), which are models of treatment sequences. Section 5 connects ASs with GLMMs. Section 6 presents our main results, which show the important role of SSs in the construction of design matrices. Theorem 2 shows that the identifìability of all model parameters is guaranteed only if SSs are implemented. Theorems 3–4 establish methods for implementing SSs in practice. Section 7 presents a test to examine whether the responses obtained in a treatment stage (or period) are influenced by a previous stage. Section 8 shows an application to the design of cross-over trials with 3 treatments and 3 periods. Sections 9–10 analyze the STAR*D data using ASs. See proofs of theorems in Appendix A. To understand some of the proofs, the reader must have a familiarity with advanced algebraic treatments of GLMMs (for instance, Frees 2004).
2. Introduction to the Strategies to Obtain a Design Matrix of Full Rank
This and the next section present a convenient notation representing the process of administering a sequence of medical or behavioral treatments (MBTs) to a subject (or a patient) suffering from a chronic disease. Let Y be a (possibly-transformed) variable measuring or indicating some aspect of the state of a particular chronic disorder of a subject (or patient). Assume the goal of a MBT is to modify the value of Y. Examples of Y are: 1) a dichotomous indicator of whether or not a child with autism has appropriate expressive language skills; 2) the positive or negative subscales of the PANSS syndrome scale, which measure schizophrenia severity; 3) blood glucose or cholesterol concentrations, which partially assess the metabolic syndrome; 4) the 17-item Hamilton Rating Scale for Depression (HAM-D17); or 5) the total number of abstemious days (NADs) in a month, which assesses chronic alcoholism.
Assume the CT consists of at most q stages, q ≥ 2. (In a cross-over trial, a stage corresponds to a period.) At Stage i, li different MBTs (or “decisions”) Ai,1, …, Ai,li are available for administration to the subject, where li ≥ 2, i = 1, …, q; the clinician must administer one and only one of these decisions to the subject. The set of all decisions, {A1,1, …, A1,l1, …, Aq,1, …, Aq,lq} is called the treatment pool, and Ai,1, …, Ai,li are called i-stage decisions. Here, for all , but the same decision can be used in two or more different stages, that is, we can have that Ai, j = Ai′, j′ for some i ≠ i′ and some pair (j, j′). In other words, our methods can accommodate the same treatment occurring at different stages for some subjects. Also, in our formulation, the MBTs of a particular stage can have different durations. For instance, A1,1 can represent “administer cognitive therapy for only 1 month”, A1,2 can represent “administer cognitive therapy for 3 months”, and so on.
Let Y0,ω be the baseline value of Y for a subject ω, measured just before the sequence of decisions is administered. Our formulation introduces a new mathematical object, symbolized as (read “koppa”) and called the basal backdrop. As explained in Section 4, acts as a zero vector in computations. The symbol is interpreted as the existing clinical context under which Y0,ω is measured, that is, the context before Stage 1 begins. By definition, for all i and j. That is, no treatment from the treatment pool can be administered at baseline.
allows representing symbolically two proposed alternative strategies for obtaining a design matrix of full rank. In one strategy, represents placebo and it is assumed that placebo effects are negligible. For instance, in a CT with q = 2 stages, the pair (A1,5, A2,3) represents a sequence in which a subject receives treatment A1,5 at Stage 1 and treatment A2,3 at Stage 2. In contrast, the pair (, A2,3) will be used to represent the following sequence of events: the subject is administered placebo at the first stage and A2,3 at the second stage. Similarly, if q = 3, the triplet (, , A2,3) will indicate that the subject is administered placebo in the first two stages and treatment A2,3 at the third stage. This strategy is called real placebo.
In the second strategy, called virtual placebo, symbolizes a virtual absence of treatment at a particular stage, but the first treatment that the patient receives corresponds to a treatment from a subsequent stage. For instance, in a CT with q = 2 stages, the pair (, A2,3) will be used to represent the following strategy: the subject is not administered any 1-stage treatment during Stage 1, but is administered A2,3 at Stage 1 instead. That is, the subject “skips” having a 1-stage treatment, receives the 2-stage treatment A2,3 right after measuring Y0,ω, and completes the trial at the end of Stage 1. Although A2,3 is implemented in this subject as if it were a 1-stage treatment, A2,3 will be treated as a 2-stage treatment in the design matrix. In particular, the response determined by A2,3 will be treated as a response measured at the end of Stage 2. Provided Y0,ω measures a stable illness state, this strategy is methodologically sound and does not compromise the conclusions of data analysis. This is a consequence of Theorem 4. As shown in Theorems 2 and 3, the above strategies help construct design matrices of full rank.
3. Admissible and Skip Sequences
For r = 1, …, q, a treatment sequence up to Stage r is viewed as an element of the Cartesian product . For fix , denote , where , i ≤ r. A sequence is called inadmissible if there exist i and j with 1 ≤ i < j ≤ r such that but , or if for al1 i = 1, …, r. A sequence that is not inadmissible is called admissible. The subset of admissible sequences in is denoted as . An admissible sequence is called a skip sequence (SS) if for some i. An inadmissible sequence can be interpreted as a protocol violation in which a subject was returned at some stage ≥ 2 to baseline conditions, which are not part of the treatment pool.
Thus, any admissible sequence can be written as either
for some r = 1, …, q and some ji’s such that ji ∈ {1, …, li}. Only admissible sequences will be used in our model. Examples of admissible sequences in are (A1,2, A2,2, A3,1), (, A2,2, A3,1), and (, , A3,1). All these except the first are skip. In the second sequence, under a virtual-placebo strategy, the subject “skipped” receiving any 1-stage treatment, but received A2,2 at Stage 1 and A3,1 at Stage 2, completing the trial at the end of Stage 2. In the third sequence, the subject “skipped” receiving 1- and 2-stage treatments, but received A3,1 at Stage 1 and completed the trial at the end of Stage 1. Examples of inadmissible sequences are and .
4. Antichronic Systems
This section presents a new model of sequences of decisions. We assume that there is a clearly defined population of subjects to whom admissible sequences of possibly different lengths will be administered. Suppose a subject experienced (or was administered) a sequence . Under a real placebo strategy, we assume that the subject’s response Y was measured at the end of stages 1, …, r. Alternatively, under a virtual placebo strategy, we assume that the response was measured at the end of stages , where . Measured responses from subject ω are denoted as Yk,ω with k ∈ {0, 1, …, r} under real placebo, or under virtual placebo. Thus, after administering , the subject’s response vector is under real placebo, but it is under virtual placebo. Under virtual placebo, if for some particular i, Yi,ω is undefined. However, Y0,ω must be measured in both approaches.
Importantly, our model identifies decision with a row vector of dimension li which has 1 in its j-th component but 0 elsewhere. (That is, ). Also, if occurs at Stage i, we identify with the zero row vector of dimension li. [That is, .] Thus, a sequence is viewed as a sequence of row vectors such that has dimension li.
Definition 1. A set of admissible sequences is called an antichronic system (AS) if, for subject ω, there exist a unique number αω and unique column vectors βij,ω of dimension li, i = 1, …, j, j = 1, …, q, such that, for all r = 1, …, q, if the subject experiences sequence , then
| (1) |
where k = 1, …, r if real placebo is implemented, or under virtual placebo. We assume includes at least one sequence in .
Here, g is a known link function and Eω[·] represents conditional expectation given subject ω, that is, given both Λω and βij,ω, i = 1, …, j, j = 1, …, q. In addition, Xω is a p × 1 vector of baseline clinical, environmental, genetic, biological or demographic covariates, and . The number αω and the vectors βij,ω are all characteristic constants of subject ω (they do not change over time). However, some of them may vary from subject to subject. Thus, at the subject population level, αω and βij,ω are viewed as particular realizations of a random variable α and a random vector βij, which may possibly take on constant values. It is possible to assume that λ varies from subject to subject, but the simpler model that assumes a fixed λ will suffice to introduce our ideas. For fixed ω, under real placebo, the conditional distributions of Y0,ω, Y1,ω, …, Yr,ω belong to the same exponential family (e.g. normal, Poisson or Bernoulli), although with possibly different parameter values. Similarly, under virtual placebo and fixed ω, the conditional distributions of Y0,ω and belong to the same exponential family. Importantly, observe if , because acts as a zero vector in computations (that is, in that case).
To interpret equation (1), denote with . The effect of subsequence on Yk,ω is equal to the sum: “Effect of 1-stage treatment on Yk,ω (that is, θ1k,ω)” plus “effect of 2-stage treatment on Yk,ω” (θ2k,ω) plus …plus “effect of k-stage treatment on Yk,ω” (θkk,ω). Thus, by denoting , we see that is the effect of decision on response Yk,ω, if decision was actually taken at stage i, i ≤ k. Note that we can easily model delayed treatment effects. For instance, if higher values of Y represent a less severe illness, and if , but for all r with i ≤ r < k, then can be interpreted as a delayed beneficial effect of treatment on response Yk,ω. Finally, note we are measuring all these effects with respect to baseline conditions.
The virtual placebo approach requires an additional assumption. To present the assumption, we need the concept of interchangeable sequences. Let r and t be such that 1 ≤ r < t ≤ q. We say that a sequence is interchangeable with a sequence if . As an interpretation, the skip sequence T may represent the hypothetical situation in which the administration of was postponed t − r stages but the baseline conditions did not change during the first t − r stages.
Interchangeability assumption for virtual placebo:
Suppose and are interchangeable sequences. Then, for a particular subject ω, has the same distribution as . Thus, interchangeable sequences and T will potentially produce the same responses in a subject. Consequently, the effect of will not change in the hypothetical situation that we delay the administration of the sequence t − r stages, provided the subject continues on the baseline clinical backdrop () during the delay.
The practical consequence of the Interchangeability Assumption is that the effect of on a subject ω can be examined by administering right after measuring Y0,ω, and labeling the obtained responses as Yt − r+1,ω, Yt − r+2,ω, …, Yt,ω. That is, we do not need to delay the administration of by administering placebo during the first t − r stages, and we can use the responses to to incorporate T in the design matrix. Thus the name “virtual placebo”. We demonstrate in Section 6 that this approach is valid under the Interchangeability Assumption and that we need sequences like T in order to build a design matrix of full rank. This approach is recommendable when placebo is unfeasible or unethical, but other applications are possible as illustrated in Sections 9–10. Note that the Interchangeability Assumption is an assumption about particular sequences and T. Not all sequences of an AS have to satisfy interchangeability assumptions in order for the virtual placebo approach to be applicable.
Finally, note that both equation (1) and the Interchangeability Assumption implicitly require that Y0,ω measures a stable illness state (relative to sequence duration). That is, they are applicable if the subject’s baseline state will not change rapidly in the hypothetical situation that the subject is not intervened. That subjects must enter the study in a stable condition is an inclusion criterion of many clinical trials.
5. Matrix Representation of Responses
Next we exhibit the connection between ASs and the algebraic machinery that underlies GLMMs. To motivate the connection using q = 2, suppose subject ω was administered with . Then, by equation (1),
| (2) |
where the ’s represent row vectors, not all of them with the same dimension, and m2 = 2l1 + l2. If , we treat as a zero vector; and if a virtual placebo approach is followed we do not use the equation at the second row.
In general, suppose that . If the study follows a real placebo strategy, we define
![]() |
(3) |
where , with ui = l1+ ⋯ + li. If the study implements virtual placebo and is not skip, is defined exactly as in equation (3); but if is skip, V is defined as the matrix obtained after eliminating rows 2 through from the matrix in (3).
The number of rows of V is equal to the dimension of the subject’s response . The first row of V includes only zeros because, by equation (1), Y0,ω does not depend on any investigated MBT. All V matrices need to be put on top of each other to build the study design matrix. Therefore, they must have the same number of columns. For this reason, if r < q, the last mq − mr columns of V must contain only zeros. If r = q, the additional zero columns in equation (3) are not needed. The following theorem gives a matrix representation of responses.
Theorem 1. Let be an antichronic system and suppose that, by the end of Stage r, r ≤ q, a particular subject ω with coυariate vector Xω has been administered sequence . Then,
( and 1 have the same number of rows as V.) For convenience, we denote
Theorem 1 shows that an AS is algebraically analogous to a GLMM of repeated measures of Y. Specifically, g (Eω [Yr,ω]) depends linearly not only on the covariates in Xω but also on stage-dependent covariates whose values for a subject who underwent sequence are given in the first mr columns of . Thus, at least in principle, to estimate AS parameters, we can adapt standard software for fitting GLMMs (or RELMs in the case of Gaussian response with identity link). For example, we can use SAS GLIMMIX or MIXED procedures (SAS Institute Inc, Cary, NC, USA) or Stata’s meglm and mixed commands (StataCorp LP, College Station, TX, USA). Stata’s xtreg command with the option fe provides fixed-effects estimators for RELMs, which are robust to self-selection, simultaneity and omitted variables (Hausman 1978, Ebbes et al. 2004, Kim & Frees 2006). The option vce(robust) additionally provides Huber/White estimators of standard errors, which are robust to covariance matrix misspecifications (Huber et al. 1967, White 1982).
Fitting an AS requires that appropriate (stage-dependent) covariates be created, using matrix as a template. The supplementary material, described in Appendix C, provides a Stata ado program written by the author, called asdesign, which creates the covariates. The help file of asdesign provides additional instructions to fit an AS.
Here, we are interested in estimating the fixed effects vector E[γ], where γ = (α, λT, δ(q))T and
(We interpret γω as a constant vector characterizing one particular subject, which is a realization of the random vector γ.) If the experimental design does not incorporate SSs, however, the study design matrix will not be of full rank and, therefore, E[γ] will not be identifiable. We show this in Theorems 2–3.
Finally, note that the Interchangeability Assumption does not participate in the proof of Theorem 1. This assumption is required to prove Theorem 4, which guarantees that the introduction of SSs for the virtual placebo approach will not affect parameter interpretation.
6. Experimental Designs and Data Analysis Strategies for Antichronic Systems
This section presents the crux of our proposal: to estimate all AS model parameters, CTs of treatment sequences must incorporate SSs in their experimental design and/or data analysis. First, we show we need SSs to render E[γ] identifiable. Then we explain strategies for constructing SSs in practice.
In the context of a CT, let , be distinct admissible sequences. Assume that is an AS, and that we have a sample of N subjects, each of whom experienced one sequence in . Suppose Ni subjects completed sequence , i = 1, …, d, where Ni = N and Ni ≥ 1 for all i. For , if subject ω has covariate vector Xω and underwent sequence , the matrix is the subject’s design matrix (DM), ω = 1, …, N.
Under the usual assumptions of GLMMs, E [γ] will be identifiable if and only if the study design matrix is of full rank, regardless of the method used to estimate E[γ] (for instance, quasi-likelihood, maximum likelihood, generalized method of moments or robust fixed-effect estimation) (see Christensen 2011). The following theorem indicates the importance of SSs for the estimation of E[γ].
Theorem 2. Let , where . A necessary condition for K to be of full rank is that the following two facts simultaneously occur: (1) ρ(K) = ρ(D)+p, and (2) at least one subject experiences a treatment sequence such that is skip. (Here, ρ(A) = rank of A.)
The practical consequence of Theorem 2 is that, to estimate E[γ] through any method implemented in statistical software for GLMMs without imposing additional restrictions on E[γ], the clinical trial must implement SSs. Appendix B illustrates ASs with a DM of full rank. In practice, at the design stage of a CT, we must add SSs to the set of investigated treatment sequences until D is of full rank. This sometimes can also be done at the data analysis stage of existing data, as in Section 9. With existing data, however, it is not always possible to implement an adequate amount of SSs (see below). Condition 1 of Theorem 2 is achieved when the covariates in Xω are not perfectly correlated with each other and the final sequence allocations.
The next theorem will be useful to find the appropriate SSs at the design stage of a CT.
Theorem 3. Suppose are c distinct non-skip sequences in , which will occur with positive probability in the CT. Assume that each 1-stage treatment will be administered in the first stage of at least one of these sequences. Denote
Then, there exist τ skip sequences such that is of full rank, where
The practical implication of Theorem 3 is that E[γ] will be identifiable and therefore estimable if the CT implements all the sequences , . That is, adding to the design will produce a design matrix of full rank. The theorem’s proof actually establishes a procedure to construct starting from . In essence, these SSs are taken from some of the families of SSs,
There are a total of families, but only τ of them need to be used. To illustrate, for q = 4, any member of the family can be written as (, A2,3, B, C), where B and C are 3-stage and 4-stage treatments.
Specifically, the procedure to construct consists of the following easy steps: (1) Using the treatment names, label the columns of Γ. With our notation, the labels are
from the left to the right column of Γ. (2) Convert Γ into an echelon matrix using only elementary row operations, and then identify the columns of the echelon that do not have a pivot. Let A〈1〉, …, A〈τ〉 denote their labels. (3) Lastly, for each k = 1, …, τ, choose from the family . We recommend choosing the sequence in that both can be implemented in practice and will cause the lightest burden for the subjects. For each k, the CT has to implement at least one SS from the family . Otherwise, E[γ] will not be estimable. Section 8 illustrates this procedure for a cross-over design.
As explained in Section 2, two general strategies to implement SSs in practice are possible: real and virtual placebo. In the real placebo strategy, the trialist implements sequence by administering a placebo at each stage k with . The rest of this section elaborates further on virtual placebo. This strategy exploits the following theorem, which is a consequence of the Interchangeability Assumption.
Theorem 4. Let be an antichronic system and . Suppose that is interchangeable with , where r < t. Then, for a specific subject ω,
The practical consequence of Theorem 4 is that we can use W (T, Xω) in place of W (S, Xω) in order to code the covariates of a subject who experienced . The theorem guarantees that doing this will not affect the interpretation of model parameters contained in E[γ].
As suggested by Theorem 4, can be implemented in a CT by administering without delay after measuring Y0,ω, labeling the obtained r responses as Yt − r +1,ω, Yt − r +2,ω, …, Yt,ω. For instance, suppose that q = 3 and that (, , A) is needed, where A is a 3-stage treatment. To implement this sequence, some subjects are administered treatment A right after measuring Y0,ω, without a previous administration of 1-stage or 2-stage treatments. Then, the response produced by A is labeled as Υ3,ω. Responses to 1- or 2-stage treatments will not be needed from these subjects, because the DMs of these subjects will not have rows corresponding to these stages.
Under some circumstances, we can use virtual placebo in the analysis of data from studies that did not implement SSs in their protocol. The method constructs SSs at the data-analysis phase of the study and is also based on Theorem 4. To code the covariates of a subject who experienced , we can use W (T, Xω) in place of if the skip sequence T is needed to construct a study design matrix of full rank and is interchangeable with T. For instance, for q = 2, suppose that A is simultaneously a 1-stage and 2-stage treatment, and that, after providing Y0,ω, a particular subject was administered A at Stage 1, then the subject provided Y1,ω at the end of this stage, but then he/she dropped out of the study right after that. These events can be represented as . Matrix corresponding to can be used to construct K. But if we need SSs to estimate E [γ], we can use matrix V(T) instead, where , and relabel Y1,ω as Y2,ω. We followed this method in the analysis of the STAR*D data in Sections 9–10. Clearly, this approach to data analysis is not always applicable to studies that were not designed with SSs in mind, because it requires that some treatments be simultaneously administered at two or more different stages, and appropriate SSs may not always be available. This also highlights the importance of incorporating SSs at the planning stage of a CT.
7. Effect of a Stage on Future Responses
ASs allow testing naturally the null hypothesis that clinical decisions made at a particular stage do not affect the subjects’ responses in future stages. Note that βik includes all the effects of i-stage treatments on Yk,ω. If βik ≠ 0, we say that Stage i affects Yk,ω. The importance of this concept is that if βik = 0 for i < k, then clinical decisions made at Stage i will not affect the response measured at Stage k, and there are no delayed effects of Stage i on Yk,ω. To examine whether Stage i does not affect Yk,ω we test the null hypothesis, βik = 0. This test can be implemented with standard statistical software for fitting GLMMs, examining the joint significance of the covariates corresponding to βik. These covariates are called i-to-k transition covariates. For a particular subject, the values of these (stage-dependent) covariates are in columns c1 through c2 of matrix V, where and . For convenience, the covariates corresponding to βii are called i-stage covariates; these are also stage-dependent.
For instance, for q = 2, the third terms of the equations in (2) contain the 1-to-2 transition covariates. There are l1 of these covariates, one for each 1-Stage treatment. Similarly, first and second stage covariates are in the second and fourth terms, respectively. Tables 2 and 5 illustrate these types of covariates for the crossover example (Section 8) and the STAR*D data (Section 9), respectively.
8. Application: Improving a Cross-Over Design with 3 Treatments and 3 Periods
This Section illustrates an application of the proposed methodology to the design of a cross-over study of 3 treatments administered in 3 periods. The treatments are denoted A, B and C. Specifically, we show how placebo can be implemented in order to achieve the estimability of carry-over effects. Suppose the trialist wants to implement 6 sequences arranged in 2 Latin squares, as follows (Senn 2002):
We assume a baseline measure is obtained prior to the administration of a sequence. We also assume a normal response Y, independent and identically distributed random errors, and a random intercept representing subjects’s heterogeneity (Jones & Kenward 2015). Cross-over studies often assign the same number of subjects to the investigated sequences, but we do not need to assume this for this illustration.
Table 1 shows the expected responses by sequence and period for the above sequences, assuming the responses satisfy an AS (sequences 1–6). In Table 1, a parameter τAij measures the effect of treatment A on Y, when A is administered in period i and Y is measured at the end of period j, for i ≤ j and i, j = 0, 1, 2, 3. This effect is measured relative to the average baseline response μ. The effects of treatments B and C are defined analogously. Table 1 also shows expected responses for some skip sequences (sequences 7–12). Here, we interpret as placebo and assume placebo has negligible effects on Y.
Table 1:
Expected responses under a cross-over design with 3 treatments, 3 periods and 6 sequences (Sequences 1–6), assuming an antichronic system. Expected responses for some skip sequences are also shown (Sequences 7–12). Placebo is symbolized with .
| Period | ||||
|---|---|---|---|---|
| Sequence | 0 (Baseline) | 1 | 2 | 3 |
| 1 ABC | μ | μ + τA11 | μ + τA12 + τB22 | μ + τA13 + τB23 + τC33 |
| 2 BCA | μ | μ + τB11 | μ + τB12 + τC22 | μ + τB13 + τC23 + τA33 |
| 3 CAB | μ | μ + τC11 | μ + τC12 + τA22 | μ + τC13 + τA23 + τB33 |
| 4 ACB | μ | μ + τA11 | μ + τA12 + τC22 | μ + τA13 + τC23 + τB33 |
| 5 BAC | μ | μ + τB11 | μ + τB12 + τA22 | μ + τB13 + τA23 + τC33 |
| 6 CBA | μ | μ + τC11 | μ + τC12 + τB22 | μ + τC13 + τB23 + τA33 |
| Some skip sequences: | ||||
| 7 A | μ | μ | μ | μ + τA33 |
| 8 B | μ | μ | μ | μ + τB33 |
| 9 C | μ | μ | μ | μ + τC33 |
| 10 CA | μ | μ | μ + τC22 | μ + τC23 + τA33 |
| 11 CB | μ | μ | μ + τC22 | μ + τC23 + τB33 |
| 12 CC | μ | μ | μ + τC22 | μ + τC23 + τC33 |
We built the DM corresponding to sequences 1–6 without assuming any linear constraint on model parameters. For a design with only one subject per sequence, the DM is shown in Table 2. For a study with more than one subject in a sequence, the study DM will include the rows corresponding to the sequence as many times as subjects are assigned to the sequence. As predicted from Theorem 2, the matrix in Table 2 (and the study DM in general) are not of full rank.
Table 2:
Design matrix for sequences 1–6 in Table 1, assuming an antichronic system. The parameter corresponding to a particular column is shown at the top of the column. For a study with Ni subjects on a sequence, the complete study design matrix includes the four rows of the sequence Ni times.
| Period 1 | 1-to-2 transition | Period 2 | 1-to-3 transition | 2-to-3 transition | Period 3 | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sequence | Period | μ | τA11 | τB11 | τC11 | τA12 | τB12 | τC12 | τA22 | τB22 | τC22 | τA13 | τB13 | τC13 | τA23 | τB23 | τC23 | τA33 | τB33 | τC33 |
| 1 ABC | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 1 ABC | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 1 ABC | 2 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 1 ABC | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 |
| 2 BCA | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 2 BCA | 1 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 2 BCA | 2 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 2 BCA | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | 0 | 0 |
| 3 CAB | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3 CAB | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3 CAB | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 3 CAB | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 1 | 0 |
| 4 ACB | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 4 ACB | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 4 ACB | 2 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 4 ACB | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 |
| 5 BAC | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 5 BAC | 1 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 5 BAC | 2 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 5 BAC | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 1 |
| 6 CBA | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 6 CBA | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 6 CBA | 2 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 6 CBA | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 1 | 0 | 0 |
Here, we are interested in estimating the percent contribution of carry-over to the total effect of a treatment sequence. For instance, the percent contribution of carry over to the total effect of sequence ABC is defined as
Similarly, the contribution of carry-over to the total effect of the partial sequence AB is
Carry-over contributions for other sequences are defined analogously.
In general, however, if the cross-over trial includes only sequences 1–6, the numerators of the carry-over contributions will not be estimable and, therefore, we will not be able to estimate the carry-over contributions. To demonstrate this assertion, we used Searle criterion for estimability (Searle 1966). According to this criterion, if τ is the column vector containing the 19 parameters in Table 1 (including μ), L is a row vector, and Lτ is a contrast of these parameters, then Lτ is estimable if and only if
where M is the matrix in Table 2 and (MT M)− is the Moore-Penrose inverse of MTM (Bronson 1989). Using Searle criterion, we found that all parameters in Table 1 are nonestimable, except μ, τA11, τB11 and τC11, if only sequences 1–6 are implemented in the trial. In addition, although the denominators of all percent carry-over contributions are estimable, the numerators are not.
To assess carry-over contributions, the estimability of all model parameters is needed. By Theorem 2, we will be able to achieve estimability if we add SSs to the design. Figure 1 illustrates how to compute appropriate skip sequences in accordance to the procedure explained after Theorem 3. The top portion of the figure shows the matrix Γ, whose six rows represent sequences 1–6. The figure also shows the reduced row echelon form of matrix Γ. Since the echelon has four columns without pivots, at least four skip sequences are necessary to achieve the estimability of all model parameters. The bottom of Figure 1 suggests that we need to add to the design the sequences (, , A), (, , B) and (, , C), and at least one of the sequences (, C, T) with T = A, B, or C. Thus, for instance, if at least one subject is randomized to each of the sequences 1–10 in Table 1, percent carry-over contributions can be estimated. Table 3 shows the DM corresponding to the skip sequences 7–10 in Table 1, assuming only one subject per sequence and without imposing any parameter constraint. The reader can verify that if Tables 2 and 3 are combined, the resultant matrix is of full rank as predicted from Theorem 3. Therefore, the study DM of a cross-over trial including sequences 1–10 will be of full rank, provided that at least one subject undergoes each sequence. We computed the Moore-Penrose inverse with SAS PROC IML (SAS Institute Inc.) and the echelon matrix in Figure 1 with the calculator TI-Nspire CX CAS (Texas Instruments Inc.).
Figure 1:
Illustration of the computation of skip sequences for a cross-over design with 3 treatments, 3 periods and 6 sequences, according to Theorem 3.
Table 3:
Design matrix for the four skip sequences, (, , A), (, , B), (, , C) and (, C, A). Combining this matrix with that in Table 2 produces a matrix of full rank. The rows corresponding to a particular sequence must be included in the study design matrix as many times as subjects are assigned to the sequence.
| Period 1 | 1-to-2 transition | Period 2 | 1-to-3 transition | 2-to-3 transition | Period 3 | |||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sequence | Period | μ | τA11 | τB11 | τC11 | τA12 | τB12 | τC12 | τA22 | τB22 | τC22 | τA13 | τB13 | τC13 | τA23 | τB23 | τC23 | τA33 | τB33 | τC33 |
| 7 A | 0 | l | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 7 A | 1 | l | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 7 A | 2 | l | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 7 A | 3 | l | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
| 8 B | 0 | l | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 8 B | 1 | l | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 8 B | 2 | l | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 8 B | 3 | l | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
| 9 C | 0 | l | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 9 C | 1 | l | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 9 C | 2 | l | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 9 C | 3 | l | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| 10 CA | 0 | l | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 10 CA | 1 | l | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 10 CA | 2 | l | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 10 CA | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 |
9. Application: Fitting an AS to the STAR*D Data
In this section, we use virtual placebo for the analysis of the STAR*D clinical trial database which was provided by the National Institutes of Mental Health. The author did not conduct this clinical trial or participate in it. The STAR*D study investigators have declared in their publications that the study was approved by ethical committees and that patient consents were obtained (Fava et al. 2003). The database provided to the author did not contain information identifying the subjects, and the author maintained the database in a password-protected computer system.
The STAR*D study investigated 4,041 adult subjects with non-psychotic major depressive disorder (Fava et al. 2003). We analyzed the 1,439 subjects who did not have an acceptable response to Citalopram (CIT) monotherapy but continued in the study receiving alternative treatments. Here, Y is the HAM-D17 total depression score and Y0,ω is the score obtained right after CIT monotherapy ended. The HAM-D17 scale produced values from 0 to 42 inclusive, and all integer numbers in this range were observed in the patient population. Thus, we treated the HAM-D17 score as a continuous variable, as most statisticians would do in practice. An AS model with identity link and Gaussian response was fitted. The design matrix K was built using our Stata program asdesign (see Appendix C).
Alternative treatments given right after Citalopram monotherapy failure are named here “1-stage” treatments (Table 4). If a 1-stage treatment was declared to be successful by the clinician, the subject was passed to follow-up (FU). If the 1-stage treatment failed, the subject and clinician were presented with other treatment choices; these other choices, together with FU, are named here “2-stage” treatments. Although additional treatment choices were available if a 2-stage treatment failed, we analyzed here only sequences of at most 2 additional treatments given after CIT monotherapy failure; that is, q = 2 (Table 4).
Table 4:
Sequences of at most two additional treatments administered to 1,439 subjects with major depression from the STAR*D study who were refractory to Citalopram monotherapy.
| Sequence | Stage 1a | Stage 2b | Nc |
|---|---|---|---|
| 1 | BUP | -- | 87d |
| 2 | BUP | BUP+LI | 18 |
| 3 | BUP | BUP+THY | 8 |
| 4 | BUP | FU | 69 |
| 5 | BUP | MIRT | 28 |
| 6 | BUP | NTP | 29 |
| 7 | CIT+BUP | -- | 68d |
| 8 | CIT+BUP | CIT+LI | 14 |
| 9 | CIT+BUP | CIT+THY | 20 |
| 10 | CIT+BUP | FU | 150 |
| 11 | CIT+BUP | MIRT | 10 |
| 12 | CIT+BUP | NTP | 17 |
| 13 | CIT+BUS | -- | 78d |
| 14 | CIT+BUS | CIT+LI | 10 |
| 15 | CIT+BUS | CIT+THY | 17 |
| 16 | CIT+BUS | FU | 137 |
| 17 | CIT+BUS | MIRT | 18 |
| 18 | CIT+BUS | NTP | 26 |
| 19 | CIT+CTH | -- | 18d |
| 20 | CIT+CTH | BUP | 10 |
| 21 | CIT+CTH | FU | 45 |
| 22 | CIT+CTH | VEN | 12 |
| 23 | CTH | -- | 20d |
| 24 | CTH | BUP | 5 |
| 25 | CTH | FU | 33 |
| 26 | CTH | VEN | 4 |
| 27 | SER | -- | 75d |
| 28 | SER | FU | 87 |
| 29 | SER | MIRT | 33 |
| 30 | SER | NTP | 22 |
| 31 | SER | SER+LI | 11 |
| 32 | SER | SER+THY | 10 |
| 33 | VEN | -- | 81d |
| 34 | VEN | FU | 101 |
| 35 | VEN | MIRT | 21 |
| 36 | VEN | NTP | 22 |
| 37 | VEN | VEN+LI | 10 |
| 38 | VEN | VEN+THY | 15 |
FU: follow-up; CTH: Cognitive Therapy; BUP: Bupropion; BUS: Buspirone; CIT: Citalopram; LI: Lithium; MIRT: Mirtazapine; NTP: Nortriptyline; SER: Sertraline; TCP: Tranylcypromine; THY: Thyroid; VEN: Venlafaxine.
Stage 1 treatments are the treatments from Level 2 of the STAR*D study. In Level 1 of the study, only Citalopram monotherapy was given to the subjects.
Stage 2 treatments are the treatments from Levels 2A and 3 of STAR*D study. A follow-up (FU) after Level 2 of the study was treated as a “treatment” choice for Stage 2. The first HAM-D17 score available from follow-up was usually obtained at the end of the 3rd month of follow-up and was used in the fit of the AS model.
Number of subjects who underwent the treatment sequence.
These subjects withdrew from the study after completing Stage 1 (=Level 2 of STAR*D study).
In the STAR*D data, correlated effects were possible because 2-stage decisions may have been correlated with observed or unobserved subjects’ characteristics that determined the responses to Stage-1 decisions. Thus, we assumed a model with correlated effects and used robust fìxed-effects estimators to estimate regression coefficients of first and second stage covariates and transition covariates (Kim & Frees 2006, Ebbes et al. 2004, Frees 2004) (Table 5, footnote g). Additional advantages of this approach are that the actual structure and distribution of random effects do not need to be specified, and it is not necessary to conduct subject-level variable selection because these variables are not used in robust fìxed-estimation. However, since we were also interested in estimating the effects of demographics, we additionally conducted a separate augmented regression to estimate their effects (Mundlak 1978, Frees 2001, 2004) (Table 5, footnote d). In the augmented regression, we treated second-stage covariates as endogenous and used their average to augment the regression equation. Since subjects were initially randomized to first-stage treatments, first-stage and l-to-2 transition covariates were treated as exogenous. Huber/White robust standard errors were computed (Huber et al. 1967, White 1982) (Table 5).
Table 5:
Parameter estimates of a 2-stage antichronic system model of HAM-D17 scores computed with 3,303 scores from 1,434 STAR*D subjects refractory to Citalopram monotherapy.a
| Covariates | GLS estimators (random intercept)b | Robust estimatorsc | ||
|---|---|---|---|---|
| Regression coefficient | 95% Confidence Interval | Regression Coefficient | 95% Confidence interval | |
| Age (y) | 0.060*** | [0.033, 0.087] | 0.055*** | [0.032, 0.078]d |
| Black or African Americane | 2.52*** | [1.61, 3.44] | 2.01*** | [1.16, 2.85]d |
| Cocaine abuser or dependentf | −3.70* | [−7.10, −0.31] | −3.87*** | [−6.15, −1.60]d |
| First stage covariates | ||||
| BUP | −5.09*** | [−6.17, −4.01] | −5.26*** | [−6.45, −4.06]g |
| CIT+BUP | −5.87*** | [−6.75, −4.98] | −4.90*** | [−5.87, −3.93]g |
| CIT+BUS | −4.87*** | [−5.72, −4.02] | −4.16*** | [−5.15, −3.16]g |
| CIT+CTH | −4.67*** | [−6.16, −3.19] | −4.65*** | [−6.50, −2.80]g |
| CTH | −5.58*** | [−7.47, −3.68] | −4.57*** | [−7.03, −2.10]g |
| SER | −4.74*** | [−5.70, −3.77] | −5.56*** | [−6.70, −4.43]g |
| VEN | −5.89*** | [−6.93, −4.85] | −6.01*** | [−7.23, −4.80]g |
| Transition covariatesh | ||||
| BUP | 0.73 | [−2.76, 4.22] | 0.28 | [−3.65, 4.21]g |
| CIT+BUP | 0.58 | [−2.80, 3.95] | 0.66 | [−3.17, 4.48]g |
| CIT+BUS | 0.48 | [−2.90, 3.86] | 0.60 | [−3.22, 4.41]g |
| CIT+CTH | −0.53 | [−3.50, 2.44] | −1.29 | [−4.71, 2.14]g |
| CTH | −0.0021 | [−3.52, 3.51] | −0.35 | [−4.62, 3.91]g |
| SER | −0.86 | [−4.30, 2.59] | −2.05 | [−5.93, 1.83]g |
| VEN | −0.48 | [−3.94, 2.97] | −1.66 | [−5.58, 2.25]g |
| Second stage covariates | ||||
| BUP | −2.53* | [−4.60, −0.46] | −2.85** | [−4.91, −0.79]g |
| BUP+LI | −4.35 | [−8.90, 0.21] | −5.48* | [−9.96, −0.99]g |
| BUP+THY | −10.6*** | [−16.0, −5.17] | −11.2*** | [−17.0, −5.32]g |
| CIT+LI | −3.96 | [−8.29, 0.36] | −5.09* | [−9.29, −0.88]g |
| CIT+THY | −5.81** | [−9.79, −1.83] | −5.58* | [−10.2, −1.01]g |
| FU | −6.69*** | [−9.92, −3.45] | −5.03** | [−8.70, −1.36]g |
| MIRT | −5.17** | [−8.72, −1.62] | −5.95** | [−9.90, −2.01]g |
| NTP | −5.25** | [−8.78, −1.72] | −6.02** | [−10.1, −1.95]g |
| SER+LI | −4.91 | [−10.3, 0.46] | −6.29* | [−11.3, −1.32]g |
| SER+THY | −3.26 | [−8.47, 1.95] | −3.49 | [−8.24, 1.26]g |
| VEN | −4.17*** | [−6.36, −1.97] | −4.68** | [−7.66, −1.70]g |
| VEN+LI | −8.25** | [−13.6, −2.87] | −7.78** | [−13.2, −2.34]g |
| VEN+THY | −5.71* | [−10.5, −0.96] | −5.92* | [−11.1, −0.76]g |
| Constant (μα) | 14.9*** | [13.7, 16.1] | 17.4*** | [17.2, 17.6]g |
p < 0.05,
p < 0.01,
p < 0.001.
Note: To estimate model parameters, skip sequences with BUP and VEN at the second stage were implemented.
FU: follow-up; CTH: Cognitive Therapy; BUP: Bupropion; BUS: Buspirone; CIT: Citalopram; Li: Lithium; MIRT: Mirtazapine; NTP: Nortriptyline; SER: Sertraline; TCP: Tranylcypromine; THY: Thyroid; VEN: Venlafaxine.
The data were obtained from version 4.1 of the limited access datasets distributed from the NIH-supported “Sequenced Treatment Alternatives to Relieve Depression” (STAR*D). Before fitting the models, a few inconsistencies in the STAR*D data were found and corrected by the author. Inconsistencies and corrections were documented in a text file that is available from the author on request.
For the random intercept model estimated with GLS, estimated standard deviations of intercept and error were (95% confidence interval, [5.26, 5.88]) and .
To estimate regression coefficients of demographic variables, augmented regression was used. To estimate regression coefficients of first and second stage covariates and transition covariates, robust fixed-effects estimators were used.
Augmented regression (Mundlak 1978; Frees 2001, 2004) and Huber/White sandwich standard errors (Huber 1967, White 1982) were used. The estimators are robust to correlated effects and covariance matrix misspecification.
The dichotomous covariate was defined as 1 if the subject was self-declared black or African American, 0 otherwise.
The dichotomous covariate was defined as 1 if the subject declared to be cocaine abuser or dependent, 0 otherwise.
Robust fixed effects estimators (Hausman 1978, Kim and Frees 2006) and Huber/White sandwich standard errors (Huber 1967, White 1982) were computed. The estimators are robust to correlated effects and covariance matrix misspecification.
1-to-2 transition covariates. These covariates were jointly significant (robust test of H0: β1,2 = 0, p = 0.0079), suggesting that treatments at Stage 1 affected significantly the responses measured at the end of Stage 2. However, no individual transition covariate reached significance.
Stata’s xtreg and mixed commands were used (StataCorp LP, College Station, TX, USA). For comparison purposes, we also present GLS estimates for a random intercept model with i.i.d errors (Table 5). These estimates would be more appropriate if there were not correlated effects, but this cannot be guaranteed in the STAR*D data. Residual analyses showed both that the model fitted well and that the assumption of normality for the untransformed scores was reasonable.
Data and skip sequences used are described in the rest of this section. Five of the 1,439 subjects did not provide any of the three scores Y0,ω, Y1,ω and Y2,ω. Each of the other 1,434 subjects provided at least one of them. Thus, the 2-stage AS was fitted with these 1,434 subjects (Table 5). An assumption of missingness at random was made to include subjects who did not have all three values of Y. In total, the subjects provided 3,303 HAM-D17 scores for this analysis.
After CIT monotherapy failed, subjects had seven alternative treatment choices for Stage 1; thus, l1 = 7 (Table 4). For Stage 2, subjects had 13 alternative choices, including follow-up (FU) (l2 = 13). Note that the STAR*D protocol did not use the term “Stage” (footnotes a and b of Table 4).
Skip sequences were built as follows. Both Bupropion (BUP) and Venlafaxine (VEN) were simultaneously 1-stage and 2-stage treatments (Table 4). Also, 87 subjects who were on BUP at Stage 1 withdrew from the study after completing this stage (Sequence 1 in Table 4). Applying Theorem 4, these 87 subjects were assigned the skip sequence, . Similarly, the skip sequence, was assigned to each of 81 subjects who were on VEN at Stage 1 but withdrew after completing this stage (Sequence 33 in Table 4).
In summary, we built an AS that included the 38 sequences in Table 4, but Sequences 1 and 33 were redefined to corresponding interchangeable SSs. By following this approach, we obtained a study design matrix K of full rank. Parameter estimates are shown in Table 5. Note that, here, can be interpreted as “CIT monotherapy”. Also, the regression coefficient of follow-up (FU) must be interpreted with caution, because two different subjects on FU may have been under different therapies.
Note that subjects in Sequence 7 of Table 4 took CIT+BUP at Stage 1 but withdrew from the study after completing this stage. These subjects cannot be used to build SSs because CIT+BUP was not a 2-stage treatment. Thus, Sequence 7 was treated as . Similar comments can be made for subjects in Sequences 13, 19, 23 and 27. In contrast, Sequence 35 in Table 4, say, was treated as .
10. Results for STAR*D Data
Hausman test comparing robust fìxed-effects estimators with GLS estimators confirmed that there were correlated effects (χ2 = 346.9, df = 27, p < 0.0001) (Frees 2004). Therefore, robust fixed-effects estimators and augmented regression were necessary.
At α = 0.05, each 1-Stage treatment reduced significantly the average HAM-D17 score during Stage 1 in subjects who received the treatment (Table 5). Using the robust fixed-effects estimates, we found that Stage 1 significantly affect Y2,ω [overall test with robust estimates, F(7,1433) = 2.74, p = 0.0079]. That is, at least one transition covariate was significant according to the overall test. However, none of the individual regression coefficients of transition covariates reached significance. Thus, the evidence is weak that treatments used at Stage 1 had a potential influence on HAM-D17 scores measured at the end of Stage 2. In particular, there did not appear to be appreciable delayed effects. Not even cognitive therapy (CTH), which was a 1-stage treatment, gave these difficult-to-treat subjects a significant added advantage at fighting their illness at the next stage—the regression coefficient for the transition covariate corresponding to CTH was −0.35, 95% CI, (−4.62, 3.91) (Table 5). The other 1-stage treatments included drugs and it is possible that these drugs’ blood levels dropped during Stage 2, which may explain in part why Stage 1 treatments did not affect substantially the 2nd stage response.
Only one 2-stage treatment did not lower significantly the HAM-D17 average score in comparison with the score measured at the end of CIT monotherapy, namely SER+THY (Table 5). Also, according to the robust augmented regression, self-declared blacks or African Americans had significantly higher HAM-D17 average scores after controlling for treatment; cocaine abuser or dependent subjects had significantly lower average scores; and subjects of older age had significantly higher average scores.
The regression coefficients of 1-stage treatments were not significantly different from each other (using robust estimation, F(6,1433) = 1.13, p = 0.3398; Table 5). Therefore, there was no evidence that 1-stage treatments had differential average effects on HAM-D17. Similarly, 2-stage treatments did not have significantly different average effects on HAM-D17 (F(12,1433) = 1.18, p = 0.2927).
We also tested the appropriateness of the Interchangeability Assumption. This assumption implies that the regression coefficients of BUP for the first and second stage covariates are equal and that those of VEN are equal as well. Employing the robust standard errors, a simultaneous Wald test of these two null hypotheses was not significant (F(2,1433) = 2.06, p = 0.1275). This suggests the assumption was appropriate.
With a few additions, our model allows investigating interactions between treatments administered at different stages. Transition covariates are useful in this regard. As an illustration, we examined the interaction between BUP administered at the first stage and MIRT administered at the second stage by computing the product of the transition covariate corresponding to BUP and the second-stage covariate corresponding to MIRT. Using robust estimation, this interaction was not significant (p = 0.70). It is also possible to compute interactions between baseline covariates and stage or transition covariates.
In general, however, some interaction terms may induce a non-full-rank design matrix. Thus, future research must investigate how the presence of interaction terms affects the rank of design matrices, and must give conditions for the identifiability of their regression coefficients. Fortunately, however, the omission of significant interaction terms should not bias robust fixed-effects estimators because they are robust to omitted variables (Kim & Frees 2006).
11. Discussion
The main contribution of this article is the introduction of skip sequences in the design and analysis of clinical trials of treatment sequences. These sequences guarantee the existence of design matrices of full rank and, therefore, guarantee the identifiability (estimability) of carry-over effects. Importantly, we did not need unjustified linear constraints in order to examine carry-over effects. Skip sequences also facilitate the implementation of many estimation methods for GLMMs.
Theorem 2 indicates the importance of SSs in the experimental design and statistical analysis of CTs of treatment sequences, and theorem 3 gives a procedure for selecting appropriate SSs at the planning stage of a particular study. This procedure is very general, being applicable to any number of study stages and treatments, and to continuous or discrete responses, including dichotomous responses.
Real placebo can be used to implement SSs in practice. Alternatively, a virtual placebo approach can be applied to the analysis of data from some CTs such as the STAR*D study which did not specifically implement SSs in their protocol. As illustrated with the STAR*D data, useful insights can also be obtained in such CTs. This approach, however, cannot always be applicable if SSs are not implemented in the study protocol. Theorem 4 supports the use of virtual placebo. A virtual placebo approach can also be applied when real placebo is unethical or unfeasible. In such cases, appropriate SSs must be selected at the design stage of the CT.
ASs are applicable if we can assume the subjects’ health will not improve or deteriorate rapidly should they remain under baseline conditions. In other words, Y0,ω must measure a stable illness state. Having a stable illness is an inclusion criterion of many clinical trials. Our approach is applicable to these trials. This assumption, however, may not be applicable to some conditions such as advanced cancer or mental illness in acute state.
Our model does not include stage-dependent variables other than the investigated MBTs. The reason is that CTs are highly controlled experiments that attempt maintaining other variables affecting the response constant during the study. Therefore, in a well-conducted CT, the response Y should be a function only of both MBTs administered during the CT and baseline covariates. For instance, it is well known that smoking affects the metabolism of some neurotropic drugs (e.g. Botts, Diaz, Santoro, Spina, Muscatello, Cogollo, Castro & de Leon 2008). In a CT investigating these drugs, a subject quitting smoking (or starting smoking) during the CT is usually considered a protocol violation. Therefore, data from this subject is not included in data analyses. Thus, although it would be easy to incorporate in our model stage-dependent covariates other than those proposed in this article, doing so would be just a mere academic exercise that is inconsistent with the principles of CTs.
Note also that variables changing as a result of the investigated MBTs should not be treated as independent variables in ASs, because ASs account for the effects of treatments applied in one stage on subsequent stages. Thus, for instance, if a 1-stage treatment changed the value of a mediating variable that ultimately affected Y2,ω, the corresponding l-to-2 transition covariate will be significant. This means that we do not need to include the mediating variable as independent variable.
Our approach based on SSs poses an interesting question: when Theorem 3 guarantees that two or more alternative sets of SSs will produce a design matrix of full rank, which set will produce the lowest standard errors for the estimates of the regression coefficients? This open problem needs to be investigated by future research.
If the value of Y is missing for some stages, the corresponding rows of V can be eliminated accordingly. In such case, the assumption of missingness at random must be made, as we did to fit an AS to the STAR*D data.
Acknowledgements
The STAR*D study was supported by NIMH Contract #N01MH90003, and has the ClinicalTrials.gov Identifier, NCT00021528. The author was supported by an Institutional Clinical and Translational Science Award, NIH/NCATS Grant Number UL1TR000001 (awarded to the University of Kansas Medical Center). The author was not involved in the planning or conduction of the STAR*D study and has never received any payment or salary from its associated NIMH contract or its investigators. This manuscript may not reflect the opinions or views of the STAR*D study investigators or the NIH. The author thanks Mr. Douglas Eikermann and Guanlin Zhang for editorial assistance and making useful comments. Mr. Jiawei Duan also made useful comments. The author also thanks the two anonymous reviewers, whose comments motivated interesting additions to the manuscript.
Appendix A. Proofs of Theorems
Some proofs utilize the following convenient matrix notation. If ai, i = 1,…, n, is a sequence of scalars or matrices, the matrix [a1,…, an] is denoted with , and the matrix is denoted with . The symbols ∃i and ∃i are read “stack right” and “stack down”, respectively. For simplicity, for a constant number or matrix a, we denote and , in which a is repeated n times.
Proof of Theorem 1. Observe that, in general, if aij and bij are column vectors of the same dimension, then
Then, by Fubini’s theorem,
| (4) |
Fix r and k, where r = 1,…, q, and k ∈ {0,1,…, r} (real placebo) or (virtual placebo). Denote , where I{h=k} = 1 if h = k, or I{h=k} = 0 if h ≠ k. Observe that τik,ω = 0 if i > k and if i ≤ k. Thus,
| (5) |
Now define the row vectors , k = 0,…, r, and observe that the first row of V is , where is a row vector of zeros of dimension mq − mr. And, for j ≥ 2, the j-th row of V is with . Also note that Therefore, by equations (1), (4) and (5),
Definition A.1 To prove Theorem 2, we need to define a new binary matrix operation. For a matrix A with rows Ai, i = 1, …, n, and a matrix B with rows Bj, j = 1, …, k, define the new “product” of A and B as , where [Ai, Bj] is a row vector that contains from left to right the elements of the i-th row of A first, and then those of the j-th row of B. Observe that A * B has n · k rows.
Lemma A.1. (Used in proof of Theorem 2.) Let A be an n × p matrix. Let Ik be the k × k identity matrix. Let Jn be an n × 1 vector with ones in all its entries. If there exists a p × 1 vector c such that Ac = Jn, then ρ (A * Ik) = ρ(A) + k −1 for all k ≥ 1, where * is the product defined in Definition A.1.
Proof of Lemma A.1. Let Ai and Bi denote the i-th rows of A and Ik, respectively. We will reduce A * Ik into echelon form by using both elementary row and column transformations. Let 0k = (0, …, 0) denote a row with k zeros. (Here, the symbol “⇔” is read “has the same rank as”, and numbers above this symbol refer to explanations below.) Then,
![]() |
![]()
|
![]() |
where Δ is a matrix with dimension {k − 1 + n} × {k + p}. Above, is obtained by switching columns; is obtained by subtracting the first row of the block from the other block’s rows; by moving the first row of each block to the upper part of the matrix; by subtracting the second block from the blocks below it; by deleting rows with zeroes; by adding first row to rows in second block; and by putting first row at top of second block.
Now observe that the last p column of Δ constitute the matrix
By hypothesis, . Also, the first column of Δ is Jk−1+n. By replacing the first column of Δ with the column Jk−1+n − Hc, which can be viewed as a sequence of elementary column transformations on matrix Δ, we obtain that A * Ik has the same rank as
The rank is ρ(A) + k – 1.
Proof of Theorem 2. If subject ω underwent sequence , denote , ω = 1, …, N. Denote Clearly, for fixed i, all subjects who experienced sequence have the same matrix . Note that because, for any ω = 1, …, N, there exists i = 1, …, d such that and, for any i, there exists ω such that , which imply that any row in is equal to some row in D, and any row in D is equal to some row in .
Suppose first that
| (6) |
that is, that all possible decision combinations occur but does not occur at any stage. Then, . Denote , j = 1, …, q; and , where * is the product defined in Definition A.1. Then, the rows of D can be rearranged as
where the ’s are column vectors with only ones, not all of the same dimension. By Lemma A.1, ρ (G0) = 1, ρ (G1) = l1 and , j = 2, … q. Hence, if satisfies equation (6),
(Recall that q ≥ 2.)
Now note that if is not skip for any , and if there exists such that for some r < q, that is, if at least one subject did not receive a q-stage treatment, then the number of rows of D is ≤ d(q + 1); or if for all but the number of sequences in is , then the number of rows of D is also ≤ d(q + 1). In both situations, .
But, K is of full rank if and only if ρ(K) = 1+ p + mq. This implies that K will not be of full rank if but is not skip for any because, in such case, ρ(K) ≤ 1 + p + mq − q(q − 1)/2. In particular, if and satisfies equation (6), then ρ(K) = 1 + p + mq − q(q − 1)/2, which is the maximum possible rank of K when no subject experiences a skip sequence.
Proof of Theorem 3. By reorganizing the rows of D1 and subtracting rows from their duplicates, and then eliminating rows with only zeros, we obtain
![]() |
where is the li × li identity matrix, the ’s represent matrices of zeros, and, as before, the “⇔” is read “has the same rank as”. [We obtained in place of in the above matrix because of the assumption that every 1-stage treatment is implemented in the first stage of at least one of the sequences .]
By applying elementary row operations to , we can obtain an echelon matrix E2 which has the form
where C2 is also a row echelon matrix. Noting that the number of columns of C2 is l2, let τ2 = l2 − ρ (C2), and let be the indices of the columns of C2 that do not have a pivot. Recall that the rank of a matrix is equal to the number of columns minus the number of columns without pivots. Clearly, .
For i = 2, …, q and j = 1, …, li, consider the family of skip sequences,
A particular sequence in will be denoted . If , we denote for 1 ≤ t ≤ q.
Now take τ2 skip sequences from , respectively, and consider the matrix
By using elementary row operations,
because has l’s only in its columns , which are matched with the columns of C2 that do not have a pivot after converting the upper part of M2 into E2. Then, ρ (M2) = l1+l2. Now observe that, by starting with the same elementary row operations used to obtain E2, we can obtain
where C3 is row echelon. Denoting τ3 = l3 − ρ (C3), let be the columns of C3 that do not have a pivot. Now take τ3 sequences from , respectively. By using elementary row operations, we obtain
![]() |
![]() |
where the second equivalence was obtained by using row operations analogous to those used to obtain , and the third equivalence is because the 1’s of are aligned vertically with the columns of C3 that do not have a pivot. Then, ρ (M3) = l1 + l2 + l3.
Thus, working iteratively in this way, we can construct q − 1 sequences of numbers, namely , and matrices of the form
such that ρ (Mk) = l1 + ⋯ + lk. By construction, since , a conversion of Γ into a row echelon matrix shows that , where .
Now observe that, for all i = 2, …, q and j = 1, …, li,
![]() |
where is a row vector of zeros of dimension k, and the dimension of is (q − i + 2) × (mq + 1).
Then, reordering rows,
![]() |
We finally obtain
and conclude that , which is equal to the number of columns of D. Thus, with the selected skip sequences, D is of full rank.
Proof of Theorem 4. By Theorem 1, , and similarly for T. By the Interchangeability Assumption, and have the same distribution. Hence, . Then,
Appendix B. Examples of Antichronic Systems with a Full-Rank Design Matrix
Example B.1. Consider the simplest ASs, which occur when q = 2 and l1 = l2 = 2. In this case, includes only four non-skip sequences, namely: , , , and , for which
Suppose that is an antichronic system. By Theorem 2, an estimator of E[γ] cannot be computed if only the sequences in are implemented in a CT, even if each sequence in is administered to at least one subject, because, in this case, mq = 6, q(q −1)/2 = 1 and ρ(K) = p+6 < p+7 (see proof of Theorem 2); that is, the study design matrix will never be of full rank. Only two SSs are possible, namely and , for which, respectively,
Thus, for q = l1 = l2 = 2, the design matrix will be of full rank for some systems including at least one SS, provided each sequence of the system is administered to at least one subject. Examples of such systems are
To show that the design matrix is of full rank for , say, just observe that the rank of is 7. Mathematical software such as Mathematica (Wolfram Research, Inc.), or the calculator TI-Nspire CX CAS (Texas Instruments Inc.) are helpful in these rank computations.
Example B.2. Now suppose that q = 3 and l1 = l2 = l3 = 2. Here, includes 8 possible non-skip sequences (see Figure 2). For instance, in the Figure, . Consider the skip sequences and . To illustrate,
If , the design matrix will be of full rank for the systems
provided each sequence in the system is administered to at least one subject. Other systems for which the design matrix is of full rank are possible when both q = 3 and l1 = l2 = l3 = 2. For instance, in the definition of in , A2,1 could be used in place of A2,2.
Figure 2:
Graphical representation of all possible non-skip sequences with q = 3 and l1 = l2 = l3 = 2.
Appendix C. Description of Supplementary Material (Computer Code)
The author wrote a Stata command (asdesign) which creates the design matrix using three input variables: the subject identifier, the stage and the treatment variables. A help file for asdesign is also included, which additionally shows code for fitting antichronic systems.
The included files are:
asdesign.ado: Stata program that runs as a regular command.
asdesign.sthip: Help file for asdesign.
Instructions for asdesign.docx: Word file with instructions to install asdesign.
The above files are available from the author on request.
References
- Abeyasekera S & Curnow RN (1984), ‘The desirability of adjusting for residual effects in a crossover design’, Biometrics 40, 1071–1078. [Google Scholar]
- Berg M, Welty TE, Gidal BE, Diaz FJ, Krebill R, Szaflarski JP, Dworetzky BA, Pollard JR, Elder EJ Jr, Jiang W, Jiang X, Switzer RD & Privitera MD (2017), ‘Bioequivalence between generic and branded lamotrigine in people with epilepsy: the EQUIGEN randomized clinical trial’, JAMA Neurology 74, 919–926. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Botts S, Diaz F, Santoro V, Spina E, Muscatello MR, Cogollo M, Castro FE & de Leon J (2008), ‘Estimating the effects of co-medications on plasma olanzapine concentrations by using a mixed model’, Progress in Neuro-Psychopharmacology & Biological Psychiatry 32, 1453–1458. [DOI] [PubMed] [Google Scholar]
- Breslow NE & Clayton DG (1993), ‘Approximate inference in generalized linear mixed models’, Journal of the American Statistical Association pp. 925. [Google Scholar]
- Bronson R (1989), Matrix Operations, Schaum’s Outline Series, 1 edn, McGraw-Hill, New York. [Google Scholar]
- Center for Drug Evaluation and Research (2001), Us food and drug administration, guidance for industry: Statistical approaches to establishing bioequivalence, https://www.fda.gov/downloads/drugs/guidances/ucm070244.pdf, Accessed February 22, 2018.
- Center for Drug Evaluation and Research (2003), Us food and drug administration. guidance for industry: Bioavailability and bioequivalence studies for orally administered drug products - general considerations, https://www.fda.gov/ohrms/dockets/ac/03/briefing/3995B1_07_GFI-BioAvail-BioEquiv.pdf, Accessed February 22, 2018.
- Christensen R (2011), Plane Answers to Complex Questions: The Theory of Linear Models, Springer, New York. [Google Scholar]
- Diaz FJ (2016), ‘Measuring the Individual Benefit of a Medical or Behavioral Treatment Using Generalized Linear Mixed-Effects Models’, Statistics in Medicine 35, 4077–4092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Diaz FJ (2017), ‘Estimating individual benefits of medical or behavioral treatments in severely ill patients’, Statistical Methods in Medical Research (DOI: 10.1177/0962280217739033). [DOI] [PubMed] [Google Scholar]
- Diaz FJ, Berg MJ, Krebill R, Welty T, Gidal BE, Alloway R & Privitera M (2013), ‘Random-effects linear modeling and sample size tables for two special cross-over designs of average bioequivalence studies: the 4-period, 2-sequence, 2-formulation and 6-period, 3-sequence, 3-formulation designs’, Clinical Pharmacokinetics 52, 1033–1043. [DOI] [PubMed] [Google Scholar]
- Ebbes P, Bockenholt U & Wedel M (2004), ‘Regressor and random-effects dependencies in multilevel models’, Statistica Neerlandica 58, 161–178. [Google Scholar]
- Fava M, Rush AJ, Trivedi M II.. Nierenberg AA, Thase ME, Sackeim HA, Quitkin FM, Wisniewski S, Lavori PW, Rosenbaum JF & Kupfer DJ (2003), ‘Background and rationale for the sequenced treatment alternatives to relieve depression (STAR*D) study’, Psychiatric clinics of North America 26, 457–494. [DOI] [PubMed] [Google Scholar]
- Fitzmaurice G & Molenberghs G (2004), Advances in longitudinal data analysis, in Fitzmaurice G, Molenberghs G, Davidian M & Verbeke G, eds, ‘Longitudinal data analysis’, Chapman and Hall/CRC, London. [Google Scholar]
- Fleiss JL (1989), ‘A critique of recent research on the two-treatment Crossover design’, Controlled Clinical Trials 10, 237–43. [DOI] [PubMed] [Google Scholar]
- Frees EW (2001), Omitted variables in longitudinal data models’, The Canadian Journal of Statistics 29, 573–595. [Google Scholar]
- Frees EW (2004), Longitudinal and Panel Data, Cambridge, Cambridge University Press. [Google Scholar]
- Grajales LF & Lopez LA (2006), ‘Data imputation in switchback designs using a mixed model with correlated errors’, Revista Colombiana de Estadística 29, 221–238. [Google Scholar]
- Hausman JA (1978), ‘Specification tests in econometrics’, Econometrica 46,1251–1271. [Google Scholar]
- Henderson CR (1953), ‘Estimation of variance and covariance components’, Biometrics 9, 226–252. [Google Scholar]
- Hofmann M, Wrobel N, Kessner S & Bingel U (2014), ‘Minimizing carryover effects after treatment failure and maximizing therapeutic outcome. Can changing the route of administration mitigate the influence of treatment history?’, Zeitschrift fur Psychologic 222, 171–178. [Google Scholar]
- Hooks T, Marx D, Kachman S & Pedersen J (2009), Optimality criteria for models with random effects’, Revista Colombiana de Estadística 32, 17–31. [Google Scholar]
- Huber PJ et al. (1967), The behavior of maximum likelihood estimates under nonstandard conditions, in ‘Proceedings of the fifth Berkeley symposium on mathematical statistics and probability’, Vol. 1, University of California Press, pp. 221–233. [Google Scholar]
- Jones B & Kenward MG (2015), Design and Analysis of Cross-over Trials, CRC Press, Boca Raton. [Google Scholar]
- Kim J-S & Frees EW (2006), Omitted variables in multilevel models’, Psy-chometrika 71, 659–690. [Google Scholar]
- Kim J-S & Frees EW (2007), ‘Multilevel modeling with correlated effects’, Psychometrika 72, 505–533. [Google Scholar]
- Laird NM & Ware JH (1982), ‘Random effects models for longitudinal data’, Biometrics 38, 963–974. [PubMed] [Google Scholar]
- Long DL, Preisser JS, Herring AH & Golin CE (2015), ‘A marginalized zero-inflated Poisson regression model with random effects’, Journal of the Royal Statistical Society: Series C 64, 815–830. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mundlak Y (1978), On the pooling of time series and cross-section data’, Econo-metrica 46, 69–85. [Google Scholar]
- Nelder JA & Wedderburn RWM (1972), ‘Generalized linear models’, Journal of the Royal Statistical Society, Series A 135, 370–384. [Google Scholar]
- Privitera MD, Welty TE, Gidal BE, Diaz FJ, Krebill R, Szaflarski JP, Dworetzky BA, Pollard JR, Elder EJ, Jiang W, Jiang X & Berg M (2016), ‘Generic-to-Generic Lamotrigine Switches in People with Epilepsy: A Randomized Controlled Trial’, Lancet Neurology 15, 365–372. [DOI] [PubMed] [Google Scholar]
- Rabe-Hesketh S & Skrondal A (2009), Generalized linear mixed-effects models, in ‘Longitudinal Data Analysis’, Chapman and Hall/CRC, London, pp. 93120. [Google Scholar]
- Rabe-Hesketh S, Skrondal A & Pickles A (2005), ‘Maximum likelihood estimation of limited and discrete dependent variable models with nested random effects’, Journal of Econometrics 128, 301–323. [Google Scholar]
- Searle SR (1966), Matrix Algebra for the Biological Sciences, Wiley, New York. [Google Scholar]
- Senn S (2002), Cross-over Trials in Clinical Research, 2 edn, Wiley, Hoboken. [Google Scholar]
- Senn S, D’Angelo G & Potvin D (2004), ‘Carry-over in cross-over trials in bioequivalence: Theoretical concerns and empirical evidence’, Pharmaceutical Statistics 3, 133–142. [Google Scholar]
- Senn S & Lambrou D (1998), ‘Robust and realistic approaches to carry-over’, Statistics in Medicine 17, 2849–2864. [DOI] [PubMed] [Google Scholar]
- Shao J (2003), Mathematical Statistics, 2 edn, Springer, New York. [Google Scholar]
- Vermunt JK (2005), ‘Mixed-effects logistic regression models for indirectly observed discrete outcome variables’, Multivariate Behavioral Research 40, 281301. [DOI] [PubMed] [Google Scholar]
- White H (1982), ‘Maximum likelihood estimation of misspecifìed models’, Econo-metrica 50, 1–25. [Google Scholar]











