A note on point estimation and interval estimation of the relative treatment effect under a simple crossover design

Chii-Dean Lin; Kung-Jong Lui

doi:10.1002/pst.2176

. Author manuscript; available in PMC: 2023 Mar 1.

Published in final edited form as: Pharm Stat. 2021 Nov 9;21(2):386–394. doi: 10.1002/pst.2176

A note on point estimation and interval estimation of the relative treatment effect under a simple crossover design

Chii-Dean Lin ¹, Kung-Jong Lui ¹

PMCID: PMC9054161 NIHMSID: NIHMS1793756 PMID: 34755464

Abstract

To increase power or reduce the number of patients needed for a parallel groups design, the crossover design has been often used to study treatments for noncurable chronic diseases. However, in the presence of carry-over effect caused by treatments, the commonly-used estimator which ignores the carry-over effect leads to a biased estimator for estimating the treatment effect difference. A two-stage test approach aimed to address carry-over effect proposed was found to be potentially misleading. In this paper, we propose a weighted average of the commonly-used estimator and an unbiased estimator that uses only the first period of the data. We derive an optimal weight that minimizes the mean squared error (MSE) and its modified estimator. We apply Monte Carlo simulation to evaluate the performance of the proposed estimators in a variety of situations. In the simulations, we examine the estimated MSE (EMSE), percentile interval length, and coverage probability calculated from the percentile intervals among considered estimators. Simulation results show that our proposed weighted average estimator and its modified estimator lead to smaller EMSEs on average comparing to the two commonly used estimators. The coverage probabilities using our proposed estimators are reasonably close to the nominal confidence level and the interval lengths are shorter comparing to the use of the unbiased estimator that uses only the first period of the data. We apply an example that was to evaluate the efficacy of two type of bronchodilators for asthma treatment to demonstrate the use of the proposed estimators.

Keywords: carry-over effect, mean squared error, parallel groups design, percentile interval, relative treatment effect, simple crossover design

1 |. INTRODUCTION

To increase power or reduce the number of patients needed for a parallel groups design,^1,2,3 the crossover design has been often used to study treatments for noncurable chronic diseases, because each patient serves as his/her own control. Following Senn³ and Fleiss,^2,4 we should not employ the crossover design if we cannot employ an adequate washout period to assure that patients are weaned off the residual effects from earlier treatments. However, we can never be certain that the washout period has worked. Grizzle⁵ proposed a two-stage test procedure as follows. We first test whether the carry-over effect exists. Because the test itself for the carry-over effect is subject to the response variation between patients, the power of this test is generally low and thereby a high-nominal level of Type I error (10% or 15%) is usually chosen. If the test for the carry-over effect is nonsignificant, we will carry on analyses based on the difference between responses within patients as done for the crossover trial with assuming no carry-over effects. Otherwise, the test procedure using the data at the first period only is carried out as for the parallel groups design and all data obtained at the second period are excluded from data analysis. Freeman⁶ carried out a thorough investigation of the two-stage test approach and concluded that this approach could be potentially misleading. This is because the test for detecting the carry-over effect is highly correlated to that for testing equality based on the data at the first period. Thus, the test based on the data at the first period is likely to be significant as well when the test result for the carry-over effect is significant. This leads that the actual Type I error rate for the two-stage test is higher than the nominal α - level. Due to this concern, we do not want to use the two-stage test procedure in practice.^3,7,8,9 Other notes in use of the two-stage test in a crossover trial can be found elsewhere.^10,11

2 |. NOTATION AND METHODS

Consider comparing an experimental treatment B with a standard treatment A (or a placebo) under an AB/BA crossover design. Suppose that we randomly assign n₁ patients to group g = 1 with the treatment-receipt sequence A-then-B, in which patients receive treatment A at period 1 and then crossover to receive treatment B at period 2, and n₂ patients to group g = 2 with the treatment-receipt sequence B-then-A, in which patients receive treatment B at period 1 and then crossover to receive treatment A at period 2. For patient i (i=1, 2, ⋯, n_g) assigned to group g (g = 1, 2), let $Y_{i z}^{(g)}$ denote the patient response at period z (= 1, 2). We assume that $Y_{i z}^{(g)}$ can be expressed by the following random effects linear additive risk model:

Y_{i z}^{(g)} = μ_{i}^{(g)} + η_{B A} X_{i z}^{(g)} + γ Z_{i z}^{(g)} + λ_{A} 1_{{g = 1}} Z_{i z}^{(g)} + λ_{B} (1 - 1_{{g = 1}}) Z_{i z}^{(g)} + ε_{i z}^{(g)},

(1)

where $μ_{i}^{(g)}$ denotes the random effect due to the ith patient in group g and all $μ_{i}^{(g)}'s$ are assumed to be independent and identically distributed as an unspecified probability density function f_g(μ) with variance $σ_{u}^{2}; X_{i z}^{(g)}$ denotes the treatment-received covariate for treatment B, and $Z_{i z}^{(g)} = 1$ if the ith patient in group g at period z receives treatment B, and = 0, otherwise; $Z_{i z}^{(g)}$ represents the period covariate, and $Z_{i z}^{(g)} = 1$ for period z = 2, and = 0, otherwise; 1_{g=1} is the indicator function of group 1, and = 1 for group g = 1 and = 0, otherwise; and the random errors $ε_{i z}^{(g)}' s$ are assumed to be independent and identically distributed as a continuous distribution with mean 0 and variance $σ_{e}^{2}$ , and are assumed to be also independent of $μ_{i}^{(g)}' s$ . Parameters η_BA and γ sin model (1) denote the difference in effects between treatments B and A, and that between periods 2 and 1, respectively. Furthermore, λ_A and λ_B in model (1) represent the carry-over effects due to treatments A and B. Under model (1), we can see that the covariance between $Y_{i 1}^{(g)}$ and $Y_{i 2}^{(g)}$ is $cov (Y_{i 1}^{(g)}, Y_{i 2}^{(g)}) = Var (μ_{i}^{(g)}) = σ_{u}^{2} > 0$ and thereby, $Y_{i 1}^{(g)}$ and $Y_{i 2}^{(g)}$ are positively correlated with the intraclass correlation $ρ = σ_{u}^{2} / (σ_{u}^{2} + σ_{e}^{2})$ . Thus, the larger the variation $σ_{u}^{2}$ between patients, the higher is the value of the intraclass correlation between responses within patients.

For patient i (= 1, 2, ⋯, n_g) in group g (= 1, 2), we define $d_{i}^{(g)} = Y_{i 2}^{(g)} - Y_{i 1}^{(g)}$ , representing the difference in responses between two periods. We further define ${\bar{d}}^{(g)} = \sum_{i = 1}^{n_{g}} d_{i}^{(g)} / n_{g} = {\bar{Y}}_{+ 2}^{(g)} - {\bar{Y}}_{+ 1}^{(g)}$ , where ${\bar{Y}}_{+ z}^{(g)} = \sum_{i = 1}^{n_{g}} Y_{i z}^{(g)} / n_{g}$ (for z = 1, 2), as the average of response differences for period 2 versus period 1 over patients in group g. When there are no carry-over effects (i.e., λ_A = λ_B), the following estimator is the most commonly-used unbiased estimator for η_BA under model (1):

{\hat{η}}_{B A} = ({\bar{d}}^{(1)} - {\bar{d}}^{(2)}) / 2,

(2)

and its variance.

Var ({\hat{η}}_{B A}) = σ_{d}^{2} (1 / n_{1} + 1 / n_{2}) / 4,

(3)

where $σ_{d}^{2} = Var (d_{i}^{(g)})$ . We can estimate the variance $σ_{d}^{2}$ by the unbiased pooled-sample variance.^4,5

{\hat{σ}}_{d}^{2} = \sum_{g = 1}^{2} \sum_{i = 1}^{n_{g}} {(d_{i}^{(g)} - {\bar{d}}^{(g)})}^{2} / (n_{+} - 2),

(4)

where n₊ = n₁ + n₂ denotes the total number of patients in the trial. Therefore, when substituting ${\hat{σ}}_{d}^{2}$ (4) for $σ_{d}^{2}$ in $Var ({\hat{η}}_{B A})$ (3), we obtain the variance estimator $\hat{Var} ({\hat{η}}_{B A})$ . When there are carry-over effects (i.e., λ_A ≠ λ_B), ${\hat{η}}_{B A}$ (2) is known to be a biased estimator of η_BA and has the bias given by.

E ({\hat{η}}_{B A} - η_{B A}) = λ_{D} / 2,

(5)

where λ_D = λ_A − λ_B. We can estimate λ_D by the unbiased estimator.

{\hat{λ}}_{D} = ({\bar{Y}}_{+ 1}^{(1)} + {\bar{Y}}_{+ 2}^{(1)}) - ({\bar{Y}}_{+ 1}^{(2)} + {\bar{Y}}_{+ 2}^{(2)}) .

(6)

The variance of ${\hat{λ}}_{D}$ is given by

Var ({\hat{λ}}_{D}) = (4 σ_{u}^{2} + 2 σ_{e}^{2}) (\frac{1}{n_{1}} + \frac{1}{n_{2}}) .

(7)

Note the variance can be estimated by the unbiased pooled-sample variance^2,3 and we can obtain the variance estimator $\hat{Var} ({\hat{λ}}_{D})$ . When there are carry-over effects, we can use ${\hat{λ}}_{D}$ (6) to adjust the bias in (5) to obtain the unbiased estimator only based on the data at period 1 as done for the parallel groups design.⁵ This leads us to consider the following estimator ${\hat{η}}_{P A R}$ .

{\hat{η}}_{PAR} = {\bar{Y}}_{+ 1}^{(2)} - {\bar{Y}}_{+ 1}^{(1)},

(8)

which is an unbiased estimator for η_BA even in the presence of carry-over effects. The variance for ${\hat{η}}_{P A R}$ (8) is

Var ({\hat{η}}_{PAR}) = σ^{2} (1 / n_{1} + 1 / n_{2}),

(9)

where $σ^{2} = σ_{u}^{2} + σ_{e}^{2}$ . We can estimate σ² by

{\hat{σ}}^{2} = \sum_{g = 1}^{2} {\sum_{i = 1}^{n_{g}} (Y_{i 1}^{(g)} - {\bar{Y}}_{+ 1}^{(g)})}^{2} / (n_{+} - 2),

(10)

and obtain the variance estimator $\hat{Var} ({\hat{η}}_{P A R}) = {\hat{σ}}^{2} (1 / n_{1} + 1 / n_{2})$ . Note that Willan and Pater¹² stated that the mean squared error (MSE) $E {({\hat{η}}_{B A} - η_{B A})}^{2}$ is smaller than $E {({\hat{η}}_{P A R} - η_{B A})}^{2}$ under the balanced case (i.e., n₁ = n₂ = n) if and only if

λ_{D}^{2} / 4 < (1 + ρ) σ^{2} / n .

(11)

However, this equality (11) is generally difficult to apply in practice due to the lack of prior knowledge of the difference λ_D, the intraclass correlation ρ, or the variance σ². Also, the two-stage test procedure¹ has been suggested to determine which estimator ${\hat{η}}_{B A}$ or ${\hat{η}}_{P A R}$ for use. As noted elsewhere,^2,5,12,13 there are concerns, such as the lack of power or the bias of the estimator for the relative treatment effect, in application of this two-stage test procedure as well.

Instead of deciding to choose either ${\hat{η}}_{B A}$ or ${\hat{η}}_{P A R}$ for use based on nonexistent prior knowledge or a powerless hypothesis testing procedure, we may consider a weighted average $W {\hat{η}}_{P A R} + (1 - W) {\hat{η}}_{B A}$ (where 0 ≤ W ≤ 1), that includes ${\hat{η}}_{P A R}$ and ${\hat{η}}_{B A}$ as special cases. We can show that the optimal weight W₀ to minimize the MSE $E {(W {\hat{η}}_{P A R} + (1 - W) {\hat{η}}_{B A} - η_{B A})}^{2}$ is given by

W_{o} = [E {({\hat{η}}_{B A} - η_{B A})}^{2} - E ({\hat{η}}_{B A} - η_{B A}) ({\hat{η}}_{P A R} - η_{B A})] / [E {({\hat{η}}_{B A} - η_{B A})}^{2} + E {({\hat{η}}_{P A R} - η_{B A})}^{2} - 2 E ({\hat{η}}_{B A} - η_{B A}) ({\hat{η}}_{P A R} - η_{B A})] .

(12)

We can further show that

E ({\hat{η}}_{B A} - η_{B A}) ({\hat{η}}_{P A R} - η_{B A}) = (σ_{e}^{2} / 2) (1 / n_{1} + 1 / n_{2}) .

(13)

From (3), (7), (9) and (13), we can see that the optimal weight W₀ (12) simplifies to

W_{0} = (λ_{D}^{2} / 4) / [λ_{D}^{2} / 4 + σ^{2} ((1 + ρ) / 2) (1 / n_{1} + 1 / n_{2})] .

(14)

Note that the optimal weight W₀ is a function of unknown parameters and hence we cannot apply $W_{0} {\hat{η}}_{P A R} + (1 - W_{o}) {\hat{η}}_{B A}$ to reduce the MSE of ${\hat{η}}_{P A R}$ and ${\hat{η}}_{B A}$ directly.

We can estimate $λ_{D}^{2}$ by the unbiased estimator

{\hat{λ}}_{D}^{2} - \hat{Var} ({\hat{λ}}_{D}) ..

(15)

We can further estimate σ²((1+ρ)/2) by the unbiased estimator

{\hat{σ}}^{2} - {\hat{σ}}_{d}^{2} / 4.

(16)

On the basis of (15)–(16), we can estimate the optimal weight W_o by

{\hat{W}}_{0} = [({\hat{λ}}_{D}^{2} - \hat{Var} ({\hat{λ}}_{D})) / 4] / [({\hat{λ}}_{D}^{2} - \hat{Var} ({\hat{λ}}_{D})) / 4 + ({\hat{σ}}^{2} - {\hat{σ}}_{d}^{2} / 4) (1 / n_{1} + 1 / n_{2})]

(17)

and define

{\hat{η}}_{O P} = {\hat{W}}_{0} {\hat{η}}_{P A R} + (1 - {\hat{W}}_{0}) {\hat{η}}_{B A} .

(18)

To compare the MSE of ${\hat{η}}_{O P}$ (18) with ${\hat{η}}_{B A}$ (2) and ${\hat{η}}_{P A R}$ (8), we plot MSE versus λ_D. Figure 1 and Figure 2 show the relationship (with true W₀) between MSE and λ_D when n = 50, ρ = 0.2, and σ² = 2 (Figure 1) and n = 50, ρ = 0.4, and σ² = 4 (Figure 2) for ${\hat{η}}_{O P}$ , ${\hat{η}}_{B A}$ , and ${\hat{η}}_{P A R}$ . As expected, both graphs show that the MSE of ${\hat{η}}_{O P}$ , is smaller than the MSE of ${\hat{η}}_{P A R}$ and ${\hat{η}}_{B A}$ . The value of λ_D does not affect the MSE of ${\hat{η}}_{P A R}$ since ${\hat{η}}_{P A R}$ uses only the first period of the data. When the absolute value of λ_D is large (large carry-over effect difference), the MSE of ${\hat{η}}_{B A}$ increases substantially. In a situation that the carry-over effect difference is 0 (λ_D = 0), the MSE of ${\hat{η}}_{O P}$ and ${\hat{η}}_{B A}$ is the same and the MSE is the smallest. This can be seen from (14) that when λ_D = 0, the value of W₀ = 0 as well. This leads to the equivalence of ${\hat{η}}_{O P}$ and ${\hat{η}}_{B A}$ . We further notice that when the MSEs of ${\hat{η}}_{B A}$ and ${\hat{η}}_{P A R}$ are equal (the crossing of the two curves), the reduction of the MSE using ${\hat{η}}_{O P}$ is the largest. In Figure 1, the two MSE curves of ${\hat{η}}_{B A}$ and ${\hat{η}}_{P A R}$ crossed each other when λ_D is near 0.4 or −0.4. This is the λ_D value that the MSE will reduce the most when ${\hat{η}}_{O P}$ is applied.

The relationship (with true W₀) between MSE of ${\hat{η}}_{O P}$ (20), ${\hat{η}}_{B A}$ (2), and ${\hat{η}}_{P A R}$ (9) and λ_D when n = 50 per group, intraclass correlation ρ = 0.2, and random error variance σ² = 2

The relationship (with true W₀) between MSE of ${\hat{η}}_{O P}$ (20), ${\hat{η}}_{B A}$ (2), and ${\hat{η}}_{P A R}$ (9) and λ_D when n = 50 per group, intraclass correlation ρ = 0.4, and random error variance σ² = 4

3 |. MONTE CARLO SIMULATION

To compare the performance of ${\hat{η}}_{O P}$ (18) with ${\hat{η}}_{B A}$ (2) and ${\hat{η}}_{P A R}$ (8), we use Monte Carlo simulation. We consider the situations in which the random errors $ε_{i z}^{(g)}$ are assumed to be independently identically distributed as the normal distribution with mean 0 and variance σ² = 1–3 with an increment of 0.5; the effect for treatment A versus treatment B = 0 to 1 with an increment of 0.2; the effect for period 2 versus period 1, γ = 1; the intraclass correlation ρ = 0.1–0.8 with an increment of 0.1; the carry-over effect difference λ_D = λ_A − λ_B = 0–1.5 with an increment of 0.25; and the number of patients per group n (= n1 = n2) = 15, 25, 50, 75, 100, 200. For each configuration determined by a combination of the above parameters, we apply SAS 9.4¹⁴ to generate 1000 simulated samples, each consisting of n observations per group g (= 1, 2). We use these settings to compare the estimated MSEs. From simulation results, we found that the estimate W₀ tends to underestimate when the true W₀ is close to 1 and overestimate when the true W₀ is close to 0. To reduce the cause due to the overestimate (underestimate) near the boundary, we propose a modified estimator of $λ_{D}^{2}$ when we estimate W₀. If $\frac{{\hat{λ}}_{D}}{{\hat{σ}}^{2}} < . 1$ , we use ${\hat{λ}}_{D}^{2}$ instead of the unbiased estimator ${\hat{λ}}_{D}^{2} - V \hat{A} R ({\hat{λ}}_{D})$ (15) to estimate $λ_{D}^{2}$ . On the other hand, if $\frac{{\hat{λ}}_{D}}{{\hat{σ}}^{2}} \geq . 1$ , we adopt the unbiased estimator ${\hat{λ}}_{D}^{2} - V \hat{A} R ({\hat{λ}}_{D})$ (15) to estimate $λ_{D}^{2}$ . This is to account for potential over-correction (when ${\hat{λ}}_{D}$ is small) and we may observe a negative ${\hat{W}}_{0}$ . That is,

{\hat{W}}_{01} = {\begin{array}{l} [({\hat{λ}}_{D}^{2}) / 4] / [({\hat{λ}}_{D}^{2}) / 4 + ({\hat{σ}}^{2} - {\hat{σ}}_{d}^{2} / 4) (1 / n_{1} + 1 / n_{2}), \frac{{\hat{λ}}_{D}}{{\hat{σ}}^{2}} < . 1 \\ [({\hat{λ}}_{D}^{2} - \hat{Var} ({\hat{λ}}_{D})) / 4] / [({\hat{λ}}_{D}^{2} - \hat{Var} ({\hat{λ}}_{D})) / 4 + ({\hat{σ}}^{2} - {\hat{σ}}_{d}^{2} / 4) (1 / n_{1} + 1 / n_{2})], \frac{{\hat{λ}}_{D}}{{\hat{σ}}^{2}} \geq . 1 \end{array} .

(19)

Based on ${\hat{W}}_{01}$ , we define the modified estimator as

{\hat{η}}_{O P_{-} a} = {\hat{W}}_{01} {\hat{η}}_{P A R} + (1 - {\hat{W}}_{01}) {\hat{η}}_{B A} .

(20)

On top of comparing the performance of EMSE of ${\hat{η}}_{P A R}$ , ${\hat{η}}_{B A}$ , ${\hat{η}}_{O P}$ , and ${\hat{η}}_{O P_a}$ , we further examine the efficiency of the four estimators. It is known that in the presence of the carry-over effects, ${\hat{η}}_{B A}$ is a biased estimator of η_BA and hence, ${\hat{η}}_{O P}$ and ${\hat{η}}_{O P_a}$ are biased in estimating η_BA as well. We use interval estimation and coverage probability to evaluate the efficiency. We generate bootstrap samples to construct 95% percentile intervals and the coverage probabilities of testing a hypothesized treatment effect difference η_BA based on the constructed 95% percentile intervals through simulations. We generate 500 bootstrap samples for each simulation and calculate the average 95% interval length and the coverage probability of the 1000 simulated samples. We consider the situations in which the random errors $ε_{i z}^{(g)}$ are assumed to be independently identically distributed as the normal distribution with mean 0 and variance σ² = 0.5 to 1.5 with an increment of 0.5; the effect for treatment A = 1 versus treatment B = 0; the effect for period 2 versus period 1, γ = 1; the intraclass correlation ρ = 0.3, 0.7; the carry-over effect difference λ_D = λ_A − λ_B = 0 to 1 with an increment of 0.25; and the number of patients per group n (= n1 = n2) = 10, 20, 30. For each configuration determined by a combination of the above parameters, we apply SAS 9.4¹⁴ to generate 1000 simulated samples, each consisting of n observations per group g (= 1, 2).

4 |. RESULTS

To compare simulation results, we summarize in Table 1 the estimated MSE (EMSE) for the four estimators we considered ( ${\hat{η}}_{B A}$ (2), ${\hat{η}}_{P A R}$ (8), ${\hat{η}}_{O P}$ (18), and ${\hat{η}}_{O P_a}$ (20)) categorizing by the sample size. Note ${\hat{η}}_{O P_a}$ has the smallest EMSE in all sample sizes we considered here. When sample size is large, the first period of the data contains enough information and thus, the advantage of adopting either ${\hat{η}}_{O P}$ or ${\hat{η}}_{O P_a}$ gets smaller. For ${\hat{η}}_{B A}$ , the reduction of the EMSE is limited even when n gets larger. This is due to the existence of the carry-over effects and it will not disappear even n becomes bigger. When the sample size is large, the difference of the EMSE among ${\hat{η}}_{O P}$ , ${\hat{η}}_{O P_a}$ , and ${\hat{η}}_{P A R}$ becomes smaller.

TABLE 1.

The estimated MSE and standard error (in bracket) organized by sample size n for ${\hat{η}}_{P A R}$ , ${\hat{η}}_{B A}$ , ${\hat{η}}_{O P}$ , and ${\hat{η}}_{O P_a}$ in situations in which random errors $ε_{i z}^{(g)}$ are assumed to be independently identically distributed as the normal distribution with mean 0 and variance σ² = 1–3 with an increment of 0.5; the effect for treatment A versus treatment B = 0–1 with an increment of 0.2; the effect for period 2 versus period 1, γ = 1; the intraclass correlation ρ = 0.1 to 0.8 with an increment of 0.1; the carry-over effect difference λ_D = λ_A − λ_B = 0–1.5 with an increment of 0.25; and the number of patients per group n (= n1 = n2) = 15, 25, 50, 75, 100, 200

N	EMSE_n_par (SE)	EMSE_n_BA (SE)	EMSE_n_OP (SE)	EMSE_n_{OP_a} (SE)
15	0.233(0.115)	0.267(0.200)	0.197(0.104)	0.182(0.090)
25	0.140(0.069)	0.242(0.197)	0.131(0.073)	0.119(0.062)
50	0.070(0.034)	0.222(0.196)	0.072(0.042)	0.065(0.035)
75	0.047(0.023)	0.216(0.195)	0.049(0.029)	0.045(0.025)
100	0.035(0.017)	0.213(0.195)	0.037(0.022)	0.034(0.019)
200	0.018(0.009)	0.208(0.195)	0.018(0.011)	0.018(0.010)

Open in a new tab

Note: Each entry is calculated on the basis of 1000 repeated samples.

Table 2 shows the EMSE of the four estimators grouped by different λ_D. When the λ_D (the carry-over effect) is relative small (< 0.5), the estimate ${\hat{η}}_{B A}$ performs the best. However, the EMSE of ${\hat{η}}_{B A}$ increases rather quickly when λ_D becomes large. On the other hand, the EMSE of ${\hat{η}}_{P A R}$ stays the same. Note both ${\hat{η}}_{O P_a}$ and ${\hat{η}}_{O P}$ perform moderately well. They can be used to protect the worst scenario when the carry-over effect difference is large and they are more efficient than the ${\hat{η}}_{P A R}$ when the sample size is small.

TABLE 2.

The estimated MSE and standard error (in bracket) organized by λ_D for ${\hat{η}}_{P A R}$ , ${\hat{η}}_{B A}$ , ${\hat{η}}_{O P}$ , and ${\hat{η}}_{O P_a}$ in situations in which random errors $ε_{i z}^{(g)}$ are assumed to be independently identically distributed as the normal distribution with mean 0 and variance σ² = 1–3 with an increment of 0.5; the effect for treatment A versus treatment B = 0 to 1 with an increment of 0.2; the effect for period 2 versus period 1, γ = 1; the intraclass correlation ρ = 0.1–0.8 with an increment of 0.1; the carry-over effect difference λ_D = λ_A − λ_B = 0–1.5 with an increment of 0.25; and the number of patients per group n (= n1 = n2) = 15, 25, 50, 75, 100, 200

λ_D	EMSE_n_par (SE)	EMSE_n_BA (SE)	EMSE_n_OP (SE)	EMSE_n_{OP_a} (SE)
0	0.091 (0.095)	0.025 (0.030)	0.048 (0.051)	0.052 (0.056)
0.25	0.090 (0.094)	0.040 (0.030)	0.056 (0.052)	0.057 (0.055)
0.5	0.090 (0.095)	0.088 (0.031)	0.074 (0.059)	0.069 (0.059)
0.75	0.090 (0.094)	0.165 (0.030)	0.090 (0.072)	0.080 (0.067)
1	0.090 (0.094)	0.275 (0.030)	0.101 (0.088)	0.089 (0.077)
1.25	0.090 (0.095)	0.416 (0.031)	0.108 (0.103)	0.095 (0.088)
1.5	0.090 (0.095)	0.587 (0.031)	0.112 (0.115)	0.099 (0.097)

Open in a new tab

Note: Each entry is calculated on the basis of 1000 repeated samples.

In Table 3, we summarize the coverage probabilities and 95% percentile interval lengths grouped by the sample size. It can be seen that the percentile interval lengths using ${\hat{η}}_{B A}$ are the shortest since ${\hat{η}}_{B A}$ uses all available observations. However, since we consider simulations including nonzero carry-over effects, the coverage probabilities using ${\hat{η}}_{B A}$ do not close to the nominal 0.95 confidence level. The other three estimates have similar coverage probabilities that are close to the nominal 0.95 level, but ${\hat{η}}_{P A R}$ produces larger interval lengths. Table 4 shows the coverage probabilities and 95% percentile interval lengths grouped by different λ_D. Note that λ_D does not impact the percentile interval lengths. The interval lengths remain about the same for each estimate. The interval lengths generated using ${\hat{η}}_{B A}$ are still the shortest. However, the coverage probabilities using ${\hat{η}}_{B A}$ do not close to the nominal 0.95 confidence level when λ_D is not 0. Both ${\hat{η}}_{O P}$ and ${\hat{η}}_{O P_a}$ are able to maintain reasonable coverage probabilities even λ_D is large while produce shorter interval lengths comparing to the lengths produced by the use of ${\hat{η}}_{P A R}$ . The simulation results demonstrate that our proposed estimators ${\hat{η}}_{O P}$ and ${\hat{η}}_{O P_a}$ are able to factor in the existence of cross-over effects and are more efficient comparing to the use of ${\hat{η}}_{P A R}$ . Furthermore, since the coverage probabilities using both ${\hat{η}}_{O P}$ and ${\hat{η}}_{O P_a}$ are closer to the nominal 95% confidence level, the large Type I error rate issue caused by the two-stage test procedure is not a concern by applying our proposed estimators.

TABLE 3.

The coverage probability based on the 95% percentile intervals and the average interval length (in bracket) organized by sample size n for ${\hat{η}}_{P A R}$ , ${\hat{η}}_{B A}$ , ${\hat{η}}_{O P}$ , and ${\hat{η}}_{O P_a}$ in situations in which random errors $ε_{i z}^{(g)}$ are assumed to be independently identically distributed as the normal distribution with mean 0 and variance σ² = 0.5–1.5 with an increment of 0.5; the effect for treatment A = 1 versus treatment B = 0; the effect for period 2 versus period 1, γ = 1; the intraclass correlation ρ = 0.3, 0.7; the carry-over effect difference λ_D = λ_A − λ_B = 0 to 1 with an increment of 0.25; the number of patients per group n (= n1 = n2) = 10, 20, 30, and 500 bootstrap samples

n	${\hat{η}}_{P A R}$	${\hat{η}}_{B A}$	${\hat{η}}_{O P}$	${\hat{η}}_{O P_a}$
10	0.918 (1.598)	0.644 (0.780)	0.918 (1.425)	0.918 (1.418)
20	0.934 (1.172)	0.535 (0.574)	0.931 (1.059)	0.933 (1.050)
30	0.937 (0.967)	0.448 (0.466)	0.930 (0.890)	0.934 (0.879)

Open in a new tab

Note: Each entry is calculated on the basis of 1000 repeated samples.

TABLE 4.

The coverage probability based on the 95% percentile intervals and the average interval length (in bracket) organized by λ_D for ${\hat{η}}_{P A R}$ , ${\hat{η}}_{B A}$ , ${\hat{η}}_{O P}$ , and ${\hat{η}}_{O P_a}$ in situations in which random errors $ε_{i z}^{(g)}$ are assumed to be independently identically distributed as the normal distribution with mean 0 and variance σ² = 0.5–1.5 with an increment of 0.5; the effect for treatment A = 1 versus treatment B = 0; the effect for period 2 versus period 1, γ = 1; the intraclass correlation ρ = 0.3, 0.7; the carry-over effect difference λ_D = λ_A − λ_B = 0 to 1 with an increment of 0.25; the number of patients per group n (= n1 = n2) = 10, 20, 30, and 500 bootstrap samples

λ _D	${\hat{η}}_{P A R}$	${\hat{η}}_{B A}$	${\hat{η}}_{O P}$	${\hat{η}}_{O P_a}$
0	0.929 (1.245)	0.930 (0.609)	0.947 (1.065)	0.939 (1.076)
0.25	0.934 (1.246)	0.814 (0.608)	0.936 (1.077)	0.936 (1.080)
0.5	0.930 (1.247)	0.502 (0.598)	0.917 (1.118)	0.923 (1.108)
0.75	0.929 (1.246)	0.303 (0.609)	0.916 (1.158)	0.921 (1.138)
1	0.927 (1.244)	0.162 (0.610)	0.915 (1.207)	0.920 (1.178)

Open in a new tab

Note: Each entry is calculated on the basis of 1000 repeated samples.

5 |. AN EXAMPLE

To illustrate the use of our proposed estimators, we consider an example from Senn.⁵ The study was to evaluate the efficacy of two types of bronchodilators that were used for asthma treatment. The two types of treatments are Salbutamol (Sal) and Formoterol (For). A 2×2 crossover design was applied for the study and a total of 12 patients were randomly assigned into two groups. One group used the sequential order of For-then-Sal and another group used the sequential order of Sal-then-For. The outcome measure is peak expiratory flow (PEF). The estimated treatment effect (the difference of For and Sal) using parallel group design (use first period data only) ${\hat{η}}_{P A R}$ (8) is −9.17. If we assume no carry-over effect and apply the most commonly-used unbiased estimator for η_BA, ${\hat{η}}_{B A}$ (2), the estimate is −28.67. When considering the weighted estimator we proposed, the estimated weight, ${\hat{W}}_{0}$ is 0.734. This leads to ${\hat{η}}_{O P}$ (18) = −14.35. When we consider the adjusted weight, ${\hat{W}}_{01}$ (19) = 0.781 and the estimate ${\hat{η}}_{O P_a}$ (20) is −13.43. Both ${\hat{η}}_{O P}$ and ${\hat{η}}_{O P_a}$ yield different treatment effect estimates comparing to the estimates of ${\hat{η}}_{P A R}$ and ${\hat{η}}_{B A}$ .

6 |. DISCUSSION

From Figure 1 and Figure 2, we notice the derived optimal estimator η_OP yields the smallest MSE if W₀ is known. However, Monte Carlo simulations show that with estimated parameters, the EMSE of ${\hat{η}}_{O P}$ is not uniformly smallest in all scenarios we considered. This is due to the random variations that the estimated optimal weight ${\hat{W}}_{0}$ can be smaller than 0 or greater than 1 while the possible range of the W₀ is between 0 and 1. We assigned ${\hat{W}}_{0}$ to be 0 (1) when the estimated ${\hat{W}}_{0}$ is ≤ 0 (≥ 1). This is the reason that most of the estimated ${\hat{W}}_{0}$ ranges from 0.25 to 0.75 while the true value of W₀ ranges from 0 to 1. By observing the behavior of EMSE of ${\hat{η}}_{O P}$ and other estimated parameters such as ${\hat{σ}}^{2}$ and ${\hat{λ}}_{D}$ , we propose a simple modification to calculate ${\hat{W}}_{0}$ . Simulation results show that the modified ${\hat{η}}_{O P_a}$ performs better than the ${\hat{η}}_{O P}$ in most cases. One advantage of our proposed weighted estimator is that there is no need to apply a two-stage test procedure approach as suggested by Grizzle.⁵ We use a weighted estimator and a modified estimator that are based on ${\hat{η}}_{B A}$ and ${\hat{η}}_{P A R}$ . Even the simulation results demonstrate that ${\hat{η}}_{O P}$ and ${\hat{η}}_{O P_a}$ do not perform the best uniformly, it can serve as a safeguard to prevent a huge MSE when the carry-over effect is large.

Simulation results also demonstrate that the coverage probabilities using both ${\hat{η}}_{O P}$ and ${\hat{η}}_{O P_a}$ are reasonably close to the nominal 0.95 level even with the presence of the carry-over effect. Furthermore, the percentile interval lengths using ${\hat{η}}_{O P}$ and ${\hat{η}}_{O P_a}$ are shorter than the interval lengths when ${\hat{η}}_{P A R}$ is applied under the situations we considered. Since the coverage probabilities based on the percentile intervals are close to the nominal 95% confidence level, there is an advantage of using ${\hat{η}}_{O P}$ and ${\hat{η}}_{O P_a}$ rather than conducting a two-stage test procedure which can lead the actual Type I error rate to be higher than the nominal α - level.

In summary, we have developed an optimal estimator that is a weighted average of the two commonly used estimators of AB/BA crossover designs and a modified estimator based on the optimal estimator. Simulation results demonstrate that the overall performance of the optimal estimator and its modified estimator are better than the two commonly used estimators. When sample size is large, the parallel design that uses only the first period can prevent any potential carry-over effect difference between the two considered treatments and it shall be used. The results, findings and discussions should be of use for clinicians and biostatisticians when they employ simple AB/BA crossover designs for their studies.

ACKNOWLEDGMENTS

The authors wish to thank the two reviewers for many valuable and useful comments to improve the contents and clarity of this paper. Dr. Lin’s Research is supported in part by the National Institutes of Health under award numbers U54MD012397 and R61MH120236-01A1. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Funding information

National Institute of Mental Health, Grant/Award Numbers: U54MD012397, R61MH120236-01A1

Footnotes

CONFLICT OF INTEREST

The authors declare that they have no conflicts of interest for this work.

DATA AVAILABILITY STATEMENT

The data used in the paper was an example from Senn (2002) “Cross-over Trials In Clinical Research.”

REFERENCES

1.Hills M, Armitage P. The two-period crossover clinical trial. Br J Clinic Pharmacol. 1979;8:7–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Fleiss JL. The Design and Analysis of Clinical Experiments. Wiley; 1986:459. [Google Scholar]
3.Senn S. Crossover Trials in Clinical Research. 2nd ed. Wiley; 2002:340. [Google Scholar]
4.Fleiss JL. A critique of recent research on the two-treatment crossover design. Control Clin Trials. 1989;10:237–243. [DOI] [PubMed] [Google Scholar]
5.Grizzle JE. The two-period change-over design and its use in clinical trials. Biometrics. 1965;21:467–480. [PubMed] [Google Scholar]
6.Freeman PR. The performance of the two-stage analysis of two-treatment, two-period crossover trials. Stat Med. 1989;8:1421–1432. [DOI] [PubMed] [Google Scholar]
7.Senn SJ. Crossover trials, carry-over effects and the art of self-delusion. Stat Med. 1988;7:1099–1101. [DOI] [PubMed] [Google Scholar]
8.Senn SJ. Problems with the two stage analysis of crossover trials. Br J Clin Pharmacol. 1991;32:133–711. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Senn SJ. The case for crossover trials in phase III. Stat Med. 1997;16:2021–2022. [DOI] [PubMed] [Google Scholar]
10.Jones B, Donev AN. Modelling and design of crossover trials. Stat Med. 1996;16:1435–1446. [DOI] [PubMed] [Google Scholar]
11.Senn SJ. The AB/BA crossover: how to perform the two-stage analysis if you can’t be persuaded that you shouldn’t. In: Hansen B, de Ridder M, eds. Liber Amicorum Roel van Strik. Erasmus University; 1996:93–100. [Google Scholar]
12.Willan AR, Pater JL. Carry-over and the two-period crossover clinical trial. Biometrics. 1986;42:593–599. [PubMed] [Google Scholar]
13.Senn SJ. Is the simple carry-over model useful? Stat Med. 1992;11:715–726. [DOI] [PubMed] [Google Scholar]
14.SAS Institute Inc. Cary. SAS Institute Inc; 2013. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data used in the paper was an example from Senn (2002) “Cross-over Trials In Clinical Research.”

[R1] 1.Hills M, Armitage P. The two-period crossover clinical trial. Br J Clinic Pharmacol. 1979;8:7–20. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] 2.Fleiss JL. The Design and Analysis of Clinical Experiments. Wiley; 1986:459. [Google Scholar]

[R3] 3.Senn S. Crossover Trials in Clinical Research. 2nd ed. Wiley; 2002:340. [Google Scholar]

[R4] 4.Fleiss JL. A critique of recent research on the two-treatment crossover design. Control Clin Trials. 1989;10:237–243. [DOI] [PubMed] [Google Scholar]

[R5] 5.Grizzle JE. The two-period change-over design and its use in clinical trials. Biometrics. 1965;21:467–480. [PubMed] [Google Scholar]

[R6] 6.Freeman PR. The performance of the two-stage analysis of two-treatment, two-period crossover trials. Stat Med. 1989;8:1421–1432. [DOI] [PubMed] [Google Scholar]

[R7] 7.Senn SJ. Crossover trials, carry-over effects and the art of self-delusion. Stat Med. 1988;7:1099–1101. [DOI] [PubMed] [Google Scholar]

[R8] 8.Senn SJ. Problems with the two stage analysis of crossover trials. Br J Clin Pharmacol. 1991;32:133–711. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.Senn SJ. The case for crossover trials in phase III. Stat Med. 1997;16:2021–2022. [DOI] [PubMed] [Google Scholar]

[R10] 10.Jones B, Donev AN. Modelling and design of crossover trials. Stat Med. 1996;16:1435–1446. [DOI] [PubMed] [Google Scholar]

[R11] 11.Senn SJ. The AB/BA crossover: how to perform the two-stage analysis if you can’t be persuaded that you shouldn’t. In: Hansen B, de Ridder M, eds. Liber Amicorum Roel van Strik. Erasmus University; 1996:93–100. [Google Scholar]

[R12] 12.Willan AR, Pater JL. Carry-over and the two-period crossover clinical trial. Biometrics. 1986;42:593–599. [PubMed] [Google Scholar]

[R13] 13.Senn SJ. Is the simple carry-over model useful? Stat Med. 1992;11:715–726. [DOI] [PubMed] [Google Scholar]

[R14] 14.SAS Institute Inc. Cary. SAS Institute Inc; 2013. [Google Scholar]

PERMALINK

A note on point estimation and interval estimation of the relative treatment effect under a simple crossover design

Chii-Dean Lin

Kung-Jong Lui

Abstract

1 |. INTRODUCTION

2 |. NOTATION AND METHODS

FIGURE 1.

FIGURE 2.

3 |. MONTE CARLO SIMULATION

4 |. RESULTS

TABLE 1.

TABLE 2.

TABLE 3.

TABLE 4.

5 |. AN EXAMPLE

6 |. DISCUSSION

ACKNOWLEDGMENTS

Funding information

Footnotes

DATA AVAILABILITY STATEMENT

REFERENCES

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

A note on point estimation and interval estimation of the relative treatment effect under a simple crossover design

Chii-Dean Lin

Kung-Jong Lui

Abstract

1 |. INTRODUCTION

2 |. NOTATION AND METHODS

FIGURE 1.

FIGURE 2.

3 |. MONTE CARLO SIMULATION

4 |. RESULTS

TABLE 1.

TABLE 2.

TABLE 3.

TABLE 4.

5 |. AN EXAMPLE

6 |. DISCUSSION

ACKNOWLEDGMENTS

Funding information

Footnotes

DATA AVAILABILITY STATEMENT

REFERENCES

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases