Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Mar 1.
Published in final edited form as: Pharm Stat. 2021 Nov 9;21(2):386–394. doi: 10.1002/pst.2176

A note on point estimation and interval estimation of the relative treatment effect under a simple crossover design

Chii-Dean Lin 1, Kung-Jong Lui 1
PMCID: PMC9054161  NIHMSID: NIHMS1793756  PMID: 34755464

Abstract

To increase power or reduce the number of patients needed for a parallel groups design, the crossover design has been often used to study treatments for noncurable chronic diseases. However, in the presence of carry-over effect caused by treatments, the commonly-used estimator which ignores the carry-over effect leads to a biased estimator for estimating the treatment effect difference. A two-stage test approach aimed to address carry-over effect proposed was found to be potentially misleading. In this paper, we propose a weighted average of the commonly-used estimator and an unbiased estimator that uses only the first period of the data. We derive an optimal weight that minimizes the mean squared error (MSE) and its modified estimator. We apply Monte Carlo simulation to evaluate the performance of the proposed estimators in a variety of situations. In the simulations, we examine the estimated MSE (EMSE), percentile interval length, and coverage probability calculated from the percentile intervals among considered estimators. Simulation results show that our proposed weighted average estimator and its modified estimator lead to smaller EMSEs on average comparing to the two commonly used estimators. The coverage probabilities using our proposed estimators are reasonably close to the nominal confidence level and the interval lengths are shorter comparing to the use of the unbiased estimator that uses only the first period of the data. We apply an example that was to evaluate the efficacy of two type of bronchodilators for asthma treatment to demonstrate the use of the proposed estimators.

Keywords: carry-over effect, mean squared error, parallel groups design, percentile interval, relative treatment effect, simple crossover design

1 |. INTRODUCTION

To increase power or reduce the number of patients needed for a parallel groups design,1,2,3 the crossover design has been often used to study treatments for noncurable chronic diseases, because each patient serves as his/her own control. Following Senn3 and Fleiss,2,4 we should not employ the crossover design if we cannot employ an adequate washout period to assure that patients are weaned off the residual effects from earlier treatments. However, we can never be certain that the washout period has worked. Grizzle5 proposed a two-stage test procedure as follows. We first test whether the carry-over effect exists. Because the test itself for the carry-over effect is subject to the response variation between patients, the power of this test is generally low and thereby a high-nominal level of Type I error (10% or 15%) is usually chosen. If the test for the carry-over effect is nonsignificant, we will carry on analyses based on the difference between responses within patients as done for the crossover trial with assuming no carry-over effects. Otherwise, the test procedure using the data at the first period only is carried out as for the parallel groups design and all data obtained at the second period are excluded from data analysis. Freeman6 carried out a thorough investigation of the two-stage test approach and concluded that this approach could be potentially misleading. This is because the test for detecting the carry-over effect is highly correlated to that for testing equality based on the data at the first period. Thus, the test based on the data at the first period is likely to be significant as well when the test result for the carry-over effect is significant. This leads that the actual Type I error rate for the two-stage test is higher than the nominal α - level. Due to this concern, we do not want to use the two-stage test procedure in practice.3,7,8,9 Other notes in use of the two-stage test in a crossover trial can be found elsewhere.10,11

2 |. NOTATION AND METHODS

Consider comparing an experimental treatment B with a standard treatment A (or a placebo) under an AB/BA crossover design. Suppose that we randomly assign n1 patients to group g = 1 with the treatment-receipt sequence A-then-B, in which patients receive treatment A at period 1 and then crossover to receive treatment B at period 2, and n2 patients to group g = 2 with the treatment-receipt sequence B-then-A, in which patients receive treatment B at period 1 and then crossover to receive treatment A at period 2. For patient i (i=1, 2, ⋯, ng) assigned to group g (g = 1, 2), let Yiz(g) denote the patient response at period z (= 1, 2). We assume that Yiz(g) can be expressed by the following random effects linear additive risk model:

Yiz(g)=μi(g)+ηBAXiz(g)+γZiz(g)+λA1{g=1}Ziz(g)+λB(11{g=1})Ziz(g)+εiz(g), (1)

where μi(g) denotes the random effect due to the ith patient in group g and all μi(g) 's  are assumed to be independent and identically distributed as an unspecified probability density function fg(μ) with variance σu2;Xiz(g) denotes the treatment-received covariate for treatment B, and Ziz(g)=1 if the ith patient in group g at period z receives treatment B, and = 0, otherwise; Ziz(g) represents the period covariate, and Ziz(g)=1 for period z = 2, and = 0, otherwise; 1{g=1} is the indicator function of group 1, and = 1 for group g = 1 and = 0, otherwise; and the random errors εiz(g)'s are assumed to be independent and identically distributed as a continuous distribution with mean 0 and variance σe2, and are assumed to be also independent of μi(g)'s. Parameters ηBA and γ sin model (1) denote the difference in effects between treatments B and A, and that between periods 2 and 1, respectively. Furthermore, λA and λB in model (1) represent the carry-over effects due to treatments A and B. Under model (1), we can see that the covariance between Yi1(g) and Yi2(g) is cov(Yi1(g),Yi2(g))=Var(μi(g))=σu2>0 and thereby, Yi1(g) and Yi2(g) are positively correlated with the intraclass correlation ρ=σu2/(σu2+σe2). Thus, the larger the variation σu2 between patients, the higher is the value of the intraclass correlation between responses within patients.

For patient i (= 1, 2, ⋯, ng) in group g (= 1, 2), we define di(g)=Yi2(g)Yi1(g), representing the difference in responses between two periods. We further define d¯(g)=i=1ngdi(g)/ng=Y¯+2(g)Y¯+1(g), where Y¯+z(g)=i=1ngYiz(g)/ng (for z = 1, 2), as the average of response differences for period 2 versus period 1 over patients in group g. When there are no carry-over effects (i.e., λA = λB), the following estimator is the most commonly-used unbiased estimator for ηBA under model (1):

η^BA=(d¯(1)d¯(2))/2, (2)

and its variance.

Var(η^BA)=σd2(1/n1+1/n2)/4, (3)

where σd2=Var(di(g)). We can estimate the variance σd2 by the unbiased pooled-sample variance.4,5

σ^d2=g=12i=1ng(di(g)d¯(g))2/(n+2), (4)

where n+ = n1 + n2 denotes the total number of patients in the trial. Therefore, when substituting σ^d2 (4) for σd2 in Var(η^BA) (3), we obtain the variance estimator Var^(η^BA). When there are carry-over effects (i.e., λAλB), η^BA (2) is known to be a biased estimator of ηBA and has the bias given by.

E(η^BAηBA)=λD/2, (5)

where λD = λAλB. We can estimate λD by the unbiased estimator.

λ^D=(Y¯+1(1)+Y¯+2(1))(Y¯+1(2)+Y¯+2(2)). (6)

The variance of λ^D is given by

Var(λ^D)=(4σu2+2σe2)(1n1+1n2). (7)

Note the variance can be estimated by the unbiased pooled-sample variance2,3 and we can obtain the variance estimator Var^(λ^D). When there are carry-over effects, we can use λ^D (6) to adjust the bias in (5) to obtain the unbiased estimator only based on the data at period 1 as done for the parallel groups design.5 This leads us to consider the following estimator η^PAR.

η^PAR=Y¯+1(2)Y¯+1(1), (8)

which is an unbiased estimator for ηBA even in the presence of carry-over effects. The variance for η^PAR (8) is

Var(η^PAR)=σ2(1/n1+1/n2), (9)

where σ2=σu2+σe2. We can estimate σ2 by

σ^2=g=12i=1ng(Yi1(g)Y¯+1(g))2/(n+2), (10)

and obtain the variance estimator Var^(η^PAR)=σ^2(1/n1+1/n2). Note that Willan and Pater12 stated that the mean squared error (MSE) E(η^BAηBA)2 is smaller than E(η^PARηBA)2 under the balanced case (i.e., n1 = n2 = n) if and only if

λD2/4<(1+ρ)σ2/n. (11)

However, this equality (11) is generally difficult to apply in practice due to the lack of prior knowledge of the difference λD, the intraclass correlation ρ, or the variance σ2. Also, the two-stage test procedure1 has been suggested to determine which estimator η^BA or η^PAR for use. As noted elsewhere,2,5,12,13 there are concerns, such as the lack of power or the bias of the estimator for the relative treatment effect, in application of this two-stage test procedure as well.

Instead of deciding to choose either η^BA or η^PAR for use based on nonexistent prior knowledge or a powerless hypothesis testing procedure, we may consider a weighted average Wη^PAR+(1W)η^BA (where 0 ≤ W ≤ 1), that includes η^PAR and η^BA as special cases. We can show that the optimal weight W0 to minimize the MSE E(Wη^PAR+(1W)η^BAηBA)2 is given by

Wo=[E(η^BAηBA)2E(η^BAηBA)(η^PARηBA)]/[E(η^BAηBA)2+E(η^PARηBA)22E(η^BAηBA)(η^PARηBA)]. (12)

We can further show that

E(η^BAηBA)(η^PARηBA)=(σe2/2)(1/n1+1/n2). (13)

From (3), (7), (9) and (13), we can see that the optimal weight W0 (12) simplifies to

W0=(λD2/4)/[λD2/4+σ2((1+ρ)/2)(1/n1+1/n2)]. (14)

Note that the optimal weight W0 is a function of unknown parameters and hence we cannot apply W0η^PAR+(1Wo)η^BA to reduce the MSE of η^PAR and η^BA directly.

We can estimate λD2 by the unbiased estimator

λ^D2Var^(λ^D).. (15)

We can further estimate σ2((1+ρ)/2) by the unbiased estimator

σ^2σ^d2/4. (16)

On the basis of (15)–(16), we can estimate the optimal weight Wo by

W^0=[(λ^D2Var^(λ^D))/4]/[(λ^D2Var^(λ^D))/4+(σ^2σ^d2/4)(1/n1+1/n2)] (17)

and define

η^OP=W^0η^PAR+(1W^0)η^BA. (18)

To compare the MSE of η^OP (18) with η^BA (2) and η^PAR (8), we plot MSE versus λD. Figure 1 and Figure 2 show the relationship (with true W0) between MSE and λD when n = 50, ρ = 0.2, and σ2 = 2 (Figure 1) and n = 50, ρ = 0.4, and σ2 = 4 (Figure 2) for η^OP, η^BA, and η^PAR. As expected, both graphs show that the MSE of η^OP, is smaller than the MSE of η^PAR and η^BA. The value of λD does not affect the MSE of η^PAR since η^PAR uses only the first period of the data. When the absolute value of λD is large (large carry-over effect difference), the MSE of η^BA increases substantially. In a situation that the carry-over effect difference is 0 (λD = 0), the MSE of η^OP and η^BA is the same and the MSE is the smallest. This can be seen from (14) that when λD = 0, the value of W0 = 0 as well. This leads to the equivalence of η^OP and η^BA. We further notice that when the MSEs of η^BA and η^PAR are equal (the crossing of the two curves), the reduction of the MSE using η^OP is the largest. In Figure 1, the two MSE curves of η^BA and η^PAR crossed each other when λD is near 0.4 or −0.4. This is the λD value that the MSE will reduce the most when η^OP is applied.

FIGURE 1.

FIGURE 1

The relationship (with true W0) between MSE of η^OP (20), η^BA (2), and η^PAR (9) and λD when n = 50 per group, intraclass correlation ρ = 0.2, and random error variance σ2 = 2

FIGURE 2.

FIGURE 2

The relationship (with true W0) between MSE of η^OP (20), η^BA (2), and η^PAR (9) and λD when n = 50 per group, intraclass correlation ρ = 0.4, and random error variance σ2 = 4

3 |. MONTE CARLO SIMULATION

To compare the performance of η^OP (18) with η^BA (2) and η^PAR (8), we use Monte Carlo simulation. We consider the situations in which the random errors εiz(g) are assumed to be independently identically distributed as the normal distribution with mean 0 and variance σ2 = 1–3 with an increment of 0.5; the effect for treatment A versus treatment B = 0 to 1 with an increment of 0.2; the effect for period 2 versus period 1, γ = 1; the intraclass correlation ρ = 0.1–0.8 with an increment of 0.1; the carry-over effect difference λD = λAλB = 0–1.5 with an increment of 0.25; and the number of patients per group n (= n1 = n2) = 15, 25, 50, 75, 100, 200. For each configuration determined by a combination of the above parameters, we apply SAS 9.414 to generate 1000 simulated samples, each consisting of n observations per group g (= 1, 2). We use these settings to compare the estimated MSEs. From simulation results, we found that the estimate W0 tends to underestimate when the true W0 is close to 1 and overestimate when the true W0 is close to 0. To reduce the cause due to the overestimate (underestimate) near the boundary, we propose a modified estimator of λD2 when we estimate W0. If λ^Dσ^2<.1, we use λ^D2 instead of the unbiased estimator λ^D2VA^R(λ^D) (15) to estimate λD2. On the other hand, if λ^Dσ^2.1, we adopt the unbiased estimator λ^D2VA^R(λ^D) (15) to estimate λD2. This is to account for potential over-correction (when λ^D is small) and we may observe a negative W^0. That is,

W^01={[(λ^D2)/4]/[(λ^D2)/4+(σ^2σ^d2/4)(1/n1+1/n2),λ^Dσ^2<.1[(λ^D2Var^(λ^D))/4]/[(λ^D2Var^(λ^D))/4+(σ^2σ^d2/4)(1/n1+1/n2)],λ^Dσ^2.1. (19)

Based on W^01, we define the modified estimator as

η^OPa=W^01η^PAR+(1W^01)η^BA. (20)

On top of comparing the performance of EMSE of η^PAR, η^BA, η^OP, and η^OP_a, we further examine the efficiency of the four estimators. It is known that in the presence of the carry-over effects, η^BA is a biased estimator of ηBA and hence, η^OP and η^OP_a are biased in estimating ηBA as well. We use interval estimation and coverage probability to evaluate the efficiency. We generate bootstrap samples to construct 95% percentile intervals and the coverage probabilities of testing a hypothesized treatment effect difference ηBA based on the constructed 95% percentile intervals through simulations. We generate 500 bootstrap samples for each simulation and calculate the average 95% interval length and the coverage probability of the 1000 simulated samples. We consider the situations in which the random errors εiz(g) are assumed to be independently identically distributed as the normal distribution with mean 0 and variance σ2 = 0.5 to 1.5 with an increment of 0.5; the effect for treatment A = 1 versus treatment B = 0; the effect for period 2 versus period 1, γ = 1; the intraclass correlation ρ = 0.3, 0.7; the carry-over effect difference λD = λAλB = 0 to 1 with an increment of 0.25; and the number of patients per group n (= n1 = n2) = 10, 20, 30. For each configuration determined by a combination of the above parameters, we apply SAS 9.414 to generate 1000 simulated samples, each consisting of n observations per group g (= 1, 2).

4 |. RESULTS

To compare simulation results, we summarize in Table 1 the estimated MSE (EMSE) for the four estimators we considered (η^BA (2), η^PAR (8), η^OP (18), and η^OP_a (20)) categorizing by the sample size. Note η^OP_a has the smallest EMSE in all sample sizes we considered here. When sample size is large, the first period of the data contains enough information and thus, the advantage of adopting either η^OP or η^OP_a gets smaller. For η^BA, the reduction of the EMSE is limited even when n gets larger. This is due to the existence of the carry-over effects and it will not disappear even n becomes bigger. When the sample size is large, the difference of the EMSE among η^OP, η^OP_a, and η^PAR becomes smaller.

TABLE 1.

The estimated MSE and standard error (in bracket) organized by sample size n for η^PAR, η^BA, η^OP, and η^OP_a in situations in which random errors εiz(g) are assumed to be independently identically distributed as the normal distribution with mean 0 and variance σ2 = 1–3 with an increment of 0.5; the effect for treatment A versus treatment B = 0–1 with an increment of 0.2; the effect for period 2 versus period 1, γ = 1; the intraclass correlation ρ = 0.1 to 0.8 with an increment of 0.1; the carry-over effect difference λD = λA − λB = 0–1.5 with an increment of 0.25; and the number of patients per group n (= n1 = n2) = 15, 25, 50, 75, 100, 200

N EMSE_npar (SE) EMSE_nBA (SE) EMSE_nOP (SE) EMSE_nOP_a (SE)
15 0.233(0.115) 0.267(0.200) 0.197(0.104) 0.182(0.090)
25 0.140(0.069) 0.242(0.197) 0.131(0.073) 0.119(0.062)
50 0.070(0.034) 0.222(0.196) 0.072(0.042) 0.065(0.035)
75 0.047(0.023) 0.216(0.195) 0.049(0.029) 0.045(0.025)
100 0.035(0.017) 0.213(0.195) 0.037(0.022) 0.034(0.019)
200 0.018(0.009) 0.208(0.195) 0.018(0.011) 0.018(0.010)

Note: Each entry is calculated on the basis of 1000 repeated samples.

Table 2 shows the EMSE of the four estimators grouped by different λD. When the λD (the carry-over effect) is relative small (< 0.5), the estimate η^BA performs the best. However, the EMSE of η^BA increases rather quickly when λD becomes large. On the other hand, the EMSE of η^PAR stays the same. Note both η^OP_a and η^OP perform moderately well. They can be used to protect the worst scenario when the carry-over effect difference is large and they are more efficient than the η^PAR when the sample size is small.

TABLE 2.

The estimated MSE and standard error (in bracket) organized by λD for η^PAR, η^BA, η^OP, and η^OP_a in situations in which random errors εiz(g) are assumed to be independently identically distributed as the normal distribution with mean 0 and variance σ2 = 1–3 with an increment of 0.5; the effect for treatment A versus treatment B = 0 to 1 with an increment of 0.2; the effect for period 2 versus period 1, γ = 1; the intraclass correlation ρ = 0.1–0.8 with an increment of 0.1; the carry-over effect difference λD = λAλB = 0–1.5 with an increment of 0.25; and the number of patients per group n (= n1 = n2) = 15, 25, 50, 75, 100, 200

λD EMSE_npar (SE) EMSE_nBA (SE) EMSE_nOP (SE) EMSE_nOP_a (SE)
0 0.091 (0.095) 0.025 (0.030) 0.048 (0.051) 0.052 (0.056)
0.25 0.090 (0.094) 0.040 (0.030) 0.056 (0.052) 0.057 (0.055)
0.5 0.090 (0.095) 0.088 (0.031) 0.074 (0.059) 0.069 (0.059)
0.75 0.090 (0.094) 0.165 (0.030) 0.090 (0.072) 0.080 (0.067)
1 0.090 (0.094) 0.275 (0.030) 0.101 (0.088) 0.089 (0.077)
1.25 0.090 (0.095) 0.416 (0.031) 0.108 (0.103) 0.095 (0.088)
1.5 0.090 (0.095) 0.587 (0.031) 0.112 (0.115) 0.099 (0.097)

Note: Each entry is calculated on the basis of 1000 repeated samples.

In Table 3, we summarize the coverage probabilities and 95% percentile interval lengths grouped by the sample size. It can be seen that the percentile interval lengths using η^BA are the shortest since η^BA uses all available observations. However, since we consider simulations including nonzero carry-over effects, the coverage probabilities using η^BA do not close to the nominal 0.95 confidence level. The other three estimates have similar coverage probabilities that are close to the nominal 0.95 level, but η^PAR produces larger interval lengths. Table 4 shows the coverage probabilities and 95% percentile interval lengths grouped by different λD. Note that λD does not impact the percentile interval lengths. The interval lengths remain about the same for each estimate. The interval lengths generated using η^BA are still the shortest. However, the coverage probabilities using η^BA do not close to the nominal 0.95 confidence level when λD is not 0. Both η^OP and η^OP_a are able to maintain reasonable coverage probabilities even λD is large while produce shorter interval lengths comparing to the lengths produced by the use of η^PAR. The simulation results demonstrate that our proposed estimators η^OP and η^OP_a are able to factor in the existence of cross-over effects and are more efficient comparing to the use of η^PAR. Furthermore, since the coverage probabilities using both η^OP and η^OP_a are closer to the nominal 95% confidence level, the large Type I error rate issue caused by the two-stage test procedure is not a concern by applying our proposed estimators.

TABLE 3.

The coverage probability based on the 95% percentile intervals and the average interval length (in bracket) organized by sample size n for η^PAR, η^BA, η^OP, and η^OP_a in situations in which random errors εiz(g) are assumed to be independently identically distributed as the normal distribution with mean 0 and variance σ2 = 0.5–1.5 with an increment of 0.5; the effect for treatment A = 1 versus treatment B = 0; the effect for period 2 versus period 1, γ = 1; the intraclass correlation ρ = 0.3, 0.7; the carry-over effect difference λD = λAλB = 0 to 1 with an increment of 0.25; the number of patients per group n (= n1 = n2) = 10, 20, 30, and 500 bootstrap samples

n η^PAR η^BA η^OP η^OP_a
10 0.918 (1.598) 0.644 (0.780) 0.918 (1.425) 0.918 (1.418)
20 0.934 (1.172) 0.535 (0.574) 0.931 (1.059) 0.933 (1.050)
30 0.937 (0.967) 0.448 (0.466) 0.930 (0.890) 0.934 (0.879)

Note: Each entry is calculated on the basis of 1000 repeated samples.

TABLE 4.

The coverage probability based on the 95% percentile intervals and the average interval length (in bracket) organized by λD for η^PAR, η^BA, η^OP, and η^OP_a in situations in which random errors εiz(g) are assumed to be independently identically distributed as the normal distribution with mean 0 and variance σ2 = 0.5–1.5 with an increment of 0.5; the effect for treatment A = 1 versus treatment B = 0; the effect for period 2 versus period 1, γ = 1; the intraclass correlation ρ = 0.3, 0.7; the carry-over effect difference λD = λAλB = 0 to 1 with an increment of 0.25; the number of patients per group n (= n1 = n2) = 10, 20, 30, and 500 bootstrap samples

λ D η^PAR η^BA η^OP η^OP_a
0 0.929 (1.245) 0.930 (0.609) 0.947 (1.065) 0.939 (1.076)
0.25 0.934 (1.246) 0.814 (0.608) 0.936 (1.077) 0.936 (1.080)
0.5 0.930 (1.247) 0.502 (0.598) 0.917 (1.118) 0.923 (1.108)
0.75 0.929 (1.246) 0.303 (0.609) 0.916 (1.158) 0.921 (1.138)
1 0.927 (1.244) 0.162 (0.610) 0.915 (1.207) 0.920 (1.178)

Note: Each entry is calculated on the basis of 1000 repeated samples.

5 |. AN EXAMPLE

To illustrate the use of our proposed estimators, we consider an example from Senn.5 The study was to evaluate the efficacy of two types of bronchodilators that were used for asthma treatment. The two types of treatments are Salbutamol (Sal) and Formoterol (For). A 2×2 crossover design was applied for the study and a total of 12 patients were randomly assigned into two groups. One group used the sequential order of For-then-Sal and another group used the sequential order of Sal-then-For. The outcome measure is peak expiratory flow (PEF). The estimated treatment effect (the difference of For and Sal) using parallel group design (use first period data only) η^PAR (8) is −9.17. If we assume no carry-over effect and apply the most commonly-used unbiased estimator for ηBA, η^BA (2), the estimate is −28.67. When considering the weighted estimator we proposed, the estimated weight, W^0 is 0.734. This leads to η^OP (18) = −14.35. When we consider the adjusted weight, W^01 (19) = 0.781 and the estimate η^OP_a (20) is −13.43. Both η^OP and η^OP_a yield different treatment effect estimates comparing to the estimates of η^PAR and η^BA.

6 |. DISCUSSION

From Figure 1 and Figure 2, we notice the derived optimal estimator ηOP yields the smallest MSE if W0 is known. However, Monte Carlo simulations show that with estimated parameters, the EMSE of η^OP is not uniformly smallest in all scenarios we considered. This is due to the random variations that the estimated optimal weight W^0 can be smaller than 0 or greater than 1 while the possible range of the W0 is between 0 and 1. We assigned W^0 to be 0 (1) when the estimated W^0 is ≤ 0 (≥ 1). This is the reason that most of the estimated W^0 ranges from 0.25 to 0.75 while the true value of W0 ranges from 0 to 1. By observing the behavior of EMSE of η^OP and other estimated parameters such as σ^2 and λ^D, we propose a simple modification to calculate W^0. Simulation results show that the modified η^OP_a performs better than the η^OP in most cases. One advantage of our proposed weighted estimator is that there is no need to apply a two-stage test procedure approach as suggested by Grizzle.5 We use a weighted estimator and a modified estimator that are based on η^BA and η^PAR. Even the simulation results demonstrate that η^OP and η^OP_a do not perform the best uniformly, it can serve as a safeguard to prevent a huge MSE when the carry-over effect is large.

Simulation results also demonstrate that the coverage probabilities using both η^OP and η^OP_a are reasonably close to the nominal 0.95 level even with the presence of the carry-over effect. Furthermore, the percentile interval lengths using η^OP and η^OP_a are shorter than the interval lengths when η^PAR is applied under the situations we considered. Since the coverage probabilities based on the percentile intervals are close to the nominal 95% confidence level, there is an advantage of using η^OP and η^OP_a rather than conducting a two-stage test procedure which can lead the actual Type I error rate to be higher than the nominal α - level.

In summary, we have developed an optimal estimator that is a weighted average of the two commonly used estimators of AB/BA crossover designs and a modified estimator based on the optimal estimator. Simulation results demonstrate that the overall performance of the optimal estimator and its modified estimator are better than the two commonly used estimators. When sample size is large, the parallel design that uses only the first period can prevent any potential carry-over effect difference between the two considered treatments and it shall be used. The results, findings and discussions should be of use for clinicians and biostatisticians when they employ simple AB/BA crossover designs for their studies.

ACKNOWLEDGMENTS

The authors wish to thank the two reviewers for many valuable and useful comments to improve the contents and clarity of this paper. Dr. Lin’s Research is supported in part by the National Institutes of Health under award numbers U54MD012397 and R61MH120236-01A1. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Funding information

National Institute of Mental Health, Grant/Award Numbers: U54MD012397, R61MH120236-01A1

Footnotes

CONFLICT OF INTEREST

The authors declare that they have no conflicts of interest for this work.

DATA AVAILABILITY STATEMENT

The data used in the paper was an example from Senn (2002) “Cross-over Trials In Clinical Research.”

REFERENCES

  • 1.Hills M, Armitage P. The two-period crossover clinical trial. Br J Clinic Pharmacol. 1979;8:7–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Fleiss JL. The Design and Analysis of Clinical Experiments. Wiley; 1986:459. [Google Scholar]
  • 3.Senn S. Crossover Trials in Clinical Research. 2nd ed. Wiley; 2002:340. [Google Scholar]
  • 4.Fleiss JL. A critique of recent research on the two-treatment crossover design. Control Clin Trials. 1989;10:237–243. [DOI] [PubMed] [Google Scholar]
  • 5.Grizzle JE. The two-period change-over design and its use in clinical trials. Biometrics. 1965;21:467–480. [PubMed] [Google Scholar]
  • 6.Freeman PR. The performance of the two-stage analysis of two-treatment, two-period crossover trials. Stat Med. 1989;8:1421–1432. [DOI] [PubMed] [Google Scholar]
  • 7.Senn SJ. Crossover trials, carry-over effects and the art of self-delusion. Stat Med. 1988;7:1099–1101. [DOI] [PubMed] [Google Scholar]
  • 8.Senn SJ. Problems with the two stage analysis of crossover trials. Br J Clin Pharmacol. 1991;32:133–711. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Senn SJ. The case for crossover trials in phase III. Stat Med. 1997;16:2021–2022. [DOI] [PubMed] [Google Scholar]
  • 10.Jones B, Donev AN. Modelling and design of crossover trials. Stat Med. 1996;16:1435–1446. [DOI] [PubMed] [Google Scholar]
  • 11.Senn SJ. The AB/BA crossover: how to perform the two-stage analysis if you can’t be persuaded that you shouldn’t. In: Hansen B, de Ridder M, eds. Liber Amicorum Roel van Strik. Erasmus University; 1996:93–100. [Google Scholar]
  • 12.Willan AR, Pater JL. Carry-over and the two-period crossover clinical trial. Biometrics. 1986;42:593–599. [PubMed] [Google Scholar]
  • 13.Senn SJ. Is the simple carry-over model useful? Stat Med. 1992;11:715–726. [DOI] [PubMed] [Google Scholar]
  • 14.SAS Institute Inc. Cary. SAS Institute Inc; 2013. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The data used in the paper was an example from Senn (2002) “Cross-over Trials In Clinical Research.”

RESOURCES