ABSTRACT
One important property of any drug product is its stability over time. A key objective in drug stability studies is to estimate the shelf‐life of a drug, involving a suitable definition of the true shelf‐life and the construction of an appropriate estimate of the true shelf‐life. Simultaneous confidence bands (SCBs) for percentiles in linear regression are valuable tools for determining the shelf‐life in drug stability studies. In this paper, we propose a novel criterion, the Minimum Area Confidence Set (MACS) criterion, for finding the optimal SCB for percentile regression lines. This criterion focuses on the area of the constrained regions for the newly proposed pivotal quantities, which are generated from the confidence set for the unknown parameters of an SCB. We employ the new pivotal quantities to construct exact SCBs over any finite covariate intervals and use the MACS criterion to compare several SCBs of different forms. The optimal SCB under the MACS criterion can be used to construct the interval estimate of the true shelf‐life. Furthermore, a new computationally efficient method is proposed for calculating the critical constants of exact SCBs for percentile regression lines. A real data example on drug stability is provided for illustration.
Keywords: confidence set, drug stability study, multiple testing, percentile line, shelf‐life, simultaneous confidence band
1. Introduction
In drug development, a key property of any drug product is its stability over time. Drug stability studies are routinely conducted in order to measure the degradation of new drug products or existing drugs that have been reformulated or repackaged, as outlined by International Council of Harmonasation (ICH) guideline on stability testing of new drug substances and products; see ICH [1]. These studies provide patients and consumers with a high degree of confidence that a drug retains its strength, quality, and purity under appropriate storage conditions. Individual dosage units (e.g., tablets, capsules, vials) are sampled at predetermined time points and assayed for their active pharmaceutical ingredient (drug) content. The decline in drug content over time can be modeled based on these observations.
One important objective in drug stability studies is to estimate the true shelf‐life of a drug. The estimated shelf‐life is the storage period during which the potency of a drug product is expected to remain within the approved specification limit with an acceptable probability. The final day of the shelf life is known as the expiry date, which is required to be printed on the package label of the corresponding drug. According to ICH [2], the approved specification limits are no greater than of the labeled content of the drug. Current ICH guidelines, adopted by the U.S. Food and Drug Administration (FDA) and other major national regulatory agencies, recommend that the shelf‐life estimate should be based on an interval estimate of the mean change in drug content over storage time. Specifically, as outlined in ICH [1], the interval estimate of shelf‐life for a single batch is defined as the time interval during which the confidence interval for the mean regression line of the observed drug content () and the time () crosses a pre‐specified level of drug content, say (in percentage).
A commonly used statistical model for this purpose is simple linear regression of drug content () on the covariate time (); see Kiermeier et al. [3], Quinlan et al. [4], and Liu et al. [5] for a review. Throughout this paper, we focus on linear regression for a single batch of drug, assuming that the drug content decreases over time, which results in a negative slope for the regression line. The situation where the drug content increases over time can be treated using a similar approach. To be specific, let the statistical model be , where denotes the observed drug content at time point of the dosage unit, , , the random error follows a normal distribution , and , and are unknown parameters.
One problem in estimating the shelf‐life is deciding whether to construct a confidence interval for the average content of all dosage units, , or for the content of a proportion of all dosage units, , with denoting the percentile of standard normal distribution; see Ruberg and Stegeman [6]. According to the interval estimates of batch shelf‐life defined by ICH [1], the estimation is based on a ()‐level confidence interval for the mean regression line, which implies that only half (i.e., 50%) of all the individual dosage units of the drug should have a drug content of at least by the shelf‐life. However, from the patients' and consumers' point of view, it is expected that a larger proportion (e.g., 90%) of all the individual dosage units will have a drug content above the acceptable limit by the expiry date. Quinlan et al. [4] also note that the ICH method can overestimate the true shelf‐life, making it inappropriate for ensuring drug safety.
Kiermeier et al. [3] consider an approximate ()‐level pointwise confidence band (PCB) for the percentile regression line to provide an accurate estimate of the true shelf‐life. This PCB offers the inference of drug content for of dose units at one specific time point. Liu et al. [5] provide a new method to address this problem by determining the shelf‐life using an exact ()‐level PCB for the percentile line in linear regression.
Note that, the true shelf‐life can occur at any time point during the drug stability study period. Therefore, using simultaneous confidence bands (SCBs) to estimate the shelf‐life is a suitable approach, as the SCBs guarantee the confidence level of estimation at any time point within the covariate range of interest. SCBs for percentiles in linear regression are studied by Steinhorst and Bowden [7], Turner and Bowden [8, 9], and Thomas and Thomas [10] over the whole covariate range . As the expiry date required by the regulatory to be printed on the package label of a drug often has a limit, it is reasonable to consider a finite interval of shelf‐life, that is, . The specific duration of the expiration date varies depending on the drug product, its formulation, and stability testing results, but often and (in years) to justify an expiration dating period of over two years. Han et al. [11] construct exact SCBs over a finite covariate interval and develop asymmetric SCBs for the percentile line, which could be used for more informative interval estimates of shelf‐life than the SCBs over .
Han et al. [11] is the only available method for constructing exact symmetric and asymmetric SCBs. However, the pivotal quantities used in their approach for different types of SCBs vary in dimension, which complicates the implementation of their methods. In addition, they propose two methods for calculating the critical constants of SCBs: a simulation‐based method and a numerical quadrature method, both of which are computationally intensive.
As reviewed in Table 1, there are various forms of SCBs available in the statistical literature, and so it is desirable to identify the most informative SCB among different forms under a suitable criterion. Han et al. [11] determine the optimal SCB under the average width (AW) criterion [12]. The AW of a SCB is the average width between its upper and lower bounds over the covariate interval of interest , and has been widely used as an optimality criterion for comparing different forms of confidence bands; see Liu et al. [13]. It is noteworthy that the AW criterion has one limitation that it cannot directly provide the information of the unknown parameters and when both parameters are of primary interest. Liu and Hayter [14] introduce the minimum area confidence set (MACS) criterion based on the area of the confidence set for only, focusing on the SCBs for the mean regression line , see also Liu et al. [15] and Liu and Ah‐kine [16]. However, when estimating the shelf‐life, the SCBs for percentile lines are required, rather than the mean regression line, in order to assess whether a large proportion of dose units have a drug content above the threshold. In addition, note that making simultaneous inference for a percentile regression line over is equivalent to constructing the joint confidence set for and . Therefore, this paper considers the MACS criterion based on the area of the confidence set for the unknown parameters and . To date, no work in the literature has addressed such a MACS criterion for constructing and comparing SCBs for a percentile regression line.
TABLE 1.
Six simultaneous confidence bands.
In the following, we construct exact ()‐level symmetric and asymmetric two‐sided SCBs for the percentile line over any given covariate interval to provide accurate interval estimates of shelf‐life. Note that both the lower and upper bounds are important for the patients, consumers, and manufacturers. Specifically, the lower bound of the interval estimate of the shelf‐life, based on the two‐sided SCBs, indicates that at least a () proportion of all dosage units maintain a drug content above the limit by this time point. The corresponding upper bound of the interval estimate of the shelf‐life represents the time point beyond which less than () proportion of all dosage units retain a drug content above . Hence, the two‐sided SCBs of our method can be applied to produce both upper or lower bounds for shelf‐life.
This paper makes three main contributions. First, we construct new pivotal quantities for the exact two‐sided ()‐SCBs for the percentile line, , over any given covariate interval . Our newly proposed pivotal quantities link SCBs to the confidence set for the unknown parameters in linear regression. Furthermore, these pivotal quantities provide a more general approach for developing SCBs, making them applicable to all SCB types. Second, we develop the new MACS criterion to determine optimal SCBs that minimize the area of the confidence set for the unknown parameters by minimizing the area of the constrained regions for our new pivotal quantities. Third, we provide an efficient and exact method to compute the critical constants of SCBs for percentile lines. This new method uses numerical quadrature based on the newly proposed pivotal quantities. In comparison with the methods of Han et al. [11], it not only reduces the dimensions of the numerical integration but also eliminates the need for simulations, resulting in a significant reduction in computational costs.
Accordingly, this paper is organized as follows. In Section 2, we provide the details of the newly proposed pivotal quantities for the exact SCBs for percentile lines and the construction of the MACS criterion, including the numerical computation of the critical constants in Section 2.4. In Section 3, we report the results of a simulation study to assess the performances of different forms of SCBs, both symmetric and asymmetric, under the MACS criterion. In Section 4, we illustrate the proposed method with an illustrative example to determine the shelf‐life of a drug by using SCBs for percentile lines. Finally, Section 5 contains conclusions and discussions. The computational costs of our newly proposed method and the previous simulation‐based method, along with additional simulation results, are provided in the Supporting Information to save space.
2. Methodology
In this section, we focus on the construction of the MACS criterion for the exact ()‐level SCBs for the percentile line over any pre‐specified interval of interest .
2.1. Preliminaries
Consider the simple linear regression model
| (1) |
where , , and the random error is assumed to follow a normal distribution . For a given dataset , let denote the design matrix, the row of which is given by , . Denote the least squares estimator of and the unbiased estimator of by and with and . Then, we have , with the degrees of freedom , and and are independent random variables. To facilitate the computation of critical constants in Section 2.2, we consider the mean‐centered design matrix without loss of generality.
In this paper, the true shelf‐life is defined as the time point , based on the percentile line , that satisfies
where is the percentile of the standard normal distribution, and is the pre‐specified acceptable limit of the drug content level. Hence, by the time point , no more than (with ) of all the individual dosage units will have a drug content below the acceptable limit . It is noteworthy that the mean regression line is the 50th percentile line, which is a special case of with .
The definition of the SCBs , for the percentile line , is given as follows.
Definition 1
A confidence band , which satisfies
(2) is called a ‐level simultaneous confidence band for the percentile line , over a finite interval of interest .
Consider two‐sided SCBs of the form
| (3) |
where and are the critical constants that satisfy (2), and the constants and are given and selected for constructing different forms of SCBs. The SCBs of the form (3) can be divided into symmetric and asymmetric based on whether or , respectively, and into Type and Type based on whether or , respectively. For upper one‐sided SCBs, we have , and for lower one‐sided SCBs, we have .
Six different forms of SCBs have been published in the literature, with various ‐values. Table 1 gives the values of and for these six bands: SB, TBU, TBE (Type with ), V, UV and TT (Type with ). The corresponding asymmetric bands are denoted as SBa, TBUa, TBEa, Va, UVa, and TTa.
Han et al. [11] provide a two‐dimensional pivotal quantity for constructing the Type band and a three‐dimensional pivotal quantity for constructing the Type band. In this section, we construct a more general pivotal quantity that can be applied to both Type and Type bands in Table 1. All these six forms in Table 1 are compared under the MACS criterion in Section 3.
2.2. Pivotal Quantities and Confidence Sets
In this section, we construct the new pivotal quantities to provide a general way for developing the SCBs in Table 1, regardless of their types. The constrained regions for the new pivotal quantities and the confidence sets for unknown parameters are also provided based on the corresponding SCBs.
Without loss of generality, we use the mean‐centered covariate values to fit the regression in (1). Then, the covariate in (1) is replaced by , where denotes the mean of ‐values in the training dataset . Accordingly, the vector in (1) becomes , and the row of design matrix becomes . Let , then is a diagonal matrix. Let be the square root matrix of given by
and . From (3), the confidence level of a two‐sided SCB is given by
| (4) |
where , and .
Let . Define the new pivotal quantity
| (5) |
and a region of , denoted as , by
| (6) |
Then, in general, the critical constants for all ()‐level SCBs in Table 1 satisfy that .
Next, we develop the confidence sets for unknown parameters using the region in (6). Let , , and . Then the confidence set for unknown parameters is given by
| (7) |
the detailed derivation is provided in Appendix A. The new criterion is based on the area of the confidence set for . The smaller the area of , the better and more informative the corresponding SCB. Intuitively, each in a confidence set corresponds to one percentile line that lies completely inside the corresponding SCB and vice versa. Note that the pivotal quantity is generated from the parameter via the transformation . It is clear that the distribution of depends on the values of via the shift and scaling . Consequently, when comparing the SCBs in Table 1 with various ‐values, one cannot conclude that a smaller area of necessarily implies a smaller confidence set for unknown parameter .
Next, we develop a new pivotal quantity, defined as , which does not depend on the values of . Also, we define the region by
Then, the confidence set in (7) can further be represented as
Since the transformation is independent of ‐values, then a smaller area of corresponds to a smaller confidence set for unknown parameter . Hence, we measure the volume of the confidence set in terms of the area of , which is given in the theorem below.
Theorem 1
The area of is given in terms of the area of by
(8) where denotes the area of set .
By using matrix scaling and geometric scaling, we have
2.3. MACS Criterion
In this section, we propose and discuss the new optimality criterion, the MACS criterion, based on our newly introduced pivotal quantities and .
Criterion: For given values of , , the design matrix and the covariate interval of interest in (3), the optimal ‐level SCB has the smallest in (8) among all possible regions satisfying .
In order to determine the optimal SCB among all possible forms given in Table 1, the area of , , for each SCB should be computed and compared. Since depends on , as shown in Theorem 1, we next focus on deriving using the polar coordinates of the pivotal quantity .
The constraints in (6) restrict to the lemon‐shaped region in Figure 1c, bounded by two thick curved lines given in Figure 1a and given in Figure 1b. The angle is formed by the boundary vectors of over , where the boundary vectors are given by and . Then, is calculated from
| (9) |
In Figure 1, increases when increases, and the critical constants and can be determined from the equation . Note that the set in Figure 1c is partitioned into four triangles and four fans (); see Figure 1d. Thus, the set can be expressed as
Based on this expression of , we can simplify the calculation of the confidence level for SCBs in Table 1 by using the polar coordinates of as shown next.
FIGURE 1.

(a–c) The construction of ; (d) The partitioned regions of . (a) The right boundary of , denoted by . (b) The left boundary of , denoted by . (c) The area of , denoted by . (d) The partitioned regions of by using polar coordinates of .
Let be the polar coordinates of , i.e., and , where and is the angle between the vectors and . The joint density of is given by
| (10) |
where , , and ; the derivation of in (10) is given in Appendix C.
The theorem below gives an expression of the confidence level of an SCB using (10) and Figure 1d.
Theorem 2
For given and , the confidence level of the SCB can be expressed as:
The proof of Theorem 2 is given in Appendix D. From Theorems 1 and 2, for a given type of SCBs in (3) with given (, ), , design matrix and covariate interval , the critical constants of the optimal SCB under the MACS criterion can be determined by minimizing the in (8) subject to the constraint
2.4. Computation of the Critical Constants and
In this section, we provide a general algorithm for computing the critical constants using the polar coordinates of pivotal quantity and Theorem 2. For a symmetric SCB with , determining the critical constant can be simplified to a root‐finding problem, that is, computing the unique root that satisfies according to Theorem 2. Note that a symmetric SCB is just a special case of the SCB in (3). Hence, the critical constant for the symmetric SCB can be used as the starting point for searching the critical constants of the corresponding optimal MACS asymmetric SCB with . It is noteworthy that if then , and if then . We use Algorithm 1 for computing the of the optimal MACS asymmetric SCBs.
ALGORITHM 1. Compute the of the optimal MACS asymmetric SCB.

Han et al. [11] compare the SCBs in Table 1 under the AW criterion. For given values of , , the design matrix and the covariate interval of interest in (3), the optimal ‐level SCB has the smallest AW, where
| (11) |
Han et al. [11] propose two methods for computing the critical constants : a numerical quadrature method and a simulation‐based method. Next, we illustrate how the computational methods in Han et al. [11] are more time‐consuming than the method of this paper in Algorithm 1.
For the numerical quadrature method of Han et al. [11], the critical constants are computed exactly by solving equations that involve three‐dimensional integrations and indicator functions, given by
where is the joint density function of , and denotes the indicator function of the set . Due to the three‐dimensional numerical quadrature involved, it takes substantially longer computation time than the method of this paper, which uses only two‐dimensional numerical quadrature.
For the simulation‐based method of Han et al. [11], they first generate independent replicates of : for , and then compute the corresponding ‐values. Finally, the pair for an asymmetric SCB is selected such that it not only satisfies the confidence level but also minimizes the AW in (11). The computational costs are reduced considerably by using this simulation‐based method in comparison with their numerical quadrature method. However, iterations are required to ensure the critical constants are accurate to at least two decimal places.
In contrast, the new method does not involve a Monte Carlo simulation and takes only a few seconds to compute the critical constants. This enables the dynamic construction of SCBs to be performed efficiently.
To highlight the improvements in computational efficiency, we conducted a simulation study to compare our new method with the simulation‐based approach. Since the numerical quadrature method is even more time‐consuming than the simulation‐based method, we do not compare our new method with the numerical quadrature method in this simulation study. Our numerical results show that the computation of critical constants using our newly proposed method takes about 20 s for symmetric SCBs and about 200 s for asymmetric SCBs on an ordinary Windows PC (Intel(R) Core(TM) i7‐6700 CPU with 3.40 GHz, 3.41 GHz, RAM 16.0 GB). In contrast, the simulation‐based method in Han et al. [11] takes over s for symmetric SCBs and no less than s for asymmetric SCBs on the same Windows PC, using 1 000 000 simulations. This decrease in time by about one magnitude demonstrates that our newly proposed method is significantly more efficient. Detailed computational costs for various scenarios are provided in the Supporting Information.
3. Simulation Study for Evaluating the Performance of SCBs
In our numerical comparisons, we assess the performance of symmetric and asymmetric SCBs within each type (Type bands with and Type bands with ), as well as compare the performance across Type and Type bands, in various scenarios.
3.1. Design of Simulation Study
Note that the critical constants for SCBs and the areas of regions in (6) depend on the design matrix , the interval , , , and . Based on this, we design the simulation study as follows. Given the data , we have the mean and the sum of squares . Here, we focus on the case where , i.e., the interval is symmetric around . Let . Recall in (6). The then depends on the value of , which lies in the interval and is symmetric about 0. Note that , then the value of increases as increases. Thus, the critical constants depend on the design matrix and the covariate interval via . In this case, the critical constants and the areas of regions depend only on , , , and .
In total, 12 different exact SCBs are considered under 24 different scenarios. The 12 SCBs include both the symmetric and asymmetric bands of the six forms given in Table 1. The 24 scenarios are derived from different combinations of the confidence level , percentile , sample size , and value of . In our numerical comparison, the settings we used are as follows: (i) confidence level: ; (ii) percentile: ; (iii) sample size: ; and (iv) .
3.2. Measuring Performance of SCBs
The main goal of the simulation study is to identify the optimal form of SCBs under the MACS criterion. To make the evaluation more concrete and quantifiable, we assess the performance of different SCBs by comparing the ratios of areas of the regions .
Suppose two bands and have regions and , respectively, for given values of , , , , and . Clearly the ratio
| (12) |
is of interest for comparing bands and under the MACS criterion. When , the area of is smaller and the corresponding confidence set for unknown parameter is smaller, and so is better than ; otherwise, is better than .
3.3. Simulation Results
The results of the simulation study are summarized here using the ‐ratio in (12). According to (9), the ‐values for each SCB depend on and only through . For Type bands, the ‐values are the same because is fixed at . However, for Type bands, the ‐values depend on different ‐values. Even with different ‐values, the ‐values for all Type bands remain the same up to two decimal places for given and . In Table 2, and denote the ‐values for Type and Type bands, respectively.
TABLE 2.
Ratios , relative to UVa, of for SCBs: TBE, TBEa, UV.
|
|
|
|
|
|
|
TBE | TBEa | UV | UVa | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.90 | 0.75 | 10 | 0.1 | 0.613 | 0.543 | 1.003 | 1.003 | 1.020 | 1 | ||||||
| 1 | 2.529 | 2.451 | 1.007 | 1.004 | 1.023 | 1 | |||||||||
| 10 | 3.078 | 3.070 | 1.003 | 0.998 | 1.020 | 1 | |||||||||
| 100 | 0.1 | 1.571 | 1.466 | 1.012 | 1.010 | 1.001 | 1 | ||||||||
| 1 | 2.942 | 2.920 | 1.006 | 1.006 | 1.001 | 1 | |||||||||
| 10 | 3.122 | 3.119 | 1.006 | 1.006 | 1.001 | 1 | |||||||||
| 0.95 | 10 | 0.1 | 0.613 | 0.543 | 1.014 | 1.013 | 1.081 | 1 | |||||||
| 1 | 2.529 | 2.451 | 1.124 | 1.118 | 1.090 | 1 | |||||||||
| 10 | 3.078 | 3.070 | 1.098 | 1.084 | 1.071 | 1 | |||||||||
| 100 | 0.1 | 1.571 | 1.466 | 1.087 | 1.087 | 1.003 | 1 | ||||||||
| 1 | 2.942 | 2.920 | 1.112 | 1.111 | 1.003 | 1 | |||||||||
| 10 | 3.122 | 3.119 | 1.112 | 1.107 | 1.003 | 1 | |||||||||
| 0.99 | 0.75 | 10 | 0.1 | 0.613 | 0.543 | 1.211 | 1.004 | 1.286 | 1 | ||||||
| 1 | 2.529 | 2.451 | 1.155 | 0.992 | 1.172 | 1 | |||||||||
| 10 | 3.078 | 3.070 | 1.126 | 0.968 | 1.143 | 1 | |||||||||
| 100 | 0.1 | 1.571 | 1.466 | 1.033 | 1.019 | 1.021 | 1 | ||||||||
| 1 | 2.942 | 2.920 | 1.023 | 1.007 | 1.015 | 1 | |||||||||
| 10 | 3.122 | 3.119 | 1.022 | 1.005 | 1.015 | 1 | |||||||||
| 0.95 | 10 | 0.1 | 0.613 | 0.543 | 1.581 | 1.017 | 1.731 | 1 | |||||||
| 1 | 2.529 | 2.451 | 1.770 | 1.056 | 1.464 | 1 | |||||||||
| 10 | 3.078 | 3.070 | 1.618 | 0.906 | 1.317 | 1 | |||||||||
| 100 | 0.1 | 1.571 | 1.466 | 1.197 | 1.144 | 1.070 | 1 | ||||||||
| 1 | 2.942 | 2.920 | 1.267 | 1.179 | 1.046 | 1 | |||||||||
| 10 | 3.122 | 3.119 | 1.264 | 1.162 | 1.044 | 1 |
Based on our numerical results within each type, asymmetric SCBs have smaller areas of and are better than symmetric SCBs, for both Type and Type bands. For Type bands, asymmetric SCBs (SBa, TBUa, TBEa) outperform symmetric SCBs (SB, TBU, TBE) by up to about 100% when both and are large. Among the three symmetric Type bands, TBE has the smallest area of , and so TBE is the best in terms of MACS. The difference in the areas of among the three asymmetric Type bands is small. For Type bands, asymmetric SCBs (Va, TTa, UVa) are better than symmetric SCBs (V, TT, UV), by as much as about 80% when is large and is small. Among the three symmetric Type bands, the V band consistently performs slightly worse than the UV band under the MACS criterion, and therefore, the V band is not recommended. The differences in the areas of between UV and TT are small, so either band can be used. Additionally, the differences in the areas of among the three asymmetric Type bands are small. In general, an asymmetric SCB is better than the corresponding symmetric SCB according to the MACS criterion. The detailed numerical results are provided in the Supporting Information.
In the comparative analysis across Type and Type bands, we focus on the performance of the TBE, TBEa, UV, and UVa bands under the MACS criterion, as TBE and UV are the best Type band and Type band, respectively. Table 2 presents the ‐ratios of for TBE, TBEa, and UV relative to UVa. Note that the ‐ratios for symmetric bands are always larger than one, indicating that the asymmetric bands (TBEa and UVa) outperform the symmetric bands (TBE and UV). When , UVa is the best, with the for UVa consistently smaller than that for TBEa. When , UVa is the best in most cases, while TBEa performs best if the ‐values in the training dataset are dispersed (i.e., and ).
Based on the results in Table 2, we conduct a comparison between TBEa and UVa in Figure 2 with , and . The ‐values for Type bands are utilized as ‐axis. Figure 2 reveal that the ‐ratio is below 1 for large and (e.g., and ). Therefore, we can conclude that TBEa is better than UVa when both and are large, and is small. It is also noteworthy that and are larger than 3 when , indicating that the ‐values of the training dataset are dispersed. Hence, asymmetric Type bands should be used only for the training dataset having dispersed ‐values, a relatively large confidence level , and a small sample size . Therefore, asymmetric Type bands, like UVa, are recommended in general.
FIGURE 2.

The ratios of for TBEa relative to UVa, with , , and . (a) , (b) .
4. Illustrative Example
In order to illustrate how to determine the shelf‐life of a drug based on the method of SCBs for a percentile line, we revisit the real data example on drug stability from Ruberg and Hsu [17]. We also provide a visual demonstration of the SCBs and their corresponding constrained regions . For the data from the first batch of Experiment One in Ruberg and Hsu [17], the fitted model is with , and .
For given , , , and , Table 3 shows the ‐ratios of areas of regions relative to the UVa band, including (i) the symmetric SCBs (SB, TBU, TBE, V, TT, UV) and (ii) the asymmetric SCBs (SBa, TBUa, TBEa, Va, TTa, UVa). It is clear that the ‐values for asymmetric SCBs are substantially smaller than those for symmetric SCBs under the MACS criterion. Based on the numerical results, we can conclude that the UVa band is the best under the MACS criterion in this example.
TABLE 3.
Ratios of relative to the UVa band for the drug stability data.
| Bands | Ratio | Bands | Ratio | Bands | Ratio | Bands | Ratio | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
1.579 |
|
1.425 |
|
1.089 |
|
1.002 | ||||
|
|
1.473 |
|
1.334 |
|
1.089 |
|
1.000 | ||||
|
|
1.235 |
|
1.335 |
|
1.091 |
|
1 |
Note: SB, TBU, TBE, V, TT and UV are the symmetric SCBs. SBa, TBUa, TBEa, Va, TTa and UVa are the asymmetric SCBs.
In order to visually demonstrate the disparity in the areas of constrained regions , we consider the for TBE, UV, TBEa, and UVa, since TBE and UV represent the best options among Type and Type bands, respectively. Figure 3a gives the areas of for TBE, UV, TBEa, and UVa using a solid line, dash line, dash‐dotted line, and dotted line, respectively. It is shown in Figure 3a that for UVa is the smallest.
FIGURE 3.

(a) The areas of for TBE, UV, TBEa, and UVa in the coordinates (, ); (b) The 95% asymmetric band UVa for the 5th percentile.
In order to illustrate the estimation of shelf‐life based on SCBs for percentiles in linear regression, the UVa band is used, as it has been identified as the optimal choice from Table 3 and Figure 3a. In Figure 3b, the estimated percentile line with is shown by the solid line, and the UVa band is given by the dotted lines. The critical constants () for UVa are (3.230, 2.016). For the given threshold , which is given by the dash‐dotted line, one can infer from the UVa band that, the percentile line is above before the time point with confidence level, and so at least proportion of all the dosage units have drug content above by this time point. But beyond the time point , the percentile line is below with confidence level, and so less than proportion of all the dosage units have drug content above . It should be noted that the true shelf‐life at which can be anywhere in the interval .
In general, according to the area of in Figure 3a and the numerical results in Table 3, the asymmetric Type bands (Va, UVa and TTa) have smaller volume of confidence sets for unknown parameter and should be used under the MACS criterion.
5. Conclusion
Drug stability studies are an important part of any pharmaceutical drug development programme. The interval estimation of true shelf‐life based on the mean drug content of a selected dosage unit only guarantees that half of all the dosage units have the drug content no less than the pre‐specified level by the estimated shelf‐life. This may not be satisfactory from the patients' point of view unless the therapeutic effect of a dosage unit with drug content below is well understood.
In this paper, the true shelf‐life is set as the time point at which no more than of all the dosage units will have the drug content less than a pre‐specified lowest acceptable limit , where is a pre‐specified number close to 0. Constructing the exact ‐level SCBs for the percentile line in linear regression is the appropriate way for estimating the shelf‐life in this situation.
In this paper, we propose the MACS criterion, allowing for the comparison of different types of SCBs for percentile lines based on the area of the region for our new pivotal quantity . In addition, our newly proposed method of computing the critical constants of SCBs for percentile lines is more efficient than the methods previously available in the statistical literature. Moreover, the optimal SCB under the MACS can be used to construct the interval estimation of shelf‐life in drug stability studies.
Based on our numerical results, we observe that the optimal MACS asymmetric bands are inherently superior to the corresponding symmetric bands under the MACS criterion, as expected, making them the preferred choice. In most cases, the asymmetric Type bands perform better than the asymmetric Type bands. Therefore, asymmetric Type bands, like UVa, are recommended.
Although this paper only focuses on the simple linear regression model, the proposed method can be extended to multiple regression and polynomial regression. For multiple regression involving more than one covariate, the MACS‐based method can be applied, and the polar coordinates of can be directly used to calculate the critical constants. The computational cost remains comparable to that of simple linear regression, taking approximately 10 s for symmetric SCBs and about 200 s for asymmetric SCBs. The computation time does not increase with the number of covariates. For polynomial regression, the critical constants cannot be determined directly using the polar coordinates of the pivotal quantity . In such cases, a simulation‐based approach can be employed instead. The computation time increases due to the optimization involved. With 1 000 000 simulations, the computation time is approximately s for symmetric SCBs and over s for asymmetric SCBs. These times can be substantially reduced through parallel computing with an increased number of cores.
Furthermore, the MACS criterion can be used to select optimal simultaneous tolerance intervals for linear regression, which is currently under research and will be reported separately in the future.
Conflicts of Interest
The authors declare no conflicts of interest.
Supporting information
Data S1. The Supporting Information provides the computational costs of our newly proposed method and the previous simulation‐based method, as well as the additional numerical results in Section 3.
Appendix A. Derivation of the confidence set
To derive the confidence set for unknown parameters according to the region , we need identify the relation between and the pivotal quantity . Recall , , and as defined in (7), we have
Hence, the confidence set can be expressed as
Appendix B. Derivation of
Set and , where and are defined in (4). The in (8) can be calculated for the following situations.
When ,
When ,
- When ,
-
(1)if , ;
-
(2)if , .
-
(1)
Appendix C. Joint density function of
Since , and with in (4) are independent, the joint density function of is
| (A1) |
Now, consider the pdf for with
where , and . Define the random variable , so that , and can be solved in terms of , and as
| (A2) |
This gives the following Jacobian matrix and determinant
| (A3) |
It follows immediately that the joint density of , and is given by
Combining ((A1), (A2), (A3)) gives
Integrating out gives the (marginal) density of and
Appendix D. Proof of Theorem 2
Before we prove the Theorem 2, we need the following two lemmas.
Lemma 1
Let be the pivotal quantity in ( 5 ) and be the region in ( 6 ). For given critical constants and , the confidence level of the SCB in ( 3 ) is
In order to calculate the probability , all three different situations are considered below. The angles and are defined in Figure 1a,b, while the angles , , , , and are defined in Figure 1d.
When , we have , , and .
When , we have , , and .
- When ,
-
(1)if , we have , , and ;
-
(2)if ,
-
(i)suppose , , , , , , and ;
-
(ii)suppose , , , , , , and .
-
(i)
-
(1)
The next lemma provides the derivation of using a method involving partitions and polar coordinates .
Lemma 2
For any SCB in ( 3 ), the corresponding in Figure 1 d is partitioned as:
where
From Lemmas 1 and 2, we can derive Theorem 2.
Following Lemmas 1 and 2, the confidence level of the SCB in (3) given and is
(A4) The Equation (A4) is based on the independence of regions and , .
Wang L., Han Y., Liu W., and Bretz F., “Minimum Area Confidence Set Optimality for Simultaneous Confidence Bands for Percentiles With Applications to Drug Shelf‐Life Estimation,” Statistics in Medicine 44, no. 20‐22 (2025): e70184, 10.1002/sim.70184.
Funding: The authors received no specific funding for this work.
Data Availability Statement
The R code for the simulation studies and the real data example is available at https://github.com/Lingjiao‐WANG/MACS; the results of these analyses are presented in the paper and the Supporting Information.
References
- 1. International Conference on Harmonization , “Topic Q1A (R2): Stability Testing of New Drug Substances and Products,” (2003), https://database.ich.org/sites/default/files/Q1A%28R2%29%20Guideline.pdf.
- 2. International Conference on Harmonization , “Topic Q6A: Specifications: Test Procedures and Acceptance Criteria for New Drug Substances and New Drug Products: Chemical Substances,” (1999), https://database.ich.org/sites/default/files/Q6AGuideline.pdf. [PubMed]
- 3. Kiermeier A., Jarrett R. G., and Verbyla A. P., “A New Approach to Estimating Shelf‐Life,” Pharmaceutical Statistics 3, no. 1 (2004): 3–11. [Google Scholar]
- 4. Quinlan M., Stroup W., Schwenke J., and Christopher D., “Evaluating the Performance of the ICH Guidelines for Shelf Life Estimation,” Journal of Biopharmaceutical Statistics 23, no. 4 (2013): 881–896. [DOI] [PubMed] [Google Scholar]
- 5. Liu W., Hsu J. C., Bretz F., Hayter A. J., and Han Y., “Shelf‐Life and Its Estimation in Drug Stability Studies,” Journal of Applied Statistics 41, no. 9 (2014): 1989–2000. [Google Scholar]
- 6. Ruberg S. J. and Stegeman J. W., “Pooling Data for Stability Studies: Testing the Equality of Batch Degradation Slopes,” Biometrics 47, no. 3 (1991): 1059–1069. [PubMed] [Google Scholar]
- 7. Steinhorst R. K. and Bowden D. C., “Discrimination and Confidence Bands on Percentiles,” Journal of the American Statistical Association 66, no. 336 (1971): 851–854. [Google Scholar]
- 8. Turner D. L. and Bowden D. C., “Simultaneous Confidence Bands for Percentile Lines in the General Linear Model,” Journal of the American Statistical Association 72, no. 360a (1977): 886–889. [Google Scholar]
- 9. Turner D. L. and Bowden D. C., “Sharp Confidence Bands for Percentile Lines and Tolerance Bands for the Simple Linear Model,” Journal of the American Statistical Association 74, no. 368 (1979): 885–888. [Google Scholar]
- 10. Thomas D. L. and Thomas D. R., “Confidence Bands for Percentiles in the Linear Regression Model,” Journal of the American Statistical Association 81, no. 395 (1986): 705–708. [Google Scholar]
- 11. Han Y., Liu W., Bretz F., and Wan F., “Simultaneous Confidence Bands for a Percentile Line in Linear Regression,” Computational Statistics and Data Analysis 81 (2015): 1–9. [Google Scholar]
- 12. Gafarian A. V., “Confidence Bands in Straight Line Regression,” Journal of the American Statistical Association 59, no. 305 (1964): 182–213. [Google Scholar]
- 13. Liu W., Ah‐kine P., and Zhou S., “Simultaneous Confidence Bands for Linear Regression With Covariates Constrained in Intervals,” Scandinavian Journal of Statistics 39, no. 3 (2012): 543–553. [Google Scholar]
- 14. Liu W. and Hayter A. J., “Minimum Area Confidence Set Optimality for Confidence Bands in Simple Linear Regression,” Journal of the American Statistical Association 102, no. 477 (2007): 181–190. [Google Scholar]
- 15. Liu W., Lin S., and Piegorsch W. W., “Construction of Exact Simultaneous Confidence Bands for a Simple Linear Regression Model,” International Statistical Review 76, no. 1 (2008): 39–57. [Google Scholar]
- 16. Liu W. and Ah‐kine P., “Optimal Simultaneous Confidence Bands in Simple Linear Regression,” Journal of Statistical Planning and Inference 140, no. 5 (2010): 1225–1235. [Google Scholar]
- 17. Ruberg S. J. and Hsu J. C., “Multiple Comparison Procedures for Pooling Batches in Stability Studies,” Technometrics 34, no. 4 (1992): 465–472. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data S1. The Supporting Information provides the computational costs of our newly proposed method and the previous simulation‐based method, as well as the additional numerical results in Section 3.
Data Availability Statement
The R code for the simulation studies and the real data example is available at https://github.com/Lingjiao‐WANG/MACS; the results of these analyses are presented in the paper and the Supporting Information.
