Abstract
In this paper, we consider the problem of making statistical inference for a truncated normal distribution under progressive type I interval censoring. We obtain maximum likelihood estimators of unknown parameters using the expectation-maximization algorithm and in sequel, we also compute corresponding midpoint estimates of parameters. Estimation based on the probability plot method is also considered. Asymptotic confidence intervals of unknown parameters are constructed based on the observed Fisher information matrix. We obtain Bayes estimators of parameters with respect to informative and non-informative prior distributions under squared error and linex loss functions. We compute these estimates using the importance sampling procedure. The highest posterior density intervals of unknown parameters are constructed as well. We present a Monte Carlo simulation study to compare the performance of proposed point and interval estimators. Analysis of a real data set is also performed for illustration purposes. Finally, inspection times and optimal censoring plans based on the expected Fisher information matrix are discussed.
Keywords: Bayes estimation, EM algorithm, inspection times, optimal censoring plans, probability plot, truncated normal distribution
2010 Mathematics Subject Classifications: 62N01, 62N02
1. Introduction
The application of life testing experiments is very common in many practical problems arising in multiple branches of science and engineering including mortality analysis, clinical trials, studies related to industry, reliability estimation, etc. In general, such experiments are conducted under various constraints such as cost and time limits, unintentional breakage of units, drop out of subjects from a study and so on. Thus it may not be possible to record exact failure times of all items placed on a test and in sequel, recorded observations appear as censored in nature. Censoring is very common in such studies and indicates situations where failure times of test units are available only for a portion of the total items. In such cases, only partial information in the form of some bound is known on failure times of units that had not failed. In literature, various censoring methodologies have been proposed to obtain censored data. Type I and type II censoring are probably the most commonly used methods in this regard. In type I censoring failure times are recorded up to a specified time point and failures occurring afterward this fixed time are not recorded. On the other hand in type II censoring a test continues until a prespecified number of failure times have been recorded. These two methods share one common feature that live units can be withdrawn only at the termination point of the test. In progressive censoring items put on a test can be withdrawn during the experimentation as well and hence this censoring is more flexible in nature compared to the other two basic schemes. Progressive censoring has been widely studied in lifetime analysis by various researchers. One may refer to Balakrishnan and Cramer [7] for an exhaustive list of references on this topic and also for a detailed discussion on its applications in life testing experiments. Sometimes it is relatively difficult to record exact failure times of items due to lack of continuous monitoring of subjects under study. In such situations, observations are often recorded in intervals and corresponding censoring is referred to as the interval censoring. Aggarwala [1] initially discussed progressive type I interval censoring in literature and studied an exponential distribution using this censoring. Since then this censoring has attracted some attention among researchers. Progressive type I interval censoring can briefly be described as follows. Suppose that a total of n identical units is placed on a life test experiment at time . In this experiment units are inspected at m prespecified time points such that where denotes the time at which experiment terminates. Now let number of failure times are observed within the interval and also number of live units are randomly withdrawn from the experiment at the inspected time , . We further note that the number of surviving units at each inspection time is random in nature and therefore should not be greater than . Usually is obtained using the prescribed percentage of the remaining surviving units at each for with and equivalently, we have where denotes the greatest integer less than or equal to a. In fact, can also be prespecified non-negative integers. In such situations, the actual number of surviving units removed at is given by = min(, number of surviving units at inspection time ), and = number of surviving units at inspection time . In this paper, we denote the observed progressive type I interval censoring scheme as . We denote the corresponding progressive type I interval-censored data as , where . Note that this censoring scheme corresponds to the traditional type I censoring for the case . At recent past this censoring scheme has received some attention among various researchers. Huang and Wu [17] obtained reliability sampling plans for exponential lifetimes under some cost constraint. Amin [4] computed MLEs and Bayes estimates of unknown log-normal parameters under progressive type I interval censoring. Further Lin et al. [22] established optimally spaced inspection times for this distribution. Ng and Wang [23] studied the Weibull distribution using different estimation methods. Chen and Lio [11] analyzed a generalized exponential distribution and computed estimates for unknown parameters using different estimation procedures. Cheng et al. [14] provided a useful algorithm to derive MLEs of model parameters under progressive type I interval censoring. Lin and Lio [21] computed Bayes estimates of parameters of generalized exponential and Weibull distributions. Chen et al. [12] further considered interval estimation for the generalized exponential distribution. Kus et al. [19] studied optimal plans for the Pareto distribution under some budget limit, see also Ismail [18]. Ahmadi and Yousefzadeh [2] studied the problem of estimating unknown parameters of a generalized half-normal distribution. Wu and Lin [33,34] obtained inference for lifetime performance index assuming the exponential and Weibull distributions. Singh and Tripathi [27] studied an inverse Weibull distribution under this censoring scheme. Recently Budhiraja et al. [10] obtained some interesting results for the maximum likelihood estimates under this censoring. Wang et al. [29] analyzed the mixed generalized exponential distribution based on progressive type I interval censoring. Wu et al. [35] and Wu [30] obtained interesting results for lifetime performance index when product lifetime follows Rayleigh and Chen distributions respectively. Budhiraja and Pradhan [8,9] established optimal plans under progressive type I interval censoring based on some cost function. Zhao and Bordes [36] obtained optimal censoring for step-stress models. Arabi Belaghi et al. [5] considered the problem of estimating unknown parameters of a Burr XII distribution from frequentist and Bayesian contexts. Azizi et al. [6] studied a competing risk model under progressive type I interval censoring assuming Weibull distributions. Du et al. [16] derived inference for entropy by considering the log-logistic distribution. Wu and Hsieh [32] obtained inference for the lifetime performance index for the Gompertz distribution. Recently, Malevich and Muller [24] established optimal design plans of inspection times and Roy and Pradhan [26] obtained Bayesian life testing plans under this censoring scheme.
In this paper, we have considered making inference for a truncated normal distribution under progressive type I interval censoring. The density function of this distribution is given by
| (1) |
where μ and τ denote unknown parameters and is the standard normal distribution function. We denote this distribution as . The parameter μ denotes the mode of the distribution. The corresponding cumulative distribution function (CDF) and the survival function (SF) are obtained as, respectively,
| (2) |
and
| (3) |
To our knowledge this distribution has not been studied yet using the progressive type I interval censoring. The purpose of this paper is two-fold. We first obtain some classical and Bayes estimates of unknown parameters μ and τ. Both point and interval estimation are considered. The maximum likelihood estimates (MLEs) of unknown parameters are derived using an expectation-maximization algorithm. We also obtain corresponding estimates using midpoint approximation and probability plot methods. The asymptotic intervals are constructed using the observed Fisher information matrix. Bayes estimates are derived under the squared error and linex loss functions using both proper and improper prior distributions. In sequel, highest posterior density (HPD) intervals of unknown parameters are also constructed. The second aim of the paper is to present inspection times and optimal plans under progressive type I interval censoring. Optimal plans have become extremely popular in reliability and life testing experiments in recent years.
We have organized the rest of this paper as follows. In Section 2, we compute maximum likelihood estimates of unknown parameters μ and τ using an EM algorithm. We further use this method to obtain asymptotic intervals from the observed Fisher information matrix. In this section, we also provide midpoint and probability plot estimates of both parameters under progressively type I interval-censored data. Bayes estimates of unknown parameters are derived with respect to squared error and linex loss functions in Section 3 using the importance sampling procedure. We further obtain highest posterior density (HPD) intervals using the importance sampling. A simulation study is conducted in Section 4 to compare the performance of proposed methods of estimation. A real data set is analyzed in Section 5 for illustration purposes. In Section 6, we discuss inspection times and optimal censoring plans under progressive type I interval censoring. Some concluding remarks are given in Section 7.
2. Likelihood estimation
In this section, we compute maximum likelihood estimates of unknown parameters of a distribution under progressive type I interval censoring. Some more classical estimates are obtained using midpoint approximation and probability plot method of estimation. Asymptotic intervals are obtained from the observed Fisher information matrix.
2.1. Maximum likelihood estimates
Suppose that denotes a progressive type I interval censored sample of size m from a distribution as defined in (1). The likelihood function of μ and τ based on this observed data can be written as
| (4) |
where is the standard normal cumulative distribution function. The log-likelihood function is then given by
| (5) |
and likelihood equations of μ and τ are then obtained as
| (6) |
and
| (7) |
We observe that above likelihood equations have no analytical solutions for the MLEs and of unknown parameters μ and τ, respectively. In fact, these equations are nonlinear in nature and we can solve them numerically by making use of some iterative technique. Here we obtain desired MLEs using an EM algorithm. This method is initially discussed in Dempster et al. [15] and is very useful for computing MLEs particularly in situations where observed data is censored in nature. Suppose that , denotes the lifetimes of jth item failed within the interval and , denotes the lifetimes for those items withdrawn at time for . Then, the log-likelihood function of μ and τ under the complete data set is obtained as
| (8) |
In the E-step of the EM algorithm we compute the pseudo log-likelihood function and this is obtained as
| (9) |
In order to evaluate the above expression we need to compute the involved conditional expectations which are given below. Noticing that and follow the truncated normal distribution with parameters μ and τ, we find that we have
| (10) |
| (11) |
| (12) |
and
| (13) |
Next we maximize the pseudo-log-likelihood function with respect to unknown parameters in the M-step of the EM algorithm. If be an estimate of at the kth stage of iteration then we can compute the th stage estimate of unknown parameters by maximizing as given in (9). We further observe that can be obtained by solving the equation
| (14) |
Then updated estimate of τ can be computed as follows
| (15) |
where , , , and denote conditional expectations at the kth iteration. We perform this iterative process till a desired convergence is achieved. Next we obtain confidence intervals of unknown parameters μ and τ using asymptotic property of the maximum likelihood estimates. In this connection, we first obtain the observed Fisher information matrix of MLEs of μ and τ which is given by
and involved expressions are reported in Appendix. Subsequently asymptotic confidence intervals for μ and τ can be obtained as and , respectively, where is the upper th percentile of the standard normal distribution.
2.2. Midpoint approximation method
In this section, we estimate unknown parameters of a distribution using the mid point approximation method. We assumed that number of failures is observed at the center of ith interval and also number of units are censored at the inspection time , . The likelihood function of μ and τ is then obtained as
The desired estimates of μ and τ can be obtained by maximizing the following log-likelihood function
| (16) |
Subsequently, we need to solve the following system of equations to obtain the midpoint estimates of unknown parameters:
| (17) |
and
| (18) |
Likelihood Equations (17) and (18) cannot be solved analytically due to their nonlinear nature. Here we have used the EM algorithm to obtain the respective estimates of μ and τ.
2.3. Estimation based on the probability plot
Let , with denotes a progressive type I interval censored sample from a distribution. Based on this sample the cumulative distribution function at time can be estimated as
| (19) |
where
We also note that
| (20) |
Now minimizing the expression with respect to μ and τ leads to estimates of unknown parameters of the TN distribution based on probability plot method. We mention that these estimates can be computed numerically using some nonlinear optimization technique.
3. Bayesian estimation
Here we obtain different Bayes estimates of unknown parameters of a distribution under symmetric and asymmetric loss functions. In many Bayes estimation problems squared error loss function is applied which penalizes equally to under- and over-estimation. In many practical studies consequences of over- and under-estimation are not symmetric in nature. For example in reliability and life testing studies over-estimation may be treated more serious than under-estimation. Generally in survival analysis over-dispersed data are studied more carefully than under-dispersed data. In such situations, an asymmetric loss function can be used to derive inference upon unknown quantities. Here we obtain Bayes estimates of μ and τ under squared error and linex loss functions. The squared error loss is given by
where denotes an estimator of , a function of the parameter η. In this case, the posterior mean of denotes its Bayes estimate. In literature, linex is one of the most commonly used asymmetric loss function (see [28]) and it is defined as
where . The corresponding estimate of is given by
We obtain Bayes estimates of both the unknown parameters using informative and non-informative prior distributions. The informative prior distribution π of μ and τ is considered as
where is assumed to follow a distribution and has an inverse gamma distribution. Also a, b, p and q denote hyper-parameters and reflect prior knowledge about the unknown parameters. We see that the joint prior distribution of μ and τ can be written as
| (21) |
Using (5) and (21), the joint posterior distribution of μ and τ turns out to be
| (22) |
where K is the normalizing constant such that
Now Bayes estimate of , a function of μ and τ, under the loss function is obtained as
| (23) |
Similarly for the loss, we have
| (24) |
where .
We note that both Bayes estimators appear as the ratio of two integrals which are difficult to simplify in closed forms. So some approximation technique is required to evaluate such integrals. Here we make use of an importance sampling technique to compute Bayes estimators of unknown parameters of a distribution. The highest posterior density (HPD) intervals of μ and τ are also obtained using progressive type I interval-censored samples. The posterior distribution in Equation (22) is of the form
We re-express this distribution as
| (25) |
where
We use the following steps to generate samples from the posterior distribution .
Generate τ from .
Generate μ from .
Repeat step 1 and step 2, t times to obtain samples , , , .
Bayes estimates of parameters μ and τ under squared loss and linex loss functions are now obtained as, respectively,
| (26) |
and
| (27) |
Next we construct HPD intervals of unknown parameters of the distribution using the method of Chen and Shao [13]. For the sake of completeness, we illustrate HPD interval of the parameter μ using this method. Let and respectively denote posterior density function and posterior distribution function of μ. Also let be such that , 0<p<1. We first obtain a simulation consistent estimate of the pth quantile of the posterior distribution of μ. Assume that
Arrange , as where , and is associated with for . Then a simulation consistent estimate of is where is given by
Now credible intervals of μ are obtained as for . Then HPD interval of μ is given by where is such that for all κ. The HPD interval of the parameter τ can be obtained similarly.
Remark 3.1
We have also obtained Bayes estimates of unknown parameters of the distribution under a non-informative prior distribution of μ and τ given as
The corresponding joint posterior distribution given the progressive type I interval censored data turns out to be
where is the normalizing constant such that
We have used importance sampling to obtain desired estimates of both unknown parameters under squared error and linex loss functions. For the sake of conciseness, we have not presented the detailed calculations which is quite similar to the informative prior case.
4. Simulation study
Here we perform a Monte Carlo simulation study to compare the performance of proposed estimation methods discussed in the previous sections. We assess the behavior of all estimators in terms of their average estimates and mean square error (MSE) values. These values are computed on the basis of various progressive type I interval-censored samples drawn from a distribution when true unknown parameters are and . For a given parent sample of size n we first generate a progressive type I interval-censored sample , of size m using prespecified inspection time and censoring scheme . We have generated required samples using the algorithm as discussed in Aggarwala [1]. Following this method we first generate from a binomial distribution and given observation we calculate as . Further for , we have
with . We have obtained estimates using different inspection times , , , and and censoring schemes , and . Here denotes the type I censoring scheme and other schemes can be interpreted similarly. We compute different estimates of μ and τ under these combinations when n is assigned values as 30 and 50. We obtain MLEs of unknown parameters using the EM algorithm. The true unknown parameters are considered as the initial guess in the EM algorithm. Similarly midpoint estimates are computed. Bayes estimates of μ and τ are computed using the importance sampling procedure. These estimates are obtained under non-informative (NIN) and informative (IN) prior distributions as discussed in Section 3. The average estimates and MSE of MLEs, Midpoint (MP) and probability plot (PP) estimates are reported in Table 1. It is seen that estimates obtained using the MP method perform quite good compared to the respective maximum likelihood estimates of μ and τ as far as average estimates and MSEs are concerned. The performance of probability plot method is also appreciated. In general, maximum likelihood estimates of both the parameters have an advantage over the other two methods. In general, mean square errors of proposed estimates of both the parameters tend to decrease as sample size increases. This holds for almost all the tabulated schemes. The traditional type I censoring scheme provide marginally better estimation results compared to the other two schemes. In Table 2, we present Bayes estimates along with MSEs for both the unknown parameters. These estimates are obtained under squared error and linex loss functions for n = 30 and n = 50 using the MH algorithm. Bayes estimates under linex loss are computed for arbitrarily selected c such as and 0.5 indicating weights to under- and over-estimation, respectively. Bayes estimates under IN prior are computed when hyper-parameters are assigned as , b = 2.5, p = 3 and q = 4. From this table, we observe that estimates obtained under IN prior show superior behavior than estimates obtained under NIN prior in terms of both average estimates and MSEs values. Bayes estimates computed under NIN prior distribution compete quite good with maximum likelihood, MP and PP estimates, see Table 1. We also observe that proper Bayes estimates of both the unknown parameters perform better than all the classical estimates tabulated in Table 1. Here also estimates obtained using the type I censored data produce better estimation results compared to the other two schemes and . Moreover, we tend to get better Bayes estimates of unknown parameters with increasing sample sizes. In Table 3, we have presented the asymptotic confidence and HPD interval estimates of unknown parameters μ and τ for proposed censoring schemes. We have presented coverage percentage and average interval length of both interval estimates. We observe that asymptotic intervals compete good with respective non-informative HPD intervals as far as coverage probability and interval length are concerned. The HPD intervals of both the unknown parameters obtained using the IN prior distribution show superior behavior in this respect. It is also observed that interval length tend to decrease with an increase in sample sizes. Coverage probabilities obtained from the censoring remain smaller than those obtained using the schemes and .
Table 1. Maximum likelihood estimates using EM, MP and PP algorithms.
| n = 30 | n = 50 | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| μ | τ | μ | τ | ||||||||||
| C.S. | EM | MP | PP | EM | MP | PP | EM | MP | PP | EM | MP | PP | |
| Avg | 0.4839 | 0.5294 | 0.5543 | 1.0393 | 0.9167 | 0.8653 | 0.4884 | 0.5332 | 0.5147 | 1.0218 | 0.9366 | 0.9483 | |
| MSE | 0.0377 | 0.0482 | 0.0758 | 0.0847 | 0.0587 | 0.1745 | 0.0382 | 0.0341 | 0.0614 | 0.0581 | 0.0371 | 0.1159 | |
| Avg | 0.4789 | 0.5234 | 0.5152 | 1.0381 | 0.9195 | 0.8987 | 0.4831 | 0.5366 | 0.4959 | 1.0283 | 0.9339 | 0.9460 | |
| MSE | 0.0404 | 0.0442 | 0.0711 | 0.0428 | 0.0572 | 0.1462 | 0.0368 | 0.0345 | 0.0534 | 0.0260 | 0.0409 | 0.1005 | |
| Avg | 0.4853 | 0.5173 | 0.5183 | 1.0388 | 0.9445 | 0.9376 | 0.4892 | 0.5372 | 0.4918 | 1.0337 | 0.9658 | 0.9825 | |
| MSE | 0.0391 | 0.0393 | 0.0704 | 0.0775 | 0.0401 | 0.1222 | 0.0331 | 0.0212 | 0.0555 | 0.0033 | 0.0204 | 0.0721 | |
Table 2. Bayesian estimates of parameters.
| IN Prior | NIN Prior | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| Linex | Linex | ||||||||
| C.S. | Method | SEL | c = −0.5 | c = 0.5 | SEL | c = −0.5 | c = 0.5 | ||
| MH | μ | Avg | 0.48515 | 0.48643 | 0.48433 | 0.58530 | 0.42425 | 0.56615 | |
| MSE | 0.02577 | 0.02508 | 0.02665 | 0.06805 | 0.05029 | 0.04598 | |||
| MH | τ | Avg | 1.01501 | 0.98316 | 0.99848 | 1.04190 | 0.95109 | 1.02778 | |
| MSE | 0.04957 | 0.05131 | 0.04810 | 0.09997 | 0.09050 | 0.09522 | |||
| MH | μ | Avg | 0.49790 | 0.49613 | 0.48004 | 0.56525 | 0.43131 | 0.55941 | |
| MSE | 0.02157 | 0.02177 | 0.02148 | 0.05884 | 0.06117 | 0.05664 | |||
| MH | τ | Avg | 1.06997 | 0.96519 | 1.05565 | 0.90480 | 0.91288 | 1.11733 | |
| MSE | 0.05308 | 0.05419 | 0.05334 | 0.08143 | 0.08236 | 0.08142 | |||
| MH | μ | Avg | 0.51330 | 0.51042 | 0.50648 | 0.63131 | 0.46671 | 0.62616 | |
| MSE | 0.02282 | 0.02335 | 0.02239 | 0.04606 | 0.04780 | 0.04443 | |||
| MH | τ | Avg | 0.99024 | 0.99385 | 1.01769 | 0.99934 | 0.99742 | 0.99292 | |
| MSE | 0.04808 | 0.05078 | 0.04483 | 0.07184 | 0.07352 | 0.07214 | |||
| MH | μ | Avg | 0.49100 | 0.48644 | 0.48577 | 0.54494 | 0.54910 | 0.54090 | |
| MSE | 0.01689 | 0.01694 | 0.01689 | 0.04395 | 0.04543 | 0.04253 | |||
| MH | τ | Avg | 0.99550 | 1.00480 | 0.98708 | 0.83448 | 0.83927 | 0.82953 | |
| MSE | 0.04077 | 0.04285 | 0.03980 | 0.07314 | 0.07282 | 0.07353 | |||
| MH | μ | Avg | 0.52222 | 0.52609 | 0.52845 | 0.71314 | 0.71624 | 0.71011 | |
| MSE | 0.02052 | 0.02127 | 0.01981 | 0.06093 | 0.06242 | 0.05949 | |||
| MH | τ | Avg | 0.97670 | 0.98197 | 0.97128 | 0.80626 | 0.80866 | 0.80376 | |
| MSE | 0.03184 | 0.03242 | 0.03127 | 0.07166 | 0.07112 | 0.07221 | |||
| MH | μ | Avg | 0.50334 | 0.50657 | 0.50021 | 0.63939 | 0.64215 | 0.63670 | |
| MSE | 0.02512 | 0.02593 | 0.02435 | 0.05475 | 0.05621 | 0.05333 | |||
| MH | τ | Avg | 1.00301 | 1.00806 | 0.99785 | 0.88047 | 0.88328 | 0.87759 | |
| MSE | 0.03095 | 0.03158 | 0.03035 | 0.06589 | 0.06555 | 0.06625 | |||
Table 3. Coverage probabilities and average lengths of asymptotic and HPD intervals.
| HPD interval | |||||||
|---|---|---|---|---|---|---|---|
| Asymptotic interval | NIN Prior | IN Prior | |||||
| n | C.S. | μ | τ | μ | τ | μ | τ |
| 30 | 0.8990 | 0.8780 | 0.8738 | 0.8553 | 0.9428 | 0.9362 | |
| 1.01997 | 1.41021 | 0.66736 | 0.87997 | 0.62102 | 0.76932 | ||
| 0.8849 | 0.8637 | 0.8583 | 0.8436 | 0.9210 | 0.9187 | ||
| 1.05020 | 1.42186 | 0.54208 | 0.73005 | 0.53197 | 0.71021 | ||
| 0.8574 | 0.8431 | 0.8328 | 0.8384 | 0.9197 | 0.9084 | ||
| 1.02473 | 1.36021 | 0.51030 | 0.68938 | 0.48957 | 0.57843 | ||
| 50 | 0.9186 | 0.8960 | 0.8977 | 0.8646 | 0.9555 | 0.9577 | |
| 0.98956 | 1.37480 | 0.51849 | 0.62099 | 0.44868 | 0.55834 | ||
| 0.9175 | 0.8673 | 0.8883 | 0.8594 | 0.9436 | 0.9413 | ||
| 1.09532 | 1.25834 | 0.44636 | 0.46539 | 0.43178 | 0.46537 | ||
| 0.8872 | 0.8545 | 0.8664 | 0.8386 | 0.9296 | 0.9055 | ||
| 0.96392 | 1.03141 | 0.36512 | 0.35433 | 0.33315 | 0.30214 | ||
5. Real data analysis
We analyze a real data set in support of proposed estimation methods. This data set is taken from https://openmv.net/info/unlimited-time-test which was uploaded on first March 2013. This data set consists of grades from a midterm exam, as well as the time taken by the student to write the exam. It was an ‘infinite’ time midterm, so there was no time pressure to finish within the allocated period. The exam time taken by all 80 students (after divided by 100) in minutes are reported as follows:
We consider this data set as a progressive type I interval censored with n = 80, m = 4, , , and , , , . We now perform goodness of fit test to verify whether a TN distribution is a suitable model for this data set. For comparison purposes, we take into account half-normal (HN) and folded-normal (FN) distributions as well. The negative log-likelihood (NL) and Kolmogorov–Smirnov (K–S) test statistic criteria are used to assess the goodness of fit for all the competing models. Maximum distance under the K–S test given as
denotes the distance between an empirical distribution under the observed progressive type I censored data and the population distribution . The empirical CDF at each inspection time can be estimated using Equation (19). Based on the observed data we computed MLEs of unknown vector parameter of TN distributions as , the corresponding NL estimate is 17.9530 and K–S statistic value turns out to be 0.1046. For HN distribution MLE is 2.3799, NL estimate is 114.4501, and K–S test statistic value is obtained as 0.5191. Finally for FN distribution MLEs of unknown vector parameter is , NL is and K–S test statistic value is given by 0.56145. Note that smaller value of NL criterion and K–S test statistic indicate better fit to the given data set. From computed estimates we observe that the proposed TN distribution fits the data set reasonably good compare to the other two models. We next computed asymptotic confidence intervals of μ and τ as and respectively under observed data set. In addition, we report non-informative Bayes estimates of these parameters in Table 4. Estimates under the linex loss function are computed for two choices of c such as and 0.5. It is seen that estimates obtained using different methods vary marginally from each other. Finally, the corresponding non-informative HPD credible intervals for μ and τ are obtained as and respectively.
Table 4. Bayes estimates of μ and τ for real data.
| μ | τ | |
|---|---|---|
| SEL | 1.8203 | 0.8888 |
| Linex, c = −0.5 | 1.8226 | 0.8970 |
| Linex, c = 0.5 | 1.8181 | 0.8809 |
6. Inspection times and optimal censoring
Here we provide optimal inspection times when data are observed using progressive type I interval censoring. Optimal design of censoring schemes is quite important in life testing experiments. These types of inference have received considerable attention in the literature. Huang and Wu [17] studied reliability sampling plans for progressive type I interval-censored life tests under the assumption of exponential distribution. Lin et al. [22] and [20] discussed A- and D-optimal schemes for log-normal and Weibull distributions respectively. Akdogan et al. [3] and Kus et. al. [19] obtained the optimal censoring schemes for the Burr XII and Pareto distributions using cost constraints. Wu and Huang [31] also studied optimal plans under competing risks set up. Singh and Tripathi [27] obtained various inspection times and optimal censoring schemes for the inverse Weibull distribution. One can also refer to Roy and Pradhan [25,26], Budhiraja et al. [10], Arabi Belaghi et al. [5], Zhao and Bordes [36], Malevich and Muller [24] for some more useful results on optimal plans under progressive type I interval censoring. Recall that under this scheme inspection times are prescribed before the start of an experiment using a priori information related to the test. Many practical studies of interest including life testing experiments require procedures to derive efficient estimates of unknown quantities of interest. In this regard, we mention that adequate inference on effective inspection times may, in turn, yield better estimation results for unknown parameters. So it is important to study the impact of various inspection times on the effectiveness of different estimation methods. In many situations, inspection times are considered of equal length which may not be appropriate for deriving inference upon data indicating monotonic hazard rate functions. To overcome this problem an equal probability (EP) inspection times can be taken into account where the probability of expected number of failures in each inspection interval remains the same. Notice that probability that a test unit fails in the interval is given by and accordingly the expected number of failures in this duration becomes which is . Recall that per cent of units are removed at time . Thus for a given , the expected number of removed units is . Similarly for we find that we have
| (28) |
and
| (29) |
Thus time with and is the required inspection times where h represents the total degree of censoring. We have computed inspection times for various schemes using the algorithm as discussed in Singh and Tripathi [27]. We have performed these computations for varying choices of h such as . We present these results in Table 5 where a − represents the situation that experiment can be terminated only after the failure of all remaining units. Tabulated results suggest that inspection times under the type I interval censoring scheme tend to remain small which is followed by and censoring schemes respectively. We also observed that in general higher degree of censoring lead to smaller inspection times. These quantities can also be computed based on some optimality criteria where the goal may be to infer the inspection times which optimizes the considered criterion. Techniques such as genetic algorithm or simulated annealing algorithm can be used to obtain the desired inspection times. Now we compute optimal proportion or equivalently among all possible censoring schemes in a similar manner. Generally, the maximum possible censoring satisfying the relation is given by for prescribed n and h where 's are the positive integers. The statistical package ‘partitions’ in R software can be used to generate all solutions of the equation . Computational complexity may occur for relatively large values of n and m. We propose the following algorithm to obtain optimal censoring schemes (see also [27]).
Input and inspection times .
Set deg = floor(); A = rep(deg, m); B = firstblockpart(A, deg); B, ; schemes = choose((deg + m−1),(m−1)); count = 0 and k = 1.
If ( schemes) then set i = 1 and go to Step 4 else go to Step 8.
If () then calculate from the above relation (??), else go to Step 6.
If () then go to step 7 else set i = i + 1 and go to Step 4.
Set count = count + 1; If() then calculate considered criterion value correspond to the values of r, , say .
B = nextblockpart(B, A). Set k = k + 1 and go to Step 3.
Optimal censoring r corresponds to the optimum observation from all the .
Here we note that Steps 5 and 6 optimize the cost of computations as estimate of the proposed criterion can be obtained for only count number of selected censoring schemes satisfying the relation where ε may be smaller than 0.5 just to control the round off error. We have obtained two optimal censoring schemes based on minimizing and maximizing the trace of expected variance-covariance matrix of MLEs and the expected Fisher information matrix, referred to as Criterion I and Criterion II, respectively. We also note that the expected information matrix can be obtained by replacing by and with in the second-order partial derivatives of log-likelihood function with respect to μ and τ as provided in Appendix.
In Table 6, we have reported optimal censoring schemes under progressive type I interval censoring. From this table, we observe that removal pattern of the live units is, in general, not affected by an increase in sample sizes. For instance when n is 30 or 50, and for h except h = 0.20, Criterion I suggests removal of units at the first stage and Criterion II suggests removal of items at the second stage of the experiment. Also optimal censoring schemes suggest removal of many units at the last stage of the test if h = 0.20. If interest lies in computing optimal proportion instead of optimal number of units then Equation (29) can be used to obtain the desired results.
Table 5. EP inspection times.
| h | C.S. | |||||
|---|---|---|---|---|---|---|
| 0.20 | 0.29598 | 0.66793 | 1.05666 | 1.56058 | – | |
| 0.29598 | 0.57478 | 0.85963 | 1.29933 | 2.66053 | ||
| 0.29598 | 0.57478 | 0.85963 | 1.17787 | 1.58802 | ||
| 0.30 | 0.26045 | 0.58637 | 0.99474 | 1.51440 | – | |
| 0.26045 | 0.50538 | 0.75063 | 1.10685 | 1.78224 | ||
| 0.26045 | 0.50538 | 0.75063 | 1.01223 | 1.31534 | ||
| 0.40 | 0.22462 | 0.50538 | 0.88440 | 1.33155 | 2.09535 | |
| 0.22462 | 0.43600 | 0.64454 | 0.93466 | 1.39876 | ||
| 0.22462 | 0.43600 | 0.64454 | 0.85963 | 1.09301 | ||
| 0.50 | 0.18843 | 0.42442 | 0.73476 | 1.07020 | 1.48896 | |
| 0.18843 | 0.36632 | 0.54005 | 0.77455 | 1.11613 | ||
| 0.18843 | 0.36632 | 0.54005 | 0.71501 | 0.89687 |
Table 6. Optimal censoring schemes for m = 5.
| n | h | Two associated optimal censoring schemes | |
|---|---|---|---|
| 30 | 0.20 | (3, 0, 0, 0, 3) | (1, 0, 1, 2, 2) |
| 0.30 | (3, 3, 0, 1, 2) | (1, 4, 2, 0, 2) | |
| 0.40 | (8, 1, 1, 0, 2) | (6, 4, 0, 0, 2) | |
| 0.50 | (10, 2, 1, 1, 1) | (3, 11, 0, 0, 1) | |
| 50 | 0.20 | (1, 4, 0, 1, 4) | (1, 1, 2, 3, 3) |
| 0.30 | (10, 1, 0, 0, 4) | (0, 9, 1, 3, 2) | |
| 0.40 | (15, 0, 2, 0, 3) | (4, 12, 1, 1, 2) | |
| 0.50 | (19, 2, 1, 1, 2) | (17, 5, 0, 1, 2) | |
7. Conclusion
We have considered estimation of the unknown parameters of the truncated normal distribution when data are observed using progressive type I interval censoring. We computed different classical estimates of parameters using various procedures such as maximum likelihood, midpoint and probability plot methods. These methods are quite popular in literature. Recently, Ahmadi and Yousefzadeh [2] applied these procedures for computing estimates of generalized half normal distribution based on progressive type I interval-censored data. We mention that sometimes probability plot method can also be used as initial guess for unknown model parameters. We also computed Bayes estimates of parameters using importance sampling technique with respect to squared error and linex loss functions. We conducted a Monte Carlo simulation study and compared the performance of proposed estimation methods under different censoring schemes. We observed that mid point and probability plot methods provide marginally satisfactory results. Bayes method provides better estimation results under proper prior information. We also proposed two different criteria to obtain inspection times and established optimal censoring plans as well. Finally, we mention that there are some issues we are not able to address in this paper. More work is required to study some decision-theoretic properties of probability plot and midpoint estimation methods. Also we have obtained optimal plans based on maximum likelihood estimates. Various estimates can also be computed using empirical Bayesian methods. Likewise Bayesian optimal plans may also be studied, as suggested by anonymous reviewers. These are some possible future works.
Acknowledgements
Authors are thankful to reviewers for helpful comments which led to significant improvement in both content and presentation of the manuscript. Authors also thank the Editor and an Associate Editor for constructive suggestions.
Appendix.
Funding Statement
Yogesh Mani Tripathi gratefully acknowledges the partial financial support for this research work under a grant EMR/2016/001401 Science and Engineering Research Board (SERB), India.
Disclosure statement
No potential conflict of interest was reported by the authors.AUGRPThe disclosure statement has been inserted. Please correct if this is inaccurate.
References
- 1.Aggarwala R., Progressive interval censoring: some mathematical results with applications to inference, Commun. Stat.-Theory Methods 30 (2001), pp. 1921–1935. doi: 10.1081/STA-100105705 [DOI] [Google Scholar]
- 2.Ahmadi K. and Yousefzadeh F., Estimation for the parameters of generalized half-normal distribution based on progressive type I interval censoring, Commun. Stat.-Simulation Comput. 44 (2015), pp. 2671–2695. doi: 10.1080/03610918.2013.842590 [DOI] [Google Scholar]
- 3.Akdogan Y., Kus C., and Wu S.J., Planning life tests for burr XII distributed products under progressive group-censoring with cost considerations, G U Fen Bilimleri Dergisi(G. U. J. Sci.) 25 (2012), pp. 425–434. [Google Scholar]
- 4.Amin Z.H., A note on the parameter estimation for the lognormal distribution based on progressively type I interval censored samples, Model. Assist. Stat. Appl. 3 (2008), pp. 169–176. [Google Scholar]
- 5.Arabi Belaghi R., Noori Asl M., and Singh S., On estimating the parameters of the Burr XII model under progressive type I interval censoring, J. Stat. Comput. Simul. 87 (2017), pp. 3132–3151. doi: 10.1080/00949655.2017.1359600 [DOI] [Google Scholar]
- 6.Azizi F., Haghighi F., and Tabibi Gilani N., Statistical inference for competing risks model under progressive interval censored Weibull data, Commun. Stat.-Simulation Comput. 7 (2018), pp. 1–14. doi: 10.1080/03610918.2018.1510522 [DOI] [Google Scholar]
- 7.Balakrishnan N. and Cramer E., The Art of Progressive Censoring, Springer, New York, 2014. [Google Scholar]
- 8.Budhiraja S. and Pradhan B., Computing optimum design parameters of a progressive type I interval censored life test from a cost model, Appl. Stoch. Models. Bus. Ind. 33 (2017), pp. 494–506. doi: 10.1002/asmb.2251 [DOI] [Google Scholar]
- 9.Budhiraja S. and Pradhan B., Point and interval estimation under progressive type I interval censoring with random removal, Stat. Papers (2017), pp. 1–33. doi: 10.1007/s00362-017-0948-y. [DOI] [Google Scholar]
- 10.Budhiraja S., Pradhan B., and Sengupta D., Maximum likelihood estimators under progressive type I interval censoring, Stat. Probab. Lett. 123 (2017), pp. 202–209. doi: 10.1016/j.spl.2016.12.013 [DOI] [Google Scholar]
- 11.Chen D.G. and Lio Y.L., Parameter estimations for generalized exponential distribution under progressive type I interval censoring, Comput. Stat. Data Anal. 54 (2010), pp. 1581–1591. doi: 10.1016/j.csda.2010.01.007 [DOI] [Google Scholar]
- 12.Chen D.G., Lio Y.L., and Jiang N., Lower confidence limits on the generalized exponential distribution percentiles under progressive type I interval censoring, Commun. Stat.-Simulation Comput. 42 (2013), pp. 2106–2117. doi: 10.1080/03610918.2012.695842 [DOI] [Google Scholar]
- 13.Chen M.H. and Shao Q.M., Monte Carlo estimation of Bayesian credible and HPD intervals, J. Comput. Graph. Stat. 8 (1999), pp. 69–92. [Google Scholar]
- 14.Cheng C., Chen J., and Li Z., A new algorithm for maximum likelihood estimation with progressive type I interval censored data, Commun. Stat. Simulation Comput. 39 (2010), pp. 750–766. doi: 10.1080/03610911003637406 [DOI] [Google Scholar]
- 15.Dempster A.P., Laird N.M., and Rubin D.B., Maximum likelihood from incomplete data via the EM algorithm, J. Royal Stat. Soc. Ser. B (methodological) 39 (1977), pp. 1–22. [Google Scholar]
- 16.Du Y., Guo Y., and Gui W., Statistical inference for the information entropy of the log-Logistic distribution under progressive type I interval censoring schemes, Symmetry 10 (2018), pp. 445. doi: 10.3390/sym10100445 [DOI] [Google Scholar]
- 17.Huang S.R. and Wu S.J., Reliability sampling plans under progressive type I interval censoring using cost functions, IEEE Trans. Reliab. 57 (2008), pp. 445–451. doi: 10.1109/TR.2008.928239 [DOI] [Google Scholar]
- 18.Ismail A.A., Optimum partially accelerated life test plans with progressively type I interval censored data, Seq. Anal. 34 (2015), pp. 135–147. doi: 10.1080/07474946.2015.1030301 [DOI] [Google Scholar]
- 19.Kus C., Akdogan Y., and Wu S.J., Optimal progressive group censoring scheme under cost considerations for pareto distribution, J. Appl. Stat. 40 (2013), pp. 2437–2450. doi: 10.1080/02664763.2013.818107 [DOI] [Google Scholar]
- 20.Lin C.T., Balakrishnan N., and Wu S.J., Planning life tests based on progressively type I grouped censored data from the Weibull distribution, Commun. Stat.-Simulation Comput. 40 (2011), pp. 574–595. doi: 10.1080/03610918.2010.549278 [DOI] [Google Scholar]
- 21.Lin Y.J. and Lio Y.L., Bayesian inference under progressive type I interval censoring, J. Appl. Stat. 39 (2012), pp. 1811–1824. doi: 10.1080/02664763.2012.683170 [DOI] [Google Scholar]
- 22.Lin C.T., Wu S.J., and Balakrishnan N., Planning life tests with progressively type I interval censored data from the lognormal distribution, J. Stat. Plan. Inference. 139 (2009), pp. 54–61. doi: 10.1016/j.jspi.2008.05.016 [DOI] [Google Scholar]
- 23.Ng H.K.T. and Wang Z., Statistical estimation for the parameters of Weibull distribution based on progressively type I interval censored sample, J. Stat. Comput. Simul. 79 (2009), pp. 145–159. doi: 10.1080/00949650701648822 [DOI] [Google Scholar]
- 24.Malevich N. and Muller C.H., Optimal design of inspection times for interval censoring, Stat. Papers 60 (2019), pp. 1–16. doi: 10.1007/s00362-018-01067-7 [DOI] [Google Scholar]
- 25.Roy S. and Pradhan B., Bayesian optimum life testing plans under progressive type I interval censoring scheme, Quality Reliab. Eng. Int. 33 (2017), pp. 2727–2737. doi: 10.1002/qre.2231 [DOI] [Google Scholar]
- 26.Roy S. and Pradhan B., Bayesian C-optimal life testing plans under progressive type I interval censoring scheme, Appl. Math. Mod. 70 (2019). doi: 10.1016/j.apm.2019.01.023 [DOI] [Google Scholar]
- 27.Singh S. and Tripathi Y.M., Estimating the parameters of an inverse Weibull distribution under progressive type I interval censoring, Stat. Papers 59 (2016), pp. 21–56. doi: 10.1007/s00362-016-0750-2 [DOI] [Google Scholar]
- 28.Varian H.R., A Bayesian approach to real estate assessment, in Studies in Bayesian Econometrics and Statistics in Honor of L. J. Savage, S.E. Feinberge and A. Zellner, eds., North Holland, Amsterdam, 1975, pp. 195–208,
- 29.Wang C., Wang S., Wang D., Li C., and Dong X., Parameter estimations for mixed generalized exponential distribution based on progressive type I interval censoring, Cogent Math. 4 (2017), pp. 1280913. doi: 10.1080/23311835.2017.1280913 [DOI] [Google Scholar]
- 30.Wu S.F., The performance assessment on the lifetime performance index of products following Chen lifetime distribution based on the progressive type I interval censored sample, J. Comput. Appl. Math. 334 (2018), pp. 27–38. doi: 10.1016/j.cam.2017.11.022 [DOI] [Google Scholar]
- 31.Wu S.J. and Huang S.R., Planning progressive type I interval censoring life tests with competing risks, IEEE Trans. Reliab. 63 (2014), pp. 511–522. doi: 10.1109/TR.2014.2313708 [DOI] [Google Scholar]
- 32.Wu S.F. and Hsieh Y.T., The assessment on the lifetime performance index of products with Gompertz distribution based on the progressive type I interval censored sample, J. Comput. Appl. Math. 351 (2019), pp. 66–76. doi: 10.1016/j.cam.2018.10.044 [DOI] [Google Scholar]
- 33.Wu S.F. and Lin Y.P., Computational testing algorithmic procedure of assessment for lifetime performance index of products with one-parameter exponential distribution under progressive type I interval censoring, Math. Comput. Simul. 120 (2016), pp. 79–90. doi: 10.1016/j.matcom.2015.06.013 [DOI] [Google Scholar]
- 34.Wu S.F. and Lin M.J., Computational testing algorithmic procedure of assessment for lifetime performance index of products with Weibull distribution under progressive type I interval censoring, J. Comput. Appl. Math. 311 (2017), pp. 364–374. doi: 10.1016/j.cam.2016.08.005 [DOI] [Google Scholar]
- 35.Wu S.F., Lin Y.T., Chang W.J., Chang C.W., and Lin C., A computational algorithm for the evaluation on the lifetime performance index of products with Rayleigh distribution under progressive type I interval censoring, J. Comput. Appl. Math. 328 (2018), pp. 508–519. doi: 10.1016/j.cam.2017.07.004 [DOI] [Google Scholar]
- 36.Zhao X. and Bordes L., Optimal progressive type I interval censored scheme under step-stress life testing, Stat. Interface. 10 (2017), pp. 521–527. doi: 10.4310/SII.2017.v10.n3.a14 [DOI] [Google Scholar]
