Abstract
Statistical data analysis available in most scientific fields is often recorded with measurement error. The modeling of these statistical data by ignoring the measurement errors, leads to estimators of the parameters of the distributions, whose use does not achieve sufficient accuracy in the goodness of fit. In reliability criteria, one of the important issues is hazard rate function. It prompted us to investigate the hazard rate criterion in the presence of measurement error generated from the normal or logistic distribution. Now, while providing the estimator for the density function using local time polynomial estimator methods, the risk rate function is estimated according to the contamination degree of 15 or 30%. Finally, we present the numerical analysis.
Keywords: Kernel density estimation, Bandwidth, Logistic distribution, Lifetime data, Mean squared error
1. Introduction
In the analysis of statistical data that includes measurement error, the calculation of reliability measures, including the hazard rate function, is very importance. Analyzing and modeling these types of data, by ignoring measurement errors, would increase the bias of estimators. Let T be a lifetime random variable such that then hazard rate function is given by:
| (1.1) |
and whenever . For more details see [1]. One of the important factors in the analysis of statistical data in various fields such as medicine, economics, astronomy, etc. is the measurement error investigation. Ignoring measurement error in this type of data can lead to biased estimates and wrong conclusions (see [2], [3] and [4]). In this paper, the parameter is estimated when the statistical data have measurement error generated from the normal or logistic distribution.
It is important to provide an estimator for the hazard rate function or equation (1.1). For this purpose, a non-parametric framework is considered to estimate the hazard rate function (see [5], [4] and [6]).
One of the suggested estimators is the use of local polynomial regression (see [7], [8], [9], [10] and [11]). In modeling data in the presence of measurement error, the main problem is the unavailability of real observations, such that instead of their observations, independent observations are available. Therefore, the following model is considered.
| (1.2) |
Such that, for simplicity, it is assumed that the random variable ε has a normal distribution with a mean of zero and a variance of . In this collective model, to estimate the density function of the random variable X, according to the distribution of ε and the observed values for the random variable Y, the density function of random variable X can be estimated by the well-known deconvolution estimator as (See [3], [4] and [12]):
| (1.3) |
where and are characteristic functions of K and ε, respectively. It is easy to show that the characteristic function of normal and logistic distribution given by:
respectively.
1.1. Plug-in estimator
As we know observed or recorded values for lifetime data are positive values even with measurement errors. Providing a collective model in this case may produce negative values. To avoid this, it is necessary to provide a multiplicative model. Therefore, to avoid taking negative values of the variables, the multiplicative model
| (1.4) |
is considered. But it can be easily shown that
| (1.5) |
Such that , and . Now to estimation of , we can write:
such that given by:
| (1.6) |
Now, by using the (1.6), the risk rate function estimator can be considering as:
| (1.7) |
Note that in the case where the errors are non-random and systematic, the above problem can be considered as the estimation of the hazard rate function under biased samples which has received attention by some researches (cf. [13]).
1.2. Local polynomial estimator
Let be m selected points from random sample . For selecting these points on a sub set with , we partition the interval into m disjoint subintervals of equal length , and let the be the center of each bin . For the number of observations in the interval the local polynomial estimator of the rate function is given by (See [2], [14] and [15]):
| (1.8) |
Based on [16], the local polynomial estimator of order the hazard rate function approximates given by,
Such that , . For
the estimator of can be obtained by minimization of:
where , K a kernel function and b is the bandwidth. The local polynomial estimator with is:
| (1.9) |
where, .
Hence, the estimator for the of order p, is given by , i.e.:
| (1.10) |
Here according to the type of data and using the multiplicative model and contaminated data, our goal present new estimator by replacing the satisfying (See [5], [17] and [18]):
where . Following this idea and from [19], our new estimator will be as follows:
| (1.11) |
and our new suggested estimator for the of order p, is given by , i.e.:
| (1.12) |
Such that and , are (1.7) estimators related to m selected points.
According to the estimators and , in the rest of this section, the features and asymptotic properties of the estimators are discussed (See [20]).
2. Asymptotic results
In this section, the behavior of the risk rate function estimator is investigated using Theorems 1 and 2 of [21]. In the following, it is shown that the distribution of the hazard rate function estimator is asymptotically normal.
2.1. Asymptotic results for the plug-in estimator
To present the asymptotic properties of the hazard rate function estimator according to the density function f, random variables T and the characteristic distribution of the measurement error
| (2.1) |
For and fixed positive b satisfying
| (2.2) |
2.2. Asymptotic results for the local polynomial estimator
Based on local polynomial estimator of in the equation (1.11) to obtain the we can write:
| (2.3) |
where , and .
These exact bias and variance expressions are not directly usable, since they depend on unknown quantities. Hence, there is a need for approximating bias and variance. The following notation will be used:
According to the notations, the variance and bias of the estimator are respectively equal to:
| (2.4) |
For p odd
| (2.5) |
and for p even
| (2.6) |
Provided that and are continuous in a neighborhood of .
3. Bandwidth selection
One of the methods to reduce the mean square error in the estimation of the usage rate function is to select the bandwidth. So the selection of the bandwidth in deconvolution problems has recently attracted the attention of a number of researchers, for more information, you can refer to articles [22], [23], [24] and [25]. In Section 3.1, a rule of thumb has been used to select the bandwidth when the measurement error has a normal or logistic distribution (see [26] and [27]).
3.1. Rule of thumb
In this section, according to the estimation methods of the risk rate function, the criterion (MISE) is used to determine the bandwidth as
According to and the characteristic function of the measurement error, the optimal value of the bandwidth is given by (See [27])
and based on rule of thumb bandwidth in case of ordinary smooth the value of the bandwidth is given by:
4. Simulation study
In this section, the main goal is to compare the obtained estimators with the estimator [28], that is, the plug-in estimator.
| (4.1) |
where is defined in (1.3) and is the estimator introduced by [29], i.e.
| (4.2) |
Comte et al. [28] discussed the value of λ in (4.2). We can use [30] results in the Theorem 1 and the Theorem 2 for determination of λ, but for convenience here we take from the page 9 of [29].
Now to compare the risk rate function estimation methods according to the amount of contamination produced from the normal or logistic distribution, the results for Weibull, Gamma, and Lindley distributions are evaluated. In the simulation part, we use the second-order kernel function to estimate the density function corresponding to the hazard rate function under contaminated data (see [24], [31] and [32]),
| (4.3) |
with characteristic function
For the local polynomial estimator, we take and according to [33] recommendation, . To determine the quantity m, some approaches can be found in references such as [34], [35] and [16]. For a practical and complete discussion on determining m, refer to section 4.1 of [36]. For selecting points , we let where and , . Such a style selection fixed points for local polynomial estimator has been used in various references such as in [16], [37] and [38].
Here to compare the estimators and according to the parameters of Weibull, Gamma and Lindley distributions with 15 and 30 percent contamination generated from normal or logistic, the simulation results for the size of samples with 1000 repetitions using R software are given in Table 1, Table 2, Table 3, Table 4, Table 5, Table 6, Table 7, Table 8, Table 9, Table 10, Table 11, Table 12. Also, to compare the convenience of the behavior of the estimator, the diagram of the mean square error of estimator is given in Figure 1, Figure 2, Figure 3, such that:
Table 1.
Mean and Mean Square Error of Hazard rate function estimators of Weibull distribution with exception %15 contamination Normal error.
| t | ||||||
|---|---|---|---|---|---|---|
| n | Estimator | 0.25 | 0.5 | 0.75 | 1.00 | 1.25 |
| 10 | 0.5494 | 0.0641 | 1.6238 | 2.5925 | 4.4401 | |
| (1.0017) | (1.2724) | (1.6807) | (1.9531) | (1.7560) | ||
| 1.2226 | 1.0735 | 1.6312 | 2.3995 | 4.5371 | ||
| (0.4175) | (0.6166) | (1.3629) | (1.7428) | (1.1562) | ||
| 0.5909 | 1.0705 | 1.6525 | 2.0688 | 2.5194 | ||
| (1.0183) | (1.3766) | (1.7349) | (2.3920) | (3.5748) | ||
| 50 | 0.4788 | 1.0203 | 1.5230 | 2.0462 | 2.4447 | |
| (0.1745) | (0.2346) | (0.3012) | (0.5097) | (0.8055) | ||
| 0.4901 | 1.0119 | 1.5231 | 2.0428 | 2.4601 | ||
| (0.0789) | (0.1061) | (0.1907) | (0.4007) | (0.7450) | ||
| 0.4789 | 1.0197 | 1.5159 | 3.0494 | 2.4878 | ||
| (0.1796) | 0.2360) | (0.3023) | (0.5140) | (0.8263) | ||
| 100 | 0.5079 | 1.022 | 1.4969 | 2.0076 | 2.5387 | |
| (0.0901) | (0.1214) | (0.1452) | (0.2397) | (0.4436) | ||
| 0.5093 | 1.0180 | 1.5025 | 2.0049 | 2.5313 | ||
| (0.0402) | (0.0592) | (0.0884) | (0.1753) | (0.3755) | ||
| 0.5076 | 1.0216 | 1.4939 | 2.0062 | 2.5605 | ||
| (0.0909) | (0.1288) | (0.31452) | (0.2412) | (0.4764) | ||
Table 2.
Mean and Mean Square Error of Hazard rate function estimators of Weibull distribution with exception %30 contamination Normal error.
| t | ||||||
|---|---|---|---|---|---|---|
| n | Estimator | 0.25 | 0.5 | 0.75 | 1.00 | 1.25 |
| 10 | 0.5517 | 1.1122 | 1.6487 | 2.2574 | 3.3497 | |
| (1.6143) | (0.8427) | (1.1671) | (4.6060) | (18.6593) | ||
| 0.5462 | 1.1029 | 1.6423 | 2.2568 | 4.0570 | ||
| (0.1745) | (0.3855) | (0.7714) | (2.3980) | (21.6651) | ||
| 0.5521 | 1.1166 | 1.6441 | 2.1131 | 2.4317 | ||
| (0.6034) | (0.8409) | (1.0929) | (1.2837) | (1.5235) | ||
| 50 | 0.5154 | 1.0227 | 1.4874 | 1.9845 | 2.4582 | |
| (0.1149) | (0.1293) | (0.1603) | (0.3029) | (0.4577) | ||
| 0.5064 | 1.0316 | 1.4887 | 1.9795 | 2.4661 | ||
| (0.0327) | (0.0584) | (0.0668) | (0.2262) | (0.4432) | ||
| 0.5150 | 1.0193 | 1.4772 | 1.9901 | 2.5216 | ||
| (0.1149) | 0.1295) | (0.1589) | (0.3034) | (0.4743) | ||
| 100 | 0.5240 | 1.0219 | 1.5241 | 1.9749 | 2.4268 | |
| (0.0600) | (0.0620) | (0.0966) | (0.1270) | (0.2125) | ||
| 0.5158 | 1.0173 | 1.5132 | 1.9728 | 2.4295 | ||
| (0.0152) | (0.0272) | (0.0609) | (0.0930) | (0.1797) | ||
| 0.5232 | 1.0171 | 1.5156 | 1.9783 | 2.4964 | ||
| (0.0597) | (0.0616) | (0.0971) | (0.1338) | (0.2607) | ||
Table 3.
Mean and Mean Square Error of Hazard rate function estimators of Weibull distribution with exception %15 contamination Logistics error.
| t | ||||||
|---|---|---|---|---|---|---|
| n | Estimator | 0.25 | 0.5 | 0.75 | 1.00 | 1.25 |
| 10 | 1.2475 | 1.6142 | 1.5291 | 1.6120 | 1.7512 | |
| (2.8970) | (54.7680) | (9.1122) | (4.2154) | (3.1250) | ||
| 1.3042 | 1.0813 | 1.9171 | 2.1037 | 2.5712 | ||
| (27.3542) | (14.1097) | (36.0151) | (41.1235) | (43.1253) | ||
| 0.5913 | 1.2351 | 1.2101 | 1.3521 | 1.4125 | ||
| (3.9535) | (5.0403) | (5.1270) | (4.0751) | (2.1253) | ||
| 50 | 1.0611 | 1.0231 | 0.9506 | 1.0235 | 1.1202 | |
| (0.5204) | (0.2806) | (0.2360) | (0.1234) | (2.1350) | ||
| 0.3571 | 0.5323 | 0.7874 | 0.8512 | 2.1025 | ||
| (0.3800) | (0.8197) | (0.9120) | (0.9812) | (0.7230) | ||
| 0.3553 | 0.9566 | 0.9178 | 0.8512 | 0.7123 | ||
| (0.0710) | (0.2607) | (0.2606) | (0.2310) | (1.0123) | ||
| 100 | 1.0920 | 0.9968 | 0.9147 | 0.9012 | 0.8123 | |
| (0.2542) | (0.1266) | (0.1122) | (0.0825) | (0.1631) | ||
| 0.3801 | 0.7344 | 0.4268 | 0.3851 | 0.3120 | ||
| (15.2798) | (35.0121) | (1.1560) | (0.0514) | (0.0412) | ||
| 0.3577 | 0.9351 | 0.9070 | 1.1253 | 1.1012 | ||
| (0.0328) | (0.1147) | (0.0920) | (0.0132) | (0.0123) | ||
Table 4.
Mean and Mean Square Error of Hazard rate function estimators of Weibull distribution with exception %30 contamination Logistics error.
| t | ||||||
|---|---|---|---|---|---|---|
| n | Estimator | 0.25 | 0.5 | 0.75 | 1.00 | 1.25 |
| 10 | 0.9485 | 0.6110 | 0.4774 | 0.4397 | 0.4102 | |
| (0.0622) | (0.0216) | (0.0144) | (0.0122) | (0.0109) | ||
| 0.2517 | 0.2531 | 0.2643 | 0.2654 | 0.2678 | ||
| (0.0087) | (0.0076) | (0.0087) | (0.0078) | (0.0076) | ||
| 0.4512 | 0.7195 | 0.5286 | 0.4997 | 0.4622 | ||
| (2.5360) | (3.5400) | (0.1085) | (0.1022) | (0.0734) | ||
| 50 | 1.0055 | 0.6933 | 0.5504 | 0.5099 | 0.4805 | |
| (0.0210) | (0.0092) | (0.0057) | (0.0053) | (0.0040) | ||
| 0.2580 | 0.2708 | 0.2824 | 0.2864 | 0.2925 | ||
| (0.0028) | (0.0028) | (0.0028) | (0.0029) | (0.0026) | ||
| 0.2994 | 0.6088 | 0.5565 | 0.5176 | 0.4905 | ||
| (0.0093) | (0.0243) | (0.0117) | (0.0087) | (0.0072) | ||
| 100 | 1.0376 | 0.7503 | 0.6156 | 0.5695 | 0.5379 | |
| (0.0156) | (0.0078) | (0.0045) | (0.0044) | (0.0039) | ||
| 0.2619 | 0.2814 | 0.3040 | 0.3053 | 0.3136 | ||
| (0.0019) | (0.0033) | (0.0021) | (0.00219) | (0.0022) | ||
| 0.3016 | 0.6517 | 0.6176 | 0.5717 | 0.5382 | ||
| (0.0038) | (0.0127) | (0.0066) | (0.0057) | (0.0048) | ||
Table 5.
Mean and Mean Square Error of Hazard rate function estimators of Gamma distribution with exception %15 contamination Normal error.
| t | ||||||
|---|---|---|---|---|---|---|
| n | Estimator | 0.25 | 0.5 | 0.75 | 1.00 | 1.25 |
| 10 | 0.1326 | 0.2681 | 0.5218 | 0.6577 | 0.8513 | |
| (0.1697) | (0.1205) | (0.2637) | (0.3183) | (0.3111) | ||
| 0.1352 | 0.2923 | 0.5177 | 0.6617 | 0.7996 | ||
| (0.0850) | (0.1021) | (0.1687) | (0.2667) | (0.2845) | ||
| 0.1237 | 0.2656 | 0.5178 | 0.6582 | 0.8010 | ||
| (0.1709) | (0.1709) | (0.2563) | (0.3234) | (0.3165) | ||
| 50 | 0.1197 | 0.2987 | 0.4772 | 0.6112 | 0.7381 | |
| (0.0301) | (0.0358) | (0.0414) | (0.0500) | (0.0617) | ||
| 0.1242 | 0.2987 | 0.4766 | 0.6128 | 0.7376 | ||
| (0.0151) | (0.0210) | (0.0284) | (0.0378) | (0.0533) | ||
| 0.1197 | 0.2983 | 0.4761 | 0.6889 | 0.7352 | ||
| (0.0305) | (0.0388) | (0.0452) | (0.0502) | (0.0651) | ||
| 100 | 0.1119 | 0.2856 | 0.4645 | 0.6316 | 0.7378 | |
| (0.0138) | (0.0157) | (0.0118) | (0.0214) | (0.0216) | ||
| 0.1168 | 0.2947 | 0.4662 | 0.6282 | 0.7364 | ||
| (0.0069) | (0.0093) | (0.0143) | (0.0180) | (0.0223) | ||
| 0.1119 | 0.2851 | 0.4630 | 0.6296 | 0.7360 | ||
| (0.0138) | (0.0176) | (0.0217) | (0.0243) | (0.0275) | ||
Table 6.
Mean and Mean Square Error of Hazard rate function estimators of Gamma distribution with exception %30 contamination Normal error.
| t | ||||||
|---|---|---|---|---|---|---|
| n | Estimator | 0.25 | 0.5 | 0.75 | 1.00 | 1.25 |
| 10 | 0.1082 | 0.3112 | 0.5191 | 0.6436 | 0.7563 | |
| (0.0687) | (0.0944) | (0.1444) | (0.1394) | (0.1496) | ||
| 0.1240 | 0.3143 | 0.5062 | 0.6372 | 0.7511 | ||
| (0.0229) | (0.0432) | (0.0828) | (0.1036) | (0.1247) | ||
| 0.1089 | 0.3078 | 0.5144 | 0.6408 | 0.7564 | ||
| (0.0692) | (0.0931) | (0.1418) | (0.1365) | (0.1459) | ||
| 50 | 0.1213 | 0.3054 | 0.4833 | 0.6169 | 0.7348 | |
| (0.0185) | (0.0254) | (0.0258) | (0.0266) | (0.0322) | ||
| 0.1263 | 0.3044 | 0.4772 | 0.6143 | 0.7306 | ||
| (0.0058) | (0.0111) | (0.0157) | (0.0200) | (0.0274) | ||
| 0.1213 | 0.3034 | 0.4803 | 0.6115 | 0.7289 | ||
| (0.0185) | (0.0253) | (0.0256) | (0.0265) | (0.0335) | ||
| 100 | 0.1092 | 0.2975 | 0.4737 | 0.6153 | 0.7354 | |
| (0.0087) | (0.0116) | (0.0137) | (0.0166) | (0.0177) | ||
| 0.1168 | 0.2947 | 0.4709 | 0.6135 | 0.7321 | ||
| (0.0026) | (0.0056) | (0.0081) | (0.0124) | (0.0143) | ||
| 0.1091 | 0.2966 | 0.4706 | 0.6105 | 0.7297 | ||
| (0.0086) | (0.0115) | (0.0136) | (0.0163) | (0.0179) | ||
Table 7.
Mean and Mean Square Error of Hazard rate function estimators of Gamma distribution with exception %15 contamination Logistics error.
| t | ||||||
|---|---|---|---|---|---|---|
| n | Estimator | 0.25 | 0.5 | 0.75 | 1.00 | 1.25 |
| 10 | 0.6562 | 0.6746 | 0.6560 | 0.7145 | 0.7652 | |
| (1.9074) | (1.9076) | (0.9189) | (0.8513) | (0.7712) | ||
| 0.3208 | 0.5380 | 0.5451 | 0.5512 | 0.5633 | ||
| (1.6052) | (9.7921) | (4.2944) | (1.2561) | (1.0152) | ||
| 0.1920 | 0.6145 | 0.6375 | 0.4997 | 0.6512 | ||
| (2.2239) | (1.3283) | (0.6280) | (1.1011) | (1.2015) | ||
| 50 | 0.5865 | 0.5795 | 0.5312 | 0.5811 | 0.5211 | |
| (0.0963) | (0.0403) | (0.0285) | (0.0312) | (0.0615) | ||
| 0.4152 | 0.4511 | 0.5121 | 0.5512 | 0.5765 | ||
| (0.1251) | (3.0145) | (2.1561) | (0.7521) | (0.8152) | ||
| 0.1569 | 0.4846 | 0.5466 | 0.5612 | 0.5410 | ||
| (0.0089) | (0.0138) | (0.0284) | (0.0325) | (0.0375) | ||
| 100 | 0.5663 | 0.5612 | 0.5179 | 0.4676 | 0.4713 | |
| (0.0458) | (0.0188) | (0.0127) | (0.0115) | (0.0075) | ||
| 0.1258 | 0.4715 | 0.5217 | 0.5676 | 0.5701 | ||
| (0.0152) | (0.5251) | (1.2561) | (1.0125) | (1.0035) | ||
| 0.1499 | 0.4672 | 0.5345 | 0.5613 | 0.5741 | ||
| (0.0039) | (0.0145) | (0.0129) | (0.0135) | (0.0271) | ||
Table 8.
Mean and Mean Square Error of Hazard rate function estimators of Gamma distribution with exception %30 contamination Logistics error.
| t | ||||||
|---|---|---|---|---|---|---|
| n | Estimator | 0.25 | 0.5 | 0.75 | 1.00 | 1.25 |
| 10 | 0.7009 | 0.4799 | 0.3661 | 0.3453 | 0.3218 | |
| (0.0296) | (0.0156) | (0.0090) | (0.0081) | (0.0057) | ||
| 0.1675 | 0.1786 | 0.1779 | 0.1877 | 0.1895 | ||
| (0.0041) | (0.0044) | (0.0042) | (0.0044) | (0.0036) | ||
| 0.1734 | 0.3725 | 0.3488 | 0.3348 | 0.3213 | ||
| (0.0169) | (0.0363) | (0.0179) | (0.0204) | (0.0175) | ||
| 50 | 0.6995 | 0.5056 | 0.4105 | 0.3903 | 0.3639 | |
| (0.0142) | (0.0056) | (0.0032) | (0.0027) | (0.0025) | ||
| 0.1556 | 0.1675 | 0.1798 | 0.1892 | 0.1911 | ||
| (0.0012) | (0.0013) | (0.0012) | (0.0012) | (0.0013) | ||
| 0.1669 | 0.3800 | 0.3878 | 0.3815 | 0.3572 | ||
| (0.0014) | (0.0058) | (0.0036) | (0.0033) | (0.0030) | ||
| 100 | 0.6844 | 0.5283 | 0.4374 | 0.4190 | 0.3943 | |
| (0.0100) | (0.0039) | (0.0027) | (0.0024) | (0.0027) | ||
| 0.1437 | 0.1603 | 0.1760 | 0.1866 | 0.1918 | ||
| (0.0007) | (0.0007) | (0.0009) | (0.0009) | (0.0009) | ||
| 0.1671 | 0.4206 | 0.4253 | 0.4161 | 0.3950 | ||
| (0.0009) | (0.0033) | (0.0030) | (0.0027) | (0.0022) | ||
Table 9.
Mean and Mean Square Error of Hazard rate function estimators of Lindley distribution with exception %15 contamination Normal error.
| t | ||||||
|---|---|---|---|---|---|---|
| n | Estimator | 0.25 | 0.5 | 0.75 | 1.00 | 1.25 |
| 10 | 2.7783 | 3.2788 | 5.3471 | 6.7835 | 8.7729 | |
| (7.8285) | (3.7071) | (5.5905) | (3.6813) | (4.6441) | ||
| 2.9782 | 4.2607 | 5.1686 | 5.4262 | 5.7512 | ||
| (11.2853) | (12.1825) | (13.7444) | (14.1821) | (15.0125) | ||
| 2.7083 | 2.8210 | 2.8808 | 2.9945 | 3.0728 | ||
| (7.0043) | (8.0775) | (10.2198) | (9.4938) | (3.1137) | ||
| 50 | 2.4261 | 2.5822 | 2.5643 | 2.9193 | 4.2800 | |
| (1.1100) | (1.2292) | (1.4280) | (4.0251) | (7.0151) | ||
| 2.4035 | 2.7237 | 2.5850 | 2.9314 | 3.5931 | ||
| (9.5630) | (9.6505) | (1.0931) | (10.6109) | (25.1754) | ||
| 2.4351 | 2.5812 | 2.9500 | 3.1865 | 3.8599 | ||
| (1.1106) | (1.1987) | (1.4599) | (3.5816) | (5.3615) | ||
| 100 | 2.3806 | 2.4705 | 2.4715 | 2.6567 | 2.9232 | |
| (0.5221) | (0.5607) | (0.6652) | (1.1747) | (3.7315) | ||
| 2.3813 | 2.4759 | 2.5845 | 2.6722 | 3.1407 | ||
| (0.3834) | (0.2709) | (0.4678) | (0.9517) | (1.0325) | ||
| 2.3779 | 2.4776 | 2.6123 | 2.7429 | 4.3145 | ||
| (0.5228) | (0.5724) | (0.7128) | (1.4172) | (1.5102) | ||
Table 10.
Mean and Mean Square Error of Hazard rate function estimators of Lindley distribution with exception %30 contamination Normal error.
| t | ||||||
|---|---|---|---|---|---|---|
| n | Estimator | 0.25 | 0.5 | 0.75 | 1.00 | 1.25 |
| 10 | 2.5526 | 2.9583 | 5.3604 | 7.5059 | 7.8571 | |
| (3.3042) | (8.3465) | (20.1521) | (271526) | (30.1526) | ||
| 2.6554 | 3.2773 | 4.9396 | 3.6457 | 6.2025 | ||
| (2.1424) | (2.9019) | (38.3145) | (43.1197) | (55.1251) | ||
| 2.5398 | 2.8855 | 3.8926 | 3.6145 | 4.0920 | ||
| (3.2184) | (4.2150) | (11.2571) | (23.6092) | (27.1256) | ||
| 50 | 2.4597 | 2.2453 | 2.5706 | 2.6893 | 2.9994 | |
| (0.5757) | (0.4811) | (0.7209) | (1.3146) | (5.1206) | ||
| 2.4291 | 2.4569 | 2.5675 | 2.8085 | 3.1501 | ||
| (0.2721) | (0.2309) | (0.5006) | (6.0962) | (8.8652) | ||
| 2.4523 | 2.4759 | 2.6846 | 3.9638 | 4.9036 | ||
| (0.5785) | (0.5388) | (1.0260) | (4.1859) | (7.2352) | ||
| 100 | 2.2394 | 2.4187 | 2.5700 | 2.5327 | 2.5457 | |
| (0.2629) | (0.2444) | (0.3007) | (0.4846) | (0.6695) | ||
| 2.3839 | 2.4238 | 2.4938 | 2.5471 | 2.5763 | ||
| (0.1173) | (0.1200) | (0.2069) | (0.4465) | (0.6565) | ||
| 2.3805 | 2.4306 | 2.5885 | 2.7934 | 3.7031 | ||
| (0.2663) | (0.2743) | (1.4160) | (1.4025) | (3.5901) | ||
Table 11.
Mean and Mean Square Error of Hazard rate function estimators of Lindley distribution with exception %15 contamination Logistics error.
| t | ||||||
|---|---|---|---|---|---|---|
| n | Estimator | 0.25 | 0.5 | 0.75 | 1.00 | 1.25 |
| 10 | 5.5147 | 4.1164 | 5.3217 | 1.8223 | 3.7503 | |
| (7.5851) | (9.0335) | (12.0327) | (16.8605) | (17.7405) | ||
| 6.6882 | 2.9759 | 3.2995 | 4.9049 | 5.2917 | ||
| (21.0421) | (7.9037) | (18.7152) | (17.3166) | (17.8526) | ||
| 6.3830 | 3.2546 | 1.3838 | 2.2360 | 1.2752 | ||
| (25.1752) | (9.0161) | (13.3125) | (18.1163) | (19.9394) | ||
| 50 | 2.5527 | 2.6978 | 2.3386 | 2.8676 | 3.2916 | |
| (24.6818) | (30.6719) | (53.8453) | (52.0771) | (62.9917) | ||
| 6.8119 | 1.8983 | 3.7467 | 1.8352 | 3.1257 | ||
| (25.5282) | (19.4680) | (21.0940) | (8.7851) | (18.5015) | ||
| 4.6538 | 6.0101 | 7.1852 | 9.6125 | 10.1055 | ||
| (7.8405) | (8.8452) | (8.9165) | (9.1252) | (8.9521) | ||
| 100 | 2.1867 | 2.0712 | 2.1926 | 3.2887 | 2.6668 | |
| (4.2219) | (4.9383) | (7.6555) | (9.6982) | (10.1251) | ||
| 2.1514 | 2.4450 | 2.2235 | 5.0192 | 2.3625 | ||
| (8.0762) | (11.8992) | (7.1313) | (10.1521) | (7.9125) | ||
| 3.1412 | 2.7614 | 1.2032 | 1.0316 | 0.8924 | ||
| (1.4215) | (1.3039) | (1.1253) | (8.0082) | (6.9522) | ||
Table 12.
Mean and Mean Square Error of Hazard rate function estimators of Lindley distribution with exception %30 contamination Logistics error.
| t | ||||||
|---|---|---|---|---|---|---|
| n | Estimator | 0.25 | 0.5 | 0.75 | 1.00 | 1.25 |
| 10 | 1.3312 | 0.8408 | 0.6514 | 0.5856 | 0.5686 | |
| (0.1313) | (0.0614) | (0.0394) | (0.0445) | (0.0424) | ||
| 0.4405 | 0.4519 | 0.4553 | 0.4338 | 0.4442 | ||
| (0.1287) | (0.1728) | (0.2093) | (0.0766) | (0.0721) | ||
| 2.2371 | 3.4598 | 2.2089 | 0.9798 | 3.4464 | ||
| (1.4202) | (1.8752) | (1.0301) | (1.2515) | (1.0418) | ||
| 50 | 1.2713 | 0.8012 | 0.6045 | 0.5225 | 0.5142 | |
| (0.0224) | (0.0088) | (0.0057) | (0.0042) | (0.0041) | ||
| 0.3675 | 0.3787 | 0.3819 | 0.3809 | 0.3828 | ||
| (0.0038) | (0.0037) | (0.0036) | (0.0035) | (0.0033) | ||
| 2.9368 | 1.0193 | 0.7997 | 0.6719 | 0.6201 | ||
| (0.0914) | (1.3206) | (0.9151) | (0.3517) | (0.2512) | ||
| 100 | 1.2799 | 0.7957 | 0.5991 | 0.5501 | 0.5074 | |
| (0.0118) | (0.0042) | (0.0027) | (0.0024) | (0.0022) | ||
| 0.3689 | 0.3734 | 0.3761 | 0.3780 | 0.3774 | ||
| (0.0019) | (0.0019) | (0.0016) | (0.0016) | (0.0017) | ||
| 2.4997 | 1.5626 | 0.7571 | 0.6547 | 0.5941 | ||
| (0.0125) | (1.2562) | (0.0237) | (0.0133) | (0.0116) | ||
Figure 1.
Estimator for Weibull distribution with shape 2 and scale 1.
Figure 2.
Estimator for Gamma distribution with shape 3 and scale .
Figure 3.
Estimator for Lindley distribution with shape 0 and scale 0.5.
Black line: Actual value, Blue line: Estimator (1.12), Red line: Estimator (1.7), and Green line: Estimator (4.1).
As observed from these results, our proposed estimators have generally outperformed the estimator (4.1) in terms of both empirical mean square error and bias, especially for large t. The results revealed that with a large sample size, the improvement of the proposed estimators would be further enhanced. Comparison between our two proposed estimators indicates the local polynomial estimator in terms of empirical mean square error is better than the first plug-in estimator.
5. Conclusion
Considering the importance of the hazard rate function in the evaluation of survival data, its point estimation with the presence of measurement errors generated from normal distribution is considered. In using the deconvolution kernel density function estimator, we proposed a plug-in estimator. The second estimator was constructed using the first estimator and applying local polynomial regression. For measurement errors that have a normal or logistic distribution, we mentioned the conditions under which our proposed plug-in estimator is consistent. For the local polynomial estimator, the asymptotic bias and variance were calculated and noted that this estimator has also an asymptotic normal distribution. Note that the mean square error of the statistical distribution of the survey with the logistic distribution error was lower compared to the mean square of the error with the normal distribution error under the same conditions.
CRediT authorship contribution statement
Parviz Nasiri: Supervision, Methodology. Rougheih Kheirazar: Software, Investigation. Abbas Rasouli: Software. Ali Shadrokh: Resources.
Declaration of Competing Interest
The authors declare the following financial interests/personal relationships which may be considered as potential competing interests:
Parviz Nasiri reports a relationship with Payame Noor University that includes: non-financial support. Parviz Nasiri has patent licensed to –. It is declared that there is no conflict of interest. Parviz Nasiri
If there are other authors, they declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Acknowledgement
The authors would like to thank anonymous referees for their constructive suggestions and insightful comments, which significantly improved the presentation of this article.
Data availability
Data will be made available on request.
References
- 1.Lai C.D., Xie M. Springer; Berlin, Germany: 2006. Stochastic Ageing and Dependence for Reliability. [DOI] [Google Scholar]
- 2.Buonaccorsi J.P. Chapman and Hall/CRC; 2010. Measurement Error: Models, Methods, and Applications. [DOI] [Google Scholar]
- 3.Meister A. Springer; Berlin, Germany: 2009. Deconvolution Problems in Nonparametric Statistics. [DOI] [Google Scholar]
- 4.Carroll R.J., Ruppert D., Stefanski L.A., Crainiceanu C.M. second edition. Chapman and Hall; London: 2006. Measurement Error in Nonlinear Models: A Modern Perspective. [DOI] [Google Scholar]
- 5.Stefanski L.A., Carroll R.J. Deconvoluting kernel density estimators. Statistics. 1990;21:169–184. doi: 10.1080/02331889008802238. [DOI] [Google Scholar]
- 6.Mammen E., Nielsen J.P. A general approach to the predictability issue in survival analyses. Biometrika. 2007;94:873–892. doi: 10.1093/biomet/asm062. [DOI] [Google Scholar]
- 7.Müller H.G., Wang J.L., Capra W.B. From lifetables to hazard rates: the transformation approach. Biometrika. 1997;84:881–892. [Google Scholar]
- 8.Nielsen J.P. Marker dependent hazard estimation from local linear estimation. Scand. Actuar. J. 1998;2:113–124. doi: 10.1080/03461238.1998.10413997. [DOI] [Google Scholar]
- 9.Nielsen J.P., Tanggaard C. Boundary and bias correction in kernel hazard estimation. Scand. J. Stat. 2001;28:675–698. doi: 10.1111/1467-9469.00262. [DOI] [Google Scholar]
- 10.Nielsen J.P., Tanggaard C., Jones C. Local linear density estimation for filtered survival data. Statistics. 2009;42:167–186. doi: 10.1080/02331880701736648. [DOI] [Google Scholar]
- 11.Wang J.L., Müller H.G., Capra W.B. Analysis of oldest-old mortality: lifetables revisited. Ann. Stat. 1998;28:126–163. [Google Scholar]
- 12.Devroye L. Consistent deconvolution in density estimation. Can. J. Stat. 1989;17(2):235–239. doi: 10.2307/3314852. [DOI] [Google Scholar]
- 13.Fakoor V. On the nonparametric mean residual life estimator in length-biased sampling. Commun. Stat., Theory Methods. 2015;44:512–519. doi: 10.1080/03610926.2012.748328. [DOI] [Google Scholar]
- 14.Pulcini G. Modeling the failure data of a repairable equipment with bathtub type failure intensity. Reliab. Eng. Syst. Saf. 2001;71:209–218. doi: 10.1016/S0951-8320(00)00101-0. [DOI] [Google Scholar]
- 15.Rai B., Singh N. Customer-rush near warranty expiration limit and nonparametric hazard rate estimation from known mileage accumulation rates. IEEE Trans. Reliab. 2006;55:480–489. doi: 10.1109/TR.2006.879648. [DOI] [Google Scholar]
- 16.Bagkavos D., Patil P.N. Local polynomial fitting in failure rate estimation. IEEE Trans. Reliab. 2008;56:126–163. doi: 10.1109/TR.2007.911264. [DOI] [Google Scholar]
- 17.Fan J., Truong Y.K. Nonparametric regression with errors in variables. Ann. Stat. 1993;21:1900–1925. doi: 10.1214/aos/1176349402. [DOI] [Google Scholar]
- 18.Carroll R.J., Hall P. Optimal rates of convergence for deconvolving a density. J. Am. Stat. Assoc. 1988;83:1184–1186. [Google Scholar]
- 19.Delaigle A., Fan J., Carroll R.J. A design-adaptive local polynomial estimator for the errors-in-variables problem. J. Am. Stat. Assoc. 2009;104:348–359. doi: 10.1198/jasa.2009.0114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.McIntyre J., Stefanski L.A. Density estimation with replicate heteroscedastic measurements. Ann. Inst. Stat. Math. 2011;61:81–90. doi: 10.1007/s10463-009-0220-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Pourjafar H., Zardasht V. Estimation of the mean residual life function in the presence of measurement errors. Commun. Stat., Simul. Comput. 2020;49(2):532–555. doi: 10.1080/03610918.2018.1489054. [DOI] [Google Scholar]
- 22.Bagkavos D. Local linear hazard rate estimation and bandwidth selection. Ann. Inst. Stat. Math. 2011;63:1019–1046. doi: 10.1007/s10463-010-0277-6. [DOI] [Google Scholar]
- 23.Hesse C.H. Data-driven deconvolution. J. Nonparametr. Stat. 1999;10(4):343–373. doi: 10.1080/10485259908832766. [DOI] [Google Scholar]
- 24.Delaigle A., Gijbels I. Bootstrap bandwidth selection in kernel density estimation from a contaminated sample. Ann. Inst. Stat. Math. 2004;56(1):19–47. doi: 10.1007/BF02530523. [DOI] [Google Scholar]
- 25.Delaigle A., Gijbels I. Practical bandwidth selection in deconvolution kernel density estimation. Comput. Stat. Data Anal. 2004;45:249–267. doi: 10.1016/S0167-9473(02)00329-8. [DOI] [Google Scholar]
- 26.Wang X.F., Wang B. 2010. Simultaneous Confidence Bands and Bootstrap Bandwidth Selection in Deconvolution with Heteroscedastic Error. [Google Scholar]
- 27.Fan J. On the optimal rates of convergence for nonparametric deconvolution problems. Ann. Stat. 1991;19:1257–1272. [Google Scholar]
- 28.Comte F., Samson A., Stirnemann J.J. Hazard estimation with censoring and measurement error: application to length of pregnancy. Test. 2018;27:338–359. doi: 10.1007/s11749-017-0548-0. [DOI] [Google Scholar]
- 29.Dattner I., Goldenshluger A., Juditsky A. On deconvolution of distribution functions. Ann. Stat. 2011;39:2477–2501. doi: 10.1214/11-AOS907. [DOI] [Google Scholar]
- 30.Dattner I., Reiser B. Estimation of distribution functions in measurement error models. J. Stat. Plan. Inference. 2013;143:479–493. doi: 10.1016/j.jspi.2012.09.004. [DOI] [Google Scholar]
- 31.Fan J. Deconvolution with supersmooth distributions. Can. J. Stat. 1992;20:155–169. doi: 10.2307/3315465. [DOI] [Google Scholar]
- 32.Resnick S.I. Birkhauser; Boston: 1999. A Probability Path. [DOI] [Google Scholar]
- 33.Fan J., Gijbels I. Chapman and Hall; London: 1996. Local Polynomial Modeling and Its Applications. [DOI] [Google Scholar]
- 34.Scott D.W. On optimal and data-based histograms. Biometrika. 1979;66:605–610. doi: 10.1093/biomet/66.3.605. [DOI] [Google Scholar]
- 35.Wand M.P. Data-based choice of histogram bin width. Am. Stat. 1997;51:59–64. doi: 10.1080/00031305.1997.10473591. [DOI] [Google Scholar]
- 36.Jayasinghe C., Zeephongsekul P. On the nonparametric smooth estimation of the reversed hazard rate function. Stat. Methodol. 2012;9(9):364–380. doi: 10.1016/j.stamet.2011.09.003. [DOI] [Google Scholar]
- 37.Jayasinghe C., Zeephongsekul P. Local polynomial fitting of the mean residual life function. IEEE Trans. Reliab. 2013;62(2):317–328. doi: 10.1109/TR.2013.2255774. [DOI] [Google Scholar]
- 38.Shao J. Springer; New York: 2003. Mathematical Statistics. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Data will be made available on request.



