Abstract
The standard omega distribution is defined on the unit interval so that it is a probabilistic model for observations in rates and percentages. It is, in fact, the unit form of the exponentiated half logistic distribution. In this work, we first give a detailed shape analysis from which we observe that it is another flexible beta-like distribution. We observe that it can be J-shaped, reverse J-shaped, U-shaped, unimodal and show left and right skewness according to the values of its shape parameters. Contrary to the ordinary beta, it has the advantage of having a clear distribution function. We then discuss the existence and uniqueness of the maximum likelihood estimators and the Bayesian estimate of the parameters. The existence and uniqueness of the maximum likelihood estimators of the parameters will give a great advantage to the possible practitioners of this model since the possibility of finding a spurious solution to the likelihood equations disappears then. The comparison of these estimators with the existing ones for the general omega distribution is made with the help of a simulation study. Two real data fitting demonstrations prove its usefulness among other beta-like distributions such as Kumaraswamy, log-Lindley and Topp–Leone.
Keywords: Bayesian estimation, existence and uniqueness problem, maximum likelihood estimation, omega distribution, unit exponentiated half logistic distribution
1. Introduction
In many practical situations, we often encounter proportion data. A random variable whose values are expressed as percentages or fractions of a whole produces proportion data. For example, acceptance rate, i.e. the percentage of applicants admitted for a college or university in a country, percentage of votes in an election, width to length ratio of a rectangular manufactured item, cost-to-charge ratio, i.e. ratio of the actual cost of care to what the hospital actually bills for care, yield of a chemical reaction taken at a specific temperature, percentage of the useful volume of a hydroelectric plant's water reservoir, and so on.
Such random variables can take any value on the unit interval and a suitable probability distribution like beta is needed for modelling the observed data.
There are many distributions for modelling data which are restricted to a bounded range. Recently, Dombi et al. [7] introduced a new probability distribution referred to as the omega distribution and discussed its application in reliability theory. It has the following probability density function (PDF) and the cumulative distribution function (CDF)
respectively, where , , d>0, and
is the omega function. The main properties of the omega function can be found in Dombi and Jónás [5].
These formulas are simple in the sense that they do not contain any special function. They also do not contain an exponential term. These features give the distribution an important advantage in computational procedures like parameter estimation. The parameters α and β have similar meanings to those of the Weibull distribution. The parameter d clearly determines the domain of the distribution. The hazard rate function of the distribution can exhibit monotonic, constant and bathtub shapes. Dombi et al. [7] showed that the asymptotic omega hazard rate function is just the Weibull hazard rate function. That is, the asymptotic omega distribution is just the Weibull distribution. As is also noted in Dombi and Jónás [6] that when , the log-likelihood function of the omega distribution is identical to the log-likelihood function of the Weibull distribution.
Later, Okorie and Nadarajah [19] derived closed-form expressions for the mean, mode, median, variance, rth raw moment, quantile function, skewness and kurtosis measures of the distribution.
Vasileva [25] investigated the omega probability distribution in the Hausdorff sense and derived a nonlinear equation satisfied by the Hausdorff distance between the CDF of the distribution and the Heaviside function.
Moments of order statistics from the distribution were studied in Alsubie et al. [1]. They also discuss seven methods to estimate the parameters of the distribution, including maximum likelihood (ML). Recently, Özbilen and Genç [20] defined a bivariate extension of the standard form of the distribution via Marshall–Olkin approach.
In this work, we will also concentrate on the standard omega distribution by taking d = 1. We will suggest this form to model rates and percentage data appeared in many fields, e.g. economics, biology, etc. This paper will replace it among the alternatives to the ordinary beta distribution. By doing that, we open the research for alternative models such as beta regression models, beta-binomial models, etc.
We will estimate the parameters by the ML method. Although several estimation methods, including the ML for the parameters of the distribution are proposed by Alsubie et al. [1], the existence and uniqueness problem of the MLEs of the parameters are not studied in the literature. This problem of the ML estimators (MLEs) is important in statistical inference. An ML algorithm may fail for a distribution due to the presence of several local maxima. This may make the solution of the ML estimating equations wrong. Another problem is to obtain the solution on the boundary of the parameter space. There are a lot of works in the literature that aim to show the existence and uniqueness of the MLEs of the parameters of different models. Some of the recent studies include Ghitany et al. [8], Gui [11], Jiang and Gui [15], Sudsawat and Pal [23].
In the general case, we note that the survival function of the distribution is in the form , where
is the parent survival function. That is, the omega distribution belongs to the class of proportional hazard rate models (see, e.g. [21]) which includes several well-known lifetime models such as exponential, Pareto, Lomax and Burr Type XII.
The paper is organized as follows. In Section 2, the standard omega distribution and its shape analysis are given. In Section 3, the ML estimation is discussed and given the observed Fisher information matrix. In Section 4, the Bayesian estimation is discussed. In Section 5, a simulation study comparing the proposed methods with the existing ones is performed. In Section 6, two real data fitting applications are given. The paper is finalized with conclusions. In order to shorten the paper, the proofs of the lemmas and theorems are given in the supplementary file, available online.
2. Standard omega distribution
When d = 1, the standard omega distribution is obtained. It has the following PDF:
| (1) |
and the CDF
It is flexible with bounded domain distribution, like the ordinary beta. It can be different shapes such as J-shaped, reversed J-shaped, U-shaped, unimodal, skewed left and skewed right (Figure 1). However, it has the advantage that its CDF is in a simple mathematical form not containing any special function.
Figure 1.
Plots of the PDF of the standard omega distribution.
Note that the standard omega distribution is, in fact, a unit exponentiated half logistic distribution. That is, the distribution of when X has the exponentiated half logistic distribution. The latter is another useful probability distribution and has many applications in reliability theory (see, e.g. [14,22]). We also note that some of the well-known distributions in the literature are also obtained by this transformation. For example, the Kumaraswamy distribution [16] is obtained from the generalized exponential distribution [12] and log-Lindley distribution [9] is obtained from the generalized Lindley distribution [26] by this transformation.
Henceforward, a random variable following the PDF in (1) will be denoted by .
2.1. Shape of the PDF
In this section, we will revise the shape of the distribution for its standard form. We immediately have the following limit:
and by L'Hôpital's rule, we have the following limit
which is resulted in
As is also noted in [19] for omega distribution that if then the roots of the equation are given by
and
From the analyse above, we can summarize the shape of the PDF as in the following:
and correspond to a U-shaped distribution with minimum at
and correspond to a reverse J-shaped distribution.
, and correspond to a J-shaped distribution.
, and correspond to an increasing-decreasing-increasing-shaped distribution with mode at and antimode at .
and correspond to a unimodal distribution with mode at . Further, if then a right-skewed distribution; otherwise, a left-skewed distribution is obtained.
The shape of the PDF can be seen in Figure 1 for several choices of the parameters. We observe that the PDF can be as flexible as that of the ordinary beta distribution. Thus, one may properly use the standard omega distribution alternatively to the beta distribution in data modelling.
The distribution is closed under power transformation as it is shown in the following theorem.
Theorem 2.1
If , , then .
3. Maximum likelihood estimation
In this section, we will discuss the ML estimation of the model parameters. We will first derive the MLEs of the parameters and then show the existence and uniqueness of the MLEs.
Let be a data set modelled by the distribution. Then, we have , and the log-likelihood function is given by
The log-likelihood estimating equations are given by
| (2) |
and
| (3) |
The MLEs of α and β are obtained by solving Equations (2) and (3), simultaneously. From Equation (2), the MLE of α is given explicitly
| (4) |
as a function of .
By putting (4) into (3) for α, we get , where
The solution of gives the MLE of β, .
Lemma 3.1
Let for all and . Then
Lemma 3.2
Let for all and . Then
Theorem 3.3
We have that
Now, we will search for the limit . We first give some lemmas before to evaluate this limit.
Lemma 3.4 Formula 1.513.1 in [10] —
If , then we have the following series expansion:
Lemma 3.5
Let for all and . Then
where .
Theorem 3.6
We have that
Note that the limit is negative. By Theorem 3.3 and Theorem 3.6, we can say that the function has at least one root in . This means that , the MLE of β, exists in the parameter space.
Now, we will try to show that is unique.
Let . Then, after some arrangements, we have
Let
and
Lemma 3.7
The sum is positive.
Lemma 3.8
For , define
Then .
Lemma 3.9
The sum is positive.
Theorem 3.10
The maximum likelihood estimate uniquely exists in the parameter space .
3.1. Observed fisher information matrix
Let . We give the second order partial derivatives of the log-likelihood function l to form the observed Fisher information matrix
whose elements are given by
From the standard large sample procedures that . Approximate confidence intervals for parameters can be found as and where is the quantile of order of the standard normal distribution and is the respective diagonal element of .
4. Bayesian estimation
Let be the vector of observations. In this section, we will discuss Bayesian estimations (see, e.g. [13]) of the parameters.
Since both parameters are positive, we may suppose the independent gamma priors have the following PDF's:
Then the joint posterior distribution of is
| (5) |
ignoring the normalizing constant. The conditional posterior densities of α and β are given by
and
respectively.
Under the squared error loss function, the Bayes estimators of α and β are given by
and
respectively. Here, C is the normalizing constant ignored in (5). Since these integrals cannot be solved analytically, we resort to the numerical methods, like the Metropolis-Hastings-within-Gibbs algorithm (see, e.g. [17]). The algorithm adapted for the standard omega distribution is given in the following:
Step 1: Set , and t = 1.
Step 2: Generate from and from .
- Step 3: Generate from and calculate
where is the PDF of . Step 4: If then set ; otherwise, set .
- Step 5: Generate from and calculate
Step 6: If then set ; otherwise, set .
Step 7: Set t = t + 1.
Step 8: Repeat steps 2−7 M times.
This generates Markov chains and and it is well known that, after a sufficient number of steps, that is, a burn-in period, the chains approach their stationary distributions (see, e.g. [13]). Now, the Bayes estimate of α and β are given by
where H is a burn-in period. Further, let be the values sorted increasingly. Similarly, . Then highest posterior density (HPD) credible intervals (see, e.g. [2]) for the parameters are given by
and
respectively. Here, and are the smallest integers less than or equal to and , respectively.
5. Simulation study
To examine the performance of the Bayesian estimator and to compare it with the previously proposed estimators, we generate 10,000 samples from for each combination of different parameter values and sample sizes. Here, we choose parameters as , 1.5, 2.5, , 1.5, 2.5 and sample sizes as n = 30, 100. The values of the parameters have been chosen so that we examine the estimation methods under different density shapes. Afterwards, ML, Bayes (B), least square (LS), weighted least squares (WLS), maximum product spacing (MPS), percentile (P), Anderson-Darling (AD), and right-tailed Anderson-Darling (RAD) estimation methods were applied to the simulated data. For the Bayesian estimation, we used the procedure given in Section 4 where gamma priors with and choose different and values for different density shapes as in Table 1. Here, and values were obtained by considering the shape analysis given in Section 2.1. For the details about these estimation methods that are not given in this paper, see Alsubie et al. [1]. They also share a simulation result for these methods, but they only use parameter values that generate reversed J or unimodal shaped densities. We have demonstrated the values of the means and mean squared errors (MSEs) of all the estimators in Tables 2 and 3.
Table 1.
Actual parameter values and the corresponding density shapes and gamma priors.
| Actual values | Prior parameters | |||
|---|---|---|---|---|
| Density shape | α | β | ||
| U-shaped | 0.75 | 0.75 | 1 | 0.5 |
| 1.5 | 0.75 | 1 | 0.5 | |
| J-shaped | 0.75 | 1.5 | 1 | 2 |
| 0.75 | 2.5 | 1 | 2 | |
| 1.5 | 1.5 | 1 | 2 | |
| 1.5 | 2.5 | 1 | 2 | |
| Reversed J-shaped | 2.5 | 0.75 | 3 | 0.5 |
| Unimodal | 2.5 | 1.5 | 3 | 2 |
| 2.5 | 2.5 | 3 | 2 | |
Table 2.
Means of estimates of the parameters and MSEs (in parentheses) for many estimation methods when n = 30.
| n | α | β | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 30 | 0.75 | 0.75 | 0.8248 | 0.8421 | 0.8131 | 0.8093 | 0.7648 | 0.7694 | 0.7743 | 0.7838 | 0.7355 | 0.707 | 0.7602 | 0.7459 | 0.7835 | 0.797 | 0.8021 | 0.8734 |
| (0.0346) | (0.0686) | (0.0294) | (0.0520) | (0.0388) | (0.0871) | (0.0344) | (0.0759) | (0.0213) | (0.0464) | (0.0472) | (0.0759) | (0.0322) | (0.0647) | (0.0380) | (0.1732) | |||
| 1.5 | 0.8326 | 1.6937 | 0.8329 | 1.6944 | 0.7663 | 1.5435 | 0.7779 | 1.578 | 0.7432 | 1.4237 | 0.7588 | 1.465 | 0.7873 | 1.6026 | 0.8084 | 1.77 | ||
| (0.0354) | (0.2803) | (0.0323) | (0.2233) | (0.0384) | (0.3497) | (0.0336) | (0.3079) | (0.0207) | (0.1859) | (0.0499) | (0.2286) | (0.0322) | (0.2621) | (0.0381) | (0.7220) | |||
| 2.5 | 0.8409 | 2.8462 | 0.8214 | 2.6558 | 0.7682 | 2.5903 | 0.7822 | 2.6579 | 0.7514 | 2.3954 | 0.7545 | 2.4078 | 0.7917 | 2.6935 | 0.8147 | 2.9966 | ||
| (0.0363) | (0.7761) | (0.0280) | (0.4083) | (0.0366) | (0.9924) | (0.0317) | (0.8855) | (0.0202) | (0.4971) | (0.0565) | (0.6103) | (0.0318) | (0.7327) | (0.0376) | (2.1468) | |||
| 1.5 | 0.75 | 1.6323 | 0.8032 | 1.548 | 0.7687 | 1.5357 | 0.7525 | 1.551 | 0.7624 | 1.4179 | 0.7005 | 1.5055 | 0.7427 | 1.5678 | 0.7725 | 1.5955 | 0.7977 | |
| (0.1813) | (0.0335) | (0.1171) | (0.0254) | (0.2034) | (0.0403) | (0.1823) | (0.0357) | (0.1197) | (0.0270) | (0.1701) | (0.0473) | (0.1666) | (0.0322) | (0.1978) | (0.0548) | |||
| 1.5 | 1.6316 | 1.6031 | 1.5746 | 1.5803 | 1.5358 | 1.5019 | 1.5508 | 1.5215 | 1.417 | 1.3978 | 1.5052 | 1.4717 | 1.5667 | 1.5417 | 1.5966 | 1.5943 | ||
| (0.1820) | (0.1357) | (0.1234) | (0.1059) | (0.2152) | (0.1603) | (0.1917) | (0.1425) | (0.1200) | (0.1110) | (0.1660) | (0.1266) | (0.1683) | (0.1294) | (0.2029) | (0.2169) | |||
| 2.5 | 1.6357 | 2.6806 | 1.5395 | 2.5318 | 1.5361 | 2.5089 | 1.5515 | 2.5421 | 1.4208 | 2.338 | 1.4957 | 2.427 | 1.569 | 2.5766 | 1.5975 | 2.6627 | ||
| (0.1810) | (0.3730) | (0.1086) | (0.2333) | (0.2034) | (0.4479) | (0.1822) | (0.3968) | (0.1182) | (0.2992) | (0.1764) | (0.3265) | (0.1666) | (0.3584) | (0.1977) | (0.6098) | |||
| 2.5 | 0.75 | 2.7646 | 0.7917 | 2.7206 | 0.7784 | 2.5881 | 0.7479 | 2.6136 | 0.7566 | 2.341 | 0.7019 | 2.4773 | 0.7306 | 2.6389 | 0.7649 | 2.6989 | 0.7826 | |
| (0.6938) | (0.0246) | (0.4343) | (0.0193) | (0.8772) | (0.0290) | (0.7596) | (0.0257) | (0.4181) | (0.0212) | (0.6223) | (0.0390) | (0.6241) | (0.0234) | (0.7769) | (0.0340) | |||
| 1.5 | 2.7649 | 1.5835 | 2.7689 | 1.5904 | 2.5882 | 1.4958 | 2.6137 | 1.5132 | 2.3413 | 1.4038 | 2.486 | 1.468 | 2.6392 | 1.5299 | 2.6992 | 1.5653 | ||
| (0.6939) | (0.0985) | (0.4657) | (0.0804) | (0.8773) | (0.1160) | (0.7596) | (0.1030) | (0.4178) | (0.0847) | (0.4900) | (0.0922) | (0.6242) | (0.0937) | (0.7770) | (0.1359) | |||
| 2.5 | 2.7706 | 2.6458 | 2.7033 | 2.576 | 2.5877 | 2.4973 | 2.6141 | 2.5268 | 2.3466 | 2.3462 | 2.4768 | 2.4326 | 2.6432 | 2.5553 | 2.6991 | 2.6119 | ||
| (0.6900) | (0.2715) | (0.4012) | (0.1828) | (0.8102) | (0.3223) | (0.7091) | (0.2850) | (0.4132) | (0.2295) | (0.4893) | (0.2311) | (0.6191) | (0.2591) | (0.7557) | (0.3790) |
Table 3.
Means of estimates of the parameters and MSEs (in parentheses) for many estimation methods when n = 100.
| n | α | β | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 100 | 0.75 | 0.75 | 0.7905 | 0.7879 | 0.7879 | 0.7795 | 0.7551 | 0.7564 | 0.7661 | 0.7711 | 0.7583 | 0.7369 | 0.7538 | 0.7499 | 0.7645 | 0.7692 | 0.7724 | 0.7955 |
| (0.0084) | (0.0155) | (0.0080) | (0.0142) | (0.0092) | (0.0203) | (0.0076) | (0.0170) | (0.0060) | (0.0126) | (0.0132) | (0.0228) | (0.0079) | (0.0161) | (0.0085) | (0.0301) | |||
| 1.5 | 0.7987 | 1.5852 | 0.7999 | 1.5907 | 0.7566 | 1.5168 | 0.7723 | 1.5564 | 0.7669 | 1.4838 | 0.753 | 1.4901 | 0.7681 | 1.546 | 0.7784 | 1.6105 | ||
| (0.0090) | (0.0639) | (0.0090) | (0.0613) | (0.0092) | (0.0816) | (0.0074) | (0.0700) | (0.0061) | (0.0508) | (0.0142) | (0.0661) | (0.0079) | (0.0653) | (0.0087) | (0.1264) | |||
| 2.5 | 0.8043 | 2.6443 | 0.7996 | 2.5979 | 0.7569 | 2.528 | 0.7776 | 2.6104 | 0.7728 | 2.4764 | 0.7455 | 2.4468 | 0.7702 | 2.5801 | 0.7832 | 2.7096 | ||
| (0.0095) | (0.1725) | (0.0087) | (0.1414) | (0.0091) | (0.2218) | (0.0074) | (0.1949) | (0.0062) | (0.1360) | (0.0154) | (0.1638) | (0.0080) | (0.1780) | (0.0090) | (0.3627) | |||
| 1.5 | 0.75 | 1.5333 | 0.7633 | 1.513 | 0.7541 | 1.505 | 0.7484 | 1.5138 | 0.7537 | 1.4519 | 0.7236 | 1.4986 | 0.746 | 1.515 | 0.7545 | 1.5222 | 0.7613 | |
| (0.0368) | (0.0080) | (0.0331) | (0.0074) | (0.0474) | (0.0105) | (0.0406) | (0.0089) | (0.0333) | (0.0079) | (0.0484) | (0.0142) | (0.0394) | (0.0086) | (0.0432) | (0.0127) | |||
| 1.5 | 1.5375 | 1.5289 | 1.5258 | 1.5252 | 1.5073 | 1.4979 | 1.5165 | 1.509 | 1.4559 | 1.4494 | 1.5002 | 1.4894 | 1.5179 | 1.5108 | 1.5244 | 1.5234 | ||
| (0.0366) | (0.0323) | (0.0333) | (0.0302) | (0.0469) | (0.0421) | (0.0402) | (0.0359) | (0.0324) | (0.0314) | (0.0454) | (0.0377) | (0.0388) | (0.0346) | (0.0420) | (0.0496) | |||
| 2.5 | 1.5395 | 2.5536 | 1.5159 | 2.5139 | 1.5074 | 2.4993 | 1.5168 | 2.5184 | 1.4581 | 2.4214 | 1.4934 | 2.4698 | 1.5181 | 2.5209 | 1.5255 | 2.5446 | ||
| (0.0366) | (0.0897) | (0.0320) | (0.0784) | (0.0476) | (0.1176) | (0.0405) | (0.0989) | (0.0321) | (0.0859) | (0.0493) | (0.0952) | (0.0393) | (0.0958) | (0.0430) | (0.1430) | |||
| 2.5 | 0.75 | 2.5698 | 0.762 | 2.5745 | 0.7598 | 2.5163 | 0.7483 | 2.5346 | 0.7535 | 2.4107 | 0.7271 | 2.4819 | 0.7417 | 2.5364 | 0.7541 | 2.5475 | 0.7581 | |
| (0.1280) | (0.0061) | (0.1183) | (0.0057) | (0.1686) | (0.0078) | (0.1411) | (0.0067) | (0.1121) | (0.0060) | (0.1507) | (0.0113) | (0.1350) | (0.0064) | (0.1449) | (0.0083) | |||
| 1.5 | 2.573 | 1.5252 | 2.5928 | 1.5315 | 2.5185 | 1.4979 | 2.5371 | 1.5084 | 2.4136 | 1.4552 | 2.492 | 1.4893 | 2.5384 | 1.5093 | 2.551 | 1.5179 | ||
| (0.1286) | (0.0248) | (0.1223) | (0.0238) | (0.1674) | (0.0315) | (0.1410) | (0.0270) | (0.1116) | (0.0244) | (0.1283) | (0.0278) | (0.1345) | (0.0260) | (0.1461) | (0.0341) | |||
| 2.5 | 2.5633 | 2.5348 | 2.5622 | 2.5211 | 2.513 | 2.4925 | 2.5292 | 2.5082 | 2.4046 | 2.4185 | 2.4778 | 2.4683 | 2.5312 | 2.5101 | 2.5452 | 2.5257 | ||
| (0.1279) | (0.0668) | (0.1148) | (0.0603) | (0.1702) | (0.0868) | (0.1425) | (0.0737) | (0.1136) | (0.0674) | (0.1337) | (0.0689) | (0.1368) | (0.0713) | (0.1497) | (0.0946) |
According to Tables 2 and 3, we can say that all estimators are consistent since the MSEs tend to zero for increasing n. According to the MSE criteria, Bayes and MPS estimators give similar results, and they give the best results among all estimators. Generally, the Bayes estimator is better at the estimation of β, and the MPS estimator is better at the estimation of α. According to mean estimations, the best results were obtained by LS and P estimators. Here, the Bayes estimator always overestimates the actual parameter values, but it produces better results than the ML estimator, especially for a small value of n. As a note, we observe that the RAD estimator produces poor results when the parameter values produce a J-shaped distribution.
6. Illustrative examples
To see the performances of the proposed ML and Bayes estimators, we consider two real data sets. We do not only fit the distribution to the data sets, but also other beta-like distributions for comparison.
6.1. Failure times data
We consider the failure times of the light-emitting diode (LED) in a life test. The data set was also analysed by Cheng and Wang [3] and Dey et al. [4] and it consists of the observed failure samples generated under used condition. The data divided by 10 are given by 0.018, 0.019, 0.019, 0.034, 0.036, 0.040, 0.044, 0.044, 0.045, 0.046, 0.047, 0.053, 0.057, 0.057, 0.063, 0.065, 0.070, 0.071, 0.071, 0.075, 0.076, 0.076, 0.079, 0.080, 0.085, 0.098, 0.101, 0.107, 0.112, 0.114, 0.115, 0.117, 0.120, 0.123, 0.124, 0.125, 0.126, 0.132, 0.133 0.133, 0.139, 0.142, 0.150, 0.155, 0.158, 0.159, 0.162, 0.168, 0.170 0.179, 0.200, 0.201, 0.204, 0.254, 0.361, 0.376, 0.465, 0.897 . The skewness and kurtosis measures of the data are given, respectively, by 3.746805 and 20.5975. Thus, the data set is positively skewed and have a high kurtosis. A suitable model defined on is needed for modelling this high kurtosis and positive skewness.
We fit the distribution as well as the ordinary beta and some other beta-like distributions [9,16,24] to these data and calculate the ML estimators first. The estimation results are given in Table 4. For the distribution, it can also be seen the location of the MLE of β in the plot of the function (Figure 2(left)). According to the AIC (Akaike Information Criterion), the OS method gives the best fit among others. We also perform a nonparametric bootstrap Kolmogorov–Smirnov (KS) goodness-of-fit test based on 10,000 replications for the proposed method. It gives the result KS = 0.1846 with p-value 0.7141. This supports the OS fit to the data.
Table 4.
MLE's and AIC values for failure time data set.
| Method | loglik | AIC | ||
|---|---|---|---|---|
| SO | 9.2430 | 1.1806 | 58.5950 | −113.1899 |
| (1.9826) | (0.1117) | |||
| Beta | 1.2351 | 7.1832 | 55.5859 | −107.1718 |
| (0.2059) | (1.4116) | |||
| KUM | 1.0486 | 6.3959 | 54.9236 | −105.8472 |
| (0.1121) | (1.4302) | |||
| LL | 0.8638 | 0.0000 | 45.1558 | −86.3116 |
| (0.1024) | (0.1706) | |||
| TL | 0.5904 | 40.0598 | −78.1197 | |
| (0.0775) |
Figure 2.
Plot of the function and location of the MLE of β for the failure times data (left) and CPI data (right).
The asymptotically confidence intervals for parameters α and β are also given, respectively, by and . Their lengths are given by 6.522028 and 0.3675939.
On the other hand, we run the Metropolis-Hastings-within-Gibbs algorithm by taking , H = 1000, and obtain the Bayes estimates of the parameters as and . The highest posterior density (HPD) credible intervals of the parameters α and β are given, respectively, by and . Their lengths are given by 6.218522 and 0.352833. We observe that these intervals are narrower than those for the ML method.
6.2. Consumer price index data
In economics, there are many indicators defined in interval . For instance, poverty rate, unemployment rate, labour force participation rate, gross domestic product, etc. One of the most important indicators for the economics of a country is the inflation rate. We next consider the consumer price index (CPI) data set of 43 countries reported by OECD [18] on web page https://data.oecd.org/price/inflation-cpi.htm#indicator-chart. Inflation measured by CPI is defined as the change in the prices of a basket of goods and services that are typically purchased by specific groups of households. The data are given in Table 5 for convenience. The data set consists of monthly indices and contains the latest data available by March 2022.
Table 5.
Consumer price index (CPI)(%) data set, March 2022 or latest available.
| Country | CPI | Country | CPI | Country | CPI |
|---|---|---|---|---|---|
| Argentina | 55.11 | Hungary | 8.50 | Poland | 11.0 |
| Austria | 5.85 | Iceland | 6.67 | Portugal | 5.33 |
| Belgium | 8.31 | India | 5.04 | Russia | 9.16 |
| Brazil | 10.54 | Indonesia | 2.64 | Saudi Arabia | 2.04 |
| Canada | 5.69 | Ireland | 6.72 | Slovak Republic | 9.04 |
| Chile | 9.41 | Israel | 3.48 | Slovenia | 5.36 |
| China | 1.50 | Italy | 6.46 | South Africa | 5.66 |
| Colombia | 8.53 | Japan | 0.90 | Spain | 9.82 |
| Czech Republic | 12.74 | Korea | 4.14 | Sweden | 5.97 |
| Denmark | 5.37 | Latvia | 11.48 | Switzerland | 2.39 |
| Estonia | 15.19 | Lithuania | 15.72 | Turkey | 61.14 |
| Finland | 5.80 | Luxembourg | 6.14 | United Kingdom | 6.20 |
| France | 3.63 | Mexico | 7.45 | United States | 8.54 |
| Germany | 7.26 | Netherlands | 9.73 | ||
| Greece | 8.85 | Norway | 4.54 |
The skewness and kurtosis measures of the data are given, respectively, by 3.7575 and 16.6902. Thus, the data set is positively skewed and have a high kurtosis. A suitable model defined on is needed for modelling this high kurtosis and positive skewness.
In this case, we just fit three candidate models to the data set, i.e. the , beta and Kumaraswamy distributions. The estimation results are given in Table 6. For the distribution, it can also be seen the location of the MLE of β in the plot of the function (Figure 2(right)). According to the AIC value, the OS method gives the best fit among others. We also perform a nonparametric bootstrap Kolmogorov–Smirnov goodness-of-fit test based on 10,000 replications for the proposed method. It gives the result KS = 0.2338 with p-value 0.7466. This supports the OS fit to the data.
Table 6.
MLE's and AIC values for CPI data set.
| Method | Loglik | AIC | ||
|---|---|---|---|---|
| SO | 13.0525 | 1.1308 | 58.4559 | −112.9117 |
| (3.4611) | (0.1178) | |||
| Beta | 1.2869 | 11.4678 | 57.1275 | −110.2550 |
| (0.2498) | (2.6417) | |||
| KUM | 1.0497 | 9.9412 | 56.4221 | −108.8443 |
| (0.1223) | (2.7952) |
The asymptotically confidence intervals for parameters α and β are and , respectively. Their lengths are given by 11.3863 and 0.3875. The Bayes estimates of the parameters are given by and . The highest posterior density (HPD) credible intervals of the parameters α and β are given, respectively, by and . Their lengths are given by 11.0635 and 0.3780. We observe that these intervals are slightly narrower than those for the ML method.
7. Conclusions
We showed that the distribution may have many different forms and concluded that it is another flexible beta-like distribution. It has the advantage of having a clear and simple CDF. The ML estimates of the two parameters uniquely exist in the parameter space and this is another advantage of the distribution for the possible practitioners of the distribution. Alternatively, the is also amenable under MCMC sampling methods to get Bayes estimates of the parameters. A simulation study that covers both small and large sample sizes was performed for comparison of the estimation methods proposed here with the existing ones.
We showed in the examples that percentage data sets with high skewness and kurtosis can be modelled best by the distribution among other beta-type ones. We also observed in many applications that when the kurtosis becomes smaller, the distribution performs a little bit better than the Kumaraswamy distribution according to the AIC criteria. The real data demonstrations prove its usefulness in modelling rates and percentages.
From the reliability point of view, the model may be a useful alternative. The existence and uniqueness of the ML estimates will be studied under different censoring schemes of a sample from the distribution as further research.
Supplementary Material
Acknowledgments
The authors are very grateful to two referees and an associate editor for helpful comments that improved the paper.
Disclosure statement
No potential conflict of interest was reported by the authors.
References
- 1.Alsubie A., Akhter Z., Athar H., Alam M., Ahmad A.E., Cordeiro G.M., and Afify A.Z., On the omega distribution: Some properties and estimation, Mathematics 9 (2021), p. 656. [Google Scholar]
- 2.Chen M.H. and Shao Q.M., Monte Carlo estimation of Bayesian credible and HPD intervals,J. Comput. Graph. Stat. 8 (1999), pp. 69–92. [Google Scholar]
- 3.Cheng Y.F and Wang F.K., Estimating the Burr XII parameters in constant-stress partially accelerated life tests under multiple censored data, Commun. Stat. Simul. Comput. 41 (2012), pp. 1711–1727. [Google Scholar]
- 4.Dey S., Wang L., and Nassar M., Inference on Nadarajah–Haghighi distribution with constant stress partially accelerated life tests under progressive type-II censoring, J. Appl. Stat. (in press). doi: 10.1080/02664763.2021.1928014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Dombi J. and Jónás T., On an alternative to four notable distribution functions with applications in engineering and the business sciences, Acta Polytech. Hung. 17 (2020), pp. 231–252. [Google Scholar]
- 6.Dombi J. and Jónás T., Advances in the Theory of Probabilistic and Fuzzy Data Scientific Methods with Applications, Springer, Switzerland, 2021. [Google Scholar]
- 7.Dombi J., Jónás T., Toth E.Z., and Arva G., The omega probability distribution and its applications in reliability theory, Qual. Reliab. Eng. Int. 35 (2019), pp. 600–626. [Google Scholar]
- 8.Ghitany M.E., Al-Jarallah R.A., and Balakrishnan N., On the existence and uniqueness of the MLEs of the parameters of a general class of exponentiated distributions, Statistics 47 (2013), pp. 605–612. [Google Scholar]
- 9.Gómez-Déniz E., Sordo M.A., and Calderín-Ojeda E., The Log-Lindley distribution as an alternative to the beta regression model with applications in insurance, Insur. Math. Econ. 54 (2014), pp. 49–57. [Google Scholar]
- 10.Gradshteyn I.S. and Ryzhik I.M., Table of Integrals, Series, and Products, 8th ed., Academic Press, San Diego, 2015. [Google Scholar]
- 11.Gui W., Exponentiated half logistic distribution: Different estimation methods and joint confidence regions, Commun. Stat. Simul. Comput. 46 (2017), pp. 4600–4617. [Google Scholar]
- 12.Gupta R.D. and Kundu D., Generalized exponential distributions, Aust. N. Z. J. Stat. 41 (1999), pp. 173–188. [Google Scholar]
- 13.Hoff P.D., A First Course in Bayesian Statistical Methods, Springer, New York, 2009. [Google Scholar]
- 14.Hu X. and Gui W., Statistical inference for the exponentiated half logistic distribution based on censored data, J. Comput. Appl. Math. 362 (2019), pp. 219–244. [Google Scholar]
- 15.Jiang H. and Gui W., Bayesian inference for the parameters of Kumaraswamy distribution via ranked set sampling, Symmetry 13 (2021), p.1170. [Google Scholar]
- 16.Kumaraswamy P., A generalized probability density function for double-bounded random processes, J. Hydrol. 46 (1980), pp. 79–88. [Google Scholar]
- 17.Luengo D., Martino L., Bugallo M., Elvira V., and Särkkä S., A survey of Monte Carlo methods for parameter estimation, EURASIP J. Adv. Signal Process. 2020 (2020), p. 25. [Google Scholar]
- 18.OECD , Inflation (CPI) (indicator), 2022. doi: 10.1787/eee82e6e-en (accessed on 21 April 2022). [DOI]
- 19.Okorie I.E. and Nadarajah S., On the omega probability distribution, Qual. Reliab. Eng. Int. 35 (2019), pp. 2045–2050. [Google Scholar]
- 20.Özbilen Ö. and Genç A.İ., A bivariate extension of the omega distribution for two-dimensional proportional data, Math. Slovaca (in press). [Google Scholar]
- 21.Popović B.V., Ristic M.M., and Genç A.İ., Dependence properties of multivariate distributions with proportional hazard rate marginals, Appl. Math. Model. 77 (2020), pp. 182–198. [Google Scholar]
- 22.Seo J.I. and Kang S.B., Notes on the exponentiated half logistic distribution, Appl. Math. Model. 39 (2015), pp. 6491–6500. [Google Scholar]
- 23.Sudsawat M. and Pal N., Some inferential results on a two parameter generalized half normal distribution, Am. J. Math. Manag. (in press). doi: 10.1080/01966324.2021.1959469 [DOI] [Google Scholar]
- 24.Topp C.W. and Leone F.C., A family of J-shaped frequency functions, JASA 50 (1955), pp. 209–219. [Google Scholar]
- 25.Vasileva M.T., Some notes on the omega distribution and the pliant probability distribution family, Algorithms 13 (2020), p. 324. [Google Scholar]
- 26.Zakerzadeh H. and Dolati A., Generalized Lindley distribution, J. Math. Ext. 3 (2010), pp. 13–25. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.


