Abstract
We introduce a new three-parameter lifetime distribution, the exponentiated Lindley geometric distribution, which exhibits increasing, decreasing, unimodal, and bathtub shaped hazard rates. We provide statistical properties of the new distribution, including shape of the probability density function, hazard rate function, quantile function, order statistics, moments, residual life function, mean deviations, Bonferroni and Lorenz curves, and entropies. We use maximum likelihood estimation of the unknown parameters, and an Expectation-Maximization algorithm is also developed to find the maximum likelihood estimates. The Fisher information matrix is provided to construct the asymptotic confidence intervals. Finally, two real-data examples are analyzed for illustrative purposes.
Keywords: compounding, Lindley distribution, geometric distribution, maximum likelihood estimation, Expectation-Maximization algorithm, lifetime distribution
1. Introduction
Suppose that a company has N systems functioning independently and producing a certain product at a given time, where N is a random variable determined by economy, customers demand, etc. The reason for considering N as a random variable comes from a practical viewpoint, because failure (of a device for example) often occurs due to the present of an unknown number of initial defects in the system. In this paper, we consider the case in which N is taken to be a geometric random variable with the probability mass function given by
for and n is a positive integer. We may take N to follow other discrete distributions, such as binomial, Poisson, etc, whereas they need to be truncated 0 because one must have . Another rationale by taking N to be a geometric random variable is that the “optimum” number can be interpreted as “number to event”, matching up with the definition of a geometric random variable, as commented by [1]. The geometric distribution has been widely used for the number of “systems” in the literature; see, for example, [2,3]. It has also been adopted to obtain some new class of distributions; see [4] for the exponential geometric (EG) distribution, [5] for the exponentiated exponential geometric (EEG) distribution, [6] for the Weibull geometric distribution, [1] for the geometric exponential Poisson (GEP) distribution, to name just a few.
On the other hand, we assume that each of N systems is made of parallel components, and therefore, the system will completely shutdown if all of the components fail. Meanwhile, we assume that the failure times of the components for the ith system, denoted by , are independent and identically distributed (iid) with the cumulative distribution function (cdf) and the probability density function (pdf) . For simplicity of notation, let stand for the failure time of the ith system and X denote the time to failure of the first out of the N functioning systems, i.e., . Then it can be seen from [5] that the conditional cdf of X given N is given by
and the unconditional cdf of X can thus be written as
| (1) |
The new class of distribution in (1) depends on the cdf of the failure times of the components in the system, which may follow some continuous probability distributions, such as the exponential, Lindley, and Weibull distributions. As an illustration, if the failure times of the components for the ith system are iid exponential random variables with the rate parameter , i.e., , then we obtain the EEG distribution due to [5]. Its cdf is given by
| (2) |
Please note that in reliability engineering and lifetime analysis, we often assume that the failure times of the components within each system follow the exponential lifetimes; see, for example [4,5,7], among others. This assumption may be unreasonable because the hazard rate of the exponential distribution is a constant, whereas some real-life systems may not have constant hazard rates, and the components of a system are often more rigid than the system itself. Accordingly, it becomes reasonable to consider the components of a system following a distribution with a non-constant hazard function that has flexible hazard function shapes.
In this paper, we propose a new three-parameter lifetime distribution by compounding the Lindley and geometric distributions based on the new class of distribution in (1). The Lindley distribution was first proposed by [8] in the context of Bayesian statistics, as a counterexample of fiducial statistics. It has recently received considerable attention as an appropriate model to analyze lifetime data especially in applications modeling stress-strength reliability; see, for example, [9,10,11]. Ghitany et al. [12] argue that the Lindley distribution could be a better lifetime model than the exponential distribution through a numerical example and show that the hazard function of the Lindley distribution does not exhibit a constant hazard rate, indicating the flexibility of the Lindley distribution over the exponential distribution. These observations motivate us to study the structure properties of the distribution in (1) when the failure times of the units for the ith system are iid Lindley random variables with the parameter , i.e.,
| (3) |
where the parameter . Its corresponding cdf is given by
| (4) |
where the parameters , , and . We call the distribution as the exponentiated Lindley geometric (ELG) distribution. Indeed, it is necessary to compute the entropy measure for ELG distribution under the assumption that errors are non-Gaussian distributed (e.g., [13]). Other motivations of the ELG distribution are briefly summarized as follows. (i) It contains several lifetime distributions as special cases, such as the Lindley-geometric (LG) distribution due to [14] when . (ii) It can be viewed as a mixture of exponentiated Lindley distributions introduced by [15]. (iii) The ELG distribution is a flexible model which can be widely used for modeling lifetime data in reliability and survival analysis. (iv) It exhibits monotonically increasing, decreasing, unimodal (upper-down bathtub), and bathtub shaped hazard rates but does not exhibit a constant hazard rate, which makes the ELG distribution to be superior to other lifetime distributions, which exhibit only monotonically increasing/decreasing, or constant hazard rates.
The remainder of the paper is organized as follows. In Section 2, we discuss various statistical properties of the new distribution. The maximum-likelihood estimation is considered in Section 3, and an EM algorithm is proposed to find the maximum likelihood estimates because they cannot be obtained in closed form. The maximum-likelihood estimation for censored data is also discussed briefly. In Section 4, two real-data applications are provided for illustrative purposes. Some concluding remarks are given in Section 5.
2. Properties of the ELG distribution
We provide statistical properties of the ELG distribution. These include the pdf and its shape (Section 2.1), hazard rate function and its shape (Section 2.2), quantile function (Section 2.3), order statistics (Section 2.4), expressions for the nth moments (Section 2.5), residual life function (Section 2.6), mean deviations (Section 2.7), Bonferroni and Lorenz curves (Section 2.8), and entropies (Section 2.9).
2.1. Probability Density function
The corresponding pdf of the ELG distribution corresponding to the cdf in (4) is given by
| (5) |
for , , , and .
It should be noted that the pdf in (5) is still a well-defined density function when . Thus, we can define the ELG distribution in (5) to any . As mentioned in Section 1, the ELG distribution includes several special submodels. When , it becomes the LG distribution due to [14]. When and , it turns out to be the Lindley distribution due to [8]. It converges a distribution degenerating at the point 0 when .
Figure 1 displays the pdf of the ELG distribution in (5) with selected values of , and p. We observe from Figure 1 that the shape of the pdf is monotonically decreasing with the modal value of ∞ at when and the shape of the pdf appears upside-down bathtub for . When , we observe that the shape exhibits monotonically decreasing as well as unimodal. This observation coincides with Theorem 1 of [14], which states that the density function of the LG distribution is (i) decreasing for all values of p and θ for which , (ii) unimodal for all values of p and θ for which .
Figure 1.
Plots of the pdf of the ELG distribution for different values of , and p.
2.2. Hazard Rate Function
The failure rate function, also known as the hazard rate (hf) function, is an important characteristic for lifetime modeling. For a continuous distribution with the cdf and the pdf , its failure rate function is defined as
where is the survival function of X. The hf of the ELG distribution is given by
| (6) |
for , , , and .
Figure 2 depicts shapes of the hf with selected values of , and p. We observe that the hf of the ELG distribution is quite flexible. For example, the shape appears monotonically decreasing if is sufficiently small and p is not sufficiently large. The shape appears monotonically increasing for small p and large . The shape appears bathtub-shaped or first increases then bathtub-shaped for . We may conclude that the ELG distribution exhibits increasing, decreasing, upside-down bathtub, and bathtub shaped failure hazard rates, but does not exhibit a constant hazard rate.
Figure 2.
Plots of the hf of the ELG distribution for different values of , and p.
Note also that as , the initial hf behaves as , which implies that for , for , and for .
2.3. Quantile Function
Let Z denote a Lindley random variable with the cdf in (3). We observe from [16] that the quantile function of the Lindley distribution is
| (7) |
where and denotes the negative branch of the Lambert W function (i.e., the solution of the equation ), which can be calculated by using the Lambert-W function in the R package lamW; see [17] in detail.
Let X be a ELG random variable with the cdf in (4). By inverting for , we obtain
It follows from Equation (7) that the quantile function of the ELG distribution is given by
| (8) |
Please note that , so the is unique, which implies that is also unique. Thus, one can use Equation (8) for generating random data from the ELG distribution. In particular, the quartiles of the ELG distribution, respectively, are given by
2.4. Order Statistics
Suppose is a random sample from the ELG distribution. Let be the corresponding order statistics. The pdf for the rth order statistic of the ELG distribution, say , is given by
The corresponding cdf of Y is given by
In practice, we may be interested in studying the asymptotic distribution of the extreme values and . By using L’Hospital’s rule, we have
In addition, by using L’Hospital’s rule, it can be easily shown that
By following Theorem 1.6.2 in [18] we observe that there must be some normalizing constants , , , and , such that
and
as . The form of the normalizing constants can be determined by using Corollary 1.6.3 in [18]. As an illustration, one can see that and , where denotes the inverse function of .
2.5. Moment Properties
Many important features of a distribution can be characterized through its moments, such as dispersion, skewness, and kurtosis. To derive the nth moment of the ELG distribution, we consider the Taylor series expansion of the form
| (9) |
which converges for . This provides that
| (10) |
where is the generalized binomial coefficient. Therefore, we can rewrite Equation (4) as
| (11) |
We observe that the ELG distribution is a mixture of exponentiated Lindley distributions introduced by [15], i.e.,
which is convergent for . They show that if Y is an exponentiated Lindley random variable with parameters and , the nth moment and the moment generating function of Y are, respectively, given by
and
for , where
By using Equation (11), we obtain the nth moment of X can be rewritten as
| (12) |
for . Equation (12) can be adopted to compute the third and fourth central moments of the ELG distribution, which are then used to define skewness and kurtosis, respectively. For instance, based on the first four moments of the ELG distribution, the measures of skewness and kurtosis of the ELG distribution are, respectively, given by
and
The moment generating function of the ELG distribution, denoted by , is given by
Thereafter, we can use to obtain the nth moment about zero of the ELG distribution. In particular, if , then Equation (11) can be simplified to
| (13) |
The corresponding nth moment of X can be simplified as
| (14) |
for , and the moment generating function of the ELG distribution is given by
2.6. Residual Life Function
Given that a component of a system survives up to time , the residual life will be the period beyond t until the time of failure occurs in the system and is thus defined by the conditional random variable . The mean residual life plays an important role in survival analysis and reliability of characterizing lifetime, because it can be used to determine a unique corresponding lifetime distribution. The rth moment of the residual life of the ELG distribution can be obtained by the general formula
| (15) |
where is the survival function defined before. Noting that the ELG distribution is a mixture of exponentiated Lindley distributions, we may calculate by using the expression in Lemma 2 of [15], which is given by
where represents the complementary incomplete gamma function. Let X be an ELG random variable. By using the Taylor series expansion in (9), it can be easily shown that
From the binomial expansion for , we get that the rth order moment of the residual life of the ELG distribution is given by
The mean and variance of the residual life function of the ELG distribution can be easily obtained using and , and are not shown here for simplicity. In a similar way as done for Equation (13), it can be shown that if , then
| (16) |
and the rth order moment of the residual life of the ELG distribution can be written as
2.7. Mean Deviations
We consider the totality of deviations from the mean and median and the mean deviation from the mean, which is often used to estimate the amount of scatter in a population. The mean deviation is a more robust statistic to outliers in the data set than the standard deviation and the mean deviation from the median is a measure of statistical dispersion, which is a more robust statistic to outliers than the sample variance or standard deviation.
Let X denote a random variable with the pdf , the cdf , mean , and median M. The mean deviation about the mean and the mean deviation about the median are defined by
and
respectively. Of particular note is that when , the mean deviations above can be further simplified as
and
2.8. Bonferroni and Lorenz Curves
The Bonferroni and Lorenz curves (Bonferroni 1930) have many practical applications not only in economics and poverty, but also in other fields like reliability, lifetime testing, insurance, and medicine. For a random variable X with cdf , the Bonferroni and Lorenz curves are defined by
| (17) |
where , and
| (18) |
respectively. If X is an ELG random variable with the pdf in (5), we observe Equation (17) can be written as
which is obtained by using Equation (16) with and . By using Equation (18), it follows easily that the Lorenz curve of the ELG distribution is given by .
2.9. Entropies
It is well known that an entropy of a random variable X is a measure of variation of the uncertainty. The Rényi entropy is defined as
where and . The Shannon entropy is defined as E, which is a particular case of the Rényi entropy as . We first observe that
which shows that the Rényi entropy of the ELG distribution is given by
It can be shown that the Shannon entropy of the ELG distribution is given by
which can be easily evaluated using a unidimensional integral. Figure 3 depicts shapes of the Shannon entropy of the ELG distribution with several selected values of and p. It deserves mentioning that the entropy measure of the ELG distribution can be estimated by using numerical integration methods with the (plug-in) estimators found in the following section.
Figure 3.
Plots of the Shannon entropy of the ELG distribution for different values of , and p.
3. Estimation of Parameters
We adopt the maximum likelihood estimation to estimate the unknown parameters (Section 3.1) and develop an Expectation-Maximization (EM) algorithm to find the maximum likelihood estimate (MLE) (Section 3.2). We also discuss the MLEs of the unknown parameters when the data is censored (Section 3.3).
3.1. Maximum Likelihood Estimation
It is well known that the MLE is often used to estimate the unknown parameter of a distribution because of its attractive properties, such as consistency, asymptotic normality, etc. Let be a random sample from the ELG distribution with unknown parameter vector . Then the log-likelihood function is given by
| (19) |
For notational convenience, let
for . The MLEs of the unknown parameters can be obtained by taking the first partial derivatives of Equation (19) with respect to , , and p and putting them equal to 0. We have the following likelihood equations
| (20) |
| (21) |
| (22) |
Please note that the MLEs, respectively , and of , and p cannot be solved analytically. Numerical iteration techniques, such as the Newton-Raphson algorithm, are required to solve these equations, whereas the second derivatives of the log-likelihood are required for all iterations involved in numerical iteration techniques. We thus develop an EM algorithm to estimate the MLEs of the unknown parameters.
For interval estimation of the parameters, we consider suitable pivotal quantities based on the asymptotic properties of the MLEs and approximate the distributions of these quantities by the normal distribution. We observe that
where for . The observed Fisher information matrix of , , and p can be written as
so the variance-covariance matrix of the MLEs , and may be approximated by inverting the matrix and is thus given by
The asymptotic joint distribution of the MLEs , , and can be treated as being approximately multivariate normal and is given by
| (23) |
Since involves the unknown parameters , , and p, we replace these parameters by their corresponding MLEs to obtain an estimate of denoted by
The asymptotic confidence intervals of , , and p are determined by
respectively, where is the upper pth percentile of the standard normal distribution.
The likelihood ratio (LR) can be used to evaluate the difference between the ELG distribution and its special submodels. We partition the parameters of the ELG distribution into , where is the parameter of interest and is the remaining parameters. Consider the hypotheses
| (24) |
The LR statistic for the test of the null hypothesis in (24) is given by
| (25) |
where and are the restricted and unrestricted maximum likelihood estimators under and , respectively. Under , it follows
| (26) |
where denotes convergence in distribution as and is the dimension of the subset of interest. For instance, we can compare the ELG and LG distributions by testing versus . The ELG and Lindley distributions are compared by testing versus .
3.2. Expectation-Maximization Algorithm
Dempster et al. [19] introduce an EM algorithm to estimate the parameters when some observations are treated as incomplete data. Suppose that and represent the observed and hypothetical data, respectively. Here, the hypothetical data can be thought of as missing data because are not observable. We formulate the problem of finding the MLEs as an incomplete data problem, and thus, the EM algorithm is applicable to determine the MLEs of the ELG distribution. Let denote the complete data. To start this algorithm, define the pdf of each for as
The E-step of an EM cycle requires the conditional expectation of , where is the current estimate of in the rthe iteration. Please note that the pdf of Z given X, say , is given by
Thus, the conditional expectation is given by
The log-likelihood function of the complete data after ignoring the constants can be written as
| (27) |
Next the M-step involves the maximization of the pseudo log-likelihood function in (27). The components of the score function are given by
For notational convenience, let
for . By replacing the missing Z’s with their conditional expectations , we obtain an iterative procedure of the EM algorithm given by the following equations.
| (28) |
| (29) |
where
for . Please note that some efficient numerical methods, such as the Newton-Raphson algorithm, are only needed for solving Equations (28) and (29).
3.3. Censored Maximum Likelihood Estimation
Censored data often occur in lifetime data analysis. Several popular mechanisms of censoring, such as type-I censoring and type-II censoring, have received much attention in the literature. The survival function of the ELG distribution has a simple closed-form expression, and therefore, it can be used in analyzing lifetime data in the presence of censoring. We briefly discuss the general case of multicensored data. Suppose that subjects of which
is known to have failed at the times ,
is known to have failed into the interval for ,
is known to have survived at a time for but not observed any longer.
Please note that Type-I censoring and Type-II censoring are contained as particular cases of multicensoring above. The log-likelihood function of of the ELG distribution for this multicensoring takes the form
It is straightforward to derive the first derivatives of the log-likelihood function with respect to the three unknown parameters , , and p. Thereafter, the MLEs of the unknown parameters can be obtained by setting the first derivatives equal to zero, i.e.,
Please note that the Newton-Raphson algorithm or other optimization algorithms may be employed to solve the above system of equations, because the MLEs of the unknown parameters cannot be obtained in closed-forms. Finally, the corresponding information matrix for is too complicated to be presented here.
4. Two Real-Data Applications
In this section, we illustrate the applicability of the ELG distribution using two real-data examples. We use the same data sets to compare the ELG distribution with the Gamma, Weibull, Lindley geometric (LG), Weibull geometric (WG) distributions, whose densities are given by
-
(i) Gamma
-
(ii) Weibull
-
(iii) LG
-
(iv) WG
for , respectively. To compare the ELG distribution with the four distributions listed above, we advocate the Akaike information criterion (AIC), the Bayesian information criterion (BIC), and the AIC with a correction (AICc) for the two-real data sets. In addition, we apply two formal goodness-of-fit tests: the Cramér-von Mises () and Anderson-Darling () statistics to further verify which distribution fits better to the data; see, for example, [5,20], among others. The smaller the value of the considered criterion, the better the fit to the data.
The first data set is about the remission time (in months) of a random sample of 128 bladder cancer patients. This data set presented in Table 1 was studied by [21] in fitting the extended Lomax distribution and [22] for the modified Weibull geometric distribution. Table 2 shows the MLEs of the parameters, AIC, BIC, and AICc for the ELG, Gamma, Weibull, LG, and WG distributions for the first data set. We observe from Table 2 that the ELG distribution and its special case LG provide an improved fit over other distributions that are commonly used for fitting lifetime data. The plots of the fitted probability density and survival function are also shown in Figure 4. Please note that the density and survival functions of the ELG distribution seem to be better than Gamma, Weibull, and WG density and survival functions. In addition, we observe from the values of goodness-of-fit tests in Table 3 that the ELG distribution fits the current data better than other distributions under consideration.
Table 1.
The first data set: the remission time (in months) of a random sample of 128 bladder cancer patients.
| 0.08 | 2.09 | 3.48 | 4.87 | 6.94 | 8.66 | 13.11 | 23.63 | 0.20 | 2.23 |
| 3.52 | 4.98 | 6.97 | 9.02 | 13.29 | 0.40 | 2.26 | 3.57 | 5.06 | 7.09 |
| 9.22 | 13.80 | 25.74 | 0.50 | 2.46 | 3.64 | 5.09 | 7.26 | 9.47 | 14.24 |
| 25.82 | 0.51 | 2.54 | 3.70 | 5.17 | 7.28 | 9.74 | 14.76 | 26.31 | 0.81 |
| 2.62 | 3.82 | 5.32 | 7.32 | 10.06 | 14.77 | 32.15 | 2.64 | 3.88 | 5.32 |
| 7.39 | 10.34 | 14.83 | 34.26 | 0.90 | 2.69 | 4.18 | 5.34 | 7.59 | 10.66 |
| 15.96 | 36.66 | 1.05 | 2.69 | 4.23 | 5.41 | 7.62 | 10.75 | 16.62 | 43.01 |
| 1.19 | 2.75 | 4.26 | 5.41 | 7.63 | 17.12 | 46.12 | 1.26 | 2.83 | 4.33 |
| 5.49 | 7.66 | 11.25 | 17.14 | 79.05 | 1.35 | 2.87 | 5.62 | 7.87 | 11.64 |
| 17.36 | 1.40 | 3.02 | 4.34 | 5.71 | 7.93 | 11.79 | 18.10 | 1.46 | 4.40 |
| 5.85 | 8.26 | 11.98 | 19.13 | 1.76 | 3.25 | 4.50 | 6.25 | 8.37 | 12.02 |
| 2.02 | 3.31 | 4.51 | 6.54 | 8.53 | 12.03 | 20.28 | 2.02 | 3.36 | 6.76 |
| 12.07 | 21.73 | 2.07 | 3.36 | 6.93 | 8.65 | 12.63 | 22.69 |
Table 2.
MLEs of the fitted models, AIC, BIC, and AICc for the first data set.
| Model | Parameters | AIC | BIC | AICc | ||
|---|---|---|---|---|---|---|
| Gamma | = 0.1252 | 830.7356 | 836.4396 | 830.8316 | ||
| Weibull | = 1.0478 | = 9.5607 | 832.1738 | 837.8778 | 832.2698 | |
| LG | = 0.0742 | 823.1859 | 833.742 | 823.2819 | ||
| WG | = 1.6042 | = 0.0286 | = 0.9362 | 826.1842 | 834.7403 | 826.3777 |
| ELG | = 1.0792 | = 0.0699 | = 0.9204 | 824.6214 | 833.1775 | 824.8149 |
Figure 4.
Plots of the estimated density and survival function of the fitted models for the first data set.
Table 3.
Goodness-of-fit tests for the first data set.
| Statistic | ||
|---|---|---|
| Model | ||
| Gamma | 0.11988 | 0.71928 |
| Weibull | 0.13136 | 0.78643 |
| LG | 0.05374 | 0.33827 |
| WG | 0.01493 | 0.09939 |
| ELG | 0.01389 | 0.09498 |
As mentioned in Section 3.1, we can adopt the LR statistic to compare between the ELG distribution and its special submodels. For example, the LR statistic for testing between the LG and ELG distributions (i.e., versus ) is and the corresponding p-value is . Thus, we fail to reject and conclude that there is no statistical difference between the fits to this data using the ELG and its submodel LG. This is quite reasonable because the estimate of in the ELG model is , which is close to 1 in the LG model.
In the second data set, we consider the waiting time (in minutes) before service of 100 bank customers. The data are presented in Table 4. This data set was used by [12] in fitting the Lindley distribution. Table 5 shows the MLEs of the parameters, AIC, BIC, and AICc for the ELG, Gamma, Weibull, LG, and WG distributions for the second data set. Table 5 indicates that the ELG distribution is still a strong competitor to other lifetime distributions. In addition, the plots of the fitted probability density and survival function are shown in Figure 5. Please note that the ELG and WG distributions perform identically and that the empirical and fitted five survival curves almost overlap for this data set, supporting that the ELG distribution fits this data at least as good as the four alternative distributions. In addition, we observe from the values of goodness-of-fit tests in Table 6 that the ELG distribution fits the current data better than the Gamma, Weibull, and LG distributions and is comparable with the WG distribution.
Table 4.
The second data set: the waiting time (in minutes) before service of 100 bank customers.
| 0.8 | 0.8 | 1.3 | 1.5 | 1.8 | 1.9 | 1.9 | 2.1 | 2.6 | 2.7 |
| 2.9 | 3.1 | 3.2 | 3.3 | 3.5 | 3.6 | 4.0 | 4.1 | 4.2 | 4.2 |
| 4.3 | 4.3 | 4.4 | 4.4 | 4.6 | 4.7 | 4.7 | 4.8 | 4.9 | 4.9 |
| 5.0 | 5.3 | 5.5 | 5.7 | 5.7 | 6.1 | 6.2 | 6.2 | 6.2 | 6.3 |
| 6.7 | 6.9 | 7.1 | 7.1 | 7.1 | 7.1 | 7.4 | 7.6 | 7.7 | 8.0 |
| 8.2 | 8.6 | 8.6 | 8.6 | 8.8 | 8.8 | 8.9 | 8.9 | 9.5 | 9.6 |
| 9.7 | 9.8 | 10.7 | 10.9 | 11.0 | 11.0 | 11.1 | 11.2 | 11.2 | 11.5 |
| 11.9 | 12.4 | 12.5 | 12.9 | 13.0 | 13.1 | 13.3 | 13.6 | 13.7 | 13.9 |
| 14.1 | 15.4 | 15.4 | 17.3 | 17.3 | 18.1 | 18.2 | 18.4 | 18.9 | 19.0 |
| 19.9 | 20.6 | 21.3 | 21.4 | 21.9 | 23.0 | 27.0 | 31.6 | 33.1 | 38.5 |
Table 5.
MLEs of the fitted models, AIC, BIC, and AICc for the second data set.
| Model | Parameters | AIC | BIC | AICc | ||
|---|---|---|---|---|---|---|
| Gamma | = 0.2033 | 638.6002 | 643.8106 | 638.724 | ||
| Weibull | = 1.4585 | = 10.9553 | 641.4614 | 646.6717 | 641.5851 | |
| LG | = 0.2027 | 641.8269 | 647.0372 | 641.9506 | ||
| WG | = 1.9789 | = 0.0501 | = 0.82132 | 639.9084 | 647.7239 | 640.1584 |
| ELG | = 1.4602 | = 0.1725 | = 0.5385 | 640.3108 | 648.1263 | 640.5608 |
Figure 5.
Plots of the estimated density and survival function of the fitted models for the second data set.
Table 6.
Goodness-of-fit tests for the second data set.
| Statistic | ||
|---|---|---|
| Model | ||
| Gamma | 0.02761 | 0.18225 |
| Weibull | 0.06294 | 0.39624 |
| LG | 0.05374 | 0.33827 |
| WG | 0.01706 | 0.12365 |
| ELG | 0.01801 | 0.12665 |
5. Concluding Remarks
In this paper, we introduced the exponentiated Lindley geometric distribution, which generalizes the LG distribution due to [14] and the Lindley distribution proposed by [23]. We have studied various statistical properties of the new distribution. Estimations of the unknown parameters of the distribution are discussed based on the maximum likelihood estimation and an EM algorithm is provided for estimating the parameters. In an ongoing project, we study and Bayesian inference of these parameters and results will be reported elsewhere.
Acknowledgments
We greatly are very grateful to the three anonymous reviewers for their constructive comments and suggestions that led to a significant improvements of this paper. An early version of this paper is available in [24].
Author Contributions
M.W. and B.P. initiated and carried out the study. MW drafted the manuscript. Z.X. and B.P. participated in the data analysis and discussion. All authors read and approved the final manuscript.
Funding
The first author was partially supported by Scientific Innovation Program of Sichuan Province (Major Engineering Project: 2018RZ0093), Nanchong Scientific Council (Strategic Cooperation Program between University and City: NC17SY4020).
Conflicts of Interest
The authors declare that they have no competing interests.
References
- 1.Nadarajah S., Cancho V., Ortega E.M. The geometric exponential Poisson distribution. Stat. Methods Appl. 2013;22:355–380. doi: 10.1007/s10260-013-0230-y. [DOI] [Google Scholar]
- 2.Conti M., Gregori E., Panzieri F. Load distribution among replicated web servers: A QoS-based approach. SIGMETRICS Perform. Eval. Rev. 2000;27:12–19. doi: 10.1145/346000.346004. [DOI] [Google Scholar]
- 3.Fricker C., Gast N., Mohamed H. Mean field analysis for inhomogeneous bike sharing systems; Proceedings of the DMTCS; Montreal, QC, Canada. 18–22 June 2012; pp. 365–376. [Google Scholar]
- 4.Adamidis K., Loukas S. A lifetime distribution with decreasing failure rate. Stat. Probab. Lett. 1998;39:35–42. doi: 10.1016/S0167-7152(98)00012-1. [DOI] [Google Scholar]
- 5.Rezaei S., Nadarajah S., Tahghighnia N. A new three-parameter lifetime distribution. Statistics. 2013;47:835–860. doi: 10.1080/02331888.2011.627587. [DOI] [Google Scholar]
- 6.Barreto-Souza W., de Morais A.L., Cordeiro G.M. The Weibull-geometric distribution. J. Stat. Comput. Simul. 2011;81:645–657. doi: 10.1080/00949650903436554. [DOI] [Google Scholar]
- 7.Pararai M., Warahena-Liyanage G., Oluyede B.O. Exponentiated power Lindley-Poisson distribution: Properties and applications. Commun. Stat. Theory Methods. 2017;46:4726–4755. doi: 10.1080/03610926.2015.1076473. [DOI] [Google Scholar]
- 8.Lindley D.V. Fiducial distributions and Bayes’ theorem. J. R. Stat. Soc. Ser. B. Methodol. 1958;20:102–107. doi: 10.1111/j.2517-6161.1958.tb00278.x. [DOI] [Google Scholar]
- 9.Gupta P., Singh B. Parameter estimation of Lindley distribution with hybrid censored data. Int. J. Syst. Assur. Eng. Manag. 2012;4:378–385. doi: 10.1007/s13198-012-0120-y. [DOI] [Google Scholar]
- 10.Mazucheli J., Achcar J.A. The Lindley distribution applied to competing risks lifetime data. Comput. Methods Progr. Biomed. 2011;104:188–192. doi: 10.1016/j.cmpb.2011.03.006. [DOI] [PubMed] [Google Scholar]
- 11.Zakerzadeh Y., Dolati A. Generalized Lindley distribution. J. Math. Ext. 2009;3:13–25. [Google Scholar]
- 12.Ghitany M.E., Atieh B., Nadarajah S. Lindley distribution and its application. Math. Comput. Simul. 2008;78:493–506. doi: 10.1016/j.matcom.2007.06.007. [DOI] [Google Scholar]
- 13.Arellano-Valle R.B., Contreras-Reyes J.E., Stehlík M. Generalized skew-normal negentropy and its application to fish condition factor time series. Entropy. 2017;19:528. doi: 10.3390/e19100528. [DOI] [Google Scholar]
- 14.Zakerzadeh H., Mahmoudi E. A new two parameter lifetime distribution: Model and properties. arXiv. 20121204.4248 [Google Scholar]
- 15.Nadarajah S., Bakouch H.S., Tahmasbi R. A generalized Lindley distribution. Sankhya B. 2011;73:331–359. doi: 10.1007/s13571-011-0025-9. [DOI] [Google Scholar]
- 16.Jodrá P. Computer generation of random variables with lindley or Poisson-Lindley distribution via the lambert W function. Math. Comput. Simul. 2010;81:851–859. doi: 10.1016/j.matcom.2010.09.006. [DOI] [Google Scholar]
- 17.Adler A. lamW: Lambert-W Function; R package version 1.3.0. [(accessed on 18 May 2019)];2017 Available online: https://cran.r-project.org/web/packages/lamW/lamW.pdf.
- 18.Leadbetter M.R., Lindgren G., Rootzén H. Extremes and Related Properties of Random Sequences and Processes. Springer; New York, NY, USA: 1983. (Springer Series in Statistics). [Google Scholar]
- 19.Dempster A.P., Laird N.M., Rubin D.B. Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B (Methodol.) 1977;39:1–38. doi: 10.1111/j.2517-6161.1977.tb01600.x. [DOI] [Google Scholar]
- 20.Chen G., Balakrishnan N. A general purpose approximate goodness-of-fit test. J. Qual. Technol. 1995;27:154–161. doi: 10.1080/00224065.1995.11979578. [DOI] [Google Scholar]
- 21.Lemonte A.J., Cordeiro G.M. An extended Lomax distribution. Statistics. 2013;47:800–816. doi: 10.1080/02331888.2011.568119. [DOI] [Google Scholar]
- 22.Wang M., Elbatal I. The modified Weibull geometric distribution. METRON. 2015;73:303–315. doi: 10.1007/s40300-014-0052-1. [DOI] [Google Scholar]
- 23.Biçer C. Statistical inference for geometric process with the power Lindley distribution. Entropy. 2018;20:723. doi: 10.3390/e20100723. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Wang M. A new three-parameter lifetime distribution and associated inference. arXiv. 20131308.4128 [Google Scholar]





