Abstract
In the statistical literature, several discrete distributions have been developed so far. However, in this progressive technological era, the data generated from different fields is getting complicated day by day, making it difficult to analyze this real data through the various discrete distributions available in the existing literature. In this context, we have proposed a new flexible family of discrete models named discrete odd Weibull-G (DOW-G) family. Its several impressive distributional characteristics are derived. A key feature of the proposed family is its failure rate function that can take a variety of shapes for distinct values of the unknown parameters, like decreasing, increasing, constant, J-, and bathtub-shaped. Furthermore, the presented family not only adequately captures the skewed and symmetric data sets, but it can also provide a better fit to equi-, over-, under-dispersed data. After producing the general class, two particular distributions of the DOW-G family are extensively studied. The parameters estimation of the proposed family, are explored by the method of maximum likelihood and Bayesian approach. A compact Monte Carlo simulation study is performed to assess the behavior of the estimation methods. Finally, we have explained the usefulness of the proposed family by using two different real data sets.
Keywords: Discrete distributions, dispersion index, Bayesian method, maximum likelihood method, L-moment statistics, odd Weibull-G family, simulation
2010 Mathematics Subject Classifications: 62E15, 62F10, 62F15
1. Introduction
The Weibull (W) model is a well-known continuous distribution and has been used extensively over the last several decades for modelling data in many areas, especially in engineering, reliability, and biological fields. It is commonly applied for modellig monotone hazard rates. The cumulative distribution function (CDF) of W distribution can be presented as
| (1) |
where λ and β are the scale and shape parameters, respectively. Because of the flexibility of the W distribution, many statisticians proposed several modifications by introducing one or more parameters to the baseline model in Equation (1). For instance, Lai et al. [43], Pal et al. [52], Lee et al. [44], Silva et al. [62], Pinho et al. [55], Almalki and Yuan [7], Sarhan and Apaloo [61], Nofal et al. [51], Al-Babtain et al. [4], Abd EL-Baset and Ghazal [1], and references cited therein.
Alzaatreh et al. [8] developed a new approach in which they used a general form to produce a new family, called the transformed-transformer family. Bourguignon et al. [12] utilized this technique to introduce a new flexible family of continuous distributions and it is known as odd Weibull-G (OW-G) family. The CDF of OW-G family is given by
| (2) |
where is a vector of model parameters ( ) and is the CDF of the baseline distribution. For more details about the generator , see Cooray [17]. Due to the flexibility of this generator, several authors used it for producing flexible families which can be utilized for modelling different types of data. For instance, Tahir et al. [63], Korkmaz and Genç [40], Korkmaz [37], Hamedani et al. [30], Korkmaz et al. [38], El-Morshedy and Eliwa [24], Korkmaz et al. [38], Reyad et al. [58], Bakouch et al. [9], Nascimento et al. [47], Alizadeh et al. [5,6], Korkmaz et al. [39], Afify and Alizadeh [2], Cordeiro et al. [16], and references cited therein.
The concept of discretization generally emerges when it becomes impossible or inconvenient to record the lifetime of a product/device on a continuous scale. These circumstances may appear when the life length needs to be recorded on a discrete scale rather than on a continuous analogue. For example, in survival analysis, the survival times for those having the diseases like lung cancer, or the period from remission to relapse may be recorded as a number of days/weeks; in engineering systems, the number of successful cycle prior to the failure when device work in cycle, the number of times a device is switched on/off. Moreover, in many practical problems, the count phenomenon occurs as, for example, the number of occurrences of earthquakes in a calendar year, the number of absences, the number of accidents, the number of kinds of species in ecology, the number of insurance claims, and so on. Therefore, we can easily infer that it is rational and appropriate to model such situations by a suitable discrete distribution.
Since classical discrete models like Binomial, Poisson, Negative Binomial, and Geometric were not sufficient to analyze different types of discrete data, therefore, Roy [59] proposed a new technique to develop a new discrete distribution by using the survival function of any continuous distribution. This approach is named as survival discretization method (Chakraborty [13]). The one of the important virtue of this methodology is that, the developed discrete model keeps the same form of the survival function as that of its continuous counterpart. Due to this feature many reliability characteristics of the distribution remain unchanged. In recent years this method has been widely used to develop several discrete models. For instance, Roy [60], Inusah and Kozubowski [35], Ghitany and Al-Mutairi [27], Krishna and Pundir [41], Gómez-Déniz [28], Gómez-Déniz and Calderín-Ojeda [29], Bebbington et al. [11], Nekoukhou et al. [48], Bakouch et al. [10], Nekoukhou and Bidram [49,50], Alamatsaz et al. [3], Chandrakant et al. [14], Kus et al. [42], Tyagi et al. [64–66], Eliwa et al. [18,19], El-Morshedy et al. [22,23], Eliwa and El-Morshedy [21], and the references included in the cited articles.
In view of the existing literature, we found that although several discrete models have been proposed over the past few decades, we are still facing a lack of more admirable discrete distributions that adequately capture the diversity of real data sets. Such phenomenon motivates us to provide a flexible family of discrete models for analyzing a broad spectrum of discrete real-world data sets. Therefore, in present article, we propose a family of discrete uni-variate distributions with two additional parameters using survival discretization method. The main objectives of proposing DOW-G family are as follows:
To produce discrete models that not only fit a positively skewed, a negatively skewed, or a symmetric shaped data set, they are also capable enough for fitting equi-, under-, and over-dispersed real data.
To develop discrete distributions whose failure rate functions can take diverse shapes for different values of parameters (eg. increasing, decreasing, constant, J- and bathtub-shaped).
To generate models for modelling probability distribution of count data.
To produce consistently superior fits than other developed discrete distributions with the same baseline model and other popular discrete distributions in the existing literature.
The remaining parts of this article are as follows: Section 2 introduces the DOW-G family. Some statistical characteristics of the DOW-G family are derived in Section 3. Two special models of the proposed family are extensively studied in Section 4. The method of maximum likelihood and the Bayesian approach is used to estimate the unknown parameters of the DOW-G family in Section 5. An extensive simulation study is conducted to investigate the behaviour of different estimation methods in Section 6. Section 7 contains two applications to real data sets. Finally, some important remarks about the presented study are discussed in Section 8.
2. Synthesis of the family
The random variable (RV) X is said to follow the DOW-G family if its CDF is of the form
| (3) |
where , , and . The reliability function (RF) of the DOW-G family is given by
| (4) |
Suppose be independent and identically distributed (IID) integer valued RVs and , then Z ∼ DOW-G provided ( ) ∼ DOW-G family where
| (5) |
The probability mass function (PMF) and hazard rate function (HRF) corresponding to Equation (3) can be expressed as
| (6) |
and
| (7) |
respectively.
3. Distributional properties of the DOW-G family
3.1. Quantile function QF
Under the DOW-G family, the qth QF, say , is the solution of , then
| (8) |
where and denotes the baseline QF. By putting q = 0.5, we can obtain the median of the proposed family. To study the effect of shape parameters on skewness and kurtosis, one can use . The Bowley skewness and Moors kurtosis based on the quantiles can be obtained as
3.2. Moments, dispersion index, skewness and kurtosis
Suppose a RV DOW-G , then the rth moment of the RV X can be proposed by
| (9) |
where . Using the expression of Equation (9), the mean ( ) and variance can be, respectively, formulated as
| (10) |
The dispersion index (DsI) is defined as the variance / |mean|, it determines that whether a given distribution is suitable for equi-, under- or over- dispersed data sets. The DsI is widely used in ecology as a standard measure for measuring repulsion (under dispersion) or clustering (over dispersion). If DsI (DsI ) the distribution is under-dispersed (over-dispersed), whereas the distribution is equi-dispersed at DsI .
The moment generating function (MGF) of the proposed family can be formulated as
| (11) |
The first four partial derivatives of , with respect to t at t = 0, produce the first four raw moments about the origin, i.e. . Furthermore, by using Equations (9) or (11), the skewness and kurtosis based on moments can be computed as skewness variance and kurtosis variance .
3.3. Mean of excess over threshold and tail value at risk
Considerable attention has been paid to measuring operational risk over the past few decades. In this segment, we obtain few risk measures such as mean excess over the threshold (MEOT) and tail value at risk (TVAR) for the DOW-G family. The MEOT of the RV X following DOW-G family is given as
| (12) |
Based on an arbitrary choice of , we can get , then the DOW-G family belongs to a class of discrete distributions having new worse than used in expectation, for more detail refer Marshall and Proschan [45]. Hence, the DOW-G family plays an vital role in the reliability theory. Regarding the TVAR, for any quantile , the TVAR of DOW-G family can be derived as
| (13) |
3.4. Rényi and Shannon entropies
The amount of uncertainty associated with a RV X is referred to as Entropy. It is widely applicable in many areas such as information theory, econometrics, survival analysis, computer science and quantum information, for more details, see Rényi [57]. The Rényi and Shannon entropies of the RV X can be formulated as
| (14) |
and
| (15) |
respectively, where and . It is notable that the Shannon entropy can be derived as a particular case of the Rényi entropy if , i.e. .
3.5. Order statistics and L-moment statistics
Order statistics (ORST) play an important role in various fields of statistical theory and practice. Suppose , be a random sample (RS) from the DOW-G , and let , …, be their corresponding ORST. Then, the CDF of the ith ORST for an integer value of x is proposed as
| (16) |
where . The PMF of the ith ORST can be formulated as
| (17) |
The uth moment of is given by
| (18) |
L-moments (L-MS) are linear combinations of ORSTs and are used to study the shape of a probability distribution. Hosking and Wallis [32] has proposed L-MS to summaries theoretical distribution and observed samples. Let be ith largest observations based on a sample of size n, then the L-MS is
| (19) |
Using Equation (19), we get , , [ ] and , and consequently, we can define some descriptive statistics like L-M of mean, L-M coefficient of variation, L-M coefficient of skewness and L-M coefficient of kurtosis as , , and , respectively.
4. Special models
In this segment, we study two particular distributions of the DOW-G family to demonstrate its viability. Some other main objectives of providing these models are: to evaluate the aforementioned characteristics for particular models of the presented family; to investigate the behaviour of various estimation methods for estimation of unknown parameters (in Section 6); to express the practicality of the developed family in real data analysis through two special models (in Section 7).
4.1. The DOW-Geometric (DOWGeo) distribution
Consider the CDF of the Geo distribution. Then, the PMF of the DOWGeo distribution can be formulated as
| (20) |
where and . Figures 1 and 2 show the PMF ( . . . ) and HRF ( . . . ) of the DOWGeo model, respectively. The PMF can be either unimodal or bimodal and can be used to analyze both a positively skewed and a negatively skewed data set. Furthermore, the HRF can be either increasing, constant, increasing-constant-, J-, and bathtub-shaped. Therefore, the parameters of the DOWGeo model can be fixed to fit most data sets.
Figure 1.
The PMF of the DOWGeo distribution.
Figure 2.
The HRF of the DWGeo distribution.
Regarding the rth moment, it is difficult to express it in an explicit form, and consequently, Maple software is utilized to interpret some of the descriptive statistics of the DOWGeo model. Table 1 lists some descriptive statistics using the DOWGeo model for different values of the parameters and θ.
Table 1.
Some useful descriptive statistics of the DOWGeo distribution.
| Parameter | Measure | ||||||
|---|---|---|---|---|---|---|---|
| β | p | θ | Mean | Variance | Skewness | Kurtosis | DsI |
| 0.5 | 0.1 | 0.00100 | 0.00099 | 31.5753 | 998.0020 | 0.99900 | |
| 0.3 | 0.03034 | 0.03074 | 5.84226 | 37.72465 | 1.01335 | ||
| 0.1 | 0.5 | 0.12093 | 0.15324 | 3.67484 | 18.27296 | 1.26718 | |
| 0.7 | 0.38322 | 0.76523 | 2.92847 | 13.09894 | 1.99681 | ||
| 0.9 | 1.87829 | 10.4785 | 2.53384 | 10.70328 | 5.57875 | ||
| 0.1 | 0.12601 | 0.11215 | 2.33436 | 6.76028 | 0.89003 | ||
| 0.3 | 0.47328 | 0.53516 | 1.44833 | 4.38079 | 1.13074 | ||
| 0.5 | 0.5 | 1.05464 | 1.80762 | 1.22931 | 3.80505 | 1.71396 | |
| 0.7 | 2.38896 | 7.29211 | 1.11821 | 3.54407 | 3.05241 | ||
| 0.9 | 9.05357 | 86.8501 | 1.05653 | 3.41683 | 9.59291 | ||
| 0.1 | 1.11533 | 0.71572 | 0.134371 | 2.07950 | 9.59291 | ||
| 0.3 | 2.54887 | 2.63579 | 2.09458 | 1.03410 | |||
| 0.9 | 0.5 | 4.77186 | 8.02525 | 2.11899 | 1.68178 | ||
| 0.7 | 9.72264 | 30.52447 | 2.13726 | 3.13952 | |||
| 0.9 | 34.0694 | 351.4762 | 3.53164 | 10.3164 | |||
| 5.5 | 0.1 | 1.00000 | |||||
| 0.3 | 1.00000 | ||||||
| 0.5 | 0.1 | 1.00000 | |||||
| 0.7 | 1.00000 | ||||||
| 0.9 | 1.00000 | ||||||
| 0.1 | 0.100000 | 0.09000 | 2.66666 | 8.11111 | 0.90000 | ||
| 0.3 | 0.30000 | 0.21000 | 0.87287 | 1.76190 | 0.70000 | ||
| 0.5 | 0.5 | 0.50000 | 0.25000 | 0.00000 | 1.00000 | 0.50000 | |
| 0.7 | 0.70000 | 0.21000 | 1.76190 | 0.30000 | |||
| 0.9 | 0.90000 | 0.09000 | 8.11111 | 0.10000 | |||
| 0.1 | 5.00111 | 0.90089 | 3.32144 | 0.18013 | |||
| 0.3 | 5.50660 | 1.01886 | 3.37346 | 0.18502 | |||
| 0.5 | 0.9 | 5.96232 | 1.12473 | 3.42405 | 0.18863 | ||
| 0.7 | 6.54227 | 1.26243 | 3.474461 | 0.19296 | |||
| 0.9 | 7.69496 | 1.53654 | 3.58922 | 0.19968 | |||
From Table 1, it can be easily observed that the DOWGeo model is appropriate of modelling over- and (under)-dispersed data; for some selected values of the model parameters DsI , so it can be used as an alternative model to Poisson distribution; it is suitable for modelling negative and positive skewed as well as symmetric data sets; and DOWGeo model can be used to analyze either leptokurtic (kurtosis ) or platykurtic (kurtosis ) data sets. Tables 2–4 reports some numerical computations of MEOT, TVAR and Rényi entropy at x = 5 based on DOWGeo parameters when and q = 0.5.
Table 2.
The MEOT, TVAR and entropy at , and .
| p | |||||||
|---|---|---|---|---|---|---|---|
| Measure ↓ Parameter | 0.05 | 0.1 | 0.3 | 0.5 | 0.7 | 0.9 | 0.95 |
| MEOT | 1.23741 | 1.36315 | 1.81559 | 2.37393 | 3.26988 | 5.54413 | 7.23255 |
| TVAR | 11.82741 | 9.89443 | 7.30231 | 7.37393 | 8.26988 | 10.94776 | 12.90343 |
| Rényi entropy | 1.69774 | 1.76243 | 1.91188 | 2.02716 | 2.15060 | 2.33402 | 2.41787 |
Table 3.
The MEOT, TVAR and entropy at , p = 0.5 and .
| θ | ||||||
|---|---|---|---|---|---|---|
| Measure ↓ Parameter | 0.1 | 0.3 | 0.5 | 0.7 | 0.9 | 0.95 |
| MEOT | 0.99974 | 0.99998 | 1.00000 | 1.00001 | 3.56188 | 9.89159 |
| TVAR | 8.56188 | 15.97817 | ||||
| Rényi entropy | 0.16816 | 0.74610 | 1.30292 | 2.48811 | 3.20516 | |
Table 4.
The MEOT, TVAR and entropy at , p = 0.5 and β grows.
| β | ||||||
|---|---|---|---|---|---|---|
| Measure ↓ Parameter | 0.1 | 0.5 | 1.5 | 2.5 | 3.0 | 5.5 |
| MEOT | 8.81406 | 1.10315 | 1.00000 | 0.99999 | 0.99998 | 0.77272 |
| TVAR | 15.01324 | 258.4995 | ||||
| Rényi entropy | 1.34251 | 1.13804 | 0.74610 | 0.69318 | 0.69314 | 0.69313 |
From Tables 2–4, it is clearly seen that: the MEOT and Rényi entropy increase whereas the TVAR can take bathtub-shaped for some fixed values of β and θ with ; the MEOT and Rényi entropy increase whereas the TVAR have inverse-J-shaped for some fixed values of β and p with ; and the MEOT and Rényi entropy decrease whereas the TVAR increases for some fixed values of θ and p with β grows.
4.2. The DOW-Inverse Weibull DOWIW distribution
Consider the CDF of the inverse Weibull (IW) model. Then, the PMF of the DOWIW distribution can be defined as
| (21) |
where , and . Figures 3 and 4 show the PMF ( . . . ) and HRF ( . . . ) of the DOWIW distribution, respectively. The shape of the PMF is only unimodal, whereas the HRF can be increasing, increasing-constant-, or decreasing.
Figure 3.
The PMF of the DOWIW distribution.
Figure 4.
The HRF of the DOWIW distribution.
Like the DOWGeo model, the DOWIW distribution can be applied to model skewed data set. Moreover, it is appropriate for modelling over-, equi- or under-dispersed data sets.
5. Estimation
5.1. Maximum likelihood estimation
In this segment, unknown parameters of the DOW-G family are estimated by maximum likelihood estimation (MLE). Let be a RS from the proposed family. Then, the likelihood function (l) can be expressed as
| (22) |
Furthermore, the log-likelihood function (L) can be obtained as
| (23) |
where . The maximum likelihood estimators of the unknown parameters p, β and can be achieved by solving the non-linear normal equations obtained by differentiating (23) with respect to the family parameters. The components of the score vector, are
| (24) |
| (25) |
and
| (26) |
where . By putting Equations (24)–(26) equals to zero and solving them, immediately produces the MLEs of unknown parameters of the DOW-G family. Since the analytical solution to the above normal equations is difficult to obtain, therefore, we can solve these equations through an iterative method such as Newton-Raphson.
5.2. Bayesian estimation
When the experimenter has some prior knowledge (in the form of prior distribution) regarding the unknown parameters involved in the model, Bayesian estimation becomes crucial. In such a context, we discuss the estimation of unknown parameters of the DOW-G family under the Bayesian framework. Here, we assume the independent prior densities as , , and , where and are the prior density and set of associated hyperparameters (i.e. parameters involved in the prior distribution), respectively, for the member of the vector parameter . Then, the joint prior distribution of is
| (27) |
where, , and the domain of hyperparameters in depends on the arbitrary choice of prior density . If we choose and such that Equation (27) has minimal or no effect on posterior distribution, we can perform Bayesian study under non-informative priors.
In view of the likelihood function (22), and the joint prior distribution (27), the unnormalized joint posterior density of given can be written as
| (28) |
Loss functions (LFs) have an important role in statistical decision inference and Bayesian theory. Therefore, in order to develop the Bayes estimators (BEs) of unknown parameters (or any of their functions), we ought to consider the question of which type of LF will be used. Here, we use a popular LF, named as a squared error loss function (SELF). It is widely applicable due to its symmetric nature (i.e. it equally penalized negative and positive error). The SELF has the following form
where is any estimate of the parameter ξ, then the BE of ξ under SELF is , here the expectation is carried out under the posterior distribution of the unknown parameter ξ. Thus, in our case, the BE of (any function of parameters p, β, ) with SELF, can be derived as
| (29) |
Since the analytical solution of the expectation in Equation (29) is difficult to derive due to the non-closure form of the joint posterior density (28), therefore we need numerical approximation methods. Here, we utilize two important Markov Chain Monte Carlo (MCMC) approaches, known as Gibbs sampling [25] and Metropolis–Hastings (MH) algorithm [31,46] to generate the RSs from the posterior density and computing posterior quantities of interest. The work of the Gibbs algorithm is to decompose the joint density (28) into full conditional posterior densities of p, β, , so that these densities can be further used to simulate posterior RSs and to achieve the BEs of the unknown parameters.
To implement the Gibbs algorithm, the unnormalized conditional posterior densities of the parameters p, β, and are as follows
| (30) |
| (31) |
| (32) |
where, is the likelihood function in Equation (22). Since the posterior distributions (30)–(32) are difficult to simulate directly by conventional techniques of generating RSs, therefore, we employ MH algorithm to generate posterior RSs for every element of the parameter vector from their respective densities given in Equations (30)–(32).
The detailed steps of MH with in Gibbs algorithm for obtaining BE of (lth element of ), , are as follows:
-
Step I.
Initialize with some starting guess of as and set i = 1.
-
Step II.
Obtain the proposal point from the proposal distribution and u from Uniform distribution . Here, we utilize , and is suitably chosen.
-
Step III.
If , then , otherwise . Here, the notation represents the acceptance probability and has been defined in Equations (30)–(32).
-
Step IV.
Set i = i + 1.
-
Step V.
Rerun steps 2–4, say M times (a large number of times), and store .
-
Step VI.
Under SELF, the BE of , say is obtained as , where is the burn-in-iterations of the Markov Chain.
6. A Monte Carlo simulation study
This section is devoted for an extensive simulation study to examine the behaviour of the various estimation methods discussed in Section 5. For this purpose, we generate samples of different sizes, i.e. n = 25, 50, and 100 from DOWGeo and DOWIW distributions. To draw the required RSs, we assume the parametric values of DOWGeo distribution as (0.5, 0.7, 0.8) and (0.9, 1.3, 0.5), while for DOWIW distribution, the model parameters are taken to be as (0.6, 0.8, 0.3, 1.2) and (0.9, 1.5, 0.5, 0.7). In classical setup, we compute the MLEs of the unknown parameters with their associated standard errors (Ses). The MLEs and Ses of the unknown parameters of DOWGeo distribution are presented in Table 5, whereas Table 6 holds the MLEs and Ses in case of DOWIW distribution.
Table 5.
MLE and Bayes estimates for unknown parameters of DOWGeo distribution.
| True value → | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Sample size | Parameters ↓ | MLE | Se | BE | PSe | MLE | Se | BE | PSe |
| 25 | p | 0.5287 | 0.3740 | 0.4906 | 0.0683 | 0.8807 | 0.3218 | 0.9183 | 0.0315 |
| β | 0.7236 | 0.3349 | 0.7103 | 0.0624 | 1.2667 | 1.2353 | 1.3290 | 0.1243 | |
| θ | 0.8303 | 0.1584 | 0.7875 | 0.0289 | 0.5235 | 0.5427 | 0.5129 | 0.0373 | |
| 50 | p | 0.5117 | 0.2879 | 0.4946 | 0.0501 | 0.9150 | 0.1628 | 0.9108 | 0.0274 |
| β | 0.7168 | 0.2628 | 0.7112 | 0.0445 | 1.3118 | 0.8405 | 1.3107 | 0.1103 | |
| θ | 0.8122 | 0.1180 | 0.7819 | 0.0261 | 0.5112 | 0.4107 | 0.4956 | 0.0345 | |
| 100 | p | 0.5078 | 0.1894 | 0.5059 | 0.0479 | 0.8966 | 0.1143 | 0.9076 | 0.0204 |
| β | 0.6962 | 0.1576 | 0.7023 | 0.0314 | 1.3092 | 0.7469 | 1.3012 | 0.0649 | |
| θ | 0.7961 | 0.0892 | 0.8003 | 0.0113 | 0.5070 | 0.3319 | 0.4989 | 0.0314 | |
Table 6.
MLE and Bayes estimates for unknown parameters of DOWIW distribution.
| True value → | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Sample size | Parameters ↓ | MLE | Se | BE | PSe | MLE | Se | BE | PSe |
| 25 | p | 0.5476 | 0.9712 | 0.5873 | 0.0526 | 0.9123 | 0.5718 | 0.9111 | 0.0271 |
| β | 0.8369 | 2.4693 | 0.8263 | 0.1934 | 1.5739 | 4.8323 | 1.4857 | 0.1297 | |
| θ | 0.2889 | 0.6648 | 0.3124 | 0.0454 | 0.5348 | 0.9743 | 0.4874 | 0.0433 | |
| α | 1.2332 | 4.0627 | 1.2261 | 0.1274 | 0.6792 | 2.3019 | 0.6917 | 0.0715 | |
| 50 | p | 0.5679 | 0.8459 | 0.5919 | 0.0413 | 0.8869 | 0.4573 | 0.9049 | 0.0205 |
| β | 0.8194 | 1.8607 | 0.8079 | 0.0992 | 1.5349 | 2.9876 | 1.4943 | 0.1203 | |
| θ | 0.2914 | 0.3369 | 0.2941 | 0.0374 | 0.4819 | 0.7260 | 0.4889 | 0.0406 | |
| α | 1.2204 | 3.2751 | 1.2219 | 0.1227 | 0.6898 | 1.7737 | 0.7096 | 0.0672 | |
| 100 | p | 0.6109 | 0.6257 | 0.6071 | 0.0397 | 0.9079 | 0.2951 | 0.8996 | 0.0183 |
| β | 0.8102 | 1.1781 | 0.7989 | 0.0907 | 1.5104 | 2.0373 | 1.5030 | 0.1126 | |
| θ | 0.2975 | 0.3123 | 0.3039 | 0.0314 | 0.4989 | 0.5813 | 0.5011 | 0.0327 | |
| α | 1.2100 | 2.7216 | 1.2041 | 0.1146 | 0.6997 | 1.3602 | 0.7029 | 0.0612 | |
For Bayesian analysis of DOWGeo distribution, , , and are utilized as the independent informative priors for the parameters p, β, and θ, respectively, while in case of DOWIW distribution, the variability of the unknown parameters p, β, θ, and α is measured by assuming independent informative priors as , , , and , respectively. In these prior densities, the hyperparameters have been selected such that the mean of the prior density is approximately equal to the corresponding predetermined value of the parameter. Under this setting, in order to calculate the Bayes estimates of unknown parameters, we generate 51,000 items for each parameter by utilizing MH within Gibbs sampling. The starting 1000 burn-in iterations for each of the chains have been discarded for removing the effects of initial values of the parameters. Also, every 25th observation is stored to neutralize the auto-correlation of successive draws. The convergence diagnostics of each generated chain is investigated by Geweke's [26] test at a credibility level. This diagnostics is also examined by plotting posterior densities, MCMC runs, and auto-correlation functions. The above plots can be seen in Figures S1, S2, S3, and S4 for DOWGeo(0.5, 0.7, 0.8), DOWGeo (0.9, 1.3, 0.5), DOWIW(0.6, 0.8, 0.3, 1.2), and DOWIW (0.9, 1.5, 0.5, 0.7) distributions, respectively.
After the convergence diagnostics, we have used these generated values to obtain Bayes estimates with their associated posterior standard errors (PSes) under SELF. The Bayes estimates and their PSes for the unknown parameters of DOWGeo distribution are given in Table 5, whereas Table 6 consists the Bayes estimates and PSes for the parameters of DOWIW distribution. All numerical computations are performed using open-source software R.
From this simulation study, we have observed that for each pair of sample size and true parametric value, both estimation methods work satisfactorily, but Bayesian method outperforms the MLE with respect to the estimation errors. Also, it is noticed that the estimation error associated with an estimate for both estimation procedures, tends to decrease as we increase the sample size.
7. Real data illustration
In this segment, we illustrate the applications of the DOWGeo and DOWIW models by using two real data sets. The fitting of the models are compared using some well-known measures, namely, , Akaike information criterion (AkIC), correct Akaike information criterion (CAkIC), Hannan-Quinn information criterion (HQIC), Chi-square ( ) with degree of freedom (DF) and the associated P-value. The competitive distributions are provided in Table 7.
Table 7.
The competitive models.
| Distribution | Abbreviation | Author(s) |
|---|---|---|
| Geometric | Geo | – |
| Generalized geometric | GGeo | Gómez-Déniz [28] |
| Discrete Rayleigh | DR | Roy [60] |
| Discrete inverse Rayleigh | DIR | Hussain and Ahmad [33] |
| Discrete inverse Weibull | DIW | Jazi et al. [36] |
| One parameter discrete Lindley | DLi-I | Gómez-Déniz and Calderín-Ojeda [29] |
| Two parameters discrete Lindley | DLi-II | Hussain et al. [34] |
| Three parameters discrete Lindley | DLi-III | Eliwa et al. [19] |
| One parameter discrete flexible model | DFx-I | Eliwa and El-Morshedy [22] |
| Negative Binomial | NeBi | – |
| Poisson | Poi | Poisson [56] |
| Discrete Bilal | DBL | Eliwa et al. [20] |
| Discrete Pareto | DPa | Krishna and Pundir [41] |
| Discrete Burr | DB | Krishna and Pundir [41] |
| Discrete Burr-Hatke | DBH | El-Morshedy et al. [22]. |
| Discrete log-logistic | DLogL | Para and Jan [53] |
| Discrete Lomax distribution | DLo | Para and Jan [54] |
7.1. Data set I: count cysts of kidneys using steroids
This data set is taken from the study of Chan et al. [15]. Here, we examine the fitting capability of the DOWGeo model with some other competitive distributions like: GGeo, Geo, DR, DIR, DIW, NeBi, Poi, DFx-I, DLi-I, DLi-II, DLi-III, DLogL, DB, DLo, and DPa. The MLEs with their associated standard errors (Se) are tabulated in Table 8, whereas Tables 9 and 10 list the goodness of fit statistics (GOFS). In Tables 9 and 10, ExFr and ObFr represent the expected and observed frequencies, respectively.
Table 8.
The MLEs with their corresponding Se for data set I.
| Parameter → | p | β | θ | |||
|---|---|---|---|---|---|---|
| Model ↓ | MLE | Se | MLE | Se | MLE | Se |
| DOWGeo | 0.181 | 0.152 | 0.441 | 0.119 | 0.813 | 0.147 |
| GGeo | – | – | 0.188 | 0.089 | 0.800 | 0.064 |
| Geo | – | – | – | – | 0.582 | 0.030 |
| DR | – | – | – | – | 0.901 | 0.009 |
| DIR | – | – | – | – | 0.554 | 0.049 |
| DIW | – | – | 1.049 | 0.146 | 0.581 | 0.048 |
| NeBi | 0.812 | 0.045 | 0.322 | 0.074 | – | – |
| Poi | 1.390 | 0.112 | – | – | – | – |
| DFx-I | 0.623 | 0.031 | – | – | – | – |
| DLi-I | 0.436 | 0.026 | – | – | – | – |
| DLi-II | 0.581 | 0.045 | 0.001 | 0.058 | – | – |
| DLi-III | 0.582 | 0.005 | 358.728 | 11863.37 | 0.001 | 20.698 |
| DLogL | 0.780 | 0.136 | 1.208 | 0.159 | – | – |
| DB | 0.278 | 0.045 | 1.053 | 0.167 | – | – |
| DLo | 0.152 | 0.098 | 1.830 | 0.952 | – | – |
| DPa | 0.268 | 0.034 | – | – | – | – |
Table 9.
Various fitted distributions with GOFS for data set I based on one parameter competitive models.
| ExFr | |||||||||
|---|---|---|---|---|---|---|---|---|---|
| X | ObFr | DOWGeo | Geo | DR | DIR | Poi | DFx-I | DLi-I | DPa |
| 0 | 65 | 65.006 | 45.980 | 10.890 | 60.888 | 27.398 | 45.256 | 40.286 | 65.842 |
| 1 | 14 | 14.203 | 26.760 | 26.618 | 33.99 | 38.084 | 29.094 | 29.834 | 18.267 |
| 2 | 10 | 8.581 | 15.575 | 29.448 | 8.123 | 26.468 | 16.508 | 18.357 | 8.164 |
| 3 | 6 | 5.947 | 9.064 | 22.296 | 3.004 | 12.264 | 8.893 | 10.336 | 4.513 |
| 4 | 4 | 4.355 | 5.275 | 12.629 | 1.420 | 4.262 | 4.703 | 5.523 | 2.820 |
| 5 | 2 | 3.262 | 3.070 | 5.539 | 0.779 | 1.185 | 2.489 | 2.851 | 1.909 |
| 6 | 2 | 2.458 | 1.787 | 1.914 | 0.473 | 0.274 | 1.335 | 1.437 | 1.368 |
| 7 | 2 | 1.843 | 1.039 | 0.526 | 0.308 | 0.054 | 0.731 | 0.711 | 1.022 |
| 8 | 1 | 1.364 | 0.605 | 0.116 | 0.212 | 0.009 | 0.409 | 0.347 | 0.789 |
| 9 | 1 | 0.991 | 0.352 | 0.020 | 0.152 | 0.001 | 0.234 | 0.167 | 0.626 |
| 10 | 1 | 0.702 | 0.205 | 0.003 | 0.112 | 0 | 0.137 | 0.079 | 0.506 |
| 11 | 2 | 1.288 | 0.288 | 0.001 | 0.539 | 0.001 | 0.211 | 0.072 | 4.174 |
| Total | |||||||||
| 166.605 | 178.767 | 277.778 | 186.547 | 246.210 | 182.288 | 189.110 | 171.192 | ||
| AkIC | 339.209 | 359.533 | 557.556 | 375.094 | 494.420 | 366.575 | 380.220 | 344.384 | |
| CAkIC | 339.436 | 359.570 | 557.593 | 375.131 | 494.457 | 366.612 | 380.257 | 344.421 | |
| HQIC | 342.495 | 360.629 | 558.651 | 376.189 | 495.515 | 367.671 | 381.316 | 345.479 | |
| 0.596 | 19.109 | 306.515 | 40.456 | 89.277 | 31.702 | 34.635 | 3.430 | ||
| DF | 2 | 4 | 4 | 2 | 3 | 4 | 4 | 4 | |
| P-value | 0.742 | 0.489 | |||||||
Table 10.
Various fitted distributions with GOFS for data set I based on competitive models with two or three parameters.
| ExFr | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| X | ObFr | DOWGeo | GGeo | NeBi | DLi-II | DLi-III | DLogL | DB | DLo | DIW |
| 0 | 65 | 65.006 | 62.738 | 56.520 | 46.026 | 46.008 | 63.192 | 64.743 | 61.615 | 63.910 |
| 1 | 14 | 14.203 | 19.665 | 15.885 | 26.768 | 26.765 | 20.101 | 19.177 | 21.023 | 20.699 |
| 2 | 10 | 8.581 | 9.439 | 9.173 | 15.568 | 15.570 | 8.644 | 8.484 | 9.687 | 8.053 |
| 3 | 6 | 5.947 | 5.436 | 6.203 | 9.054 | 9.058 | 4.656 | 4.632 | 5.275 | 4.234 |
| 4 | 4 | 4.355 | 3.463 | 4.502 | 5.266 | 5.269 | 2.864 | 2.863 | 3.197 | 2.599 |
| 5 | 2 | 3.262 | 2.349 | 3.400 | 3.062 | 3.066 | 1.921 | 1.920 | 2.088 | 1.754 |
| 6 | 2 | 2.458 | 1.663 | 2.635 | 1.781 | 1.783 | 1.368 | 1.365 | 1.441 | 1.261 |
| 7 | 2 | 1.843 | 1.213 | 2.079 | 1.036 | 1.037 | 1.019 | 1.013 | 1.038 | 0.949 |
| 8 | 1 | 1.364 | 0.904 | 1.663 | 0.602 | 0.604 | 0.786 | 0.777 | 0.773 | 0.739 |
| 9 | 1 | 0.991 | 0.685 | 1.344 | 0.350 | 0.351 | 0.623 | 0.613 | 0.592 | 0.592 |
| 10 | 1 | 0.702 | 0.525 | 1.095 | 0.204 | 0.204 | 0.504 | 0.494 | 0.463 | 0.485 |
| 11 | 2 | 1.288 | 1.920 | 5.501 | 0.283 | 0.285 | 4.322 | 3.919 | 2.808 | 4.725 |
| Total | ||||||||||
| 166.605 | 168.556 | 168.544 | 178.767 | 178.767 | 171.717 | 171.139 | 170.481 | 172.935 | ||
| AkIC | 339.209 | 341.113 | 340.088 | 361.534 | 363.533 | 347.43 | 346.278 | 344.961 | 349.869 | |
| CAkIC | 339.436 | 341.225 | 344.489 | 361.646 | 363.759 | 347.55 | 346.391 | 345.073 | 349.982 | |
| HQIC | 342.495 | 343.303 | 343.2788 | 363.724 | 366.819 | 349.62 | 348.469 | 347.152 | 352.060 | |
| 0.596 | 2.444 | 4.287 | 19.091 | 19.096 | 4.033 | 2.587 | 3.238 | 6.445 | ||
| DF | 2 | 3 | 4 | 3 | 2 | 3 | 2 | 3 | 3 | |
| P-value | 0.742 | 0.485 | 0.369 | 0.0003 | 0.258 | 0.274 | 0.356 | 0.092 | ||
From Tables 9 and 10 it is noted that, the DPa, GGeo, NeBi, DLogL, DB, DLo and DIW distributions work quite satisfactory for analyzing data set I besides the DOWGeo distribution under condition (P-value ). Depending on , AkIC, CAkIC, HQIC, and P-value, it is easily visible that the DOWGeo model provides the best fit among all other tested models since it has the smallest value of , AkIC, CAkIC, HQIC, and , whereas it achieve the highest P-value associated to the test. Figure S5 depicts the profile of L for each parameter based on data set I, which indicates that the estimators of the parameters are unimodal. Figures 5 and S6 support the results of Tables 9 and 10, and they conclude that data set I follows the DOWGeo, DPa, GGeo, NeBi, DLogL, DBX-II, DLo and DIW models. But, the DOWGeo model is better than other rival models.
Figure 5.
The observed and expected PMFs for data set I.
Table 11 lists some theoretical and empirical descriptive statistics for data set I.
Table 11.
Some useful descriptive statistics for data set I.
| Type ↓ Measures | Mean | Variance | DsI | Skewness | Kurtosis |
|---|---|---|---|---|---|
| Theoretical | 1.39493 | 5.98717 | 4.29207 | 2.34691 | 9.11467 |
| Empirical | 1.39090 | 6.11184 | 4.39416 | 2.29259 |
From Table 11, it is clear that the theoretical measures are closed to empirical ones. Most of the distribution is at the left and platykurtic. Further, the data considered herein suffering from over-dispersion phenomena.
7.2. Data set II: COVID-19 in South Korea
This data set can be viewed at (https://www.worldometers.info/coronavirus/country/south-korea/) and consists the daily new deaths in South Korea for COVID-19 from 15 February to 12 Dec 2020. Here, we examine the fitting capability of the DOWIW model with some other rival distributions like DIW, DLogL, DB, DIR, DBL, DPa, DBH, and Poi. The MLEs with their associated Se are reported in Table 12, whereas Table 13 lists the GOFS.
Table 12.
The MLEs with their corresponding Se for data set II.
| p | β | θ | α | |||||
|---|---|---|---|---|---|---|---|---|
| Model | MLE | Se | MLE | Se | MLE | Se | MLE | Se |
| DOWIW | 0.997 | 0.0002 | 1.064 | 0.069 | 0.991 | 0.003 | 1.083 | 0.102 |
| DIW | – | – | – | – | 0.271 | 0.025 | 1.411 | 0.083 |
| DLogL | 1.716 | 0.095 | 1.878 | 0.107 | – | – | – | – |
| DB | 0.591 | 0.031 | 2.466 | 0.248 | – | – | – | – |
| DIR | – | – | – | – | 0.229 | 0.023 | – | – |
| DBL | 0.707 | 0.010 | – | – | – | – | – | – |
| DPa | 0.377 | 0.021 | – | – | – | – | – | – |
| DBH | 0.904 | 0.020 | – | – | – | – | – | – |
| Poi | 1.901 | 0.079 | – | – | – | – | – | – |
Table 13.
Various fitted distributions with GOFS for data set II.
| E. Fr | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| No. ECB | O. Fr | DOWIW | DIW | DLogL | DB | DIR | DBL | DPa | DBH | Poi |
| 89 | 89.040 | 82.351 | 80.931 | 92.887 | 69.890 | 63.089 | 149.356 | 166.600 | 45.408 | |
| 79 | 74.528 | 103.702 | 92.775 | 97.788 | 140.613 | 89.133 | 50.500 | 54.598 | 86.336 | |
| 49 | 51.862 | 44.390 | 51.431 | 42.676 | 47.685 | 64.868 | 25.478 | 26.667 | 82.074 | |
| 29 | 34.059 | 22.317 | 27.336 | 21.174 | 19.125 | 39.578 | 15.379 | 15.540 | 52.014 | |
| 19 | 21.595 | 12.926 | 15.559 | 12.172 | 9.333 | 22.307 | 10.312 | 10.017 | 24.725 | |
| 17 | 13.351 | 8.248 | 9.518 | 7.754 | 5.198 | 12.037 | 7.387 | 6.885 | 9.402 | |
| 9 | 8.092 | 5.636 | 6.193 | 5.308 | 3.162 | 6.328 | 5.533 | 4.953 | 2.979 | |
| 6 | 4.825 | 4.052 | 4.247 | 3.831 | 2.098 | 3.273 | 4.408 | 3.682 | 0.809 | |
| 6 | 2.387 | 3.031 | 3.061 | 2.879 | 1.429 | 1.674 | 3.466 | 2.808 | 0.192 | |
| 1 | 4.261 | 17.347 | 12.949 | 17.531 | 5.467 | 1.713 | 32.181 | 12.25 | 0.061 | |
| Total | ||||||||||
| 564.625 | 586.855 | 577.011 | 587.652 | 606.870 | 575.338 | 620.466 | 621.0976 | |||
| AIC | 1137.250 | 1177.711 | 1158.023 | 1179.304 | 1215.740 | 1152.676 | 1269.061 | 1242.932 | 1244.195 | |
| CAIC | 1137.384 | 1177.751 | 1158.063 | 1179.344 | 1215.754 | 1152.689 | 1269.075 | 1242.945 | 1244.208 | |
| BIC | 1152.118 | 1185.145 | 1165.457 | 1186.738 | 1219.457 | 1156.393 | 1272.778 | 1246.649 | 1247.912 | |
| HQIC | 1143.198 | 1180.684 | 1160.997 | 1182.278 | 1217.227 | 1154.163 | 1270.548 | 1244.419 | 1245.682 | |
| 2.792 | 41.868 | 25.019 | 44.784 | 92.204 | 28.203 | 128.631 | 109.333 | 115.896 | ||
| D.F | 3 | 6 | 6 | 6 | 6 | 6 | 7 | 6 | 4 | |
| P-value | 0.425 | |||||||||
Based on the fitting measures in Table 13, we can announce that the DOWIW model is the best fitted model. Figure S7 shows the profiles of L for each parameter based on data set II, and it shows that every estimator has a unimodal function. Figures 6 and S8 support the results of Table 13, and we can say that the data set II follows the DOWIW model.
Figure 6.
The observed and expected PMFs for data set II.
Some theoretical and empirical descriptive statistics based on data set II can be viewed in Table 14.
Table 14.
Some useful descriptive statistics for data set II.
| Type ↓ Measures | Mean | Variance | DsI | Skewness | Kurtosis |
|---|---|---|---|---|---|
| Theoretical | 1.9013 | 4.1222 | 2.1681 | 1.2638 | 4.0864 |
| Empirical | 1.8567 | 3.9978 | 2.1531 | 1.3657 |
From Table 14, the empirical mean, variance, DsI and skewness are closed to theoretical ones. Data set II suffering from over-dispersion phenomena, and most of the distribution is at the left with platykurtic.
7.3. Bayesian estimation for real data sets I and II
Here, we compute the BEs with their PSes for the unknown parameters of the DOWGeo and DOWIW distributions using a similar manner discussed in Section 5. Since there is no prior information available regarding the unknown parameters of the special distributions, therefore, we have used non-informative priors for the unknown parameters of the models under study. The calculated estimates with the related estimation errors of DOWGeo and DOWIW distributions are jointly given in Table 15.
Table 15.
Bayes estimates for DOWGeo and DOWIW distributions under real data sets I and II.
| DOWGeo distribution under data set I | DOWIW distribution under data set II | |||
|---|---|---|---|---|
| Parameter ↓ Model | BE | PSe | BE | PSe |
| p | 0.1804 | 0.0105 | 0.9970 | 0.0002 |
| β | 0.4390 | 0.0285 | 1.1105 | 0.0427 |
| θ | 0.8141 | 0.0235 | 0.9869 | 0.0024 |
| α | – | – | 1.0135 | 0.0728 |
From the comparison of Bayes estimates (in Table 15) and MLEs (in Tables 8 and 12), we can easily observe that the result obtained in case of simulation study do hold in case of real-life data sets as well.
8. Concluding remarks
In the present research article, we have introduced a new family of discrete distributions called DOW-G family. Its several important statistical characteristics have been investigated. Two particular distribution of the proposed family are studied in detail. These special models provides good flexibility in terms of shapes for the PDFs. Further, we have noticed that the proposed family can model a positively skewed, a negatively skewed or a symmetric shaped data set. In addition, the DOW-G family can be used quite effectively for modelling a wide variety of failure data because its HRF can take diverse shapes (increasing, decreasing, constant, J-, and bathtub-shaped). Moreover, it is suitable for modelling under-, equi- and over-dispersed data set. In classical and non-classical setups, the method of maximum likelihood and Bayesian approach have been utilized to estimate the unknown parameters of particular models.
An extensive Monte Carlo simulation analysis has been conducted to evaluate the behaviour of above stated estimation methods. The results of simulation studies declare that these two estimation procedures perform quite satisfactorily in estimating unknown parameters of the model, but the Bayesian method dominates the method of maximum likelihood in terms of estimation errors. The flexibility of the DOW-G family has also been exemplified by using two distinctive real data sets. Hence, it is reasonable to say that the special distributions of the developed family can serve as an alternative model to the existing discrete models for modelling count or failure data in various fields including reliability, insurance, medicine, economics, and demography, etc.
It is worth noting that the above research can be expanded into several dimensions, for example- with different types of censoring schemes, we can study the particulars models of the proposed family (see Tyagi et al. [65]); In reliability inference, we can analyze stress-strength reliability or load share systems under special distributions of the DOW-G family. Also, various characteristics of order statistics from the proposed family could be explored and to infer the bi-variate data, a bi-variate discrete family may also be studied.
Supplementary Material
Acknowledgments
The authors thankfully acknowledge the critical suggestions and comments from the associate Editor and referees which greatly helped us in the improvement of the paper.
Disclosure statement
No potential conflict of interest was reported by the author(s).
References
- 1.Abd EL-Baset A.A. and Ghazal M.G.M., Exponentiated additive Weibull distribution, Reliab. Eng. Syst. Saf. 193 (2020), p. 106663. [Google Scholar]
- 2.Afify A.Z. and Alizadeh M., The odd dagum family of distributions: Properties and applications, J. Appl. Probab. Stat. 15 (2020), pp. 45–72. [Google Scholar]
- 3.Alamatsaz M.H., Dey S., Dey T., and Harandi S.S., Discrete generalized Rayleigh distribution, Pakistan J. Statist. 32 (2016), pp. 1–20. [Google Scholar]
- 4.Al-Babtain A., Fattah A.A., Ahmed A.H.N., and Merovci F., The Kumaraswamy-transmuted exponentiated modified Weibull distribution, Commun. Stat. Simul. Comput. 46 (2017), pp. 3812–3832. [Google Scholar]
- 5.Alizadeh M., Afify A.Z., Eliwa M.S., and Ali S., The odd log-logistic Lindley-G family of distributions: Properties, Bayesian and non-Bayesian estimation with applications, Comput. Stat. 35 (2020), pp. 281–308. [Google Scholar]
- 6.Alizadeh M., Emadi M., Doostparast M., Cordeiro G.M., Ortega E.M., and Pescim R.R., A new family of distributions: The Kumaraswamy odd log-logistic, properties and applications, Hacettepe J. Math. Statist. 44 (2015), pp. 1491–1512. [Google Scholar]
- 7.Almalki S.J. and Yuan J, A new modified Weibull distribution, Reliab. Eng. Syst. Saf. 111 (2013), pp. 164–170. [Google Scholar]
- 8.Alzaatreh A., Lee C., and Famoye F, A new method for generating families of continuous distributions, Metron 71 (2013), pp. 63–79. [Google Scholar]
- 9.Bakouch H.S., Chesneau C., and Khan M.N., The extended odd family of probability distributions with practice to a submodel, Filomat 33 (2019), pp. 3855–3867. [Google Scholar]
- 10.Bakouch H.S., Aghababaei M., and Nadarajah S., A new discrete distribution, Statistics 48 (2014), pp. 200–240. [Google Scholar]
- 11.Bebbington M., Lai C.D., Wellington M., and Zitikis R, The discrete additive Weibull distribution: A bathtub-shaped hazard for discontinuous failure data, Reliab. Eng. Syst. Saf. 106 (2012), pp. 37–44. [Google Scholar]
- 12.Bourguignon M., Silva R.B., and Cordeiro G.M., The Weibull-G family of probability distributions, J. Data. Sci. 12 (2014), pp. 53–68. [Google Scholar]
- 13.Chakraborty S., Generating discrete analogues of continuous probability distributions – a survey of methods and constructions, J. Stat. Distrib. Appl. 2 (2015), pp. 1–30. [Google Scholar]
- 14.Chandrakant K., Yogesh M.T., and Manoj K.R., On a discrete analogue of linear failure rate distribution, Am. J. Math. Manage. Sci. 36 (2017), pp. 229–246. [Google Scholar]
- 15.Chan S.K., Riley P.R., Price K.L., McElduff F., Winyard P.J., Welham S.J., Woolf A.S., and Long D.A., Corticosteroid-induced kidney dysmorphogenesis is associated with deregulated expression of known cystogenic molecules, as well as Indian hedgehog, Am. J. Physiology-Renal Physiol. 298 (2010), pp. F346–F356. [DOI] [PubMed] [Google Scholar]
- 16.Cordeiro G.M., Altun E., Korkmaz M.C., Pescim R.R., Afify A.Z., and Yousof H.M., The xgamma family: Censored regression modelling and applications, REVSTAT–Statist. J. 18 (2020), pp. 593–612. [Google Scholar]
- 17.Cooray K., Generalization of the Weibull distribution: The odd Weibull family, Stat. Model. 6 (2006), pp. 265–277. [Google Scholar]
- 18.Eliwa M.S., Alhussain Z.A., and El-Morshedy M., Discrete Gompertz-G family of distributions for over-and under-dispersed data with properties, estimation, and applications, Mathematics 8 (2020a), p. 358. [Google Scholar]
- 19.Eliwa M.S., Altun E., El-Dawoody M., and El-Morshedy M., A new three-parameter discrete distribution with associated INAR(1) process and applications, IEEE Access 8 (2020b), pp. 91150–91162. doi: 10.1109/ACCESS.2020.2993593. [DOI] [Google Scholar]
- 20.Eliwa M.S., El-Morshedy M., and Altun E., A study on discrete Bilal distribution with properties and applications on integer-valued autoregressive process, Revstat – Statist. J. (2020c), pp. 1–30. [Google Scholar]
- 21.Eliwa M.S. and El-Morshedy M., A one-parameter discrete distribution for over-dispersed data: Statistical and reliability properties with applications, J. Appl. Statist. (2021), pp. 1–21. doi: 10.1080/02664763.2021.1905787. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.El-Morshedy M., Eliwa M.S., and Altun E., Discrete Burr-Hatke distribution with properties, estimation methods and regression model, IEEE Access 8 (2020a), pp. 74359–74370. [Google Scholar]
- 23.El-Morshedy M., Eliwa M.S., and Nagy H., A new two-parameter exponentiated discrete Lindley distribution: Properties, estimation and applications, J. Appl. Stat. 47 (2020b), pp. 354–375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.El-Morshedy M. and Eliwa M. S., The odd flexible Weibull-H family of distributions: Properties and estimation with applications to complete and upper record data, Filomat 33 (2019), pp. 2635–2652. [Google Scholar]
- 25.Geman S and Geman D, Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images, IEEE. Trans. Pattern. Anal. Mach. Intell. 6 (1984), pp. 721–741. [DOI] [PubMed] [Google Scholar]
- 26.Geweke J., Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments Minneapolis, MN, Federal Reserve Bank of Minneapolis, Research Department, 196, 1991.
- 27.Ghitany M.E. and Al-Mutairi D.K., Estimation methods for the discrete Poisson Lindley distribution, J. Stat. Comput. Simul. 79 (2009), pp. 1–9. [Google Scholar]
- 28.Gómez-Déniz E., Another generalization of the geometric distribution, Test 19 (2010), pp. 399–415. [Google Scholar]
- 29.Gómez-Déniz E. and Calderín-Ojeda E., The discrete Lindley distribution: Properties and applications, J. Stat. Comput. Simul. 81 (2011), pp. 1405–1416. [Google Scholar]
- 30.Hamedani G.G., Altun E., Korkmaz M.Ç., Yousof H.M., and Butt N.S., A new extended G family of continuous distributions with mathematical propertiesm, characterizations and regression modelling, Pakistan J. Statist. Oper. Res. 14 (2018), pp. 737–758. [Google Scholar]
- 31.Hastings W.K, Monte Carlo sampling methods using Markov chains and their applications, Biometrika 57 (1970), pp. 97–109. [Google Scholar]
- 32.Hosking J.R. and Wallis J.R., Regional Frequency Analysis: An Approach Based on L-Moments, Cambridge University Press, Cambridge, 1997. [Google Scholar]
- 33.Hussain T. and Ahmad M, Discrete inverse Rayleigh distribution, Pakistan J. Statist. 30 (2014), pp. 203–222. [Google Scholar]
- 34.Hussain T., Aslam M., and Ahmad M., A two-parameter discrete Lindley distribution, Revista Colombiana De Estadistica 39 (2016), pp. 45–61. [Google Scholar]
- 35.Inusah S. and Kozubowski T.J., A discrete analogue of the Laplce distribution, J. Statist. Plann. Inference 136 (2006), pp. 1090–1102. [Google Scholar]
- 36.Jazi M.A., Lai C.D., and Alamatsaz M.H., A discrete inverse Weibull distribution and estimation of its parameters, Stat. Methodol. 7 (2010), pp. 121–132. [Google Scholar]
- 37.Korkmaz M. Ç., A new family of the continuous distributions: The extended Weibull-G family, Commun. Faculty Sci. Univ. Ankara Ser. A1 Math. Statist. 68 (2018), pp. 248–270. [Google Scholar]
- 38.Korkmaz M.Ç, Altun E., Yousof H.M., and Hamedani G.G., The odd power Lindley generator of probability distributions: Properties, characterizations and regression modelling, Int. J. Stat. Probab. 8 (2019), pp. 70–89. [Google Scholar]
- 39.Korkmaz M.Ç., Altun E., Yousof H.M., and Hamedani G.G., The Hjorth's IDB generator of distributions: Properties, characterizations, regression modelling and applications, J. Statist. Theory Appl. 19 (2020), pp. 59–74. [Google Scholar]
- 40.Korkmaz M.Ç. and Genç A.İ., A new generalized two-sided class of distributions with an emphasis on two-sided generalized normal distribution, Commun. Statist.-Simul. Comput. 46 (2017), pp. 1441–1460. [Google Scholar]
- 41.Krishna H. and Pundir P.S., Discrete Burr and discrete Pareto distributions, Stat. Methodol. 6 (2009), pp. 177–188. [Google Scholar]
- 42.Kus C., Akdogan Y., Asgharzadeh A., Kinaci I., and Karakaya K., Binomial-discrete Lindley distribution, Commun. Faculty Sci. Univ. Ankara Ser. A1-Math. Statist. 68 (2018), pp. 401–411. [Google Scholar]
- 43.Lai C.D., Xie M., and Murthy D.N.P., A modified Weibull distribution, IEEE Trans. Reliab. 52 (2003), pp. 33–37. [Google Scholar]
- 44.Lee C., Famoye F., and Olumolade O., Beta-Weibull distribution: Some properties and applications to censored data, J. Modern Appl. Statist. Methods 6 (2007), p. 17. [Google Scholar]
- 45.Marshall A.W. and Proschan F., Classes of distributions applicable in replacement with renewal theory implications, Sixth Berkeley Symposium I, 1970, pp. 395–416.
- 46.Metropolis N. and Ulam S., The Monte Carlo method, J. Amer. Statist. Assoc. 44 (1949), pp. 335–341. [DOI] [PubMed] [Google Scholar]
- 47.Nascimento A.D., Silva K.F., Cordeiro G.M., Alizadeh M., Yousof H.M., and Hamedani G.G., The odd Nadarajah-Haghighi family of distributions: Properties and applications, Studia Scientiarum Mathematicarum Hungarica 56 (2019), pp. 185–210. [Google Scholar]
- 48.Nekoukhou V., Alamatsaz M.H., and Bidram H., Discrete generalized exponential distribution of a second type, Statistics 47 (2013), pp. 876–887. [Google Scholar]
- 49.Nekoukhou V. and Bidram H., The exponentiated discrete Weibull distribution, Statist. Oper. Res. Trans. 39 (2015a), pp. 127–146. [Google Scholar]
- 50.Nekoukhou V. and Bidram H., A new four-parameter discrete distribution with bathtub and unimodal failure rate, J. Appl. Statist. 42 (2015b), pp. 2654–2670. [Google Scholar]
- 51.Nofal Z.M., Afify A.Z., Yousof H.M., Granzotto D.C., and Louzada F., Kumaraswamy transmuted exponentiated additive Weibull distribution, Int. J. Statist. Probab. 5 (2016), pp. 78–99. [Google Scholar]
- 52.Pal M., Ali M.M., and Woo J., Exponentiated weibull distribution, Statistica 66 (2006), pp. 139–147. [Google Scholar]
- 53.Para B.A. and Jan T.R., Discrete version of log-logistic distribution and its applications in genetics, Int. J. Modern Math. Sci. 14 (2016a), pp. 407–422. [Google Scholar]
- 54.Para B.A. and Jan T.R., On discrete three parameter burr type XII and discrete Lomax distributions and their applications to model count data from medical science, Biom. Biostatist. Int. J. 4 (2016b), pp. 1–15. [Google Scholar]
- 55.Pinho L.G.B., Cordeiro G.M., and Nobre J.S., The gamma-exponentiated Weibull distribution, J. Statist. Theory Appl. 11 (2012), pp. 379–395. [Google Scholar]
- 56.Poisson S.D., Probabilité des jugements en matiére criminelle et en matiére civile, précédées des régles génerales du calcul des probabilitiés, Paris, Bachelier, 1837, pp. 206–207.
- 57.Rényi A, On measures of entropy and information, Math. Statist. Probab. 1 (1961), pp. 547–561. [Google Scholar]
- 58.Reyad H., Korkmaz M.Ç., Afify A.Z., Hamedani G.G., and Othman S., The Fréchet topp Leone-G family of distributions: Properties, characterizations and applications, Ann. Data Sci. 8 (2019), pp. 345–366. [Google Scholar]
- 59.Roy D., The discrete normal distribution, Commun. Statist. Theory Methods 32 (2003), pp. 1871–1883. [Google Scholar]
- 60.Roy D., Discrete Rayleigh distribution, IEEE Trans. Reliab. 53 (2004), pp. 255–260. [Google Scholar]
- 61.Sarhan A.M. and Apaloo J, Exponentiated modified Weibull extension distribution, Reliab. Eng. System Saf. 112 (2013), pp. 137–144. [Google Scholar]
- 62.Silva G.O., Ortega E.M., and Cordeiro G.M., The beta modified Weibull distribution, Lifetime. Data. Anal. 16 (2010), pp. 409–430. [DOI] [PubMed] [Google Scholar]
- 63.Tahir M.H., Cordeiro G.M., Alizadeh M., Mansoor M., Zubair M., and Hamedani G.G., The odd generalized exponential family of distributions with applications, J. Statist. Distrib. Appl. 2 (2015), p. 1. [Google Scholar]
- 64.Tyagi A., Choudhary N., and Singh B., Discrete additive Perks–Weibull distribution: Properties and applications, Life Cycle Reliab. Saf. Eng. 8 (2019), pp. 183–199. [Google Scholar]
- 65.Tyagi A., Choudhary N., and Singh B., A new discrete distribution: Theory and applications to discrete failure lifetime and count data, J. Appl. Probab. Statist. 15 (2020), pp. 117–143. [Google Scholar]
- 66.Tyagi A., Choudhary N., and Singh B., Inferences on discrete Rayleigh distribution under type-II censored data using a tie-run approach, Int. J. Agricult. Stat. Sci. 16 (2020), pp. 939–951. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.






