Skip to main content
Journal of Applied Statistics logoLink to Journal of Applied Statistics
. 2021 May 24;49(11):2928–2952. doi: 10.1080/02664763.2021.1928018

A discrete analogue of odd Weibull-G family of distributions: properties, classical and Bayesian estimation with applications to count data

M El-Morshedy a,b,CONTACT, M S Eliwa b, Abhishek Tyagi c
PMCID: PMC9336501  PMID: 35909662

Abstract

In the statistical literature, several discrete distributions have been developed so far. However, in this progressive technological era, the data generated from different fields is getting complicated day by day, making it difficult to analyze this real data through the various discrete distributions available in the existing literature. In this context, we have proposed a new flexible family of discrete models named discrete odd Weibull-G (DOW-G) family. Its several impressive distributional characteristics are derived. A key feature of the proposed family is its failure rate function that can take a variety of shapes for distinct values of the unknown parameters, like decreasing, increasing, constant, J-, and bathtub-shaped. Furthermore, the presented family not only adequately captures the skewed and symmetric data sets, but it can also provide a better fit to equi-, over-, under-dispersed data. After producing the general class, two particular distributions of the DOW-G family are extensively studied. The parameters estimation of the proposed family, are explored by the method of maximum likelihood and Bayesian approach. A compact Monte Carlo simulation study is performed to assess the behavior of the estimation methods. Finally, we have explained the usefulness of the proposed family by using two different real data sets.

Keywords: Discrete distributions, dispersion index, Bayesian method, maximum likelihood method, L-moment statistics, odd Weibull-G family, simulation

2010 Mathematics Subject Classifications: 62E15, 62F10, 62F15

1. Introduction

The Weibull (W) model is a well-known continuous distribution and has been used extensively over the last several decades for modelling data in many areas, especially in engineering, reliability, and biological fields. It is commonly applied for modellig monotone hazard rates. The cumulative distribution function (CDF) of W distribution can be presented as

F(x;λ,β)=1eλxβ;x0,(λ,β)>0, (1)

where λ and β are the scale and shape parameters, respectively. Because of the flexibility of the W distribution, many statisticians proposed several modifications by introducing one or more parameters to the baseline model in Equation (1). For instance, Lai et al. [43], Pal et al. [52], Lee et al. [44], Silva et al. [62], Pinho et al. [55], Almalki and Yuan [7], Sarhan and Apaloo [61], Nofal et al. [51], Al-Babtain et al. [4], Abd EL-Baset and Ghazal [1], and references cited therein.

Alzaatreh et al. [8] developed a new approach in which they used a general form to produce a new family, called the transformed-transformer family. Bourguignon et al. [12] utilized this technique to introduce a new flexible family of continuous distributions and it is known as odd Weibull-G (OW-G) family. The CDF of OW-G family is given by

F(x;λ,β;Φ)=1eλ(G(x;Φ)1G(x;Φ))β;xϰR, (2)

where Φ is a vector of model parameters ( 1×j;j=1,2,3,,k) and G(x;Φ) is the CDF of the baseline distribution. For more details about the generator G(x;Φ)/(1G(x;Φ)), see Cooray [17]. Due to the flexibility of this generator, several authors used it for producing flexible families which can be utilized for modelling different types of data. For instance, Tahir et al. [63], Korkmaz and Genç [40], Korkmaz [37], Hamedani et al. [30], Korkmaz et al. [38], El-Morshedy and Eliwa [24], Korkmaz et al. [38], Reyad et al. [58], Bakouch et al. [9], Nascimento et al. [47], Alizadeh et al. [5,6], Korkmaz et al. [39], Afify and Alizadeh [2], Cordeiro et al. [16], and references cited therein.

The concept of discretization generally emerges when it becomes impossible or inconvenient to record the lifetime of a product/device on a continuous scale. These circumstances may appear when the life length needs to be recorded on a discrete scale rather than on a continuous analogue. For example, in survival analysis, the survival times for those having the diseases like lung cancer, or the period from remission to relapse may be recorded as a number of days/weeks; in engineering systems, the number of successful cycle prior to the failure when device work in cycle, the number of times a device is switched on/off. Moreover, in many practical problems, the count phenomenon occurs as, for example, the number of occurrences of earthquakes in a calendar year, the number of absences, the number of accidents, the number of kinds of species in ecology, the number of insurance claims, and so on. Therefore, we can easily infer that it is rational and appropriate to model such situations by a suitable discrete distribution.

Since classical discrete models like Binomial, Poisson, Negative Binomial, and Geometric were not sufficient to analyze different types of discrete data, therefore, Roy [59] proposed a new technique to develop a new discrete distribution by using the survival function of any continuous distribution. This approach is named as survival discretization method (Chakraborty [13]). The one of the important virtue of this methodology is that, the developed discrete model keeps the same form of the survival function as that of its continuous counterpart. Due to this feature many reliability characteristics of the distribution remain unchanged. In recent years this method has been widely used to develop several discrete models. For instance, Roy [60], Inusah and Kozubowski [35], Ghitany and Al-Mutairi [27], Krishna and Pundir [41], Gómez-Déniz [28], Gómez-Déniz and Calderín-Ojeda [29], Bebbington et al. [11], Nekoukhou et al. [48], Bakouch et al. [10], Nekoukhou and Bidram [49,50], Alamatsaz et al. [3], Chandrakant et al. [14], Kus et al. [42], Tyagi et al. [64–66], Eliwa et al. [18,19], El-Morshedy et al. [22,23], Eliwa and El-Morshedy [21], and the references included in the cited articles.

In view of the existing literature, we found that although several discrete models have been proposed over the past few decades, we are still facing a lack of more admirable discrete distributions that adequately capture the diversity of real data sets. Such phenomenon motivates us to provide a flexible family of discrete models for analyzing a broad spectrum of discrete real-world data sets. Therefore, in present article, we propose a family of discrete uni-variate distributions with two additional parameters using survival discretization method. The main objectives of proposing DOW-G family are as follows:

  • To produce discrete models that not only fit a positively skewed, a negatively skewed, or a symmetric shaped data set, they are also capable enough for fitting equi-, under-, and over-dispersed real data.

  • To develop discrete distributions whose failure rate functions can take diverse shapes for different values of parameters (eg. increasing, decreasing, constant, J- and bathtub-shaped).

  • To generate models for modelling probability distribution of count data.

  • To produce consistently superior fits than other developed discrete distributions with the same baseline model and other popular discrete distributions in the existing literature.

The remaining parts of this article are as follows: Section 2 introduces the DOW-G family. Some statistical characteristics of the DOW-G family are derived in Section 3. Two special models of the proposed family are extensively studied in Section 4. The method of maximum likelihood and the Bayesian approach is used to estimate the unknown parameters of the DOW-G family in Section 5. An extensive simulation study is conducted to investigate the behaviour of different estimation methods in Section 6. Section 7 contains two applications to real data sets. Finally, some important remarks about the presented study are discussed in Section 8.

2. Synthesis of the family

The random variable (RV) X is said to follow the DOW-G family if its CDF is of the form

FX(x;p,β,Φ)=1p(G(x+1;Φ)1G(x+1;Φ))β;xN0, (3)

where p=eλ p(0,1), β(0,) and N0={0,1,2,3,}. The reliability function (RF) of the DOW-G family is given by

RX(x;p,β,Φ)=p(G(x+1;Φ)1G(x+1;Φ))β;xN0. (4)

Suppose X1,X2,,Xn be independent and identically distributed (IID) integer valued RVs and Z=min(X1,X2,,Xn), then Z ∼ DOW-G (x;pn,β,Φ) provided Xi( i=1,2,,n) ∼ DOW-G (z;p,β,Φ) family where

RZ(x;p,β,Φ)=i=1nPr[Xix]=pn(G(x+1;Φ)1G(x+1;Φ))β;xN0 (5)

The probability mass function (PMF) and hazard rate function (HRF) corresponding to Equation (3) can be expressed as

fx(x;p,β,Φ)=p(G(x;Φ)1G(x;Φ))βp(G(x+1;Φ)1G(x+1;Φ))β;xN0 (6)

and

h(x;p,β,Φ)=1p(G(x+1;Φ)1G(x+1;Φ))β(G(x;Φ)1G(x;Φ))β;xN0, (7)

respectively.

3. Distributional properties of the DOW-G family

3.1. Quantile function (QF )

Under the DOW-G family, the qth QF, say xq, is the solution of FX(xq;p,β,Φ)q=0;xq>0, then

xq=G1(1[ln(1q)ln(p)]1/β+1)1, (8)

where q(0,1) and G1denotes the baseline QF. By putting q = 0.5, we can obtain the median of the proposed family. To study the effect of shape parameters on skewness and kurtosis, one can use xq. The Bowley skewness and Moors kurtosis based on the quantiles can be obtained as

Bowley Skewness=x34+x142x12x34x14,and Moors kurtosis=x38x18+x78x58x68x28.

3.2. Moments, dispersion index, skewness and kurtosis

Suppose a RV X DOW-G (p,β,Φ), then the rth moment of the RV X can be proposed by

μr=x=1[xr(x1)r]p(G(x;Φ)1G(x;Φ))β;xN0, (9)

where μr=x=0xrfx(x;p,β,Φ). Using the expression of Equation (9), the mean ( μ1) and variance can be, respectively, formulated as

μ1=x=1pG(x;Φ)1G(x;Φ)β and variance=x=12x1pG(x;Φ)1G(x;Φ)β(μ1)2. (10)

The dispersion index (DsI) is defined as the variance / |mean|, it determines that whether a given distribution is suitable for equi-, under- or over- dispersed data sets. The DsI is widely used in ecology as a standard measure for measuring repulsion (under dispersion) or clustering (over dispersion). If DsI <1 (DsI >1) the distribution is under-dispersed (over-dispersed), whereas the distribution is equi-dispersed at DsI =1.

The moment generating function (MGF) of the proposed family can be formulated as

MX(t)=x=0l=0(xt)ll!(p(G(x;Φ)1G(x;Φ))βp(G(x+1;Φ)1G(x+1;Φ))β);xN0. (11)

The first four partial derivatives of MX(t), with respect to t at t = 0, produce the first four raw moments about the origin, i.e. μr=drdtrMX(t)|t=0. Furthermore, by using Equations (9) or (11), the skewness and kurtosis based on moments can be computed as skewness =(μ33μ2μ1+2μ13)/(variance )3/2 and kurtosis =(μ44μ3μ1+6μ2μ123μ14)/(variance )2.

3.3. Mean of excess over threshold and tail value at risk

Considerable attention has been paid to measuring operational risk over the past few decades. In this segment, we obtain few risk measures such as mean excess over the threshold (MEOT) and tail value at risk (TVAR) for the DOW-G family. The MEOT of the RV X following DOW-G family is given as

m(x)=E(Xx|Xx)=p(G(x+1;Φ)1G(x+1;Φ))βj=xp(G(j+1;Φ)1G(j+1;Φ))β;xN0. (12)

Based on an arbitrary choice of G(x;Φ), we can get m(0)m(x), then the DOW-G family belongs to a class of discrete distributions having new worse than used in expectation, for more detail refer Marshall and Proschan [45]. Hence, the DOW-G family plays an vital role in the reliability theory. Regarding the TVAR, for any quantile xq, the TVAR of DOW-G family can be derived as

TVARq(X)=E(X|Xxq)=xq+11FX(xq;p,β,Φ)xxq(xxq)fx(x;p,β,Φ)=xq+11FX(xq;p,β,Φ)(μ1xq+x=0xq1(xqx)fx(x;p,β,Φ))=xq+p(G(xq+1;Φ)1G(xq+1;Φ))β(μ1xq+x=0xq1(xqx)[p(G(x;Φ)1G(x;Φ))βp(G(x+1;Φ)1G(x+1;Φ))β]);xN0. (13)

3.4. Rényi and Shannon entropies

The amount of uncertainty associated with a RV X is referred to as Entropy. It is widely applicable in many areas such as information theory, econometrics, survival analysis, computer science and quantum information, for more details, see Rényi [57]. The Rényi and Shannon entropies of the RV X can be formulated as

Iδ(X)=11δlogx=0(p(G(x;Φ)1G(x;Φ))βp(G(x+1;Φ)1G(x+1;Φ))β)δ;xN0 (14)

and

I(X)=x=0{p(G(x;Φ)1G(x;Φ))βp(G(x+1;Φ)1G(x+1;Φ))β}log{p(G(x;Φ)1G(x;Φ))βp(G(x+1;Φ)1G(x+1;Φ))β};xN0, (15)

respectively, where δ(0,) and δ1. It is notable that the Shannon entropy can be derived as a particular case of the Rényi entropy if δ1, i.e. I(X)=limδ=1Iδ(X).

3.5. Order statistics and L-moment statistics

Order statistics (ORST) play an important role in various fields of statistical theory and practice. Suppose X1,X2,, Xn be a random sample (RS) from the DOW-G (p,β,Φ), and let X1:n,X2:n, …,  Xn:n be their corresponding ORST. Then, the CDF of the ith ORST Xi:n for an integer value of x is proposed as

Fi:n(x;p,β,Φ)=k=in(nk)[Fi(x;p,β,Φ)]k[1Fi(x;p,β,Φ)]nk=k=inj=0nk(1)j(nk)(nkj)[Fi(x;p,β,Φ)]k+j=k=inj=0nkm=0k+jΔ(n,k)(m,j)Fi(x;pm,β,Φ)), (16)

where Δ(n,k)(m,j)=(1)j+m(nk)(nkj)(k+jm). The PMF of the ith ORST can be formulated as

fi:n(x;p,β,Φ)=k=inj=0nkm=0k+jΔ(n,k)(m,j)fi(x;pm,β,Φ). (17)

The uth moment of Zi:n is given by

Ψi:nu=E(Zi:nu)=z=0k=inj=0nkm=0k+jΔ(n,k)(m,j)zufi(x;pm,β,Φ). (18)

L-moments (L-MS) are linear combinations of ORSTs and are used to study the shape of a probability distribution. Hosking and Wallis [32] has proposed L-MS to summaries theoretical distribution and observed samples. Let X(i|n) be ith largest observations based on a sample of size n, then the L-MS is

Λr=1rs=0r1(1)s(r1s)E(Xrs:r). (19)

Using Equation (19), we get Λ1=E(X1:1), Λ2=12E(X2:2+X1:2), Λ3=13[ E(X3:3X2:3)E(X2:3+X1:3)] and Λ4=14{E[(X4:4X3:4)+(X2:4X1:4)]2E(X3:4X2:4)}, and consequently, we can define some descriptive statistics like L-M of mean, L-M coefficient of variation, L-M coefficient of skewness and L-M coefficient of kurtosis as Λ1, Λ2Λ1, Λ3Λ2 and Λ4Λ2, respectively.

4. Special models

In this segment, we study two particular distributions of the DOW-G family to demonstrate its viability. Some other main objectives of providing these models are: to evaluate the aforementioned characteristics for particular models of the presented family; to investigate the behaviour of various estimation methods for estimation of unknown parameters (in Section 6); to express the practicality of the developed family in real data analysis through two special models (in Section 7).

4.1. The DOW-Geometric (DOWGeo) distribution

Consider the CDF of the Geo distribution. Then, the PMF of the DOWGeo distribution can be formulated as

fx(x;p,β,θ)=p(θx1)βp(θ(x+1)1)β;xN0, (20)

where β>0 and 0<p,θ<1. Figures 1 and 2 show the PMF ( a:p=0.9,β=0.5,θ=0.9. b:p=0.9,β=0.5,θ=0.5. c:p=0.5,β=0.5,θ=0.9. d:p=0.9,β=5,θ=0.9) and HRF ( a:p=0.9,β=0.5,θ=0.5. b:p=0.9,β=0.5,θ=0.1. c:p=0.1,β=0.5,θ=0.9. d:p=0.9,β=0.1,θ=0.9) of the DOWGeo model, respectively. The PMF can be either unimodal or bimodal and can be used to analyze both a positively skewed and a negatively skewed data set. Furthermore, the HRF can be either increasing, constant, increasing-constant-, J-, and bathtub-shaped. Therefore, the parameters of the DOWGeo model can be fixed to fit most data sets.

Figure 1.

Figure 1.

The PMF of the DOWGeo distribution.

Figure 2.

Figure 2.

The HRF of the DWGeo distribution.

Regarding the rth moment, it is difficult to express it in an explicit form, and consequently, Maple software is utilized to interpret some of the descriptive statistics of the DOWGeo model. Table 1 lists some descriptive statistics using the DOWGeo model for different values of the parameters p,β and θ.

Table 1.

Some useful descriptive statistics of the DOWGeo distribution.

Parameter Measure
β p θ Mean Variance Skewness Kurtosis DsI
0.5   0.1 0.00100 0.00099 31.5753 998.0020 0.99900
    0.3 0.03034 0.03074 5.84226 37.72465 1.01335
  0.1 0.5 0.12093 0.15324 3.67484 18.27296 1.26718
    0.7 0.38322 0.76523 2.92847 13.09894 1.99681
    0.9 1.87829 10.4785 2.53384 10.70328 5.57875
    0.1 0.12601 0.11215 2.33436 6.76028 0.89003
    0.3 0.47328 0.53516 1.44833 4.38079 1.13074
  0.5 0.5 1.05464 1.80762 1.22931 3.80505 1.71396
    0.7 2.38896 7.29211 1.11821 3.54407 3.05241
    0.9 9.05357 86.8501 1.05653 3.41683 9.59291
    0.1 1.11533 0.71572 0.134371 2.07950 9.59291
    0.3 2.54887 2.63579 0.01709 2.09458 1.03410
  0.9 0.5 4.77186 8.02525 0.07148 2.11899 1.68178
    0.7 9.72264 30.52447 0.09810 2.13726 3.13952
    0.9 34.0694 351.4762 0.11116 3.53164 10.3164
5.5 0.1   1.0×10177147 1.0×10177147 3.2×1088573 1.0×10177147 1.00000
  0.3   3.9×1092627 3.9×1092627 1.5×1046413 2.5×1092626 1.00000
  0.5 0.1 2.7×1053327 2.7×1053327 1.9×1026663 3.6×1053326 1.00000
  0.7   3.8×1027441 3.8×1027441 1.6×1013720 2.6×1027440 1.00000
  0.9   1.5×108106 1.5×108106 7.9×104052 6.3×108105 1.00000
  0.1   0.100000 0.09000 2.66666 8.11111 0.90000
  0.3   0.30000 0.21000 0.87287 1.76190 0.70000
  0.5 0.5 0.50000 0.25000 0.00000 1.00000 0.50000
  0.7   0.70000 0.21000 0.87287 1.76190 0.30000
  0.9   0.90000 0.09000 2.66666 8.11111 0.10000
  0.1   5.00111 0.90089 0.52137 3.32144 0.18013
  0.3   5.50660 1.01886 0.54209 3.37346 0.18502
  0.5 0.9 5.96232 1.12473 0.56793 3.42405 0.18863
  0.7   6.54227 1.26243 0.59542 3.474461 0.19296
  0.9   7.69496 1.53654 0.64448 3.58922 0.19968

From Table 1, it can be easily observed that the DOWGeo model is appropriate of modelling over- and (under)-dispersed data; for some selected values of the model parameters DsI =1, so it can be used as an alternative model to Poisson distribution; it is suitable for modelling negative and positive skewed as well as symmetric data sets; and DOWGeo model can be used to analyze either leptokurtic (kurtosis >3) or platykurtic (kurtosis <3) data sets. Tables 2–4 reports some numerical computations of MEOT, TVAR and Rényi entropy at x = 5 based on DOWGeo parameters when δ=2 and q = 0.5.

Table 2.

The MEOT, TVAR and entropy at β=2.5, θ=0.9 and p1.

  p
Measure ↓ Parameter 0.05 0.1 0.3 0.5 0.7 0.9 0.95
MEOT 1.23741 1.36315 1.81559 2.37393 3.26988 5.54413 7.23255
TVAR 11.82741 9.89443 7.30231 7.37393 8.26988 10.94776 12.90343
Rényi entropy 1.69774 1.76243 1.91188 2.02716 2.15060 2.33402 2.41787

Table 3.

The MEOT, TVAR and entropy at β=1.5, p = 0.5 and θ1.

  θ
Measure ↓ Parameter 0.1 0.3 0.5 0.7 0.9 0.95
MEOT 0.99974 0.99998 1.00000 1.00001 3.56188 9.89159
TVAR 1.3×10301029544 1.6×1015277 1.7×10150 2.2×106 8.56188 15.97817
Rényi entropy 1.4×108 0.16816 0.74610 1.30292 2.48811 3.20516

Table 4.

The MEOT, TVAR and entropy at θ=0.5, p = 0.5 and β grows.

  β
Measure ↓ Parameter 0.1 0.5 1.5 2.5 3.0 5.5
MEOT 8.81406 1.10315 1.00000 0.99999 0.99998 0.77272
TVAR 15.01324 258.4995 1.7×10150 1.1×109483 2.2×1075271 1.6×102371279759
Rényi entropy 1.34251 1.13804 0.74610 0.69318 0.69314 0.69313

From Tables 24, it is clearly seen that: the MEOT and Rényi entropy increase whereas the TVAR can take bathtub-shaped for some fixed values of β and θ with p1; the MEOT and Rényi entropy increase whereas the TVAR have inverse-J-shaped for some fixed values of β and p with θ1; and the MEOT and Rényi entropy decrease whereas the TVAR increases for some fixed values of θ and p with β grows.

4.2. The DOW-Inverse Weibull (DOWIW ) distribution

Consider the CDF of the inverse Weibull (IW) model. Then, the PMF of the DOWIW distribution can be defined as

fx(x;p,β,θ,α)=p(θxα1)βp(θ(x+1)α1)β;xN0{0}, (21)

where 0<p,θ<1, α>0 and β>0. Figures 3 and 4 show the PMF ( a:p=0.9,β=0.5,θ=0.1,α=3.8. b:p=0.9,β=0.9,θ=0.9,α=0.2. c:p=0.5,β=1.3,θ=0.1,α=1.2. d:p=0.5,β=2.3,θ=0.1,α=1.2) and HRF ( a:p=0.9,β=0.5,θ=0.1,α=2.8. b:p=0.9,β=0.9,θ=0.1,α=2.8. c:p=0.9,β=0.9,θ=0.9,α=0.9. d:p=0.5,β=0.3,θ=0.1,α=1.2) of the DOWIW distribution, respectively. The shape of the PMF is only unimodal, whereas the HRF can be increasing, increasing-constant-, or decreasing.

Figure 3.

Figure 3.

The PMF of the DOWIW distribution.

Figure 4.

Figure 4.

The HRF of the DOWIW distribution.

Like the DOWGeo model, the DOWIW distribution can be applied to model skewed data set. Moreover, it is appropriate for modelling over-, equi- or under-dispersed data sets.

5. Estimation

5.1. Maximum likelihood estimation

In this segment, unknown parameters of the DOW-G family are estimated by maximum likelihood estimation (MLE). Let X1,X2,,Xn be a RS from the proposed family. Then, the likelihood function (l) can be expressed as

l(x_;p,β,Φ)=i=1n(p[Ψ(xi;Φj)]βp[Ψ(xi+1;Φj)]β), (22)

Furthermore, the log-likelihood function (L) can be obtained as

L(x_;p,β,Φ)=i=1nlog(p[Ψ(xi;Φj)]βp[Ψ(xi+1;Φj)]β), (23)

where Ψ(;Φj)=G(;Φj)1G(;Φj). The maximum likelihood estimators of the unknown parameters p, β and  Φ can be achieved by solving the non-linear normal equations obtained by differentiating (23) with respect to the family parameters. The components of the score vector, B(p,β,Φ)=(Lp,Lβ,LΦ)T, are

Bp=1pi=1n[Ψ(xi;Φj)]βp[Ψ(xi;Φj)]β[Ψ(xi+1;Φj)]βp[Ψ(xi+1;Φj)]βp[Ψ(xi;Φj)]βp[Ψ(xi+1;Φj)]β, (24)
Bβ=ln(p)i=1n[Ψ(xi;Φj)]βp[Ψ(xi;Φj)]βln(Ψ(xi;Φj)[Ψ(xi+1;Φj)]βp[Ψ(xi+1;Φj)]βln(Ψ(xi+1;Φj)p[Ψ(xi;Φj)]βp[Ψ(xi+1;Φj)]β (25)

and

BΦj=ln(p)βi=1n[Ψ(xi;Φj)]β1p[Ψ(xi;Φj)]β[Ψ(xi;Φj]Φj[Ψ(xi+1;Φj)]β1p[Ψ(xi+1;Φj)]β[Ψ(xi+1;Φj]Φjp[Ψ(xi;Φj)]βp[Ψ(xi+1;Φj)]β, (26)

where [Ψ(;Φj)]Φj=Ψ(;Φj)Φj;j=1,2,3,,k. By putting Equations (24)–(26) equals to zero and solving them, immediately produces the MLEs of unknown parameters of the DOW-G family. Since the analytical solution to the above normal equations is difficult to obtain, therefore, we can solve these equations through an iterative method such as Newton-Raphson.

5.2. Bayesian estimation

When the experimenter has some prior knowledge (in the form of prior distribution) regarding the unknown parameters involved in the model, Bayesian estimation becomes crucial. In such a context, we discuss the estimation of unknown parameters of the DOW-G family under the Bayesian framework. Here, we assume the independent prior densities as pBeta(a1,b1), βGamma(a2,b2), and Φjdj(Φj,τj);j=1,2,,k, where dj(Φj,τj) and τj are the prior density and set of associated hyperparameters (i.e. parameters involved in the prior distribution), respectively, for the jth member of the vector parameter Φ. Then, the joint prior distribution of (p,β,Φ is

π(p,β,Φ)=1Beta(a1,b1)pa11(1p)b11.b2a2Γa2βa21eb2β.j=1kdj(Φj,τj), (27)

where, a1,b1,a2,b2>0, and the domain of hyperparameters in τj depends on the arbitrary choice of prior density dj. If we choose a1,b1,a2,b2 and τj such that Equation (27) has minimal or no effect on posterior distribution, we can perform Bayesian study under non-informative priors.

In view of the likelihood function (22), and the joint prior distribution (27), the unnormalized joint posterior density of (p,β,Φ) given X_ can be written as

Π(p,β,Φ|x_)l(x_;p,β,Φ).π(p,β,Φ). (28)

Loss functions (LFs) have an important role in statistical decision inference and Bayesian theory. Therefore, in order to develop the Bayes estimators (BEs) of unknown parameters (or any of their functions), we ought to consider the question of which type of LF will be used. Here, we use a popular LF, named as a squared error loss function (SELF). It is widely applicable due to its symmetric nature (i.e. it equally penalized negative and positive error). The SELF has the following form

Loss(ξ,ξ^)(ξ^ξ)2,

where ξ^ is any estimate of the parameter ξ, then the BE of ξ under SELF is ξ^=Eξ(ξ|x_), here the expectation is carried out under the posterior distribution of the unknown parameter ξ. Thus, in our case, the BE of g(p,β,Φ) (any function of parameters p, β, Φ) with SELF, can be derived as

g^BE(p,β,Φ|x_)=Ep,β,Φj|x_(g(p,β,Φ))=010Φ1Φkg(p,β,Φ)Π(p,β,Φ|x_)dΦkdΦ1dβdp. (29)

Since the analytical solution of the expectation in Equation (29) is difficult to derive due to the non-closure form of the joint posterior density (28), therefore we need numerical approximation methods. Here, we utilize two important Markov Chain Monte Carlo (MCMC) approaches, known as Gibbs sampling [25] and Metropolis–Hastings (MH) algorithm [31,46] to generate the RSs from the posterior density and computing posterior quantities of interest. The work of the Gibbs algorithm is to decompose the joint density (28) into full conditional posterior densities of p, β, Φ, so that these densities can be further used to simulate posterior RSs and to achieve the BEs of the unknown parameters.

To implement the Gibbs algorithm, the unnormalized conditional posterior densities of the parameters p, β, and Φj;j=1,2,,k are as follows

Π1(p|x_,β,Φ)l(x_;p,β,Φ)pa11(1p)b11, (30)
Π2(β|x_,p,Φ)l(x_;p,β,Φ)βa21eb2β, (31)
ΠΦj(Φj|x_,p,β,Φ)l(x_;p,β,Φ)dj(Φj,τj);j=1,2,,k, (32)

where, l(x;p,β,Φ) is the likelihood function in Equation (22). Since the posterior distributions (30)–(32) are difficult to simulate directly by conventional techniques of generating RSs, therefore, we employ MH algorithm to generate posterior RSs for every element of the parameter vector Θ=(p,β,Φ1,Φ2,,Φk) from their respective densities given in Equations (30)–(32).

The detailed steps of MH with in Gibbs algorithm for obtaining BE of Θl (lth element of Θ), l=1,2,,k+2, are as follows:

  • Step I.

    Initialize with some starting guess of Θl as Θl(0) and set i = 1.

  • Step II.

    Obtain the proposal point Θl(i) from the proposal distribution q(Θl(i1),Θl(i))=N(Θ^l(i),Var(Θ^l(i))) and u from Uniform distribution U(0,1). Here, we utilize Θ^l(i)=Θl(0), and Var(Θ^l(i)) is suitably chosen.

  • Step III.

    If uρ(Θl(i1),Θl(i))=min(Πl(Θl(i)|x_)Πl(Θl(i1)|x_)×q(Θl(i),Θl(i1))q(Θl(i1),Θl(i)),1), then Θl(i)=Θl(i), otherwise Θl(i)=Θl(i1). Here, the notation ρ(Θl(i1),Θl(i)) represents the acceptance probability and Πl has been defined in Equations (30)–(32).

  • Step IV.

    Set i = i + 1.

  • Step V.

    Rerun steps 2–4, say M times (a large number of times), and store Θl(i);i=1,2,,m.

  • Step VI.

    Under SELF, the BE of Θl, say Θ^l is obtained as Θ^l=1Mm0i=m0+1MΘl(i), where m0 is the burn-in-iterations of the Markov Chain.

6. A Monte Carlo simulation study

This section is devoted for an extensive simulation study to examine the behaviour of the various estimation methods discussed in Section 5. For this purpose, we generate samples of different sizes, i.e. n = 25, 50, and 100 from DOWGeo and DOWIW distributions. To draw the required RSs, we assume the parametric values of DOWGeo distribution as (0.5, 0.7, 0.8) and (0.9, 1.3, 0.5), while for DOWIW distribution, the model parameters are taken to be as (0.6, 0.8, 0.3, 1.2) and (0.9, 1.5, 0.5, 0.7). In classical setup, we compute the MLEs of the unknown parameters with their associated standard errors (Ses). The MLEs and Ses of the unknown parameters of DOWGeo distribution are presented in Table 5, whereas Table 6 holds the MLEs and Ses in case of DOWIW distribution.

Table 5.

MLE and Bayes estimates for unknown parameters of DOWGeo distribution.

  True value → (p,β,θ)=(0.5,0.7,0.8) (p,β,θ)=(0.9,1.3,0.5)
Sample size Parameters ↓ MLE Se BE PSe MLE Se BE PSe
25 p 0.5287 0.3740 0.4906 0.0683 0.8807 0.3218 0.9183 0.0315
  β 0.7236 0.3349 0.7103 0.0624 1.2667 1.2353 1.3290 0.1243
  θ 0.8303 0.1584 0.7875 0.0289 0.5235 0.5427 0.5129 0.0373
50 p 0.5117 0.2879 0.4946 0.0501 0.9150 0.1628 0.9108 0.0274
  β 0.7168 0.2628 0.7112 0.0445 1.3118 0.8405 1.3107 0.1103
  θ 0.8122 0.1180 0.7819 0.0261 0.5112 0.4107 0.4956 0.0345
100 p 0.5078 0.1894 0.5059 0.0479 0.8966 0.1143 0.9076 0.0204
  β 0.6962 0.1576 0.7023 0.0314 1.3092 0.7469 1.3012 0.0649
  θ 0.7961 0.0892 0.8003 0.0113 0.5070 0.3319 0.4989 0.0314

Table 6.

MLE and Bayes estimates for unknown parameters of DOWIW distribution.

  True value → (p,β,θ,α)=(0.6,0.8,0.3,1.2) (p,β,θ,α)=(0.9,1.5,0.5,0.7)
Sample size Parameters ↓ MLE Se BE PSe MLE Se BE PSe
25 p 0.5476 0.9712 0.5873 0.0526 0.9123 0.5718 0.9111 0.0271
  β 0.8369 2.4693 0.8263 0.1934 1.5739 4.8323 1.4857 0.1297
  θ 0.2889 0.6648 0.3124 0.0454 0.5348 0.9743 0.4874 0.0433
  α 1.2332 4.0627 1.2261 0.1274 0.6792 2.3019 0.6917 0.0715
50 p 0.5679 0.8459 0.5919 0.0413 0.8869 0.4573 0.9049 0.0205
  β 0.8194 1.8607 0.8079 0.0992 1.5349 2.9876 1.4943 0.1203
  θ 0.2914 0.3369 0.2941 0.0374 0.4819 0.7260 0.4889 0.0406
  α 1.2204 3.2751 1.2219 0.1227 0.6898 1.7737 0.7096 0.0672
100 p 0.6109 0.6257 0.6071 0.0397 0.9079 0.2951 0.8996 0.0183
  β 0.8102 1.1781 0.7989 0.0907 1.5104 2.0373 1.5030 0.1126
  θ 0.2975 0.3123 0.3039 0.0314 0.4989 0.5813 0.5011 0.0327
  α 1.2100 2.7216 1.2041 0.1146 0.6997 1.3602 0.7029 0.0612

For Bayesian analysis of DOWGeo distribution, Beta(a1,b1), Gamma(a2,b2), and Beta(a3,b3) are utilized as the independent informative priors for the parameters p, β, and θ, respectively, while in case of DOWIW distribution, the variability of the unknown parameters p, β, θ, and α is measured by assuming independent informative priors as Beta(a4,b4), Gamma(a5,b5), Beta(a6,b6), and Gamma(a7,b7), respectively. In these prior densities, the hyperparameters have been selected such that the mean of the prior density is approximately equal to the corresponding predetermined value of the parameter. Under this setting, in order to calculate the Bayes estimates of unknown parameters, we generate 51,000 items for each parameter by utilizing MH within Gibbs sampling. The starting 1000 burn-in iterations for each of the chains have been discarded for removing the effects of initial values of the parameters. Also, every 25th observation is stored to neutralize the auto-correlation of successive draws. The convergence diagnostics of each generated chain is investigated by Geweke's [26] test at a 95% credibility level. This diagnostics is also examined by plotting posterior densities, MCMC runs, and auto-correlation functions. The above plots can be seen in Figures S1, S2, S3, and S4 for DOWGeo(0.5, 0.7, 0.8), DOWGeo (0.9, 1.3, 0.5), DOWIW(0.6, 0.8, 0.3, 1.2), and DOWIW (0.9, 1.5, 0.5, 0.7) distributions, respectively.

After the convergence diagnostics, we have used these generated values to obtain Bayes estimates with their associated posterior standard errors (PSes) under SELF. The Bayes estimates and their PSes for the unknown parameters of DOWGeo distribution are given in Table 5, whereas Table 6 consists the Bayes estimates and PSes for the parameters of DOWIW distribution. All numerical computations are performed using open-source software R.

From this simulation study, we have observed that for each pair of sample size and true parametric value, both estimation methods work satisfactorily, but Bayesian method outperforms the MLE with respect to the estimation errors. Also, it is noticed that the estimation error associated with an estimate for both estimation procedures, tends to decrease as we increase the sample size.

7. Real data illustration

In this segment, we illustrate the applications of the DOWGeo and DOWIW models by using two real data sets. The fitting of the models are compared using some well-known measures, namely, L, Akaike information criterion (AkIC), correct Akaike information criterion (CAkIC), Hannan-Quinn information criterion (HQIC), Chi-square ( χ2) with degree of freedom (DF) and the associated P-value. The competitive distributions are provided in Table 7.

Table 7.

The competitive models.

Distribution Abbreviation Author(s)
Geometric Geo
Generalized geometric GGeo Gómez-Déniz [28]
Discrete Rayleigh DR Roy [60]
Discrete inverse Rayleigh DIR Hussain and Ahmad [33]
Discrete inverse Weibull DIW Jazi et al. [36]
One parameter discrete Lindley DLi-I Gómez-Déniz and Calderín-Ojeda [29]
Two parameters discrete Lindley DLi-II Hussain et al. [34]
Three parameters discrete Lindley DLi-III Eliwa et al. [19]
One parameter discrete flexible model DFx-I Eliwa and El-Morshedy [22]
Negative Binomial NeBi
Poisson Poi Poisson [56]
Discrete Bilal DBL Eliwa et al. [20]
Discrete Pareto DPa Krishna and Pundir [41]
Discrete Burr DB Krishna and Pundir [41]
Discrete Burr-Hatke DBH El-Morshedy et al. [22].
Discrete log-logistic DLogL Para and Jan [53]
Discrete Lomax distribution DLo Para and Jan [54]

7.1. Data set I: count cysts of kidneys using steroids

This data set is taken from the study of Chan et al. [15]. Here, we examine the fitting capability of the DOWGeo model with some other competitive distributions like: GGeo, Geo, DR, DIR, DIW, NeBi, Poi, DFx-I, DLi-I, DLi-II, DLi-III, DLogL, DB, DLo, and DPa. The MLEs with their associated standard errors (Se) are tabulated in Table 8, whereas Tables 9 and 10 list the goodness of fit statistics (GOFS). In Tables 9 and 10, ExFr and ObFr represent the expected and observed frequencies, respectively.

Table 8.

The MLEs with their corresponding Se for data set I.

Parameter → p β θ
Model ↓ MLE Se MLE Se MLE Se
DOWGeo 0.181 0.152 0.441 0.119 0.813 0.147
GGeo 0.188 0.089 0.800 0.064
Geo 0.582 0.030
DR 0.901 0.009
DIR 0.554 0.049
DIW 1.049 0.146 0.581 0.048
NeBi 0.812 0.045 0.322 0.074
Poi 1.390 0.112
DFx-I 0.623 0.031
DLi-I 0.436 0.026
DLi-II 0.581 0.045 0.001 0.058
DLi-III 0.582 0.005 358.728 11863.37 0.001 20.698
DLogL 0.780 0.136 1.208 0.159
DB 0.278 0.045 1.053 0.167
DLo 0.152 0.098 1.830 0.952
DPa 0.268 0.034

Table 9.

Various fitted distributions with GOFS for data set I based on one parameter competitive models.

    ExFr
X ObFr DOWGeo Geo DR DIR Poi DFx-I DLi-I DPa
0 65 65.006 45.980 10.890 60.888 27.398 45.256 40.286 65.842
1 14 14.203 26.760 26.618 33.99 38.084 29.094 29.834 18.267
2 10 8.581 15.575 29.448 8.123 26.468 16.508 18.357 8.164
3 6 5.947 9.064 22.296 3.004 12.264 8.893 10.336 4.513
4 4 4.355 5.275 12.629 1.420 4.262 4.703 5.523 2.820
5 2 3.262 3.070 5.539 0.779 1.185 2.489 2.851 1.909
6 2 2.458 1.787 1.914 0.473 0.274 1.335 1.437 1.368
7 2 1.843 1.039 0.526 0.308 0.054 0.731 0.711 1.022
8 1 1.364 0.605 0.116 0.212 0.009 0.409 0.347 0.789
9 1 0.991 0.352 0.020 0.152 0.001 0.234 0.167 0.626
10 1 0.702 0.205 0.003 0.112 0 0.137 0.079 0.506
11 2 1.288 0.288 0.001 0.539 0.001 0.211 0.072 4.174
Total 110 110 110 110 110 110 110 110 110
L   166.605 178.767 277.778 186.547 246.210 182.288 189.110 171.192
AkIC   339.209 359.533 557.556 375.094 494.420 366.575 380.220 344.384
CAkIC   339.436 359.570 557.593 375.131 494.457 366.612 380.257 344.421
HQIC   342.495 360.629 558.651 376.189 495.515 367.671 381.316 345.479
χ2   0.596 19.109 306.515 40.456 89.277 31.702 34.635 3.430
DF   2 4 4 2 3 4 4 4
P-value   0.742 <0.001 <0.001 <0.001 <0.001 <0.001 <0.001 0.489

Table 10.

Various fitted distributions with GOFS for data set I based on competitive models with two or three parameters.

    ExFr
X ObFr DOWGeo GGeo NeBi DLi-II DLi-III DLogL DB DLo DIW
0 65 65.006 62.738 56.520 46.026 46.008 63.192 64.743 61.615 63.910
1 14 14.203 19.665 15.885 26.768 26.765 20.101 19.177 21.023 20.699
2 10 8.581 9.439 9.173 15.568 15.570 8.644 8.484 9.687 8.053
3 6 5.947 5.436 6.203 9.054 9.058 4.656 4.632 5.275 4.234
4 4 4.355 3.463 4.502 5.266 5.269 2.864 2.863 3.197 2.599
5 2 3.262 2.349 3.400 3.062 3.066 1.921 1.920 2.088 1.754
6 2 2.458 1.663 2.635 1.781 1.783 1.368 1.365 1.441 1.261
7 2 1.843 1.213 2.079 1.036 1.037 1.019 1.013 1.038 0.949
8 1 1.364 0.904 1.663 0.602 0.604 0.786 0.777 0.773 0.739
9 1 0.991 0.685 1.344 0.350 0.351 0.623 0.613 0.592 0.592
10 1 0.702 0.525 1.095 0.204 0.204 0.504 0.494 0.463 0.485
11 2 1.288 1.920 5.501 0.283 0.285 4.322 3.919 2.808 4.725
Total 110 110 110 110 110 110 110 110 110 110
L   166.605 168.556 168.544 178.767 178.767 171.717 171.139 170.481 172.935
AkIC   339.209 341.113 340.088 361.534 363.533 347.43 346.278 344.961 349.869
CAkIC   339.436 341.225 344.489 361.646 363.759 347.55 346.391 345.073 349.982
HQIC   342.495 343.303 343.2788 363.724 366.819 349.62 348.469 347.152 352.060
χ2   0.596 2.444 4.287 19.091 19.096 4.033 2.587 3.238 6.445
DF   2 3 4 3 2 3 2 3 3
P-value   0.742 0.485 0.369 0.0003 <0.0001 0.258 0.274 0.356 0.092

From Tables 9 and 10 it is noted that, the DPa, GGeo, NeBi, DLogL, DB, DLo and DIW distributions work quite satisfactory for analyzing data set I besides the DOWGeo distribution under condition (P-value >0.05). Depending on L, AkIC, CAkIC, HQIC, χ2 and P-value, it is easily visible that the DOWGeo model provides the best fit among all other tested models since it has the smallest value of L, AkIC, CAkIC, HQIC, and χ2, whereas it achieve the highest P-value associated to the χ2 test. Figure S5 depicts the profile of L for each parameter based on data set I, which indicates that the estimators of the parameters are unimodal. Figures 5 and S6 support the results of Tables 9 and 10, and they conclude that data set I follows the DOWGeo, DPa, GGeo, NeBi, DLogL, DBX-II, DLo and DIW models. But, the DOWGeo model is better than other rival models.

Figure 5.

Figure 5.

The observed and expected PMFs for data set I.

Table 11 lists some theoretical and empirical descriptive statistics for data set I.

Table 11.

Some useful descriptive statistics for data set I.

Type ↓ Measures Mean Variance DsI Skewness Kurtosis
Theoretical 1.39493 5.98717 4.29207 2.34691 9.11467
Empirical 1.39090 6.11184 4.39416 2.29259 8.17345

From Table 11, it is clear that the theoretical measures are closed to empirical ones. Most of the distribution is at the left and platykurtic. Further, the data considered herein suffering from over-dispersion phenomena.

7.2. Data set II: COVID-19 in South Korea

This data set can be viewed at (https://www.worldometers.info/coronavirus/country/south-korea/) and consists the daily new deaths in South Korea for COVID-19 from 15 February to 12 Dec 2020. Here, we examine the fitting capability of the DOWIW model with some other rival distributions like DIW, DLogL, DB, DIR, DBL, DPa, DBH, and Poi. The MLEs with their associated Se are reported in Table 12, whereas Table 13 lists the GOFS.

Table 12.

The MLEs with their corresponding Se for data set II.

  p β θ α
Model MLE Se MLE Se MLE Se MLE Se
DOWIW 0.997 0.0002 1.064 0.069 0.991 0.003 1.083 0.102
DIW 0.271 0.025 1.411 0.083
DLogL 1.716 0.095 1.878 0.107
DB 0.591 0.031 2.466 0.248
DIR 0.229 0.023
DBL 0.707 0.010
DPa 0.377 0.021
DBH 0.904 0.020
Poi 1.901 0.079

Table 13.

Various fitted distributions with GOFS for data set II.

    E. Fr
No. ECB O. Fr DOWIW DIW DLogL DB DIR DBL DPa DBH Poi
0 89 89.040 82.351 80.931 92.887 69.890 63.089 149.356 166.600 45.408
1 79 74.528 103.702 92.775 97.788 140.613 89.133 50.500 54.598 86.336
2 49 51.862 44.390 51.431 42.676 47.685 64.868 25.478 26.667 82.074
3 29 34.059 22.317 27.336 21.174 19.125 39.578 15.379 15.540 52.014
4 19 21.595 12.926 15.559 12.172 9.333 22.307 10.312 10.017 24.725
5 17 13.351 8.248 9.518 7.754 5.198 12.037 7.387 6.885 9.402
6 9 8.092 5.636 6.193 5.308 3.162 6.328 5.533 4.953 2.979
7 6 4.825 4.052 4.247 3.831 2.098 3.273 4.408 3.682 0.809
8 6 2.387 3.031 3.061 2.879 1.429 1.674 3.466 2.808 0.192
9 1 4.261 17.347 12.949 17.531 5.467 1.713 32.181 12.25 0.061
Total 304 304 304 304 304 304 304 304 304 304
L   564.625 586.855 577.011 587.652 606.870 575.338 633.5307 620.466 621.0976
AIC   1137.250 1177.711 1158.023 1179.304 1215.740 1152.676 1269.061 1242.932 1244.195
CAIC   1137.384 1177.751 1158.063 1179.344 1215.754 1152.689 1269.075 1242.945 1244.208
BIC   1152.118 1185.145 1165.457 1186.738 1219.457 1156.393 1272.778 1246.649 1247.912
HQIC   1143.198 1180.684 1160.997 1182.278 1217.227 1154.163 1270.548 1244.419 1245.682
χ2   2.792 41.868 25.019 44.784 92.204 28.203 128.631 109.333 115.896
D.F   3 6 6 6 6 6 7 6 4
P-value   0.425 0.001 0.001 0.001 0.001 0.001 0.001 0.001 0.001

Based on the fitting measures in Table 13, we can announce that the DOWIW model is the best fitted model. Figure S7 shows the profiles of L for each parameter based on data set II, and it shows that every estimator has a unimodal function. Figures 6 and S8 support the results of Table 13, and we can say that the data set II follows the DOWIW model.

Figure 6.

Figure 6.

The observed and expected PMFs for data set II.

Some theoretical and empirical descriptive statistics based on data set II can be viewed in Table 14.

Table 14.

Some useful descriptive statistics for data set II.

Type ↓ Measures Mean Variance DsI Skewness Kurtosis
Theoretical 1.9013 4.1222 2.1681 1.2638 4.0864
Empirical 1.8567 3.9978 2.1531 1.3657 4.2367

From Table 14, the empirical mean, variance, DsI and skewness are closed to theoretical ones. Data set II suffering from over-dispersion phenomena, and most of the distribution is at the left with platykurtic.

7.3. Bayesian estimation for real data sets I and II

Here, we compute the BEs with their PSes for the unknown parameters of the DOWGeo and DOWIW distributions using a similar manner discussed in Section 5. Since there is no prior information available regarding the unknown parameters of the special distributions, therefore, we have used non-informative priors for the unknown parameters of the models under study. The calculated estimates with the related estimation errors of DOWGeo and DOWIW distributions are jointly given in Table 15.

Table 15.

Bayes estimates for DOWGeo and DOWIW distributions under real data sets I and II.

  DOWGeo distribution under data set I DOWIW distribution under data set II
Parameter ↓ Model BE PSe BE PSe
p 0.1804 0.0105 0.9970 0.0002
β 0.4390 0.0285 1.1105 0.0427
θ 0.8141 0.0235 0.9869 0.0024
α 1.0135 0.0728

From the comparison of Bayes estimates (in Table 15) and MLEs (in Tables 8 and 12), we can easily observe that the result obtained in case of simulation study do hold in case of real-life data sets as well.

8. Concluding remarks

In the present research article, we have introduced a new family of discrete distributions called DOW-G family. Its several important statistical characteristics have been investigated. Two particular distribution of the proposed family are studied in detail. These special models provides good flexibility in terms of shapes for the PDFs. Further, we have noticed that the proposed family can model a positively skewed, a negatively skewed or a symmetric shaped data set. In addition, the DOW-G family can be used quite effectively for modelling a wide variety of failure data because its HRF can take diverse shapes (increasing, decreasing, constant, J-, and bathtub-shaped). Moreover, it is suitable for modelling under-, equi- and over-dispersed data set. In classical and non-classical setups, the method of maximum likelihood and Bayesian approach have been utilized to estimate the unknown parameters of particular models.

An extensive Monte Carlo simulation analysis has been conducted to evaluate the behaviour of above stated estimation methods. The results of simulation studies declare that these two estimation procedures perform quite satisfactorily in estimating unknown parameters of the model, but the Bayesian method dominates the method of maximum likelihood in terms of estimation errors. The flexibility of the DOW-G family has also been exemplified by using two distinctive real data sets. Hence, it is reasonable to say that the special distributions of the developed family can serve as an alternative model to the existing discrete models for modelling count or failure data in various fields including reliability, insurance, medicine, economics, and demography, etc.

It is worth noting that the above research can be expanded into several dimensions, for example- with different types of censoring schemes, we can study the particulars models of the proposed family (see Tyagi et al. [65]); In reliability inference, we can analyze stress-strength reliability or load share systems under special distributions of the DOW-G family. Also, various characteristics of order statistics from the proposed family could be explored and to infer the bi-variate data, a bi-variate discrete family may also be studied.

Supplementary Material

Supplemental_Material

Acknowledgments

The authors thankfully acknowledge the critical suggestions and comments from the associate Editor and referees which greatly helped us in the improvement of the paper.

Disclosure statement

No potential conflict of interest was reported by the author(s).

References

  • 1.Abd EL-Baset A.A. and Ghazal M.G.M., Exponentiated additive Weibull distribution, Reliab. Eng. Syst. Saf. 193 (2020), p. 106663. [Google Scholar]
  • 2.Afify A.Z. and Alizadeh M., The odd dagum family of distributions: Properties and applications, J. Appl. Probab. Stat. 15 (2020), pp. 45–72. [Google Scholar]
  • 3.Alamatsaz M.H., Dey S., Dey T., and Harandi S.S., Discrete generalized Rayleigh distribution, Pakistan J. Statist. 32 (2016), pp. 1–20. [Google Scholar]
  • 4.Al-Babtain A., Fattah A.A., Ahmed A.H.N., and Merovci F., The Kumaraswamy-transmuted exponentiated modified Weibull distribution, Commun. Stat. Simul. Comput. 46 (2017), pp. 3812–3832. [Google Scholar]
  • 5.Alizadeh M., Afify A.Z., Eliwa M.S., and Ali S., The odd log-logistic Lindley-G family of distributions: Properties, Bayesian and non-Bayesian estimation with applications, Comput. Stat. 35 (2020), pp. 281–308. [Google Scholar]
  • 6.Alizadeh M., Emadi M., Doostparast M., Cordeiro G.M., Ortega E.M., and Pescim R.R., A new family of distributions: The Kumaraswamy odd log-logistic, properties and applications, Hacettepe J. Math. Statist. 44 (2015), pp. 1491–1512. [Google Scholar]
  • 7.Almalki S.J. and Yuan J, A new modified Weibull distribution, Reliab. Eng. Syst. Saf. 111 (2013), pp. 164–170. [Google Scholar]
  • 8.Alzaatreh A., Lee C., and Famoye F, A new method for generating families of continuous distributions, Metron 71 (2013), pp. 63–79. [Google Scholar]
  • 9.Bakouch H.S., Chesneau C., and Khan M.N., The extended odd family of probability distributions with practice to a submodel, Filomat 33 (2019), pp. 3855–3867. [Google Scholar]
  • 10.Bakouch H.S., Aghababaei M., and Nadarajah S., A new discrete distribution, Statistics 48 (2014), pp. 200–240. [Google Scholar]
  • 11.Bebbington M., Lai C.D., Wellington M., and Zitikis R, The discrete additive Weibull distribution: A bathtub-shaped hazard for discontinuous failure data, Reliab. Eng. Syst. Saf. 106 (2012), pp. 37–44. [Google Scholar]
  • 12.Bourguignon M., Silva R.B., and Cordeiro G.M., The Weibull-G family of probability distributions, J. Data. Sci. 12 (2014), pp. 53–68. [Google Scholar]
  • 13.Chakraborty S., Generating discrete analogues of continuous probability distributions – a survey of methods and constructions, J. Stat. Distrib. Appl. 2 (2015), pp. 1–30. [Google Scholar]
  • 14.Chandrakant K., Yogesh M.T., and Manoj K.R., On a discrete analogue of linear failure rate distribution, Am. J. Math. Manage. Sci. 36 (2017), pp. 229–246. [Google Scholar]
  • 15.Chan S.K., Riley P.R., Price K.L., McElduff F., Winyard P.J., Welham S.J., Woolf A.S., and Long D.A., Corticosteroid-induced kidney dysmorphogenesis is associated with deregulated expression of known cystogenic molecules, as well as Indian hedgehog, Am. J. Physiology-Renal Physiol. 298 (2010), pp. F346–F356. [DOI] [PubMed] [Google Scholar]
  • 16.Cordeiro G.M., Altun E., Korkmaz M.C., Pescim R.R., Afify A.Z., and Yousof H.M., The xgamma family: Censored regression modelling and applications, REVSTAT–Statist. J. 18 (2020), pp. 593–612. [Google Scholar]
  • 17.Cooray K., Generalization of the Weibull distribution: The odd Weibull family, Stat. Model. 6 (2006), pp. 265–277. [Google Scholar]
  • 18.Eliwa M.S., Alhussain Z.A., and El-Morshedy M., Discrete Gompertz-G family of distributions for over-and under-dispersed data with properties, estimation, and applications, Mathematics 8 (2020a), p. 358. [Google Scholar]
  • 19.Eliwa M.S., Altun E., El-Dawoody M., and El-Morshedy M., A new three-parameter discrete distribution with associated INAR(1) process and applications, IEEE Access 8 (2020b), pp. 91150–91162. doi: 10.1109/ACCESS.2020.2993593. [DOI] [Google Scholar]
  • 20.Eliwa M.S., El-Morshedy M., and Altun E., A study on discrete Bilal distribution with properties and applications on integer-valued autoregressive process, Revstat – Statist. J. (2020c), pp. 1–30. [Google Scholar]
  • 21.Eliwa M.S. and El-Morshedy M., A one-parameter discrete distribution for over-dispersed data: Statistical and reliability properties with applications, J. Appl. Statist. (2021), pp. 1–21. doi: 10.1080/02664763.2021.1905787. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.El-Morshedy M., Eliwa M.S., and Altun E., Discrete Burr-Hatke distribution with properties, estimation methods and regression model, IEEE Access 8 (2020a), pp. 74359–74370. [Google Scholar]
  • 23.El-Morshedy M., Eliwa M.S., and Nagy H., A new two-parameter exponentiated discrete Lindley distribution: Properties, estimation and applications, J. Appl. Stat. 47 (2020b), pp. 354–375. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.El-Morshedy M. and Eliwa M. S., The odd flexible Weibull-H family of distributions: Properties and estimation with applications to complete and upper record data, Filomat 33 (2019), pp. 2635–2652. [Google Scholar]
  • 25.Geman S and Geman D, Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images, IEEE. Trans. Pattern. Anal. Mach. Intell. 6 (1984), pp. 721–741. [DOI] [PubMed] [Google Scholar]
  • 26.Geweke J., Evaluating the accuracy of sampling-based approaches to the calculation of posterior moments Minneapolis, MN, Federal Reserve Bank of Minneapolis, Research Department, 196, 1991.
  • 27.Ghitany M.E. and Al-Mutairi D.K., Estimation methods for the discrete Poisson Lindley distribution, J. Stat. Comput. Simul. 79 (2009), pp. 1–9. [Google Scholar]
  • 28.Gómez-Déniz E., Another generalization of the geometric distribution, Test 19 (2010), pp. 399–415. [Google Scholar]
  • 29.Gómez-Déniz E. and Calderín-Ojeda E., The discrete Lindley distribution: Properties and applications, J. Stat. Comput. Simul. 81 (2011), pp. 1405–1416. [Google Scholar]
  • 30.Hamedani G.G., Altun E., Korkmaz M.Ç., Yousof H.M., and Butt N.S., A new extended G family of continuous distributions with mathematical propertiesm, characterizations and regression modelling, Pakistan J. Statist. Oper. Res. 14 (2018), pp. 737–758. [Google Scholar]
  • 31.Hastings W.K, Monte Carlo sampling methods using Markov chains and their applications, Biometrika 57 (1970), pp. 97–109. [Google Scholar]
  • 32.Hosking J.R. and Wallis J.R., Regional Frequency Analysis: An Approach Based on L-Moments, Cambridge University Press, Cambridge, 1997. [Google Scholar]
  • 33.Hussain T. and Ahmad M, Discrete inverse Rayleigh distribution, Pakistan J. Statist. 30 (2014), pp. 203–222. [Google Scholar]
  • 34.Hussain T., Aslam M., and Ahmad M., A two-parameter discrete Lindley distribution, Revista Colombiana De Estadistica 39 (2016), pp. 45–61. [Google Scholar]
  • 35.Inusah S. and Kozubowski T.J., A discrete analogue of the Laplce distribution, J. Statist. Plann. Inference 136 (2006), pp. 1090–1102. [Google Scholar]
  • 36.Jazi M.A., Lai C.D., and Alamatsaz M.H., A discrete inverse Weibull distribution and estimation of its parameters, Stat. Methodol. 7 (2010), pp. 121–132. [Google Scholar]
  • 37.Korkmaz M. Ç., A new family of the continuous distributions: The extended Weibull-G family, Commun. Faculty Sci. Univ. Ankara Ser. A1 Math. Statist. 68 (2018), pp. 248–270. [Google Scholar]
  • 38.Korkmaz M.Ç, Altun E., Yousof H.M., and Hamedani G.G., The odd power Lindley generator of probability distributions: Properties, characterizations and regression modelling, Int. J. Stat. Probab. 8 (2019), pp. 70–89. [Google Scholar]
  • 39.Korkmaz M.Ç., Altun E., Yousof H.M., and Hamedani G.G., The Hjorth's IDB generator of distributions: Properties, characterizations, regression modelling and applications, J. Statist. Theory Appl. 19 (2020), pp. 59–74. [Google Scholar]
  • 40.Korkmaz M.Ç. and Genç A.İ., A new generalized two-sided class of distributions with an emphasis on two-sided generalized normal distribution, Commun. Statist.-Simul. Comput. 46 (2017), pp. 1441–1460. [Google Scholar]
  • 41.Krishna H. and Pundir P.S., Discrete Burr and discrete Pareto distributions, Stat. Methodol. 6 (2009), pp. 177–188. [Google Scholar]
  • 42.Kus C., Akdogan Y., Asgharzadeh A., Kinaci I., and Karakaya K., Binomial-discrete Lindley distribution, Commun. Faculty Sci. Univ. Ankara Ser. A1-Math. Statist. 68 (2018), pp. 401–411. [Google Scholar]
  • 43.Lai C.D., Xie M., and Murthy D.N.P., A modified Weibull distribution, IEEE Trans. Reliab. 52 (2003), pp. 33–37. [Google Scholar]
  • 44.Lee C., Famoye F., and Olumolade O., Beta-Weibull distribution: Some properties and applications to censored data, J. Modern Appl. Statist. Methods 6 (2007), p. 17. [Google Scholar]
  • 45.Marshall A.W. and Proschan F., Classes of distributions applicable in replacement with renewal theory implications, Sixth Berkeley Symposium I, 1970, pp. 395–416.
  • 46.Metropolis N. and Ulam S., The Monte Carlo method, J. Amer. Statist. Assoc. 44 (1949), pp. 335–341. [DOI] [PubMed] [Google Scholar]
  • 47.Nascimento A.D., Silva K.F., Cordeiro G.M., Alizadeh M., Yousof H.M., and Hamedani G.G., The odd Nadarajah-Haghighi family of distributions: Properties and applications, Studia Scientiarum Mathematicarum Hungarica 56 (2019), pp. 185–210. [Google Scholar]
  • 48.Nekoukhou V., Alamatsaz M.H., and Bidram H., Discrete generalized exponential distribution of a second type, Statistics 47 (2013), pp. 876–887. [Google Scholar]
  • 49.Nekoukhou V. and Bidram H., The exponentiated discrete Weibull distribution, Statist. Oper. Res. Trans. 39 (2015a), pp. 127–146. [Google Scholar]
  • 50.Nekoukhou V. and Bidram H., A new four-parameter discrete distribution with bathtub and unimodal failure rate, J. Appl. Statist. 42 (2015b), pp. 2654–2670. [Google Scholar]
  • 51.Nofal Z.M., Afify A.Z., Yousof H.M., Granzotto D.C., and Louzada F., Kumaraswamy transmuted exponentiated additive Weibull distribution, Int. J. Statist. Probab. 5 (2016), pp. 78–99. [Google Scholar]
  • 52.Pal M., Ali M.M., and Woo J., Exponentiated weibull distribution, Statistica 66 (2006), pp. 139–147. [Google Scholar]
  • 53.Para B.A. and Jan T.R., Discrete version of log-logistic distribution and its applications in genetics, Int. J. Modern Math. Sci. 14 (2016a), pp. 407–422. [Google Scholar]
  • 54.Para B.A. and Jan T.R., On discrete three parameter burr type XII and discrete Lomax distributions and their applications to model count data from medical science, Biom. Biostatist. Int. J. 4 (2016b), pp. 1–15. [Google Scholar]
  • 55.Pinho L.G.B., Cordeiro G.M., and Nobre J.S., The gamma-exponentiated Weibull distribution, J. Statist. Theory Appl. 11 (2012), pp. 379–395. [Google Scholar]
  • 56.Poisson S.D., Probabilité des jugements en matiére criminelle et en matiére civile, précédées des régles génerales du calcul des probabilitiés, Paris, Bachelier, 1837, pp. 206–207.
  • 57.Rényi A, On measures of entropy and information, Math. Statist. Probab. 1 (1961), pp. 547–561. [Google Scholar]
  • 58.Reyad H., Korkmaz M.Ç., Afify A.Z., Hamedani G.G., and Othman S., The Fréchet topp Leone-G family of distributions: Properties, characterizations and applications, Ann. Data Sci. 8 (2019), pp. 345–366. [Google Scholar]
  • 59.Roy D., The discrete normal distribution, Commun. Statist. Theory Methods 32 (2003), pp. 1871–1883. [Google Scholar]
  • 60.Roy D., Discrete Rayleigh distribution, IEEE Trans. Reliab. 53 (2004), pp. 255–260. [Google Scholar]
  • 61.Sarhan A.M. and Apaloo J, Exponentiated modified Weibull extension distribution, Reliab. Eng. System Saf. 112 (2013), pp. 137–144. [Google Scholar]
  • 62.Silva G.O., Ortega E.M., and Cordeiro G.M., The beta modified Weibull distribution, Lifetime. Data. Anal. 16 (2010), pp. 409–430. [DOI] [PubMed] [Google Scholar]
  • 63.Tahir M.H., Cordeiro G.M., Alizadeh M., Mansoor M., Zubair M., and Hamedani G.G., The odd generalized exponential family of distributions with applications, J. Statist. Distrib. Appl. 2 (2015), p. 1. [Google Scholar]
  • 64.Tyagi A., Choudhary N., and Singh B., Discrete additive Perks–Weibull distribution: Properties and applications, Life Cycle Reliab. Saf. Eng. 8 (2019), pp. 183–199. [Google Scholar]
  • 65.Tyagi A., Choudhary N., and Singh B., A new discrete distribution: Theory and applications to discrete failure lifetime and count data, J. Appl. Probab. Statist. 15 (2020), pp. 117–143. [Google Scholar]
  • 66.Tyagi A., Choudhary N., and Singh B., Inferences on discrete Rayleigh distribution under type-II censored data using a tie-run approach, Int. J. Agricult. Stat. Sci. 16 (2020), pp. 939–951. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental_Material

Articles from Journal of Applied Statistics are provided here courtesy of Taylor & Francis

RESOURCES