Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2022 Mar 21;36:105339. doi: 10.1016/j.rinp.2022.105339

A new generalization of Gull Alpha Power Family of distributions with application to modeling COVID-19 mortality rates

Mutua Kilai a,, Gichuhi A Waititu b, Wanjoya A Kibira b, Huda M Alshanbari c, M El-Morshedy d,e
PMCID: PMC8935977  PMID: 35340982

Abstract

This paper proposes a new generalization of the Gull Alpha Power Family of distribution, namely the exponentiated generalized gull alpha power family of distribution abbreviated as (EGGAPF) with two additional parameters. This proposed family of distributions has some well known sub-models. Some of the basic properties of the distribution like the hazard function, survival function, order statistics, quantile function, moment generating function are investigated. In order to estimate the parameters of the model the method of maximum likelihood estimation is used. To assess the performance of the MLE estimates a simulation study was performed. It is observed that with increase in sample size, the average bias, and the RMSE decrease. A distribution from this family is fitted to two real data sets and compared to its sub-models. It can be concluded that the proposed distribution outperforms its sub-models.

Keywords: Gull Alpha Power Family, Exponentiated generalized distribution, Quantile function, Cramer–Von Misses test, Maximum likelihood estimation

Introduction

Probability distributions have been of good use in various areas of science including reliability engineering, survival analysis, financial time series, econometrics, social sciences among others. Because of the applicability of probability distributions in real life, over the last decade researchers have dedicated efforts to modify and generate new distributions in order to address challenges of lifetime data. In order to guarantee flexibility of distributions, the number of shape parameters that characterize the cumulative distribution function matters. Many methods have been devised to add one or more parameters to the cumulative distribution function of a distribution which makes the resulting distribution to be flexible and rich for modeling data.

A list of the methods include the Beta generalized family developed by [1], Kumaraswamy-G family by [2], McDonald-G family by [3], Weibull X by [4], Gamma-X by [5], Beta Mashall–Olkin family of distributions by [6], a new Weibull G family by [7], the Alpha Power Transformation family by [8], the Marshall–Olkins Alpha Power family by [9], the Transmuted Alpha Power-G Family by [10], the Alpha-Power Marshall Olkins G distribution by [11], the Gull Alpha Power of the Ampadu Type by [12] and Families of distributions arising from the quantile of generalized lambda distribution by [13] are some of the notable generalizations and generators developed recently among others.

A new method called exponentiated method where the cumulative distribution function is raised to a parameter was introduced by [14]. Many traditional distributions have been extended using this method, the exponentiated exponential distribution by [15]. For other distributions extended using this criteria the reader is referred to the articles [16], [17], [18] and [19]. The addition of one shape parameter to the cumulative distribution function have a shortcoming in that it handles only the skewness, and fails to control kurtosis. [20] introduced a new method of adding two extra shape parameters called the exponentiated generalized class of distributions and studied its properties. Several authors have used this method to extend distributions for example [21] extended the Mashall Olkins family of distributions using this approach [22] developed the exponentiated generalized standardized half-logistic distribution.

Due to the availability of the above methods of generating new continuous distributions, the exponential distribution has been extended in many researches a few generalizations of the exponential distribution include the exponentiated generalized mashall olkins exponential distribution by [21], the Gompertz inverse exponential distribution studied by [23], odd generalized exponential distribution by [24], weibull exponential distribution by [25] among other generalizations.

Recently, [26] developed the Gull Alpha Power Family (GAPF) with CDF and PDF:

FGAPF(y)=αW(y;φ)αW(y;φ)ifα>0,α1W(y;φ)ifα=1 (1)
fGAPF(y)=log(α)α1W(y;φ)(w(y;φ)W(y;φ))+w(y;φ)α1W(y;φ);α>0,α1 (2)

New extension of the GAPF is proposed by considering its PDF and CDF as the baseline in the exponentiated generalized class studied by [20]. The new family of distributions is named the Exponentiated-Generalized Gull Alpha Power. The primary motivation to extend this family is:

  • 1.

    To develop distributions that are characterized by different shapes of the hazard function

  • 2.

    To generate distribution that offer a better fit compared to other models with the same baseline distribution

  • 3.

    To construct distributions that are heavy tailed in order to model diverse real data sets.

  • 4.

    To develop distributions that are characterized by different shapes.

Some special families from this new family of distributions are displayed in Table 1

Table 1.

Some special families.

α a b Family
1 1 1 W(y)
α 1 1 Gull Alpha Power
1 a b Exponentiated class of distributions
α 1 b Exponentiated Gull Alpha Power family

The remainder of the article is organized as, the new family of distribution is presented. Important properties of the EGGAP family such as CDF, PDF, HRF and survival function are discussed. Some important sub models for the family with the shapes of the PDF and HRF are presented. Estimation of parameters and simulation to investigate the properties of the estimators are provided. In order to compare the proposed distribution with well known models in literature COVID-19 data is used and the results tabulated. In the final section, conclusions are made.

Exponentiated generalized gull alpha power family of distributions

The CDF of exponentiated generalized family of distributions developed by [20] has CDF given in Eq. (3).

F(y;φ)EGG=[1{1W(y;φ)}a]b,a>0,b>0 (3)

and Eq. (4) gives the PDF:

f(y;φ)EGG=ab{1W(y;φ)}a1[1{1W(y;φ)}a]b1w(y;φ),a,b>0 (4)

The EGGAP family of distribution is obtained by inserting Eqs. (1), (2) into Eqs. (3), (4) respectively. The new family has CDF given as:

FEGGAPF(y,α,a,b,φ)=[1(W¯(y;φ))a]bα>0,1,a,b>0 (5)

and Eq. (6) gives the PDF:

fEGGAPF(y,α,a,b,φ)=(ablog(α)α1W(y;φ)(w(y;φ)W(y;φ))+abw(y;φ)α1W(y;φ))×(W¯(y;φ))a1×[1(W¯(y;φ))a]b1α>0,1,a,b>0 (6)

W¯(y;φ)=1αW(y;φ)αW(y;φ) denotes the survival function for the baseline distribution, φ are the baseline distribution parameters and the baseline CDF is W(y;φ).

Eq. (7) gives the survival function for the new family:

F¯EGGAPF(y;Θ)=1[1(W¯(y))a]ba,b>0 (7)

The hazard function for the new family of distributions is

hEGGAPF(y;Θ)=fEGGAPF1[1(W¯(y))a]b (8)

fEGGAPF is defined in Eq. (6)

EGGAP-exponential distribution

The EGGAPE(a,b,α,λ) a special case of the EGGAP family is explained. The pdfs and the hrf for some choices of parameter values are explored to determine the shapes that the family can assume. The exponential distribution has PDF w(y)=λeλy and CDF W(y)=1eλy,y>0. The extension of the exponential distribution EGGAPE(a,b,α,λ) has the PDF, CDF and HRF as:

W(y)=[1(1α(1eλy)α1eλy)a]b,λ,a,b>0,α>0,1 (9)
w(y)=abαeλyλeλyabαeλylog(α)λ(1eλy)eλy(1α(1eλy)α1eλy)1a[1(1α(1eλy)α1eλy)a]1b (10)
τ(y)=K{(1α(1eλy)α1eλy)a1[1(1α(1eλy)α1eλy)a]b1}1[1(1α(1eλy)α1eλy)a]b (11)

where K=abαeλyλeλyabαeλylog(α)λ(1eλy)eλy

As observed in Fig. 2 the hazard shape of the new distributions can take various forms from the decreasing, increasing, inverted bathtub shape to bathtub. Results from Fig. 2 illustrates versatility of the new distribution in handling data characterized with several shapes of the hazard (see Fig. 1).

Fig. 2.

Fig. 2

EGGAPE hazard shapes.

Fig. 1.

Fig. 1

EGGAPE PDF shapes.

Mathematical properties of EGGAP family of distributions

Useful expansion

Consider:

(1x)d=l=0(1)ddlxl (12)

for any integer d and |x|<1. Using the expansion in Eq. (12) in Eq. (5) we can express the cdf of EGGAP(α,a,b,φ) as:

FEGGAP(α,a,b,φ)=[1(W¯(y;φ))a]b=l=0(1)lblW¯(y;φ)al=l=0(1)lbls=0(1)salsWEGAP(y;φ)s=l=0s=0(1)l+sblalsWEGAP(y;φ)s=s=0ωsWEGAP(y;φ)s (13)

By differentiating (13) we obtain:

fEGGAP(α,a,b,φ)=fEGAP(y;φ)s=0sωsWEGAP(y;φ)s1=s=0ωsddyGEGAP(y;φ)s (14)

where ωs=l=0(1)l+sbmals

Eqs. (13), (14) confirms that the CDF and PDF of EGGAP(α,a,b,φ) are a linear combination of Exponentiated-Gull Alpha Power family of distributions. This result is not new, it has been shown by [20].

Quantile function

Inverting the cdf of EGGAP family of distribution using mathematica software we get

Q(p)=G1(W(y)[(1+(1p1b)1alog(α))×1α]log(α)),α>0,α1

W(y) is the product log function defined as:

W(y)=i=1n(1)n1nn2(n1)!zn

Using this formula, random numbers y can be generated from EGGAP(a,b,α,λ) as:

y=G1(W(y)[(1+(1p1b)1alog(α))×1α]log(α)) (15)

For example the quantile function for the EGGAPE(a,b,α,λ) is given as:

Qy(u)=1λlog{log(α)logα+W1((1+(1u1b)1a)log(α)α)},α>0,α1 (16)

Table 2 shows the quantile values for some parameter values with relation to Eq. (16).

Table 2.

Some quantile values.

y Q(y) of EGGAPE(1.5,0.8,0.5,0.2)
0.1 0.373692
0.2 0.889970
0.3 1.490652
0.4 2.175624
0.5 2.964027
0.6 3.896447
0.7 5.053373
0.8 6.617515
0.9 9.170003

To provide a numerical example on how to simulate random observations from the EGGAPE distribution Eq. (16) is used and the R language to generate 1000 EGGAPE(1.4,1.3,0.8,0.3) random variables.

As shown in Fig. 3, the plot show the adequacy of the model for practical application.

Fig. 3.

Fig. 3

Plots of the EGGAPE(1.4,1.3,0.8,0.3) for simulated data with n = 1000.

In addition Eq. (16) is used to get Moors Kurtosis and Bowley’s Skewness which exists even if the distribution does not have moments and they are less sensitive to outliers.

The Bowley skewness defined by [27] is given as:

Q(0.75)+Q(0.25)2Q(0.5)Q(0.75)Q(0.25)

Moors Kurtosis coefficient, [28] is defined as:

Q(0.375)Q(0.125)+Q(0.875)Q(0.25)Q(0.75)Q(0.25)

As shown in Fig. 4 shows the Moors Kurtosis with different baseline parameter values. The extra shape parameters have an effect on the kurtosis of the EGGAPE distribution showing the impact of the parameters.

Fig. 4.

Fig. 4

Kurtosis plot for EGGAPE distribution for α=0.75 and λ=0.2.

The Bowley’s skewness is given in Fig. 5. It is clear that the extra shape parameters have an effect on the skewness of the EGGAPE distribution showing the impact of the parameters.

Fig. 5.

Fig. 5

Bowley’s Skewness for EGGAPE distribution for λ=0.2 and α=0.75.

Order statistics

Assume y1,y2,,yn is a sample from the EGGAP(a,b,α,φ) family. Eq. (17) shows the rth order statistic PDF:

fr:n(y,Θ)=n!(r1)!(nr)!fEGGAP(y,Θ)[FEGGAP(y,Θ)]r1×[1FEGGAP(y,Θ)]nr=n!(r1)!(nr)!h=0nr(1)hnrhfEGGAP(y,Θ)×[FEGGAP(y,Θ)]h+r1 (17)

Next, making use of the expansions described in Section “Useful Expansion”, EGGAP(α,a,b,φ) is derived and given as:

fr:n(y,Θ)=n!(r1)!(nr)!h=0nr(1)hnrhfEGGAP(y,Θ)×s=0sωsWEGAP(y;φ)s1×[k=0ωkWEGAP(y;φ)k]h+r1 (18)

Upon further simplification, Eq. (18) reduces to:

fr:n(y,Θ)=n!(r1)!(nr)!h=0nr(1)hnrhfEGGAP(y,Θ)×s=0sωsWEGAP(y;φ)s1×k=0dh+r1,kWEGAP(y;φ)k=n!(r1)!(nr)!h=0nr(1)hnrh×[fEGGAP(y,Θ)k,s=0sωsdh+r1,kWEGAP(y;φ)k+s1] (19)

Moment generating function

MGF of EGGAP family of distributions is derived. By definition the MGF is given by:

MY(t)=E[eYt]

Following methodology used in [21], the MGF of the EGGAP family of distribution is:

MY(t)=E[eYt]=eytfEGGAP(y;α,a,b,φ)dy=eyts=0ωsddyWEGAP(y;φ)sdy=s=0ωseytddyWEGAP(y;φ)sdy=s=0ωsMY(t) (20)

It is notably clear that the MGF of the EGGAP family can be written in terms of the MGF of the exponentiated Gull alpha power distribution. The MGF can be used to obtain moments for the EGGAPE distribution displayed in Table 3. Parameters used for numerical integration included: I:a=1.2,b=0.1,α=0.2,λ=0.6,II:a=4.6,b=5.2,α=0.1,λ=2.2, III:a=3.5,b=2.5,α=2.2,λ=0.9, IV:a=4.0,b=3.5,α=2.8,λ=1.2 and V:a=3.2,b=3.1,α=0.1,λ=2.2. Table 3 shows that in terms of the variance and means the EGGAPE distribution is very versatile. The coefficient of skewness values demonstrate the shape of the distribution assumes right skewed and symmetrical shapes. The EGGAPE distribution can be mesokurtic or leptokurtic based on the values of the coefficient of kurtosis.

Table 3.

The EGGAPE distribution’s moments.

r I II III IV V
μ1 0.4296 0.5825 0.2580 0.1505 0.6271
μ2 1.1481 0.3722 0.1021 0.0313 0.4505
μ3 4.8405 0.2594 0.0571 0.0085 0.3651
μ4 27.3025 0.1964 0.0429 0.0029 0.3301
μ5 191.849 0.1609 0.0419 0.00123 0.3308
SD 0.9816 0.1813 0.1885 0.0930 0.2394
CV 2.2848 0.3112 0.7306 0.6179 0.3817
CS 3.7211 0.7200 1.8532 1.5179 0.7805
CK 21.7079 3.9878 9.1077 6.8693 4.1267

Entropy

Entropy is an index that measures the degree of uncertainty. Eq. (21) gives the Renyi entropy developed by [29]:

IR(γ)=11ϑlog(0fϑ(y)dy) (21)

where ϑ>0 and ϑ1.

Eq. (22) gives the Shannon entropy

ηy=E{log[f(y)]} (22)

Direct calculation of the Shannon entropy yields:

ηy=E[log(ab)log[log(α)]+2log(α)[1W(y;φ)]log(W(y;φ))(α1)log(W¯(y;φ))(b1)log[1W¯(y;φ)]]=log(ab)log[log(α)]+2log(α)2log(α)W(y;φ)E[log(W(y;φ))](α1)E[log(W¯(y;φ))](b1)E[log[1W¯(y;φ)]] (23)

The Renyi Entropy for different parameter values is displayed in Table 4. I:a=1.5,b=0.7,α=2.5,λ=0.5.II:a=2.5,b=1.5,α=0.8,λ=0.3.III:a=1.5,b=2.5,α=2.0,λ=1.5.

Table 4.

Some values of Renyi entropy.

δ I II III
0.1 2.4980 2.9219 1.6761
0.5 0.7697 1.9139 0.4364
0.9 0.2433 1.6650 0.1182
1.5 −0.1974 1.4967 −0.0880
2.5 −0.7559 1.3667 −0.2405

Estimation

The maximum likelihood estimation approach is used to estimate the parameters of the new family. Assume y1,y2,y3,,yn is a sample of size n from EGGAP family of distribution where α,a,b,andφ are parameters and φ is a vector of parameters which is associated with the baseline distribution. Let Θ=(α,a,b,φ)T be the t×1 vector of parameters, then the corresponding log-likelihood function for Θ is:

l=nlog(ab)+log(log(α))+i=1n[1W(y;φ)]log(α)+i=1nlogW(y;φ)+i=1n[1W(y;φ)]log(α)+(α1)i=1nlogW¯(y;φ)+(b1)×i=1nlog[1W¯(y;φ)]a=nlog(ab)+log(log(α))+2i=1n[1W(y;φ)]log(α)+i=1nlogW(y;φ)+(α1)i=1nlogW¯(y;φ)+(b1)i=1nlog[1W¯(y;φ)]a (24)

Score function components Mn(Θ)=(lα,la,lb,lφ) are the following:

lb=nb+i=1nlog[1W¯(y;φ)]a (25)
la=na+(b1)[1W¯(y;φ)]alog[1W¯(y;φ)] (26)
lα=1αlog(α)+2[i=1nlog[1W¯(y;φ)]α]+i=1nlog[W¯(y;φ)] (27)

and for w=1,2,,p

lφk=2log(α)i=1nWk(y;φ)+Wk(y;φ)Wk(y;φ)+(α1)W¯k(y;φ)W¯k(y;φ)α(b1)W¯k(y;φ)[1W¯k(y;φ)]a1[1W¯k(y;φ)]a (28)

where Wk(y;φ)=Wk(y;φ)φk and W¯k(y;φ)=Wk¯(y;φ)φk.

The MLEs Θˆ=(αˆ,aˆ,bˆ,φˆ) can be obtained by equating Eqs. (25) to (28) to zero and obtaining solutions through the numerical method using an appropriate software.

Monte Carlo Simulation study

The effectiveness of the MLE method to estimate the parameters of the EGGAPE distribution by the use of Monte Carlo Simulation is discussed. The nlminb() function in R is used to numerically evaluate the MLEs performance. The parameters’ (a,b,α,λ) average bias (AB) and root mean squared error (RMSE) are determined.

In order to conduct the simulation study, Eq. (16) was utilized to generate different samples of different lengths using different parameter values. The simulation study made 2000 iterations for the following choices of sample sizes n=100,,2000 and the initial parameter values a=0.9,b=0.3,α=0.6 and λ=0.5 in set I and a=0.7,b=0.8,α=0.4 and λ=0.3 in set II.

The MLEs are estimated for each item that is (aˆ,bˆ,αˆ,λˆ) for n=100,,2000 and the AB and RMSE calculated by:

AB=1Mi=1M(φˆφ) (29)

and

RMSE=1Mi=1M(φˆφ)2 (30)

where φ=a,b,α,λ

Simulation algorithm

The following simulation algorithm was used

  • i.

    Specification of the initial values of the model parameters

  • ii.

    Using Eq. (19) generate a random sample of size n

  • iii.

    Evaluate the values of the estimates using the maximum likelihood method

  • iv.

    Calculate the AB and the RMSE for all the parameters

Conclusions from simulation study

Fig. 6, Fig. 7, Fig. 8, Fig. 9, Fig. 10, Fig. 11 summarize the findings of the simulation study. The mean of the estimated parameters, the average bias and the RMSE are determined. From the simulation graphs we can confidently say that the MLE estimates the unknown parameters effectively and the resulting parameters are close to the actual values when the sample size increases. Moreover, average bias and RMSE decrease with increase in sample size.

Fig. 6.

Fig. 6

Estimated MLEs for set I (a=0.9,b=0.3,α=0.6,λ=0.5).

Fig. 7.

Fig. 7

Average Bias for set I (a=0.9,b=0.3,α=0.6,λ=0.5).

Fig. 8.

Fig. 8

RMSE for set I (a=0.9,b=0.3,α=0.6,λ=0.5).

Fig. 9.

Fig. 9

Estimated MLEs for set II (a=0.7,b=0.8,α=0.4,λ=0.3).

Fig. 10.

Fig. 10

Average Bias for set II (a=0.7,b=0.8,α=0.4,λ=0.3).

Fig. 11.

Fig. 11

RMSE for set II (a=0.7,b=0.8,α=0.4,λ=0.3).

Practical illustration using real life data

This section examines real world data set for COVID-19 mortality rates to show the value and the flexibility of the EGGAPE distribution. The study compared the proposed distribution’s goodness-of-fit test results and criterion measures to those of its sub-models.

The motivation of the EGGAPE distribution is its usefulness in data analysis problems in various fields especially survival data analysis. Recently a number of distributions have been developed to model mortality rates of COVID-19. The CDFs of the competing models include:

  • 1
    The exponential(E) distribution
    W(y,λ)=1eλy (31)
  • 2
    Gull Alpha Power Exponential(GAPE)
    W(y,α,λ)=α(1eλy)α1eλy (32)
  • 3
    Exponentiated Gull Alpha Power Exponential(EGAPE)
    W(y,α,λ)=(α(1eλy)α1eλy)b (33)
  • 4
    Exponentiated Exponential(EE) distribution
    W(y,λ,b)=(1eλy)b (34)
  • 5
    Exponentiated Generalized exponential(EGE) distribution
    W(y,λ,a,b)=(1(eλy)a)b (35)

Goodness-of-fit test statistics considered are the, Anderson Darling (A*), Cramer–von-Misses (W*) and the Kolmogorov–Smirnov (K–S) statistic with the associated p values in addition to the log-likelihood function are presented. To select the best model, the model that has the lowest log-likelihood, and the lowest A*. W* and K–S is the best model for modeling data sets.

Data set 1: United Kingdom Covid-19 mortality rate

Table 5 represents COVID-19 data from the United Kingdom for a 76-day period beginning on April 15 and ending on June 30, 2020. The data was retrieved from https://covid19.who.int/.

Table 5.

UK COVID-19 mortality rates data.

0.0587 0.0863 0.1165 0.1247 0.1277 0.1303 0.1652 0.2079 0.2395 0.2751
0.2845 0.2992 0.3188 0.3317 0.3446 0.3553 0.3622 0.3926 0.3926 0.4110
0.4633 0.4690 0.4954 0.5139 0.5696 0.5837 0.6197 0.6365 0.7096 0.7193
0.7444 0.8590 1.0438 1.0602 1.1305 1.1468 1.1533 1.2260 1.2707 1.3423
1.4149 1.5709 1.6017 1.6083 1.6324 1.6998 1.8164 1.8392 1.8721 1.9844
2.1360 2.3987 2.4153 2.5225 2.7087 2.7946 3.3609 3.3715 3.7840 3.9042
4.1969 4.3451 4.4627 4.6477 5.3664 5.4500 5.7522 6.4241 7.0657 7.4456
8.2307 9.6315 10.1870 11.1429 11.2019 11.4584

To begin with exploratory analysis was done which comprises of the descriptive statistics, the TTT plot, a histogram and a boxplot of the data set. The Total Time on Test gives a pictorial representation of the failure time. The shape of the TTT curve enables researchers to decide on the failure rate of the data, [30]. As seen in Fig. 12 the data has a decreasing failure rate that is recommended for using a modification of the exponential distribution.

Fig. 12.

Fig. 12

TTT plot, boxplot and histogram of the UK-COVID 19 data.

For UK-COVID data it is platykurtic and right-skewed, as can be seen in Table 6.

Table 6.

COVID-19 mortality rates in the United Kingdom summary statistics.

Statistic Minimum Maximum Mean Std.dev Median CK CS
Value 0.0587 11.458 2.437 2.936 1.248 1.957 1.669

The MLEs and standard errors are portrayed in Table 7. Most Parameters of the fitted distributions were significant at 5% level since by the standard error test.

Table 7.

The estimates and SEs for first data set.

Model aˆ bˆ αˆ λˆ
EGGAPE 0.141(0.020) 1.220(0.191) 2.660(0.025) 2.300(0.031)
EGAPE 0.939(0.156) 1.739(0.502) 0.282(0.079)
GAPE 0.291(0.073) 1.781(0.442)
EGE 0.378(0.732) 0.934(1.807) 0.805(0.118)
EE 0.353(0.056) 0.800(0.116)
E 0.410(0.047)

From Table 8, in comparison to the other models, the AIC, K–S, and W* values for the EGGAPE distribution are the lowest, while the log-likelihood is the highest. The EGGAPE distribution, according to formal goodness-of-fit tests, gives a better fit than its sub-models since its K–S, Anderson–Darling, and W* values are the least.

Table 8.

He log-likelihood function of EGGAPE distribution along with goodness-of-fit measures for first data set.

Model l AIC A* K–S p-value W*
EGGAPE(Proposed) −139.166 286.371 0.3688 0.0681 0.8727 0.0508
EGAPE(New) −141.305 288.610 0.6792 0.0889 0.5851 0.0996
GAPE(New) −141.377 286.755 0.0667 0.0981 0.4579 0.1394
EGE −142.504 291.341 0.9279 0.1002 0.4301 0.1394
EE −142.503 289.006 0.9289 0.0982 0.4560 0.1396
E −143.704 289.409 0.9167 0.1447 0.0829 0.1376

The results of the LRT test are displayed in Table 9. The results revealed that the EGGAPE is a better fit than its sub-models at 5% and 10% level of significance.

Table 9.

Likelihood ratio test for UK COVID-19 mortality data.

Model Hypothesis LRT P-values
EGAPE (New) H0:a=1 vs. H1:a1 4.278 0.0386
GAPE (New) H0:a=b=1 vs. H1:a=b1 4.422 0.1095
EGE H0:α=1 vs. H1:α1 6.676 0.0098
EE H0:a=α=1 vs. H1:a,α1 6.674 0.0355
E H0:a=b=α=1 vs. H1:a,b1 9.077 0.0283

Fig. 13 shows the densities of the fitted distributions.

Fig. 13.

Fig. 13

Fitted densities plot for UK COVID-19 data set.

Data set 2: Kenya COVID-19 daily cases

Table 10 represents the daily number of cases of COVID-19 for Kenya for the time span of 56 days starting at 28 March 2020 and ending at 22 May 2020. Source of the data was https://covid19.who.int.

Table 10.

Kenya COVID-19 daily cases data.

7 4 8 9 22 29 12 4 16 16
16 14 7 3 5 5 2 6 11 8
9 9 12 16 8 11 15 7 17 16
7 12 8 11 10 12 15 24 30 25
45 47 25 14 28 23 28 15 22 21
23 49 57 25 51 66 80 52

Table 11 displays the descriptive statistics for the Kenya daily COVID-19 data. From the descriptive statistic, the data is right skewed and platykurtic.

Table 11.

Summary statistics for Kenya daily COVID-19 cases.

Statistic Minimum Maximum Mean Std.dev Median Kurtosis Skewness
Value 2 80 20.178 16.84 15.000 2.167 1.612

The TTT is a pictorial representation of the failure time. The shape of the TTT curve enables researchers to decide on the failure rate of the data. As seen in Fig. 14 the data has a modified bathtub failure rate that is recommended for using a modification of the exponential distribution.

Fig. 14.

Fig. 14

TTT plot, boxplot and histogram of the Kenya-Covid 19 daily cases data.

The MLEs for the parameters and their standard errors are shown in Table 12. Parameters for most of the fitted distributions were significant at 5% level since by the standard error test, the values of the standard errors of the parameters are less than half the value of the parameter.

Table 12.

The estimates and SEs for second data set.

Model aˆ bˆ αˆ λˆ
EGGAPE 0.398(0.164) 3.777(1.062) 2.611 (0.192) 0.142(0.057)
EGAPE 2.922(0.792) 2.105(0.461) 0.053(0.014)
GAPE 0.083(0.013) 0.189(0.133)
EGE 0.208 (0.818) 0.366(1.439) 2.055(0.426)
EE 0.076(0.011) 2.071(0.431)
E 0.049(0.006)

As depicted in Table 13, in comparison to the other models, the AIC, K–S, and W* values for the EGGAPE distribution are the lowest, while the log-likelihood is the highest. The EGGAPE distribution, according to formal goodness-of-fit tests, gives a better fit than its sub-models since its K–S, Anderson–Darling, and W* values are the least.

Table 13.

The log-likelihood function of EGGAPE distribution along with goodness-of-fit measures for second data set.

Model l AIC A* K–S p-value W*
EGGAPE(Proposed) −216.370 440.740 0.276 0.086 0.7969 0.041
EGAPE(New) −217.122 440.243 0.403 0.114 0.452 0.060
GAPE(New) −220.730 445.461 0.1.109 0.128 0.318 0.173
EGE −218.405 443.049 0.730 0.125 0.379 0.112
EE −218.523 441.047 0.731 0.122 0.376 0.113
E −224.258 450.517 0.835 0.186 0.041 0.129

The visual display for the estimated PDF of the competing models is presented in Fig. 15.

Fig. 15.

Fig. 15

Fitted Densities plot Kenya daily cases for COVID-19.

Future work

It should be noted that the proposed generator of distributions only looks at cases which were complete (uncensored). However, in real life scenarios, incomplete cases arise especially in real medical research. Therefore, the study recommends that for future works, censored data may be used to demonstrate the applicability of the developed models. In this study, the maximum likelihood estimation has been used to develop the estimators of the parameters, for further studies a comparison of the parameter estimation procedures; maximum likelihood estimation, method of moments and the maximum probability space can be considered. In many instances, covariates exists and they may have an effect on the outcome variable. Therefore, in this context parametric regression models for studying the relationship between the output and input variables may be constructed using the proposed distributions. Further, Bayesian estimation and prediction for the EGGAPE distribution using informative priors can be considered as an open area for future research.

Conclusions

The EGGAP family of distributions has been developed and studies. The expressions for the basic statistical properties including the probability distribution function, hazard rate function, survival function, order statistics and the quantile have been derived successfully. A special case of the family called EGGAPE distribution has been explored and the estimates of the parameters of the sub-family have been obtained through the maximum likelihood method. The AB and the RMSE were used to evaluate the performance of the estimators of the new distribution. We can conclude that the MLE estimates are consistent and both the AB and the RMSE decrease with increase in sample size. It has been shown that the new distribution provides a better fit than that of its sub models in data that is characterized by monotone and non-monotone hazard rates through application to real data.

CRediT authorship contribution statement

Mutua Kilai: Supervision, Validation, Visualization, Writing – original draft, Writing – review editing. Gichuhi A. Waititu: Supervision, Validation, Visualization, Writing – original draft, Writing – review editing. Wanjoya A. Kibira: Supervision, Validation, Visualization, Writing – original draft, Writing – review editing. Huda M. Alshanbari: Supervision, Validation, Visualization, Writing – original draft, Writing – review editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

The first author thanks the Pan African University, Institute for Basic Sciences Technology and Innovation for supporting this work. Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2022R299), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia .

References

  • 1.Eugene N., Lee C., Famoye F. Beta-normal distribution and its applications. Comm Statist Theory Methods. 2002;31(4):497–512. [Google Scholar]
  • 2.Cordeiro G.M., de Castro M. A new family of generalized distributions. J Stat Comput Simul. 2011;81(7):883–898. [Google Scholar]
  • 3.Alexander C., Cordeiro G.M., Ortega E.M., Sarabia J.M. Generalized beta-generated distributions. Comput Statist Data Anal. 2012;56(6):1880–1897. [Google Scholar]
  • 4.Alzaatreh A., Famoye F., Lee C. Weibull-Pareto distribution and its applications. Comput Statist Data Anal. 2013;42(9):1673–1691. [Google Scholar]
  • 5.Alzaatreh A., Famoye F., Lee C. The gamma-normal distribution: Properties and applications. Comput Statist Data Anal. 2014;69:67–80. [Google Scholar]
  • 6.Alizadeh M., Cordeiro G.M., De Brito E., Demétrio C.G.B. The beta Marshall–Olkin family of distributions. J Stat Distrib Appl. 2015;2(1):1–18. [Google Scholar]
  • 7.Tahir M.H., Zubair M., Mansoor M., Cordeiro G.M., Ali̇zadehk M., Hamedani G.G. A new Weibull-G family of distributions. Hacet J Math Stat. 2016;45(2):629–647. [Google Scholar]
  • 8.Mead M.E., Cordeiro G.M., Afify A.Z., Al Mofleh H. The alpha power transformation family: properties and applications. Int J Stat Probab. 2019;(2019):525–545. [Google Scholar]
  • 9.Nassar M., Kumar D., Dey S., Cordeiro G.M., Afify A.Z. The Marshall–Olkin alpha power family of distributions with applications. J Comput Appl Math. 2019;351(2019):41–53. [Google Scholar]
  • 10.Eghwerido J.T., Efe-Eyefia E., Zelibe S.C. The transmuted alpha power-G family of distributions. J Stat Manage Syst. 2020;(2020):1–38. [Google Scholar]
  • 11.Eghwerido J.T., Oguntunde P.E., Agu F.I. The alpha power Marshall–Olkin-G distribution: properties, and applications. Sankhya A. 2021:1–26. [Google Scholar]
  • 12.Ampadu C.B. Gull alpha power of the ampadu-type: Properties and applications. Earthline J Math Sci. 2021;6(1):187–207. [Google Scholar]
  • 13.Aldeni M., Lee C., Famoye F. Families of distributions arising from the quantile of generalized lambda distribution. J Stat Distrib Appl. 2017;4(1):1–18. [Google Scholar]
  • 14.Gupta R.C., Gupta P.L., Gupta R.D. Modeling failure time data by lehman alternatives. Comm Statist Theory Methods. 1998;27(4):887–904. [Google Scholar]
  • 15.Gupta R.D., Kundu D. Exponentiated exponential family: an alternative to gamma and Weibull distributions. Biometrical J J Math Methods Biosci. 2001;43(1):117–130. [Google Scholar]
  • 16.Rather A.A., Subramanian C. Exponentiated Mukherjee-Islam distribution. J Stat Appl Probab. 2018;7(2):357–361. [Google Scholar]
  • 17.Hajar N.M., Abu-Zinadah H.H. Some statistical properties of the beta exponentiated gompertz distribution. IOSR J Math. 2018;14(1):31–40. [Google Scholar]
  • 18.Sarhan A.M., Apaloo J. Exponentiated modified Weibull extension distribution. Reliab Eng Syst Saf. 2013;112:137–144. [Google Scholar]
  • 19.Huang S., Oluyede B.O. Exponentiated kumaraswamy-dagum distribution with applications to income and lifetime data. J Stat Distrib Appl. 2014;1(1):1–20. [Google Scholar]
  • 20.Cordeiro G.M., Ortega E.M., da Cunha D.C. The exponentiated generalized class of distributions. J Data Sci. 2013;11(1):1–27. [Google Scholar]
  • 21.Handique L., Chakraborty S., de Andrade T.A. The exponentiated generalized marshall–olkin family of distribution: its properties and applications. Ann Data Sci. 2019;6(3):391–411. [Google Scholar]
  • 22.Cordeiro G.M., de Andrade T.A., Bourguignon M., Gomes-Silva F. The exponentiated generalized standardized half-logistic distribution. Int J Stat Probab. 2017;6(3):24–42. [Google Scholar]
  • 23.Oguntunde P.E., Khaleel M.A., Adejumo A.O., Okagbue H.I., Opanuga A.A., Owolabi F.O. The gompertz inverse exponential (GoIE) distribution with applications. Cogent Math Stat. 2018;5(1) [Google Scholar]
  • 24.Owoloko E.A., Oguntunde P.E., Adejumo A.O. Performance rating of the transmuted exponential distribution: an analytical approach. SpringerPlus. 2015;4(1):1–15. doi: 10.1186/s40064-015-1590-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Oguntunde P.E., Balogun O.S., Okagbue H.I., Bishop S.A. The Weibull-exponential distribution: Its properties and applications. J Appl Sci. 2015;15(11):1305–1311. [Google Scholar]
  • 26.Ijaz M., Asim S.M., Farooq M., Khan S.A., Manzoor M., Asim S.M., Farooq M., Khan S.A., Manzoor S. A gull alpha power Weibull distribution with applications to real and simulated data. Plos One. 2020;15(6) doi: 10.1371/journal.pone.0233080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Kenney J.F., Keeping E.S. Linear regression and correlation. Math Stat. 1962;1:252–285. [Google Scholar]
  • 28.Moors J.J.A. A quantile alternative for kurtosis. J R Stat Soc Ser D (Stat) 1988;37(1):25–32. [Google Scholar]
  • 29.Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Contributions to the Theory of Statistics. University of California Press; 1961. Rényi, alfréd; pp. 547–561. [Google Scholar]
  • 30.Aarset M.V. How to identify a bathtub hazard rate. Trans Reliab. 1987;36(1):106–108. [Google Scholar]

Articles from Results in Physics are provided here courtesy of Elsevier

RESOURCES