The Exponentiated Gumbel–Weibull {Logistic} Distribution with Application to Nigeria’s COVID-19 Infections Data

Patrick Osatohanmwen; Eferhonore Efe-Eyefia; Francis O Oyegue; Joseph E Osemwenkhae; Sunday M Ogbonmwan; Benson A Afere

doi:10.1007/s40745-022-00373-0

. 2022 Mar 19;9(5):909–943. doi: 10.1007/s40745-022-00373-0

The Exponentiated Gumbel–Weibull {Logistic} Distribution with Application to Nigeria’s COVID-19 Infections Data

Patrick Osatohanmwen ^1,^✉, Eferhonore Efe-Eyefia ², Francis O Oyegue ³, Joseph E Osemwenkhae ³, Sunday M Ogbonmwan ³, Benson A Afere ⁴

PMCID: PMC8934027 PMID: 38624783

Abstract

A new flexible univariate probability distribution was defined in this paper. The new distribution is so called the ‘exponentiated Gumbel–Weibull {logistic} distribution’ and it arose by using the exponentiated Gumbel distribution to generate a generalized Weibull distribution using the logit function or the quantile function of the logistic distribution as a link. The new distribution was observed to be both unimodal and bimodal as well as exhibits various shape and tail properties consistent with data arising from several real life phenomena. A detail study of its statistical properties was carried out and the maximum likelihood method was used in the estimation of its parameters. The new distribution was applied in fitting the reported daily number of infections due to the COVID-19 pandemic in Nigeria. Five other datasets were further used to ascertain the flexibility of the new distribution in fitting data sets with different statistical properties.

Keywords: T–R {Y} family, Gumbel distribution, Weibull distribution, Maximum likelihood estimation, Monte Carlo Simulations

Introduction

The science of data is one which involves the use of some methodologies from disparate fields in extracting information from data usually for policy purposes. These methodologies include statistical methodologies, scientific methodologies, artificial intelligence as well as data analysis methodologies [1–3]. These methodologies come handy in aggregating, cleaning, preparing data for analysis, manipulating data as well finding specific patterns or trajectories that data follow. Within the vanguard of statistical modeling of data, the practice is usually to find a stochastic model which best describe the behavior of a given data. These stochastic models are usually completely specified as probability distribution functions from which other desirable properties of the data are obtained for either policy making or for further investigations. The need to obtain appropriate distribution functions which can best describe the stochastic behavior of data sets arising from several real life situations is one of the major drives for the development of new and more flexible families of probability distributions. Within the context of applications, the classical probability distribution functions have been found to be unable to adequately fit data sets with varying shape and tail properties in many studies and hence the increasing volumes of research devoted so far to generalized them and in the process increase their flexibility. Several methods have been put forward in the literature for the generalization of a probability distribution [4–12] each with their attendant benefits and shortcomings.

The COVID-19 pandemic is one which has ravage the entire world and accompanying it are economic, social and behavioral challenges and responses. Several studies, using mathematical models, statistical models, behavioral models and those involving artificial intelligence frameworks have been put forward already to explain the evolution, transmission and the impacts of the pandemic in several countries of the world using data on the daily, weekly or monthly number of infections from the disease [13–21]. However, it is important to state that data of this nature tends to possess one or more characteristics which classical probability distributions as used in statistical modeling may not be able to capture when they are used to describe them. For example, data of this sort tends to be highly skewed either to the right or to the left with the possibility of having some outlying observation and hence, a classical distribution like the normal distribution cannot be used to fit such data and it becomes imperative to use a very flexible distribution to fit data of this sort such as generalized families of distributions. In this paper a new probability distribution which is a generalization of the classical Weibull distribution is developed and used to fit the daily number of infections from the COVID-19 pandemic in Nigeria. The new distribution is further used in fitting five other data sets in order to demonstrate how flexible it can be.

The rest of the paper is organized thus. In Sect. 2, the new distribution is presented. A discussion on some of the statistical properties of the distribution is contained in Sect. 3. The process of using the maximum likelihood method for the estimation of the parameters of the distribution is contained in Sect. 4 while application of the distribution to real data sets is carried out in Sect. 5. The paper closes in Sect. 6 with summary and conclusion.

The New Distribution

Supposed $T$ is a random variable following the exponentiated Gumbel distribution defined by [22] with the cumulative distribution function (cdf), probability density function (pdf) and quantile function given respectively by

F_{T} (x) = 1 - {\{1 - exp [- exp (- \frac{x - k}{c})]\}}^{β},

f_{T} (x) = \frac{β}{c} exp (- \frac{x - k}{c}) exp [- exp (- \frac{x - k}{c})] {\{1 - exp [- exp (- \frac{x - k}{c})]\}}^{β - 1},

Q_{T} (p) = k - c log \{- log [1 - {(1 - p)}^{1 / β}]\},

c, β > 0, - \infty \leq x, k \leq \infty, x \geq k, 0 < p < 1 .

Suppose also that $R$ is a Weibull random variable with cdf, pdf and quantile function given respectively by

F_{R} (x) = 1 - e^{- {(x / λ)}^{α}},

f_{R} (x) = \frac{α}{λ} {(\frac{x}{λ})}^{α - 1} e^{- {(x / λ)}^{α}},

Q_{R} (p) = λ {[- log (1 - p)]}^{1 / α},

x > 0, α, λ > 0, 0 < p < 1 .

Let $Y$ be a standard logistic random variable with cdf, pdf and quantile function given respectively

F_{Y} (x) = \frac{1}{1 + e^{- x}},

f_{Y} (x) = \frac{e^{- x}}{{(1 + e^{- x})}^{2}},

\begin{matrix} Q_{Y} (p) = & log (\frac{p}{1 - p}), \\ - \infty \leq x \leq \infty, 0 < p < 1 . \end{matrix}

The cdf

F (x) = \int_{- \infty}^{Q_{Y} (F_{R} (x))} f_{T} (t) d t = F_{T} (Q_{Y} (F_{R} (x)))

is a valid cdf and from (1) we have the cdf of the 5-parameter exponentiated Gumbel–Weibull {logistic} (EGuWL) distribution given as

\begin{matrix} F (x) = & 1 - {\{1 - exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}]\}}^{β}, \\ x, α, β, c, λ > 0, - \infty \leq k \leq \infty . \end{matrix}

The pdf corresponding to (2) is expressed as

\begin{matrix} f (x) = & \frac{α β e^{k / c}}{λ c} {(x / λ)}^{α - 1} e^{α} {(e^{α} - 1)}^{- 1 - 1 / c} exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}] \times \\ {\{1 - exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}]\}}^{β - 1}, \\ x, α, β, c, λ > 0, - \infty \leq k \leq \infty, \end{matrix}

where the parameters $α, β, c$ and $k$ control the shape of the distribution and $λ$ is scale parameter. The graphs of the pdf in (3) are shown in Figs. 1, 2 and 3 for various combinations of parameter values. The quantile function corresponding to the cdf in (1) is given by

\begin{matrix} Q (p) = & λ {\{log [{(- e^{- k / c} log (1 - {(1 - p)}^{1 / β}))}^{- c} + 1]\}}^{1 / α}, \\ α, β, c, λ > 0, - \infty \leq k \leq \infty, 0 < p < 1 . \end{matrix}

Fig. 1 — EGuWL density showing skewness to the right

Fig. 2 — EGuWL density showing symmetry and bimodality

Fig. 3 — EGuWL density showing bimodality and left skewness

In Fig. 1, for fixed values of the parameters $λ$ and $k$ we observe that the EGuWL density is highly skewed to the right when the parameters $α, β and c$ are varied. In fact, for decreasing (increasing) values of parameter $α (parameter β)$ the density falls exponentially. This behavior shows that the EGuWL distribution can be very effective in fitting highly right-skewed data sets with possibility of outliers or reverse-J shaped data sets. In Fig. 2, for fixed values of $λ$ and $β$ and varied values of $α, c and k$ the EGuWL density can be bimodal and almost symmetric. For negative values of $k$ and increasing (decreasing) values parameter $α (parameter c)$ , the EGuWL density is bimodal and for non-negative values of the parameter $k$ and increasing (decreasing) values of parameter $α (parameter c)$ , the EGuWL density is almost symmetric. This highlights that the EGuWL distribution can be used for fitting bimodal and near symmetric data sets. In Fig. 3, the EGuWL density is also observed to possess left-skewness. In fact, for fixed values of $λ$ and $α$ the density is skewed to the left when the value of $β$ is decreasing and when the values of $k and c$ is increasing. This also shows that the EGuWL distribution can also be used to fit left-skewed data sets. Observe that in the Figs. 1, 2 and 3, the value of the parameter $λ$ is always fixed, this is because $λ$ is a scale parameter and its value does not affect the shape of the density.

Proposition 1:

Suppose $X$ is an EGuWL random variable and $U$ and $T$ are uniform random variable defined on (0, 1) and exponentiated Gumbel random variable respectively, then.

(i)
$X = λ {[log (e^{T} + 1)]}^{1 / α},$
(ii)
$X = λ {\{log [{(- e^{- k / c} log (1 - {(1 - U)}^{1 / β}))}^{- c} + 1]\}}^{1 / α} .$

Proof:

The proof of (i) and (ii) follow from (1) and (4) respectively. Proposition 1 is very useful for simulating random samples from the EGuWL distribution by first simulating from the exponentiated Gumbel distribution or the uniform distribution and applying the transformation accordingly. The relation in (i) can also be used to determine the moments of the EGuWL distribution.

Statistical Properties of the New Distribution

Here we present some essential statistical properties of the EGuWL distribution. A discussion on the hazard function is used to begin the section.

Hazard Function

The hazard function of the EGuWL distribution is expressed as

\begin{matrix} h (x) = & \frac{α β e^{k / c}}{λ c} {(x / λ)}^{α - 1} e^{α} {(e^{α} - 1)}^{- 1 - 1 / c} exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}] \times \\ {\{1 - exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}]\}}^{- 1}, \\ x, α, β, c, λ > 0, - \infty \leq k \leq \infty . \end{matrix}

Figures 4, 5, 6 display the shape of the EGuWL hazard function for various combinations of parameter values. Figures 4, 5, 6 show that the EGuWL hazard can be decreasing, increasing and upside down bathtub. These results are very useful in lifetime data analysis.

Fig. 4 — EGuWL hazard showing decreasing and upside down bathtub shapes

Fig. 5 — EGuWL hazard showing increasing shapes

Fig. 6 — EGuWL hazard showing increasing shapes

Mode

Proposition 2:

The mode(s) of the EGuWL distribution is either at $x = 0$ or it will satisfy the equation.

w (x) = (β - 1) x {(e^{α} - 1)}^{} A (x) t (x),

where

A (x) = {\{1 - exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}]\}}^{- 1},

w (x) = (α - 1) (e^{α} - 1) - α {(x / λ)}^{α} + (α / c) {(x / λ)}^{α} e^{α} [e^{k / c} {(e^{α} - 1)}^{- 1 / c} - 1],

t (x) = \frac{α e^{k / c}}{λ c} {(x / λ)}^{α - 1} e^{α} {(e^{α} - 1)}^{- 1 - 1 / c} exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}] .

Proof:

As observed from the graphs of the EGuWL density, the distribution can be both unimodal and bimodal. On differentiating the EGuWL density w.r.t $x$ , one obtains.

f^{'} (x) = β A (x) (1 - F (x)) t (x) \{w (x) {[x {(e^{α} - 1)}^{}]}^{- 1} - (β - 1) A (x) t (x)\} .

The derivative $f^{'} (x)$ does not exist when $x = 0$ . Other critical point(s) satisfy $f^{'} (x) = 0$ , hence the EGuWL distribution mode(s) will either be at $x = 0$ or it will satisfy the equation

w (x) = (β - 1) x {(e^{α} - 1)}^{} A (x) t (x) .

Remark 1:

Observe that the expression $w (x) {[x {(e^{α} - 1)}^{}]}^{- 1} - (β - 1) A (x) t (x)$ is a factor of $f^{'} (x)$ and has the same sign as $f^{'} (x)$ . Analytical solution of (6) for $x$ is not possible. However, (6) can be solved numerically in order to obtain the desired mode(s).

Moments

An expression for computing the $r th$ non-central moments of the EGuWL distribution can easily be obtained by making using of the relationship between the EGuWL random variable $X$ and the exponentiated Gumbel random variable $T$ as specified in Proposition 1(i). In particular, the relation $X = λ {[log (e^{T} + 1)]}^{1 / α}$ implies that

μ_{r}^{'} = E (X^{r}) = λ^{r} E \{{[log (e^{T} + 1)]}^{r / α}\} .

Since $X$ which is an EGuWL random variable is a transformed exponentiated Gumbel random variable $T$ following from proposition 1(i), its moments can be obtained as if one is obtaining the moments of the exponentiated Gumbel random variable $T$ hence the density function of the exponentiated Gumbel distribution will be used in obtaining the moments instead of the more complex density function of the EGuWL distribution and this is a major result in this paper. It follows that

μ_{r}^{'} = \frac{β λ^{r}}{c} \int_{- \infty}^{\infty} {[log (e^{t} + 1)]}^{r / α} e^{- (t - k) / c} exp (- e^{- (t - k) / c}) {[1 - exp (- e^{- (t - k) / c})]}^{β - 1} d t .

The $r$ th non-central moments of the EGuWL distribution are computed from the relation in (7). The mean $(μ)$ , variance $(σ^{2})$ , skewness $(S)$ and kurtosis $(K)$ of the EGuWL distribution are given respectively as

μ = μ_{1}^{'},

σ^{2} = μ^{2} + μ_{2}^{'} - 2 μ^{2},

S = \frac{μ_{3}^{,} - 3 μ μ_{2}^{'} + 2 μ^{2}}{{(μ_{2}^{'} - μ^{2})}^{3 / 2}}

K = \frac{μ_{4}^{,} - 4 μ μ_{3}^{'} + 6 μ^{2} μ_{2}^{,} - 3 μ^{4}}{{(μ_{2}^{,} - μ^{2})}^{2}} .

The quantile function can also be used in computing the skewness and kurtosis of a distribution, especially when such quantile function exists in a simple analytic form. Galton [23] proposed a quantile measure based approach for evaluating skewness while Moor [24] did the same for Kurtosis. Galton’s skewness and Moor’s kurtosis are evaluated using the relations

S = \frac{Q (6 / 8) - 2 Q (4 / 8) + Q (2 / 8)}{Q (6 / 8) - Q (2 / 8)},

K = \frac{Q (7 / 8) - Q (5 / 8) + Q (3 / 8) - Q (1 / 8)}{Q (6 / 8) - Q (2 / 8)} .

Since the quantile function of the EGuWL distribution exists in a simple analytic form as expressed in (4), the above expressions can be used in computing the skewness and kurtosis of the EGuWL distribution. 3-D plots of the Galton’s skewness and the Moore’s kurtosis of the EGuWL distribution for some selected parameters values are presented in Fig. 7.

Fig. 7 — Galton’s skewness (S) and Moore’s kurtosis (K) for the EGuWL distribution (k = 0, λ = 1, β = 0.5)

Entropy

Shannon [25] offered a probabilistic definition of entropy. The Shannon entropy $η_{X}$ of a random variable $X$ following a known probability distribution is a measure of variation of uncertainty.

Proposition 3:

The Shannon entropy of a random variable $X$ following the EGuWL distribution can be expressed as.

η_{X} = η_{T} - μ_{T} - log (α / λ) - Z (α, β, c, k),

where

$η_{T}$ and $μ_{T}$ are respectively the Shannon entropy and mean of the exponentiated Gumbel distribution,

\begin{matrix} Z (α, β, c, k) = & \frac{β}{c} \int_{- \infty}^{\infty} \{[2 log (e^{- t} + 1)] + [((α - 1) / α) log (log (e^{t} + 1))] - log (e^{t} + 1)\} \\ \times e^{- (t - k) / c} exp (- e^{- (t - k) / c}) {[1 - exp (- e^{- (t - k) / c})]}^{β - 1} d t . \end{matrix}

Proof:

For a random variable $X$ with density function $f (x)$ , the Shannon Entropy of $X$ is defined as.

η_{X} = E [- log (f (X))] .

The pdf $f (x)$ corresponding to the cdf $F (x)$ in (1) can be written as

f (x) = f_{R} (x) \frac{f_{T} (Q_{Y} (F_{R} (x)))}{f_{Y} (Q_{Y} (F_{R} (x)))},

and hence

f (X) = f_{R} (X) \frac{f_{T} (Q_{Y} (F_{R} (X)))}{f_{Y} (Q_{Y} (F_{R} (X)))} .

Observe that from (1), $t = Q_{Y} (F_{R} (x))$ and hence $T = Q_{Y} (F_{R} (X))$ . It follows that

f (X) = f_{R} (X) \frac{f_{T} (T)}{f_{Y} (T)}

and

η_{X} = E [- log (f (X))] = E [- log (f_{T} (T))] - E [log (f_{R} (X))] + E [log (f_{Y} (T))] .

It follows that

η_{X} = η_{T} - E (T) - 2 E [log (e^{- T} + 1)] - E [log (f_{R} (X))],

and consequently

η_{X} = η_{T} - μ_{T} - 2 E [log (e^{- T} + 1)] - E [log (f_{R} (X))] .

From Proposition 1(i) we have that $X = λ {[log (e^{T} + 1)]}^{1 / α}$ and thus

log (f_{R} (X)) = log (α / λ) + ((α - 1) / α) log [log (e^{T} + 1)] - log (e^{T} + 1) .

It follows that

E [log (f_{R} (X))] = log (α / λ) + ((α - 1) / α) E \{log [log (e^{T} + 1)]\} - E [log (e^{T} + 1)] .

Thus

\begin{matrix} η_{X} = & η_{T} - μ_{T} - log (α / λ) - 2 E [log (e^{- T} + 1)] - ((α - 1) / α) \\ E \{log [log (e^{T} + 1)]\} + E [log (e^{T} + 1)], \end{matrix}

where

E [log (e^{- T} + 1)] = \frac{β}{c} \int_{- \infty}^{\infty} log (e^{- t} + 1) e^{- (t - k) / c} exp (- e^{- (t - k) / c}) {[1 - exp (- e^{- (t - k) / c})]}^{β - 1} d t,

E [log (e^{T} + 1)] = \frac{β}{c} \int_{- \infty}^{\infty} log (e^{t} + 1) e^{- (t - k) / c} exp (- e^{- (t - k) / c}) {[1 - exp (- e^{- (t - k) / c})]}^{β - 1} d t,

E \{log [log (e^{T} + 1)]\} = \frac{β}{c} \int_{- \infty}^{\infty} log {[log (e^{t} + 1)]}^{} e^{- (t - k) / c} exp (- e^{- (t - k) / c}) {[1 - exp (- e^{- (t - k) / c})]}^{β - 1} d t,

The integrals in (9)–(11) exist because

|log [log (e^{t} + 1)]| \leq |log (e^{t} + 1)| \leq log 2 + t when t > 0,

|log [log (e^{t} + 1)]| \leq |log (e^{t} + 1)| \leq log 2 when t < 0,

|log [log (e^{t} + 1)]| \leq |log (e^{- t} + 1)| \leq log 2 + t when t < 0,

and

|log [log (e^{t} + 1)]| \leq |log (e^{- t} + 1)| \leq log 2 when t > 0 .

Hence

η_{X} = η_{T} - μ_{T} - log (α / λ) - Z (α, β, c, k)

where

\begin{matrix} Z (α, β, c, k) = & \frac{β}{c} \int_{- \infty}^{\infty} \{[2 log (e^{- t} + 1)] + [((α - 1) / α) log (log (e^{t} + 1))] - log (e^{t} + 1)\} \times \\ e^{- (t - k) / c} exp (- e^{- (t - k) / c}) {[1 - exp (- e^{- (t - k) / c})]}^{β - 1} d t . \end{matrix}

Remark 2:

It can be easily verified that.

η_{T} = log c - log β + γ + 1 - (β - 1) E \{log [1 - G (X)]\},

where $G (.)$ is the cdf of the Gumbel distribution, $γ = 0.57722$ is the Euler’s constant and

E \{log [1 - G (X)]\} = \frac{1}{c} \int_{- \infty}^{\infty} log [1 - G (x)] e^{- (x - k) / c} exp (- e^{- (x - k) / c}) d x .

An expression for $μ_{T}$ was given in [22] as

μ_{T} = Γ (β + 1) \sum_{n = 0}^{\infty} {(- 1)}^{n} \frac{[k + c γ + c log (n + 1)]}{(n + 1)! Γ (β - n)},

where $Γ (.)$ is the complete gamma function.

Estimation

Here the maximum likelihood method of estimation of parameters is presented for the estimation of the parameters of the EGuWL distribution.

Maximum Likelihood Method of Estimation of the Parameters of the EGuWL Distribution

For a complete random independent sample $x_{1}, x_{2}, \dots, x_{n}$ of size $n$ , the log-likelihood function of the EGuWL distribution is

\begin{matrix} L = & n (log α + log β + k / c - log k - log λ) + (α - 1) \sum_{i = 1}^{n} log (x_{i} / λ) + \sum_{i = 1}^{n} {(x_{i} / λ)}^{α} \\ + (- 1 - 1 / c) \sum_{i = 1}^{n} log (e^{α} - 1) - e^{k / c} \sum_{i = 1}^{n} {(e^{α} - 1)}^{- 1 / c} \\ + (β - 1) \sum_{i = 1}^{n} log \{1 - exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}]\} . \end{matrix}

Suppose $Θ = {(α β c k λ)}^{T}$ be the unknown parameter vector, the associated score function is given by

U (Θ) = {(\frac{\partial L}{\partial α} \frac{\partial L}{\partial β} \frac{\partial L}{\partial c} \frac{\partial L}{\partial k} \frac{\partial L}{\partial λ})}^{T},

where $\frac{\partial L}{\partial α}, \frac{\partial L}{\partial β}, \frac{\partial L}{\partial c}, \frac{\partial L}{\partial k} and \frac{\partial L}{\partial λ}$ are the partial derivatives of the log-likelihood function w.r.t. to each parameter and are given by

\begin{matrix} \frac{\partial L}{\partial α} = & n / α + \sum_{i = 1}^{n} log (x_{i} / λ) + \sum_{i = 1}^{n} {(x_{i} / λ)}^{α} log (x_{i} / λ) + (- 1 - 1 / c) \sum_{i = 1}^{n} \frac{{(x_{i} / λ)}^{α} log (x_{i} / λ) e^{α}}{e^{α} - 1} \\ + e^{k / c} / c \sum_{i = 1}^{n} {(x_{i} / λ)}^{α} log (x_{i} / λ) e^{α} {(e^{α} - 1)}^{- 1 - 1 / c} \\ - \frac{(β - 1) e^{k / c}}{c} \sum_{i = 1}^{n} \frac{{(x_{i} / λ)}^{α} log (x_{i} / λ) e^{α} {(e^{α} - 1)}^{- 1 - 1 / c} exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}]}{1 - exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}]}, \end{matrix}

\frac{\partial L}{\partial β} = n / β + \sum_{i = 1}^{n} log \{1 - exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}]\},

\begin{matrix} \frac{\partial L}{\partial c} = - \frac{nk}{c^{2}} + \frac{1}{c^{2}} \sum_{i = 1}^{n} log (e^{α} - 1) \\ - \frac{e^{k / c}}{c^{2}} \sum_{i = 1}^{n} [{(e^{α} - 1)}^{- 1 / c} log (e^{α} - 1) - k {(e^{α} - 1)}^{- 1 / c}] \\ + \frac{(β - 1) e^{k / c}}{c^{2}} \\ \times \sum_{i = 1}^{n} \frac{[{(e^{α} - 1)}^{- 1 / c} log (e^{α} - 1) - k {(e^{α} - 1)}^{- 1 / c}] exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}]}{1 - exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}]}, \end{matrix}

\frac{\partial L}{\partial k} = \frac{n}{c} - \frac{n}{k} - \frac{e^{k / c}}{c} \sum_{i = 1}^{n} {(e^{α} - 1)}^{- 1 / c} + \frac{(β - 1) e^{k / c}}{c} \sum_{i = 1}^{n} \frac{{(e^{α} - 1)}^{- 1 / c} exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}]}{1 - exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}]},

\begin{matrix} \frac{\partial L}{\partial λ} = & - \frac{n}{λ} - \frac{n (α - 1)}{λ} - \frac{α}{λ^{2}} \sum_{i = 1}^{n} x_{i} {(x_{i} / λ)}^{α - 1} - \frac{α (- 1 - 1 / c)}{λ^{2}} \sum_{i = 1}^{n} \frac{x_{i} {(x_{i} / λ)}^{α - 1} e^{α}}{e^{α} - 1} \\ - \frac{α e^{k / c}}{λ^{2} c} \sum_{i = 1}^{n} x_{i} {(x_{i} / λ)}^{α - 1} e^{α} {(e^{α} - 1)}^{- 1 - 1 / c} \\ + \frac{α (β - 1) e^{k / c}}{λ^{2} c} \\ \times \sum_{i = 1}^{n} \frac{x_{i} {(x_{i} / λ)}^{α - 1} e^{α} {(e^{α} - 1)}^{- 1 - 1 / c} exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}]}{1 - exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}]} . \end{matrix}

The maximum likelihood estimate of $Θ$ is obtained by solving the non-linear systems of equations $U (Θ) = 0$ . Since the resulting systems of equations are not in closed form, the solutions can be found numerically using any of the Newton’s type algorithms.

The Fisher information matrix (FIM) of the EGuWL distribution is the $5 \times 5$ symmetric matrix given by

I (Θ) = - E_{Θ} [\begin{matrix} \begin{matrix} I_{α α} I_{α β} I_{α c} I_{α k} I_{α λ} \end{matrix} \\ I_{β α} I_{β β} I_{β c} I_{β k} I_{β λ} \\ I_{c α} I_{c β} I_{cc} I_{ck} I_{c λ} \\ I_{k α} I_{k β} I_{kc} I_{kk} I_{k λ} \\ I_{λ α} I_{λ β} I_{λ c} I_{λ k} I_{λ λ} \end{matrix}],

where the elements $I_{ij} (Θ) = [\frac{\partial^{2} L}{\partial Θ_{i} \partial Θ_{j}}] .$ Thus, the elements of the FIM can be obtained by realizing the second order partial derivatives of the log-likelihood function w.r.t. to the parameters. These elements can be numerically obtained by using the R software. The total FIM, $I (Θ),$ can be approximated by

J (\hat{Θ}) \approx {[- {(\frac{\partial^{2} L}{\partial Θ_{i} \partial Θ_{j}}|}_{Θ = \hat{Θ}}]}_{5 \times 5} .

For real data, $J (\hat{Θ})$ is obtained after the maximum likelihood estimate of $Θ$ is gotten, which implies the convergence of the iterative numerical procedure involved in finding such estimate.

Suppose $\hat{Θ}$ is the maximum likelihood estimate of $Θ$ . Under the usual regularity conditions and that the parameters are in the interior of the parameter space, but not on the boundary, we have: $\sqrt{n} (\hat{Θ} - Θ) \to^{d} N_{5} (0, I^{- 1} (Θ)),$ where $I^{- 1} (Θ)$ is the inverse of the expected FIM, which also corresponds to the variance–covariance matrix of the parameters. The asymptotic behavior is still valid if $I^{- 1} (Θ)$ is replaced by the inverse of the observed information matrix evaluated at $\hat{Θ},$ that is $J^{- 1} (\hat{Θ})$ . The multivariate normal distribution with mean vector $0 = {(00000)}^{T}$ and covariance matrix $I^{- 1} (Θ)$ can be used to construct confidence intervals for the EGuWL parameters. The approximate $100 (1 - ω) %$ two-sided confidence interval for the parameters $α, β, c, k and λ$ are given by

\hat{α} \pm Z_{ω / 2} \sqrt{^{} I_{α α}^{- 1} (\hat{Θ})}, \hat{β} \pm Z_{ω / 2} \sqrt{^{} I_{β β}^{- 1} (\hat{Θ})}, \hat{c} \pm Z_{ω / 2} \sqrt{^{} I_{cc}^{- 1} (\hat{Θ}),}

\hat{k} \pm Z_{ω / 2} \sqrt{^{} I_{kk}^{- 1} (\hat{Θ}),} \hat{λ} \pm Z_{ω / 2} \sqrt{^{} I_{λ λ}^{- 1} (\hat{Θ})},

respectively, where $I_{α α}^{- 1} (\hat{Θ}), I_{β β}^{- 1} (\hat{Θ}), I_{cc}^{- 1} (\hat{Θ}), I_{kk}^{- 1} (\hat{Θ}) and I_{λ λ}^{- 1} (\hat{Θ})$ are diagonal elements of $I^{- 1} (\hat{Θ})$ and $Z_{ω / 2}$ is the upper ${(ω / 2)}^{th}$ percentile of a standard normal distribution.

Monte Carlo Simulations

Here we conduct a Monte Carlo simulations study to assess the performance and efficiency of the maximum likelihood estimators of the parameters of the EGuWL distribution. The performance of the maximum likelihood estimators are examined for different sample sizes and different combinations of parameter values. The simulation is repeated for $N = 5000$ times using the sample sizes $n = 25, 80, 150, 400, 800 and 1500$ and parameter combination values $I : α = 5, β = 1.5, c = 4, k = - 2, λ = 1.5$ and $I I : α = 2, β = 1.5, c = 2.5, k = 2, λ = 3$ Random samples are simulated from the EGuWL distribution using Proposition 1(i) and five quantities are computed in the simulations and these include:

Mean estimates (ME) of the maximum likelihood estimator of the parameter $Θ = (α β c k λ)$ where
$ME = \frac{1}{N} \sum_{i = 1}^{N} \hat{Θ} ;$
Average bias (AVB) of the maximum likelihood estimator of the parameter $Θ = (α β c k λ)$ where
$AVB = \frac{1}{N} \sum_{i = 1}^{N} (\hat{Θ} - Θ) ;$
Root mean squared error (RSME) of the maximum likelihood estimator of the parameter

$Θ = (α β c k λ)$ where
$RMSE = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(\hat{Θ} - Θ)}^{2} ;}$
Coverage probability (CP) of 95% confidence intervals of the parameters $Θ = (α β c k λ)$ i.e., the percentage of intervals that contain the true value of parameter $Θ ;$
Average width (AW) of 95% confidence intervals of the parameter $Θ = (α β c k λ)$ .

Tables 1 and 2 contain the results for the quantities ME, AVB, RMSE, AW and CP. In Tables 1 and 2, it can be observed that ME of all the parameters reduce as the sample size increases and moves toward their true values. The AVB of all the parameters are all positive and reduce as the sample size increases. The RMSE and the AW of all the parameters also reduce as the sample size increases.

Table 1.

Results of Monte Carlo simulations $α = 5, β = 1.5, c = 4, k = - 2, λ = 1.5$

Parameter	Sample size	ME	AVB	RMSE	AW	CP
$β$	n = 25	2.3662	2.8721	9.5568	92.1923	0.92
	n = 80	1.9902	2.8001	9.2332	43.7943	0.94
	n = 150	1.8189	1.6123	4.9977	19.0392	0.92
	n = 400	1.7231	0.6956	1.8794	6.2785	0.95
	n = 800	1.6070	0.2885	0.8791	3.1967	0.96
	n = 1500	1.5158	0.1722	0.5319	2.1018	0.95
$α$	n = 25	6.6151	1.6151	2.8147	22.4692	0.99
	n = 80	5.9234	0.9234	2.1205	11.6721	0.98
	n = 150	5.6044	0.6044	1.7176	7.9471	0.97
	n = 400	5.3479	0.3479	1.0629	4.2522	0.97
	n = 800	5.1483	0.1483	0.6956	2.7024	0.97
	n = 1500	5.0368	0.0368	0.4752	1.8867	0.96
$λ$	n = 25	1.5495	0.0495	0.2718	2.2593	0.99
	n = 80	1.5594	0.0594	0.2367	1.3777	0.97
	n = 150	1.5397	0.0397	0.2124	1.0284	0.95
	n = 400	1.5396	0.0396	0.1434	0.5947	0.94
	n = 800	1.5163	0.0163	0.1006	0.4082	0.94
	n = 1500	1.5080	0.0081	0.0734	0.2959	0.95
$k$	n = 25	− 0.5616	1.4384	6.9987	61.4781	1
	n = 80	− 0.6047	1.8336	5.4006	28.2781	1
	n = 150	− 0.6947	1.3053	3.8439	17.0276	0.99
	n = 400	− 1.3415	0.6585	2.0352	7.9190	0.98
	n = 800	− 1.6746	0.3254	1.1400	4.7119	0.99
	n = 1500	− 1.7674	0.2326	0.7742	3.2086	0.95
$c$	n = 25	6.0365	2.0365	5.2700	41.5927	0.98
	n = 80	5.7615	1.7615	4.2515	21.6274	0.97
	n = 150	5.2363	1.2363	3.2890	14.0813	0.95
	n = 400	4.7095	0.7095	1.9105	7.0457	0.98
	n = 800	4.3271	0.3271	1.1555	4.2893	0.97
	n = 1500	4.1631	0.1631	0.7403	2.9448	0.96

Open in a new tab

Table 2.

Results of Monte Carlo simulations $α = 2, β = 1.5, c = 2.5, k = 2, λ = 3$

Parameter	Sample size	ME	AVB	RMSE	AW	CP
$β$	n = 25	2.9043	28.2944	33.9312	398.123	0.97
	n = 80	2.8575	19.1266	26.1717	333.010	0.99
	n = 150	2.8018	14.6778	20.2345	271.750	0.99
	n = 400	2.2214	5.9548	18.6956	99.3318	0.99
	n = 800	2.0757	3.6372	10.4671	55.8275	1
	n = 1500	2.0091	2.3331	5.7627	28.5450	0.99
$α$	n = 25	5.3867	3.3870	4.5350	18.6944	1
	n = 80	4.0405	2.0405	3.2464	10.4061	0.97
	n = 150	3.1945	1.1945	2.3676	7.1001	0.97
	n = 400	2.2856	0.2856	0.8372	2.9608	0.96
	n = 800	2.0716	0.0716	0.4053	1.8213	0.98
	n = 1500	2.0002	0.0002	0.2412	1.2770	0.98
$λ$	n = 25	4.8666	1.8660	2.5276	10.7823	0.77
	n = 80	4.7721	1.7721	2.4828	9.6358	0.78
	n = 150	4.4782	1.4783	2.2842	9.1945	0.95
	n = 400	3.8396	0.8396	1.5297	6.9524	0.95
	n = 800	3.5479	0.5479	1.0833	5.0864	0.97
	n = 1500	3.4106	0.4106	0.8283	3.5243	0.99
$k$	n = 25	4.0900	2.0900	8.7545	90.8129	0.98
	n = 80	4.6124	2.6123	7.9091	49.9287	0.97
	n = 150	4.5187	2.5187	7.5799	36.1367	0.86
	n = 400	2.8963	0.8963	2.8955	14.2318	0.99
	n = 800	2.5846	0.5846	1.6062	8.4314	0.99
	n = 1500	2.4372	0.4373	1.0703	5.5644	0.99
$c$	n = 25	6.2989	3.7989	7.0401	47.6068	0.99
	n = 80	5.5586	3.0586	6.0645	27.1930	0.99
	n = 150	4.7446	2.2446	5.2725	18.9548	0.99
	n = 400	3.2178	0.7178	1.9232	6.8753	1
	n = 800	2.8741	0.3741	0.9428	3.7113	1
	n = 1500	2.7351	0.2351	0.5228	2.3270	0.99

Open in a new tab

Remark 3

The simulations was also conducted for other sets of combination of parameter values namely $α = 4, β = 4.5, c = 2, k = 0, λ = 0.5$ , $α = 2.5, β = 3, c = 5, k = - 5, λ = 1$ and $α = 4, β = 2.5, c = 1.5, k = 5, λ = 2.5$ and the results followed similar pattern as obtained in Tables 1 and 2. To conserve space, they are not reported.

Applications

The EGuWL distribution will be applied to fit the daily number of reported infections from the COVID-19 pandemic in Nigeria. Five other data sets will also be used to demonstrate its flexibility. The fit of the EGuWL distribution will be compared with those of other models in its class.

(i)
Application to Nigeria’s COVID—19 data

For the first application, the EGuWL is used to fit the daily number of reported infections from the COVID-19 pandemic in Nigeria for a seven months period (20th March–19th October, 2020). The data set was obtained from the website of the National Center for Disease Control (NCDC) at http://covid19.ncdc.gov.ng/. The data set is unimodal, right-skewed and platykurtic (skewness = 0.4671, excess kurtosis = − 0.8916). The data set is contained in Table 3.

The Weibull (W), exponentiated Gumbel (EGu) [22], the beta exponential (BE) [26], the beta generalized exponential (BGE) [27] and the Gumbe Weibull {logistic} (GuWL) [28] distributions are also used to fit the data and their fits are compared with that of the EGuWL distribution. The BE, BGE and GuWL densities are given respectively by

f_{BE} (x) = \frac{Γ (a + b)}{Γ (a) Γ (b)} λ e^{- b λ x^{}} {(1 - e^{- λ x^{}})}^{a - 1}, x, a, b, λ, > 0,

\begin{matrix} f_{BGE} (x) & = \frac{β Γ (a + b)}{λ Γ (a) Γ (b)} e^{- x / λ^{}} {(1 - e^{- x / λ})}^{a β - 1} {[1 - {(1 - e^{- x / λ^{}})}^{β}]}^{b - 1}, \\ x, a, b, β, λ, > 0, \end{matrix}

\begin{matrix} f_{GuWL} (x) = & \frac{α e^{k / c}}{λ c} {(x / λ)}^{α - 1} e^{α} {(e^{α} - 1)}^{- 1 - 1 / c} \\ exp [- e^{k / c} {(e^{α} - 1)}^{- 1 / c}], x, α, c, λ, > 0, - \infty < k < \infty . \end{matrix}

The results from fitting the COVID-19 data which include the estimate of the parameters, the standard errors of these estimated parameters, the loglikelihood (loglik) values, the Akaike Information Criterion (AIC) values and the Kolmogorov–Smirnov (K–S) statistic values (the corresponding p values are also reported) of all the fitted distributions are reported in Table 4. Figure 8 shows the graph of all the fitted densities alongside the histogram of the data. The results in Table 4 clearly show that the EGuWL distribution provided the best fit for the data by possessing the smallest AIC value as well as the highest p value of the K–S statistic.

Table 3.

Daily number of infections from COVID-19 (20th March–19th October, 2020)

4, 4, 10, 8, 10, 4, 7, 14, 5, 19, 22, 20, 8, 35, 10, 25, 5, 18, 6, 16, 22, 14, 17, 13, 5, 20, 30, 34, 35, 51, 48, 86, 38, 117, 91, 108, 114, 87, 91, 64, 195, 196, 204, 238, 220, 170, 245,148, 195, 381, 386, 239, 248, 242, 146, 184, 193, 288, 176, 388, 216, 226, 284, 339, 245, 265, 313, 229, 276, 389, 182, 387, 553, 307, 416, 241, 348, 350, 328, 389, 260, 315, 663, 409, 681, 627, 501, 403, 573, 490, 587, 745, 667, 661, 436, 675, 452, 649, 594, 684, 779, 490, 566, 561, 790, 626, 454, 603, 544, 575, 503, 460, 499, 575, 664, 571, 595, 463, 643, 595, 600, 653, 556, 562, 576, 543, 604, 591, 438, 555, 648, 624, 404, 481, 462, 386, 304, 288, 304, 457, 354, 443, 453, 437, 290, 423, 453, 373, 329, 325, 298, 417, 410, 593, 476, 340, 601, 322, 321, 252, 221, 296, 160, 250, 138, 143, 239, 216, 125, 162, 100, 155, 296, 176, 197, 188, 160, 79, 132, 90, 126, 131, 221, 189, 97, 195, 176, 111, 125, 213, 136, 126, 136, 187, 201, 153, 126, 160, 58, 120, 118, 155, 103, 151, 111, 163, 164, 225, 179, 148, 212, 113, 133, 118

Open in a new tab

Table 4.

Maximum likelihood fit of the COVID-19 data

Distribution	W	BE	EGu	BGE	GuWL	EGuWL
Parameter estimates	$\begin{matrix} \hat{α} = & 1.2085 \\ (0.0693) \end{matrix}$	$\begin{matrix} \hat{a} = & 0.9643 \\ (0.1632) \end{matrix}$	$\begin{matrix} \hat{\hat{β}} = & 0.0773 \\ (0.0053) \end{matrix}$	$\begin{matrix} \hat{a} = & 3.5162 \\ (50.047) \end{matrix}$	$\begin{matrix} \hat{a} = & 1.6014 \\ (0.0943) \end{matrix}$	$\begin{matrix} \hat{a} = & 1.6073 \\ (0.4738) \end{matrix}$
	$\begin{matrix} \hat{λ} = & 303.53 \\ (17.926) \end{matrix}$	$\begin{matrix} \hat{b} = & 0.0809 \\ (0.0174) \end{matrix}$	$\begin{matrix} \hat{c} = & 21.440 \\ (0.0020) \end{matrix}$	$\begin{matrix} \hat{b} = & 0.8909 \\ (1.9056) \end{matrix}$	$\begin{matrix} \hat{c} = & 5.0092 \\ (0.6222) \end{matrix}$	$\begin{matrix} \hat{\hat{β}} = & 24.317 \\ (34.992) \end{matrix}$
		$\begin{matrix} \hat{λ} = & 0.0430 \\ (0.0087) \end{matrix}$	$\begin{matrix} \hat{k} = & 11.008 \\ (0.0573) \end{matrix}$	$\begin{matrix} \hat{\hat{β}} = & 0.3231 \\ (4.7859) \end{matrix}$	$\begin{matrix} \hat{k} = & 1.9668 \\ (0.4694) \end{matrix}$	$\begin{matrix} \hat{c} = & 10.240 \\ (3.4696) \end{matrix}$
				$\begin{matrix} \hat{λ} = & 234.86 \\ (488.64) \end{matrix}$	$\begin{matrix} \hat{λ} = & 114.45 \\ (3.7350) \end{matrix}$	$\begin{matrix} \hat{k} = & 14.289 \\ (7.4607) \end{matrix}$
						$\begin{matrix} \hat{λ} = & 205.27 \\ (79.159) \end{matrix}$
Log Likelihood	$- 1420.29$	$- 1425.26$	$- 14.32 . 10$	$- 1423.82$	$- 1405.78$	$- 1403.23$
AIC	$2844.58$	$2856.52$	$2870.20$	$2855.64$	$2819.56$	$2816.46$
K-S p-value	$0.0738$ $0.1845$	$0.1175$ $0.0050$	$0.0973$ $0.0324$	$0.0848$ $0.0872$	$0.0629$ $0.351$	$0.0622$ $0.3636$

Open in a new tab

(Standard error of estimates in parenthesis)

Fig. 8 — Graph of the fitted densities for the COVID-19 data

(ii)
Application to Aluminum Coupons data

For the second application, the EGuWL distribution is used to fit the fatigue time of 101 6061-T6 Aluminum Coupons cut parallel to the direction of rolling and oscillated at 18 cycles per second (cps). The data set was reported in [29] and presented in Table 5. The data set is unimodal, right-skewed and leptokurtic (Skewness = 0.3355 and excess kurtosis = 1.1687). The beta normal (BN) [6], the beta Weibull (BW) [30], the beta Burr XII (BBXII) [31], Gumbel–Burr XII {logistic} (GuBXIIL) [32] and the GuWL distributions are also used to fit the data set and their fits are compared with that of the EGuWL distribution. The BN, BW, BBXII and the GuBXIIL densities are given respectively by

\begin{matrix} f_{BN} (x) = & \frac{Γ (a + b)}{c Γ (a) Γ (b)} ϕ (\frac{x - k}{c}) {[Φ (\frac{x - k}{c})]}^{a - 1} {[1 - Φ (\frac{x - k}{c})]}^{b - 1}, \\ a, b, c > 0, - \infty < x, k < \infty, \end{matrix}

Table 5.

Fatigue time of 101 6061-T6 Aluminum Coupons

70, 90, 96, 97, 99, 100, 103, 104,104,105,107,108, 108, 108,109, 109, 112, 112,113, 114, 114, 114, 116, 119, 120, 120,120, 121, 121, 123, 124, 124, 124, 124, 124, 128, 128, 129,129, 130, 130, 130, 131, 131, 131, 131, 131, 132, 132, 132,133, 134, 134, 134, 134, 134, 136, 136, 137, 138, 138, 138,139, 139, 141, 141, 142, 142, 142, 142, 142, 142, 144, 144,145, 146, 148, 148, 149, 151, 151, 152, 155, 156, 157, 157,157, 157, 158, 159, 162, 163, 163, 164, 166, 166, 168, 170,174, 196, 212

Open in a new tab

$ϕ (.)$ and $Φ (.)$ are the pdf and cdf of the normal distribution respectively,

\begin{matrix} f_{BW} (x) = & \frac{α Γ (a + b)}{λ Γ (a) Γ (b)} {(x / λ)}^{α - 1} e^{- b {(x / λ)}^{α}} {(1 - e^{- {(x / λ)}^{α}})}^{a - 1}, \\ x, α, a, b, λ > 0, \end{matrix}

\begin{matrix} f_{BBXII} (x) = & \frac{α c Γ (a + b)}{λ Γ (a) Γ (b)} {(x / λ)}^{α - 1} {[1 + {(x / λ)}^{α}]}^{- b c - 1} {\{1 - {[1 + {(x / λ)}^{α}]}^{- c}\}}^{a - 1}, \\ x, α, a, b, c, λ > 0, \end{matrix}

\begin{matrix} f_{GuBXIIL} (x) = & \frac{α β e^{k / c}}{λ c} {(x / λ)}^{α - 1} {(1 + {(x / λ)}^{α})}^{β - 1} {[{(1 + {(x / λ)}^{α})}^{β} - 1]}^{- 1 - 1 / c} \\ exp \{- e^{k / c} {[{(1 + {(x / λ)}^{α})}^{β} - 1]}^{- 1 / c}\}, \\ x, α, β, c, λ > 0, - \infty < k < \infty . \end{matrix}

The results from fitting the Aluminum Coupons which include the estimate of the parameters, the standard errors of these estimated parameters, the loglikelihood (loglik) values, the Akaike Information Criterion (AIC) values and the Kolmogorov–Smirnov (K–S) statistic values (the corresponding p values are also reported) of all the fitted distributions are reported in Table 6. Figure 9 shows the graph of all the fitted densities alongside the histogram of the data. The results in Table 6 clearly show that the EGuWL distribution provided the best fit for the data by possessing the smallest AIC value as well as the highest p value of the K–S statistic.

(iii)
Application to the Kevlar 49/epoxy strands failure times data (pressure at 70%)

Table 6.

Maximum likelihood fit of the Aluminum Coupons

Distribution	BW	BN	GuWL	BBXII	GuBXIIL	EGuWL
Parameter estimates	$\begin{matrix} \hat{a} = & 6.2469 \\ (6.6825) \end{matrix}$	$\begin{matrix} \hat{a} = & 8.1285 \\ (30.488) \end{matrix}$	$\begin{matrix} \hat{α} = & 2.2590 \\ (0.0028) \end{matrix}$	$\begin{matrix} \hat{a} = & 124.92 \\ (197.607) \end{matrix}$	$\begin{matrix} \hat{α} = & 1.0580 \\ (0.5606) \end{matrix}$	$\begin{matrix} \hat{α} = & 2.0623 \\ (1.4468) \end{matrix}$
	$\begin{matrix} \hat{b} = & 1.4300 \\ (2.6866) \end{matrix}$	$\begin{matrix} \hat{b} = & 1.6931 \\ (3.4807) \end{matrix}$	$\begin{matrix} \hat{c} = & 4.0733 \\ (0.3075) \end{matrix}$	$\begin{matrix} \hat{b} = & 52.778 \\ (50.597) \end{matrix}$	$\begin{matrix} \hat{β} = & 2.2834 \\ (2.3011) \end{matrix}$	$\begin{matrix} \hat{β} = & 1.7546 \\ (2.7663) \end{matrix}$
	$\begin{matrix} \hat{α} = & 2.7639 \\ (1.7179) \end{matrix}$	$\begin{matrix} \hat{c} = & 44.2964 \\ (64.704) \end{matrix}$	$\begin{matrix} \hat{k} = & 11.025 \\ (0.4289) \end{matrix}$	$\begin{matrix} \hat{c} = & 0.7183 \\ (0.5679) \end{matrix}$	$\begin{matrix} \hat{c} = & 0.4774 \\ (0.4626) \end{matrix}$	$\begin{matrix} \hat{c} = & 3.7812 \\ (5.4660) \end{matrix}$
	$\begin{matrix} \hat{λ} = & 106.53 \\ (24.282) \end{matrix}$	$\begin{matrix} \hat{c} = & 86.946 \\ (106.44) \end{matrix}$	$\begin{matrix} \hat{λ} = & 43.216 \\ (0.0028) \end{matrix}$	$\begin{matrix} \hat{α} = & 1.1465 \\ (0.6420) \end{matrix}$	$\begin{matrix} \hat{k} = & 17.6749 \\ (17.136) \end{matrix}$	$\begin{matrix} \hat{k} = & 9.9375 \\ (11.154) \end{matrix}$
				$\begin{matrix} \hat{λ} = & 35.835 \\ (54.408) \end{matrix}$	$\begin{matrix} \hat{λ} = & 12.4511 \\ (5.9684) \end{matrix}$	$\begin{matrix} \hat{λ} = & 44.614 \\ (2.1010) \end{matrix}$
Log Likelihood	− 456.67	− 456.88	− 456.61	− 457.90	− 475.18	− 455.59
AIC	921.34	921.75	921.21	925.80	960.35	921.18
K–S	0.0654	0.0647	0.0750	0.0913	0.1329	0.0611
p value	0.7550	0.7673	0.5936	0.3482	0.0514	0.8222

Open in a new tab

(Standard error of estimates in parenthesis)

Fig. 9 — Graph of the fitted densities for the Aluminum Coupons data

For the third application, the EGuWL distribution is used to fit the Kevlar 49/epoxy strands failure times data (pressure at 70%). The data set was reported in [28]. The data set is multimodal, platykurtic, and approximately symmetric. (skewness = 0.0998, excess kurtosis = − 0.79). The data set is presented in Table 7. The BN, BW, GuWL, beta exponentiated Weibull (BEW) [33] and the Gumbel–Weibull {logistic} Poisson (GuWLP) [12] distributions are also used to fit the data set and their fits are compared with that of the EGuWL distribution. The BEW and the GuWLP densities are given respectively by

\begin{matrix} f_{BEW} (x) = & \frac{α β Γ (a + b)}{λ Γ (a) Γ (b)} {(x / λ)}^{α - 1} e^{- {(x / λ)}^{α}} {(1 - e^{- {(x / λ)}^{α}})}^{a β - 1} \\ \times {[1 - {(1 - e^{- {(x / λ)}^{α}})}^{β}]}^{b - 1}, x, a, α, b, β, λ, > 0, \end{matrix}

\begin{matrix} f_{GuWLP} (x) = \frac{β α e^{k / c}}{λ c (e^{β} - 1)} {(x / λ)}^{α - 1} e^{α} {(e^{α} - 1)}^{- 1 - 1 / c} \\ \times exp \{- e^{k / c} {(e^{α} - 1)}^{- 1 / c}\} \\ \times exp \{β [1 - exp (- e^{k / c} {(e^{α} - 1)}^{- 1 / c})]\}, x, α, λ, c > 0, β, k \in R . \end{matrix}

Table 7.

Kevlar 49/epoxy strands failure times data (pressure at 70%)

1051, 1337, 1389, 1921, 1942, 2322, 3629, 4006, 4012, 4063, 4921, 5445, 5620, 5817, 5905, 5956, 6068, 6121, 6473, 7501, 7886, 8108, 8546, 8666, 8831, 9106, 9711, 9806, 10,205, 10,396, 10,861, 11,026, 11,214, 11,362, 11,604, 11,608, 11,745, 11,762, 11,895, 12,044, 13,520, 13,670, 14,110, 14,496, 15,395, 16,179, 17,092, 17,568, 17,568

Open in a new tab

The results from fitting the Kevlar 49/epoxy strands failure times data (pressure at 70%) which include the estimate of the parameters, the standard errors of these estimated parameters, the loglikelihood (loglik) values, the Akaike Information Criterion (AIC) values and the Kolmogorov–Smirnov (K–S) statistic values (the corresponding p values are also reported) of all the fitted distributions are reported in Table 8. Figure 10 shows the graph of all the fitted densities alongside the histogram of the data. The results in Table 8 clearly show that the EGuWL distribution provided the best fit for the data by possessing the smallest AIC value as well as the highest p value of the K -S statistic.

(iv)
Application to the Kevlar 49/epoxy strands failure times data (pressure at 90%)

Table 8.

Maximum likelihood fit of the Kevlar 49/epoxy strands failure times data (pressure at 70%)

Distribution	BW	BN	GuWL	BEW	GuWLP	EGuWL
Parameter estimates	$\begin{matrix} \hat{a} = & 0.4877 \\ (0.1222) \end{matrix}$	$\begin{matrix} \hat{a} = & 0.1150 \\ (0.1489) \end{matrix}$	$\begin{matrix} \hat{α} = & 2.6741 \\ (0.3582) \end{matrix}$	$\begin{matrix} \hat{a} = & 7.9008 \\ (75.977) \end{matrix}$	$\begin{matrix} \hat{α} = & 2.0438 \\ (0.3801) \end{matrix}$	$\begin{matrix} \hat{α} = & 2.6093 \\ (0.0127) \end{matrix}$
	$\begin{matrix} \hat{b} = & 0.1183 \\ (0.0189) \end{matrix}$	$\begin{matrix} \hat{b} = & 0.0806 \\ (0.1068) \end{matrix}$	$\begin{matrix} \hat{c} = & 4.1036 \\ (1.0343) \end{matrix}$	$\begin{matrix} \hat{b} = & 0.1498 \\ (0.0479) \end{matrix}$	$\begin{matrix} \hat{β} = & - 0.606 \\ (2.5157) \end{matrix}$	$\begin{matrix} \hat{β} = & 0.2237 \\ (0.1811) \end{matrix}$
	$\begin{matrix} \hat{α} = & 2.6980 \\ (0.0423) \end{matrix}$	$\begin{matrix} \hat{c} = & 1087.1 \\ (794.87) \end{matrix}$	$\begin{matrix} \hat{k} = & 1.1546 \\ (0.7454) \end{matrix}$	$\begin{matrix} \hat{β} = & 0.0194 \\ (0.4225) \end{matrix}$	$\begin{matrix} \hat{c} = & 3.6367 \\ (1.9862) \end{matrix}$	$\begin{matrix} \hat{c} = & 2.1905 \\ (1.4673) \end{matrix}$
	$\begin{matrix} \hat{λ} = & 5002.4 \\ (0.0509) \end{matrix}$	$\begin{matrix} \hat{k} = & 7796.1 \\ (1390.6) \end{matrix}$	$\begin{matrix} \hat{λ} = & 6116.9 \\ (246.5) \end{matrix}$	$\begin{matrix} \hat{α} = & 2.4884 \\ (0.2836) \end{matrix}$	$\begin{matrix} \hat{k} = & 1.6822 \\ (3.1542) \end{matrix}$	$\begin{matrix} \hat{k} = & - 1.558 \\ (1.7711) \end{matrix}$
				$\begin{matrix} \hat{λ} = & 5000.0 \\ (0.3352) \end{matrix}$	$\begin{matrix} \hat{λ} = & 4555.4 \\ (69.076) \end{matrix}$	$\begin{matrix} \hat{λ} = & 4611.9 \\ (0.04750) \end{matrix}$
Log Likelihood	479.49	− 480.41	− 479.49	− 480.0	− 478.86	− 478.40$
AIC	966.97	968.81	966.97	970.0	960.35	966.80
K–S	0.0764	0.0832	0.0742	0.0755	0.0701	0.0607
p value	0.9165	0.8590	0.9316	0.9227	0.9556	0.9888

Open in a new tab

(Standard error of estimates in parenthesis)

Fig. 10 — Graph of the fitted densities for the Kevlar 49/epoxy strands failure times data (pressure at 70%)

For the fourth application, the EGuWL distribution is used to fit the Kevlar 49/epoxy strands failure times data (pressure at 90%). The data set was reported in [28]. The data set is unimodal, leptokurtic, and highly skewed to the right (reverse J-shape) (skewness = 3.0472, excess kurtosis = 14.4745). The data set is presented in Table 9. The BN, BW, GuWL, exponentiated Weibull (EW) [5] and the GuWLP distributions are also used to fit the data set and their fits are compared with that of the EGuWL distribution. The EW density is given by

f_{EW} (x) = \frac{α β}{λ} {(x / λ)}^{α - 1} e^{- {(x / λ)}^{α}} {(1 - e^{- {(x / λ)}^{α}})}^{β - 1}, x, α, β, λ, > 0 .

Table 9.

Kevlar 49/epoxy strands failure times data (pressure at 90%)

0.01, 0.01, 0.02, 0.02, 0.02, 0.03, 0.03, 0.04, 0.05, 0.06, 0.07, 0.07, 0.08, 0.09, 0.09, 0.10, 0.10, 0.11, 0.11, 0.12, 0.13, 0.18, 0.19, 0.20, 0.23, 0.24, 0.24, 0.29, 0.34, 0.35, 0.36, 0.38, 0.40, 0.42, 0.43, 0.52, 0.54, 0.56, 0.60, 0.60, 0.63, 0.65, 0.67, 0.68, 0.72, 0.72, 0.72, 0.73, 0.79, 0.79, 0.80, 0.80, 0.83, 0.85, 0.90, 0.92, 0.95, 0.99, 1.00, 1.01, 1.02, 1.03, 1.05, 1.10, 1.10, 1.11, 1.15, 1.18, 1.20, 1.29, 1.31, 1.33, 1.34, 1.40, 1.43, 1.45, 1.50, 1.51, 1.52, 1.53, 1.54, 1.54, 1.55, 1.58, 1.60, 1.63, 1.64, 1.80, 1.80, 1.81, 2.02, 2.05, 2.14, 2.17, 2.33, 3.03, 3.03, 3.34, 4.20, 4.69, 7.89

Open in a new tab

The results from fitting the Kevlar 49/epoxy strands failure times data (pressure at 90%) which include the estimate of the parameters, the standard errors of these estimated parameters, the loglikelihood (loglik) values, the Akaike Information Criterion (AIC) values and the Kolmogorov–Smirnov (K–S) statistic values (the corresponding p values are also reported) of all the fitted distributions are reported in Table 10. Figure 11 shows the graph of all the fitted densities alongside the histogram of the data. The results in Table 10 clearly show that the EGuWL distribution provided the best fit for the data by possessing the smallest AIC value as well as the highest p value of the K -S statistic.

(v)
Application to the Australian Athletes' Height Data

Table 10.

Maximum likelihood fit of the Kevlar 49/epoxy strands failure times data (pressure at 90%)

Distribution	BW	BN	GuWL	EW	GuWLP	EGuWL
Parameter estimates	$\begin{matrix} \hat{a} = & 0.7609 \\ (0.1240) \end{matrix}$	$\begin{matrix} \hat{a} = & 10.590 \\ (5.3031) \end{matrix}$	$\begin{matrix} \hat{α} = & 30.9196 \\ (0.0888) \end{matrix}$	$\begin{matrix} \hat{\hat{β}} = & 0.7932 \\ (0.2870) \end{matrix}$	$\begin{matrix} \hat{α} = & 0.8861 \\ (0.2282) \end{matrix}$	$\begin{matrix} \hat{α} = & 0.9413 \\ (0.0027) \end{matrix}$
	$\begin{matrix} \hat{b} = & 0.2157 \\ (0.0241) \end{matrix}$	$\begin{matrix} \hat{b} = & 0.0949 \\ (0.0594) \end{matrix}$	$\begin{matrix} \hat{c} = & 3.2739 \\ (0.6359) \end{matrix}$	$\begin{matrix} \hat{α} = & 1.0602 \\ (0.2398) \end{matrix}$	$\begin{matrix} \hat{β} = & - 0.596 \\ (3.2028) \end{matrix}$	$\begin{matrix} \hat{β} = & 0.7281 \\ (0.3845) \end{matrix}$
	$\begin{matrix} \hat{α} = & 1.0513 \\ (0.0027) \end{matrix}$	$\begin{matrix} \hat{c} = & 0.4639 \\ (0.1702) \end{matrix}$	$\begin{matrix} \hat{k} = & 1.9375 \\ (0.8810) \end{matrix}$	$\begin{matrix} \hat{λ} = & 1.2176 \\ (0.3932) \end{matrix}$	$\begin{matrix} \hat{c} = & 2.9498 \\ (1.8805) \end{matrix}$	$\begin{matrix} \hat{c} = & 3.1163 \\ (1.0502) \end{matrix}$
	$\begin{matrix} \hat{λ} = & 0.2588 \\ (0.0027) \end{matrix}$	$\begin{matrix} \hat{k} = & - 0.846 \\ (0.2472) \end{matrix}$	$\begin{matrix} \hat{λ} = & 0.2068 \\ (0.0713) \end{matrix}$		$\begin{matrix} \hat{k} = & 1.3918 \\ (3.3291) \end{matrix}$	$\begin{matrix} \hat{k} = & 1.6064 \\ (1.5398) \end{matrix}$
					$\begin{matrix} \hat{λ} = & 0.1992 \\ (0.1150) \end{matrix}$	$\begin{matrix} \hat{λ} = & 0.1736 \\ (0.0027) \end{matrix}$
Log Likelihood	− 102.17	− 129.81	− 100.94	− 102.79	− 100.16	− 99.80
AIC	212.34	267.62	209.88	211.57	210.31	209.59
K–S	0.0784	0.1219	0.0689	0.0844	0.0683	0.0629
p value	0.5385	0.0913	0.6983	0.4433	0.7078	0.7953

Open in a new tab

(Standard error of estimates in parenthesis)

Fig. 11 — Graph of the fitted densities for the Kevlar 49/epoxy strands failure times data (pressure at 90%)

For the fifth application, the EGuWL distribution is used to fit the heights (in centimeters) of 100 female Australian athletes. The data set was collected by the Australian Institute of Sport and reported in [28]. The data set is unimodal, leptokurtic, and left-skewed (skewness = − 0.5684, excess kurtosis = 1.3212). The data set is presented in Table 11. The BN, GuWL, EW, Weibull–Pareto {exponential} (WPE) [34] and the beta skew normal (BSN) [35] distributions are also used to fit the data set and their fits are compared with that of the EGuWL distribution. The WPE and the BSN densities are given by

f_{WPE} (x) = \frac{β c}{x} {[β log {(x / λ)}^{}]}^{c - 1} exp \{- {[β log {(x / λ)}^{}]}^{c}\}, x > λ, c, β, λ, > 0,

f_{BSN} (x) = \frac{2 Γ (a + b)}{Γ (a) Γ (b)} ϕ (z) Φ (α z) {[Φ (z ; α)]}^{a - 1} {[1 - Φ (z ; α)]}^{b - 1},

z = (x - k) / c, a, b, c > 0, - \infty < x, α, k < \infty, Φ (z ; α) = Φ (z) - 2 T (z, α),

Table 11.

Australian Athletes' Height Data

148.9, 149.0, 156.0, 156.9, 157.9, 158.9, 162.0, 162.0, 162.5, 163.0, 163.9, 165.0, 166.1, 166.7, 167.3, 167.9, 168.0, 168.6, 169.1, 169.8, 169.9, 170.0, 170.0, 170.3, 170.8, 171.1, 171.4, 171.4, 171.6, 171.7, 172.0, 172.2, 172.3, 172.5, 172.6, 172.7, 173.0, 173.3, 173.3, 173.5, 173.6, 173.7, 173.8, 174.0, 174.0, 174.0, 174.1, 174.1, 174.4, 175.0, 175.0, 175.0, 175.3, 175.6, 176.0, 176.0, 176.0, 176.0, 176.8, 177.0, 177.3, 177.3, 177.5, 177.5, 177.8, 177.9, 178.0, 178.2, 178.7, 178.9, 179.3, 179.5, 179.6, 179.6, 179.7, 179.7, 179.8, 179.9, 180.2, 180.2, 180.5, 180.5, 180.9, 181.0, 181.3, 182.1, 182.7, 183.0, 183.3, 183.3, 184.6, 184.7, 185.0, 185.2, 186.2, 186.3, 188.7, 189.7, 193.4, 195.9

Open in a new tab

$ϕ (.)$ and $Φ (.)$ are the pdf and cdf of the normal distribution respectively, $T (., .)$ is the Owen’s T function.

The results from fitting the Heights data which include the estimate of the parameters, the standard errors of these estimated parameters, the loglikelihood (loglik) values, the Akaike Information Criterion (AIC) values and the Kolmogorov–Smirnov (K–S) statistic values (the corresponding p values are also reported) of all the fitted distributions are reported in Table 12. Figure 12 shows the graph of all the fitted densities alongside the histogram of the data. The results in Table 12 clearly show that the EGuWL distribution provided the best fit for the data by possessing the smallest AIC value as well as the highest p value of the K -S statistic.

(vi)
Application to Australian Athletes’ sum of skin folds data

Table 12.

Maximum likelihood fit of the Heights data

Distribution	WPE	BN	GuWL	EW	BSN	EGuWL
Parameter estimates	$\begin{matrix} \hat{c} = & 8.1892 \\ (3.3757) \end{matrix}$	$\begin{matrix} \hat{a} = & 0.9773 \\ (1.3802) \end{matrix}$	$\begin{matrix} \hat{α} = & 12.433 \\ (0.0400) \end{matrix}$	$\begin{matrix} \hat{\hat{β}} = & 2.7803 \\ (1.3356) \end{matrix}$	$\begin{matrix} \hat{a} = & 0.9419 \\ (1.1986) \end{matrix}$	$\begin{matrix} \hat{α} = & 12.358 \\ (0.0027) \end{matrix}$
	$\begin{matrix} \hat{β} = & 2.8428 \\ (1.1247) \end{matrix}$	$\begin{matrix} \hat{b} = & 8.3144 \\ (25.382) \end{matrix}$	$\begin{matrix} \hat{c} = & 3.5464 \\ (0.2775) \end{matrix}$	$\begin{matrix} \hat{α} = & 14.748 \\ (3.2334) \end{matrix}$	$\begin{matrix} \hat{b} = & 7.5836 \\ (19.702) \end{matrix}$	$\begin{matrix} \hat{β} = & 0.9717 \\ (0.3854) \end{matrix}$
	$\begin{matrix} \hat{λ} = & 125.11 \\ (17.253) \end{matrix}$	$\begin{matrix} \hat{c} = & 13.280 \\ (14.537) \end{matrix}$	$\begin{matrix} \hat{k} = & 6.3124 \\ (0.3764) \end{matrix}$	$\begin{matrix} \hat{λ} = & 170.28 \\ (4.5382) \end{matrix}$	$\begin{matrix} \hat{c} = & 10.129 \\ (12.262) \end{matrix}$	$\begin{matrix} \hat{c} = & 4.1148 \\ (0.9673) \end{matrix}$
		$\begin{matrix} \hat{k} = & 193.97 \\ (29.487) \end{matrix}$	$\begin{matrix} \hat{λ} = & 148.84 \\ (0.0547) \end{matrix}$		$\begin{matrix} \hat{k} = & 100.19 \\ (129.59) \end{matrix}$	$\begin{matrix} \hat{k} = & 7.5580 \\ (1.5104) \end{matrix}$
					$\begin{matrix} \hat{α} = & 4.1711 \\ (12.143) \end{matrix}$	$\begin{matrix} \hat{λ} = & 146.48 \\ (0.0027) \end{matrix}$
Log Likelihood	− 351.49	− 350.30	− 350.14	− 351.44	−350.30	−349.02
AIC	708.97	708.60	708.28	708.89	210.31	708.04
K–S	0.0801	0.0721	0.0587	0.0711	0.0722	0.0534
p value	0.5171	0.6489	0.8607	0.6662	0.6472	0.9230

Open in a new tab

(Standard error of estimates in parenthesis)

Fig. 12 — Graph of the fitted densities for the Heights data

For the last application, the EGuWL distribution is used to fit the sum skin folds of 100 female Australian athletes. The data set was collected by the Australian Institute of Sport and reported in [28]. The data set is unimodal, leptokurtic, and right-skewed (skewness = 0.7878, excess kurtosis = 0.7320). The data set is presented in Table 13. The BN, GuWL, WPE, EW and BW distributions are also used to fit the data set and their fits are compared with that of the EGuWL distribution. The results from fitting the sum of skin folds data which include the estimate of the parameters, the standard errors of these estimated parameters, the loglikelihood (loglik) values, the Akaike Information Criterion (AIC) values and the Kolmogorov–Smirnov (K–S) statistic values (the corresponding p-values are also reported) of all the fitted distributions are reported in Table 14. Figure 13 shows the graph of all the fitted densities alongside the histogram of the data. The results in Table 14 clearly show that the EGuWL distribution provided the best fit for the data by possessing the smallest AIC value as well as the highest p value of the K–S statistic.

Table 13.

Australian Athletes' sum of skin folds data

33.8, 36.8, 38.2, 41.1, 41.6, 42.3, 43.5, 43.5, 46.1, 46.2, 46.3, 47.5, 47.6, 48.4, 49.0, 49.9, 50.0, 52.5, 52.6, 54.6,54.6, 55.6, 56.8, 57.9, 58.9, 59.4, 61.9, 62.6, 62.9, 65.1, 67.0, 68.3, 68.9, 69.9, 70.0, 71.3, 71.6, 73.9, 74.7, 74.9, 75.1,75.2, 76.2, 76.8, 77.0, 80.1, 80.3, 80.3, 80.3, 80.6, 83.0, 87.2, 88.2, 89.0,90.2, 90.4, 91.0, 91.2, 95.4, 96.8, 97.2, 97.9, 98.0, 98.1, 98.3, 98.5, 99.8, 99.9, 101.1, 102.8, 102.8,103.6,103.6, 104.6, 106.9, 109.0, 109.1, 109.5, 109.6, 110.2, 110.7, 111.1, 113.5, 114.0, 115.9, 117.8, 122.1,123.6, 125.9, 126.4, 126.4, 131.9, 136.3,143.5, 148.9,156.6,156.6, 171.1, 181.7, 200.8

Open in a new tab

Table 14.

Maximum likelihood fit of the sum of skin folds data

Distribution	BW	BN	GuWL	EW	WPE	EGuWL
Parameter estimates	$\begin{matrix} \hat{a} = & 4.3509 \\ (1.0637) \end{matrix}$	$\begin{matrix} \hat{a} = & 9.7706 \\ (0.4008) \end{matrix}$ $\begin{matrix} \hat{b} = & 0.1967 \\ (0.0223) \end{matrix}$	$\begin{matrix} \hat{α} = & 2.2868 \\ (0.9179) \end{matrix}$	$\begin{matrix} \hat{\hat{β}} = & 6.5077 \\ (6.3889) \end{matrix}$	$\begin{matrix} \hat{c} = & 3.6318 \\ (1.0579) \end{matrix}$	$\begin{matrix} \hat{α} = & 2.7590 \\ k 3 (0.0052) \end{matrix}$
	$\begin{matrix} \hat{b} = & 0.1641 \\ (0.0176) \end{matrix}$	$\begin{matrix} \hat{b} = & 0.1967 \\ (0.0223) \end{matrix}$	$\begin{matrix} \hat{c} = & 1.3310 \\ (0.3218) \end{matrix}$	$\begin{matrix} \hat{α} = & 1.2244 \\ (0.4407) \end{matrix}$	$\begin{matrix} \hat{β} = & 0.7183 \\ (0.1801) \end{matrix}$	$\begin{matrix} \hat{β} = & 0.1792 \\ (0.0180) \end{matrix}$
	$\begin{matrix} \hat{α} = & 1.9999 \\ (0.0026) \end{matrix}$	$\begin{matrix} \hat{c} = & 25.4309 \\ (0.9049) \end{matrix}$	$\begin{matrix} \hat{k} = & 0.0459 \\ (1.3030) \end{matrix}$	$\begin{matrix} \hat{λ} = & 41.487 \\ (24.646) \end{matrix}$	$\begin{matrix} \hat{λ} = & 23.0468 \\ (7.6267) \end{matrix}$	$\begin{matrix} \hat{c} = & 0.6422 \\ (0.0052) \end{matrix}$
	$\begin{matrix} \hat{λ} = & 33.310 \\ (0.0026) \end{matrix}$	$\begin{matrix} \hat{k} = & 9.1517 \\ (4.3612) \end{matrix}$	$\begin{matrix} \hat{λ} = & 81.639 \\ (27.344) \end{matrix}$			$\begin{matrix} \hat{k} = & - 0.916 \\ (0.0052) \end{matrix}$
						$\begin{matrix} \hat{λ} = & 65.951 \\ (0.0052) \end{matrix}$
Log Likelihood	− 486.25	− 487.06	− 486.28	− 487.27	− 486.07	− 485.23
AIC	980.50	982.10	980.55	980.54	978.13	980.47
K–S	0.0725	0.0711	0.0704	0.0809	0.0825	0.0598
p value	0.6424	0.6925	0.6778	0.5042	0.4782	0.8463

Open in a new tab

(Standard error of estimates in parenthesis)

Fig. 13 — Graph of the fitted densities for the sum of skin folds data

Summary and Conclusion

A new flexible probability distribution called the exponentiated Gumbel–Weibull {logistic} distribution has been defined and studied in this paper. The new distribution has been applied in modeling the daily number of infections from the novel COVID-19 pandemic in Nigeria. Five other data sets which exhibit various shape and tail behaviors have been further used to buttress the flexibility of the new distribution. The performance of the distribution in fitting the various data sets have been compared with those of other probability distributions in its class and results obtained showed that the new distribution gave the best fits. We hope the new distribution will attract further usage in fitting data sets from other fields.

Author Contributions

The first draft of the manuscript was written by Patrick Osatohanmwen and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Funding

No funding was received for conducting this study.

Declarations

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Ethics approval

Ethical standards as recommended by the journal and in line with global best practices have been followed in the course of wrting the article as well as in the reporting of the results conatined therein.

Data Availability

All data as used in the article and in the generation of results are contained in the body of the article and where necesary, URL address have been provided to also acess them.

Code Availability

The codes used in the article can be obtained upon request from the corresponding author.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Olson DL, Shi Y (2007) Introduction to business data mining. McGraw-Hill/Irwin, New York [Google Scholar]
2.Shi Y, Tian YJ, Kou G, Peng Y, Li JP (2011) Optimization based data mining: theory and applications. Springer, Berlin [Google Scholar]
3.Tien JM (2017) Internet of things, real-time decision making, and artificial intelligence. Ann Data Sci 4(2):149–178 [Google Scholar]
4.Azzalini A (1985) A class of distributions which includes the normal ones. Scand J Stat 12:171–178 [Google Scholar]
5.Mudholkar GS, Srivastava DK (1993) Exponentiated Weibull family for analyzing bathtub failure-rate data. IEEE Trans Reliab 42:299–302. 10.1109/24.229504 [Google Scholar]
6.Eugene N, Lee C, Famoye F (2002) Beta-normal distribution and its applications. Commun Stat Theory Methods 31:497–512. 10.1081/STA-120003130 [Google Scholar]
7.Shaw WT, Buckley IR (2009) The alchemy of probability distributions: beyond Gram-Charlier expansions and a skew-kurtotic-normal distribution from a rank transmutation map. arXiv:0901.0434
8.Cordeiro GM, de Castro M (2011) A new family of generalized distributions. J Stat Comput Simul 81:883–898. 10.1080/00949650903530745 [Google Scholar]
9.Cordeiro GM, Ortega GM, da Cunha DCC (2013) The exponentiated generalized class of distributions. J Data Sci 11:1–27 [Google Scholar]
10.Alzaatreh A, Lee C, Famoye F (2014) T – normal family of distributions: a new approach to generalize the normal distribution. J Stat Distrib Appl 1:16 [Google Scholar]
11.Osatohanmwen P, Oyegue FO, Ajibade B, Ewere F (2020) A new generalized family of distributions on the unit interval: the T - kumaraswamy family of distributions. J Data Sci 18(2):218–236 [Google Scholar]
12.Osatohanmwen P, Oyegue FO, Ogbonmwan SM (2020) The T – R Y power series family of probability distributions. J Egypt Math Soc 28:29. 10.1186/s42787-020-00083-7 [Google Scholar]
13.Liu Z, Magal P, Seydi O, Webb G (2020) Predicting the cumulative number of cases for the COVID-19 epidemic in China from early data. arXiv:2002.12298v1 [DOI] [PubMed]
14.Roosa K, Lee Y, Luo R, Kirpich A, Rothenberg A, Hyman JM, Yan P, Chowell G (2020) Real-time forecasts of the COVID-19 epidemic in China from February 5th to February 24th, 2020. Infect Dis Model 5:256–263. 10.1016/j.idm.2020.02.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Tang B, Bragazzi NL, Li Q, Tang S, Xiao Y, Wu J (2020) An updated estimation of the risk of transmission of the novel corona virus (2019-nCov). Infect Dis Model 5:248–255. 10.1016/j.idm.2020.02.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Tang B, Wang X, Li Q, Bragazzi NL, Tang S, Xiao Y, Wu J (2020) Estimation of the transmission risk of the 2019-nCov and its implication for public health intervention. J Clin Med 9(2):462 [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Wu JT, Leung K, Leung GM (2020) Nowcasting and forecasting the potential domestic and international spread of the 2019-nCov outbreak originating in Wuhan, China: a modelling study. Lancet 395:689–697. 10.1016/s0140-6736(20)30260-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Osatohanmwen P, Oyegue FO, Ogbonmwan SM (2020) Modeling the daily number of reported cases of infection from the COVID-19 Pandemic in Nigeria: a stochastic approach. Earthline J Math Sci 5(2):217–235. 10.34198/ejms.5221.217235 [Google Scholar]
19.Guan C, Liu W, Cheng JYC (2021) Using social media to predict the stock market crash and rebound amid the pandemic: the digital ‘Haves’ and ‘Have-mores.’ Ann Data Sci. 10.1007/s40745-021-00353-w [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Li J, Guo K, Herrera Viedma E, Lee H, Liu J, Zhong Z, Gomes L, Filip FG, Fang SC, Özdemir MS, Liu XH, Lu G, Sh Y (2020) Culture vs policy: more global collaboration to effectively combat COVID-19. Innovation. 10.1016/j.xinn.2020.100023 [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Liu Y, Gu Z, Xia S, Shi B, Zhou X, Shi Y, Liu J (2020) What are the underlying transmission patterns of COVID-19 outbreak? An age-specific social contact characterization. EClincialMedicine 22:100354 [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Nadarajah S (2006) The exponentiated Gumbel distribution with climate application. Environmetrics 17:13–23. 10.1002/env.739 [Google Scholar]
23.Galton F (1883) Enquiries into human faculty and its development. Macmillan and Company, London [Google Scholar]
24.Moor JJ (1988) A quantile alternative for Kurtosis. Statistician 37:25–32 [Google Scholar]
25.Shannon CE (1948) A mathematical theory of communication. Bell Syst Tech J 27:379–432 [Google Scholar]
26.Nadarajah S, Kotz S (2006) The beta exponential distribution. Reliab Eng Syst Saf 91:689–697. 10.1016/j.ress.2005.05.008 [Google Scholar]
27.Barreto-Souza W, Santos AHS, Cordeiro GM (2010) The beta generalized exponential distribution. J Stat Comput Simul 80:159–172. 10.1080/00949650802552402 [Google Scholar]
28.Al-Aqtash R, Lee C, Famoye F (2014) Gumbel - Weibull distribution: properties and application. J Mod App Stat Method 13:201–225. 10.22237/jmasm/1414815000 [Google Scholar]
29.Birnbaum ZW, Saunders SC (1969) A new family of life distributions. J App Prob 6:637–652. 10.2307/3212003 [Google Scholar]
30.Famoye F, Lee C, Olumolade O (2005) The beta-Weibull distribution. J Stat Theory Appl 4:121–136 [Google Scholar]
31.Paranaiba PF, Ortega EMM, Cordeiro GM, Pescim R (2013) The beta Burr XII distribution with application to lifetime data. Comput Stat Data Anal 55:1118–1136. 10.1016/j.csda.2010.09.009 [Google Scholar]
32.Osatohanmwen P, Oyegue FO, Ogbonmwan SM (2019) A new Member from the T-X family of distributions: the Gumbel-Burr XII distribution and its properties. Sankhya A 81:298–322. 10.1007/s13171-017-0110-x [Google Scholar]
33.Cordeiro GM, Gomes AE, da-Silva CQ, Ortega EMM (2013) The beta exponentiated Weibull distribution. J Stat Comput Simul 83(1):114–138. 10.1080/00949655.2011.615838 [Google Scholar]
34.Alzaatreh A, Lee C, Famoye F (2013) Weibull-pareto distribution and its applications. Commun Stat Theory Methods 42:1673–1691. 10.1080/03610926.2011.599002 [Google Scholar]
35.Mameli V, Musio M (2013) A generalization of the beta skew-normal distribution: the beta skew-normal. Commun Statist Theory Methods 42:2229–2244. 10.1080/03610926.2011.607530 [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All data as used in the article and in the generation of results are contained in the body of the article and where necesary, URL address have been provided to also acess them.

[CR1] 1.Olson DL, Shi Y (2007) Introduction to business data mining. McGraw-Hill/Irwin, New York [Google Scholar]

[CR2] 2.Shi Y, Tian YJ, Kou G, Peng Y, Li JP (2011) Optimization based data mining: theory and applications. Springer, Berlin [Google Scholar]

[CR3] 3.Tien JM (2017) Internet of things, real-time decision making, and artificial intelligence. Ann Data Sci 4(2):149–178 [Google Scholar]

[CR4] 4.Azzalini A (1985) A class of distributions which includes the normal ones. Scand J Stat 12:171–178 [Google Scholar]

[CR5] 5.Mudholkar GS, Srivastava DK (1993) Exponentiated Weibull family for analyzing bathtub failure-rate data. IEEE Trans Reliab 42:299–302. 10.1109/24.229504 [Google Scholar]

[CR6] 6.Eugene N, Lee C, Famoye F (2002) Beta-normal distribution and its applications. Commun Stat Theory Methods 31:497–512. 10.1081/STA-120003130 [Google Scholar]

[CR7] 7.Shaw WT, Buckley IR (2009) The alchemy of probability distributions: beyond Gram-Charlier expansions and a skew-kurtotic-normal distribution from a rank transmutation map. arXiv:0901.0434

[CR8] 8.Cordeiro GM, de Castro M (2011) A new family of generalized distributions. J Stat Comput Simul 81:883–898. 10.1080/00949650903530745 [Google Scholar]

[CR9] 9.Cordeiro GM, Ortega GM, da Cunha DCC (2013) The exponentiated generalized class of distributions. J Data Sci 11:1–27 [Google Scholar]

[CR10] 10.Alzaatreh A, Lee C, Famoye F (2014) T – normal family of distributions: a new approach to generalize the normal distribution. J Stat Distrib Appl 1:16 [Google Scholar]

[CR11] 11.Osatohanmwen P, Oyegue FO, Ajibade B, Ewere F (2020) A new generalized family of distributions on the unit interval: the T - kumaraswamy family of distributions. J Data Sci 18(2):218–236 [Google Scholar]

[CR12] 12.Osatohanmwen P, Oyegue FO, Ogbonmwan SM (2020) The T – R Y power series family of probability distributions. J Egypt Math Soc 28:29. 10.1186/s42787-020-00083-7 [Google Scholar]

[CR13] 13.Liu Z, Magal P, Seydi O, Webb G (2020) Predicting the cumulative number of cases for the COVID-19 epidemic in China from early data. arXiv:2002.12298v1 [DOI] [PubMed]

[CR14] 14.Roosa K, Lee Y, Luo R, Kirpich A, Rothenberg A, Hyman JM, Yan P, Chowell G (2020) Real-time forecasts of the COVID-19 epidemic in China from February 5th to February 24th, 2020. Infect Dis Model 5:256–263. 10.1016/j.idm.2020.02.002 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Tang B, Bragazzi NL, Li Q, Tang S, Xiao Y, Wu J (2020) An updated estimation of the risk of transmission of the novel corona virus (2019-nCov). Infect Dis Model 5:248–255. 10.1016/j.idm.2020.02.001 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Tang B, Wang X, Li Q, Bragazzi NL, Tang S, Xiao Y, Wu J (2020) Estimation of the transmission risk of the 2019-nCov and its implication for public health intervention. J Clin Med 9(2):462 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.Wu JT, Leung K, Leung GM (2020) Nowcasting and forecasting the potential domestic and international spread of the 2019-nCov outbreak originating in Wuhan, China: a modelling study. Lancet 395:689–697. 10.1016/s0140-6736(20)30260-9 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] 18.Osatohanmwen P, Oyegue FO, Ogbonmwan SM (2020) Modeling the daily number of reported cases of infection from the COVID-19 Pandemic in Nigeria: a stochastic approach. Earthline J Math Sci 5(2):217–235. 10.34198/ejms.5221.217235 [Google Scholar]

[CR19] 19.Guan C, Liu W, Cheng JYC (2021) Using social media to predict the stock market crash and rebound amid the pandemic: the digital ‘Haves’ and ‘Have-mores.’ Ann Data Sci. 10.1007/s40745-021-00353-w [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Li J, Guo K, Herrera Viedma E, Lee H, Liu J, Zhong Z, Gomes L, Filip FG, Fang SC, Özdemir MS, Liu XH, Lu G, Sh Y (2020) Culture vs policy: more global collaboration to effectively combat COVID-19. Innovation. 10.1016/j.xinn.2020.100023 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Liu Y, Gu Z, Xia S, Shi B, Zhou X, Shi Y, Liu J (2020) What are the underlying transmission patterns of COVID-19 outbreak? An age-specific social contact characterization. EClincialMedicine 22:100354 [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Nadarajah S (2006) The exponentiated Gumbel distribution with climate application. Environmetrics 17:13–23. 10.1002/env.739 [Google Scholar]

[CR23] 23.Galton F (1883) Enquiries into human faculty and its development. Macmillan and Company, London [Google Scholar]

[CR24] 24.Moor JJ (1988) A quantile alternative for Kurtosis. Statistician 37:25–32 [Google Scholar]

[CR25] 25.Shannon CE (1948) A mathematical theory of communication. Bell Syst Tech J 27:379–432 [Google Scholar]

[CR26] 26.Nadarajah S, Kotz S (2006) The beta exponential distribution. Reliab Eng Syst Saf 91:689–697. 10.1016/j.ress.2005.05.008 [Google Scholar]

[CR27] 27.Barreto-Souza W, Santos AHS, Cordeiro GM (2010) The beta generalized exponential distribution. J Stat Comput Simul 80:159–172. 10.1080/00949650802552402 [Google Scholar]

[CR28] 28.Al-Aqtash R, Lee C, Famoye F (2014) Gumbel - Weibull distribution: properties and application. J Mod App Stat Method 13:201–225. 10.22237/jmasm/1414815000 [Google Scholar]

[CR29] 29.Birnbaum ZW, Saunders SC (1969) A new family of life distributions. J App Prob 6:637–652. 10.2307/3212003 [Google Scholar]

[CR30] 30.Famoye F, Lee C, Olumolade O (2005) The beta-Weibull distribution. J Stat Theory Appl 4:121–136 [Google Scholar]

[CR31] 31.Paranaiba PF, Ortega EMM, Cordeiro GM, Pescim R (2013) The beta Burr XII distribution with application to lifetime data. Comput Stat Data Anal 55:1118–1136. 10.1016/j.csda.2010.09.009 [Google Scholar]

[CR32] 32.Osatohanmwen P, Oyegue FO, Ogbonmwan SM (2019) A new Member from the T-X family of distributions: the Gumbel-Burr XII distribution and its properties. Sankhya A 81:298–322. 10.1007/s13171-017-0110-x [Google Scholar]

[CR33] 33.Cordeiro GM, Gomes AE, da-Silva CQ, Ortega EMM (2013) The beta exponentiated Weibull distribution. J Stat Comput Simul 83(1):114–138. 10.1080/00949655.2011.615838 [Google Scholar]

[CR34] 34.Alzaatreh A, Lee C, Famoye F (2013) Weibull-pareto distribution and its applications. Commun Stat Theory Methods 42:1673–1691. 10.1080/03610926.2011.599002 [Google Scholar]

[CR35] 35.Mameli V, Musio M (2013) A generalization of the beta skew-normal distribution: the beta skew-normal. Commun Statist Theory Methods 42:2229–2244. 10.1080/03610926.2011.607530 [Google Scholar]

PERMALINK

The Exponentiated Gumbel–Weibull {Logistic} Distribution with Application to Nigeria’s COVID-19 Infections Data

Patrick Osatohanmwen

Eferhonore Efe-Eyefia

Francis O Oyegue

Joseph E Osemwenkhae

Sunday M Ogbonmwan

Benson A Afere

Abstract

Introduction

The New Distribution

Fig. 1.

Fig. 2.

Fig. 3.

Proposition 1:

Proof:

Statistical Properties of the New Distribution

Hazard Function

Fig. 4.

Fig. 5.

Fig. 6.

Mode

Proposition 2:

Proof:

Remark 1:

Moments

Fig. 7.

Entropy

Proposition 3:

Proof:

Remark 2:

Estimation

Maximum Likelihood Method of Estimation of the Parameters of the EGuWL Distribution

Monte Carlo Simulations

Table 1.

Table 2.

Remark 3

Applications

Table 3.

Table 4.

Fig. 8.

Table 5.

Table 6.

Fig. 9.

Table 7.

Table 8.

Fig. 10.

Table 9.

Table 10.

Fig. 11.

Table 11.

Table 12.

Fig. 12.

Table 13.

Table 14.

Fig. 13.

Summary and Conclusion

Author Contributions

Funding

Declarations

Conflict of interest

Ethics approval

Data Availability

Code Availability

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases