Bayesian and Classical Inference for the Generalized Log-Logistic Distribution with Applications to Survival Data

Abdisalam Hassan Muse; Samuel Mwalili; Oscar Ngesa; Saad J Almalki; Gamal A Abd-Elmougod

doi:10.1155/2021/5820435

. 2021 Oct 11;2021:5820435. doi: 10.1155/2021/5820435

Bayesian and Classical Inference for the Generalized Log-Logistic Distribution with Applications to Survival Data

Abdisalam Hassan Muse ^1,^✉, Samuel Mwalili ², Oscar Ngesa ³, Saad J Almalki ⁴, Gamal A Abd-Elmougod ⁵

PMCID: PMC8523281 PMID: 34671390

Abstract

The generalized log-logistic distribution is especially useful for modelling survival data with variable hazard rate shapes because it extends the log-logistic distribution by adding an extra parameter to the classical distribution, resulting in greater flexibility in analyzing and modelling various data types. We derive the fundamental mathematical and statistical properties of the proposed distribution in this paper. Many well-known lifetime special submodels are included in the proposed distribution, including the Weibull, log-logistic, exponential, and Burr XII distributions. The maximum likelihood method was used to estimate the unknown parameters of the proposed distribution, and a Monte Carlo simulation study was run to assess the estimators' performance. This distribution is significant because it can model both monotone and nonmonotone hazard rate functions, which are quite common in survival and reliability data analysis. Furthermore, the proposed distribution's flexibility and usefulness are demonstrated in a real-world data set and compared to its submodels, the Weibull, log-logistic, and Burr XII distributions, as well as other three-parameter parametric survival distributions, such as the exponentiated Weibull distribution, the three-parameter log-normal distribution, the three-parameter (or the shifted) log-logistic distribution, the three-parameter gamma distribution, and an exponentiated Weibull distribution. The proposed distribution is plausible, according to the goodness-of-fit, log-likelihood, and information criterion values. Finally, for the data set, Bayesian inference and Gibb's sampling performance are used to compute the approximate Bayes estimates as well as the highest posterior density credible intervals, and the convergence diagnostic techniques based on Markov chain Monte Carlo techniques were used.

1. Introduction

Applied statisticians use many probability distributions for reliability and survival studies. The distributions could be applied in different fields such as medicine, engineering, economy, industrial and physical fields, and so many other fields. Exponential distributions, generalized exponential distributions, gamma distributions, generalized gamma distributions, extreme value distributions, Weibull distributions, log-logistic distributions, log-normal distributions, Burr XII distributions, and generalized Weibull distributions are among the most frequently used distributions in survival and reliability analysis.

Typically, researchers in reliability and survival analysis are concerned with the development of new probability models. Log-logistic (LL) distribution is one of the parametric distributions that can be used as a life-testing distribution because of the simplicity of its cumulative distribution and survival function which can both be stated in closed form and because it belongs to the Scale-Shape family [1]. LL is one of the right-skewed, heavy-tailed functions that can be used as an alternative to a log-normal distribution. It resembles the log-normal distribution in shape but has heavier tails. Log-logistic distribution is particularly applicable to model nonmonotone (i.e., unimodal) hazard functions.

It is well understood that the log-logistic model is not appropriate for modelling where the failure rate is monotonic when analyzing time-to-event data with parametric models. It is suitable to use an extension of the model which has a monotone hazard function. Departures from the monotonicity of distribution are typically studied in terms of its shape or more specifically in terms of its skewness (also referred to as asymmetry) and kurtosis.

In this study, we focus on a modification of the log-logistic model because it resembles the log-normal distribution in shape but is better suited for the application in the analysis of survival data when dealing with incomplete data, such as censored observations which are common in such data [2]. The presence of incomplete observations causes difficulties when using log-normal or inverse Gaussian models, since the survival functions in these cases are complicated. On the other hand, since the logarithms of small positive numbers are large negative numbers, the log-normal distribution may give undue weight to very short survival times [1]. For the reasons stated above, we will focus on the log-logistic model whose hazard rate exhibits the aforementioned behaviour.

However, due to the log-logistic model's symmetric property, it may be inadequate for cases where the hazard rate is heavily tailed or skewed, as well as for modelling censored survival data [3–5]. In this study, we studied a modification (or generalization) of the log-logistic parametric survival model and referring to this as the generalized log-logistic distribution given in [6]. The generalized log-logistic distribution reflects the structure of the heavy tails and the skewness and it significantly outperformed the log-logistic distribution in general.

In the statistical literature, with the aim of increasing the versatility of the log-logistic distribution in modelling survival time data, different generalized forms of the distribution have recently been proposed, including a new extension of the LL distribution with applications to actuarial data sets [7], alpha power transformed LL distribution [8, 9], transmuted four-parameter generalized LL distribution [10, 11], a new three-parameter LL distribution [12], extended log-logistic distribution [13], exponentiated LL geometric distribution [14], the LL Weibull distribution [15], beta LL distribution [16], McDonald LL distribution [2], transmuted LL distribution [17], Marshal-Olkin LL distribution [18], the Zografos-Balakrishnan LL distribution [19], and exponentiated LL distribution [20]. More details about the modifications and recent generalizations of the log-logistic distribution can be found in [21].

In addition, other authors have studied the Bayesian inference of the LL distribution and some of its generalizations. dos Santos et al. [22] developed a Bayesian analysis of the transmuted LL distribution. Yahaya and Dewu [23] studied the Bayesian estimation of the scale parameter for the LL distribution using Chi-square and Maxwell priors. Abbas and Tang [24] studied the objective Bayesian analysis of the LL distribution using the reference and Jeffreys prior. Al-Shomrani et al. [25] focused on the application of the Markov chain Monte Carlo (McMC) techniques for estimating the unknown parameters of the LL distribution. Guure et al. [26] explored the Bayesian inference of the LL distribution for the interval-censored data. Kang et al. [27] proposed the noninformative priors for the LL distribution. Chaudhary and Kumar [28] studied the Bayesian estimation of the three-parameter exponentiated LL distribution. Akhtar et al. [29] discussed the Bayesian analysis of the LL distribution using the Laplace approximation. Chaudhary [30] proposed the Bayesian analysis of the two-parameter exponentiated LL distribution.

The log-logistic distribution has large-scale applications in analyzing time-to-event data. The model is closed under both proportionality (multiplication) of failure time and proportionality of odds, though it is not a proportional hazard (PH) model. However, regarding this issue, Khan and Khosa [6] presented generalized log-logistic distribution that belongs to the proportional hazard models. The proposed distribution has similar properties to the 2-parameter log-logistic distribution and approaches the Weibull distribution in limit. However, its statistical and mathematical properties, as well as inferential procedures, have not received attention so far. On the other hand, they discussed the classical inference of the proposed distribution under the PH regression framework. However, much work still has to be done. In this paper, we focused on the Bayesian and classical inference of the generalized log-logistic distribution as a generalized distribution, not as a regression model.

Additionally, for the applied cases, especially in the survival modelling, the GLL model could be applicable in the following cases: (1) modelling the “asymmetric monotonically right-skewed” heavy tail data sets; (2) modelling the “bathtub-shaped hazard rate” data sets like data set I; (3) in “survival analysis,” the GLL distribution could be chosen for modelling proportional hazard frameworks; (4) in the medical field, the GLL distribution could be considered in modelling the “bladder cancer data sets” which have “reversed bathtub-shaped HRF” as illustrated in data set I; and (5) in the reliability and survival analysis, the proposed distribution can be an alternative to the Weibull distribution since it can be closed under both accelerated failure time (AFT) and PH models since the Weibull distribution fails to model unimodal data. For these based on ground reasons, we are motivated to study and introduce the GLL distribution.

Thus, the main goal of this research article is to propose and study a generalized log-logistic distribution, which extends the exponential, Weibull, log-logistic, and Burr XII distributions, with the hope that the proposed distribution may have a better fit compared to these distributions and other 3-parametric distributions in certain practical situations. In addition, we would provide a comprehensive account of the mathematical and statistical properties of the proposed model. The proposed model's formulae are simple and tractable, and, with the use of modern computer software and its numerical capabilities, the proposed model could be a great addition to the arsenal of applied mathematicians and statisticians in the areas like medicine, engineering, economics, social sciences, and biology, among others. Finally, we discussed the Bayesian model formulation of the proposed distribution.

The rest of the paper is organized as follows. Section 2 describes the distribution functions for the GLL distribution, its submodel distributions, and some of its basic properties. Some mathematical properties of the GLL distribution are derived in Section 3. Section 4 describes the maximum likelihood for the estimation parameters of GLL distribution. Section 5 discusses the findings of a simulation study that was conducted to estimate and compare the performances of the proposed estimators. Section 6 presents an analysis of a real-life data set. The Bayesian model formulation for the proposed distribution is discussed in Section 7. Section 8 presents the Bayesian analysis of a real-life data set using Markov chain Monte Carlo techniques. Finally, Section 9 summarizes the study with some concluding remarks.

2. The Generalized Log-Logistic Distribution

The generalized log-logistic distribution is a continuous probability distribution with positive support ℝ on a subset of (0, ∞) with three parameters. It is a generalization of the two-parameter log-logistic distribution. The generalization of log-logistic distribution for censored survival data can be traced back to Singh et al. [3] who discussed a generalized log-logistic distribution and applied it to censored survival data and proposed a generalized log-logistic model and introduced the shape parameter and then they used it to fit a lung cancer data. Prentice [31] proposed a generalization for quantile response data and discussed several of its uses.

Since many continuous probability distributions are commonly applied for parametric models in survival analysis like the exponential, Gompertz, Weibull, log-normal, log-logistic, and the gamma distribution, GLL is also applicable for survival data analysis. There are a number of probability functions that are related to continuous probability distributions; we will concentrate on functions that are related to the lifetime distributions as a random variable in this study.

2.1. Hazard (Failure) Rate Function

SThe hazard (failure) rate function plays an important role in survival analysis. It is the most popular function for analyzing and modelling lifetime data because of its intuitive interpretation of the amount of risk to fail associated with a unit time t, applicable for describing the lifetime distribution of engineered and other components. The hazard rate is more informative than all of the other functions in lifetime distributions. Because of this, the authors in [6] started their work by defining the hazard rate of the GLL distribution. Cox and Oakes [32] described the reason why the hazard rate is considered when we are dealing with the survival data. They gave a number of reasons including the fact that hazard rate-based models are often convenient when there is incomplete information (censoring) or there are several types of failure rates; also hazard rate is a special form of the intensity function, and last but not least the hazard rate function can be derived from all other functions that we use to describe lifetime distributions.

The hazard rate function describes how the instantaneous failure rate changes over time. For the GLL distribution, the hazard rate function plots are given in

\begin{matrix} h (x; θ) = \frac{α k {(k x)}^{α - 1}}{[1 + {(η x)}^{α}]}, x \geq 0, k, α, η > 0, \end{matrix}

(1)

where k > 0, β > 0, η > 0 are parameters and θ=(k, α, η)′.

It can be easily seen from equation (1) that the hazard rate function is monotonically decreasing for α ≤ 1 and unimodal when α ≤ 1. That is, it initially increases to a maximum at t= [(α − 1)/λ^α]^(1/α) and then decreases to zero monotonically as t ⟶∞. The HRF plots are shown in Figure 1.

The hazard curve of the GLL distribution.

2.2. Submodels

The proposed distribution consists of a number of important submodels that are widely used in parametric survival modelling. These include the log-logistic distribution, the standard log-logistic distribution, the Burr XII distribution, the Weibull distribution, and the exponential distribution. The propositions below relate the GLL to the log-logistic, standard log-logistic, Burr XII, Weibull, and exponential distributions.

2.2.1. Log-Logistic Distribution

Proposition 1 . —

Let X ~ GLL(α, k, η). If η depends on k via k=η, then the hazard rate function of (1) reduces to the hazard rate function of the log-logistic distribution.

Proof —

The hazard rate function of the generalized log-logistic distribution is given by

$\begin{matrix} h (x; θ) = \frac{α k {(k x)}^{α - 1}}{[1 + {(η x)}^{α}]} . \end{matrix}$ (2)

If we replace η=k, it gives us

$\begin{matrix} h (x; θ) = \frac{α k {(k x)}^{α - 1}}{[1 + {(k x)}^{α}]} = \frac{α k {(k x)}^{α - 1}}{[1 + {(k x)}^{α}]}, \end{matrix}$ (3)

which is the hazard rate function form of a log-logistic distribution with the two unknown parameters (k, α). When θ=(k, α)′, k=(1/β) is the rate parameter.

It is easy to verify that the hazard rate function of the log-logistic distribution is monotonically decreasing for 0 < α ≤ 1 and unimodal for α > 1 (decreases and then increases with the maximum at x=(1/k)(α − 1)^(1/α)).

2.2.2. Standard Log-Logistic Distribution

Proposition 2 . —

Let X ~ GLL(α, k, η). If η depends on k via k=η=1, then the hazard rate function of (1) reduces to the hazard rate function of the standard log-logistic distribution.

Proof —

The hazard rate function of the generalized log-logistic distribution is given by

$\begin{matrix} h (x; θ) = \frac{α k {(k x)}^{α - 1}}{[1 + {(η x)}^{α}]} . \end{matrix}$ (4)

If we replace η=k=1, it gives us

$\begin{matrix} h (x; θ) = \frac{α \cdot 1 {(1 \cdot x)}^{α - 1}}{[1 + {(1 \cdot x)}^{α}]} \\ = \frac{α {(x)}^{α - 1}}{[1 + x^{α}]}, \end{matrix}$ (5)

which is the hazard rate function form of a standard log-logistic distribution with one unknown parameter (α). Hence, the proof.

It should be noted that x > 0, is the distribution‘s support, and α is the distribution's shape parameter. It is easy to verify that the hazard rate function of the log-logistic distribution is monotonically decreasing for 0 < α ≤ 1 and unimodal for α > 1 (decreases and then increases with the maximum at x=(α − 1)^(1/α)).

2.2.3. Burr XII Distribution

Proposition 3 . —

Let X ~ GLL(α, k, η). If η depends on k via η=kλ^−(1/α), λ > 0, then the hazard rate function of (1) reduces to the hazard rate function of the Burr XII distribution.

Proof —

The hazard rate function of the generalized log-logistic distribution is given by

$\begin{matrix} h (x; θ) = \frac{α k {(k x)}^{α - 1}}{[1 + {(η x)}^{α}]} . \end{matrix}$ (6)

If we replace η=kλ^−(1/α), it gives us

$\begin{matrix} h (x; θ) = \frac{α k {(k x)}^{α - 1}}{[1 + {(k λ^{- (1 / α)} x)}^{α}]} \\ = \frac{α k {(k x)}^{α - 1}}{[1 + (k λ^{- (α / α)} x^{α})]} = \frac{α k x^{α - 1}}{[1 + x^{α}]}, \end{matrix}$ (7)

which is the hazard rate function form of a Burr XII distribution with two unknown parameters (α, k). Hence, the proof.

The Burr XII hazard function is monotonically decreasing for α ≤ 1 and upside-down bathtub shapes curve for α > 1 (which means that it initially increases, attains a maximum at x=(α − 1)^(1/α), and then decreases to zero at (x⟶∞).

2.2.4. Weibull Distribution

Proposition 4 . —

Let X ~ GLL(α, k, η). If η^α⟶0, then the hazard rate function of the GLL (1) approaches the hazard rate function of the Weibull distribution.

Proof —

If we now let η^α⟶0, then, from the hazard rate function of the GLL given by

$\begin{matrix} h (x; θ) = \frac{α k {(k x)}^{α - 1}}{[1 + {(η x)}^{α}]}, \end{matrix}$ (8)

we have that

$\begin{matrix} h (x; θ) = \frac{α k {(k x)}^{α - 1}}{[1 + (0]}, \end{matrix}$ (9)

which by simplifying gives

$\begin{matrix} h (t; θ) = α k {(k x)}^{α - 1}, \end{matrix}$ (10)

which is a hazard function of a Weibull distribution with the unknown parameters (α, k). This property of the GLL enables it to handle monotonically increasing hazard satisfactorily with α > 1 and λ close to zero (very small).

It is clear from (10) that, for 0 < α < 1, the hazard rate function decreases, for α > 1, the hazard rate function increases, and for α=1, the hazard rate function decreases.

The distribution reduces to exponential for α=1.

2.2.5. Exponential Distribution

Proposition 5 . —

Similarly, if we now let α=1, then the hazard rate function of (10) reduces to the hazard rate function of the exponential distribution.

Proof —

From (10), we have that the hazard rate function is

$\begin{matrix} h (t; θ) = α k {(k x)}^{α - 1}, \end{matrix}$ (11)

and if we replace α=1,

$\begin{matrix} h (t; θ) = k \cdot 1 {(1 \cdot t)}^{1 - 1}, \end{matrix}$ (12)

which by simplifying gives

$\begin{matrix} h (t; θ) = k, \end{matrix}$ (13)

which is the hazard rate function of an exponential distribution. This property makes the exponential distribution be inadequate to describe survival data. Hence, the proof.

The summary of the submodels for the proposed distribution is summarized in Table 1.

Table 1.

Summary of submodels from the GLL distribution.

Distributions	α	η	k
Log-logistic distribution	α	η=k	k=η
Weibull distribution	η ^α⟶0	η ^α⟶0	k
Exponential distribution	α=1	η⟶0	k
Standard log-logistic distribution	α	η=k=1	k= η=1
Burr XII distribution	α	η=kλ^−(1/α), λ > 0	η=kλ^−(1/α), λ > 0

Open in a new tab

2.3. The Probability Density Function

The pdf of the GLL distribution with three unknown parameters can be obtained by applying the following equation and the pdf plots are shown in Figure 2.

\begin{matrix} f (x; θ) = h (x, θ) \exp \{- \int_{0}^{x} h (x) d x\} . \end{matrix}

(14)

Simplifying gives

\begin{matrix} f (x; θ) = \frac{α k {(k x)}^{α - 1}}{{[1 + {(η x)}^{α}]}^{(k^{α} / η^{α}) + 1}}, x \geq 0, k, α, η > 0. \end{matrix}

(15)

2.4. The Survival (or Reliability) Function

The survival (reliability) function of the GLL distribution that represents the probability that observation does not fail until t is given below and its plots are shown in Figure 3.

\begin{matrix} S (x; θ) = \frac{f (x; θ)}{h (x; θ)} . \end{matrix}

(16)

The survival curves of the GLL distribution.

Simplifying gives

\begin{matrix} S (x; θ) = {[1 + {(η x)}^{α}]}^{- (k^{α} / η^{α})}, x \geq 0, k, α, η > 0. \end{matrix}

(17)

2.5. Cumulative Distribution Function of the GLL Distribution

The cumulative distribution function (CDF), also known as the lifetime distribution function, of the GLL distribution is of the form below and the CDF plots are shown in Figure 4.

\begin{matrix} F (x; θ) = \frac{{[1 + {(η x)}^{α}]}^{(k^{α} / η^{α})} - 1}{{[1 + {(η x)}^{α}]}^{(k^{α} / η^{α})}}, x \geq 0, k, α, η > 0, \end{matrix}

(18)

where k > 0, β > 0, η > 0 are parameters and θ=(k, α, η)′.

2.6. The Reversed Hazard Rate Function

The reversed hazard rate (also known as the retro hazard) is defined as the ratio of pdf to the corresponding CDF. The retro hazard is written as follows:

\begin{matrix} r (x; θ) = \frac{f (x; θ)}{F (x; θ)} . \end{matrix}

(19)

Reversed hazard rate function plays an important role in the analysis of censored data and in the estimation of the survival function. The following equation gives us the basic relationship between hazard rate function and the reversed hazard rate function.

\begin{matrix} r (x; θ) = \frac{h (x; θ) S (x; θ)}{1 - S (x; θ)} . \end{matrix}

(20)

The applications of hazard rate function in survival analysis are well known. Recently, the reversed hazard rate function has gained popularity among applied statisticians; for more information, see [33, 34]. Block et al. [33] showed that the hazard rate function plays an essential role in the analysis of right-censored data, while the retro hazard plays an essential role in the analysis of left-censored data.

The reversed hazard rate function of the GLL distribution takes the form

\begin{matrix} r (x; θ) = \frac{f (x; θ)}{F (x; θ)} = \frac{(α k {(k t)}^{α - 1} / {[1 + {(λ x)}^{α}]}^{(k^{α} / λ^{α}) + 1})}{{[1 + {(λ x)}^{α}]}^{(k^{α} / λ^{α})}} . \end{matrix}

(21)

Simplifying gives

\begin{matrix} r (x; θ) = \frac{α k {(k x)}^{α - 1}}{{[1 + {(λ x)}^{α}]}^{(k^{α} / λ^{α}) + 1} - [1 + {(λ x)}^{α}]}, x \geq 0, k, α, η > 0. \end{matrix}

(22)

The reversed hazard rate plots are shown in Figure 5.

The reversed hazard rate curves of the GLL distribution.

2.7. The Cumulative Hazard Function

The cumulative hazard function of the GLL distribution takes the form

\begin{matrix} H (x; θ) = - \log S (x; θ) = \int_{0}^{x} h (x; θ) d x . \end{matrix}

(23)

Simplifying gives

\begin{matrix} H (x; θ) = \frac{k^{α}}{λ^{α}} \log [1 + {(λ x)}^{α}], x \geq 0, k, α, η > 0, \end{matrix}

(24)

where k > 0, α > 0, λ > 0 are parameters and θ=(k, α, η)′.

2.8. The Hazard Rate Average (FRA) Function

The HRA function of X is expressed as

\begin{matrix} HRA (x; θ) = \frac{H (x; θ)}{x} = \frac{\int_{0}^{x} h (x; θ) d x}{x}, x > 0, \end{matrix}

(25)

where H(x; θ) is the cumulative hazard function. An analysis of HRA(x; θ) on t enables us to find increasing hazard rate average and decreasing hazard rate average.

3. Some Mathematical Properties of the GLL Distribution

In this section, we present some mathematical properties of the GLL distribution. The functions that we discussed in Section 2 are not the only ways that we can define the GLL distribution, but there are other mathematical functions that we can use to describe the lifetime distributions of a random variable X. These include quantile function and its related results, moments and its related properties, r^th central moments, residual life and reversed residual life functions, and other mathematical properties.

3.1. The Quantile Function and Related Results

The quantile function (which is the inverse of the CDF) is crucial in statistical and quantitative data analysis. A probability distribution can be defined in terms of either the quantile function or the cumulative distribution function [35]. The quantiles of the proposed distribution with various parameter values are given in Table 2.

Table 2.

Quantiles of the proposed distribution for different parameter values.

Quantiles	(k, α, η)
Quantiles	(0.5, 0.5, 0.5)	(5.0, 1.5, 1.5)	(4.0, 4.0, 2.5)	(3.0, 2.0, 3.0)	(5.0, 3.0, 2.0)
0.1	0.0247	0.0449	0.1427	0.1111	0.0945
0.2	0.1250	0.0745	0.1725	0.1667	0.1216
0.3	0.3673	0.1026	0.1945	0.2182	0.1424
0.4	0.8889	0.1314	0.2134	0.2722	0.1608
0.5	1.9999	0.1627	0.2312	0.3333	0.1783
0.6	4.4999	0.1985	0.2489	0.4082	0.1961
0.7	10.8889	0.2421	0.2681	0.5092	0.2155
0.8	32.0000	0.3006	0.2906	0.6667	0.2385
0.9	162.0000	0.3972	0.3222	1.0000	0.2707

Open in a new tab

Theorem 1 . —

If T ~ GLL(k, α, η), then the quantile function, lower quartile, median, and the upper quartile of the GLL distribution, respectively, are given by

$\begin{matrix} X_{q} = F^{- 1} (q; k, α, η) = \frac{{\{{[1 / (1 - p)]}^{(η^{α} / k^{α})} - 1\}}^{(1 / α)}}{η}, \end{matrix}$ (26)

$\begin{matrix} X_{q_{1}} = \frac{{\{{[4 / 3]}^{(η^{α} / k^{α})} - 1\}}^{(1 / α)}}{η}, \end{matrix}$ (27)

$\begin{matrix} X_{q_{2}} = Median = \frac{{\{2^{(η^{α} / k^{α})} - 1\}}^{(1 / α)}}{η}, \end{matrix}$ (28)

$\begin{matrix} X_{q_{3}} = \frac{{\{4^{(η^{α} / k^{α})} - 1\}}^{(1 / α)}}{η} . \end{matrix}$ (29)

Proof —

The quantile function of GLL distribution is derived by finding the value of Q for which

$\begin{matrix} 1 - {[1 + {(η x)}^{α}]}^{- (k^{α} / η^{α})} = p, \\ X_{q} = F^{- 1} (q; k, α, η) = {[1 + {(η q)}^{α}]}^{- (k^{α} / η^{α})} = 1 - p \\ = \frac{1}{{[1 + {(η q)}^{α}]}^{(k^{α} / η^{α})}} = 1 - p \\ = {[1 + {(η q)}^{α}]}^{(k^{α} / η^{α})} = \frac{1}{1 - p} \\ = 1 + {(η q)}^{α} = {(\frac{1}{1 - p})}^{(η^{α} / k^{α})} \\ = {(η q)}^{α} = {(\frac{1}{1 - p})}^{(η^{α} / k^{α})} - 1 \\ = η q = {\{{[\frac{1}{1 - p}]}^{(η^{α} / k^{α})} - 1\}}^{(1 / α)}, \\ ∴ q = \frac{{\{{[1 / (1 - p)]}^{(η^{α} / k^{α})} - 1\}}^{(1 / α)}}{η}, \end{matrix}$ (30)

where p ∈ [0,1). k > 0, α > 0, η > 0. Hence the proof.

Similarly, we can prove (27)–(29) by applying the following values: the lower quartile = 1/4, median = 2/4 = 1/2, and the upper quartile = 3/4.

Lower quartile is

$\begin{matrix} X_{q_{1}} = \frac{{\{{[4 / 3]}^{(η^{α} / k^{α})} - 1\}}^{(1 / α)}}{η} . \end{matrix}$ (31)

Median is

$\begin{matrix} X_{q_{2}} = median = \frac{{\{2^{(η^{α} / k^{α})} - 1\}}^{(1 / α)}}{η} . \end{matrix}$ (32)

Upper quartile is

$\begin{matrix} X_{q_{3}} = \frac{{\{4^{(η^{α} / k^{α})} - 1\}}^{(1 / α)}}{η} . \end{matrix}$ (33)

3.1.1. Skewness and Kurtosis

The following relationship defines the mathematical form of the Galton Skewness and Moors Kurtosis of the GLL model with three parameters:

\begin{matrix} S_{K} = \frac{Q (3 / 4) + Q (1 / 4) - 2 Q (2 / 4)}{Q (3 / 4) - Q (1 / 4)}, \\ K_{M} = \frac{Q (7 / 8) + Q (3 / 8) - Q (5 / 8) - Q (1 / 8)}{Q (6 / 8) - Q (2 / 8)}, \end{matrix}

(34)

where Q describes different quartile values.

The above equations can be determined as functions of the GLL quantile function. The advantages of these measures are that they are less sensitive in the presence of outliers and that they exist even when the distribution is lacking moments.

3.2. The Random Deviate Generation Functions

Let U be a random variable with a uniform distribution (0,1) and an inverse CDF, F(.). Then any sample drawn from F⁻¹(u) is assumed to have been drawn from F(.). As a result, using GLL (k, α, η), the random deviate can be generated as follows:

\begin{matrix} x = \frac{{\{{[1 / (1 - u) - 1]}^{(λ^{α} / k^{α})}\}}^{(1 / α)}}{λ}, 0 < u < 1, \end{matrix}

(35)

where u follows U(0,1) distribution.

3.3. The r^th Moments and Related Results

Numerous important characteristics and properties of a probability distribution such as mean, variance, kurtosis, and skewness can be obtained from its moments. Moments are extremely important and play a central role in statistical analysis, especially in applications. The important moment functions, such as the moments, r^th moment, r^th central moment, mean, variance, skewness, and kurtosis of the proposed distribution, are presented.

Theorem 2 . —

If T ~ GLL (k, α, η), then the r^th power, negative moments, and logarithmic moments are given, respectively, by

$\begin{matrix} E (T^{r}) = \frac{k^{α}}{η^{α + r}} \frac{Γ ((k^{α} / η^{α}) - (r / α)) Γ ((r / α) + 1)}{Γ ((k^{α} / η^{α}) + 1)}, for \frac{α k^{α}}{η^{α}} > r, \end{matrix}$ (36)

$\begin{matrix} E (T^{- r}) = \frac{λ^{α + r}}{k^{α}} \frac{Γ ((k^{α} / η^{α}) + 1)}{Γ ((k^{α} / η^{α}) - (r / α)) Γ ((r / α) + 1)} . \end{matrix}$ (37)

Proof —

We have

$\begin{matrix} E (T^{r}) = \int_{0}^{\infty} t^{r} f (t; k, α, η) d t = \int_{0}^{\infty} t^{r} \frac{α k {(k t)}^{α - 1}}{{[1 + {(η t)}^{α}]}^{(k^{α} / η^{β}) + 1}} d t = \frac{α k}{Γ ((k^{α} / η^{α}) + 1)} \int_{0}^{\infty} t^{r} \frac{{(k t)}^{α - 1}}{1 + {(η t)}^{α}} d t \\ = \frac{k^{α}}{η^{α + r}} \frac{Γ ((k^{α} / η^{α}) - (r / α)) Γ ((r / α) + 1)}{Γ ((k^{α} / η^{α}) + 1)}, for \frac{α k^{α}}{η^{α}} > r . \end{matrix}$ (38)

Similarly, we can prove (37).

3.3.1. Mean and Variance

Corollary 1 . —

If T ~ GLL (k, α, η), then the mean and variance are given, respectively, as follows.

The mean of the GLL distribution is

\begin{matrix} μ = E (T) = \frac{k^{α}}{η^{α}} \frac{Γ ((k^{α} / η^{α}) - (1 / α)) Γ ((1 / α) + 1)}{Γ ((k^{α} / η^{α}) + 1)} . \end{matrix}

(39)

This is provided that (αk^α/η^α) > 1.

The Variance of the GLL distribution is

\begin{matrix} σ^{2} = V (T) = E (T^{2}) - {(E (T))}^{2} \\ = \frac{k^{α}}{η^{α + 2}} \frac{Γ ((k^{α} / η^{α}) - (2 / α)) Γ ((2 / α) + 1)}{Γ ((k^{α} / η^{α}) + 1)} - {(\frac{k^{α}}{η^{α}} \frac{Γ ((k^{α} / η^{α}) - (1 / α)) Γ ((1 / α) + 1)}{Γ ((k^{α} / η^{α}) + 1)})}^{2} . \end{matrix}

(40)

This is provided that (αk^α/η^α) > 2.

3.4. The r^th Central Moments

Corollary 2 . —

If T ~ GLL (k, α, η), then the cumulants of the first, second, and r^th central moments, are given, respectively, by

$\begin{matrix} c_{1} = μ_{1}^{'} = E (T) = \frac{k^{α}}{η^{α}} \frac{Γ ((k^{α} / η^{α}) - (1 / α)) Γ ((1 / α) + 1)}{Γ ((k^{α} / η^{α}) + 1)}, \\ c_{2} = μ_{2}^{'} - μ_{1}^{2'} = E (T^{2}) - {(E (T))}^{2} = \frac{k^{α}}{η^{α + 2}} \frac{Γ ((k^{α} / η^{α}) - (2 / α)) Γ ((2 / α) + 1)}{Γ ((k^{α} / η^{α}) + 1)} - {(\frac{k^{β}}{η^{α}} \frac{Γ ((k^{α} / η^{α}) - (1 / α)) Γ ((1 / α) + 1)}{Γ ((k^{α} / η^{α}) + 1)})}^{2}, \\ c_{r} = μ_{r}^{'} - \sum_{n = 1}^{r - 1} (\begin{matrix} r - 1 \\ n - 1 \end{matrix}) c_{n} μ_{r - m}^{'} = \frac{k^{α}}{η^{α + r}} \frac{Γ ((k^{α} / η^{α}) - (r / α)) Γ ((r / α) + 1)}{Γ ((k^{α} / η^{α}) + 1)} \\ - \sum_{n = 1}^{r - 1} (\begin{matrix} r - 1 \\ n - 1 \end{matrix}) c_{n} \frac{k^{α}}{η^{α + (r - n)}} \frac{Γ ((k^{α} / η^{α}) - ((r - n) / α)) Γ (((r - n) / α) + 1)}{Γ ((k^{α} / η^{α}) + 1)} . \end{matrix}$ (41)

Hence, from Corollary 2, we can derive the skewness and kurtosis of the GLL distribution by computing, respectively:

\begin{matrix} Skewness = \frac{c_{3}}{{(σ^{2})}^{(3 / 2)}}, \\ Kurtosis = \frac{c_{4}}{{(σ^{2})}^{2}} . \end{matrix}

(42)

3.5. Residual and Reverse Residual Life

The residual life has broader applications in survival analysis and risk management. The residual lifetime of the GLL random variable is calculated as follows:

\begin{matrix} R_{(t)} (x) = \frac{S (x + t)}{S (t)}, \\ R_{(t)} (x) = \frac{{[1 + {(η (x + t))}^{α}]}^{- (k^{α} / η^{α})}}{{[1 + {(η t)}^{α}]}^{- (k^{α} / η^{α})}} . \end{matrix}

(43)

In addition, the reverse residual life of the generalized log-logistic random variable can be calculated as follows:

\begin{matrix} {\hat{R}}_{(t)} (x) = \frac{S (x - t)}{S (t)}, \\ {\hat{R}}_{(t)} (x) = \frac{{[1 + {(η (x - t))}^{α}]}^{- (k^{α} / η^{α})}}{{[1 + {(η t)}^{α}]}^{- (k^{α} / η^{α})}} . \end{matrix}

(44)

From Table 3, the GLL distribution is clearly numerically versatile in terms of means and variance. Furthermore, the values of CS show that it can be right-skewed, nearly symmetrical, or slightly left-skewed. The CK values show that the GLL distribution can be mesokurtic, leptokurtic, or platykurtic. All of these characteristics demonstrate the GLL distribution flexibility, which remains appealing for modelling purposes.

Table 3.

1st five moments, standard deviation, skewness, and kurtosis of the GLL distribution for some parameter values.

Moments	(k, α, η)
Moments	(0.5, 0.5, 0.5)	(1.0, 1.5, 1.5)	(1.5, 2.0, 2.5)	(2.0, 5.0, 3.0)	(1.0, 1.0, 2.0)	(4.0, 4.5, 0.2)	(5.0, 4.0, 0.5)
μ₁′	0.1034	0.2065	0.2432	0.2795	0.1547	0.2281	0.1813
μ₂′	0.0567	0.1292	0.1482	0.1741	0.0893	0.0554	0.0354
μ₃′	0.0388	0.0925	0.1036	0.1204	0.0619	0.0141	0.0073
μ₄′	0.0294	0.0715	0.0787	0.0900	0.0471	0.0037	0.0016
μ₅′	0.0237	0.0581	0.0631	0.0711	0.0380	0.0010	0.0004
SD	0.2146	0.2943	0.2984	0.3098	0.2557	0.0575	0.0509
CV	2.0743	1.4250	1.2270	1.1081	1.6529	0.2521	0.2805
CS	2.3743	1.1784	0.9109	0.6100	1.6648	−0.1784	−0.0871
CK	7.8842	3.0318	2.5240	2.0238	4.6628	2.8081	2.7479

Open in a new tab

The mean and variance plots for different values of alpha and kappa parameters are shown in Figure 6, while the skewness and kurtosis plots are shown in Figure 7.

The mean and variance plot for several combinations of alpha and kappa parameters.

The skewness and kurtosis plot for several combinations of alpha and kappa.

4. Maximum Likelihood Estimation (MLE)

In this section, the unknown parameters of the generalized log-logistic distribution based on a complete sample are estimated using the maximum likelihood method. Let X₁, X₂, …, X_n indicate a random sample of the complete GLL data, and then the sample's likelihood function is given as

\begin{matrix} L = \prod_{i = 1}^{n} f (x_{i}, α, k, η), \\ L (x; α, k, η) = \prod_{i = 1}^{n} \frac{α k {(k x_{i})}^{α - 1}}{{[1 + {(η x_{i})}^{α}]}^{(k^{α} / η^{α}) + 1}} . \end{matrix}

(45)

The log-likelihood function may be expressed as

\begin{matrix} ℓ = n \log (α k) + (α - 1) \sum_{i = 1}^{n} \log (k x_{i}) - \sum_{i = 1}^{n} \log [1 + {(η x_{i})}^{α}] - (\frac{k}{η}) \sum_{i = 1}^{n} \log [1 + {(η x_{i})}^{α}] . \end{matrix}

(46)

By taking the first derivatives of the log-likelihood function in equation (48) with respect to α, k, and η and fixing the outcome to zero, we have

\begin{matrix} \frac{\partial ℓ}{\partial α} = \frac{n}{α} + \sum_{i = 1}^{n} log (k x_{i}) - \sum_{i = 1}^{n} \{\frac{({(η x_{i})}^{α} \log (η x_{i}))}{[1 + {(η x_{i})}^{α}]}\} - (\frac{k}{η}) \sum_{i = 1}^{n} \{\frac{({(η x_{i})}^{α} \log (η x_{i}))}{[1 + {(η x_{i})}^{α}]}\}, \end{matrix}

(47)

\begin{matrix} \frac{\partial ℓ}{\partial k} = \frac{n}{k} + n k (α - 1) - \frac{1}{η} \sum_{i = 1}^{n} log (1 + η x_{i}), \end{matrix}

(48)

\begin{matrix} \frac{\partial ℓ}{\partial η} = - \sum_{i = 1}^{n} \{\frac{({(η x_{i})}^{α} \log (η x_{i}))}{[1 + {(η x_{i})}^{α}]}\} - \frac{k}{η^{2}} \sum_{i = 1}^{n} \{\frac{({(η x_{i})}^{α} \log (η x_{i}))}{[1 + {(η x_{i})}^{α}]}\} . \end{matrix}

(49)

It is worth noting that the MLEs $\hat{α}, \hat{k} and \hat{η}$ of α, k, and η, respectively, can be obtained by equating the results to zero and numerically solving the system of nonlinear equations. Because the expected information matrix is complicated, the observed information matrix J(θ) is used to construct confidence intervals for the model parameters. The observed information matrix is given by

\begin{matrix} J (θ) = - [\begin{matrix} \frac{\partial^{2} ℓ}{\partial^{2} α} & \frac{\partial^{2} ℓ}{\partial α \partial k} & \frac{\partial^{2} ℓ}{\partial α \partial η} \\ \frac{\partial^{2} ℓ}{\partial^{2} k} & \frac{\partial^{2} ℓ}{\partial k \partial \partial} \\ \frac{\partial^{2} ℓ}{\partial^{2} η} \end{matrix}], \end{matrix}

(50)

where θ=(α, k, η)′. When the usual regularity conditions are met and the parameters are within the parameter space's interior but not on the boundary, $\sqrt{n} (≅ θ - θ)$ converges in distribution to N₃(0, I⁻¹(θ)), where I(θ) is the expected information matrix. When I(θ) is replaced by the observed information matrix evaluated at J(θ), the asymptotic behaviour remains valid. The asymptotic multivariate normal distribution N₃(0, J⁻¹(θ)) can be used to generate 100(1 − τ)% two-sided confidence intervals for the model parameters, where τ is the significant level.

5. Monte Carlo Simulation Study

In this section, we assess the performance of the MLEs estimators for a finite sample of size n using a Monte Carlo simulation study. The simulation study based on the generalized log-logistic distribution is carried out to examine the average biases (ABs), the mean square errors (MSEs), the root mean square errors (RMSEs), and maximum likelihood estimates (MLEs) for the model parameters α, k, and η. The simulation experiment was carried out using a variety of simulations with varying sample sizes and parameter values. To generate random samples for the GLL, the quantile function is given in equation [26]. The simulation study was repeated 1500 times, each with sample sizes n=50,100, …, 1500, and the following parameter scenarios in set I: α=0.9, k=0.5and η=2.5, and the following parameter scenarios in set II: α=0.8, k=0.4 and η=2.0.

The MLEs of the GLL model are determined via the nlminb () R-function with the argument method = “BFGS”; see supplementary materials (available here). For each piece of simulated data, say, ( $\hat{α}, \hat{k}, \hat{η}$ ) for i=1,2,…, 1000, the AB, RMSE, and MSE of the parameters were computed by

\begin{matrix} AB = \frac{1}{N} \sum_{i = 1}^{N} (\hat{θ} - θ), \\ MSE = \frac{1}{N} \sum_{i = 1}^{N} {(\hat{θ} - θ)}^{2}, \\ RMSE = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(\hat{θ} - θ)}^{2}}, \end{matrix}

(51)

where θ=α, k and η.

The MLE, AB, and RMSE values of the parameters α, k and η are displayed from various sample sizes. Based on these findings, we conclude that the MLEs perform quite well in estimating the model parameters and that the estimates are fairly stable and are nearer to the true values for these sample sizes. Table 4 and Figures 8–11 show that as the sample size increases, the MSE and RMSE decrease as expected. Furthermore, as the sample size increases, the AB decreases. In addition, the MLEs of the parameters of the model are very close to the true value. As a result, the maximum likelihood estimates and their asymptotic results can be applied to construct confidence intervals for the model parameters even for a small sample size.

Table 4.

Monte Carlo simulation results for the GLL distribution: MLE, AB, MSEs, and RMSEs.

Parameters	n	I			II
Parameters	n	MLE	AB	RMSE	MLE	AB	RMSE
α	50	2.320	1.420	5.273	2.330	1.530	7.149
	100	1.281	0.381	2.386	1.097	0.297	2.263
	300	0.995	0.095	1.369	0.937	0.137	1.512
	600	0.921	0.021	0.5207	0.840	0.040	0.619
	900	0.908	0.008	0.067	0.804	0.004	0.058
	1200	0.905	0.005	0.060	0.803	0.003	0.049
	1500	0.904	0.004	0.054	0.804	0.004	0.045

k	50	1.246	0.746	2.306	1.217	0.817	2.802
	100	0.792	0.292	1.235	0.613	0.213	1.093
	300	0.571	0.071	0.665	0.463	0.063	0.467
	600	0.511	0.011	0.135	0.422	0.022	0.199
	900	0.508	0.008	0.095	0.405	0.005	0.081
	1200	0.507	0.007	0.082	0.404	0.004	0.066
	1500	0.505	0.005	0.073	0.405	0.005	0.063

η	50	3.280	0.780	3.404	3.033	1.033	3.944
	100	2.780	0.280	1.800	2.281	0.281	1.614
	300	2.612	0.112	0.904	2.056	0.056	0.806
	600	2.554	0.054	0.588	2.046	0.046	0.543
	900	2.542	0.042	0.500	2.019	0.019	0.442
	1200	2.526	0.026	0.409	2.014	0.014	0.360
	1500	2.520	0.020	0.370	2.030	0.030	0.340

Open in a new tab

Plots for MLEs and biases of the GLL model for set I of the table.

Plots for MSEs and RMSEs of the GLL distribution for the values of set I in the table.

Plots for MLEs and biases of the GLL distribution for the values of set II in the table.

Plots for MSEs and RMSEs of the GLL distribution for the values of set II in the table.

6. Data Analysis

In this section, the proposed distribution is fully applied to real-world data set which is taken from literature to demonstrate the ability of the new model. We compare the proposed distribution with the other three parametric survival distributions including gamma, log-normal, log-logistic, exponentiated Weibull, and the Weibull distribution. Also, we have compared the GLL distribution with some of its submodels with two-parameter distribution, namely, Weibull, log-logistic, and the Burr XII distributions.

The density functions of the fitted models are as follows.

(1)
Weibull distribution:
$\begin{matrix} f (t) = α k {(k t)}^{α - 1} \exp \{- {(k t)}^{α}\} . \end{matrix}$ (52)
(2)
Log-logistic distribution:
$\begin{matrix} f (t) = \frac{α k {(k t)}^{α - 1}}{{[1 + {(k t)}^{α}]}^{2}} . \end{matrix}$ (53)
(3)
Burr XII distribution:
$\begin{matrix} f (t) = \frac{α k t^{α - 1}}{{[1 + t^{α}]}^{- k - 1}} . \end{matrix}$ (54)
(4)
Exponentiated Weibull distribution:
$\begin{matrix} f (t) = α k λ {(k t)}^{α - 1} {(1 - exp \{- {(k t)}^{α}\})}^{λ - 1} \exp \{- {(k t)}^{α}\} . \end{matrix}$ (55)
(5)
Three-parameter log-logistic distribution (or shifted log-logistic distribution):
$\begin{matrix} f (t) = \frac{α / β {((t - μ) / β)}^{α - 1}}{{[1 + {((t - μ) / k β)}^{α}]}^{2}} . \end{matrix}$ (56)
(6)
Three-parameter log-normal distribution:
$\begin{matrix} f (t) = \frac{α}{β} {(\frac{t - μ}{β})}^{α - 1} \exp \{- {(\frac{t - μ}{k β})}^{α}\} . \end{matrix}$ (57)
(7)
Three-parameter Weibull distribution:
$\begin{matrix} f (t) = \frac{\exp \{- (1 / 2) {((\log (t - μ) - α) / β)}^{2}\}}{\sqrt{2 π} β (x - μ)} . \end{matrix}$ (58)
(8)
Three-parameter Gamma distribution:
$\begin{matrix} f (t) = \frac{{(t - μ)}^{α - 1} \exp - ((t - μ) / β)}{β^{α} Γ (α)}, \end{matrix}$ (59)
where t > μ.

Certain analytical measures are taken into account in order to determine which distribution best fits the applied data. These analytical measures include four discrimination measures: AIC (Akaike Information Criterion), CAIC (Consistent Akaike Information Criterion), BIC (Bayesian Information Criterion), and HQIC (Hannan-Quin Information Criterion). In addition, there are two goodness-of-fit tests: Anderson–Darling (A^∗) and Cramer-von Mises (W^∗).

The AIC is

\begin{matrix} AIC = 2 k - 2 l . \end{matrix}

(60)

The BIC is

\begin{matrix} BIC = k \ln (n) - 2 l . \end{matrix}

(61)

The CAIC is

\begin{matrix} CAIC = \frac{2 n k}{n - k - 1} - 2 l . \end{matrix}

(62)

The HQIC is

\begin{matrix} HQIC = 2 k \ln (\ln (n)) - 2 l, \end{matrix}

(63)

where l represents the log-likelihood function evaluated as the MLEs, n denotes the sample size, and k denotes the number of model parameters. The goodness-of-fit measures under consideration are as follows.

The Anderson–Darling (A^∗) test statistic is given by

\begin{matrix} A^{*} = - n - \frac{1}{n} \sum_{i = 1}^{n} (2 l - 1) \times [\ln G (X_{i}) + \ln \{1 - G (X_{n - i + 1})\}] . \end{matrix}

(64)

The Cramer-von Mises (W^∗) test statistic is given by

\begin{matrix} W^{*} = \frac{1}{12 n} + \sum_{i = 1}^{n} {[\frac{2 i - 1}{2 n} + G (X_{i})]}^{2}, \end{matrix}

(65)

where x_i is the ith observation in the sample and n is the sample size; x_i is calculated when the data is sorted in ascending order.

The best model is the one with the lowest AIC, BIC, CAIC, and HQIC, as well as the A^∗, W^∗, and K-S tests. Moreover, the best model is also chosen as the one having the highest value of the log-likelihood function, and p values for the K-S statistics are also used to compare the competitive models.

6.1. Likelihood Ratio Test for Submodels

The GLL distribution has five submodels, namely, log-logistic distribution, Weibull distribution, Burr XII distribution, exponential distribution, and the standard log-logistic distribution. Hence, we have employed the likelihood ratio criterion to test the following hypotheses:

H₀ : η^α⟶0; that is, the sample is from Weibull distribution. $H_{1} : η^{α} \underset{not}{⟶} 0$ ; that is, the sample is GLL
H₀ : η=k; that is, the sample is from log-logistic distribution. H₁ : η ≠ k; that is, the sample is GLL
H₀ : kλ^−(1/α), λ > 0; that is, the sample is from Burr XII distribution. H₁ : kλ^−(1/α), λ ≤ 0; that is, the sample is GLL
H₀ : η=k=1; that is, the sample is from the standard log-logistic distribution. H₁ : η ≠ 1, k ≠ 1; that is, the sample is GLL
H₀ : η=0&α=1; that is, the sample is from an exponential distribution. H₁ : η ≠ 0&α ≠ 1; that is, the sample is GLL

The likelihood ratio test (LRT) is given by

\begin{matrix} LR = - 2 \ln \frac{(L ({\hat{θ}}^{*}; x))}{(L (\hat{θ}; x))}, \end{matrix}

(66)

where ${\hat{θ}}^{*}$ represents the restricted Maximum likelihood estimates under the null hypothesis H₀ and $\hat{θ}$ represents the unrestricted Maximum likelihood estimates under the alternative hypothesis H₁. Under the null hypothesis, the LRT follows Chi-square distribution with degrees of freedom (df) (df_alt − df_null). If the p value is less than 0.05, the null hypothesis is rejected.

6.2. An Application to Bladder Cancer Data Set

The following real-world data set is used to demonstrate the proposed methodology. The data in Table 5 below show the remission times (in months) of a sample of 128 bladder cancer patients. The data set is available in [36]. The descriptive statistics for the data set are shown in Table 6 and the likelihood ratio test statistics for the data set are given in Table 7.

Table 5.

The remission times (in months) of a sample of 128 bladder cancer patients.

3.88, 5.32, 7.39, 10.34, 14.83, 34.26, 0.90, 2.69, 4.18, 5.34, 7.59, 10.66, 15.96, 36.66, 1.05, 2.69, 4.23, 5.41, 7.62, 10.75, 16.62, 43.01, 1.19, 2.75, 4.26, 5.41, 7.63, 17.12, 46.12, 1.26, 2.83, 4.33, 5.49, 7.66, 11.25, 17.14, 79.05, 1.35, 2.87, 5.62, 7.87, 11.64, 17.36, 1.40, 3.02, 4.34, 5.71, 7.93, 0.08, 2.09, 3.48, 4.87, 6.94, 8.66, 13.11, 23.63, 0.20, 2.23, 3.5, 4.98, 6.97, 9.02, 13.29, 0.40, 2.26, 3.57, 5.06, 7.09, 9.22, 13.80, 25.74, 0.50, 2.46, 3.64, 5.09, 7.26, 9.47, 14.24, 25.82, 0.51, 2.54, 3.70, 5.17, 7.28, 9.74, 14.76, 26.31, 0.81, 2.62, 3.82, 5.32, 7.32, 10.06, 14.77, 32.15, 2.64, 11.79, 18.10, 1.46, 4.40, 5.85, 8.26, 11.98, 19.13, 1.76, 3.25, 4.50, 6.25, 8.37, 12.02, 2.02, 3.31, 4.51, 6.54, 8.53, 12.03, 20.28, 2.02, 3.36, 6.76, 12.07, 21.73, 2.00, 3.36, 6.93, 8.65, 12.63, 22.69.

Open in a new tab

Table 6.

Descriptive statistics of data set I.

Mean	Median	Mode	Variance	Skewness	Kurtosis	Minimum	Maximum
9.365	6.395	5	110.435	3.286	15.481	0.08	79.05

Open in a new tab

Table 7.

Likelihood ratio test statistic for data set I.

Distribution	Hypothesis	LRT	p values
W2	H ₀ : η^α⟶0 vs H₁ : H₀ is false	8.676	0.003
LL2	H ₀ : η^α=k vs H₁ : H₀ is false	10.819	0.001
Burr XII	H ₀ : kλ^−(1/α), λ > 0 vs H₁ : H₀ is false	87.472	<0.001
Ex	H ₀ : η=0&α=1 vs H₁ : H₀ is false	9.182	0.010
Standard LL	H ₀ : η=k=1 vs H₁ : H₀ is false	190.150	<0.001

Open in a new tab

For data set I, the asymptotic variance-covariance matrix for the estimated GLL parameters is given by

\begin{matrix} J^{- 1} = [\begin{matrix} 3.0929 \times 10^{- 4} & 1.7255 \times 10^{- 3} & 5.8513 \times 10^{- 4} \\ 1.7255 \times 10^{- 3} & 3.1612 \times 10^{- 2} & 5.9347 \times 10^{- 3} \\ 5.8513 \times 10^{- 4} & 5.9347 \times 10^{- 3} & 1.5958 \times 10^{- 3} \end{matrix}] . \end{matrix}

(67)

The information criterion values in Table 8 and the goodness-of-fit tests in Table 9 both demonstrate the superiority of the proposed model over the other competing models.

Table 8.

Information criterion for data set I.

Distribution	AIC	BIC	CAIC	HQIC
GLL	825.564	834.120	825.756	829.040
LN3	826.723	835.279	826.916	830.199
LL2	826.937	835.641	827.033	829.254
ExpW	827.393	835.949	827.586	830.869
LL3	827.458	836.014	827.651	830.934
G3	831.955	840.511	832.148	835.431
W2	832.163	837.868	832.259	834.481
W3	832.665	841.221	832.858	836.141
Burr XII	910.959	916.663	911.055	913.276

Open in a new tab

Table 9.

MLE estimators of the model parameters, the log-likelihood, and goodness-of-fit statistics for data set I.

Distributions	Estimates (SEs)	ℓ	W ^∗	A ^∗	K − S (p value)
GLL (α, kη)	α = 1.410 (0.174)	−409.78	0.019	0.128	0.034 (0.999)
	k = 0.134 (0.017)
	η = 0.077 (0.038)

ExpW (α, kλ)	α = 0.275 (0.146)	−410.70	0.045	0.291	0.044 (0.967)
	k = 0.676 (0.136)
	λ = 2.636 (1.161)

LL3 (α, β, γ)	α = 0.535 (0.061)	−410.73	0.019	0.135	0.038 (0.993)
	β = 1.863 (0.106)
	μ = −0.293 (0.358)

LN3 (α, β, γ)	α = 0.877 (0.090)	−410.36	0.017	0.115	0.029 (0.998)
	β = 1.925 (0.111)
	μ = −0.623 (0.372)

G3 (α, β, γ)	α = 1.098 (0.134)	−412.98	0.125	0.778	0.067 (0.618)
	β = 8.424 (1.238)
	μ = 0.075 (0.018)

W3 (α, β, γ)	α = 1.031 (0.072)	−413.33	0.134	0.839	0.080 (0.387)
	β = 9.743 (0.908)
	μ = 0.077 (0.013)

W2 (α, β)	α = 1.049 (0.068)	−414.08	0.131	0.784	0.071 (0.545)
W2 (α, β)	k = 9.576 (0.854)	−414.08	0.131	0.784	0.071 (0.545)

BXII (α, β)	α = 2.342 (0.356)	−453.48	0.752	4.564	0.251 (<0.005)
BXII (α, β)	k = 0.233 (0.040)	−453.48	0.752	4.564	0.251 (<0.005)

LL2 (α, β)	α = 0.578 (0.043)	−411.47	0.043	0.310	0.041 (0.984)
LL2 (α, β)	k = 1.805 (0.088)	−411.47	0.043	0.310	0.041 (0.984)

Open in a new tab

The estimated pdf and CDF of the proposed distribution corresponding to the real-world data set are shown in Figure 12 and the Kaplan–Meier and PP plots for the proposed distribution are shown in Figure 13.

Estimated pdf and CDF of the GLL distribution corresponding to data set I.

PP and Kaplan–Meier plots of the GLL distribution corresponding to data set I.

6.2.1. TTT Plot

The total time test (TTT) plot plays a central role in determining the best model to fit the given data in terms of the hazard rates. This plot depicts the various forms of the hazard rate. A straight line on the TTT plot indicates that the given data has a constant hazard rate. If the plot is convex, the hazard rate will be decreased; if it is concave, the hazard rates will be increased. The plot for the bathtub shape is first convex and then concave. Similarly, if the hazard rate has an inverted bathtub shape, it will increase first (or concave) and then decrease (or convex). The TTT plot is calculated by using the following formula:

\begin{matrix} G (\frac{r}{n}) = \frac{\sum_{i = 1}^{r} x_{i : n} + (n - r) x_{i : n}}{\sum_{i = 1}^{r} x_{i : n}}, r = x_{i : n} = 1,2, \dots, n, \end{matrix}

(68)

where x_i:n are the order statistics.

The TTT and box plots of the data set are presented in Figure 14. These plots indicate that the empirical hazard rate function of the 1st data set is bathtub shape, monotonically increasing.

The estimated fitted pdfs and CDFs of data set I for the competitive models are shown in Figure 15.

Some estimated fitted densities and cumulative functions of data set I.

7. Bayesian Model Formulation

Given a set of data x=(x₁, x₂, …, x_n) from GLL (α, k, η), the likelihood function of the model is given by

\begin{matrix} L (α, k, η | x) = {(α k)}^{n} \prod_{i = 1}^{n} {(k x_{i})}^{α - 1} \prod_{i = 1}^{n} {[1 + {(η x_{i})}^{α}]}^{- ((k^{α} / η^{α}) + 1)} . \end{matrix}

(69)

The Bayesian model is built by specifying the prior distribution for the model parameters α, k and η and then multiplying with the likelihood function L(α, k, η|x) for the given data x=(x₁, x₂, …, x_n) to obtain the posterior distribution function using the Bayes theorem. The prior distribution of α, k and η is denoted as p(α, k, η).

The joint posterior is

\begin{matrix} p (α, k, η | x) \propto L (α, k, η | x) p (α, k, η) . \end{matrix}

(70)

7.1. Prior Distribution

We assumed independent noninformative gamma priors for the parameters of the proposed model in this study due to the flexibility of gamma distributions in accommodating many possible shapes for the types of parameters involved in the proposed distribution. Furthermore, they enable efficient posterior calculations and the recovery of the noninformative distribution for each parameter. Many research papers in the literature consider taking these priors into account (see [28, 37–41]).

For the model parameters, we assume independent gamma priors: α ~ G(a₁, b₁), k ~ G(a₂, b₂), and η ~ G(a₃, b₃).

\begin{matrix} p (α) = \frac{b_{1}^{a_{1}}}{Γ (a_{1})} α^{a_{1} - 1} \exp (- b_{1} α), α > 0, a_{1} > 0, b_{1} > 0, \\ p (k) = \frac{b_{2}^{a_{2}}}{Γ (a_{2})} k^{a_{2} - 1} \exp (- b_{2} k), α > 0, a_{2} > 0, b_{2} > 0, \\ p (η) = \frac{b_{3}^{a_{3}}}{Γ (a_{3})} η^{a_{3} - 1} \exp (- b_{3} η), η > 0, a_{3} > 0, b_{3} > 0. \end{matrix}

(71)

Hence, we have

\begin{matrix} p (α, k, η) = p (α) p (k) p (η) . \end{matrix}

(72)

7.2. Posterior Distribution

The posterior expression can be obtained, up to proportionality, by multiplying the likelihood by the prior, and this can be written as

\begin{matrix} p (α, k, η | x) \propto α^{a_{1} + n - 1} k^{a_{2} + n - 1} η^{a_{3} + n - 1} e^{- (b_{1} α + b_{2} k + b_{3} η)} L_{1}, \end{matrix}

(73)

where

\begin{matrix} L_{1} = {(α k)}^{n} \prod_{i = 1}^{n} {(k x_{i})}^{α - 1} \prod_{i = 1}^{n} {[1 + {(η x_{i})}^{α}]}^{- ((k^{α} / η^{α}) + 1)} . \end{matrix}

(74)

The posterior is complicated, and there are no closed-form inferences. As a result, we, propose using McMC techniques to simulate samples from the posterior, allowing for simple sample-based inferences.

7.3. Gibbs Sampler: Algorithm

Markov chains require a stationary distribution in order to perform Markov chain Monte Carlo calculations. These chains can be built in a variety of ways. Over the last decade, the following Monte Carlo sampling techniques for assessing high-dimensional posterior integrals have already been developed. Others are Metropolis-Hastings's sampling, Monte Carlo importance sampling, Gibb's sampling, and others. The most popular McMC sampling algorithm in the Bayesian survival inference computation literature is Gibbs' sampling, which is primarily a special case of Metropolis-Hastings's sampling. Gibb's sampling is preferred in high-dimensional numerical computation.

By using Gibbs's sampling, we only need to know the full conditional distribution. To carry out Gibbs's sampling, the basic scheme is as follows:

Step 1: compute the posterior distribution, up to proportionality, and specify the full conditionals, using equation (71), of the model parameters α, η and k as follows.
- (i)
  Full conditional of α given η, k and x:
  $\begin{matrix} p (α | η, k, x) \propto α^{a_{1} + n - 1} e^{- (b_{1} α)} L_{1} . \end{matrix}$ (75)
- (ii)
  Full conditional of k given α, η and x:
  $\begin{matrix} p (k | α, η, x) \propto k^{a_{2} + n - 1} e^{- (b_{2} k)} L_{1} . \end{matrix}$ (76)
- (iii)
  Full conditional of η given α, k and x:
  $\begin{matrix} p (η | α, k, x) \propto η^{a_{3} + n - 1} e^{- (b_{3} η)} L_{1} . \end{matrix}$ (77)
Step 2: select an initial value θ⁽⁰⁾=(α⁽⁰⁾, k⁽⁰⁾, η⁽⁰⁾) to start the chain.
Step 3: suppose that, at the ith step, θ=(α, η, k) takes the value θ⁽ⁱ⁾=(α⁽ⁱ⁾, k⁽ⁱ⁾, η⁽ⁱ⁾); then, from full conditionals, generate
$\begin{matrix} α^{(i + 1)}, from p (α | k^{(i)}, η^{(i)}, x), \\ k^{(i + 1)}, from p (k | α^{(i + 1)}, η^{(i)}, x), \\ η^{(i + 1)}, from p (η | α^{(i + 1)}, k^{(i + 1)}, x) . \end{matrix}$ (78)
Step 4: this completes a transition from θ⁽ⁱ⁾ to θ⁽ⁱ⁺¹⁾.
Step 5: repeat Step 3 N times.

8. Bayesian Analysis

In this work, we assumed the independent gamma priors for α ~ G(a₁, b₁), k ~ G(a₂, b₂), and η ~ G(a₃, b₃) with hyperparameter values (a₁=b₁=a₂=b₂=a₃=b₃=1.0).

8.1. Convergence Diagnostics

The proposed model is built with the goal of calculating Bayesian estimates for GLL parameters using the McMC method. Due to the Ergodic property of the Markov chain, all inferences are based on the assumption that it will converge. Hence, the McMC convergence diagnostic is crucial. If the simulated sample gives an acceptable approximation for the posterior density, the inferences are correct. Several convergence diagnostic analyses are used to determine whether the chains have converged, including the following.

8.1.1. Geweke's Convergence Diagnostic

Geweke's diagnostic, also called Geweke's z-score diagnostic, focuses on comparing the first and last parts of a chain. It is, in fact, a frequentist comparison, of means, with 95 percent of the values falling between −2 and 2, as proposed by [42]. All three values of the three parameters for the three chains in Figure 16 are between −2 and 2.

Geweke's diagnostic plot for alpha, eta, and kappa parameters.

8.1.2. Autocorrelation Diagnostics

The autocorrelation plot for the parameters is shown in Figure 17.

Autocorrelation plot for the alpha, eta, and kappa parameters.

8.1.3. Heidelberger and Welch's Convergence Diagnostic

Schruben [43] and Schruben et al. [44] proposed detecting nonstationarity in simulation output using a spectral analysis approach to estimate the sample mean variance. They applied the Cramer-von Mises statistic and Brownian bridge theory to test the null hypothesis of stationarity of the Markov chain.

Heidelberger and Welch [45] applied the aforementioned test to introduce a comprehensive method for generating a confidence interval of a predetermined width for the mean of a parameter when the chain has an initial transient (a state when the algorithm has not reached stationarity yet). They computed a test statistic (based on the Cramer-von Mises test statistic) to reject or accept the null hypothesis that the Markov chain belongs to a stationary distribution. A single chain was subjected to diagnostic.

8.1.4. Raftery and Lewis's Diagnostic

Raftery and Lewis [46, 47] proposed “a method for a single chain that tests for chain convergence to the target distribution and estimates the run-lengths required to properly estimate quantiles of functions of the parameters.”

In this study, we applied a quantile of interest (0.025), the desired level of accuracy of ±0.0005, and a probability of 0.95 to attain the indicated degree of accuracy.

8.1.5. Brooks–Gelman–Rubin (BGR) Convergence Diagnostic

The fact that the lines for all of the parameters are close to 1 indicates convergence from BGR plots as shown in Figure 18.

BGR plots for alpha, eta, and kappa parameters.

In this section, a summary of some common statistical convergence diagnostics tests is provided in Table 10.

Table 10.

Summary of some statistical convergence diagnostic tests.

Parameter	Geweke's diagnostic	Raftery and Lewis	Heidelberger-Welch
Parameter	Pr > \|z\|	Total no. of samp.	p value	Stationarity test	Halfwidth test
Alpha	−1.1992	3823	0.072	Passed	Passed
Eta	−0.5711	4338	0.690	Passed	Passed
Kappa	0.4144	4106	0.980	Passed	Passed

Open in a new tab

8.1.6. Ergodic Mean (Running Mean) Plot

The running mean, also known as the ergodic mean, is the average of all samples up to and including a specific iteration. It is used to observe the McMC chains' convergence pattern. Figure 19 shows a time-series graph of each parameter and it displays the running mean (or ergodic mean) plots for the three parameters of the GLL distribution. The running mean plots of alpha, eta, and kappa show that the chains converge to the values in Table 11 after N iterations.

The ergodic mean plots for alpha, eta, and kappa.

Table 11.

Numerical summaries of posterior properties for the GLL model with gamma priors based on an McMC sample.

Characteristics	Chain 1			Chain 2			Chain 3
Characteristics	α	η	k	α	η	k	α	η	k
Mean	1.444	0.094	0.144	1.437	0.093	0.144	1.441	0.093	0.143
SD	0.175	0.041	0.018	0.172	0.040	0.018	0.174	0.040	0.018
Naïve SE	0.002	0.001	0.0002	0.002	0.001	0.0002	0.002	0.001	0.0002
Time-series SE	0.003	0.0003	0.0003	0.003	0.0003	0.0004	0.003	0.001	0.0003
MC error	0.001	0.0004	0.0001	0.001	0.0003	0.0002	0.002	0.002	0.0001
Minimum	0.967	0.003	0.090	0.933	0.001	0.090	0.965	0.001	0.088
2.5th percentile	1.139	0.027	0.112	1.134	0.025	0.112	1.134	0.024	0.112
Q1	1.319	0.065	0.131	1.316	0.064	0.131	1.316	0.064	0.131
Medium (Q2)	1.432	0.090	0.142	1.425	0.088	0.143	1.425	0.090	0.142
Q3	1.550	0.118	0.155	1.548	0.117	0.155	1.553	0.118	0.155
97.5th percentile	1.825	0.184	0.182	1.820	0.180	0.181	1.820	0.180	0.183
Maximum	2.240	0.307	0.238	2.087	0.281	2.392	2.233	0.290	0.233
Mode	1.450	0.090	0.145	1.450	0.090	0.145	1.350	0.090	0.145
Variance	0.031	0.002	0.0003	1.636	0.002	0.0003	0.030	0.002	0.0003
Skewness	0.463	0.655	0.484	0.372	0.514	0.375	0.365	0.524	0.423
Kurtosis	0.372	0.821	0.612	0.153	0.344	0.301	0.044	0.420	0.412
95% credible interval	(1.139, 1.825)	(0.027, 0.184)	(0.112, 0.182)	(1.134, 1.820)	(0.025, 0.180)	(0.112, 0.181)	(1.134, 1.820)	(0.024, 0.180)	(0.112, 0.183)
95% HPD interval	(1.113, 1.784)	(0.021, 0.174)	(0.112, 0.181)	(1.104, 1.764)	(0.021, 0.171)	(0.110, 0.179)	(1.107, 1.779)	(0.019, 0.172)	(0.108, 0.178)

Open in a new tab

8.2. Posterior Analysis

In this section, we present numerical and visual summaries of the posterior distribution for each of the three chains. The joint posterior distribution for the proposed model was estimated using the JAGS software [48]. For each proposed model, we ran three parallel chains with 50,000 iterations and a burn-in of 5,000. Chains were thinned by storing every fifth iteration to reduce autocorrelation in the sample. The use of various convergence diagnostic tools ensured convergence to the joint posterior.

8.2.1. Numerical Summary

We have considered different quantities of interest and their numeric data based on an McMC sample of posterior properties for generalized log-logistic distribution. The McMC simulation results include the results of of the posterior mean, posterior standard deviation, naïve standard error, time-series standard error, Markov chain error, the posterior five-point summary statistics (minimum, lower quartile (Q1), median (Q2), upper quartile (Q3), and maximum), the posterior skewness, posterior kurtosis, 2.5th percentile, 97.5th percentile, and the credible interval followed by the highest probability density (HPD).

The naïve standard error is defined as a measure of simulation error in the mean rather than posterior uncertainty.

\begin{matrix} naive SE = \frac{posterior SD}{\sqrt{n}} . \end{matrix}

(79)

The time-series SE adjusts the “naïve” SE for autocorrelation.

8.2.2. Visual Summary

In this subsection, we have considered different graphs for a visual summary of the posterior properties; those include the box plot, density strip plots, histogram, and trace plots for the parameters. These graphs and plots provide a nearly complete picture of the parameters' posterior uncertainty [49]. We applied the posterior sample (α^(j), k^(j) and η^(j)), j=1,…, 15000, to draw these graphs.

(1) Box Plots. The boxes in Figure 20 represent interquartile ranges, and the line in the middle of each box is the median; the arms of each box extend to encompass the central 95 percent of the distribution, and their ends thus correspond to the 2.5 percent and 97.5 percent quartiles, respectively.

The box plots for the alpha, eta, and kappa parameters.

(2) Density and Histogram Plots. Histogram can provide information about the behaviour in the tails, skewness, data outliers, and the presence of multimodal behaviour. The graphs in Figure 21 can provide us with a nearly complete picture of the posterior uncertainty about the GLL parameters, while the graphs in Figure 22 show a comparison of the full density and partial density of the parameters.

Kernel density estimate and the histogram plots for alpha, eta, and kappa parameters.

Density plots for the parameters comparing the whole chains with their last parties.

(3) Trace Plots. A trace plot, also known as “a time-series plot,” is a representation of the iteration number versus the value of the parameter drawn at each iteration. Because the plots do not show long-term increasing or decreasing trends but rather resemble a horizontal band in Figure 23, we can conclude that the chains have converged.

The trace plots for alpha, eta, and kappa parameters.

9. Conclusions

This work introduced and presented results on the mathematical and statistical properties of the generalized log-logistic distribution. The GLL model contains several parametric survival submodels that could be used in a variety of statistics and probability applications. Statistical properties such as quantile function and their related results, moments and their related results, r^th central moments, and residual and reversed residual life were derived. We have also considered the Bayesian and classical inference of the unknown parameters of the proposed distribution when the data is uncensored or complete. The Bayesian estimates are obtained using the Gibbs sampling method under the assumption of independent gamma priors on the shape and scale parameters. It is worth noting that when prior information is available, Bayes estimates clearly outperform maximum likelihood estimates. To assess the behaviour of the estimators, Monte Carlo simulations are run. The proposed distribution was also applied to a real-world data set and provided a better fit than its submodels and other common parametric survival distributions based on goodness-of-fit statistics, log-likelihood function, and information criterion values. As a result, we conclude that the GLL is the most appropriate model among the distributions considered and it is a very competitive model for explaining lifetime phenomena.

This work has numerous potential extensions. In practice, for example, the presence of explanatory variables and long-term survivals is common. Furthermore, a regression model for both complete and incomplete (or censored) data could be beneficial. As a result, our framework can be further researched in these contexts. The GLL distribution could also be useful in studies comprising survival models such as accelerated failure time, competing risks, mixture cure, frailty, multiple states, and joint survival models, as well as longitudinal data.

Acknowledgments

This paper was supported by Taif University Researchers Supporting Project (no. TURDP-2020/253), Taif University, Taif, Saudi Arabia.

Data Availability

The data used to support the findings of this study are included within the article.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Supplementary Materials

The supplementary materials are available in the supplementary file.

Click here for additional data file.^{(16.9KB, docx)}

References

1.Gupta R. C., Akman O., Lvin S. A study of log-logistic model in survival analysis. Biometrical Journal . 1999;41(4):431–443. doi: 10.1002/(sici)1521-4036(199907)41:4<431::aid-bimj431>3.0.co;2-u. [DOI] [Google Scholar]
2.Tahir M. H., Mansoor M., Zubair M., Hamedani G. G., Science C. McDonald log-logistic distribution with an application to breast cancer data. Journal of Statistical Theory and Applications . 2014;13(1):65–82. doi: 10.2991/jsta.2014.13.1.6. [DOI] [Google Scholar]
3.Singh K. P., Lee C. M.-S., George E. O. On generalized log-logistic model for censored survival data. Biometrical Journal . 1988;30(7):843–850. doi: 10.1002/bimj.4710300714. [DOI] [Google Scholar]
4.Bennett S. Log-logistic regression models for survival data. Applied Statistics . 1983;32(2):165–171. doi: 10.2307/2347295. [DOI] [Google Scholar]
5.Kalbfleisch J. D., Prentice R. L. Marginal likelihoods based on cox’s regression and life model. Biometrika . 1973;60(2):267–278. doi: 10.1093/biomet/60.2.267. [DOI] [Google Scholar]
6.Khan S. A., Khosa S. K. Generalized log-logistic proportional hazard model with applications in survival analysis. Journal of Statistical Distributions and Applications . 2015;3(1) doi: 10.1186/s40488-016-0054-z. [DOI] [Google Scholar]
7.Alfaer N. M., Gemeay A. M., Aljohani H. M., Afify A. Z. The extended log-logistic distribution: inference and actuarial applications. Mathematics . 2021;9(12):p. 1386. doi: 10.3390/math9121386. [DOI] [Google Scholar]
8.Aldahlan M. A. Alpha power transformed log-logistic distribution with application to breaking stress data. Advances in Mathematical Physics . 2020;2020:9. doi: 10.1155/2020/2193787.2193787 [DOI] [Google Scholar]
9.Malik A. S., Ahmad S. P. An extension of log-logistic distribution for analyzing survival data. Pakistan Journal of Statistics and Operation Research . 2020;16(4):789–801. doi: 10.18187/pjsor.v16i4.2961. [DOI] [Google Scholar]
10.Adeyinka F. S. On transmuted four parameters generalized log-logistic distribution. International Journal of Statistical Distributions and Applications . 2019;5(2):p. 32. doi: 10.11648/j.ijsd.20190502.12. [DOI] [Google Scholar]
11.Granzotto D. C. T., Louzada F. The transmuted log-logistic distribution: modeling, inference, and an application to a polled tabapua race time up to first calving data. Communications in Statistics-Theory and Methods . 2015;44(16):3387–3402. doi: 10.1080/03610926.2013.775307. [DOI] [Google Scholar]
12.Shakhatreh M. K. A new three-parameter extension of the log-logistic distribution with applications to survival data. Communications in Statistics-Theory and Methods . 2018;47(21):5205–5226. doi: 10.1080/03610926.2017.1388399. [DOI] [Google Scholar]
13.Lima S. R., Cordeiro G. M. The extended log-logistic distribution: properties and application. Anais da Academia Brasileira de Ciências . 2017;89(1):3–17. doi: 10.1590/0001-3765201720150579. [DOI] [PubMed] [Google Scholar]
14.Mendoza N. V. R., Ortega E. M. M., Cordeiro G. M. The exponentiated-log-logistic geometric distribution: dual activation. Communications in Statistics-Theory and Methods . 2016;45(13):3838–3859. doi: 10.1080/03610926.2014.909937. [DOI] [Google Scholar]
15.Oluyede B., Foya S., Warahena-Liyanage G., Huang S. The log-logistic weibull distribution with applications to lifetime data. Austrian Journal of Statistics . 2016;45(3):43–69. doi: 10.17713/ajs.v45i3.107. [DOI] [Google Scholar]
16.Lemonte A. J. The beta log-logistic distribution. Brazilian Journal of Probability and Statistics . 2014;28(3):313–332. doi: 10.1214/12-bjps209. [DOI] [Google Scholar]
17.Aryal G. R. Transmuted log-logistic distribution. Journal of Statistics Applications & Probability . 2013;2(1):11–20. doi: 10.12785/jsap/020102. [DOI] [Google Scholar]
18.Gui W. Marshall-olkin extended log-logistic distribution and its application in minification processes. Applied Mathematical Sciences . 2013;7(77–80):3947–3961. doi: 10.12988/ams.2013.35268. [DOI] [Google Scholar]
19.Ramos M. W. A. The zografos-balakrishnan log-logistic distribution: properties and applications. Journal of Statistical Theory and Applications . 2013;12(3):244–255. doi: 10.2991/jsta.2013.12.3.2. [DOI] [Google Scholar]
20.Rosaiah K., Nagarjuna K. M., Siva Kumar D. C. U., Rao B. S. Exponential–log logistic additive failure rate model. International Journal of Scientific and Research Publications, . 2014;4(1):2250–3153. [Google Scholar]
21.Muse A. H., Mwalili S. M., Ngesa O. On the log-logistic distribution and its generalizations: a survey. International Journal of Statistics and Probability . 2021;10(3):p. 93. doi: 10.5539/ijsp.v10n3p93. [DOI] [Google Scholar]
22.dos Santos C., Granzotto D., Tomazella V., Louzada F. Hierarchical transmuted log-logistic model: a subjective bayesian analysis. Journal of Risk and Financial Management . 2018;11(1):p. 13. doi: 10.3390/jrfm11010013. [DOI] [Google Scholar]
23.Yahaya A., Dewu M. W. Bayesian estimation of scale parameter of the log-logistic distribution under the assumption of chi-square and maxwell. ATBU Journal of Science, Technology and Education . 2016;4(3):39–46. [Google Scholar]
24.Abbas K., Tang Y. Objective bayesian analysis for log-logistic distribution. Communications in Statistics-Simulation and Computation . 2016;45(8):2782–2791. doi: 10.1080/03610918.2014.925925. [DOI] [Google Scholar]
25.Al-Shomrani A. A., Shawky A. I., Arif O. H., Aslam M. Log-logistic distribution for survival data analysis using MCMC. Springerplus . 2016;5(1) doi: 10.1186/s40064-016-3476-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Guure C. B., Ibrahim N. A., Dwomoh D., Bosomprah S. Bayesian statistical inference of the loglogistic model with interval-censored lifetime data. Journal of Statistical Computation and Simulation . 2015;85(8):1567–1583. doi: 10.1080/00949655.2014.881813. [DOI] [Google Scholar]
27.Kang S. G., Lee K., Lee W. D. Noninformative priors for the generalized half-normal distribution. Journal of the Korean Surgical Society . 2014;43(1):19–29. doi: 10.1016/j.jkss.2013.06.003. [DOI] [Google Scholar]
28.Chaudhary A. K., Kumar V. Bayesian estimation of three-parameter exponentiated log-logistic distribution. International Journal of Statistika and Mathematika . 2014;9(2):66–81. [Google Scholar]
29.Akhtar M., Khan A., Akhtar M. T., Khan A. A. A log-logistic distribution as a reliability model: a bayesian analysis. American Journal of Mathematics and Statistics . 2014;4(3):162–170. [Google Scholar]
30.Chaudhary A. K. Bayesian analysis of two-parameter exponentiated log-logistic distribution. Pravaha . 2007;25(1):1–12. [Google Scholar]
31.Prentice R. L. A generalization of the probit and logit methods for dose response curves. Biometrics . 1976;32(4):761–768. doi: 10.2307/2529262. [DOI] [PubMed] [Google Scholar]
32.Cox D., Oakes D. Analysis of Survival Data . 4. Vol. 21. Boca Raton, FL, USA: Chapman and Hall/CRC; 1984. [Google Scholar]
33.Block H. W., Savits T. H., Singh H. The reversed hazard rate function. Probability in the Engineering and Informational Sciences . 1998;12(1):69–90. doi: 10.1017/s0269964800005064. [DOI] [Google Scholar]
34.Gupta R. C., Wu H. Analyzing survival data by proportional reversed hazard model. International Journal of Reliability and Applications . 2001;2(1):1–26. [Google Scholar]
35.Midhu N. N., Sankaran P. G., Unnikrishnan Nair N. A class of distributions with the linear mean residual quantile function and it’s generalizations. Statistical Methodology . 2013;15:1–24. doi: 10.1016/j.stamet.2013.03.002. [DOI] [Google Scholar]
36.Lee E. T., Wang J. Statistical Methods for Survival Data Analysis . Vol. 476. Hoboken, NJ, USA: John Wiley & Sons; 2003. [Google Scholar]
37.Alvares D., Rubio F. J. A tractable bayesian joint model for longitudinal and survival data. Statistics in Medicine . 2021;40(19):4213–4229. doi: 10.1002/sim.9024. [DOI] [PubMed] [Google Scholar]
38.Alvares D., Lázaro E., Gómez-Rubio V., Armero C. Bayesian survival analysis with BUGS. Statistics in Medicine . 2021;40(12):2975–3020. doi: 10.1002/sim.8933. [DOI] [PubMed] [Google Scholar]
39.Lesaffre E., Lawson A. B. Bayesian Biostatistics . Hoboken, NJ, USA: John Wiley & Sons; 2012. [Google Scholar]
40.Christensen R., Johnson W., Branscum A., Hanson T. E. Bayesian Ideas and Data Analysis: An Introduction for Scientists and Statisticians . Boca Raton, FL, USA: CRC Press; 2010. [Google Scholar]
41.Alvares D., Rubio F. J. A tractable Bayesian joint model for longitudinal and survival data. Statistics in Medicine . 2021;40(19):4213–4229. doi: 10.1002/sim.9024. [DOI] [PubMed] [Google Scholar]
42.Geweke J. Evaluating the accuracy of sampling-based approaches to the calculations of posterior moments. Bayesian Statistics . 1992;4:641–649. [Google Scholar]
43.Schruben L. W. Detecting initialization bias in simulation output. Operations Research . 1982;30(3):569–590. doi: 10.1287/opre.30.3.569. [DOI] [Google Scholar]
44.Schruben L., Singh H., Tierney L. Optimal tests for initialization bias in simulation output. Operations Research . 1983;31(6):1167–1178. doi: 10.1287/opre.31.6.1167. [DOI] [Google Scholar]
45.Heidelberger P., Welch P. D. Simulation run length control in the presence of an initial transient. Operations Research . 1983;31(6):1109–1144. doi: 10.1287/opre.31.6.1109. [DOI] [Google Scholar]
46.Raftery A. E., Lewis S. M. The number of iterations, convergence diagnostics and generic Metropolis algorithms. Practical Markov Chain Monte Carlo . 1995;7(98):763–773. [Google Scholar]
47.Raftery A. E., Lewis S. M. [Practical Markov chain Monte Carlo]: comment: one long run with diagnostics: implementation strategies for Markov chain Monte Carlo. Statistical Science . 1992;7(4):493–497. doi: 10.1214/ss/1177011143. [DOI] [Google Scholar]
48. M. M. Plummer, RJAGS: bayesian graphical models using MCMC, R package, version 4-8, 2019.
49.Fernández-i-Marín X. GGMCMC: analysis of MCMC samples and bayesian inference. Journal of Statistical Software . 2016;70(9) doi: 10.18637/jss.v070.i09. [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

The supplementary materials are available in the supplementary file.

Click here for additional data file.^{(16.9KB, docx)}

Data Availability Statement

The data used to support the findings of this study are included within the article.

[B1] 1.Gupta R. C., Akman O., Lvin S. A study of log-logistic model in survival analysis. Biometrical Journal . 1999;41(4):431–443. doi: 10.1002/(sici)1521-4036(199907)41:4<431::aid-bimj431>3.0.co;2-u. [DOI] [Google Scholar]

[B2] 2.Tahir M. H., Mansoor M., Zubair M., Hamedani G. G., Science C. McDonald log-logistic distribution with an application to breast cancer data. Journal of Statistical Theory and Applications . 2014;13(1):65–82. doi: 10.2991/jsta.2014.13.1.6. [DOI] [Google Scholar]

[B3] 3.Singh K. P., Lee C. M.-S., George E. O. On generalized log-logistic model for censored survival data. Biometrical Journal . 1988;30(7):843–850. doi: 10.1002/bimj.4710300714. [DOI] [Google Scholar]

[B4] 4.Bennett S. Log-logistic regression models for survival data. Applied Statistics . 1983;32(2):165–171. doi: 10.2307/2347295. [DOI] [Google Scholar]

[B5] 5.Kalbfleisch J. D., Prentice R. L. Marginal likelihoods based on cox’s regression and life model. Biometrika . 1973;60(2):267–278. doi: 10.1093/biomet/60.2.267. [DOI] [Google Scholar]

[B6] 6.Khan S. A., Khosa S. K. Generalized log-logistic proportional hazard model with applications in survival analysis. Journal of Statistical Distributions and Applications . 2015;3(1) doi: 10.1186/s40488-016-0054-z. [DOI] [Google Scholar]

[B7] 7.Alfaer N. M., Gemeay A. M., Aljohani H. M., Afify A. Z. The extended log-logistic distribution: inference and actuarial applications. Mathematics . 2021;9(12):p. 1386. doi: 10.3390/math9121386. [DOI] [Google Scholar]

[B8] 8.Aldahlan M. A. Alpha power transformed log-logistic distribution with application to breaking stress data. Advances in Mathematical Physics . 2020;2020:9. doi: 10.1155/2020/2193787.2193787 [DOI] [Google Scholar]

[B9] 9.Malik A. S., Ahmad S. P. An extension of log-logistic distribution for analyzing survival data. Pakistan Journal of Statistics and Operation Research . 2020;16(4):789–801. doi: 10.18187/pjsor.v16i4.2961. [DOI] [Google Scholar]

[B10] 10.Adeyinka F. S. On transmuted four parameters generalized log-logistic distribution. International Journal of Statistical Distributions and Applications . 2019;5(2):p. 32. doi: 10.11648/j.ijsd.20190502.12. [DOI] [Google Scholar]

[B11] 11.Granzotto D. C. T., Louzada F. The transmuted log-logistic distribution: modeling, inference, and an application to a polled tabapua race time up to first calving data. Communications in Statistics-Theory and Methods . 2015;44(16):3387–3402. doi: 10.1080/03610926.2013.775307. [DOI] [Google Scholar]

[B12] 12.Shakhatreh M. K. A new three-parameter extension of the log-logistic distribution with applications to survival data. Communications in Statistics-Theory and Methods . 2018;47(21):5205–5226. doi: 10.1080/03610926.2017.1388399. [DOI] [Google Scholar]

[B13] 13.Lima S. R., Cordeiro G. M. The extended log-logistic distribution: properties and application. Anais da Academia Brasileira de Ciências . 2017;89(1):3–17. doi: 10.1590/0001-3765201720150579. [DOI] [PubMed] [Google Scholar]

[B14] 14.Mendoza N. V. R., Ortega E. M. M., Cordeiro G. M. The exponentiated-log-logistic geometric distribution: dual activation. Communications in Statistics-Theory and Methods . 2016;45(13):3838–3859. doi: 10.1080/03610926.2014.909937. [DOI] [Google Scholar]

[B15] 15.Oluyede B., Foya S., Warahena-Liyanage G., Huang S. The log-logistic weibull distribution with applications to lifetime data. Austrian Journal of Statistics . 2016;45(3):43–69. doi: 10.17713/ajs.v45i3.107. [DOI] [Google Scholar]

[B16] 16.Lemonte A. J. The beta log-logistic distribution. Brazilian Journal of Probability and Statistics . 2014;28(3):313–332. doi: 10.1214/12-bjps209. [DOI] [Google Scholar]

[B17] 17.Aryal G. R. Transmuted log-logistic distribution. Journal of Statistics Applications & Probability . 2013;2(1):11–20. doi: 10.12785/jsap/020102. [DOI] [Google Scholar]

[B18] 18.Gui W. Marshall-olkin extended log-logistic distribution and its application in minification processes. Applied Mathematical Sciences . 2013;7(77–80):3947–3961. doi: 10.12988/ams.2013.35268. [DOI] [Google Scholar]

[B19] 19.Ramos M. W. A. The zografos-balakrishnan log-logistic distribution: properties and applications. Journal of Statistical Theory and Applications . 2013;12(3):244–255. doi: 10.2991/jsta.2013.12.3.2. [DOI] [Google Scholar]

[B20] 20.Rosaiah K., Nagarjuna K. M., Siva Kumar D. C. U., Rao B. S. Exponential–log logistic additive failure rate model. International Journal of Scientific and Research Publications, . 2014;4(1):2250–3153. [Google Scholar]

[B21] 21.Muse A. H., Mwalili S. M., Ngesa O. On the log-logistic distribution and its generalizations: a survey. International Journal of Statistics and Probability . 2021;10(3):p. 93. doi: 10.5539/ijsp.v10n3p93. [DOI] [Google Scholar]

[B22] 22.dos Santos C., Granzotto D., Tomazella V., Louzada F. Hierarchical transmuted log-logistic model: a subjective bayesian analysis. Journal of Risk and Financial Management . 2018;11(1):p. 13. doi: 10.3390/jrfm11010013. [DOI] [Google Scholar]

[B23] 23.Yahaya A., Dewu M. W. Bayesian estimation of scale parameter of the log-logistic distribution under the assumption of chi-square and maxwell. ATBU Journal of Science, Technology and Education . 2016;4(3):39–46. [Google Scholar]

[B24] 24.Abbas K., Tang Y. Objective bayesian analysis for log-logistic distribution. Communications in Statistics-Simulation and Computation . 2016;45(8):2782–2791. doi: 10.1080/03610918.2014.925925. [DOI] [Google Scholar]

[B25] 25.Al-Shomrani A. A., Shawky A. I., Arif O. H., Aslam M. Log-logistic distribution for survival data analysis using MCMC. Springerplus . 2016;5(1) doi: 10.1186/s40064-016-3476-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B26] 26.Guure C. B., Ibrahim N. A., Dwomoh D., Bosomprah S. Bayesian statistical inference of the loglogistic model with interval-censored lifetime data. Journal of Statistical Computation and Simulation . 2015;85(8):1567–1583. doi: 10.1080/00949655.2014.881813. [DOI] [Google Scholar]

[B27] 27.Kang S. G., Lee K., Lee W. D. Noninformative priors for the generalized half-normal distribution. Journal of the Korean Surgical Society . 2014;43(1):19–29. doi: 10.1016/j.jkss.2013.06.003. [DOI] [Google Scholar]

[B28] 28.Chaudhary A. K., Kumar V. Bayesian estimation of three-parameter exponentiated log-logistic distribution. International Journal of Statistika and Mathematika . 2014;9(2):66–81. [Google Scholar]

[B29] 29.Akhtar M., Khan A., Akhtar M. T., Khan A. A. A log-logistic distribution as a reliability model: a bayesian analysis. American Journal of Mathematics and Statistics . 2014;4(3):162–170. [Google Scholar]

[B30] 30.Chaudhary A. K. Bayesian analysis of two-parameter exponentiated log-logistic distribution. Pravaha . 2007;25(1):1–12. [Google Scholar]

[B31] 31.Prentice R. L. A generalization of the probit and logit methods for dose response curves. Biometrics . 1976;32(4):761–768. doi: 10.2307/2529262. [DOI] [PubMed] [Google Scholar]

[B32] 32.Cox D., Oakes D. Analysis of Survival Data . 4. Vol. 21. Boca Raton, FL, USA: Chapman and Hall/CRC; 1984. [Google Scholar]

[B33] 33.Block H. W., Savits T. H., Singh H. The reversed hazard rate function. Probability in the Engineering and Informational Sciences . 1998;12(1):69–90. doi: 10.1017/s0269964800005064. [DOI] [Google Scholar]

[B34] 34.Gupta R. C., Wu H. Analyzing survival data by proportional reversed hazard model. International Journal of Reliability and Applications . 2001;2(1):1–26. [Google Scholar]

[B35] 35.Midhu N. N., Sankaran P. G., Unnikrishnan Nair N. A class of distributions with the linear mean residual quantile function and it’s generalizations. Statistical Methodology . 2013;15:1–24. doi: 10.1016/j.stamet.2013.03.002. [DOI] [Google Scholar]

[B36] 36.Lee E. T., Wang J. Statistical Methods for Survival Data Analysis . Vol. 476. Hoboken, NJ, USA: John Wiley & Sons; 2003. [Google Scholar]

[B37] 37.Alvares D., Rubio F. J. A tractable bayesian joint model for longitudinal and survival data. Statistics in Medicine . 2021;40(19):4213–4229. doi: 10.1002/sim.9024. [DOI] [PubMed] [Google Scholar]

[B38] 38.Alvares D., Lázaro E., Gómez-Rubio V., Armero C. Bayesian survival analysis with BUGS. Statistics in Medicine . 2021;40(12):2975–3020. doi: 10.1002/sim.8933. [DOI] [PubMed] [Google Scholar]

[B39] 39.Lesaffre E., Lawson A. B. Bayesian Biostatistics . Hoboken, NJ, USA: John Wiley & Sons; 2012. [Google Scholar]

[B40] 40.Christensen R., Johnson W., Branscum A., Hanson T. E. Bayesian Ideas and Data Analysis: An Introduction for Scientists and Statisticians . Boca Raton, FL, USA: CRC Press; 2010. [Google Scholar]

[B41] 41.Alvares D., Rubio F. J. A tractable Bayesian joint model for longitudinal and survival data. Statistics in Medicine . 2021;40(19):4213–4229. doi: 10.1002/sim.9024. [DOI] [PubMed] [Google Scholar]

[B42] 42.Geweke J. Evaluating the accuracy of sampling-based approaches to the calculations of posterior moments. Bayesian Statistics . 1992;4:641–649. [Google Scholar]

[B43] 43.Schruben L. W. Detecting initialization bias in simulation output. Operations Research . 1982;30(3):569–590. doi: 10.1287/opre.30.3.569. [DOI] [Google Scholar]

[B44] 44.Schruben L., Singh H., Tierney L. Optimal tests for initialization bias in simulation output. Operations Research . 1983;31(6):1167–1178. doi: 10.1287/opre.31.6.1167. [DOI] [Google Scholar]

[B45] 45.Heidelberger P., Welch P. D. Simulation run length control in the presence of an initial transient. Operations Research . 1983;31(6):1109–1144. doi: 10.1287/opre.31.6.1109. [DOI] [Google Scholar]

[B46] 46.Raftery A. E., Lewis S. M. The number of iterations, convergence diagnostics and generic Metropolis algorithms. Practical Markov Chain Monte Carlo . 1995;7(98):763–773. [Google Scholar]

[B47] 47.Raftery A. E., Lewis S. M. [Practical Markov chain Monte Carlo]: comment: one long run with diagnostics: implementation strategies for Markov chain Monte Carlo. Statistical Science . 1992;7(4):493–497. doi: 10.1214/ss/1177011143. [DOI] [Google Scholar]

[B48] 48. M. M. Plummer, RJAGS: bayesian graphical models using MCMC, R package, version 4-8, 2019.

[B49] 49.Fernández-i-Marín X. GGMCMC: analysis of MCMC samples and bayesian inference. Journal of Statistical Software . 2016;70(9) doi: 10.18637/jss.v070.i09. [DOI] [Google Scholar]

PERMALINK

Bayesian and Classical Inference for the Generalized Log-Logistic Distribution with Applications to Survival Data

Abdisalam Hassan Muse

Samuel Mwalili

Oscar Ngesa

Saad J Almalki

Gamal A Abd-Elmougod

Abstract

1. Introduction

2. The Generalized Log-Logistic Distribution

2.1. Hazard (Failure) Rate Function

Figure 1.

2.2. Submodels

2.2.1. Log-Logistic Distribution

Proposition 1 . —

Proof —

2.2.2. Standard Log-Logistic Distribution

Proposition 2 . —

Proof —

2.2.3. Burr XII Distribution

Proposition 3 . —

Proof —

2.2.4. Weibull Distribution

Proposition 4 . —

Proof —

2.2.5. Exponential Distribution

Proposition 5 . —

Proof —

Table 1.

2.3. The Probability Density Function

Figure 2.

2.4. The Survival (or Reliability) Function

Figure 3.

2.5. Cumulative Distribution Function of the GLL Distribution

Figure 4.

2.6. The Reversed Hazard Rate Function

Figure 5.

2.7. The Cumulative Hazard Function

2.8. The Hazard Rate Average (FRA) Function

3. Some Mathematical Properties of the GLL Distribution

3.1. The Quantile Function and Related Results

Table 2.

Theorem 1 . —

Proof —

3.1.1. Skewness and Kurtosis

3.2. The Random Deviate Generation Functions

3.3. The rth Moments and Related Results

Theorem 2 . —

Proof —

3.3.1. Mean and Variance

Corollary 1 . —

3.4. The rth Central Moments

Corollary 2 . —

3.5. Residual and Reverse Residual Life

Table 3.

Figure 6.

Figure 7.

4. Maximum Likelihood Estimation (MLE)

5. Monte Carlo Simulation Study

Table 4.

Figure 8.

Figure 9.

Figure 10.

Figure 11.

6. Data Analysis

6.1. Likelihood Ratio Test for Submodels

6.2. An Application to Bladder Cancer Data Set

Table 5.

Table 6.

Table 7.

Table 8.

Table 9.

Figure 12.

Figure 13.

6.2.1. TTT Plot

Figure 14.

Figure 15.

7. Bayesian Model Formulation

7.1. Prior Distribution

7.2. Posterior Distribution

3.3. The r^th Moments and Related Results

3.4. The r^th Central Moments