Modeling the survival times of the COVID-19 patients with a new statistical model: A case study from China

Xiaofeng Liu; Zubair Ahmad; Ahmed M Gemeay; Alanazi Talal Abdulrahman; E H Hafez; N Khalil

doi:10.1371/journal.pone.0254999

. 2021 Jul 26;16(7):e0254999. doi: 10.1371/journal.pone.0254999

Modeling the survival times of the COVID-19 patients with a new statistical model: A case study from China

Xiaofeng Liu ¹, Zubair Ahmad ^2,^*, Ahmed M Gemeay ³, Alanazi Talal Abdulrahman ⁴, E H Hafez ⁵, N Khalil ⁶

Editor: Feng Chen⁷

PMCID: PMC8312982 PMID: 34310646

Abstract

Over the past few months, the spread of the current COVID-19 epidemic has caused tremendous damage worldwide, and unstable many countries economically. Detailed scientific analysis of this event is currently underway to come. However, it is very important to have the right facts and figures to take all possible actions that are needed to avoid COVID-19. In the practice and application of big data sciences, it is always of interest to provide the best description of the data under consideration. The recent studies have shown the potential of statistical distributions in modeling data in applied sciences, especially in medical science. In this article, we continue to carry this area of research, and introduce a new statistical model called the arcsine modified Weibull distribution. The proposed model is introduced using the modified Weibull distribution with the arcsine-X approach which is based on the trigonometric strategy. The maximum likelihood estimators of the parameters of the new model are obtained and the performance these estimators are assessed by conducting a Monte Carlo simulation study. Finally, the effectiveness and utility of the arcsine modified Weibull distribution are demonstrated by modeling COVID-19 patients data. The data set represents the survival times of fifty-three patients taken from a hospital in China. The practical application shows that the proposed model out-classed the competitive models and can be chosen as a good candidate distribution for modeling COVID-19, and other related data sets.

1 Introduction

The first outbreak of the current COVID-19 epidemic was first seen in the popular seafood market in the Chinese city of Wuhan, where large numbers of people come to buy or sell seafood. As of December 31, 2019, a total of 27 cases of COVD-19 epidemic were reported by the WMHC (Wuhan Municipal Health Commission). After highly effecting some Chines cities and provinces, this pandemic transmitted to other countries via air routes [1].

Almost every country around the globe has paid a huge price in terms of financial and human loss, and yet, some countries are paying and in sacrifice process. The countries having large population densities and low health facilities have higher chances of being in critical situations during this pandemic [2].

The main serious and common symptoms attributed to COVID-19 are sore throat (13.9%), dry cough (67.7%), Fever (87.9%), shortness of breath (18.6%), headache (13.6%), fatigue (38.1%), sputum production (33.4%), muscle pain (14.8%), nausea (5.0%), nasal congestion (4.8%), haemoptysis (0.9%) and conjunctival congestion (0.8%). For more detail about the symptoms of COVID-19 pandemic; see [3]. The % of symptoms are displayed in Fig 1.

The comparison of COVID-19 epidemic between different countries is worth of studies and is of great concern. In this regard, researchers are devoting great efforts to make comparisons between different countries. For some previous attempts to compare this epidemic in Italy and China; see [4] for details. Comparison of COVID-19 in Europe, USA, and South Korea is provided in [5]. The COVID-19 pandemic in Australia has been discussed in [6].

Modeling the spread of COVID-19 in Lebanon is provided in [7]. A case study form Spain has been studied in [8]. Mathematical analysis of COVID-19 in Mexico is provided in [9]. A case study form Brazil is discussed in [10]. The progress of COVID-19 epidemic in Pakistan is studied by [11]. A mathematical model for COVID-19 transmission dynamics in India is provided by [12]. Al-Babtain et al. [13] introduced two case studies in Saudi Arabia, the first one about COVID-19 infections from 24 March to 12 April, 2020 and the second about numbers of daily recover patients in the same period of time. Comparison of COVID-19 events in Asian countries has been carried out in [14]. The comparison between Iran and mainland China has appeared in [15]. The comparison between the two neighbour countries Iran and Pakistan has appeared in [16]. A case of the COVID-19 pandemic in Indonesia has been discussed in [17]. For more information, reader can refer to [18–27].

In the current situation, it is of great interest to study more about COVID-19 to make comparison between different countries. In the domain and practice of big data science, to provide the best description of the data under consideration is a prominent research topic. The recent studies have pointed out the applicability of statistical models to provide the best description of the random phenomena. In this article, we focus on this research area of distribution theory, and introduce a new statistical model to provide the best fit to data in linked with COVID-19 and other related events.

The modified Weibull distribution is one of the most prominent modifications of the Weibull distribution which is introduced to improve the fitting power of the exponential, Rayleigh, linear failure rate and Weibull distributions; see [28]. We further carry this area of distribution theory and introduce a new prominent version of the modified Weibull distribution to improve its fitting power. A random variable X, is said to follow the modified Weibull distribution with shape parameter α and scale parameters κ₁ and κ₂, if its cdf (cumulative distribution function) denoted F(x;Ξ), is given by

\begin{matrix} F (x; Ξ) = 1 - e^{- κ_{1} x^{α} - κ_{2} x}, x \geq 0, α, κ_{1}, κ_{2} > 0, \end{matrix}

(1)

where Ξ = (α, κ₁, κ₂). The pdf (probability density function) corresponding to expression Eq 1 is

\begin{matrix} f (x; Ξ) = (α κ_{1} x^{α - 1} + κ_{2}) e^{- κ_{1} x^{α} - κ_{2} x}, x > 0 . \end{matrix}

In this article, we focus on proposing a new modification of the modified Weibull distribution called the arcsine modified Weibull (ASM-Weibull) distribution. The ASM-Weibull distribution is introduced by adopting the approach of the arcsine-X distributions of [29], which can be obtained as a sub-case of [30]. The cdf and pdf of the arcsine-X distributions are given, respectively, by

\begin{matrix} G (x) = \frac{2}{π} a r c s i n e (F (x; Ξ)), x \in R . \end{matrix}

(2)

where F(x;Ξ) is cdf of the baseline random variable. The respective pdf is

\begin{matrix} g (x) = \frac{2}{π} \frac{f (x; Ξ)}{\sqrt{1 - F {(x; Ξ)}^{2}}}, x \in R . \end{matrix}

The cdf of the proposed ASM-Weibull distribution is obtained by using the expression 1 in 2. The flexibility and applicability of the ASM-Weibull distribution are examined via an application to the survival times of the COVID-19 patient data.

2 The arcsine-modified weibull model

In this section, we introduce the ASM-Weibull distribution. A random variable X, is said to follow the ASM-Weibull distribution, if its cdf is given by

\begin{matrix} G (x) = \frac{2}{π} a r c s i n e (1 - e^{- κ_{1} x^{α} - κ_{2} x}), x \geq 0, α, κ_{1}, κ_{2} > 0 . \end{matrix}

(3)

The density function corresponding to 3 is given by

\begin{matrix} g (x) = \frac{2}{π} \frac{(α κ_{1} x^{α - 1} + κ_{2}) e^{- κ_{1} x^{α} - κ_{2} x}}{\sqrt{1 - {(1 - e^{- κ_{1} x^{α} - κ_{2} x})}^{2}}}, x > 0 . \end{matrix}

(4)

Some possible behaviors of the pdf of the ASM-Weibull distribution are shown in Fig 2. The plots in the right panel of Fig 2, are sketched for α = 6.5, κ₁ = 1.5, κ₂ = 0.5 (red-line), α = 5.4, κ₁ = 0.5, κ₂ = 0.9 (green-line), α = 4.6, κ₁ = 1.5, κ₂ = 1.5 (black-line), α = 3.8, κ₁ = 1.5, κ₂ = 2.5 (blue-line). Whereas, the plots in the left panel of Fig 2, are presented for α = 0.5, κ₁ = 1.5, κ₂ = 1.5 (red-line), α = 2.5, κ₁ = 2.5, κ₂ = 0.1 (green-line), α = 1.6, κ₁ = 1.2, κ₂ = 0.5 (black-line), α = 2.8, κ₁ = 1.8, κ₂ = 1.2 (blue-line).

Some possible behaviors of the hazard rate function (hrf) h(x) of the ASM-Weibull distribution are shown in Fig 3. The plots provided in Fig 3, are sketched for α = 0.5, κ₁ = 0.5, κ₂ = 1 (red-line), α = 1.5, κ₁ = 0.8, κ₂ = 0.5 (green-line), α = 1.5, κ₁ = 2.1, κ₂ = 1.5 (blue-line),α = 1.2, κ₁ = 0.9, κ₂ = 1.2 (gold-line), α = 2.4, κ₁ = 0.5, κ₂ = 1.8 (black-line). From the plots provided in Fig 3, we can see that the proposed model captures different important behaviours of the hrf such as increasing, decreasing, unimodal also called upside down bathtub, modified unimodal and most importantly bathtub shapes.

3 Basic mathematical properties

This section deals with the computation of some statistical properties of the ASM-Weibull distribution.

3.1 Quantile function

Let X denote the ASM-Weibull random variable with cdf 3, then the qf (quantile function) of X, denoted Q(u), is given by

\begin{matrix} Q (u) = κ_{1} x^{α} + κ_{2} x + log (1 - s i n (\frac{π}{2} u)) . \end{matrix}

(5)

where u has the uniform distribution on the interval (0,1).

3.2 Moments

This subsection deals with the computation of r^th moment of the ASM-Weibull distribution that can be further used to obtain important characteristics. It is often employed in computing the main properties and characteristics of the distribution (as an example of this characteristics skewness, central tendency, dispersion, and kurtosis). In this section, we derive the r^th moment of the ASM-Weibull distribution as follows

\begin{matrix} μ_{r}^{'} = \frac{2}{π} \int_{0}^{\infty} x^{r} \frac{(α κ_{1} x^{α - 1} + κ_{2}) e^{- κ_{1} x^{α} - κ_{2} x}}{\sqrt{1 - {(1 - e^{- κ_{1} x^{α} - κ_{2} x})}^{2}}} d x . \end{matrix}

(6)

Using binomial series that is convergent when |t| < 1 (see, https://socratic.org/questions/how-do-you-use-the-binomial-series-to-expand-f-x-1-sqrt-1-x-2), we have

\begin{matrix} \frac{1}{\sqrt{1 - t^{2}}} = \sum_{n = 0}^{\infty} \frac{1 \times 3 \times 5 \times \dots \times (2 n - 1)}{n! 2^{n}} t^{2 n} . \end{matrix}

(7)

Using 7, we have

\begin{matrix} \frac{1}{\sqrt{1 - {(1 - e^{- κ_{1} x^{α} - κ_{2} x})}^{2}}} = \sum_{n = 0}^{\infty} \frac{1 \times 3 \times 5 \times \dots \times (2 n - 1)}{n! 2^{n}} {(1 - e^{- κ_{1} x^{α} - κ_{2} x})}^{2 n} . \end{matrix}

(8)

The expression 8 can also be written as

\begin{matrix} \frac{1}{\sqrt{1 - {(1 - e^{- κ_{1} x^{α} - κ_{2} x})}^{2}}} & = \sum_{n = 0}^{\infty} \sum_{i = 0}^{2 n} (\begin{matrix} 2 n \\ i \end{matrix}) {(- 1)}^{i} \frac{1 \times 3 \times 5 \times \dots \times (2 n - 1)}{n! 2^{n}} \\ \times e^{- i κ_{1} x^{α} - i κ_{2} x} . \end{matrix}

(9)

Using expression 9 in 6, we have

\begin{matrix} μ_{r}^{'} & = \frac{2}{π} \sum_{n = 0}^{\infty} \sum_{i = 0}^{2 n} (\begin{matrix} 2 n \\ i \end{matrix}) {(- 1)}^{i} \frac{1 \times 3 \times 5 \times \dots \times (2 n - 1)}{n! 2^{n}} \\ \times \int_{0}^{\infty} x^{r} (α κ_{1} x^{α - 1} + κ_{2}) e^{- κ_{1} x^{α} (i + 1) - κ_{2} x (i + 1)} d x . \end{matrix}

(10)

Using the series e^−t, we have

\begin{matrix} e^{- t} = \sum_{j = 0}^{\infty} \frac{{(- 1)}^{j}}{j!} t^{j} . \end{matrix}

(11)

Let t = κ₁(i + 1)x^α, then using the expression 11, we get

\begin{matrix} e^{- κ_{1} (i + 1) x^{α}} = \sum_{j = 0}^{\infty} \frac{{(- 1)}^{j} {(i + 1)}^{j} κ_{1}^{j}}{j!} x^{j α} . \end{matrix}

(12)

Using expression 12 in 10, we get

\begin{matrix} μ_{r}^{'} & = \frac{2}{π} \sum_{n = 0}^{\infty} \sum_{i = 0}^{2 n} (\begin{matrix} 2 n \\ i \end{matrix}) {(- 1)}^{i + j} {(i + 1)}^{j} κ_{1}^{j} \frac{1 \times 3 \times 5 \times \dots \times (2 n - 1)}{n! 2^{n}} \\ \times \int_{0}^{\infty} x^{r + j α} (α κ_{1} x^{α - 1} + κ_{2}) e^{- κ_{2} x (i + 1)} d x, \end{matrix}

\begin{matrix} μ_{r}^{'} & = \frac{2}{π} \sum_{n = 0}^{\infty} \sum_{i = 0}^{2 n} (\begin{matrix} 2 n \\ i \end{matrix}) {(- 1)}^{i + j} {(i + 1)}^{j} κ_{1}^{j} \frac{1 \times 3 \times 5 \times \dots \times (2 n - 1)}{n! 2^{n}} \\ \times [α κ_{1} \int_{0}^{\infty} x^{r + α (j + 1) - 1} e^{- κ_{2} (i + 1) x} d x + κ_{2} \int_{0}^{\infty} x^{r + j α} e^{- κ_{2} (i + 1) x} d x], \end{matrix}

\begin{matrix} μ_{r}^{'} & = \frac{2}{π} \sum_{n = 0}^{\infty} \sum_{i = 0}^{2 n} (\begin{matrix} 2 n \\ i \end{matrix}) {(- 1)}^{i + j} {(i + 1)}^{j} κ_{1}^{j} \frac{1 \times 3 \times 5 \times \dots \times (2 n - 1)}{n! 2^{n}} \\ \times [α κ_{1} \frac{Γ (r + α (j + 1))}{{(κ_{2} (i + 1))}^{r + α (j + 1)}} + κ_{2} \frac{Γ (r + j α + 1)}{{(κ_{2} (i + 1))}^{r + j α + 1}}] . \end{matrix}

For different values of α and κ₂, and fixed value of κ₁, the plots of mean, variance, skewness, and kurtosis of the ASM-Weibull distribution are presented in Figs 4 and 5.

Furthermore, the mgf (moment generating function) of the ASM-Weibull distribution denoted by M_X(t) has the form

\begin{matrix} M_{X} (t) = \frac{2}{π} \sum_{r, n = 0}^{\infty} \sum_{i = 0}^{2 n} (\begin{matrix} 2 n \\ i \end{matrix}) {(- 1)}^{i} \frac{1 \times 3 \times 5 \times \dots \times (2 n - 1) t^{r}}{r! n! 2^{n}} η_{r, i, α, κ_{1}, κ_{2}} . \end{matrix}

4 Maximum likelihood estimation

Here, we derive the maximum likelihood estimators (MLEs) of the ASM-Weibull distribution parameters based on the complete samples only. Let x₁, x₂, …, x_n represent the observed values from the ASM-Weibull distribution with parameters α, κ₁ and κ₂. Corresponding to Eq 4, the total log-likelihood function ℓ(α, κ₁, κ₂) is given by

\begin{matrix} ℓ (α, κ_{1}, κ_{2}) & = n log (\frac{2}{π}) + \sum_{i = 1}^{n} log (α κ_{1} x_{i}^{α - 1} + κ_{2}) - \sum_{i = 1}^{n} κ_{1} x_{i}^{α} - \sum_{i = 1}^{n} κ_{2} x_{i} \\ - \frac{1}{2} \sum_{i = 1}^{n} (1 - {(1 - e^{- κ_{1} x_{i}^{α} - κ_{2} x_{i}})}^{2}) . \end{matrix}

(13)

The numerical maximization of ℓ(α, κ₁, κ₂) can be done either by using the computer software or via differentiation on behalf of α, κ₁ and κ₂. Corresponding to Eq 13, the partial derivatives are as follows

\begin{matrix} \frac{\partial}{\partial α} ℓ (α, κ_{1}, κ_{2}) & = κ_{1} \sum_{i = 1}^{n} \frac{x_{i}^{α} (log x_{i}) e^{- κ_{1} x_{i}^{α}} (1 - e^{- κ_{1} x_{i}^{α} - κ_{2} x_{i}})}{(1 - {(1 - e^{- κ_{1} x_{i}^{α} - κ_{2} x_{i}})}^{2})} - κ_{1} \sum_{i = 1}^{n} x_{i}^{α} (log x_{i}) \\ + κ_{1} \sum_{i = 1}^{n} \frac{x_{i}^{α - 1} [α (log x_{i}) + 1]}{(α κ_{1} x^{α - 1} + κ_{2})}, \end{matrix}

(14)

\begin{matrix} \frac{\partial}{\partial κ_{1}} ℓ (α, κ_{1}, κ_{2}) = \sum_{i = 1}^{n} \frac{x_{i}^{α} e^{- κ_{1} x_{i}^{α}} (1 - e^{- κ_{1} x_{i}^{α} - κ_{2} x_{i}})}{(1 - {(1 - e^{- κ_{1} x_{i}^{α} - κ_{2} x_{i}})}^{2})} - \sum_{i = 1}^{n} x_{i}^{α} + \sum_{i = 1}^{n} \frac{α x_{i}^{α - 1}}{(α κ_{1} x^{α - 1} + κ_{2})}, \end{matrix}

(15)

\begin{matrix} \frac{\partial}{\partial κ_{2}} ℓ (α, κ_{1}, κ_{2}) = \sum_{i = 1}^{n} \frac{x_{i} e^{- κ_{2} x_{i}} (1 - e^{- κ_{1} x_{i}^{α} - κ_{2} x_{i}})}{(1 - {(1 - e^{- κ_{1} x_{i}^{α} - κ_{2} x_{i}})}^{2})} - \sum_{i = 1}^{n} x_{i} + \sum_{i = 1}^{n} \frac{1}{(α κ_{1} x^{α - 1} + κ_{2})} . \end{matrix}

(16)

Solving $\frac{\partial}{\partial α} ℓ (α, κ_{1}, κ_{2}) = \frac{\partial}{\partial κ_{1}} ℓ (α, κ_{1}, κ_{2}) = \frac{\partial}{\partial κ_{2}} ℓ (α, κ_{1}, κ_{2}) = 0$ , yield the MLEs $(\hat{α}, \hat{κ_{1}}, \hat{κ_{2}})$ of (α, κ₁, κ₂). For more information and extensive reading about MLEs, we refer to [31–34].

5 Simulation study

In this section of the paper, we provide a brief Monte Carlo simulation study to evaluate the MLEs of the ASM-Weibull distribution parameters. The ASM-Weibull distribution is easily simulated by inverting the expression 3. Let U has a uniform distribution U(0,1), then the nonlinear equation by inverting Eq 3 is

\begin{matrix} κ_{1} x^{α} + κ_{2} x + log (1 - s i n (\frac{π}{2} u)) . \end{matrix}

(17)

The simulation is performed for two different sets of parameters (i) α = 0.7, κ₁ = 1, κ₂ = 0.5, and (ii) α = 1.2, κ₁ = 1.4, κ₂ = 0.5.

The random number generation is obtained via the inverse cdf. The inverse process and simulation results are obtained via a statistical software R using (rootSolve) library with command mle. The sample size selected as n = 10, 20, …, 500 and the Monte Carlo replications made was 500 times. For the maximization of the expression 13, the algorithm “LBFGS-B” is used with optim(). For i = 1, 2, …, 500, the MLEs $(\hat{α}, \hat{κ_{1}}, \hat{κ_{2}})$ of (α, κ₁, κ₂) are obtained for each set of simulated data. The assessing tools such as biases and mean square errors (MSEs) are considered. These quantities are calculated as follows

\begin{matrix} B i a s (Θ) = \frac{1}{500} \sum_{i = 1}^{500} (\hat{Θ} - Θ), \end{matrix}

and

\begin{matrix} M S E (Θ) = \frac{1}{500} \sum_{i = 1}^{500} {(\hat{Θ} - Θ)}^{2}, \end{matrix}

where Θ = (α, κ₁, κ₂). The coverage probabilities (CPs) are calculated at the 95% confidence interval (C.I).

The summary measures of the simulated data presented in Table 1 and the box plots are provided in Fig 6.

Table 1. The summary measures of the simulated data sets.

Simulated Data	Min.	1st Qu.	Median	Mean	3rd Qu.	Max.
Set 1	0.001	1.013	2.710	4.736	6.232	59.042
Set 2	0.0000	0.0738	0.4550	1.9608	1.5947	46.0600

Open in a new tab

For the simulated data set 1, (i) the histogram and Kernel density estimator are presented in Fig 7, (ii) the fitted pdf and cdf are sketched in Fig 8, and (iii) the Kaplan-Meier survival and QQ (quantile-quantile) plots are provided in Fig 9. Corresponding to the first set of parameters values, the simulations results are provided in Table 2.

Table 2. The Monte Carlo simulation results of the ASM-Weibull distribution.

	Set 1: α = 0.7, κ₁ = 1, κ₂ = 0.5
n	Par.	MLE	MSEs	Biases	Confidence Interval	CPs	Variances
10	α	0.95163	0.30983	0.24163	(0.30356, 1.47970)	0.982	0.90602
	κ₁	1.41624	2.97415	0.41624	(-4.62888, 3.46137)	0.990	1.56130
	κ₂	0.81004	0.41880	0.21004	(-1.73901, 3.15911)	0.884	2.92010
20	α	0.90693	0.27431	0.20693	(0.40446, 1.20940)	0.978	0.84216
	κ₁	1.39378	2.18688	0.35937	(-2.39180, 3.19793)	0.976	1.20960
	κ₂	0.79625	0.37974	0.19625	(-1.41940, 2.89190)	0.890	2.71498
100	α	0.86174	0.21262	0.14174	(0.54005, 0.94342)	0.906	0.61058
	κ₁	1.36827	1.41030	0.29827	(-1.11438, 3.06107)	0.956	0.98438
	κ₂	0.73479	0.29131	0.15479	(-0.41042, 1.68002)	0.912	1.45309
200	α	0.83339	0.15829	0.10339	(0.56393, 0.90399)	0.926	0.47525
	κ₁	1.32276	0.60629	0.16276	(-0.41898, 2.54452)	0.950	0.17330
	κ₂	0.63017	0.14327	0.13017	(-0.18576, 1.44611)	0.908	0.57153
300	α	0.79676	0.10642	0.09676	(0.58486, 0.86866)	0.903	0.15241
	κ₁	1.21962	0.39405	0.10962	(-0.15800, 2.19726)	0.924	0.11109
	κ₂	0.60906	0.11851	0.10906	(-0.04422, 1.26235)	0.958	0.36099
400	α	0.76610	0.07600	0.05610	(0.59241, 0.83979)	0.924	0.03982
	κ₁	1.15222	0.29089	0.09222	(0.04500, 2.17945)	0.934	0.07579
	κ₂	0.57035	0.09127	0.07035	(0.03073, 1.10997)	0.948	0.29648
500	α	0.71224	0.00547	0.02224	(0.60464, 0.83984)	0.940	0.00359
	κ₁	1.08212	0.12997	0.03212	(0.01549, 1.99875)	0.937	0.07283
	κ₂	0.52466	0.06617	0.02664	(0.06971, 1.12761)	0.929	0.24828

Open in a new tab

For the simulated data set 2, (i) the histogram and Kernel density estimator are presented in Fig 10, (ii) the fitted pdf and cdf are sketched in Fig 11, and (iii) the Kaplan-Meier survival and QQ (quantile-quantile) plots are provided in Fig 12. Whereas, the corresponding simulation results are given in Table 3.

Table 3. The Monte Carlo simulation results of the ASM-Weibull distribution.

	Set 1: α = 0.5, κ₁ = 1.2, κ₂ = 1
n	Par.	MLE	MSEs	Biases	Confidence Interval	CPs	Variances
10	α	0.66582	0.53651	0.48321	(0.21252, 0.95390)	0.980	0.13576
	κ₁	1.40571	1.36161	0.50571	(0.74426, 2.15565)	0.932	1.37804
	κ₂	1.38516	0.56682	0.38516	(0.47090, 2.24122)	0.986	1.13841
20	α	0.63265	0.47684	0.43265	(0.28860, 0.77669)	0.958	0.10550
	κ₁	1.38657	1.25844	0.46576	(0.69955, 1.83109)	0.912	1.26905
	κ₂	1.31583	0.96998	0.35837	(0.20597, 1.98737)	0.958	1.06494
100	α	0.61496	0.43314	0.38556	(0.36773, 0.72455)	0.926	0.09422
	κ₁	1.32931	1.21851	0.40310	(0.40705, 2.39326)	0.910	1.17722
	κ₂	1.27044	0.94830	0.27044	(0.13789, 2.89926)	0.978	1.10603
200	α	0.58326	0.37053	0.31410	(0.39251, 0.69401)	0.912	0.05642
	κ₁	1.30819	1.09100	0.35819	(-0.80415, 2.00575)	0.924	1.09821
	κ₂	1.21057	0.76538	0.21057	(-1.2665r, 2.17690)	0.872	1.01865
300	α	0.56773	0.27831	0.12262	(0.40840, 0.65870)	0.918	0.02077
	κ₁	1.27689	0.97988	0.16786	(-0.70062, 1.96376)	0.908	0.92467
	κ₂	1.16786	0.71312	0.20786	(-0.78556, 2.10588)	0.900	0.83218
400	α	0.53587	0.10437	0.04126	(0.40902, 0.58272)	0.904	0.00996
	κ₁	1.23966	0.68606	0.10673	(-0.53937, 1.67285)	0.920	0.59047
	κ₂	1.10152	0.79517	0.14525	(-0.53139, 1.96108)	0.896	0.68642
500	α	0.51290	0.00171	0.00706	(0.42042, 0.56538)	0.966	0.00136
	κ₁	1.21124	0.39390	0.04124	(-0.31099, 1.43478)	0.978	0.47598
	κ₂	1.08239	0.47754	0.09392	(-0.27358, 1.86431)	0.976	0.56431

Open in a new tab

6 Applications to COVID-19 data sets

The main interest of the derivation of the ASM-Weibull distribution is its use in data analysis objectives, which makes it useful in many fields, particularly, in the fields dealing with lifetime analysis. Here, this feature is illustrated via taking two sets of data related to COVID-19 epidemic events.

We illustrate the best fitting power of the ASM-Weibull as compared with the other two parameters, three parameters and four parameters well-known lifetime competitive distributions namely: inverse Weibull (IW), extend odd Weibull exponential (ETOWE), Kumaraswamy Weibull (Ku-W), odd log-logistic modified Weibull (OLL-MW), and Frechet Weibull (FW) distributions. The pdfs of the competitive models are

Ku-W distribution
$\begin{matrix} f (x) = a b α κ_{1} x^{α - 1} e^{- κ_{1} x^{α}} {(1 - e^{- κ_{1} x^{α}})}^{a - 1} {[1 - {(1 - e^{- κ_{1} x^{α}})}^{a}]}^{b - 1}, x > 0 . \end{matrix}$
OLL-MW
$\begin{matrix} f (x) = \frac{a (α κ_{1} x^{α - 1} + κ_{2}) e^{- a κ_{1} x^{α} - a κ_{2} x} {(1 - e^{- κ_{1} x^{α} - κ_{2} x})}^{a - 1}}{{(e^{- a κ_{1} x^{α} - a κ_{2} x} + {(1 - e^{- κ_{1} x^{α} - κ_{2} x})}^{a})}^{2}} x > 0 . \end{matrix}$
FW
$\begin{matrix} f (x) = \frac{a α b^{α} {(\frac{κ_{1}}{x})}^{a α} e^{- b^{α} {(\frac{κ_{1}}{x})}^{a α}}}{x}, x > 0 . \end{matrix}$
IW
$\begin{matrix} f (x) = b a^{b} x^{- b - 1} e^{- {(\frac{a}{x})}^{b}}, x > 0 . \end{matrix}$
ETOWE
$\begin{matrix} f (x) = α λ e^{λ x} {(e^{λ x} - 1)}^{α - 1} {(b {(e^{λ x} - 1)}^{α} + 1)}^{- \frac{b + 1}{b}}, x > 0 . \end{matrix}$

We show that the ASM-Weibull distribution provides the best fit to the lifetime data related to the COVID-19 epidemic. The term “best fit” is used in the sense that the proposed model has smaller values of the criterion selected for comparison. These criterion consist of some discrimination measures. These measures are

The AIC (Akaike information criterion)
$\begin{matrix} A I C = 2 k - 2 ℓ, \end{matrix}$
The CAIC (Corrected Akaike information criterion)
$\begin{matrix} C A I C = \frac{2 n k}{n - k - 1} - 2 ℓ, \end{matrix}$
The BIC (Bayesian information criterion)
$\begin{matrix} B I C = k log (n) - 2 ℓ, \end{matrix}$
The HQIC (Hannan-Quinn information criterion)
$\begin{matrix} H Q I C = 2 k log (log (n)) - 2 ℓ, \end{matrix}$
where ℓ is the value of the log-likelihood function under the MLE, k refers to the number of parameters of the model, and n is the sample size.

In addition to these measures, we also consider other important goodness of fit measures including the Anderson-Darling (AD) statistic, Cramer-von Mises (CM) statistic and the Kolmogorov-Smirnov (KS) statistic with p-value, for detail information about these measures see [35]. A model with the lowest values of the above mentioned measures could be chosen as the best model for the real data set.

For the computation of the numerical results, we use the Newton-Raphson iteration procedure with optim() R-function with the argument method =“BFGS” to estimate the model parameters. The numerical estimates of the unknown parameters of the ASM-Weibull and other fitted distributions are obtained using the R-script AdequacyModel with the “BFGS” algorithm.

6.1 Survival times of the COVID-19 patients data

In this subsection, we consider the survival times of patients suffering from the COVID-19 epidemic in China. The considered data set representing the survival times of patients from the time admitted to the hospital until death. Among them, a group of fifty-three (53) COVID-19 patients were found in critical condition in hospital from January to February 2020.

Among them, 37 patients (70%) were men and 16 women (30%). 40 patients (75%) were diagnosed with chronic diseases, especially including high blood pressure, and diabetes. 47 patients (88%) had common clinical symptoms of the flu, 42 patients (81%) were coughing, 37 (69%) were short of breath, and 28 patients (53%) had fatigue. 50 (95%) patients had bilateral pneumonia showed by the chest computed tomographic scans. The data set can be retrieved from https://www.worldometers.info/coronavirus/ and is given by: 0.054, 0.064, 0.704, 0.816, 0.235, 0.976, 0.865, 0.364, 0.479, 0.568, 0.352, 0.978, 0.787, 0.976, 0.087, 0.548, 0.796, 0.458, 0.087, 0.437, 0.421, 1.978, 1.756, 2.089, 2.643, 2.869, 3.867, 3.890, 3.543, 3.079, 3.646, 3.348, 4.093, 4.092, 4.190, 4.237, 5.028, 5.083, 6.174, 6.743, 7.274, 7.058, 8.273, 9.324, 10.827, 11.282, 13.324, 14.278, 15.287, 16.978, 17.209, 19.092, 20.083.

The summary measures of the first data are provided in Table 4. Whereas, The histogram of COVID-19 data along with the total time test (TTT) plot are sketched in Fig 13, shows that the data set is right-skewed (histogram).

Table 4. The summary measures of the first COVID-19 data.

Min.	1st Qu.	Median	Mean	3rd Qu.	Max.
0.054	0.704	3.079	4.787	6.743	20.083

Open in a new tab

The MLEs of the ASM-Weibull and other models are provided in Table 5. The discrimination measures of the fitted distributions are provided in Table 6, and the goodness of fit measures are provided in Table 7.

Table 5. Corresponding to the first COVID-19 data, the estimated values of the parameters of the fitted distributions.

Model	$\hat{α}$	$\hat{κ_{1}}$	$\hat{κ_{2}}$	$\hat{σ}$	$\hat{a}$	$\hat{b}$	$\hat{λ}$
ASM-Weibull	0.8290	0.5332	0.0156
MOW	0.7046	0.4404		1.4888
Ku-W	0.7968	3.2500			1.0967	0.0959
T-MW	0.7864	0.2709	0.0227				0.1145
B-MW	0.4432	2.9385	0.0288		2.8855	0.2178
OLL-MW	0.7846	0.7693	0.0970		1.9086
FW	0.5196	0.5045			1.1804	1.9838

Open in a new tab

Table 6. Corresponding to the first COVID-19 data, the discrimination measures of the fitted models.

Model	AIC	CAIC	BIC	HQIC
ASM-Weibull	272.3406	272.8304	278.2515	274.6136
MOW	273.3886	273.8784	279.2994	275.6616
Ku-W	274.1946	275.0279	282.0757	277.2253
T-MW	274.7410	275.5743	282.6222	277.7717
B-MW	274.0107	275.2873	283.8622	277.7991
OLL-MW	275.1094	275.9765	284.4875	278.0921
FW	292.8950	293.7280	300.7760	295.9260

Open in a new tab

Table 7. Corresponding to the first COVID-19 data, the goodness of fit measures of the fitted models.

Model	CM	AD	KS	p-value
ASM-Weibull	0.0723	0.4555	0.1226	0.4022
MOW	0.0772	0.4957	0.1290	0.3405
Ku-W	0.0740	0.4640	0.1386	0.2600
T-MW	0.0756	0.4750	0.1240	0.3885
B-MW	0.0797	0.4976	0.1435	0.3024
OLL-MW	0.0814	0.5209	0.1506	0.2815
FW	0.2473	1.6930	0.1435	0.2250

Open in a new tab

From the values of the criteria provided in Tables 6 and 7, we see that the ASM-Weibull model is far from the concurrence. Indeed, for the COVID-19 lifetime data, for instance, it satisfies smaller values of the AIC, CAIC, BIC, HQIC, CM, AD, KS and high p-value against the AIC, CAIC, BIC, HQIC, CM, AD, KS and high p-value for the second-best distribution.

Furthermore, for the COVID-19 lifetime data, a graphical check of the fit of the ASM-Weibull model are presented in Fig 14. For this purpose, we consider the curves of the estimated pdf, cdf, PP (probability-probability) and Kaplan-Meier survival plots of the ASM-Weibull distribution. For the ASM-Weibull model, the estimated cdf and pdf are given by $G (x; \hat{α}, \hat{κ_{1}}, \hat{κ_{2}})$ and $g (x; \hat{α}, \hat{κ_{1}}, \hat{κ_{2}})$ , respectively, where G(x;α, κ₁, κ₂) is defined by Eq 3, g(x;α, κ₁, κ₂) is defined by Eq 4, and $(\hat{α}, \hat{κ_{1}}, \hat{κ_{2}})$ are the obtained MLEs for (α, κ₁, κ₂). For instance, based on Eq 3, the second row of Table 5, and the plot of the ASM-Weibull distribution in Fig 14 representing the estimated cdf is given by

\begin{matrix} G (x) = \frac{2}{π} a r c s i n e (1 - e^{- 0.5332 x^{0.8290} - 0.0156 x}), x \geq 0, \end{matrix}

with estimated pdf

\begin{matrix} g (x) = \frac{2}{π} \frac{(0.4420 x^{0.8290 - 1} + 0.0156) e^{- 0.5332 x^{0.8290} - 0.0156 x}}{\sqrt{1 - {(1 - e^{- 0.5332 x^{0.8290} - 0.0156 x})}^{2}}}, x > 0 . \end{matrix}

From the plots sketched in Fig 14, we see that the ASM-Weibull curves are closer to the corresponding empirical objects. The above practical results show that the ASM-Weibull distribution is an efficient model to adjust the considered survival times of the COVID-19’s patients in China.

Furthermore, graphical display of the existence and uniqueness of the MLEs are shown in Figs 15 and 16, respectively. Fig 15 confirms the existence of MLEs as the log-likelihood function intersects the x-axis at one point. Furthermore, Fig 16 shows that the MLEs are unique as the log-likelihood function has global maximum roots.

6.2 Second real data set

The second data set represents the mortality rate of the COVID-19 patients in Canada. This data set is available at https://covid19.who.int/, and given by: 3.1091, 3.3825, 3.1444, 3.2135, 2.4946, 3.5146, 4.9274, 3.3769, 6.8686, 3.0914, 4.9378, 3.1091, 3.2823, 3.8594, 4.0480, 4.1685, 3.6426, 3.2110, 2.8636, 3.2218, 2.9078, 3.6346, 2.7957, 4.2781, 4.2202, 1.5157, 2.6029, 3.3592, 2.8349, 3.1348, 2.5261, 1.5806, 2.7704, 2.1901, 2.4141, 1.9048.

Corresponding to this data set, the summary measures are provided in Table 8. Whereas, the histogram and TTT plots are sketched in Fig 17.

Table 8. The summary measures of the second COVID-19 data set.

Min.	1st Qu.	Median	Mean	3rd Qu.	Max.
1.516	2.789	3.178	3.282	3.637	6.869

Open in a new tab

For the second data set, the MLEs of the ASM-Weibull and other models are provided in Table 9. The discrimination and goodness of fit measures are provided in Tables 10 and 11, respectively.

Table 9. Corresponding to the second COVID-19 data, the estimated values of the parameters of the fitted distributions.

Model	$\hat{α}$	$\hat{κ_{1}}$	$\hat{κ_{2}}$	$\hat{a}$	$\hat{b}$	$\hat{λ}$
ASM-Weibull	3.47668	0.020029	0.000918
Ku-W	2.63782	0.448372			1.52414	0.525173
OLL-MW	4.56716	6.99444× 10⁻²⁹	0.001007		0.606009
FW	1.40346	1.7383			2.25808	2.11702
IW				2.70436	3.16912
ETOWE	1.07377				5.84409 × 10⁸	1.65812 × 10⁸

Open in a new tab

Table 10. Corresponding to the second COVID-19 data, the discrimination measures of the fitted models.

Model	AIC	CAIC	BIC	HQIC
ASM-Weibull	109.437	110.187	114.188	111.095
Ku-W	119.739	121.03	126.074	121.95
OLL-MW	151.286	152.576	157.62	153.497
FW	113.84	115.13	120.174	116.051
IW	109.84	110.204	113.007	110.946
ETOWE	163.56	164.31	168.31	165.218

Open in a new tab

Table 11. Corresponding to the second COVID-19 data, the goodness of fit measures of the fitted models.

Model	CM	AD	KS	p-value
ASM-Weibull	1.21231	0.212535	0.154695	0.355032
Ku-W	3.52584	0.703955	0.265631	0.0124359
OLL-MW	8.81608	1.96755	0.397571	0.000022
FW	1.61803	0.274479	0.173757	0.227156
IW	1.61803	0.274479	0.173757	0.227156
ETOWE	8.94624	1.86779	0.40961	0.000011

Open in a new tab

From the values of the selected criteria reported in Tables 10 and 11, we see that the ASM-Weibull model is a better model as it has the smaller values of the AIC, CAIC, BIC, HQIC, CM, AD, KS and high p-value against the AIC, CAIC, BIC, HQIC, CM, AD, KS and high p-value for the second-best distribution.

Furthermore, for the second COVID-19 data, the graphical display of the pdf, cdf, PP (probability-probability) and Kaplan-Meier survival plots of the ASM-Weibull distribution are presented in Fig 18. The graphs sketched in Fig 18, show that the ASM-Weibull distribution provide the best description to the COVID-19 mortality rate data. For the second data set, the likelihood function is plotted in Figs 19 and 20, which confirms the existence and uniqueness properties of the MLEs, respectively.

7 Concluding remarks

The two-parameter Weibull model has shown great applicability in the practice of statistical sciences particularly, reliability engineering, biomedical and financial sciences. In this study, a new modification of the Weibull model is introduced using the modified Weibull distribution with the “Arcsine strategy”. The proposed model is called the arcsine modified Weibull distribution. The maximum likelihood estimators of the ASM-Weibull parameters are obtained and a Monte Carlo simulation study is conducted. To show the applicability of the ASM-Weibull model, two real-life data sets related to COVID-19 events are considered. The comparison of the proposed model is made with the other well-known competitors. To figure out the close fitting of the fitted distributions, certain analytical tools including four discrimination measures and three goodness of fit measures as well as the p-value are considered. Based on these analytical measures, we showed that the ASM-Weibull model provides a better fit than the other competitors, supported by graphical sketching and numerical tools. Furthermore, corresponding to COVID-19 data sets, the log-likelihood function is also plotted confirming the existence and uniqueness properties of the MLEs. We hope that beyond the scope of this paper, the ASM-Weibull can be applied to analyze other forms of the COVID-19 data.

Data Availability

All relevant data are within the manuscript.

Funding Statement

The author(s) received no specific funding for this work.

References

1. Li Q., Guan X., Wu P., Wang X., Zhou L., Tong Y., et al. (2020). Early transmission dynamics in Wuhan, China, of novel coronavirus–infected pneumonia. New England journal of medicine. doi: 10.1056/NEJMoa2001316 [DOI] [PMC free article] [PubMed] [Google Scholar]
2. Bhattacharya S., & Paul S. (2020). The behaviour of infection, survival and testing effort variables of SARS-CoV-2: A theoretical modelling based on optimization technique. Results in physics, 19, 103568. doi: 10.1016/j.rinp.2020.103568 [DOI] [PMC free article] [PubMed] [Google Scholar]
3. Huang C., Wang Y., Li X., Ren L., Zhao J., Hu Y., et al. (2020). Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. The lancet, 395, 497–506. doi: 10.1016/S0140-6736(20)30183-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
4. Nesteruk I. G. (2020). Coronavirus epidemic outbreak in Europe. Comparison with the dynamics in mainland China. KPI Science News, 1, 10–17. [Google Scholar]
5. Nesteruk I. (2020). Comparison of the coronavirus pandemic dynamics in Europe, USA and South Korea. medRxiv. [Google Scholar]
6. Chang S. L., Harding N., Zachreson C., Cliff O. M., & Prokopenko M. (2020). Modelling transmission and control of the COVID-19 pandemic in Australia. Nature communications, 11(1), 1–13. doi: 10.1038/s41467-020-19393-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
7. Kharroubi S. A. (2020). Modeling the spread of COVID-19 in Lebanon: A Bayesian perspective. Frontiers in Applied Mathematics and Statistics, 6, 1–40. doi: 10.3389/fams.2020.00040 [DOI] [Google Scholar]
8. Ahmar A. S., & Del Val E. B. (2020). Sutte ARIMA: Short-term forecasting method, a case: Covid-19 and stock market in Spain. Science of the Total Environment, 729, 138883. doi: 10.1016/j.scitotenv.2020.138883 [DOI] [PMC free article] [PubMed] [Google Scholar]
9. de León U. A. P., Pérez, Á. G., & Avila-Vales E. (2020). An SEIARD epidemic model for COVID-19 in Mexico: mathematical analysis and state-level forecast. Chaos, Solitons & Fractals, 140, 110165. doi: 10.1016/j.chaos.2020.110165 [DOI] [PMC free article] [PubMed] [Google Scholar]
10. de Paula Lobo A., Cardoso-dos-Santos A. C., Rocha M. S., Pinheiro R. S., Bremm J. M., Macário E. M., et al. (2020). COVID-19 epidemic in Brazil: Where are we at?. International Journal of Infectious Diseases, 97, 382–385. doi: 10.1016/j.ijid.2020.06.044 [DOI] [PMC free article] [PubMed] [Google Scholar]
11. Abid K., Bari Y. A., Younas M., Tahir Javaid S., & Imran A. (2020). Progress of COVID-19 Epidemic in Pakistan. Asia Pacific Journal of Public Health, 32(4), 154–156. doi: 10.1177/1010539520927259 [DOI] [PMC free article] [PubMed] [Google Scholar]
12. Samui P., Mondal J., & Khajanchi S. (2020). A mathematical model for COVID-19 transmission dynamics with a case study of India. Chaos, Solitons & Fractals, 140, 110173. doi: 10.1016/j.chaos.2020.110173 [DOI] [PMC free article] [PubMed] [Google Scholar]
13. Al-Babtain A. A., Elbatal I., Al-Mofleh H., Gemeay A. M., Afify A. Z., & Sarg A. M. (2021). The Flexible Burr XG Family: Properties, Inference, and Applications in Engineering Science. Symmetry, 13(3), 474. doi: 10.3390/sym13030474 [DOI] [Google Scholar]
14. Zuo M., Khosa S.K., Ahmad Z., & Almaspoor Z. (2020). Comparison of COVID-19 pandemic dynamics in Asian countries with statistical modeling. Computational and mathematical methods in medicine, 2020. doi: 10.1155/2020/4296806 [DOI] [PMC free article] [PubMed] [Google Scholar]
15. Zhao J., Ahmad Z., & Almaspoor Z. (2021a). Modeling the covid-19 pandemic dynamics in iran and china, Computers, Materials & Continua, 67, 2111–2122. doi: 10.32604/cmc.2021.014259 [DOI] [Google Scholar]
16. Zhao J., Ahmad Z., Almaspoor Z., El-Morshedy M., & Afify A.Z. (2021b). Modeling COVID-19 Pandemic Dynamics in Two Asian Countries, Computers, Materials & Continua, 67, 965–977. doi: 10.32604/cmc.2021.014553 [DOI] [Google Scholar]
17. Abdy M., Side S., Annas S., Nur W., & Sanusi W. (2021). An SIR epidemic model for COVID-19 spread with fuzzy parameter: the case of Indonesia. Advances in Difference Equations, 2021(1), 1–17. doi: 10.1186/s13662-021-03263-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
18. Atangana E., & Atangana A. (2020). Facemasks simple but powerful weapons to protect against COVID-19 spread: Can they have sides effects?. Results in physics, 19, 103425. doi: 10.1016/j.rinp.2020.103425 [DOI] [PMC free article] [PubMed] [Google Scholar]
19. Maier B. F., & Brockmann D. (2020). Effective containment explains sub-exponential growth in recent confirmed COVID-19 cases in China. Science, 368, 742–746. doi: 10.1126/science.abb4557 [DOI] [PMC free article] [PubMed] [Google Scholar]
20. Sarkodie S. A., & Owusu P. A. (2020). Investigating the cases of novel coronavirus disease (COVID-19) in China using dynamic statistical techniques. Heliyon, 6, e03747. doi: 10.1016/j.heliyon.2020.e03747 [DOI] [PMC free article] [PubMed] [Google Scholar]
21. Thabet S. T., Abdo M.S., Shah K., & Abdeljawad T. (2020). Study of transmission dynamics of COVID-19 mathematical model under ABC fractional order derivative. Results in Physics, 19, 103507. doi: 10.1016/j.rinp.2020.103507 [DOI] [PMC free article] [PubMed] [Google Scholar]
22. Teamah A.E.A., Elbanna A.A., & Gemeay A.M. (2020). Frechet-Weibull distribution with applications to earthquakes data sets, Pakistan Journal of Statistics, 36, 1–18. [Google Scholar]
23. Din A., Shah K., Seadawy A., Alrabaiah H., & Baleanu D. (2020). On a new conceptual mathematical model dealing the current novel coronavirus-19 infectious disease. Results in Physics, 19, 103510. doi: 10.1016/j.rinp.2020.103510 [DOI] [PMC free article] [PubMed] [Google Scholar]
24. Liu Y., Gayle A. A., Wilder-Smith A., & Rocklöv J. (2020). The reproductive number of COVID-19 is higher compared to SARS coronavirus. Journal of travel medicine. 27, 1–16. doi: 10.1093/jtm/taaa021 [DOI] [PMC free article] [PubMed] [Google Scholar]
25. Chen Y., Cheng J., Jiang Y., & Liu K. (2020). A time delay dynamical model for outbreak of 2019-nCoV and the parameter identification. Journal of Inverse and Ill-posed Problems, 28(2), 243–250. doi: 10.1515/jiip-2020-0010 [DOI] [Google Scholar]
26. Roosa K., Lee Y., Luo R., Kirpich A., Rothenberg R., Hyman J. M., et al. (2020). Real-time forecasts of the COVID-19 epidemic in China from February 5th to February 24th, 2020. Infectious Disease Modelling, 5, 256–263. doi: 10.1016/j.idm.2020.02.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Ahmad, Z., Khosa, S.K., Yusuf, M., Alamri, O.A., & Emam, O. (2021). A New Flexible Statistical Model: Simulation and Modeling the Survival Times of the COVID-19’s patients in China, Complexity, 2021.
28. Sarhan A. M., & Zaindin M. (2009). Modified Weibull distribution. Applied Sciences, 11, 123–136. [Google Scholar]
29. Tung Y. L., Ahmad Z., & Mahmoudi E. (2021). The Arcsine-X Family of Distributions with Applications to Financial Sciences. Computer Systems Science & Engineering, 37, 1–13. [Google Scholar]
30. He W., Ahmad Z., Afify A. Z., & Goual H. (2020). The arcsine exponentiated-X family: validation and insurance application. Complexity, 2020. doi: 10.1155/2020/8394815 [DOI] [Google Scholar]
31. Chen F., & Chen S. (2011). Injury severities of truck drivers in single-and multi-vehicle accidents on rural highways. Accident Analysis & Prevention, 43(5), 1677–1688. doi: 10.1016/j.aap.2011.03.026 [DOI] [PubMed] [Google Scholar]
32. Chen F., Chen S., & Ma X. (2018). Analysis of hourly crash likelihood using unbalanced panel data mixed logit model and real-time driving environmental big data. Journal of safety research, 65, 153–159. doi: 10.1016/j.jsr.2018.02.010 [DOI] [PubMed] [Google Scholar]
33. Dong B., Ma X., Chen F., & Chen S. (2018). Investigating the differences of single-vehicle and multivehicle accident probability using mixed logit model. Journal of Advanced Transportation, 2018. doi: 10.1155/2018/2702360 [DOI] [Google Scholar]
34. Al-Babtain A.A., Gemeay A.M., & Afify A.Z. (2021). Estimation methods for the discrete poisson-lindley and discrete lindley distributions with actuarial measures and applications in medicine, Journal of King Saud University—Science, 33, 101224. doi: 10.1016/j.jksus.2020.10.021 [DOI] [Google Scholar]
35. Ahmad Z., Mahmoudi E., Hamedani G. G., & Kharazmi O. (2020). New methods to define heavy-tailed distributions with applications to insurance data. Journal of Taibah University for Science, 14(1), 359–382. doi: 10.1080/16583655.2020.1741942 [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All relevant data are within the manuscript.

[pone.0254999.ref001] 1. Li Q., Guan X., Wu P., Wang X., Zhou L., Tong Y., et al. (2020). Early transmission dynamics in Wuhan, China, of novel coronavirus–infected pneumonia. New England journal of medicine. doi: 10.1056/NEJMoa2001316 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref002] 2. Bhattacharya S., & Paul S. (2020). The behaviour of infection, survival and testing effort variables of SARS-CoV-2: A theoretical modelling based on optimization technique. Results in physics, 19, 103568. doi: 10.1016/j.rinp.2020.103568 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref003] 3. Huang C., Wang Y., Li X., Ren L., Zhao J., Hu Y., et al. (2020). Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. The lancet, 395, 497–506. doi: 10.1016/S0140-6736(20)30183-5 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref004] 4. Nesteruk I. G. (2020). Coronavirus epidemic outbreak in Europe. Comparison with the dynamics in mainland China. KPI Science News, 1, 10–17. [Google Scholar]

[pone.0254999.ref005] 5. Nesteruk I. (2020). Comparison of the coronavirus pandemic dynamics in Europe, USA and South Korea. medRxiv. [Google Scholar]

[pone.0254999.ref006] 6. Chang S. L., Harding N., Zachreson C., Cliff O. M., & Prokopenko M. (2020). Modelling transmission and control of the COVID-19 pandemic in Australia. Nature communications, 11(1), 1–13. doi: 10.1038/s41467-020-19393-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref007] 7. Kharroubi S. A. (2020). Modeling the spread of COVID-19 in Lebanon: A Bayesian perspective. Frontiers in Applied Mathematics and Statistics, 6, 1–40. doi: 10.3389/fams.2020.00040 [DOI] [Google Scholar]

[pone.0254999.ref008] 8. Ahmar A. S., & Del Val E. B. (2020). Sutte ARIMA: Short-term forecasting method, a case: Covid-19 and stock market in Spain. Science of the Total Environment, 729, 138883. doi: 10.1016/j.scitotenv.2020.138883 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref009] 9. de León U. A. P., Pérez, Á. G., & Avila-Vales E. (2020). An SEIARD epidemic model for COVID-19 in Mexico: mathematical analysis and state-level forecast. Chaos, Solitons & Fractals, 140, 110165. doi: 10.1016/j.chaos.2020.110165 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref010] 10. de Paula Lobo A., Cardoso-dos-Santos A. C., Rocha M. S., Pinheiro R. S., Bremm J. M., Macário E. M., et al. (2020). COVID-19 epidemic in Brazil: Where are we at?. International Journal of Infectious Diseases, 97, 382–385. doi: 10.1016/j.ijid.2020.06.044 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref011] 11. Abid K., Bari Y. A., Younas M., Tahir Javaid S., & Imran A. (2020). Progress of COVID-19 Epidemic in Pakistan. Asia Pacific Journal of Public Health, 32(4), 154–156. doi: 10.1177/1010539520927259 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref012] 12. Samui P., Mondal J., & Khajanchi S. (2020). A mathematical model for COVID-19 transmission dynamics with a case study of India. Chaos, Solitons & Fractals, 140, 110173. doi: 10.1016/j.chaos.2020.110173 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref013] 13. Al-Babtain A. A., Elbatal I., Al-Mofleh H., Gemeay A. M., Afify A. Z., & Sarg A. M. (2021). The Flexible Burr XG Family: Properties, Inference, and Applications in Engineering Science. Symmetry, 13(3), 474. doi: 10.3390/sym13030474 [DOI] [Google Scholar]

[pone.0254999.ref014] 14. Zuo M., Khosa S.K., Ahmad Z., & Almaspoor Z. (2020). Comparison of COVID-19 pandemic dynamics in Asian countries with statistical modeling. Computational and mathematical methods in medicine, 2020. doi: 10.1155/2020/4296806 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref015] 15. Zhao J., Ahmad Z., & Almaspoor Z. (2021a). Modeling the covid-19 pandemic dynamics in iran and china, Computers, Materials & Continua, 67, 2111–2122. doi: 10.32604/cmc.2021.014259 [DOI] [Google Scholar]

[pone.0254999.ref016] 16. Zhao J., Ahmad Z., Almaspoor Z., El-Morshedy M., & Afify A.Z. (2021b). Modeling COVID-19 Pandemic Dynamics in Two Asian Countries, Computers, Materials & Continua, 67, 965–977. doi: 10.32604/cmc.2021.014553 [DOI] [Google Scholar]

[pone.0254999.ref017] 17. Abdy M., Side S., Annas S., Nur W., & Sanusi W. (2021). An SIR epidemic model for COVID-19 spread with fuzzy parameter: the case of Indonesia. Advances in Difference Equations, 2021(1), 1–17. doi: 10.1186/s13662-021-03263-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref018] 18. Atangana E., & Atangana A. (2020). Facemasks simple but powerful weapons to protect against COVID-19 spread: Can they have sides effects?. Results in physics, 19, 103425. doi: 10.1016/j.rinp.2020.103425 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref019] 19. Maier B. F., & Brockmann D. (2020). Effective containment explains sub-exponential growth in recent confirmed COVID-19 cases in China. Science, 368, 742–746. doi: 10.1126/science.abb4557 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref020] 20. Sarkodie S. A., & Owusu P. A. (2020). Investigating the cases of novel coronavirus disease (COVID-19) in China using dynamic statistical techniques. Heliyon, 6, e03747. doi: 10.1016/j.heliyon.2020.e03747 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref021] 21. Thabet S. T., Abdo M.S., Shah K., & Abdeljawad T. (2020). Study of transmission dynamics of COVID-19 mathematical model under ABC fractional order derivative. Results in Physics, 19, 103507. doi: 10.1016/j.rinp.2020.103507 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref022] 22. Teamah A.E.A., Elbanna A.A., & Gemeay A.M. (2020). Frechet-Weibull distribution with applications to earthquakes data sets, Pakistan Journal of Statistics, 36, 1–18. [Google Scholar]

[pone.0254999.ref023] 23. Din A., Shah K., Seadawy A., Alrabaiah H., & Baleanu D. (2020). On a new conceptual mathematical model dealing the current novel coronavirus-19 infectious disease. Results in Physics, 19, 103510. doi: 10.1016/j.rinp.2020.103510 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref024] 24. Liu Y., Gayle A. A., Wilder-Smith A., & Rocklöv J. (2020). The reproductive number of COVID-19 is higher compared to SARS coronavirus. Journal of travel medicine. 27, 1–16. doi: 10.1093/jtm/taaa021 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref025] 25. Chen Y., Cheng J., Jiang Y., & Liu K. (2020). A time delay dynamical model for outbreak of 2019-nCoV and the parameter identification. Journal of Inverse and Ill-posed Problems, 28(2), 243–250. doi: 10.1515/jiip-2020-0010 [DOI] [Google Scholar]

[pone.0254999.ref026] 26. Roosa K., Lee Y., Luo R., Kirpich A., Rothenberg R., Hyman J. M., et al. (2020). Real-time forecasts of the COVID-19 epidemic in China from February 5th to February 24th, 2020. Infectious Disease Modelling, 5, 256–263. doi: 10.1016/j.idm.2020.02.002 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0254999.ref027] 27.Ahmad, Z., Khosa, S.K., Yusuf, M., Alamri, O.A., & Emam, O. (2021). A New Flexible Statistical Model: Simulation and Modeling the Survival Times of the COVID-19’s patients in China, Complexity, 2021.

[pone.0254999.ref028] 28. Sarhan A. M., & Zaindin M. (2009). Modified Weibull distribution. Applied Sciences, 11, 123–136. [Google Scholar]

[pone.0254999.ref029] 29. Tung Y. L., Ahmad Z., & Mahmoudi E. (2021). The Arcsine-X Family of Distributions with Applications to Financial Sciences. Computer Systems Science & Engineering, 37, 1–13. [Google Scholar]

[pone.0254999.ref030] 30. He W., Ahmad Z., Afify A. Z., & Goual H. (2020). The arcsine exponentiated-X family: validation and insurance application. Complexity, 2020. doi: 10.1155/2020/8394815 [DOI] [Google Scholar]

[pone.0254999.ref031] 31. Chen F., & Chen S. (2011). Injury severities of truck drivers in single-and multi-vehicle accidents on rural highways. Accident Analysis & Prevention, 43(5), 1677–1688. doi: 10.1016/j.aap.2011.03.026 [DOI] [PubMed] [Google Scholar]

[pone.0254999.ref032] 32. Chen F., Chen S., & Ma X. (2018). Analysis of hourly crash likelihood using unbalanced panel data mixed logit model and real-time driving environmental big data. Journal of safety research, 65, 153–159. doi: 10.1016/j.jsr.2018.02.010 [DOI] [PubMed] [Google Scholar]

[pone.0254999.ref033] 33. Dong B., Ma X., Chen F., & Chen S. (2018). Investigating the differences of single-vehicle and multivehicle accident probability using mixed logit model. Journal of Advanced Transportation, 2018. doi: 10.1155/2018/2702360 [DOI] [Google Scholar]

[pone.0254999.ref034] 34. Al-Babtain A.A., Gemeay A.M., & Afify A.Z. (2021). Estimation methods for the discrete poisson-lindley and discrete lindley distributions with actuarial measures and applications in medicine, Journal of King Saud University—Science, 33, 101224. doi: 10.1016/j.jksus.2020.10.021 [DOI] [Google Scholar]

[pone.0254999.ref035] 35. Ahmad Z., Mahmoudi E., Hamedani G. G., & Kharazmi O. (2020). New methods to define heavy-tailed distributions with applications to insurance data. Journal of Taibah University for Science, 14(1), 359–382. doi: 10.1080/16583655.2020.1741942 [DOI] [Google Scholar]

PERMALINK

Modeling the survival times of the COVID-19 patients with a new statistical model: A case study from China

Xiaofeng Liu

Zubair Ahmad

Ahmed M Gemeay

Alanazi Talal Abdulrahman

E H Hafez

N Khalil

Roles

Abstract

1 Introduction

Fig 1. Graphical display for the percentage of the symptoms of COVID-19.

2 The arcsine-modified weibull model

Fig 2. Different density plots of the ASM-Weibull distribution.

Fig 3. Different hrf plots of the ASM-Weibull distribution.

3 Basic mathematical properties

3.1 Quantile function

3.2 Moments

Fig 4. Plots of the mean, variance, skewness and kurtosis for κ1 = 1.2 and different values of α and κ2.

Fig 5. Plots of the mean, variance, skewness and kurtosis for κ1 = 0.5 and different values of α and κ2.

4 Maximum likelihood estimation

5 Simulation study

Table 1. The summary measures of the simulated data sets.

Fig 6. Box plots of the simulated data set 1 (red color) and set 2 (blue color).

Fig 7. The histogram and Kernel density estimator of the simulated data set 1.

Fig 8. The fitted pdf and cdf of the ASM-Weibull model for the simulated data set 1.

Fig 9. The QQ and Kaplan-Meier survival plots of the ASM-Weibull model for the simulated data set 1.

Table 2. The Monte Carlo simulation results of the ASM-Weibull distribution.

Fig 10. The histogram and Kernel density estimator of the simulated data set 2.

Fig 11. The fitted pdf and cdf of the ASM-Weibull model for the simulated data set 2.

Fig 12. The QQ and Kaplan-Meier survival plots of the ASM-Weibull model for the simulated data set 2.

Table 3. The Monte Carlo simulation results of the ASM-Weibull distribution.

6 Applications to COVID-19 data sets

6.1 Survival times of the COVID-19 patients data

Table 4. The summary measures of the first COVID-19 data.

Fig 13. Histogram and TTT plot of the COVID-19 data 1.

Table 5. Corresponding to the first COVID-19 data, the estimated values of the parameters of the fitted distributions.

Table 6. Corresponding to the first COVID-19 data, the discrimination measures of the fitted models.

Table 7. Corresponding to the first COVID-19 data, the goodness of fit measures of the fitted models.

Fig 14. The estimated pdf, cdf, PP and Kaplan-Meier survival plots of the ASM-Weibull distribution for the COVID-19 data.

Fig 15. This figure indicates the existences of the log likelihood function as each curve intersect the x-axis at one point.

Fig 16. This figure indicates that the log likelihood roots are global maximum.

6.2 Second real data set

Table 8. The summary measures of the second COVID-19 data set.

Fig 17. Histogram and TTT plot of the second COVID-19 data set.

Table 9. Corresponding to the second COVID-19 data, the estimated values of the parameters of the fitted distributions.

Table 10. Corresponding to the second COVID-19 data, the discrimination measures of the fitted models.

Table 11. Corresponding to the second COVID-19 data, the goodness of fit measures of the fitted models.

Fig 18. The estimated pdf, cdf, PP and Kaplan-Meier survival plots of the ASM-Weibull distribution for the second COVID-19 data.

Fig 19. This figure indicates the existences of the log likelihood function as each curve intersect the x-axis at one point for the second data set.

Fig 20. This figure indicates that the log likelihood roots are global maximum for the second data set.

7 Concluding remarks

Data Availability

Funding Statement

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Fig 4. Plots of the mean, variance, skewness and kurtosis for κ₁ = 1.2 and different values of α and κ₂.

Fig 5. Plots of the mean, variance, skewness and kurtosis for κ₁ = 0.5 and different values of α and κ₂.