A stationary Weibull process and its applications

Debasis Kundu

doi:10.1080/02664763.2022.2073585

. 2022 May 18;50(13):2681–2700. doi: 10.1080/02664763.2022.2073585

A stationary Weibull process and its applications

Debasis Kundu ^1,^CONTACT

PMCID: PMC10503463 PMID: 37720249

Abstract

In this paper we introduce a discrete-time and continuous state-space Markov stationary process ${X_{n}; n = 1, 2, \dots}$ , where $X_{n}$ has a two-parameter Weibull distribution, $X_{n}$ 's are dependent and there is a positive probability that $X_{n} = X_{n + 1}$ . The motivation came from the gold price data where there are several instances for which $X_{n} = X_{n + 1}$ . Hence, the existing methods cannot be used to analyze this data. We derive different properties of the proposed Weibull process. It is observed that the joint cumulative distribution function of $X_{n}$ and $X_{n + 1}$ has a very convenient copula structure. Hence, different dependence properties and dependence measures can be obtained. The maximum likelihood estimators cannot be obtained in explicit forms, we have proposed a simple profile likelihood method to compute these estimators. We have used this model to analyze two synthetic data sets and one gold price data set of the Indian market, and it is observed that the proposed model fits quite well with the data set.

Keywords: Weibull distribution, exponential distribution, maximum likelihood estimators, minification process, maximum likelihood predictor

AMS Subject Classifications: 62F10, 62F03, 62H12

1. Introduction

The aim of this paper is to introduce a new discrete-time and continuous state-space Markov Weibull process ${X_{n}; n = 1, 2, \dots}$ , which is flexible and it has certain distinct features so that it can be used in practice in different areas. In this case, the marginals are non-negative two-parameter Weibull distributions and $X_{n}$ s are dependent. It is a lag-1 process based on a minimization approach and $X_{n} = X_{n + 1}$ with a positive probability. Therefore, if there is a non-negative time series data which are positively skewed and there is a positive probability that the consecutive values might be equal then the proposed model can be used quite effectively to analyze this data set.

The motivation for this work came when we were trying to analyze gold price data in the Indian market during the month of December 2020 and January 2021. It is observed that the data are coming from a positive valued lag-1 stationary process, there are several instances for which $X_{n} = X_{n + 1}$ , and this cannot be ignored. It may be mentioned that there are several positive valued stationary processes available in the literature, for example, the exponential process by Tavares [14], Weibull and Gamma processes by Sim [13], the logistic process by Arnold [1], Pareto process by Arnold and Hallet [2], semi-Parero process by Pillai [12], generalized Weibull process by Jayakumar and Girish Babu [6], see also Yeh et al. [15], Arnold and Robertson [3], Jose et al. [7] and the reference cited therein. But none of these can be applied in this case, as in all these cases $P (X_{n} = X_{n + 1}) = 0$ .

We have provided different properties of the proposed process ${X_{n}}$ . The Weibull process ${X_{n}}$ has one shape parameter and two scale parameters. If the shape parameter is one, it becomes a stationary exponential process. The generation from ${X_{n}}$ is quite straightforward, hence, different simulation experiments can be performed quite conveniently. The joint distribution of $X_{n}$ and $X_{n + 1}$ has a very convenient copula structure. Therefore, several dependence properties and the dependence measures can be computed quite conveniently. We provide a characterization of the process. The marginals and the joint PDF can take a variety of shapes. The autocovariance and autocorrelation are not in convenient forms. However, we have provided the necessary expressions in Appendix for completeness purposes.

The maximum likelihood estimators (MLEs) cannot be obtained in explicit form. Moreover, it cannot be obtained in a routine manner. Based on some re-parameterization and using the profile likelihood method the MLEs can be obtained. The parametric bootstrap method can be used to compute the confidence intervals of the unknown parameters. We propose a goodness of fit test based on the parametric bootstrap approach. We have analyzed two synthetic data sets and one gold price data set for two months of the Indian market. It is observed that the proposed model fits the gold price data set quite well and it can be used quite effectively to analyze the gold price in the Indian market. It may be mentioned that the proposed process is a lag one process, but it can be extended to a lag-q process also, and it has been indicated how it can be done.

I think the major difference between the existing literature and the present manuscript is in its construction. The present manuscript allows to have ties, which are not available in the literature. Moreover, in the present manuscript we have provided the detailed inference procedure and showed with real data examples how it can be implemented in practice. This seems to be the main contribution of the present manuscript.

The rest of the paper is organized as follows. In Section 2 we have defined the Weibull process and provided its different properties. The maximum likelihood estimators have been discussed in Section 3. In Section 4 one goodness of fit test has been proposed based on the parametric bootstrap approach. The analyses of three data sets have been presented in Section 5. Finally, we conclude the paper in Section 6.

2. Weibull process and its properties

We will use the following notations in this paper. A Weibull random variable with the shape parameter $α > 0$ and the scale parameter $λ > 0$ has the following probability density function (PDF);

f_{W E} (x; α, λ) = {\begin{cases} α λ x^{α - 1} e^{- λ x^{α}} & if x > 0, \\ 0 & if x \leq 0, \end{cases}

(1)

and it will be denoted by WE $(α, λ)$ . It has the following cumulative distribution function and hazard function, respectively, for x>0;

\begin{aligned} F_{W E} (x; α, λ) & = 1 - e^{- λ x^{α}}, \\ h_{W E} (x; α, λ) & = α λ x^{α - 1} . \end{aligned}

The mean and variance of WE $(α, λ)$ is

\frac{1}{λ^{1 / α}} Γ (\frac{1}{α} + 1) and \frac{1}{λ^{2 / α}} [Γ (\frac{2}{α} + 1) - {(Γ (\frac{1}{α} + 1))}^{2}],

(2)

respectively. A uniform random variable on (0,1) will be denoted by $U (0, 1)$ . Now, we are in a position to define the Weibull process.

Definition 2.1

Suppose $U_{0}, U_{1}, \dots$ are independent identically distributed (i.i.d.) $U (0, 1)$ random variables, then for $λ_{0} > 0$ , $λ_{1} > 0$ and $α > 0$ , let us define a new sequence of random variables ${X_{n}; n = 1, 2, \dots}$ , where

$X_{n} = min {{[- \frac{1}{λ_{0}} \ln U_{n}]}^{\frac{1}{α}}, {[- \frac{1}{λ_{1}} \ln U_{n - 1}]}^{\frac{1}{α}}} .$ (3)

Then the sequence of random variables ${X_{n}}$ is called a Weibull process.

A Weibull process as defined in (3) will be denoted by WEP $(α, λ_{0}, λ_{1})$ . Here α is the shape parameter and $λ_{0}$ and $λ_{1}$ are the scale parameters. It may be mentioned that a location parameter can easily be incorporated into the model, which has not been tried here. The name Weibull process comes from the following results.

Theorem 2.1

If ${X_{n}}$ is as defined in (3), then

${X_{n}}$ is a stationary process.

$X_{n}$ follows WE $(α, λ_{0} + λ_{1})$ .

Proof.

Part (i) follows from the definition. To prove Part (ii), note that

$\begin{aligned} P (X_{n} > x) & = P {{[- \frac{1}{λ_{0}} \ln U_{n}]}^{\frac{1}{α}} > x, {[- \frac{1}{λ_{1}} \ln U_{n - 1}]}^{\frac{1}{α}} > x} \\ = P {U_{n} < e^{- λ_{0} x^{α}}, U_{n - 1} < e^{- λ_{1} x^{α}}} \\ = P {U_{n} < e^{- λ_{0} x^{α}}} P {U_{n - 1} < e^{- λ_{1} x^{α}}} \\ = e^{- (λ_{0} + λ_{1}) x^{α}} . \end{aligned}$

The following result characterizes the Weibull process.

Theorem 2.2

Let $X_{1} \sim$ WE $(α, λ_{0} + λ_{1})$ , and $U_{i}$ s are i.i.d. random variables with an absolute continuous distribution function $F (\cdot)$ on (0,1). Then the process as defined in (3) is a strictly stationary Markov process if and only if $U_{0} \sim U (0, 1)$ .

Proof.

‘If’ part is trivial. Now to prove the ‘only if’ part, we assume $S (x) = 1 - F (x)$ , and $F^{'} (x) = - S^{'} (x) = f (x)$ , for x>0. Therefore, for $x > 0$ ,

$e^{- (λ_{0} + λ_{1}) x^{α}} = F (e^{- λ_{0} x^{α}}) F (e^{- λ_{1} x^{α}}) .$

If we write $y = e^{- x^{α}}$ , then for 0<y<1,

$y^{λ_{0} + λ_{1}} = F (y^{λ_{0}}) F (y^{λ_{1}}) \Rightarrow \frac{F (y^{λ_{0}})}{y^{λ_{0}}} \times \frac{F (y^{λ_{1}})}{y^{λ_{1}}} = 1 \Rightarrow \frac{F (y)}{y} = 1 \Rightarrow F (y) = y . ■$

Now we present the joint distribution of $X_{n}$ and $X_{n + m}$ , for $m \geq 1$ .

Theorem 2.3

If ${X_{n}}$ satisfies (3), then the joint survival function of $X_{n}$ and $X_{n + m}$ , $S_{n, n + m} (x, y) = P (X_{n} > x, X_{n + m} > y)$ is

$S_{n, n + m} (x, y) = {\begin{cases} e^{- (λ_{0} + λ_{1}) x^{α}} e^{- (λ_{0} + λ_{1}) y^{α}} & if m \geq 2 \\ e^{- λ_{1} x^{α}} e^{- λ_{0} y^{α}} g (x, y) & if m = 1, \end{cases}$ (4)

where $g (x, y) = min {e^{- λ_{0} x^{α}}, e^{- λ_{1} y^{α}}}$ .

Proof.

The proof is quite simple and it is avoided.

The above theorem indicates that $X_{n}$ and $X_{n + m}$ are dependent for m = 1, and they are independently distributed if m>1. This makes the process as the lag-1 process. The joint distribution function of $X_{n}$ and $X_{n + 1}$ will help to develop the dependence properties of the Weibull process and we would like to study it in more detail. The joint survival function of $X_{n}$ and $X_{n + 1}$ can be written explicitly as

S_{n, n + 1} (x, y) = {\begin{cases} e^{- λ_{1} x^{α}} e^{- (λ_{0} + λ_{1}) y^{α}} & if λ_{0} x^{α} < λ_{1} y^{α} \\ e^{- (λ_{0} + λ_{1}) x^{α}} e^{- λ_{0} y^{α}} & if λ_{0} x^{α} > λ_{1} y^{α} \\ e^{- z \frac{λ_{1}^{2} + λ_{0}^{2} + λ_{0} λ_{1}}{λ_{0} λ_{1}}} & if λ_{0} x^{α} = λ_{1} y^{α} = z . \end{cases}

(5)

Therefore, if $λ_{0} = λ_{1} = λ$ , then

S_{n, n + 1} (x, y) = {\begin{cases} e^{- λ x^{α}} e^{- 2 λ y^{α}} & if x < y \\ e^{- 2 λ x^{α}} e^{- λ y^{α}} & if x > y \\ e^{- 3 λ z^{α}} & if x = y = z . \end{cases}

(6)

It may be mentioned that (6) is the joint survival function of the Marshall–Olkin bivariate Weibull distribution, and its properties have been well studied in the literature. See for example Kundu and Dey [8] and Kundu and Gupta [10] and the references cited therein. It can be easily seen that the $S_{n, n + 1} (x, y)$ , for $0 < δ < 1$ , has the following survival copula

\tilde{C} (u, v) = {\begin{cases} u^{δ} v & if u^{δ} > v^{1 - δ} \\ u v^{1 - δ} & if u^{δ} \leq v^{1 - δ} . \end{cases}

(7)

The corresponding copula density function becomes

c (u, v) = \frac{\partial^{2}}{\partial u \partial v} \tilde{C} (u, v) = {\begin{cases} δ u^{δ - 1} & if u^{δ} > v^{1 - δ} \\ (1 - δ) v^{- δ} & if u^{δ} \leq v^{1 - δ} . \end{cases}

(8)

Based on the copula density function, the Spearman's ρ and Kendall's τ can be obtained as

\begin{aligned} ρ & = \frac{3 δ (1 - δ)}{δ^{2} - δ + 2} \\ τ & = \frac{δ (1 - δ) (1 - δ (1 - δ))}{δ^{3} + δ (1 - δ) + δ^{2} (1 - δ^{2}) + (1 - δ)^{3}}, \end{aligned}

respectively.

We will introduce the following regions, which will be used later.

\begin{aligned} S_{1} & = {(x, y); x > 0, y > 0, β x < y} \\ S_{2} & = {(x, y); x > 0, y > 0, β x > y} \\ C & = {(x, y); x > 0, y > 0, β x = y} . \end{aligned}

Here, $β = (λ_{0} / λ_{1})^{1 / α}$ , and it may be noted that the curve C has the parametric form $(t, γ (t))$ , for $0 < t < \infty$ , where $γ (t) = β t$ . The following results are needed for further development.

Theorem 2.4

If ${X_{n}}$ satisfies (3), then the joint survival function of $X_{n}$ and $X_{n + 1}$ can be written as

$S_{n, n + 1} (x, y) = p S_{a} (x, y) + (1 - p) S_{s} (x, y),$ (9)

here $p = \frac{λ_{0}^{2} + λ_{1}^{2}}{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}}$ ,

$S_{s} (x, y) = {(g (x, y))}^{\frac{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}}{λ_{0} λ_{1}}},$

and $S_{a} (x, y)$ can be obtained by subtraction, i.e.

$S_{a} (x, y) = {\begin{cases} \frac{1}{p} e^{- λ_{1} x^{α}} e^{- (λ_{0} + λ_{1}) y^{α}} - \frac{1 - p}{p} e^{- \frac{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}}{λ_{0}} y^{α}} & if β x < y \\ \frac{1}{p} e^{- (λ_{0} + λ_{1}) x^{α}} e^{- λ_{0} y^{α}} - \frac{1 - p}{p} e^{- \frac{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}}{λ_{1}} x^{α}} & if β x > y . \end{cases}$ (10)

Proof.

See in Appendix.

Now we provide the joint probability density function (PDF) of $X_{n}$ and $X_{n + 1}$ , and because of the Markov property, it will be useful to compute the joint PDF of $X_{1}, \dots, X_{n}$ . It should be mentioned that since the joint distribution (survival) function is not an absolutely continuous distribution function the joint PDF does not exist in the terms of two-dimensional Lebesgue measure dominating. In this case, we need to consider the dominating measure in a different way similarly as in Bemis et al. [4]. Here, the dominating measure is the two-dimensional Lebesgue measure on $S_{1} \cup S_{2}$ , and one-dimensional Lebesgue measure on the curve C. Based on the above dominating measure the joint PDF of $X_{n}$ and $X_{n + 1}$ , for $x >$ and $y > 0$ , can be written as follows.

Theorem 2.5

If $X_{n}$ satisfies (3), then the joint PDF of $X_{n}$ and $X_{n + 1}$ is

$f_{n, n + 1} (x, y) = {\begin{cases} f_{1} (x, y) & if β x < y, \\ f_{2} (x, y) & if β x > y, \\ f_{0} (x) & if β x = y, \end{cases}$ (11)

where

$\begin{aligned} f_{1} (x, y) & = f_{W E} (x; α, λ_{1}) f_{W E} (y; α, λ_{0} + λ_{1}), \\ f_{2} (x, y) & = f_{W E} (x; α, λ_{0} + λ_{1}) f_{W E} (y; α, λ_{0}), \\ f_{0} (x) & = \frac{α λ_{0}}{β} x^{α - 1} e^{- \frac{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}}{λ_{1}} x^{α}} . \end{aligned}$

Proof.

See in Appendix.

The following conditional PDF will be useful for prediction purposes. The conditional PDF of $X_{n + 1}$ given $X_{n}$ can be written as follows:

f_{X_{n + 1} | X_{n} = x} (y) = {\begin{cases} \frac{λ_{1}}{λ_{0} + λ_{1}} e^{λ_{0} x^{α}} f_{W E} (y; α, λ_{0} + λ_{1}) & if β x < y \\ f_{W E} (y; α, λ_{0}) & if β x > y \\ \frac{λ_{0}}{λ_{0} + λ_{1}} e^{- \frac{λ_{0}^{2}}{λ_{1}} x^{α}} & if β x = y \end{cases}

(12)

When α = 1, it becomes an exponential process. When $λ_{0} = λ_{1} = λ$ , then the joint PDF of $X_{n}$ and $X_{n + 1}$ for a Weibull process is

f_{n, n + 1} (x, y) = {\begin{cases} 2 α^{2} λ^{2} x^{α - 1} e^{- λ x^{α}} y^{α - 1} e^{- 2 λ y^{α}} & if x < y, \\ 2 α^{2} λ^{2} x^{α - 1} e^{- 2 λ x^{α}} y^{α - 1} e^{- λ y^{α}} & if x > y, \\ α λ x^{α - 1} e^{- 3 λ x^{α}} & if x = y . \end{cases}

The conditional PDF of $X_{n + 1}$ given $X_{n}$ can be written as

f_{X_{n + 1} | X_{n} = x} (y) = {\begin{cases} \frac{1}{2} e^{λ x^{α}} f_{W E} (y; α, 2 λ) & if x < y \\ f_{W E} (y; α, λ) & if y > x \\ \frac{1}{2} e^{- λ x^{α}} & if x = y \end{cases}

It follows that both for the Weibull and exponential processes,

P (X_{n} = X_{n + 1}) = P (X_{n} < X_{n + 1}) = P (X_{n} > X_{n + 1}) = \frac{1}{3} .

The autocovariance and autocorrelation of a Weibull process cannot be obtained in convenient forms. In the case of exponential process, however, they can be obtained in explicit forms. We have provided all the necessary expressions in Appendix 2, for completeness purposes.

The joint PDF of $X_{1}, \dots, X_{n}$ can be obtained as

\begin{aligned} f_{X_{1}, \dots, X_{n}} (x_{1}, \dots, x_{n}) & = f_{X_{n} | X_{n - 1}, \dots, X_{1}} (x_{n}) \times \dots \times f_{X_{2} | X_{1}} (x_{2}) \times f_{X_{1}} (x_{1}) \end{aligned}

(13)

\begin{aligned} = f_{X_{n} | X_{n - 1}} (x_{n}) \times \dots \times f_{X_{2} | X_{1}} (x_{2}) \times f_{X_{1}} (x_{1}) \end{aligned}

(14)

\begin{aligned} = \frac{f_{X_{n - 1}, X_{n}} (x_{n - 1}, x_{n})}{f_{X_{n - 1}} (x_{n - 1})} \times \dots \times \frac{f_{X_{1}, X_{2}} (x_{1}, x_{2})}{f_{X_{1}} (x_{1})} \times f_{X_{1}} (x_{1}) \end{aligned}

(15)

\begin{aligned} = \frac{\prod_{i = 1}^{n - 1} f_{X_{i}, X_{i + 1}} (x_{i}, x_{i + 1})}{\prod_{i = 2}^{n - 1} f_{X_{i}} (x_{i})} . \end{aligned}

(16)

Note that (13) is obtained by the conditioning approach, (14) is obtained by using the Markov property, (15) is obtained by using the conditional density function, and the last step is obtained by simple algebraic calculation.

Now we will be discussing about the stopping time. It may be mentioned that the stopping time has been discussed quite extensively in the time series literature, see for example Christensen [5], Novikov and Shiryaev [11], and see the references cited therein. Let L>0 be a fixed real number, and let us define a new random variable N, such for $k \geq 1$ ,

{N = k} \Leftrightarrow {X_{1} > L, \dots, X_{k - 1} > L, X_{k} \leq L} .

Then clearly, N is a stopping time. Now for $λ = max {λ_{0}, λ_{1}}$ , first we obtain

\begin{aligned} P (X_{1} > L, \dots, X_{k} > L) & = P (U_{0} < e^{- λ_{1} L^{α}}, U_{1} < e^{- λ_{0} L^{α}}, \dots U_{k - 1} < e^{- λ_{1} L^{α}}, U_{k} < e^{- λ_{0} L^{α}}) \\ = e^{- (λ_{0} + λ_{1}) L^{α}} - e^{- (k - 1) λ L^{α}} \\ P (N = 1) & = P (X_{1} \leq L) = 1 - e^{- (λ_{0} + λ_{1}) L^{α}} \\ P (N = 2) & = P (X_{1} > L, X_{2} \leq L) = P (X_{1} > L) - P (X_{1} > L, X_{2} > L) \\ = e^{- (λ_{0} + λ_{1}) L^{α}} \\ - P (U_{1} < e^{- λ_{0} L^{α}}, U_{0} < e^{- λ_{1} L^{α}}, U_{2} < e^{- λ_{0} L^{α}}, U_{1} < e^{- λ_{1} L^{α}}) \\ = e^{- (λ_{0} + λ_{1}) L^{α}} (1 - e^{- λ L^{α}}) \\ P (N = 3) & = P (X_{1} > L, X_{2} > L, X_{3} \leq L) \\ = P (X_{1} > L, X_{2} > L) - P (X_{1} > L, X_{2} > L, X_{3} > L) \\ = e^{- (λ_{0} + λ_{1}) L^{α}} e^{- λ L^{α}} (1 - e^{- λ L^{α}}) \\ P (N = k) & = e^{- (λ_{0} + λ_{1}) L^{α}} e^{- (k - 2) λ L^{α}} (1 - e^{- λ L^{α}}) . \end{aligned}

The probability generating function of N can be obtained as

\begin{aligned} G (z) & = E (z^{N}) = \sum_{k = 1}^{\infty} P (N = k) z^{k} \\ = \frac{z (1 - e^{- (λ_{0} + λ_{1}) L^{α}}) + z^{2} (e^{- (λ_{0} + λ_{1}) L^{α}} - e^{- λ L^{α}})}{(1 - z e^{- λ L^{α}})} . \end{aligned}

Using the probability generating function, different moments and other properties can be easily derived.

3. Maximum likelihood estimators

In this section we consider the maximum likelihood estimators of the unknown parameters of a Weibull process based on a sample of size n, namely $x_{1}, \dots, x_{n}$ . We consider two cases separately.

Case 1: $λ_{0}$ = $λ_{1}$

In this case it is assumed that $λ_{0} = λ_{1} = λ$ . Our problem is to estimate α and λ based on $D = {x_{1}, \dots, x_{n}}$ . We use the following notations: $I = {1, \dots, n - 1}$

I_{1} = {i : i \in I, x_{i} < x_{i + 1}}, I_{2} = {i : i \in I, x_{i} > x_{i + 1}}, I_{0} = {i : i \in I, x_{i} = x_{i + 1}} .

The number of elements in $I_{0}$ , $I_{1}$ and $I_{2}$ are denoted by $n_{0}$ , $n_{1}$ and $n_{2}$ , respectively. Based on the joint PDF (16), the log-likelihood function can be written as

\begin{aligned} l (α, λ | D) & = \sum_{i \in I_{0} \cup I_{1} \cup I_{2}} \ln f_{X_{i}, X_{i + 1}} (x_{i}, x_{i + 1}) - \sum_{i = 2}^{n} \ln f_{X_{i}} (x_{i}) \\ = (n_{1} + n_{2} + 1) \ln α + (n_{1} + n_{2} + 1) \ln λ \\ + (α - 1) (\ln x_{1} + \sum_{i \in I_{1} \cup I_{2}} \ln x_{i + 1}) - λ g_{1} (α | D) \end{aligned}

(17)

where

g_{1} (α | D) = \sum_{i \in I_{1}} (x_{i}^{α} + 2 x_{i + 1}^{α}) + \sum_{i \in I_{2}} (2 x_{i}^{α} + x_{i + 1}^{α}) + 3 \sum_{i \in I_{0}} x_{i}^{α} - 2 \sum_{i = 2}^{n - 1} x_{i}^{α} .

It is immediate that for any α, $g_{1} (α | D) > 0$ . Hence, for a given α, the MLE of λ, say $\hat{λ} (α)$ can be obtained as

\hat{λ} (α) = \frac{n_{1} + n_{2} + 1}{g_{1} (α | D)},

(18)

and the MLE of α, say $\hat{α}$ can be obtained by maximizing

\begin{aligned} h (α) & = (n_{1} + n_{2} + 1) \ln α + (n_{1} + n_{2} + 1) (\ln (n_{1} + n_{2} + 1) - \ln g (α | D)) \\ + α (\ln x_{1} + \sum_{i \in I_{1} \cup I_{2}} \ln x_{i + 1}) . \end{aligned}

(19)

Once, $\hat{α}$ is obtained, then the MLE of λ, say $\hat{λ}$ can be obtained as $\hat{λ} (\hat{α})$ . Due to the complicated nature of $h (α)$ it is difficult to prove that it is a unimodal function. But in our data analysis, it is observed that $h (α)$ is a unimodal function, and it will be explained later. In the case of exponential process, the MLE of λ can be obtained as

\hat{λ} = \frac{n_{1} + n_{2} + 1}{g_{1} (1 | D)} .

(20)

Case 2: $λ_{0} \neq λ_{1}$

In this section, we consider the case when $λ_{0}$ and $λ_{1}$ are arbitrary. We use the following notations

\begin{aligned} I_{1} (β) & = {i : i \in I, β x_{i} < x_{i + 1}} \\ I_{2} (β) & = {i : i \in I, β x_{i} > x_{i + 1}} \\ I_{0} (β) & = {i : i \in I, β x_{i} = x_{i + 1}}, \end{aligned}

and $n_{0} (β) = | I_{0} (β) |$ , $n_{1} (β) = | I_{1} (β) |$ and $n_{2} (β) = | I_{2} (β) |$ . Here, β is same as defined before. Based on the data vector $D$ , the log-likelihood function of $λ_{0}$ , $λ_{1}$ and α becomes

\begin{aligned} l (λ_{0}, λ_{1}, α | D) & = (n_{1} (β) + n_{2} (β) + 1) \ln α + n_{1} (β) \ln λ_{1} + (n_{2} (β) + n_{0} (β)) \ln λ_{0} \\ + (n_{1} (β) + n_{2} (β) + 2 - n) \ln (λ_{0} + λ_{1}) \\ + (α - 1) {\ln x_{1} + \sum_{i \in I_{1} (β) \cup I_{2} (β)} \ln x_{i + 1}} \\ - λ_{1} \sum_{i \in I_{1} (β)} x_{i}^{α} - (λ_{0} + λ_{1}) \sum_{i \in I_{2} (β)} x_{i}^{α} - \frac{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}}{λ_{1}} \sum_{i \in_{0} (β)} x_{i}^{α} \\ - (λ_{0} + λ_{1}) \sum_{i \in I_{1} (β)} x_{i + 1}^{α} - λ_{0} \sum_{i \in I_{2} (β)} x_{i + 1}^{α} \\ - n_{0} (β) \ln β + (λ_{0} + λ_{1}) \sum_{i = 2}^{n - 1} x_{i}^{α} . \end{aligned}

(21)

It is not trivial to maximize (21) directly. Hence, we use the following re-parameterization. We use the following transformed parameters, $(γ, λ_{1}, α)$ , where $γ = \frac{λ_{0}}{λ_{1}}$ and based on the transformed parameters, the log-likelihood function can be written as

\begin{aligned} l (γ, λ_{1}, α) & = (n_{1} (β) + n_{2} (β) + 1) (\ln λ_{1} + \ln α) \\ - λ_{1} g_{2} (α, γ | D) - (n_{0} (β) - 1) \ln (1 + γ) \\ + ((n_{0} (β) - \frac{1}{α}) + n_{2} (β)) \ln (γ) + (α - 1) h_{2} (D), \end{aligned}

(22)

where

\begin{aligned} g_{2} (α, γ | D) & = \sum_{i \in I_{1} (β)} x_{i}^{α} + (1 + γ) (\sum_{i \in I_{1} (β)} x_{i + 1}^{α} + \sum_{i \in I_{2} (β)} x_{i}^{α}) + γ \sum_{i \in I_{2} (β)} x_{i + 1}^{α} \\ + (γ^{2} + γ + 1) \sum_{i \in I_{0} (β)} x_{i}^{α} - (1 + γ) \sum_{i = 2}^{n - 1} x_{i}^{α} \\ h_{2} (D) & = \ln x_{1} + \sum_{i \in I_{1} (β) \cup I_{2} (β)} \ln x_{i + 1} . \end{aligned}

We propose to use the profile likelihood method to maximize (22). For fixed γ and α (β is also fixed in that case), first, we maximize (22) with respect to $λ_{1}$ , say ${\hat{λ}}_{1} (γ, α)$ , and it can be obtained in explicit form as

{\hat{λ}}_{1} (γ, α) = \frac{n_{1} (β) + n_{2} (β) + 1}{g_{2} (α, γ | D)} .

The MLEs of γ and α, say $\hat{γ}$ and $\hat{α}$ , respectively, can be obtained by maximizing $l (γ, {\hat{λ}}_{1} (γ, α), α)$ . Finally, the MLE of $λ_{1}$ can be obtained as ${\hat{λ}}_{1} (\hat{γ}, \hat{α})$ . We will denote this as ${\hat{λ}}_{1}$ . Due to complicated nature of the function $g_{2} (α, γ | D)$ , it is difficult to prove that it has a unique maximum. But in our data analysis, it is observed from the contour plot that $g_{2} (α, γ | D)$ is a unimodal function. In both the cases we have suggested the parametric bootstrap method to construct confidence intervals of the unknown parameters. They can be very easily implemented in practice. In the case of exponential process, the MLE of $λ_{1}$ for a given γ can be obtained as

{\hat{λ}}_{1} (γ) = \frac{n_{1} (β) + n_{2} (β) + 1}{g_{2} (1, γ | D)},

and the MLE of γ can be obtained by maximizing $l (γ, {\hat{λ}}_{1} (γ), 1)$ . It is a one-dimensional optimization problem.

4. Goodness of fit

In this section, we provide a goodness of fit test so that whether a given data set comes from a Weibull process or not can be tested. Suppose ${x_{1}, \dots, x_{n}}$ is a sample from a stationary sequence ${X_{1}, \dots, X_{n}}$ . We want to test the following null hypothesis

H_{0} : {X_{1}, \dots, X_{n}} \sim WEP (α, λ_{0}, λ_{1}) .

Let us use the following notations. We denote $X_{1 : n} < \dots < X_{n : n}$ as the ordered ${X_{1}, \dots, X_{n}}$ , similarly, $x_{1 : n} < \dots < x_{n : n}$ as the ordered ${x_{1}, \dots, x_{n}}$ , and $a_{1} = E_{H_{0}} (X_{1 : n}), \dots, a_{n} = E_{H_{0}} (X_{n : n})$ as their ordered expected values under $H_{0}$ . Here, $a_{1}, \dots, a_{n}$ depend on $α, λ_{0}, λ_{1}$ , but we do not make it explicit. We use the following statistic for goodness of fit test.

W_{n} = max_{1 \leq i \leq n} | X_{i : n} - a_{i} | .

It is expected that if $H_{0}$ is true, then $W_{n}$ should be small. Hence, we use the following test criterion for a given level of significance $0 < β < 1$

Reject H_{0} if W_{n} > c_{n} (β),

where $c (β)$ is such that

P_{H_{0}} (W_{n} > c_{n} (β) = β .

Note that $c_{n} (β)$ also depends on $α, λ_{0}, λ_{1}$ , but we do not make it explicit for brevity. It is difficult to obtain $c_{n} (β)$ theoretically even for large n. Hence, we propose to use the parametric bootstrap technique to approximate $c_{n} (β)$ from a given observed sample ${x_{1}, \dots, x_{n}}$

Hence, if $w_{n} = max_{1 \leq i \leq n} | x_{i : n} - {\hat{a}}_{i} | > {\hat{c}}_{n} (β)$ , then we reject the null hypothesis with β% level of significance, otherwise we accept the null hypothesis.

5. Data analysis

In this section we have analyzed three data sets; two synthetic data sets and one real gold price data set of the Indian market for two months. The main idea of these data analyses is to see how the proposed MLEs work in practice and also how the proposed model works in real life.

5.1. Synthetic data set 1:

In this section, we analyze one synthetic data set, and it has been generated using the following model specification: α = 2.0, $λ_{0} = λ_{1} = 1.0$ , n = 50. The data set (Data Set 1) has been presented in Figure 1.

Figure 1. — Synthetic data set with α = 2.0, $λ_{0} = λ_{1} = 1$ .

In this case it is observed $n_{0}$ = 17, $n_{1}$ = 18 and $n_{2}$ = 14. We would like to compute the MLEs of the unknown parameters based on the assumption $λ_{0} = λ_{1} = λ$ . It involves solving a one-dimensional optimization problem. The profile log-likelihood function of α has been presented in Figure 2.

Figure 2. — The profile log-likelihood of the synthetic data set.

It is a unimodal function. Based on the profile maximization we obtain the MLEs of α and λ as $\hat{α}$ = 1.912 and $\hat{λ}$ = 1.068. The associate 95% confidence intervals are (1.714, 2.106) and (0.878, 1.245), respectively.

5.2. Synthetic data set 2:

In this section, we analyze one synthetic data set, and it has been generated using the following model specification: α = 3.0, $λ_{0}$ = 0.15, $λ_{1}$ = 0.04 and n = 75. The data set (Data Set 2) has been presented in Figure 3.

Figure 3. — Synthetic data set with α = 3.0, $λ_{0}$ = 0.15 and $λ_{1}$ = 0.04.

Now we would like to compute the MLEs of the unknown parameters. We have adopted the two-dimensional grid search method to compute the MLEs of the unknown parameters. The MLEs of α, $λ_{0}$ and $λ_{1}$ become $\hat{α}$ = 3.344, ${\hat{λ}}_{0}$ = 0.154 and ${\hat{λ}}_{1}$ = 0.029. The associated 95% bootstrap confidence intervals become (2.876, 3.954), (0.137, 0.173) and (0.019, 0.047), respectively.

5.3. Gold price data

In this section, we present the analyses of the gold price data in India for two months period from 1 December 2020 to 31 January 2021. The data represents the price of one gram of gold in Indian rupees. It is presented in Figure 4. In this case n = 62, the minimum, maximum and median values are 4280, 4580 and 4355, respectively. There are 29, 22 and 10 cases, so that ${x_{i} < x_{i + 1}}$ , ${x_{i} > x_{i + 1}}$ and ${x_{i} = x_{i + 1}}$ , respectively. We have performed the run test on the entire data set ${x_{1}, \dots, x_{62}}$ , there are 28 runs, and the associated p value is less than 0.001. Hence, we reject the null hypothesis that they are independently distributed.

Figure 4. — Gold price data in India (rupees/gram) from 1 December 2020 to 31 January 2021.

We have plotted the autocorrelation function (ACF) and the partial autocorrelation (PACF) of the gold price data in Figures 5 and 6, respectively. It is clear from the ACF and PACF that although $x_{i}$ and $x_{i + k}$ are correlated, they are uncorrelated given $x_{i + 1}, \dots, x_{i + k - 1}$ , for $k = 1, 2, \dots$ .

Figure 5. — Autocorrelation function of the gold price data.

Figure 6. — Partial autocorrelation function of the gold price data.

We have performed run tests on two lag-1 series. The number of runs are 16 and 17, respectively. The associated p values are 0.07 and 0.18, respectively. We have performed run tests on three lag-2 series also. The number of runs are 12, 12 and 11, respectively. The associated p values are 0.56, 0.56, 0.25, respectively. Hence, based on the p values we cannot reject the null hypothesis that lag-2 observations are independently distributed. We have fitted Weibull distribution to all the three lag-2 series, the Kolmogorov–Smirnov distances and the associated p values reported in brackets are 0.1453 (0.7920), 0.2163 (0.3066) and 0.2227 (0.2743). Based on the p values we cannot reject the null hypothesis that lag-2 observations are from i.i.d. Weibull distribution.

Now we would like to compute the MLEs of the unknown parameters under the assumption $λ_{1} = λ_{2} = λ$ . It may be mentioned that in this case reasonable estimates of α and λ can be obtained in explicit forms without solving any optimization. In case of a Weibull distribution, the approximate MLEs of the unknown parameters can be obtained in explicit forms by expanding the log-likelihood function using the first-order Taylor series expansion, see for example Kundu and Gupta [9]. Based on this approach we can obtain estimates of α and λ from the odd sequence as well as from the even sequence of the data. By taking the averages of these two estimates, we obtain estimates of α and λ as 2.9878 and 0.1154, respectively.

Now we would like to obtain the MLEs of α and λ by maximizing the log-likelihood function. The profile log-likelihood function of α has been plotted in Figure 7. By maximizing the profile log-likelihood function, we obtain the MLE of α as 2.6640, the MLE of λ as 0.0729 and the associated log-likelihood value becomes −116.9681. Based on parametric bootstrapping the associated 95% confidence intervals of α and λ are (2.1375, 3.1231) and (0.0548, 0.0976), respectively.

Figure 7. — The profile log-likelihood function of α.

Further, we have calculated the MLEs of the unknown parameters when $λ_{0} \neq λ_{1}$ . We have computed the MLEs of the unknown parameters by maximizing the profile log-likelihood of γ and α, i.e. $l (γ, {\hat{λ}}_{1} (γ, α), α)$ , with respect to γ and α. It is being performed by using the grid search method, and the MLEs are as follows: $\hat{α}$ = 3.2238, ${\hat{λ}}_{0}$ = 0.0917, ${\hat{λ}}_{1}$ = 0.0192 and the corresponding log-likelihood value becomes −115.9317. The associated 95% confidence intervals of α, $λ_{0}$ and $λ_{1}$ become (2.6213, 3.8231), (0.0529, 0.1412), (0.0123, 0.0204), respectively.

It is clear that based on the BIC model selection criterion we prefer the model WIP( $α, λ, λ$ ) than WIP $(α, λ_{0}, λ_{1})$ . Now we would like to see whether both the models fit the data or not. We have used the bootstrap method proposed in Section 4 with B = 1000. The histogram of the generated ${w^{b} : 1 \leq b \leq 1000}$ when $λ_{0} = λ_{1} = λ$ and when $λ_{0} \neq λ_{1}$ are provided in Figures 8 and 9, respectively.

Figure 8. — Histogram of the generated test statistics when $λ_{0} = λ_{1}$ .

Figure 9. — Histogram of the generated test statistics when $λ_{0} \neq λ_{1}$ .

The test statistic for the model WIP( $α, λ, λ$ ) is 0.1685, and the associated p-value is greater than 0.90. It seems it provides a good fit for the data set. The test statistic for the model WIP( $α, λ_{0}, λ_{1}$ ) is 0.6291, and the associated p value is less than 0.05. Hence, it does not provide a good fit for the data set.

6. Conclusions

In this paper, we have proposed a new discrete-time and continuous state-space stochastic process based on the Weibull distributions. The distinct feature of the proposed process is that there is a positive probability that $X_{n} = X_{n + 1}$ , for some n. Hence, this model can be used quite effectively when there are ties in the two consecutive time points. We have studied different properties of the proposed process, and also provided the inference procedures of the unknown parameters.

Note that the proposed stochastic process is a lag-1 process, but it can be easily extended to lag-q process as follows: Suppose $U_{0}, U_{1}, U_{2}, \dots$ are independently and identically distributed (i.i.d.) uniform $U (0, 1)$ random variables, and $α > 0, λ_{0} > 0, λ_{1}, \dots, λ_{q} > 0$ . Then

X_{n} = min {{[- \frac{1}{λ_{0}} \ln U_{n}]}^{\frac{1}{α}}, \dots, {[- \frac{1}{λ_{q}} \ln U_{n - q}]}^{\frac{1}{α}}}

is a lag-q stationary Weibull process. It can be easily checked that there is a positive probability that $X_{n} = X_{n + 1} = \dots = X_{n + k}$ , for some n, and for $k = 1, \dots, q$ . It also has a convenient copula structure. It will be interesting to develop different properties and classical inferences of this process. More work is needed in this direction.

Acknowledgements

The author would like to thank two unknown reviewers for their constructive suggestions, which have helped to improve the paper significantly.

Appendices.

Appendix 1. Proofs

Proof Proof of Theorem 2.4 —

Note that p and $S_{a} (x, y)$ can be obtained from $S_{n, n + 1} (x, y)$ as follows:

$p = \int_{0}^{\infty} \int_{0}^{\infty} \frac{\partial^{2} S_{n, n + 1} (x, y)}{\partial x \partial y} d x d y,$

and

$p S_{a} (x, y) = \int_{y}^{\infty} \int_{x}^{\infty} \frac{\partial^{2} S_{n, n + 1} (u, v)}{\partial u \partial v} d u d v .$

Now, from

$\frac{\partial^{2} S_{n, n + 1} (x, y)}{\partial x \partial y} = {\begin{cases} f_{1} (x, y) & if (x, y) \in S_{1} \\ f_{2} (x, y) & if (x, y) \in S_{2}, \end{cases}$

where

$\begin{aligned} f_{1} (x, y) & = f_{W E} (x; α, λ_{1}) f_{W E} (y; α, λ_{0} + λ_{1}) \\ f_{2} (x, y) & = f_{W E} (x; α, λ_{0} + λ_{1}) f_{W E} (y; α, λ_{0}) . \end{aligned}$

Since

$\begin{aligned} \int_{0}^{\infty} \int_{β x}^{\infty} f_{1} (x, y) d y d x & = \frac{λ_{1}^{2}}{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}} and \int_{0}^{\infty} \int_{y / β}^{\infty} f_{2} (x, y) d x d y = \frac{λ_{0}^{2}}{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}}, \\ p & = \frac{λ_{0}^{2} + λ_{1}^{2}}{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}} . \end{aligned}$

Using this p, $S_{a} (x, y)$ can be obtained by simple integration, and after that $S_{s} (x, y)$ can be obtained by subtraction.

Alternatively, a simple probabilistic argument also can be given as follows. Suppose A is the following event

$A = {[- \frac{1}{λ_{0}} \ln U_{n}] < [- \frac{1}{λ_{1}} \ln U_{n - 1}]} \cap {[- \frac{1}{λ_{1}} \ln U_{n}] < [- \frac{1}{λ_{0}} \ln U_{n + 1}]},$

then

$P (A) = P (U_{n} > U_{n - 1}^{\frac{λ_{0}}{λ_{1}}}, U_{n} > U_{n + 1}^{\frac{λ_{1}}{λ_{0}}}) = \int_{0}^{1} u^{\frac{λ_{0}}{λ_{1}} + \frac{λ_{1}}{λ_{0}}} d u = \frac{λ_{0} λ_{1}}{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}} = 1 - p .$

Moreover,

$P (X_{n} > x, X_{n + 1} > y) = P (X_{n} > x, X_{n + 1} > y | A) P (A) + P (X_{n} > x, X_{n + 1} > y | A^{'}) P (A^{'}),$

and

$P (X_{n} > x, X_{n + 1} > y | A) = P (U_{n} < e^{- λ_{0} x^{α}}, U_{n} < e^{- λ_{1} y^{α}}) = g (x, y) .$

The rest can be obtained by subtraction.

Proof Proof of Theorem 2.5 —

We need to show that for all $0 < x, y < \infty$ ,

$S_{n, n + 1} (x, y) = \int \int_{B_{1}} f_{1} (u, v) d u d v + \int \int_{B_{2}} f_{2} (u, v) d u d v + \int_{h (x, y)}^{\infty} f_{0} (t) | γ^{'} (t) | d t,$

here for $R (x, y) = {(u, v); x \leq u < \infty, y \leq v < \infty}$ , $B_{1} = R (x, y) \cap S_{1}$ , $B_{2} = R (x, y) \cap S_{2}$ , and $h (x, y) = max {x, \frac{y}{β}}$ . It has already been shown in Theorem 2.4 that

$\iint_{B_{1}} f_{1} (u, v) d u d v + \int \int_{B_{2}} f_{2} (u, v) d u d v = p S_{a} (x, y),$

hence, the result is proved if we can show

$\int_{h (x, y)}^{\infty} f_{0} (t) | γ^{'} (t) | d t = (1 - p) S_{s} (x, y) .$

Since, $| γ^{'} (t) | = β$ and $(1 - p) = \frac{λ_{0} λ_{1}}{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}}$ ,

$\begin{aligned} \int_{h (x, y)}^{\infty} f_{0} (t) | γ^{'} (t) | d t & = \int_{h (x, y)} α λ_{0} t^{α - 1} e^{- \frac{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}}{λ_{1}} t^{α}} d t = (1 - p) \int_{v (x, y)} e^{- u} d u, \end{aligned}$

where $v (x, y) = max {\frac{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}}{λ_{1}} x^{α}, \frac{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}}{λ_{0}} y^{α}}$ . Let us remember,

$S_{s} (x, y) = {\begin{cases} e^{- \frac{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}}{λ_{0}} y^{α}} & if y > β x \\ e^{- \frac{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}}{λ_{1}} x^{α}} & if y < β x . \end{cases}$

Hence, the result follows

Appendix 2. Autocovariance and autocorrelation functions

In this section, we provide all the expressions of the autocorrelation function of the GE process mainly for completeness purposes. First, we will calculate $E (X_{n + 1} X_{n})$ . If $I_{A}$ denotes the indicator function on the set A, then

\begin{aligned} E (X_{n + 1} X_{n}) & = E (X_{n + 1} X_{n} \cdot I_{{β X_{n} < X_{n + 1}}}) + E (X_{n + 1} X_{n} \cdot I_{{β X_{n} > X_{n + 1}}}) \\ + E (X_{n + 1} X_{n} \cdot I_{{β X_{n} = X_{n + 1}}}) \\ = E_{X_{n}} (X_{n} E_{{X_{n + 1} | X_{n}}} (X_{n + 1} \cdot I_{{β X_{n} < X_{n + 1}}} | X_{n}) \\ + E_{X_{n}} (X_{n} E_{{X_{n + 1} | X_{n}}} (X_{n + 1} \cdot I_{{β X_{n} > X_{n + 1}}} | X_{n}) \\ + E_{X_{n}} (X_{n} E_{{X_{n + 1} | X_{n}}} (X_{n + 1} \cdot I_{{β X_{n} = X_{n + 1}}} | X_{n}) . \end{aligned}

Now

\begin{aligned} E_{X_{n}} (X_{n} E_{{X_{n + 1} | X_{n}}} (X_{n + 1} \cdot I_{{β X_{n} = X_{n + 1}}} | X_{n}) & = α β λ_{0} \int_{0}^{\infty} x^{α + 1} e^{- \frac{λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1}}{λ_{1}} x^{α}} d x \\ = Γ (\frac{2}{α} + 1) \frac{(λ_{0} λ_{1})^{1 + \frac{1}{α}}}{(λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1})^{\frac{2}{α} + 1}} \end{aligned}

If we denote $Γ (x, a) = \int_{x}^{\infty} t^{a - 1} e^{- t} d t$ and $γ (x, a) = \int_{0}^{x} t^{a - 1} e^{- t} d t$ as incomplete gamma functions, then

\begin{aligned} E_{X_{n}} (X_{n} E_{{X_{n + 1} | X_{n}}} (X_{n + 1} \cdot I_{{β X_{n} < X_{n + 1}}} | X_{n}) \\ = α^{2} λ_{1} \int_{0}^{\infty} x^{α} e^{- λ_{1} x^{α}} {\int_{β x}^{\infty} y^{α} e^{- (λ_{0} + λ_{1}) y^{α}} d y} d x \\ = \frac{α λ_{1}}{(λ_{0} + λ_{1})^{1 / α}} \int_{0}^{\infty} x^{α} e^{- λ_{1} x^{α}} Γ (\frac{λ_{0} (λ_{0} + λ_{1}) x^{α}}{λ_{1}}, \frac{1}{α} + 1) d x . \\ E_{X_{n}} (X_{n} E_{{X_{n + 1} | X_{n}}} (X_{n + 1} \cdot I_{{β X_{n} > X_{n + 1}}} | X_{n}) \\ = α^{2} λ_{0} (λ_{0} + λ_{1}) \int_{0}^{\infty} x^{α} e^{- (λ_{0} + λ_{1}) x^{α}} {\int_{0}^{β x} y^{α} e^{- λ_{0} y^{α}} d y} d x \\ = \frac{α (λ_{0} + λ_{1})}{λ_{0}^{1 / α}} \int_{0}^{\infty} x^{α} e^{- (λ_{0} + λ_{1}) x^{α}} γ (\frac{λ_{0}^{2} x^{α}}{λ_{1}}, \frac{1}{α} + 1) d x . \end{aligned}

We have already indicated the mean and variance of a Weibull random variable in (2). Now based on the above expressions, the autocovariance and autocorrelation functions can be obtained. In the case of exponential process i.e. when α = 1, the above expressions can be obtained in explicit forms. For example

\begin{aligned} E_{X_{n}} (X_{n} E_{{X_{n + 1} | X_{n}}} (X_{n + 1} \cdot I_{{β X_{n} = X_{n + 1}}} | X_{n}) = \frac{2 (λ_{0} λ_{1})^{2}}{(λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1})^{3}} \\ E_{X_{n}} (X_{n} E_{{X_{n + 1} | X_{n}}} (X_{n + 1} \cdot I_{{β X_{n} < X_{n + 1}}} | X_{n}) \\ = \frac{λ_{1}}{λ_{0} + λ_{1}} \int_{0}^{\infty} x e^{- λ_{1} x} (e^{- \frac{λ_{0} (λ_{0} + λ_{1})}{λ_{1}} x} + \frac{λ_{0} (λ_{0} + λ_{1})}{λ_{1}} x e^{- \frac{λ_{0} (λ_{0} + λ_{1})}{λ_{1}} x}) d x \\ = \frac{λ_{1}^{3}}{(λ_{0} + λ_{1}) (λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1})^{2}} + \frac{2 λ_{0} λ_{1}^{3}}{(λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1})^{3}} \\ E_{X_{n}} (X_{n} E_{{X_{n + 1} | X_{n}}} (X_{n + 1} \cdot I_{{β X_{n} > X_{n + 1}}} | X_{n}) \\ = \frac{λ_{0} + λ_{1}}{λ_{0}} \int_{0}^{\infty} x e^{- (λ_{0} + λ_{1}) x} (1 - e^{- \frac{λ_{0}^{2}}{λ_{1}} x} - \frac{λ_{0}^{2}}{λ_{1}} x e^{- \frac{λ_{0}^{2}}{λ_{1}} x}) d x \\ = \frac{1}{λ_{0} (λ_{0} + λ_{1})} - \frac{λ_{1}^{2} (λ_{0} + λ_{1})}{λ_{0} (λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1})^{2}} - \frac{2 λ_{0} λ_{1}^{2} (λ_{0} + λ_{1})}{(λ_{0}^{2} + λ_{1}^{2} + λ_{0} λ_{1})^{3}} . \end{aligned}

Since,

E (X_{n}) = E (X_{n + 1}) = \frac{1}{λ_{0} + λ} and V (X_{n}) = V (X_{n + 1}) = \frac{1}{(λ_{0} + λ)^{2}},

the autocovariance and autocorrelation can be obtained in explicit forms.

Disclosure statement

No potential conflict of interest was reported by the author(s).

References

1.Arnold B.C., Logistic process involving Markovian minimization, Commun. Stat. – Theor. Meth. 22 (1993), pp. 1699–1707. [Google Scholar]
2.Arnold B.C. and Hallet T.J., A characterization of the Pareto process among stationary processes of the form $X_{n} = c min (X_{n - 1}, Y_{n})$ , Stat. Probab. Lett. 8 (1989), pp. 377–380. [Google Scholar]
3.Arnold B.C. and Robertson C.A., Autoregressive logistic processes, J. Appl. Probab. 26 (1989), pp. 524–531. [Google Scholar]
4.Bemis B., Bain L.J., and Higgins J.J., Estimation and hypothesis testing for the parameters of a bivariate exponential distribution, J. Am. Stat. Assoc. 67 (1972), pp. 927–929. [Google Scholar]
5.Christensen S., Phase-type distributions and optimal stopping for autoregressive processes, J. Appl. Probab. 49 (2012), pp. 22–39. [Google Scholar]
6.Jayakumar K. and Girish Babu M., Some generalizations of Weibull distribution and related processes, J. Stat. Theor. Appl. 14 (2015), pp. 425–434. [Google Scholar]
7.Jose K.K., Ristić M.M., and Joseph A, Marshall-Olkin bivariate Weibull distributions and processes, Stat. Pap. 52 (2011), pp. 789–798. [Google Scholar]
8.Kundu D. and Dey A.K, Estimating the parameters of the Marshall Olkin bivariate Weibull distribution by EM Algorithm, Comput. Stat. Data Anal. 53 (2009), pp. 956–965. [Google Scholar]
9.Kundu D. and Gupta R.D., Estimation of $P (Y < X)$ for Weibull distribution, IEEE Trans. Reliab. 55 (2006), pp. 270–280. [Google Scholar]
10.Kundu D. and Gupta A., Bayes estimation for the Marshall-Olkin bivariate Weibull distribution, Comput. Stat. Data Anal. 57 (2013), pp. 271–281. [Google Scholar]
11.Novikov A. and Shiryaev A., On solution of the optimal stopping problem for processes with independent increments, Stochastics. 79 (2007), pp. 393–406. [Google Scholar]
12.Pillai R.N., Semi-Pareto processes, J. Appl. Probab. 28 (1991), pp. 461–465. [Google Scholar]
13.Sim C.H., Simulation of Weibull and gamma autoregressive stationary processes, Commun. Stat. – Simul. Comput. 15 (1986), pp. 1141–1146. [Google Scholar]
14.L.V, Tavares, An exponential Markovian stationary process, J. Appl. Probab. 17 (1980), pp. 1117–1120. [Google Scholar]
15.Yeh H.C., Arnold B.C., and Robertson C.A, Pareto process, J. Appl. Probab. 25 (1988), pp. 291–301. [Google Scholar]

[CIT0001] 1.Arnold B.C., Logistic process involving Markovian minimization, Commun. Stat. – Theor. Meth. 22 (1993), pp. 1699–1707. [Google Scholar]

[CIT0002] 2.Arnold B.C. and Hallet T.J., A characterization of the Pareto process among stationary processes of the form $X_{n} = c min (X_{n - 1}, Y_{n})$ , Stat. Probab. Lett. 8 (1989), pp. 377–380. [Google Scholar]

[CIT0003] 3.Arnold B.C. and Robertson C.A., Autoregressive logistic processes, J. Appl. Probab. 26 (1989), pp. 524–531. [Google Scholar]

[CIT0004] 4.Bemis B., Bain L.J., and Higgins J.J., Estimation and hypothesis testing for the parameters of a bivariate exponential distribution, J. Am. Stat. Assoc. 67 (1972), pp. 927–929. [Google Scholar]

[CIT0005] 5.Christensen S., Phase-type distributions and optimal stopping for autoregressive processes, J. Appl. Probab. 49 (2012), pp. 22–39. [Google Scholar]

[CIT0006] 6.Jayakumar K. and Girish Babu M., Some generalizations of Weibull distribution and related processes, J. Stat. Theor. Appl. 14 (2015), pp. 425–434. [Google Scholar]

[CIT0007] 7.Jose K.K., Ristić M.M., and Joseph A, Marshall-Olkin bivariate Weibull distributions and processes, Stat. Pap. 52 (2011), pp. 789–798. [Google Scholar]

[CIT0008] 8.Kundu D. and Dey A.K, Estimating the parameters of the Marshall Olkin bivariate Weibull distribution by EM Algorithm, Comput. Stat. Data Anal. 53 (2009), pp. 956–965. [Google Scholar]

[CIT0009] 9.Kundu D. and Gupta R.D., Estimation of $P (Y < X)$ for Weibull distribution, IEEE Trans. Reliab. 55 (2006), pp. 270–280. [Google Scholar]

[CIT0010] 10.Kundu D. and Gupta A., Bayes estimation for the Marshall-Olkin bivariate Weibull distribution, Comput. Stat. Data Anal. 57 (2013), pp. 271–281. [Google Scholar]

[CIT0011] 11.Novikov A. and Shiryaev A., On solution of the optimal stopping problem for processes with independent increments, Stochastics. 79 (2007), pp. 393–406. [Google Scholar]

[CIT0012] 12.Pillai R.N., Semi-Pareto processes, J. Appl. Probab. 28 (1991), pp. 461–465. [Google Scholar]

[CIT0013] 13.Sim C.H., Simulation of Weibull and gamma autoregressive stationary processes, Commun. Stat. – Simul. Comput. 15 (1986), pp. 1141–1146. [Google Scholar]

[CIT0014] 14.L.V, Tavares, An exponential Markovian stationary process, J. Appl. Probab. 17 (1980), pp. 1117–1120. [Google Scholar]

[CIT0015] 15.Yeh H.C., Arnold B.C., and Robertson C.A, Pareto process, J. Appl. Probab. 25 (1988), pp. 291–301. [Google Scholar]

PERMALINK

A stationary Weibull process and its applications

Debasis Kundu

Abstract

1. Introduction

2. Weibull process and its properties

Definition 2.1

Theorem 2.1

Proof.

Theorem 2.2

Proof.

Theorem 2.3

Proof.

Theorem 2.4

Proof.

Theorem 2.5

Proof.

3. Maximum likelihood estimators

4. Goodness of fit

5. Data analysis

5.1. Synthetic data set 1:

Figure 1.

Figure 2.

5.2. Synthetic data set 2:

Figure 3.

5.3. Gold price data

Figure 4.

Figure 5.

Figure 6.

Figure 7.

Figure 8.

Figure 9.

6. Conclusions

Acknowledgements

Appendices.

Appendix 1. Proofs

Proof Proof of Theorem 2.4 —

Proof Proof of Theorem 2.5 —

Appendix 2. Autocovariance and autocorrelation functions

Disclosure statement

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases