Almost unbiased modified ridge-type estimator: An application to tourism sector data in Egypt

Tarek Mahmoud Omara

doi:10.1016/j.heliyon.2022.e10684

. 2022 Sep 22;8(9):e10684. doi: 10.1016/j.heliyon.2022.e10684

Almost unbiased modified ridge-type estimator: An application to tourism sector data in Egypt

Tarek Mahmoud Omara ^a,^b,^⁎

PMCID: PMC9526164 PMID: 36193526

Abstract

This paper introduces an almost unbiased modified ridge-type estimator (AUMRTE) to avoid problems arising from multicollinearity. This estimator has the important features of the two important shrinkage estimators, the modified ridge-type estimator (MRTE) and almost unbiased estimator (AUE). We investigated the theoretical excellence of the proposed estimator according to the mean square error (MSE). We found that it has the superiority than the (MRTE) and almost unbiased two-parameter estimator (AUTE). Moreover, we run the simulation study, which depended on the simulated MSE (SMSE), squared bias (SB) and generalized cross-validation (GCV) as criteria to compare the estimators. The simulation results showed that the proposed estimator has the superiority than the estimators under comparison at several factors and at the same time, it works well at the high level of correlation. In addition, we investigated the behavior of the present estimator applying the real data. Under this trend, we applied the estimator to the tourism sector data in Egypt, which the results were consistent with the theoretical results.

Keywords: Multicollinearity, Liu-type estimator, Ridge-type estimator, Almost unbiased modified ridge-type estimator

Multicollinearity; Liu-type estimator; Ridge-type estimator; Almost unbiased modified ridge-type estimator

1. Introduction

The multicollinearity appears when the explanatory variables have high correlated. This problem has bad effect on the ordinary least squares (OLS) estimator, since it makes (MSE) is high and the estimator becomes unstable Yalian and Hu (2012). The ridge estimator (RE) which introduced by Hoerl and Kennard (1970) is stable solution to address the multicollinearity. This estimator is biased and it's adding more information to overcome the ill condition for the $(X^{T} X)$ matrix. In the same context, Liu (1993) proposed the biased estimator that called Liu estimator (LE) that mingling the stein estimator that introduced by Stein (1956) and (RE). For high level of multicollinearity the matrix $(X^{T} X)$ safer from ill condition with large condition number. The small value of ridge parameter cannot reduce the condition number by enough to overcome the ill condition. So that, Liu (2003) introduced Liu_type estimator (LTE) that depended on two parameters make together to reduce the condition number and at the same time improve the fitting and properties of the estimator Zhai et al. (2020). Ozkale and Kaciranlar (2007) suggest two-parameter estimator (TE), which has many features, since it contains the OLS, RE, Liu estimators in private situations. In fact, the (RE) and (LE) depend on OLS estimator, so we can use them in the case of low level of multicollinearity. Otherwise, the (TE) and (LTE) depend on any estimator, whether OLS or any other estimator. So that, we can use it at any level of multicolinearity. Sakallıoglu and Kaçıranlar (2008) and Yang and Chang (2010) modify the (LTE) in which it depends on (RE). This biased estimator has superior efficient than (RE), (LT) and (LTE). Furthermore, Omara (2019) modify the (TE) in which it depends on (RE). In addition, Yang and Chang (2010) develop a two-parameter estimator. Aslam and Ahmed (2020) suggested the class of biased estimator modify two-parameter estimator. Dorugade (2014) introduced the new biased estimator called ridge-type estimator (RTE). Lukman et al. (2019) modified the ridge-type estimator and proposed the new biased estimator called modify ridge-type estimator (MRTE). At the same time, Lukman et al. (2019) modified the ridge-type estimator with new prior information.

On the other hand, many studies go to minimize the estimators bias and at the same time keeping the MSE small. The almost unbiased estimator is one of important biased estimator which used to reduce the biased for the shrinkage estimators. There is a continuous need to improve the almost unbiased estimator to overcome the multicolinearity. In this direction, the statistical literature goes to improve the almost unbiased estimator performance by replacing the OLS estimator with more efficient shrinkage estimators. In this context, Singh et al. (1986a, 1986b) suggested almost unbiased ridge estimator. Furthermore, Singh et al. (1986a, 1986b) presented almost unbiased Liu estimator. In addition, Akdeniz and Kaciranlar (1995) suggested almost unbiased generalized Liu estimator (AUGLE). More on almost unbiased estimators, we refer our readers to Alheety and Kibria (2009), Alheety et al. (2021), Algamal (2021), Al-Taweel and Algamal (2020) and Al-Taweel and Algamal (2022). This study suggests new almost unbiased shrinkage estimator which we called almost unbiased modified ridge-type estimator (AUMRTE). This estimator merges the almost unbiased Liu estimator (AULE) with (MRTE).

The planning of the study is as follows: In section 2, we illustrated the model (subsection 2.1), proposed the estimators (subsection 2.2), and the provided the biasing parameters (subsection 2.3). The performance of the estimators according to the MSE is illustrated in section 3. The simulation study and its results are given in section 4. A real data are analyzed in section 5. Finally, the concluding are put in section 6.

2. Methodology

2.1. The regression model and shrinkage estimators

Conceder the linear regression model:

Y = X β + μ

(1)

Where Y is $n \times 1$ is vector of response regression, X is $n \times p$ explanatory matrix, β is $p \times 1$ vector of coefficients and μ is $n \times 1$ vector of error term which follow a normal distribution such that $μ \sim N (0, σ^{2} I_{n})$ . We simplify the linear regression model by use the canonical form. Since the matrix $(X^{T} X)$ is symmetric matrix, then $Λ = Z^{T} Z = H X^{T} X H^{T} = diag (λ_{1}, λ_{2}, \dots, λ_{p})$ where H and $λ_{i}$ , $i = 1, 2, \dots, p$ are the eigenvector and eigenvalues of the $(X^{T} X)$ matrix. The canonical model for the model in Eq. (1) is:

Y = Z γ + μ

(2)

where $Z = X H$ and $γ = H^{T} β$ .

For Eq. (2), (Hoerl and Kennard (1970) suggested RE to overcome the multicolinearity which formed as:

{\hat{γ}}_{R E} (k) = {(Λ + k I)}^{- 1} Z^{T} Y, k > 0 = H_{R E} {\hat{γ}}_{O L S}

where k is ridge parameter, $H_{R E} = {(Λ + k I)}^{- 1} Λ$ and ${\hat{γ}}_{O L S}$ is OLS estimator.

To overcome the high level of multicolinearity, (Liu, 1993) suggested LE which formed as:

{\hat{γ}}_{L E} (d) = {(Λ + I)}^{- 1} (Z^{T} Y - d {\hat{γ}}_{O L S}), 0 < d < 1 = H_{L E} {\hat{γ}}_{O L S}

where d is the biasing parameter and $H_{L E} = {(Λ + I)}^{- 1} (Λ - d I)$ .

To get a new biased estimator has more ability to deal with multicolinearity, (Liu, 2003) introduced the LTE as:

{\hat{γ}}_{L T E} (k, d) = {(Λ + k I)}^{- 1} (Z^{T} Y - d b^{⁎}), k > 0, - \infty < d < \infty

where $b^{⁎}$ is the any estimator for β.

Ozkale and Kaciranlar (2007) suggested two-parameter estimator, which formed as:

{\hat{γ}}_{T E} (K, d) = {(Λ + k I)}^{- 1} (Z^{T} Y - k d {\hat{γ}}_{O L S}), 0 < d < 1, = H_{T E} {\hat{γ}}_{O L S}

where $H_{T E} = {(Λ + k I)}^{- 1} (Λ - k d I)$ .

Dorugade (2014) modified the RE and suggested RTE which defined as:

{\hat{γ}}_{R T E} (k, d) = {(Λ + k d I)}^{- 1} Z^{T} Y, k > 0, 0 < d < 1

Lukman et al. (2019) modified the RTE and suggested MRTE as:

{\hat{γ}}_{M R T E} (k, d) = {(Λ + k (1 + d) I)}^{- 1} Z^{T} Y, k > 0, 0 < d < 1 = H_{M R T E} {\hat{γ}}_{O L S}

(3)

where $H_{M R T E} = {(Λ + k (1 + d) I)}^{- 1} Λ$

E ({\hat{γ}}_{M R T E} (k, d)) = H_{M R T E} γ

b i a s ({\hat{γ}}_{M R T E} (k, d)) = (H_{M R T E} - I) γ

C o v ({\hat{γ}}_{M R T E} (k, d)) = σ^{2} H_{M R T E} Λ H_{M R T E}^{T}

(4)

M S E ({\hat{γ}}_{M R T E} (k, d)) = σ^{2} H_{M R T E} Λ H_{M R T E}^{T} + (H_{M R T E} - I) {γ γ}^{T} {(H_{M R T E} - I)}^{T}

M S E ({\hat{γ}}_{M R T E} (k, d)) = σ^{2} \sum_{i = 1}^{p} \frac{λ_{i}}{{(λ_{i} + k (1 + d))}^{2}} + \sum_{i = 1}^{p} \frac{γ_{i}^{2}}{{(λ_{i} + k (1 + d) - I)}^{2}}

Kadiyala (1984) suggested class of almost unbiased estimator (AUE) which defined as:

{\hat{γ}}_{AU E} = [I + {(Λ + I)}^{- 1} (1 - d)] {\hat{γ}}_{O L S} 0 < d < 1

(5)

There is a continuous need to improve the almost unbiased estimator which used to overcome the multicolinearity. In this direction, the statistical literature goes to improve the performance of these estimators by replacing the OLS estimator with more efficient punishing estimators. Akdeniz and Kaciranlar (1995) suggested almost unbiased generalized Liu estimator (AUGLE) which defined as:

{\hat{γ}}_{AUL E} = [I + (I - D) {(Λ + I)}^{- 1}] {\hat{γ}}_{l i u} 0 < d < 1 = [I - {(I - D)}^{2} {(Λ + I)}^{- 2}] {\hat{γ}}_{O L S}

where $D = d i g (d_{1}, d_{2}, \dots, d_{p})$ .

Wu and Young (2013) introduced almost unbiased two-parameter estimator (AUTE) as:

{\hat{γ}}_{AUT E} = [I + k (I - d) {(Λ + k I)}^{- 1}] {\hat{γ}}_{l i u} 0 < d < 1 = [I - k^{2} {(I - d)}^{2} {(Λ + k I)}^{- 2}] {\hat{γ}}_{O L S} = H_{AUT E} {\hat{γ}}_{O L S}

where $H_{AUT E} = [I - k^{2} {(I - d)}^{2} {(Λ + k I)}^{- 2}]$

E ({\hat{γ}}_{AUT E} (k, d)) = H_{AUT E} γ

b i a s ({\hat{γ}}_{AUT E} (k, d)) = (H_{AUT E} - I) γ

C o v ({\hat{γ}}_{AUT E} (k, d)) = σ^{2} H_{AUT E} Λ^{- 1} H_{AUL E}

(6)

M S E ({\hat{γ}}_{AUT E} (k, d)) = σ^{2} H_{AUT E} Λ^{- 1} H_{AUT E} + (H_{AUT E} - I) {γ γ}^{T} {(H_{AUT E} - I)}^{T}

M S E ({\hat{γ}}_{AUT E} (k, d)) = σ^{2} \sum_{i = 1}^{p} \frac{1}{λ_{i}} {(1 - \frac{k^{2} {(1 - d)}^{2}}{{(λ_{i} + k I)}^{2}})}^{2} + \sum_{i = 1}^{p} {(\frac{k^{2} {(1 - d)}^{2}}{{(λ_{i} + k I)}^{2}})}^{2} γ_{i}^{2}

2.2. The proposed estimator:

In begging, we illustrated the (Xu and Yang, 2011) definition of almost unbiased estimator.

Definition

Let $\hat{β}$ be a biased estimator of the β such that $B i a s (\hat{β}) = E (\hat{β}) - β = H β$ and $(\hat{β} - H β) = β$ , then we defined the almost unbiased estimator as $\tilde{β} = \hat{β} - H β = (I - H) \hat{β}$ .

Because of the preference of the (MRTE) estimator over the (RE), (LE) and (MRTE), we find that, this estimator is a candidate to merge with (AUE). We use Eq. (3) and Eq. (5) then defined this new biased estimator (AUMRTE) as:

${\hat{γ}}_{AUMRTE} (k, d) = [I - ({(Λ + k (1 + d) I)}^{- 1} Λ - I)] {\hat{γ}}_{M R T E} k > 0, 0 < d < 1 = [2 I - {(Λ + k (1 + d) I)}^{- 1} Λ] {(Λ + k (1 + d) I)}^{- 1} Λ {\hat{γ}}_{O L S} = [I - k^{2} {(1 + d)}^{2} {(Λ + k (1 + d) I)}^{- 2}] {\hat{γ}}_{O L S} = H_{AUMRTE} {\hat{γ}}_{O L S}$

The expected, biased and MSE for the new estimator is formed as:

$E ({\hat{γ}}_{AUMRTE} (k, d)) = H_{AUMRTE} γ$

$b i a s ({\hat{γ}}_{AUMRTE} (k, d)) = (H_{AUMRTE} - I) γ$

$C o v ({\hat{γ}}_{AUMRTE} (k, d)) = σ^{2} H_{AUMRTE} Λ^{- 1} H_{AUMRTE}$ (7)

$M S E ({\hat{γ}}_{AUMRTE} (k, d)) = σ^{2} H_{AUMRTE} Λ^{- 1} H_{AUMRTE} + (H_{AUMRTE} - I) {γ γ}^{T} {(H_{AUMRTE} - I)}^{T} = σ^{2} \sum_{i = 1}^{p} \frac{1}{λ_{i}} {(1 - \frac{k^{2} {(1 + d)}^{2}}{{(λ_{i} + k (1 + d))}^{2}})}^{2} + \sum_{i = 1}^{p} {(\frac{k^{2} {(1 + d)}^{2}}{{(λ_{i} + k (1 + d))}^{2}})}^{2} γ_{i}^{2}$ (8)

The AUMRTE estimator has some special case that we can follow as:

a.
If $k = 0$ and $d = 0$ then ${\hat{γ}}_{AUMRTE} (0, 0) = {\hat{γ}}_{O L S}$ .

b.
If $d = 1$ and $k = \frac{k}{2}$ then ${\hat{γ}}_{AUMRTE} (\frac{k}{2}, 1) = {\hat{γ}}_{AURE} (k)$ .

c.
If $d + 1 = d$ , then ${\hat{γ}}_{AUMRTE} (k, d) = {\hat{γ}}_{AUTE} (k)$ .

2.3. Choosing the shrinkage parameters ( $k, d$ )

For the proposed estimator, we determine the chosen method for the shrinkage parameters $k, d$ . Let $e = 1 + d$ and $M S E ({\hat{γ}}_{AUMRTE} (k, d)) = σ^{2} \sum_{i = 1}^{p} \frac{1}{λ_{i}} {(1 - v_{i}^{2})}^{2} + \sum_{i = 1}^{p} v_{i}^{4} γ_{i}^{2}$ where $v_{i} = k (1 + d) / (λ_{i} + k (1 + d)) = k e / (λ_{i} + k e)$ . To get the optimal value of d, let k fixed and take the derivative of MSE in Eq. (8) with respect to d, then equalize the result to zero.

\frac{\partial M S E ({\hat{γ}}_{AUMRTE} (k, d))}{\partial d} = \frac{\partial M S E ({\hat{γ}}_{AUMRTE} (k, d))}{\partial v_{i}} \frac{\partial v_{i}}{\partial d} = 0

Since $\partial v_{i} / \partial d = k λ_{i} / {(λ_{i} + k (1 + d))}^{2} \neq 0$ , then

\frac{\partial M S E ({\hat{γ}}_{AUMRTE} (k, d))}{\partial v_{i}} = - 4 σ^{2} \sum_{i = 1}^{p} \frac{v_{i} (1 - v_{i}^{2})}{λ_{i}} + 4 \sum_{i = 1}^{p} v_{i}^{3} γ_{i}^{2} = 0 - σ^{2} \sum_{i = 1}^{p} (1 - v_{i}^{2}) + \sum_{i = 1}^{p} λ_{i} v_{i}^{2} γ_{i}^{2} = 0 - σ^{2} + σ^{2} v_{i}^{2} + λ_{i} v_{i}^{2} γ_{i}^{2} = 0

(9)

Since $v_{i} = k e / (λ_{i} + k e)$ , then for Eq. (9), we find

e_{o p t} = \frac{λ_{i} \sqrt{\frac{σ^{2}}{σ^{2} + λ_{i} γ_{i}^{2}}}}{k (1 - \sqrt{\frac{σ^{2}}{σ^{2} + λ_{i} γ_{i}^{2}}})}

and

d_{o p t} = \frac{λ_{i} \sqrt{\frac{σ^{2}}{σ^{2} + λ_{i} γ_{i}^{2}}} - k (1 - \sqrt{\frac{σ^{2}}{σ^{2} + λ_{i} γ_{i}^{2}}})}{k (1 - \sqrt{\frac{σ^{2}}{σ^{2} + λ_{i} γ_{i}^{2}}})}

(10)

For workable applied purpose, $σ^{2}$ and $γ_{i}^{2}$ are replaced with ${\hat{σ}}^{2}$ and ${\hat{γ}}_{i}^{2}$ . Then the Eq. (10) become

d_{o p t} = \frac{λ_{i} \sqrt{\frac{{\hat{σ}}^{2}}{{\hat{σ}}^{2} + λ_{i} {\hat{γ}}_{i}^{2}}} - k (1 - \sqrt{\frac{{\hat{σ}}^{2}}{{\hat{σ}}^{2} + λ_{i} {\hat{γ}}_{i}^{2}}})}{k (1 - \sqrt{\frac{{\hat{σ}}^{2}}{{\hat{σ}}^{2} + λ_{i} {\hat{γ}}_{i}^{2}}})}

(11)

To get the optimal value of k, let d fixed and take the derivative of MSE in Eq. (8) with respect to the parameter k, then equalize the result to zero.

\frac{\partial M S E ({\hat{γ}}_{AUMRTE} (k, d))}{\partial k} = \frac{\partial M S E ({\hat{γ}}_{AUMRTE} (k, d))}{\partial v_{i}} \frac{\partial v_{i}}{\partial k} = 0

Since $\partial v_{i} / \partial k = λ_{i} (1 + d) / {(λ_{i} + k (1 + d))}^{2} \neq 0$ , then for Eq. (9), we get

k_{o p t} = \frac{λ_{i} \sqrt{\frac{σ^{2}}{σ^{2} + λ_{i} γ_{i}^{2}}}}{(d + 1) (1 - \sqrt{\frac{σ^{2}}{σ^{2} + λ_{i} γ_{i}^{2}}})}

(12)

If we follow the Kibria (2003) which proposed the arithmetic means of k value, then we can propose the arithmetic means of $k_{o p t}$ as

k_{o p t} = \frac{1}{p} \sum_{i = 1}^{p} \frac{λ_{i} \sqrt{\frac{σ^{2}}{σ^{2} + λ_{i} γ_{i}^{2}}}}{(d + 1) (1 - \sqrt{\frac{σ^{2}}{σ^{2} + λ_{i} γ_{i}^{2}}})}

(13)

For workable applied purpose, $σ^{2}$ and $γ_{i}^{2}$ are replaced with ${\hat{σ}}^{2}$ and ${\hat{γ}}_{i}^{2}$ . Then the Eq. (13) becomes

k_{o p t} = \frac{1}{p} \sum_{i = 1}^{p} \frac{λ_{i} \sqrt{\frac{{\hat{σ}}^{2}}{{\hat{σ}}^{2} + λ_{i} {\hat{γ}}_{i}^{2}}}}{(d + 1) (1 - \sqrt{\frac{{\hat{σ}}^{2}}{{\hat{σ}}^{2} + λ_{i} {\hat{γ}}_{i}^{2}}})}

(14)

One of important methods that used to obtain the optimal shrinkage parameters k and d, is a generalized cross-validation (GCV). This method makes a equilibrium between the estimator's prediction accuracy and the bias which caused by the shrinkage the estimator (Arashi et al., 2021). The GCV received attention in the statistical literature, (Omara, 2019) use GCV to choose the shrinkage parameters for two-parameter ridge estimator. Moreover, (Roozbeh et al., 2020) use GCV as a criterion to compare the estimators for ridge rank regression. In fact, the GCV has good properties and at the same time it has a simplicity. We can choose the parameters for the propose estimator $k_{G C V}$ and $d_{G C V}$ by minimizing the following unobservable risk function which defined as

R_{AUMRTE} (d, k) = \frac{1}{n} {(y - \hat{y})}^{T} (y - \hat{y})

where $\hat{y} = Z {\hat{γ}}_{AUMRTE} (d, k) = L (d, k) Y$ and $L (d, k) = Z [I - k^{2} {(1 + d)}^{2} {(Λ + k (1 + d) I)}^{- 2}] {(Z^{T} Z)}^{- 1} Z^{T}$ which can be defined as almost unbiased modified ridge-type hat matrix of y. Then the $G C V (k . d)$ defined as

G C V (k . d) = \frac{{‖ y - Z {\hat{γ}}_{AUMRTE} (k . d) ‖}^{2}}{{(I - n^{- 1} t r (L (k, d)))}^{2}} = \frac{{‖ (I - L (k, d)) y ‖}^{2}}{{(I - n^{- 1} t r (L (k, d)))}^{2}}

There are various methods exist to estimate shrinkage parameters. To mention a few, see Suhail and Kibria (2021) and Lukman et al. (2019) among others.

3. The MSE comparison

In this section, we make a comparison between AUTE and MRTE according to the (MSE). To check the superiority of the proposed estimator, we need the following lemma:

Lemma 1

Let C be a positive definite $(p . d)$ matrix and c be a vector, then $C - c c^{T} \geq 0$ , iff $c^{T} C^{- 1} c \leq 1$ . See Farebrother (1976) .

Lemma 2

Let ${\hat{β}}_{i} = H_{i} y$ , $i = 1, 2$ be two estimators of β. Assume that $\nabla = C o v ({\hat{β}}_{1}) - C o v ({\hat{β}}_{2})$ be a $p . d$ , then $M S E ({\hat{β}}_{1}) - M S E ({\hat{β}}_{2}) \geq 0$ , iff ${(H_{2} - I)}^{T} {(\nabla + {(H_{1} - I)}^{T} {(H_{1} - I)}^{T})}^{- 1} (H_{2} - I) \leq 1$ . See Trenkler and Toutenburg, 1990 .

Theorem 1

Consider ${\hat{γ}}_{AUTE} (k, d)$ and ${\hat{γ}}_{AUMRTE} (k, d)$ are two estimators of β. Assume that $\nabla_{1} = C o v ({\hat{γ}}_{AUTE} (k, d)) - C o v ({\hat{γ}}_{AUMRTE} (k, d)) be a (p . d)$ , then $M S E ({\hat{γ}}_{AUT E} (k, d)) - M S E ({\hat{γ}}_{AUMRTE} (k, d)) \geq 0$ , iff $(H_{AUMRTE} - I_{n}) {(\nabla_{1} + {(H_{AUTE} - I)}^{T} {(H_{AUTE} - I)}^{T})}^{- 1} {(H_{AUMRTE} - I_{n})}^{T} \leq 1$ .

Proof

From Eq. (6) and (7), we find that:

$\nabla_{1} = C o v ({\hat{γ}}_{AUTE} (k, d)) - C o v ({\hat{γ}}_{AUMRTE} (k, d)) = σ^{2} [\sum_{i = 1}^{p} \frac{1}{λ_{i}} [{(1 - \frac{k^{2} {(1 - d)}^{2}}{{(λ_{i} + k)}^{2}})}^{2} - {(1 - \frac{k^{2} {(1 + d)}^{2}}{{(λ_{i} + k (1 + d))}^{2}})}^{2}]]$

This inequality is $(p . d)$ if and only if

$\frac{k^{2} {(1 - d)}^{2}}{{(λ_{i} + k)}^{2}} - \frac{k^{2} {(1 + d)}^{2}}{{(λ_{i} + k (1 + d))}^{2}} < 0$

$\frac{{(1 - d)}^{2} {(λ_{i} + k (1 + d))}^{2} - {(1 + d)}^{2} {(λ_{i} + k)}^{2}}{{(λ_{i} + k)}^{2} {(λ_{i} + k (1 + d))}^{2}} < 0$

This inequality is true if and only if

${(1 - d)}^{2} {(λ_{i} + k (1 + d))}^{2} - {(1 + d)}^{2} {(λ_{i} + k)}^{2} < 0$

This inequality is true for $k > 0$ and $0 < d < 1$ .

By using Lemma 2, the proof completed.

Theorem 2

Consider ${\hat{γ}}_{M R T E} (k, d) and {\hat{γ}}_{AUMRTE} (k, d)$ are two estimators of β. Assume that $\nabla_{2} = C o v ({\hat{γ}}_{M R T E} (k, d)) - C o v ({\hat{γ}}_{AUMRTE} (k, d)) be a (p . d)$ , then $M S E ({\hat{γ}}_{M R T E} (k, d)) - M S E ({\hat{γ}}_{AUMRTE} (k, d)) \geq 0$ , iff

$(H_{AUMRTE} - I_{n}) {(\nabla_{2} + {(H_{M R T E} - I)}^{T} {(H_{M R T E} - I)}^{T})}^{- 1} \times {(H_{AUMRTE} - I_{n})}^{T} \leq 1 .$

Proof

From Eq. (4) and (7), we find that:

$\nabla_{2} = C o v ({\hat{γ}}_{MRT E} (k, d)) - C o v ({\hat{γ}}_{AUMRTE} (k, d)) = σ^{2} \sum_{i = 1}^{p} [{(\frac{λ_{i}}{{(λ_{i} + k (1 + d))}^{2}})}^{2} - \frac{1}{λ_{i}} {(1 - \frac{k^{2} {(1 + d)}^{2}}{{(λ_{i} + k (1 + d))}^{2}})}^{2}] = σ^{2} \sum_{i = 1}^{p} [{(\frac{λ_{i}}{{(λ_{i} + k (1 + d))}^{2}})}^{2} - \frac{1}{λ_{i}} {(\frac{{(λ_{i} + k (1 + d))}^{2} - k^{2} {(1 + d)}^{2}}{{(λ_{i} + k (1 + d))}^{2}})}^{2}] = σ^{2} \sum_{i = 1}^{p} [\frac{λ_{i}^{3} {(λ_{i} + k (1 + d))}^{4} - {({(λ_{i} + k (1 + d))}^{2} - k^{2} {(1 + d)}^{2})}^{2}}{λ_{i} {(λ_{i} + k (1 + d))}^{4}}]$

This inequality is true if and only if

$λ_{i}^{3} {(λ_{i} + k (1 + d))}^{4} - {({(λ_{i} + k (1 + d))}^{2} - k^{2} {(1 + d)}^{2})}^{2} > 0$

$λ_{i}^{3} {(λ_{i} + k (1 + d))}^{4} - {(λ_{i} + k (1 + d))}^{4} + 2 k^{2} {(λ_{i} + k (1 + d))}^{2} {(1 + d)}^{2} - k^{4} {(1 + d)}^{4} > 0$

$(λ_{i}^{3} - 1) {(λ_{i} + k (1 + d))}^{4} - k^{4} {(1 + d)}^{4} + 2 k^{2} {(λ_{i} + k (1 + d))}^{2} {(1 + d)}^{2} > 0$

This inequality is true for $0 < d < 1$ and $k > 0$ , then according to Lemma 2, the proof completed.

4. Simulation study

To verify the effectiveness of the proposed estimator comparing with the other estimators according to a set of factors, we run the following simulation study. The comparisons are made between the suggested estimator and the MRTE and AUTE. All these comparisons are performed by the Matlab Programming. On the beginning, we use the following model to determine dependent variable data:

y_{i} = \sum_{j = 1}^{p} x_{i j} β_{i} + μ_{i}, i = 1, 2, \dots, n

where $μ \sim i i d N (0, σ^{2})$ .

There are many factors are candidate in this simulation. The levels of correlation between the explanatory variables are important factor for comparing the variables. To ensure that there are degrees of this correlation between the explanatory variables, we follow McDonald and Galarneau (1975), and generates the explanatory variables by

x_{i j} = {(1 - ρ^{2})}^{1 / 2} ω_{i j} + ρ ω_{i, p + 1}, i = 1, 2, \dots, n, j = 1, 2, \dots, p

where $ω_{i j}$ , s are pseudo-random numbers such that $ω \sim i i d N (0, 1)$ and ρ is the correlation between the explanatory variables. To show the effect of the levels of correlation between the explanatory variables at the different levels, we choose this correlation as $ρ = 0.90, 0.95, 0.99$ . Moreover, we use the three levels of sample size $n = 50, 100$ and 200. We take the number of explanatory $p = 5$ and 30. We take the variance of the error term as $σ^{2} = 0.1$ and 20. In addition, the simulation is repeated 2000 times and we use the simulated MSE (SMSE) and squared bias (SB) as a criterion to compare between the estimators, such that they are formed at Eq. (15) and Eq. (16). In addition, in order to avoid the over fitting, we use the (GCV) as a reliable criterion.

S M S E (\hat{β}) = \frac{\sum_{r = 1}^{2000} {({\hat{β}}_{r} - β)}^{T} ({\hat{β}}_{r} - β)}{2000}

(15)

S B (\hat{β}) = {(\frac{\sum_{r = 1}^{2000} {\hat{β}}_{r}^{T}}{2000} - β)}^{T} (\frac{\sum_{r = 1}^{2000} {\hat{β}}_{r}^{T}}{2000} - β)

(16)

where ${\hat{β}}_{r}$ is the estimator in $r^{t h}$ iteration of the simulation. The initial value for β is selected as $β_{p = 5} = {[2, 2, 2, 2, 2]}^{T}$ and $β_{p = 30} = {[2, 2, \dots, 2]}^{T}$ . To select the value of k and d we use the follow the plan:

■
For AUMRTE, we use the Eq. (11) to Eq. (14).
■
For MRTE, we follow Lukman et al. (2019) and use $d_{o p t} (MRTE) = (\frac{{\hat{σ}}^{2}}{k γ_{i}^{2}}) - 1$ and the arithmetic means of $k_{o p t} (MRTE)$ as
$k_{o p t} (MRTE) = \frac{1}{p} \sum_{i = 1}^{p} \frac{{\hat{σ}}^{2}}{(1 + d) {\hat{γ}}_{i}^{2}}$
■
For AUTE, we follow Wu and Young (2013) and use $d_{o p t} (AUTE) = 1 - \frac{(λ_{i} + k) σ}{k} \sqrt{\frac{1}{σ^{2} - λ_{i} γ_{i}^{2}}}$ and the arithmetic means of $k_{o p t} (AUTE)$ as
$k_{o p t} (AUTE) = \frac{1}{p} \sum_{i = 1}^{p} \frac{σ λ_{i}}{(1 + d) \sqrt{σ^{2} + λ_{i} γ_{i}^{2}} - σ}$

To determine the priority of the selected estimators, the criteria SMSE, SB and GCV were used. We summarized the simulation results according to these criteria in Table 1, Table 2, Table 3. The results show that, the AUMRTE has the smallest SMSE, SB and GCV at all the factors. In contrast, the MRTE got the largest SMSE and SB at the all factors. Additionally, at $p = 5$ , AUMRTE performs well at $ρ = 0.95$ compared to the other correlation levels but at $p = 30$ , it performs well at $ρ = 0.90$ compared to the other correlation levels. Moreover, at all estimators, the SMSE, SB and GCV are tended to increase with increasing of the variance of the error and the number of explanatory variables and at the same time it tends to decrease with increasing of sample size. It is clear that, at the most case there is agreement between the results of SMSE, SB and GCV. According to GCV, we find that there is a greater positive effect of increasing the size of sample and decreasing the number of explanatory variable on the work of estimators. When the correlation be trend to high, the efficiency of the estimators decreases, and it is worse at $ρ = 0.99$ .

Table 1.

The SMSE value for the estimators at deferent factors.

p	n	ρ	$σ^{2} = 0.1$			$σ^{2} = 20$
p	n	ρ	MRTE	AUTE	AUMRTE	MRTE	AUTE	AUMRTE
5	50	0.90	0.0212	0.0145	0.0088	0.0951	0.0882	0.0521
		0.95	0.0135	0.0124	0.0064	0.0665	0.0616	0.0328
		0.99	0.0193	0.0149	0.0072	0.0813	0.0917	0.0339

	100	0.90	0.0179	0.0118	0.0081	0.0733	0.0649	0.0491
		0.95	0.0179	0.0095	0.0053	0.0645	0.0592	0.0353
		0.99	0.0154	0.0115	0.0061	0.0684	0.0633	0.0399

	200	0.90	0.0081	0.0042	0.0039	0.0557	0.0412	0.0321
		0.95	0.0054	0.0088	0.0021	0.0293	0.0201	0.0119
		0.99	0.0028	0.0059	0.0017	0.0113	0.0092	0.0107

30	50	0.90	0.0508	0.0479	0.0286	0.1991	0.0941	0.0928
		0.95	0.0685	0.0622	0.0292	0.2012	0.0962	0.0999
		0.99	0.0717	0.0719	0.0305	0.2054	0.0998	0.1021

	100	0.90	0.0554	0.0451	0.0125	0.0912	0.0804	0.0664
		0.95	0.0691	0.0618	0.0141	0.1054	0.0921	0.0691
		0.99	0.0737	0.0792	0.064	0.1098	0.0978	0.0709

	200	0.90	0.0192	0.0149	0.0108	0.0628	0.0611	0.0492
		0.95	0.0204	0.0158	0.0111	0.0707	0.0744	0.0709
		0.99	0.0211	0.0164	0.0122	0.0777	0.0791	0.0717

Open in a new tab

Table 2.

The $S B (\hat{β})$ value for the estimators at deferent factors.

p	n	ρ	$σ^{2} = 0.1$			$σ^{2} = 20$
p	n	ρ	MRTE	AUTE	AUMRTE	MRTE	AUTE	AUMRTE
5	50	0.90	0.0091	0.0073	0.0049	0.0228	0.0199	0.0147
		0.95	0.0071	0.0088	0.0041	0.0199	0.0228	0.0318
		0.99	0.0082	0.0064	0.0047	0.0208	0.0182	0.0117

	100	0.90	0.0066	0.0052	0.0033	0.0097	0.0081	0.0048
		095	0.0039	0.0021	0.0028	0.0066	0.0072	0.0029
		0.99	0.0048	0.0048	0.0031	0.0077	0.0081	0.0047

	200	0.90	0.0061	0.0047	0.0025	0.0114	0.0097	0.0111
		095	0.0033	0.0011	0.0011	0.0098	0.0087	0.0098
		0.99	0.0044	0.0021	0.0021	0.0109	0.0094	0.0104

30	50	0.90	0.0108	0.0099	0.0082	0.0335	0.0333	0.0301
		0.95	0.0111	0.0102	0.0088	0.0369	0.0399	0.0304
		0.99	0.0124	0.0105	0.0099	0.0392	0.0431	0.0391

	100	0.90	0.0101	0.0094	0.0088	0.0111	0.0105	0.0101
		0.95	0.0118	0.0091	0.0074	0.0109	0.0103	0.0098
		0.99	0.0147	0.0092	0.0073	0.0108	0.0104	0.0097

	200	0.90	0.0092	0.0081	0.0066	0.0108	0.0102	0.0092
		0.95	0.0099	0.0084	0.0077	0.0111	0.0112	0.0098
		0.99	0.0121	0.0089	0.0081	0.0116	0.0131	0.0103

Open in a new tab

Table 3.

The $G C V (\hat{β})$ value for the estimators at deferent factors.

p	n	ρ	$σ^{2} = 0.1$			$σ^{2} = 20$
p	n	ρ	MRTE	AUTE	AUMRTE	MRTE	AUTE	AUMRTE
5	50	0.90	6.3652	5.0891	1.0012	6.6054	5.3065	1.0932
		0.95	6.5426	5.1325	1.0156	6.7742	5.5243	1.1318
		0.99	6.7082	5.4089	1.0827	6.9271	5.6284	1.3084

	100	0.90	4.0962	3.0514	0.9564	5.1802	2.9905	0.9654
		095	4.1564	3.0328	1.0015	5.3652	2.8638	0.9912
		0.99	4.2291	3.0632	1.0314	5.5021	2.4732	1.0214

	200	0.90	2.0185	1.9521	0.6965	4.6582	2.0658	0.9051
		095	2.0012	2.2518	0.8876	4.1068	2.6391	1.0325
		0.99	1.9834	2.3212	0.9765	4.0619	2.9653	1.0912

30	50	0.90	9.3255	8.9142	2.0879	9.9085	9.8256	2.7354
		0.95	9.7621	9.0698	2.1864	10.0304	9.9241	2.9941
		0.99	9.9025	9.3262	2.6152	10.2142	10.0251	3.2014

	100	0.90	7.6253	7.1982	1.9021	7.9056	7.3654	2.0214
		0.95	7.8641	7.2356	1.9214	8.0654	7.6021	2.3127
		0.99	7.9054	7.8321	1.9325	8.3712	8.0654	2.4085

	200	0.90	5.3242	4.8947	1.0245	5.9142	5.0214	1.0524
		0.95	5.5931	4.9921	1.0382	6.0325	5.1521	1.0934
		0.99	5.9342	5.0932	1.0634	6.1821	5.3921	1.1186

Open in a new tab

5. Application to the tourism sector data in Egypt

In this section, we confirm our results by application to the tourism data in Egypt (1995: 2019). We take the data form the Central Agency for Public Mobilization and Statistics (https://www.capmas.gov.eg/). We use the GDP of the tourism as dependent variable (y) and number of tourist nights (X₁), number of tourists (X₂) and total investments in the tourism (X₃) and number of workers in the tourism (X₄) as explanatory variables. We use the (SMSE), (SB), (R²) and GCV as a criteria's. Firstly, it is necessary to verify the existence of a multicollinearity between the independent variables and what is the level of it is. In this direction, we found that the correlation between the explanatory variables is between 0.684 and 0.893, which indicates the existence of a multicollinearity. To determine the level of multicollinearity, we use the condition number $C N = \sqrt{λ_{m a x} / λ_{m i n}}$ , where $λ_{m a x} and λ_{m i n}$ are the largest and lowest eigenvalues for the $(X^{T} X)$ matrix. We find that, the eigenvalues of the $(X^{T} X)$ matrix equal 3151,435.65 and 2.94, then the $C N = 32.73$ . This result means that there is strong level of multicollinearity.

For choosing the initial values of β, we follow the group of studies and articles related to the factors affecting tourism in Egypt were followed, including (Abd El-hamed (2021), Elawa (2014), Dibas (2001)), through which it was concluded that the approximate average of the effect of a unit increase of both the number of tourist nights, the number of tourist, total investments in the tourism sector and the number of workers in the tourism sector are 0.521, 0.607, 0.318 and 0.154. So we can suggest the initial value of β as $β = {[0.105, 0.521, 0.607, 0.318, 0.154]}^{T}$ . We summarize the results of estimate the model and the value of (SMSE), (SB), (R²) and GCV for the estimators at Table 4.

Table 4.

The coefficients of estimators and the value of (SMSE), (SB), R² and GCV.

Coefficients	Case (1)			Case (2)			Case (3)
	$k_{o p t}, d_{o p t}$			$k_{G C V}, d_{o p t}$			$k_{o p t}, d_{G C V}$
	MRTE	AUTE	AUMRTE	MRTE	AUTE	AUMRTE	MRTE	AUTE	AUMRTE
β₀	0.315	0.357	0.328	0.220	0.251	0.204	0.224	0.238	0.241
β₁	0.782	0.711	0.702	0.547	0.557	0.528	0.664	0.608	0.611
β₂	0.682	0.642	0.637	0.604	0.614	0.635	0.589	0.599	0.582
β₃	0.405	0.425	0.435	0.339	0.358	0.445	0.507	0.514	0.504
β₄	0.152	0.195	0.138	0.204	0.228	0.237	0.243	0.251	0.248

MSE	0.054	0.068	0.0082	0.011	0.0024	0.00057	0.0092	0.00097	8.24 × 10⁻⁴

SB	0.047	0.051	0.0057	0.0087	0.00091	1.34 × 10⁻⁴	0.0075	1.94 × 10⁻⁴	1.32 × 10⁻⁴

GCV	12.814	12.325	11.815	12.581	12.183	11.708	11.732	11.5514	11.2913

R²	0.6647	0.6624	0.6831	0.6845	0.7154	0.7408	0.6925	0.7225	0.7584

Open in a new tab

The results in Table 4 indicate the sign of the coefficients are identical for all estimators. In addition, the results of coefficient of determination $R^{2}$ of AUMRTE reach to 0.6831, 0.7408 and 0.7584 at cases (1-3) for Table 4 which is greater than the other estimators. These results indicate that the AUMTE improve the prediction accuracy for the model. In addition, for all cases, the AUMRTE and MRTE have the lowest and largest value of SMSE, SB and GCV.

Fig. 1 shows the ratio of the observation to the expected value against the years. At AUMRTE, the ratios at all years are close to the line of 100%, which indicate that the AUMRTE gives the model high predictive power. In the same way the AUTE give the model high predictive power than the MRTE. For get the good vision about the AUMRTE, we plot the GCV value ve. d and k for AUMRTE and presented in Fig. 2. This figure, shows three cases of optimal values of k and d that minimize the GCV's, these cases are ( $k_{o p t}$ , $d_{o p t}$ - $k_{G C V}$ , $d_{o p t}$ - $k_{o p t}$ , $d_{G C V}$ ). For Fig. 2, the GCV's are convex functions and at the same time have a global minimum. In addition, it shown that, for AUMTE, the lowest value of GCV occurs in case (b), which it reached to 11.2913.

Plots of the ratio of observation to expected value $(Y / \hat{Y})$ versus years.

Plots of the *GCV*(k,d) value versus d and k for AUMRTE.

6. Conclusion

This paper suggested a new almost unbiased ridge-type estimator (AUMRTE) to deal with multicollinearity. Theoretical comparisons were made between the AUMRTE and each of MRTE and AUTE based on (MSE). These comparisons showed that the superiority of the AUMRTE over both MRTE and AUTE. The simulation study used SMSE, SB and GCV as criteria for comparison, and its results supported the theoretical study. The simulation study results also showed that the AUMRTE is work well at the high level of correlation. For the real application, it was applied to the data of the GDP of the Egyptian tourism sector. The results of the application showed that the AUMTE improve the prediction accuracy for the model. In addition, the results of applied were confirmed with the simulation study results.

Declarations

Author contribution statement

Tarek Mahmoud Omara: Conceived and designed the experiments; Performed the experiments; Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data; Wrote the paper.

Funding statement

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Data availability statement

Data included in article/supp. material/referenced in article.

Declaration of interests statement

The authors declare no conflict of interest.

Additional information

No additional information is available for this paper.

References

Abd El-hamed R. Tourism investments and their contribution to the national income of Egypt. J. Fac. Polit. Econ. 2021;11 [Google Scholar]
Akdeniz F., Kaciranlar S. An almost unbiased generalized Liu estimator and unbiased estimation of the bias and MSE. Commun. Stat. A. 1995;24(7):1789–1797. [Google Scholar]
Al-Taweel Y., Algamal A. Some almost unbiased ridge regression estimators for the zero-inflated negative binomial regression model. Period. Eng. Nat. Sci. 2020;8(1):248–255. [Google Scholar]
Al-Taweel Y., Algamal A. Some almost unbiased ridge regression estimators for the zero-inflated Poisson model. J. Appl. Eng. Math. 2022;12(1):235–246. [Google Scholar]
Algamal A. Almost unbiased ridge estimator in the count data regression models. Electron. J. Appl. Stat. Anal. 2021;14(1):44–57. [Google Scholar]
Alheety M.A., Kibria B.M.G. On the Liu and almost unbiased estimators in presence of multicollinearity with heteroscedastic or correlated error. Surv. Math. Appl. 2009;4:155–168. [Google Scholar]
Alheety M.I., Qasim M., Mansson K., Kibria B.M.G. Modified almost unbiased two-parameter estimator for the Poisson regression model with an application to accident data. SORT. 2021;45(2):121–142. [Google Scholar]
Arashi M., Roozbeh M., Hamzah A., Gasparini M. Ridge regression and its applications in genetic studies. PLoS ONE. 2021;16(4) doi: 10.1371/journal.pone.0245376. [DOI] [PMC free article] [PubMed] [Google Scholar]
Aslam M., Ahmed S. The modified Liu-ridge-type estimator: a new class of biased estimators to address multicollinearity. Commun. Stat., Simul. Comput. 2020 [Google Scholar]
Dibas M. Egyptian Forum for Creativity and Development, Vol. 1. 2001. Tourist attraction: its nature, characteristics and factors affecting it. [Google Scholar]
Dorugade A. A modified two-parameter estimator in linear regression. Stat. Transl. 2014;15(1):23–36. [Google Scholar]
Elawa Z. Evaluating the impact of tourism activity on economic growth in Egypt. Arab Econ. J. 2014;21(65) [Google Scholar]
Farebrother R. Further results on the mean square error of ridge regression. J. R. Stat. Soc. 1976;38:248–250. [Google Scholar]
Hoerl A., Kennard R. Ridge regression: biased estimation for nonorthogonal problems. Technometrics. 1970;12(1):55–67. [Google Scholar]
Kadiyala K. A class of almost unbiased and efficient estimators of regression coefficients. Econ. Lett. 1984;16(3–4):293–296. [Google Scholar]
Kibria B.M.G. Performance of some new ridge regression estimators. Commun. Stat., Simul. Comput. 2003;32:419–435. [Google Scholar]
Liu K. A new class of biased estimate in linear regression. Commun. Stat., Theory Methods. 1993;22(2):393–402. [Google Scholar]
Liu K. Using Liu-type estimator to combat collinearity. Commun. Stat., Theory Methods. 2003;32(5):1009–1020. [Google Scholar]
Lukman A., Ayinde K., Binuomote S., Clement O. Modified ridge-type estimator to combat multicollinearity: application to chemical data. J. Chemom. 2019;33(5) [Google Scholar]
McDonald G., Galarneau D. A Monte Carlo evaluation of some ridge-type estimators. J. Am. Stat. Assoc. 1975;70:407–416. [Google Scholar]
Omara T. Modifying two-parameter ridge Liu estimator based on ridge estimation. Pak. J. Stat. Oper. Res. 2019;14(4):881–890. [Google Scholar]
Ozkale M.R., Kaciranlar S. The restricted and unrestricted two-parameter estimators. Commun. Stat., Theory Methods. 2007;36:2707–2725. [Google Scholar]
Roozbeh M., Arashi N., Hamzah A. Generalized cross-validation for simultaneous optimization of tuning parameters in ridge regression. Iran. J. Sci. Technol., Trans. A, Sci. 2020;44:473–485. [Google Scholar]
Sakallıoglu S., Kaçıranlar S. A new biased estimator based on ridge estimation. Stat. Pap. 2008;49:669–689. [Google Scholar]
Singh B., Chaubey P., Dwivedi D. An almost unbiased ridge estimator. Indian J. Stat. 1986;49(3):342–346. [Google Scholar]
Singh B., Chaubey P., Dwivedi D. An almost unbiased ridge estimator. Sankhya B. 1986;48(3):342–346. [Google Scholar]
Stein C. In: Proceedings of the Third Berkley Symposium on Mathematical and Statistics Probability, Vol. 1. Neyman J., editor. 1956. Inadmissibility of the usual estimator for mean of multivariate normal distribution; pp. 197–206. [Google Scholar]
Suhail M., Kibria B.M.G. Quantile-based robust ridge m-estimator for linear regression model in presence of multicollinearity and outliers. Commun. Stat., Simul. Comput. 2021;50(11):3194–3206. [Google Scholar]
Trenkler G., Toutenburg H. Mean square error matrix comparisons between biased estimators-an overview of recent results. Stat. Pap. 1990;31:165–179. [Google Scholar]
Xu W., Yang H. More on the bias and variance comparisons of the restricted almost unbiased estimators. Commun. Stat. A. 2011;40(22):4053–4064. [Google Scholar]
Yalian L., Hu Y. A new Liu-type estimator in linear regression model. Stat. Pap. 2012;53:427–437. [Google Scholar]
Yang H., Chang X. A new two-parameter estimator in linear regression. Commun. Stat., Theory Methods. 2010;39(6):923–934. [Google Scholar]
Zhai M., Liu G., Tao Q., Wang K., Chen Y., Pan G., Xin M. The Liu-type estimator based on parameter optimization and its application in SBAS deformation model inversion. IEEE Access. 2020;9:1076–1086. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data included in article/supp. material/referenced in article.

[br0350] Abd El-hamed R. Tourism investments and their contribution to the national income of Egypt. J. Fac. Polit. Econ. 2021;11 [Google Scholar]

[br0170] Akdeniz F., Kaciranlar S. An almost unbiased generalized Liu estimator and unbiased estimation of the bias and MSE. Commun. Stat. A. 1995;24(7):1789–1797. [Google Scholar]

[br0290] Al-Taweel Y., Algamal A. Some almost unbiased ridge regression estimators for the zero-inflated negative binomial regression model. Period. Eng. Nat. Sci. 2020;8(1):248–255. [Google Scholar]

[br0300] Al-Taweel Y., Algamal A. Some almost unbiased ridge regression estimators for the zero-inflated Poisson model. J. Appl. Eng. Math. 2022;12(1):235–246. [Google Scholar]

[br0310] Algamal A. Almost unbiased ridge estimator in the count data regression models. Electron. J. Appl. Stat. Anal. 2021;14(1):44–57. [Google Scholar]

[br0260] Alheety M.A., Kibria B.M.G. On the Liu and almost unbiased estimators in presence of multicollinearity with heteroscedastic or correlated error. Surv. Math. Appl. 2009;4:155–168. [Google Scholar]

[br0250] Alheety M.I., Qasim M., Mansson K., Kibria B.M.G. Modified almost unbiased two-parameter estimator for the Poisson regression model with an application to accident data. SORT. 2021;45(2):121–142. [Google Scholar]

[br0330] Arashi M., Roozbeh M., Hamzah A., Gasparini M. Ridge regression and its applications in genetic studies. PLoS ONE. 2021;16(4) doi: 10.1371/journal.pone.0245376. [DOI] [PMC free article] [PubMed] [Google Scholar]

[br0140] Aslam M., Ahmed S. The modified Liu-ridge-type estimator: a new class of biased estimators to address multicollinearity. Commun. Stat., Simul. Comput. 2020 [Google Scholar]

[br0370] Dibas M. Egyptian Forum for Creativity and Development, Vol. 1. 2001. Tourist attraction: its nature, characteristics and factors affecting it. [Google Scholar]

[br0110] Dorugade A. A modified two-parameter estimator in linear regression. Stat. Transl. 2014;15(1):23–36. [Google Scholar]

[br0360] Elawa Z. Evaluating the impact of tourism activity on economic growth in Egypt. Arab Econ. J. 2014;21(65) [Google Scholar]

[br0210] Farebrother R. Further results on the mean square error of ridge regression. J. R. Stat. Soc. 1976;38:248–250. [Google Scholar]

[br0020] Hoerl A., Kennard R. Ridge regression: biased estimation for nonorthogonal problems. Technometrics. 1970;12(1):55–67. [Google Scholar]

[br0180] Kadiyala K. A class of almost unbiased and efficient estimators of regression coefficients. Econ. Lett. 1984;16(3–4):293–296. [Google Scholar]

[br0200] Kibria B.M.G. Performance of some new ridge regression estimators. Commun. Stat., Simul. Comput. 2003;32:419–435. [Google Scholar]

[br0030] Liu K. A new class of biased estimate in linear regression. Commun. Stat., Theory Methods. 1993;22(2):393–402. [Google Scholar]

[br0050] Liu K. Using Liu-type estimator to combat collinearity. Commun. Stat., Theory Methods. 2003;32(5):1009–1020. [Google Scholar]

[br0120] Lukman A., Ayinde K., Binuomote S., Clement O. Modified ridge-type estimator to combat multicollinearity: application to chemical data. J. Chemom. 2019;33(5) [Google Scholar]

[br0060] McDonald G., Galarneau D. A Monte Carlo evaluation of some ridge-type estimators. J. Am. Stat. Assoc. 1975;70:407–416. [Google Scholar]

[br0100] Omara T. Modifying two-parameter ridge Liu estimator based on ridge estimation. Pak. J. Stat. Oper. Res. 2019;14(4):881–890. [Google Scholar]

[br0070] Ozkale M.R., Kaciranlar S. The restricted and unrestricted two-parameter estimators. Commun. Stat., Theory Methods. 2007;36:2707–2725. [Google Scholar]

[br0320] Roozbeh M., Arashi N., Hamzah A. Generalized cross-validation for simultaneous optimization of tuning parameters in ridge regression. Iran. J. Sci. Technol., Trans. A, Sci. 2020;44:473–485. [Google Scholar]

[br0080] Sakallıoglu S., Kaçıranlar S. A new biased estimator based on ridge estimation. Stat. Pap. 2008;49:669–689. [Google Scholar]

[br0150] Singh B., Chaubey P., Dwivedi D. An almost unbiased ridge estimator. Indian J. Stat. 1986;49(3):342–346. [Google Scholar]

[br0160] Singh B., Chaubey P., Dwivedi D. An almost unbiased ridge estimator. Sankhya B. 1986;48(3):342–346. [Google Scholar]

[br0040] Stein C. In: Proceedings of the Third Berkley Symposium on Mathematical and Statistics Probability, Vol. 1. Neyman J., editor. 1956. Inadmissibility of the usual estimator for mean of multivariate normal distribution; pp. 197–206. [Google Scholar]

[br0270] Suhail M., Kibria B.M.G. Quantile-based robust ridge m-estimator for linear regression model in presence of multicollinearity and outliers. Commun. Stat., Simul. Comput. 2021;50(11):3194–3206. [Google Scholar]

[br0220] Trenkler G., Toutenburg H. Mean square error matrix comparisons between biased estimators-an overview of recent results. Stat. Pap. 1990;31:165–179. [Google Scholar]

[br0190] Xu W., Yang H. More on the bias and variance comparisons of the restricted almost unbiased estimators. Commun. Stat. A. 2011;40(22):4053–4064. [Google Scholar]

[br0010] Yalian L., Hu Y. A new Liu-type estimator in linear regression model. Stat. Pap. 2012;53:427–437. [Google Scholar]

[br0090] Yang H., Chang X. A new two-parameter estimator in linear regression. Commun. Stat., Theory Methods. 2010;39(6):923–934. [Google Scholar]

[br0240] Zhai M., Liu G., Tao Q., Wang K., Chen Y., Pan G., Xin M. The Liu-type estimator based on parameter optimization and its application in SBAS deformation model inversion. IEEE Access. 2020;9:1076–1086. [Google Scholar]

PERMALINK

Almost unbiased modified ridge-type estimator: An application to tourism sector data in Egypt

Tarek Mahmoud Omara

Abstract

1. Introduction

2. Methodology

2.1. The regression model and shrinkage estimators

2.2. The proposed estimator:

Definition

2.3. Choosing the shrinkage parameters ( $k, d$ )

3. The MSE comparison

Lemma 1

Lemma 2

Theorem 1

Proof

Theorem 2

Proof

4. Simulation study

Table 1.

Table 2.

Table 3.

5. Application to the tourism sector data in Egypt

Table 4.

Figure 1.

Figure 2.

6. Conclusion

Declarations

Author contribution statement

Funding statement

Data availability statement

Declaration of interests statement

Additional information

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Almost unbiased modified ridge-type estimator: An application to tourism sector data in Egypt

Tarek Mahmoud Omara

Abstract

1. Introduction

2. Methodology

2.1. The regression model and shrinkage estimators

2.2. The proposed estimator:

Definition

2.3. Choosing the shrinkage parameters (k,d)

3. The MSE comparison

Lemma 1

Lemma 2

Theorem 1

Proof

Theorem 2

Proof

4. Simulation study

Table 1.

Table 2.

Table 3.

5. Application to the tourism sector data in Egypt

Table 4.

Figure 1.

Figure 2.

6. Conclusion

Declarations

Author contribution statement

Funding statement

Data availability statement

Declaration of interests statement

Additional information

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

2.3. Choosing the shrinkage parameters ( $k, d$ )