Skip to main content
Heliyon logoLink to Heliyon
. 2022 Sep 22;8(9):e10684. doi: 10.1016/j.heliyon.2022.e10684

Almost unbiased modified ridge-type estimator: An application to tourism sector data in Egypt

Tarek Mahmoud Omara a,b,
PMCID: PMC9526164  PMID: 36193526

Abstract

This paper introduces an almost unbiased modified ridge-type estimator (AUMRTE) to avoid problems arising from multicollinearity. This estimator has the important features of the two important shrinkage estimators, the modified ridge-type estimator (MRTE) and almost unbiased estimator (AUE). We investigated the theoretical excellence of the proposed estimator according to the mean square error (MSE). We found that it has the superiority than the (MRTE) and almost unbiased two-parameter estimator (AUTE). Moreover, we run the simulation study, which depended on the simulated MSE (SMSE), squared bias (SB) and generalized cross-validation (GCV) as criteria to compare the estimators. The simulation results showed that the proposed estimator has the superiority than the estimators under comparison at several factors and at the same time, it works well at the high level of correlation. In addition, we investigated the behavior of the present estimator applying the real data. Under this trend, we applied the estimator to the tourism sector data in Egypt, which the results were consistent with the theoretical results.

Keywords: Multicollinearity, Liu-type estimator, Ridge-type estimator, Almost unbiased modified ridge-type estimator


Multicollinearity; Liu-type estimator; Ridge-type estimator; Almost unbiased modified ridge-type estimator

1. Introduction

The multicollinearity appears when the explanatory variables have high correlated. This problem has bad effect on the ordinary least squares (OLS) estimator, since it makes (MSE) is high and the estimator becomes unstable Yalian and Hu (2012). The ridge estimator (RE) which introduced by Hoerl and Kennard (1970) is stable solution to address the multicollinearity. This estimator is biased and it's adding more information to overcome the ill condition for the (XTX) matrix. In the same context, Liu (1993) proposed the biased estimator that called Liu estimator (LE) that mingling the stein estimator that introduced by Stein (1956) and (RE). For high level of multicollinearity the matrix (XTX) safer from ill condition with large condition number. The small value of ridge parameter cannot reduce the condition number by enough to overcome the ill condition. So that, Liu (2003) introduced Liu_type estimator (LTE) that depended on two parameters make together to reduce the condition number and at the same time improve the fitting and properties of the estimator Zhai et al. (2020). Ozkale and Kaciranlar (2007) suggest two-parameter estimator (TE), which has many features, since it contains the OLS, RE, Liu estimators in private situations. In fact, the (RE) and (LE) depend on OLS estimator, so we can use them in the case of low level of multicollinearity. Otherwise, the (TE) and (LTE) depend on any estimator, whether OLS or any other estimator. So that, we can use it at any level of multicolinearity. Sakallıoglu and Kaçıranlar (2008) and Yang and Chang (2010) modify the (LTE) in which it depends on (RE). This biased estimator has superior efficient than (RE), (LT) and (LTE). Furthermore, Omara (2019) modify the (TE) in which it depends on (RE). In addition, Yang and Chang (2010) develop a two-parameter estimator. Aslam and Ahmed (2020) suggested the class of biased estimator modify two-parameter estimator. Dorugade (2014) introduced the new biased estimator called ridge-type estimator (RTE). Lukman et al. (2019) modified the ridge-type estimator and proposed the new biased estimator called modify ridge-type estimator (MRTE). At the same time, Lukman et al. (2019) modified the ridge-type estimator with new prior information.

On the other hand, many studies go to minimize the estimators bias and at the same time keeping the MSE small. The almost unbiased estimator is one of important biased estimator which used to reduce the biased for the shrinkage estimators. There is a continuous need to improve the almost unbiased estimator to overcome the multicolinearity. In this direction, the statistical literature goes to improve the almost unbiased estimator performance by replacing the OLS estimator with more efficient shrinkage estimators. In this context, Singh et al. (1986a, 1986b) suggested almost unbiased ridge estimator. Furthermore, Singh et al. (1986a, 1986b) presented almost unbiased Liu estimator. In addition, Akdeniz and Kaciranlar (1995) suggested almost unbiased generalized Liu estimator (AUGLE). More on almost unbiased estimators, we refer our readers to Alheety and Kibria (2009), Alheety et al. (2021), Algamal (2021), Al-Taweel and Algamal (2020) and Al-Taweel and Algamal (2022). This study suggests new almost unbiased shrinkage estimator which we called almost unbiased modified ridge-type estimator (AUMRTE). This estimator merges the almost unbiased Liu estimator (AULE) with (MRTE).

The planning of the study is as follows: In section 2, we illustrated the model (subsection 2.1), proposed the estimators (subsection 2.2), and the provided the biasing parameters (subsection 2.3). The performance of the estimators according to the MSE is illustrated in section 3. The simulation study and its results are given in section 4. A real data are analyzed in section 5. Finally, the concluding are put in section 6.

2. Methodology

2.1. The regression model and shrinkage estimators

Conceder the linear regression model:

Y=Xβ+μ (1)

Where Y is n×1 is vector of response regression, X is n×p explanatory matrix, β is p×1 vector of coefficients and μ is n×1 vector of error term which follow a normal distribution such that μN(0,σ2In). We simplify the linear regression model by use the canonical form. Since the matrix (XTX) is symmetric matrix, then Λ=ZTZ=HXTXHT=diag(λ1,λ2,,λp) where H and λi, i=1,2,,p are the eigenvector and eigenvalues of the (XTX) matrix. The canonical model for the model in Eq. (1) is:

Y=Zγ+μ (2)

where Z=XH and γ=HTβ.

For Eq. (2), (Hoerl and Kennard (1970) suggested RE to overcome the multicolinearity which formed as:

γˆRE(k)=(Λ+kI)1ZTY,k>0=HREγˆOLS

where k is ridge parameter, HRE=(Λ+kI)1Λ and γˆOLS is OLS estimator.

To overcome the high level of multicolinearity, (Liu, 1993) suggested LE which formed as:

γˆLE(d)=(Λ+I)1(ZTYdγˆOLS),0<d<1=HLEγˆOLS

where d is the biasing parameter and HLE=(Λ+I)1(ΛdI).

To get a new biased estimator has more ability to deal with multicolinearity, (Liu, 2003) introduced the LTE as:

γˆLTE(k,d)=(Λ+kI)1(ZTYdb),k>0,<d<

where b is the any estimator for β.

Ozkale and Kaciranlar (2007) suggested two-parameter estimator, which formed as:

γˆTE(K,d)=(Λ+kI)1(ZTYkdγˆOLS),0<d<1,=HTEγˆOLS

where HTE=(Λ+kI)1(ΛkdI).

Dorugade (2014) modified the RE and suggested RTE which defined as:

γˆRTE(k,d)=(Λ+kdI)1ZTY,k>0,0<d<1

Lukman et al. (2019) modified the RTE and suggested MRTE as:

γˆMRTE(k,d)=(Λ+k(1+d)I)1ZTY,k>0,0<d<1=HMRTEγˆOLS (3)

where HMRTE=(Λ+k(1+d)I)1Λ

E(γˆMRTE(k,d))=HMRTEγ
bias(γˆMRTE(k,d))=(HMRTEI)γ
Cov(γˆMRTE(k,d))=σ2HMRTEΛHMRTET (4)
MSE(γˆMRTE(k,d))=σ2HMRTEΛHMRTET+(HMRTEI)γγT(HMRTEI)T
MSE(γˆMRTE(k,d))=σ2i=1pλi(λi+k(1+d))2+i=1pγi2(λi+k(1+d)I)2

Kadiyala (1984) suggested class of almost unbiased estimator (AUE) which defined as:

γˆAUE=[I+(Λ+I)1(1d)]γˆOLS0<d<1 (5)

There is a continuous need to improve the almost unbiased estimator which used to overcome the multicolinearity. In this direction, the statistical literature goes to improve the performance of these estimators by replacing the OLS estimator with more efficient punishing estimators. Akdeniz and Kaciranlar (1995) suggested almost unbiased generalized Liu estimator (AUGLE) which defined as:

γˆAULE=[I+(ID)(Λ+I)1]γˆliu0<d<1=[I(ID)2(Λ+I)2]γˆOLS

where D=dig(d1,d2,,dp).

Wu and Young (2013) introduced almost unbiased two-parameter estimator (AUTE) as:

γˆAUTE=[I+k(Id)(Λ+kI)1]γˆliu0<d<1=[Ik2(Id)2(Λ+kI)2]γˆOLS=HAUTEγˆOLS

where HAUTE=[Ik2(Id)2(Λ+kI)2]

E(γˆAUTE(k,d))=HAUTEγ
bias(γˆAUTE(k,d))=(HAUTEI)γ
Cov(γˆAUTE(k,d))=σ2HAUTEΛ1HAULE (6)
MSE(γˆAUTE(k,d))=σ2HAUTEΛ1HAUTE+(HAUTEI)γγT(HAUTEI)T
MSE(γˆAUTE(k,d))=σ2i=1p1λi(1k2(1d)2(λi+kI)2)2+i=1p(k2(1d)2(λi+kI)2)2γi2

2.2. The proposed estimator:

In begging, we illustrated the (Xu and Yang, 2011) definition of almost unbiased estimator.

Definition

Let βˆ be a biased estimator of the β such that Bias(βˆ)=E(βˆ)β=Hβ and (βˆHβ)=β, then we defined the almost unbiased estimator as β˜=βˆHβ=(IH)βˆ.

Because of the preference of the (MRTE) estimator over the (RE), (LE) and (MRTE), we find that, this estimator is a candidate to merge with (AUE). We use Eq. (3) and Eq. (5) then defined this new biased estimator (AUMRTE) as:

γˆAUMRTE(k,d)=[I((Λ+k(1+d)I)1ΛI)]γˆMRTEk>0,0<d<1=[2I(Λ+k(1+d)I)1Λ](Λ+k(1+d)I)1ΛγˆOLS=[Ik2(1+d)2(Λ+k(1+d)I)2]γˆOLS=HAUMRTEγˆOLS

The expected, biased and MSE for the new estimator is formed as:

E(γˆAUMRTE(k,d))=HAUMRTEγ
bias(γˆAUMRTE(k,d))=(HAUMRTEI)γ
Cov(γˆAUMRTE(k,d))=σ2HAUMRTEΛ1HAUMRTE (7)
MSE(γˆAUMRTE(k,d))=σ2HAUMRTEΛ1HAUMRTE+(HAUMRTEI)γγT(HAUMRTEI)T=σ2i=1p1λi(1k2(1+d)2(λi+k(1+d))2)2+i=1p(k2(1+d)2(λi+k(1+d))2)2γi2 (8)

The AUMRTE estimator has some special case that we can follow as:

  • a.

    If k=0 and d=0 then γˆAUMRTE(0,0)=γˆOLS.

  • b.

    If d=1 and k=k2 then γˆAUMRTE(k2,1)=γˆAURE(k).

  • c.

    If d+1=d, then γˆAUMRTE(k,d)=γˆAUTE(k).

2.3. Choosing the shrinkage parameters (k,d)

For the proposed estimator, we determine the chosen method for the shrinkage parameters k,d. Let e=1+d and MSE(γˆAUMRTE(k,d))=σ2i=1p1λi(1vi2)2+i=1pvi4γi2 where vi=k(1+d)/(λi+k(1+d))=ke/(λi+ke). To get the optimal value of d, let k fixed and take the derivative of MSE in Eq. (8) with respect to d, then equalize the result to zero.

MSE(γˆAUMRTE(k,d))d=MSE(γˆAUMRTE(k,d))vivid=0

Since vi/d=kλi/(λi+k(1+d))20, then

MSE(γˆAUMRTE(k,d))vi=4σ2i=1pvi(1vi2)λi+4i=1pvi3γi2=0σ2i=1p(1vi2)+i=1pλivi2γi2=0σ2+σ2vi2+λivi2γi2=0 (9)

Since vi=ke/(λi+ke), then for Eq. (9), we find

eopt=λiσ2σ2+λiγi2k(1σ2σ2+λiγi2)

and

dopt=λiσ2σ2+λiγi2k(1σ2σ2+λiγi2)k(1σ2σ2+λiγi2) (10)

For workable applied purpose, σ2 and γi2 are replaced with σˆ2 and γˆi2. Then the Eq. (10) become

dopt=λiσˆ2σˆ2+λiγˆi2k(1σˆ2σˆ2+λiγˆi2)k(1σˆ2σˆ2+λiγˆi2) (11)

To get the optimal value of k, let d fixed and take the derivative of MSE in Eq. (8) with respect to the parameter k, then equalize the result to zero.

MSE(γˆAUMRTE(k,d))k=MSE(γˆAUMRTE(k,d))vivik=0

Since vi/k=λi(1+d)/(λi+k(1+d))20, then for Eq. (9), we get

kopt=λiσ2σ2+λiγi2(d+1)(1σ2σ2+λiγi2) (12)

If we follow the Kibria (2003) which proposed the arithmetic means of k value, then we can propose the arithmetic means of kopt as

kopt=1pi=1pλiσ2σ2+λiγi2(d+1)(1σ2σ2+λiγi2) (13)

For workable applied purpose, σ2 and γi2 are replaced with σˆ2 and γˆi2. Then the Eq. (13) becomes

kopt=1pi=1pλiσˆ2σˆ2+λiγˆi2(d+1)(1σˆ2σˆ2+λiγˆi2) (14)

One of important methods that used to obtain the optimal shrinkage parameters k and d, is a generalized cross-validation (GCV). This method makes a equilibrium between the estimator's prediction accuracy and the bias which caused by the shrinkage the estimator (Arashi et al., 2021). The GCV received attention in the statistical literature, (Omara, 2019) use GCV to choose the shrinkage parameters for two-parameter ridge estimator. Moreover, (Roozbeh et al., 2020) use GCV as a criterion to compare the estimators for ridge rank regression. In fact, the GCV has good properties and at the same time it has a simplicity. We can choose the parameters for the propose estimator kGCV and dGCV by minimizing the following unobservable risk function which defined as

RAUMRTE(d,k)=1n(yyˆ)T(yyˆ)

where yˆ=ZγˆAUMRTE(d,k)=L(d,k)Y and L(d,k)=Z[Ik2(1+d)2(Λ+k(1+d)I)2](ZTZ)1ZT which can be defined as almost unbiased modified ridge-type hat matrix of y. Then the GCV(k.d) defined as

GCV(k.d)=yZγˆAUMRTE(k.d)2(In1tr(L(k,d)))2=(IL(k,d))y2(In1tr(L(k,d)))2

There are various methods exist to estimate shrinkage parameters. To mention a few, see Suhail and Kibria (2021) and Lukman et al. (2019) among others.

3. The MSE comparison

In this section, we make a comparison between AUTE and MRTE according to the (MSE). To check the superiority of the proposed estimator, we need the following lemma:

Lemma 1

Let C be a positive definite (p.d) matrix and c be a vector, then CccT0 , iff cTC1c1 . See Farebrother (1976) .

Lemma 2

Let βˆi=Hiy , i=1,2 be two estimators of β. Assume that =Cov(βˆ1)Cov(βˆ2) be a p.d , then MSE(βˆ1)MSE(βˆ2)0 , iff (H2I)T(+(H1I)T(H1I)T)1(H2I)1 . See Trenkler and Toutenburg, 1990 .

Theorem 1

Consider γˆAUTE(k,d) and γˆAUMRTE(k,d) are two estimators of β. Assume that 1=Cov(γˆAUTE(k,d))Cov(γˆAUMRTE(k,d))be a(p.d) , then MSE(γˆAUTE(k,d))MSE(γˆAUMRTE(k,d))0 , iff (HAUMRTEIn)(1+(HAUTEI)T(HAUTEI)T)1(HAUMRTEIn)T1 .

Proof

From Eq. (6) and (7), we find that:

1=Cov(γˆAUTE(k,d))Cov(γˆAUMRTE(k,d))=σ2[i=1p1λi[(1k2(1d)2(λi+k)2)2(1k2(1+d)2(λi+k(1+d))2)2]]

This inequality is (p.d) if and only if

k2(1d)2(λi+k)2k2(1+d)2(λi+k(1+d))2<0
(1d)2(λi+k(1+d))2(1+d)2(λi+k)2(λi+k)2(λi+k(1+d))2<0

This inequality is true if and only if

(1d)2(λi+k(1+d))2(1+d)2(λi+k)2<0

This inequality is true for k>0 and 0<d<1.

By using Lemma 2, the proof completed.

Theorem 2

Consider γˆMRTE(k,d)andγˆAUMRTE(k,d) are two estimators of β. Assume that 2=Cov(γˆMRTE(k,d))Cov(γˆAUMRTE(k,d))be a(p.d) , then MSE(γˆMRTE(k,d))MSE(γˆAUMRTE(k,d))0 , iff

(HAUMRTEIn)(2+(HMRTEI)T(HMRTEI)T)1×(HAUMRTEIn)T1.

Proof

From Eq. (4) and (7), we find that:

2=Cov(γˆMRTE(k,d))Cov(γˆAUMRTE(k,d))=σ2i=1p[(λi(λi+k(1+d))2)21λi(1k2(1+d)2(λi+k(1+d))2)2]=σ2i=1p[(λi(λi+k(1+d))2)21λi((λi+k(1+d))2k2(1+d)2(λi+k(1+d))2)2]=σ2i=1p[λi3(λi+k(1+d))4((λi+k(1+d))2k2(1+d)2)2λi(λi+k(1+d))4]

This inequality is true if and only if

λi3(λi+k(1+d))4((λi+k(1+d))2k2(1+d)2)2>0
λi3(λi+k(1+d))4(λi+k(1+d))4+2k2(λi+k(1+d))2(1+d)2k4(1+d)4>0
(λi31)(λi+k(1+d))4k4(1+d)4+2k2(λi+k(1+d))2(1+d)2>0

This inequality is true for 0<d<1 and k>0, then according to Lemma 2, the proof completed.

4. Simulation study

To verify the effectiveness of the proposed estimator comparing with the other estimators according to a set of factors, we run the following simulation study. The comparisons are made between the suggested estimator and the MRTE and AUTE. All these comparisons are performed by the Matlab Programming. On the beginning, we use the following model to determine dependent variable data:

yi=j=1pxijβi+μi,i=1,2,,n

where μiidN(0,σ2).

There are many factors are candidate in this simulation. The levels of correlation between the explanatory variables are important factor for comparing the variables. To ensure that there are degrees of this correlation between the explanatory variables, we follow McDonald and Galarneau (1975), and generates the explanatory variables by

xij=(1ρ2)1/2ωij+ρωi,p+1,i=1,2,,n,j=1,2,,p

where ωij, s are pseudo-random numbers such that ωiidN(0,1) and ρ is the correlation between the explanatory variables. To show the effect of the levels of correlation between the explanatory variables at the different levels, we choose this correlation as ρ=0.90,0.95,0.99. Moreover, we use the three levels of sample size n=50,100 and 200. We take the number of explanatory p=5 and 30. We take the variance of the error term as σ2=0.1 and 20. In addition, the simulation is repeated 2000 times and we use the simulated MSE (SMSE) and squared bias (SB) as a criterion to compare between the estimators, such that they are formed at Eq. (15) and Eq. (16). In addition, in order to avoid the over fitting, we use the (GCV) as a reliable criterion.

SMSE(βˆ)=r=12000(βˆrβ)T(βˆrβ)2000 (15)
SB(βˆ)=(r=12000βˆrT2000β)T(r=12000βˆrT2000β) (16)

where βˆr is the estimator in rth iteration of the simulation. The initial value for β is selected as βp=5=[2,2,2,2,2]T and βp=30=[2,2,,2]T. To select the value of k and d we use the follow the plan:

  • For AUMRTE, we use the Eq. (11) to Eq. (14).

  • For MRTE, we follow Lukman et al. (2019) and use dopt(MRTE)=(σˆ2kγi2)1 and the arithmetic means of kopt(MRTE) as
    kopt(MRTE)=1pi=1pσˆ2(1+d)γˆi2
  • For AUTE, we follow Wu and Young (2013) and use dopt(AUTE)=1(λi+k)σk1σ2λiγi2 and the arithmetic means of kopt(AUTE) as
    kopt(AUTE)=1pi=1pσλi(1+d)σ2+λiγi2σ

To determine the priority of the selected estimators, the criteria SMSE, SB and GCV were used. We summarized the simulation results according to these criteria in Table 1, Table 2, Table 3. The results show that, the AUMRTE has the smallest SMSE, SB and GCV at all the factors. In contrast, the MRTE got the largest SMSE and SB at the all factors. Additionally, at p=5, AUMRTE performs well at ρ=0.95 compared to the other correlation levels but at p=30, it performs well at ρ=0.90 compared to the other correlation levels. Moreover, at all estimators, the SMSE, SB and GCV are tended to increase with increasing of the variance of the error and the number of explanatory variables and at the same time it tends to decrease with increasing of sample size. It is clear that, at the most case there is agreement between the results of SMSE, SB and GCV. According to GCV, we find that there is a greater positive effect of increasing the size of sample and decreasing the number of explanatory variable on the work of estimators. When the correlation be trend to high, the efficiency of the estimators decreases, and it is worse at ρ=0.99.

Table 1.

The SMSE value for the estimators at deferent factors.

p n ρ σ2=0.1
σ2=20
MRTE AUTE AUMRTE MRTE AUTE AUMRTE
5 50 0.90 0.0212 0.0145 0.0088 0.0951 0.0882 0.0521
0.95 0.0135 0.0124 0.0064 0.0665 0.0616 0.0328
0.99 0.0193 0.0149 0.0072 0.0813 0.0917 0.0339



100 0.90 0.0179 0.0118 0.0081 0.0733 0.0649 0.0491
0.95 0.0179 0.0095 0.0053 0.0645 0.0592 0.0353
0.99 0.0154 0.0115 0.0061 0.0684 0.0633 0.0399



200 0.90 0.0081 0.0042 0.0039 0.0557 0.0412 0.0321
0.95 0.0054 0.0088 0.0021 0.0293 0.0201 0.0119
0.99 0.0028 0.0059 0.0017 0.0113 0.0092 0.0107



30 50 0.90 0.0508 0.0479 0.0286 0.1991 0.0941 0.0928
0.95 0.0685 0.0622 0.0292 0.2012 0.0962 0.0999
0.99 0.0717 0.0719 0.0305 0.2054 0.0998 0.1021



100 0.90 0.0554 0.0451 0.0125 0.0912 0.0804 0.0664
0.95 0.0691 0.0618 0.0141 0.1054 0.0921 0.0691
0.99 0.0737 0.0792 0.064 0.1098 0.0978 0.0709



200 0.90 0.0192 0.0149 0.0108 0.0628 0.0611 0.0492
0.95 0.0204 0.0158 0.0111 0.0707 0.0744 0.0709
0.99 0.0211 0.0164 0.0122 0.0777 0.0791 0.0717

Table 2.

The SB(βˆ) value for the estimators at deferent factors.

p n ρ σ2=0.1
σ2=20
MRTE AUTE AUMRTE MRTE AUTE AUMRTE
5 50 0.90 0.0091 0.0073 0.0049 0.0228 0.0199 0.0147
0.95 0.0071 0.0088 0.0041 0.0199 0.0228 0.0318
0.99 0.0082 0.0064 0.0047 0.0208 0.0182 0.0117



100 0.90 0.0066 0.0052 0.0033 0.0097 0.0081 0.0048
095 0.0039 0.0021 0.0028 0.0066 0.0072 0.0029
0.99 0.0048 0.0048 0.0031 0.0077 0.0081 0.0047



200 0.90 0.0061 0.0047 0.0025 0.0114 0.0097 0.0111
095 0.0033 0.0011 0.0011 0.0098 0.0087 0.0098
0.99 0.0044 0.0021 0.0021 0.0109 0.0094 0.0104



30 50 0.90 0.0108 0.0099 0.0082 0.0335 0.0333 0.0301
0.95 0.0111 0.0102 0.0088 0.0369 0.0399 0.0304
0.99 0.0124 0.0105 0.0099 0.0392 0.0431 0.0391



100 0.90 0.0101 0.0094 0.0088 0.0111 0.0105 0.0101
0.95 0.0118 0.0091 0.0074 0.0109 0.0103 0.0098
0.99 0.0147 0.0092 0.0073 0.0108 0.0104 0.0097



200 0.90 0.0092 0.0081 0.0066 0.0108 0.0102 0.0092
0.95 0.0099 0.0084 0.0077 0.0111 0.0112 0.0098
0.99 0.0121 0.0089 0.0081 0.0116 0.0131 0.0103

Table 3.

The GCV(βˆ) value for the estimators at deferent factors.

p n ρ σ2=0.1
σ2=20
MRTE AUTE AUMRTE MRTE AUTE AUMRTE
5 50 0.90 6.3652 5.0891 1.0012 6.6054 5.3065 1.0932
0.95 6.5426 5.1325 1.0156 6.7742 5.5243 1.1318
0.99 6.7082 5.4089 1.0827 6.9271 5.6284 1.3084



100 0.90 4.0962 3.0514 0.9564 5.1802 2.9905 0.9654
095 4.1564 3.0328 1.0015 5.3652 2.8638 0.9912
0.99 4.2291 3.0632 1.0314 5.5021 2.4732 1.0214



200 0.90 2.0185 1.9521 0.6965 4.6582 2.0658 0.9051
095 2.0012 2.2518 0.8876 4.1068 2.6391 1.0325
0.99 1.9834 2.3212 0.9765 4.0619 2.9653 1.0912



30 50 0.90 9.3255 8.9142 2.0879 9.9085 9.8256 2.7354
0.95 9.7621 9.0698 2.1864 10.0304 9.9241 2.9941
0.99 9.9025 9.3262 2.6152 10.2142 10.0251 3.2014



100 0.90 7.6253 7.1982 1.9021 7.9056 7.3654 2.0214
0.95 7.8641 7.2356 1.9214 8.0654 7.6021 2.3127
0.99 7.9054 7.8321 1.9325 8.3712 8.0654 2.4085



200 0.90 5.3242 4.8947 1.0245 5.9142 5.0214 1.0524
0.95 5.5931 4.9921 1.0382 6.0325 5.1521 1.0934
0.99 5.9342 5.0932 1.0634 6.1821 5.3921 1.1186

5. Application to the tourism sector data in Egypt

In this section, we confirm our results by application to the tourism data in Egypt (1995: 2019). We take the data form the Central Agency for Public Mobilization and Statistics (https://www.capmas.gov.eg/). We use the GDP of the tourism as dependent variable (y) and number of tourist nights (X1), number of tourists (X2) and total investments in the tourism (X3) and number of workers in the tourism (X4) as explanatory variables. We use the (SMSE), (SB), (R2) and GCV as a criteria's. Firstly, it is necessary to verify the existence of a multicollinearity between the independent variables and what is the level of it is. In this direction, we found that the correlation between the explanatory variables is between 0.684 and 0.893, which indicates the existence of a multicollinearity. To determine the level of multicollinearity, we use the condition number CN=λmax/λmin, where λmaxandλmin are the largest and lowest eigenvalues for the (XTX) matrix. We find that, the eigenvalues of the (XTX) matrix equal 3151,435.65 and 2.94, then the CN=32.73. This result means that there is strong level of multicollinearity.

For choosing the initial values of β, we follow the group of studies and articles related to the factors affecting tourism in Egypt were followed, including (Abd El-hamed (2021), Elawa (2014), Dibas (2001)), through which it was concluded that the approximate average of the effect of a unit increase of both the number of tourist nights, the number of tourist, total investments in the tourism sector and the number of workers in the tourism sector are 0.521, 0.607, 0.318 and 0.154. So we can suggest the initial value of β as β=[0.105,0.521,0.607,0.318,0.154]T. We summarize the results of estimate the model and the value of (SMSE), (SB), (R2) and GCV for the estimators at Table 4.

Table 4.

The coefficients of estimators and the value of (SMSE), (SB), R2 and GCV.

Coefficients Case (1)
Case (2)
Case (3)
kopt,dopt
kGCV,dopt
kopt,dGCV
MRTE AUTE AUMRTE MRTE AUTE AUMRTE MRTE AUTE AUMRTE
β0 0.315 0.357 0.328 0.220 0.251 0.204 0.224 0.238 0.241
β1 0.782 0.711 0.702 0.547 0.557 0.528 0.664 0.608 0.611
β2 0.682 0.642 0.637 0.604 0.614 0.635 0.589 0.599 0.582
β3 0.405 0.425 0.435 0.339 0.358 0.445 0.507 0.514 0.504
β4 0.152 0.195 0.138 0.204 0.228 0.237 0.243 0.251 0.248



MSE 0.054 0.068 0.0082 0.011 0.0024 0.00057 0.0092 0.00097 8.24 × 10−4



SB 0.047 0.051 0.0057 0.0087 0.00091 1.34 × 10−4 0.0075 1.94 × 10−4 1.32 × 10−4



GCV 12.814 12.325 11.815 12.581 12.183 11.708 11.732 11.5514 11.2913



R2 0.6647 0.6624 0.6831 0.6845 0.7154 0.7408 0.6925 0.7225 0.7584

The results in Table 4 indicate the sign of the coefficients are identical for all estimators. In addition, the results of coefficient of determination R2 of AUMRTE reach to 0.6831, 0.7408 and 0.7584 at cases (1-3) for Table 4 which is greater than the other estimators. These results indicate that the AUMTE improve the prediction accuracy for the model. In addition, for all cases, the AUMRTE and MRTE have the lowest and largest value of SMSE, SB and GCV.

Fig. 1 shows the ratio of the observation to the expected value against the years. At AUMRTE, the ratios at all years are close to the line of 100%, which indicate that the AUMRTE gives the model high predictive power. In the same way the AUTE give the model high predictive power than the MRTE. For get the good vision about the AUMRTE, we plot the GCV value ve. d and k for AUMRTE and presented in Fig. 2. This figure, shows three cases of optimal values of k and d that minimize the GCV's, these cases are (kopt, dopt-kGCV, dopt-kopt, dGCV). For Fig. 2, the GCV's are convex functions and at the same time have a global minimum. In addition, it shown that, for AUMTE, the lowest value of GCV occurs in case (b), which it reached to 11.2913.

Figure 1.

Figure 1

Plots of the ratio of observation to expected value (Y/Yˆ) versus years.

Figure 2.

Figure 2

Plots of the GCV(k,d) value versus d and k for AUMRTE.

6. Conclusion

This paper suggested a new almost unbiased ridge-type estimator (AUMRTE) to deal with multicollinearity. Theoretical comparisons were made between the AUMRTE and each of MRTE and AUTE based on (MSE). These comparisons showed that the superiority of the AUMRTE over both MRTE and AUTE. The simulation study used SMSE, SB and GCV as criteria for comparison, and its results supported the theoretical study. The simulation study results also showed that the AUMRTE is work well at the high level of correlation. For the real application, it was applied to the data of the GDP of the Egyptian tourism sector. The results of the application showed that the AUMTE improve the prediction accuracy for the model. In addition, the results of applied were confirmed with the simulation study results.

Declarations

Author contribution statement

Tarek Mahmoud Omara: Conceived and designed the experiments; Performed the experiments; Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data; Wrote the paper.

Funding statement

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Data availability statement

Data included in article/supp. material/referenced in article.

Declaration of interests statement

The authors declare no conflict of interest.

Additional information

No additional information is available for this paper.

References

  1. Abd El-hamed R. Tourism investments and their contribution to the national income of Egypt. J. Fac. Polit. Econ. 2021;11 [Google Scholar]
  2. Akdeniz F., Kaciranlar S. An almost unbiased generalized Liu estimator and unbiased estimation of the bias and MSE. Commun. Stat. A. 1995;24(7):1789–1797. [Google Scholar]
  3. Al-Taweel Y., Algamal A. Some almost unbiased ridge regression estimators for the zero-inflated negative binomial regression model. Period. Eng. Nat. Sci. 2020;8(1):248–255. [Google Scholar]
  4. Al-Taweel Y., Algamal A. Some almost unbiased ridge regression estimators for the zero-inflated Poisson model. J. Appl. Eng. Math. 2022;12(1):235–246. [Google Scholar]
  5. Algamal A. Almost unbiased ridge estimator in the count data regression models. Electron. J. Appl. Stat. Anal. 2021;14(1):44–57. [Google Scholar]
  6. Alheety M.A., Kibria B.M.G. On the Liu and almost unbiased estimators in presence of multicollinearity with heteroscedastic or correlated error. Surv. Math. Appl. 2009;4:155–168. [Google Scholar]
  7. Alheety M.I., Qasim M., Mansson K., Kibria B.M.G. Modified almost unbiased two-parameter estimator for the Poisson regression model with an application to accident data. SORT. 2021;45(2):121–142. [Google Scholar]
  8. Arashi M., Roozbeh M., Hamzah A., Gasparini M. Ridge regression and its applications in genetic studies. PLoS ONE. 2021;16(4) doi: 10.1371/journal.pone.0245376. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Aslam M., Ahmed S. The modified Liu-ridge-type estimator: a new class of biased estimators to address multicollinearity. Commun. Stat., Simul. Comput. 2020 [Google Scholar]
  10. Dibas M. Egyptian Forum for Creativity and Development, Vol. 1. 2001. Tourist attraction: its nature, characteristics and factors affecting it. [Google Scholar]
  11. Dorugade A. A modified two-parameter estimator in linear regression. Stat. Transl. 2014;15(1):23–36. [Google Scholar]
  12. Elawa Z. Evaluating the impact of tourism activity on economic growth in Egypt. Arab Econ. J. 2014;21(65) [Google Scholar]
  13. Farebrother R. Further results on the mean square error of ridge regression. J. R. Stat. Soc. 1976;38:248–250. [Google Scholar]
  14. Hoerl A., Kennard R. Ridge regression: biased estimation for nonorthogonal problems. Technometrics. 1970;12(1):55–67. [Google Scholar]
  15. Kadiyala K. A class of almost unbiased and efficient estimators of regression coefficients. Econ. Lett. 1984;16(3–4):293–296. [Google Scholar]
  16. Kibria B.M.G. Performance of some new ridge regression estimators. Commun. Stat., Simul. Comput. 2003;32:419–435. [Google Scholar]
  17. Liu K. A new class of biased estimate in linear regression. Commun. Stat., Theory Methods. 1993;22(2):393–402. [Google Scholar]
  18. Liu K. Using Liu-type estimator to combat collinearity. Commun. Stat., Theory Methods. 2003;32(5):1009–1020. [Google Scholar]
  19. Lukman A., Ayinde K., Binuomote S., Clement O. Modified ridge-type estimator to combat multicollinearity: application to chemical data. J. Chemom. 2019;33(5) [Google Scholar]
  20. McDonald G., Galarneau D. A Monte Carlo evaluation of some ridge-type estimators. J. Am. Stat. Assoc. 1975;70:407–416. [Google Scholar]
  21. Omara T. Modifying two-parameter ridge Liu estimator based on ridge estimation. Pak. J. Stat. Oper. Res. 2019;14(4):881–890. [Google Scholar]
  22. Ozkale M.R., Kaciranlar S. The restricted and unrestricted two-parameter estimators. Commun. Stat., Theory Methods. 2007;36:2707–2725. [Google Scholar]
  23. Roozbeh M., Arashi N., Hamzah A. Generalized cross-validation for simultaneous optimization of tuning parameters in ridge regression. Iran. J. Sci. Technol., Trans. A, Sci. 2020;44:473–485. [Google Scholar]
  24. Sakallıoglu S., Kaçıranlar S. A new biased estimator based on ridge estimation. Stat. Pap. 2008;49:669–689. [Google Scholar]
  25. Singh B., Chaubey P., Dwivedi D. An almost unbiased ridge estimator. Indian J. Stat. 1986;49(3):342–346. [Google Scholar]
  26. Singh B., Chaubey P., Dwivedi D. An almost unbiased ridge estimator. Sankhya B. 1986;48(3):342–346. [Google Scholar]
  27. Stein C. In: Proceedings of the Third Berkley Symposium on Mathematical and Statistics Probability, Vol. 1. Neyman J., editor. 1956. Inadmissibility of the usual estimator for mean of multivariate normal distribution; pp. 197–206. [Google Scholar]
  28. Suhail M., Kibria B.M.G. Quantile-based robust ridge m-estimator for linear regression model in presence of multicollinearity and outliers. Commun. Stat., Simul. Comput. 2021;50(11):3194–3206. [Google Scholar]
  29. Trenkler G., Toutenburg H. Mean square error matrix comparisons between biased estimators-an overview of recent results. Stat. Pap. 1990;31:165–179. [Google Scholar]
  30. Xu W., Yang H. More on the bias and variance comparisons of the restricted almost unbiased estimators. Commun. Stat. A. 2011;40(22):4053–4064. [Google Scholar]
  31. Yalian L., Hu Y. A new Liu-type estimator in linear regression model. Stat. Pap. 2012;53:427–437. [Google Scholar]
  32. Yang H., Chang X. A new two-parameter estimator in linear regression. Commun. Stat., Theory Methods. 2010;39(6):923–934. [Google Scholar]
  33. Zhai M., Liu G., Tao Q., Wang K., Chen Y., Pan G., Xin M. The Liu-type estimator based on parameter optimization and its application in SBAS deformation model inversion. IEEE Access. 2020;9:1076–1086. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data included in article/supp. material/referenced in article.


Articles from Heliyon are provided here courtesy of Elsevier

RESOURCES