Shrinkage estimation of non-negative mean vector with unknown covariance under balance loss

Hamid Karamikabir; Mahmoud Afshari; Mohammad Arashi

doi:10.1186/s13660-018-1919-0

. 2018 Dec 3;2018(1):331. doi: 10.1186/s13660-018-1919-0

Shrinkage estimation of non-negative mean vector with unknown covariance under balance loss

Hamid Karamikabir ¹, Mahmoud Afshari ^1,^✉, Mohammad Arashi ²

PMCID: PMC6280813 PMID: 30839820

Abstract

Parameter estimation in multivariate analysis is important, particularly when parameter space is restricted. Among different methods, the shrinkage estimation is of interest. In this article we consider the problem of estimating the p-dimensional mean vector in spherically symmetric models. A dominant class of Baranchik-type shrinkage estimators is developed that outperforms the natural estimator under the balance loss function, when the mean vector is restricted to lie in a non-negative hyperplane. In our study, the components of the diagonal covariance matrix are assumed to be unknown. The performance evaluation of the proposed class of estimators is checked through a simulation study along with a real data analysis.

Keywords: Baranchik-type estimator, Balance loss function, Restricted estimator, Shrinkage estimator, Spherical distribution

Introduction

Shrinkage estimation is a method to improve a raw estimator in some sense, by combining it with other information. Although the shrinkage estimator is biased, it is well known that it has minimum quadratic risk compared to natural estimators (mostly the maximum likelihood estimator).

Mean vector (location) parameter estimation is an important problem in the context of shrinkage estimation, specially when some components of location parameter are restricted to be situated in a specific space. In this respect, Fourdrinier and Ouassou [6] initiated the restricted estimation problem of the mean for the general spherical model with known covariance and Fourdrinier et al. [7] studied the restricted estimation in the latter specified general spherical model, under three different constraints; see also Fourdrinier et al. [9]. Fourdrinier and Marchand [5] studied constraints with the form $\sum_{i = 1}^{p} \frac{{(θ_{i} - τ_{i})}^{2}}{σ^{2}} \leq m^{2}$ , with known $τ_{1}, \dots, τ_{p}$ , $σ^{2}$ , and m when $X_{i} \sim N (θ_{i}, σ^{2})$ , $i = 1, \dots, p$ on spheres of radius α centered at $(τ_{1}, \dots, τ_{p})$ . Kortbi and Marchand [12] exhibited a truncated linear estimator under the constraint $∥ θ ∥ \leq m$ , in the multivariate normal model. Marchand and Strawderman [16] developed a unified approach for minimax estimation for a restricted parameter space. Kubokawa et al. [13] considered minimax shrinkage estimation of a location vector of a spherically symmetric distribution under a concave squared error loss. Also Chang and Strawderman [3] studied a shrinkage estimation of p positive normal means under sum of squared errors loss. Recently, Hoque et al. [10] investigated the performance of the shrinkage estimator of the parameters of a simple linear regression model under the asymmetric loss (LINEX loss criterion). For more details on this topic, we refer to Marchand and Strawderman [15], Silvapulle and Sen [18] and van Eeden [20], among others.

Here, we develop the approach of Fourdrinier et al. [7], in which they estimated location parameter-vector when some components are non-negative, for unknown covariance matrix under balance loss function. We specifically address the Baranchik-type estimators for our purpose.

The paper is outlined as follows: In Sect. 2, some preliminary results are addressed. Section 3 includes the main result, where we give the conditions under which the proposed class of shrinkage estimators dominates the natural estimator under balance loss function, while the numerical performance analysis is investigated by a simulation study in Sect. 4. In Sect. 5, we use the air pollution dataset of USA cities to further demonstrate the superior performance of the shrinkage estimation. The paper is concluded in Sect. 6.

Preliminaries

In this section, we consider the spherical distribution as the parent model, and introduce the natural and the Baranchik-type shrinkage estimator for estimation of restricted parameter space. A $p \times 1$ random vector X is said to have a spherically symmetric distribution (or simply spherical distribution) if X and ΛX have the same distribution for all $p \times p$ orthogonal matrices Λ. Important members are the multivariate normal ( $N_{p} (0, σ^{2} I_{p})$ ), the “ε-contaminated” normal, and multivariate t distributions. For evaluating the performance of the estimators, we need to set a measure. In this paper, we use the balance loss function.

Definition 2.1

Suppose that X is a random vector having a spherical distribution with unknown mean vector parameter θ and scalar variational component $σ^{2}$ . the balance error loss function, BEL( $δ_{0}$ ) is defined as follows:

L_{ω, δ_{0}} (θ, δ) = ω \frac{{∥ δ - δ_{0} ∥}^{2}}{σ^{2}} + (1 - ω) \frac{{∥ δ - θ ∥}^{2}}{σ^{2}}, 0 \leq ω < 1,

where $δ_{0}$ ia a target estimator.

The special case of the balanced error loss function is weighted quadratic loss when $ω = 0$ . The balance loss function was introduced by Zellner [21] to reflect two criteria: goodness of fit and precision of estimation. Then the associated risk function with respect to (1), will be $R (θ, δ) = E_{θ} [L (θ, δ)]$ . For more details about the use of this loss, we refer to Zinodiny et al. [22], Peng et al. [17], Cao and He [2] and Zinodiny et al. [23], to mention a few.

Assume $(X, U)$ is a $p + k$ random vector having a spherically symmetric distribution around the $p + k$ vector $(θ, 0)$ , $dim X = dim θ = p$ and $dim U = dim 0 = k$ . Further, suppose that the scalar variational component $σ^{2}$ is unknown which will be posed for X. We wish to estimate $θ = {(θ_{1}, \dots, θ_{p})}^{T}$ by $δ = {(δ_{1}, \dots, δ_{p})}^{T}$ under the balance loss function. Here, we consider the cases where the members of a subset of $θ_{i} \geq 0$ , $i = 1, \dots, p$ , are non-negative, i.e., $θ_{1} \geq 0, θ_{2} \geq 0, \dots, θ_{q} \geq 0$ and where $θ_{q + 1}, θ_{q + 2}, \dots, θ_{p}$ are unrestricted. Further, let the scale matrix be equal to $σ^{2} I_{p}$ with unknown $σ^{2}$ and $S^{2}$ is an unbiased estimator of $σ^{2}$ , independent of X.

Define $γ_{q} (X) = (γ_{q, 1} (X), \dots, γ_{q, p} (X))$ , for $j = 1, 2, \dots, q$ , as

γ_{q, j} (X) = {\begin{matrix} - X_{j}, & X_{j} < 0, \\ 0, & X_{j} \geq 0, \end{matrix}

and $γ_{q, j} (X) = 0$ if $j > q$ . Then the natural and Baranchik-type shrinkage estimators are, respectively, defined as

δ_{q}^{(1)} (X) = X + γ_{q} (X),

δ_{q}^{(2)} (X, U) = X + γ_{q} (X) + U^{T} U g (X, S),

where $g (X, S)$ has the form

g (X, S) = - \frac{c S^{2} r (\frac{{∥ X ∥}^{2}}{S^{2}})}{{∥ X ∥}^{2}} X,

for some constant c. Furthermore, suppose that the function $r : R^{+} \to [0, 1]$ is twice differentiable and concave. To see the original form of the Baranchik-type shrinkage estimators, refer to Baranchik [1]. In the sequel, we need the following results.

Definition 2.2

A continuous function $f : R^{p} \to R$ is super-harmonic at a point $x_{0} \in R^{p}$ if, for every $r > 0$ , the average of f over the surface of the sphere $S_{r} (x_{0}) = {x : ∥ x - x_{0} ∥ = r}$ is less than or equal to $f (x_{0})$ . The function f is super-harmonic in $R^{p}$ if it is super-harmonic at each $x_{0} \in R^{p}$ .

Lemma 2.1

If $f : R^{p} \to R$ is twice differentiable, then f is super-harmonic in $R^{p}$ if and only if for all $x \in R^{p}$ ,

\nabla \cdot f (x) = \sum_{i = 1}^{p} \frac{\partial^{2}}{\partial x_{i}^{2}} f (x) \leq 0 .

Lemma 2.2

Let Y be a random variable, and $g (y)$ and $h (y)$ any functions for which $E [g (Y)]$ , $E [(h (Y)]$ , and $E [g (Y) h (Y)]$ exist. Then:

If one of the functions $g (\cdot)$ and $h (\cdot)$ is nonincreasing and the other is nondecreasing,
$E [g (Y) h (Y)] \leq E [g (Y)] E [h (Y)] .$
If both functions are either nondecreasing or nonincreasing,
$E [g (Y) h (Y)] \geq E [g (Y)] E [h (Y)] .$

For the proofs of Lemmas 2.1 and 2.2, see Lehmann and Casella [14].

Main result

In this section, we propose the superiority conditions for which the specified shrinkage estimator (4) outperforms the natural one (3). For our purpose, we consider unimodal spherical distributions. Similar to Jafari Jozani et al. [11], the target estimator can be the part of the shrinkage estimator. Let

δ_{0}^{(1)} (X, U) = X + (1 - ω) U^{T} U g (X, S) .

δ_{0}^{(2)} (X) = X + (1 - ω) γ_{q} (X) .

Hence

\begin{aligned} \begin{aligned} δ_{q}^{(1)} (X) & = δ_{0}^{(1)} (X, U) + γ_{q} (X) - (1 - ω) U^{T} U g (X, S) \\ = δ_{0}^{(2)} (X) + ω γ_{q} (X), \end{aligned} \\ \begin{aligned} δ_{q}^{(2)} (X, U) & = δ_{0}^{(1)} (X, U) + γ_{q} (X) + ω U^{T} U g (X, S) \\ = δ_{0}^{(2)} (X) + ω γ_{q} (X) + U^{T} U g (X, S) . \end{aligned} \end{aligned}

Considering these two estimators, the difference in risk for $i = 1, 2$ has the form

\begin{array}{rcl} Δ R_{ω, δ_{0}^{(i)}} (θ, δ) & = & R_{ω, δ_{0}^{(i)}} (θ, δ_{q}^{(2)}) - R_{ω, δ_{0}^{(i)}} (θ, δ_{q}^{(1)}) \\ = & \frac{1}{σ^{2}} E_{θ} [ω ({∥ δ_{q}^{(2)} - δ_{0}^{(i)} ∥}^{2} - {∥ δ_{q}^{(1)} - δ_{0}^{(i)} ∥}^{2}) \\ + (1 - ω) ({∥ δ_{q}^{(2)} - θ ∥}^{2} - {∥ δ_{q}^{(1)} - θ ∥}^{2})] \\ = & \frac{1}{σ^{2}} E_{θ} [ω ({∥ X + γ_{q} (X) + U^{T} U g (X, S) - δ_{0}^{(i)} ∥}^{2} \\ - {∥ X + γ_{q} (X) - δ_{0}^{(i)} ∥}^{2}) \\ + (1 - ω) ({∥ X + γ_{q} (X) + U^{T} U g (X, S) - θ ∥}^{2} \\ - {∥ X + γ_{q} (X) - θ ∥}^{2})] \\ = & \frac{1}{σ^{2}} E_{θ} [{(U^{T} U)}^{2} {∥ g (X, S) ∥}^{2} + 2 (1 - ω) U^{T} U g^{T} (X, S) (X - θ) \\ + 2 (1 - ω) U^{T} U g^{T} (X, S) γ_{q} (X) \\ + 2 ω U^{T} U g^{T} (X, S) (X + γ_{q} (X) - δ_{0}^{(i)})] . \end{array}

Replacing the estimators $δ_{0}^{(1)} (X)$ and $δ_{0}^{(2)} (X)$ in (9), the risk differences for $i = 1, 2$ are given by the following:

\begin{aligned} Δ R^{(1)} = & Δ R_{ω, δ_{0}^{(1)}} (θ, δ) \\ = & \frac{1}{σ^{2}} E_{θ} [{(U^{T} U)}^{2} {∥ g (X, S) ∥}^{2} + 2 (1 - ω) U^{T} U g^{T} (X, S) (X - θ) \\ + 2 (1 - ω) U^{T} U g^{T} (X, S) γ_{q} (X) \\ + 2 ω U^{T} U g^{T} (X, S) (γ_{q} (X) - (1 - ω) U^{T} U g (X, S))] \\ = & \frac{1}{σ^{2}} E_{θ} [(1 - 2 ω + 2 ω^{2}) {(U^{T} U)}^{2} {∥ g (X, S) ∥}^{2} \\ + 2 (1 - ω) U^{T} U g^{T} (X, S) (X - θ) \\ + 2 U^{T} U g^{T} (X, S) γ_{q} (X)]; \end{aligned}

\begin{aligned} Δ R^{(2)} = & Δ R_{ω, δ_{0}^{(2)}} (θ, δ) \\ = & \frac{1}{σ^{2}} E_{θ} [{(U^{T} U)}^{2} {∥ g (X, S) ∥}^{2} + 2 (1 - ω) U^{T} U g^{T} (X, S) (X - θ) \\ + 2 (1 - ω) U^{T} U g^{T} (X, S) γ_{q} (X) - 2 ω^{2} U^{T} U g^{T} (X, S) γ_{q} (X)] \\ = & \frac{1}{σ^{2}} E_{θ} [{(U^{T} U)}^{2} {∥ g (X, S) ∥}^{2} + 2 (1 - ω) U^{T} U g^{T} (X, S) (X - θ) \\ + 2 (1 - ω + ω^{2}) U^{T} U g^{T} (X, S) γ_{q} (X)] . \end{aligned}

Inside the expectations (10) and (11), the second term depends on θ. To avoid this, we use the following lemmas.

Lemma 3.1

(Fourdrinier and Strawderman [8])

For every weakly differentiable function $g : R^{p} \to R^{p}$ , for every integer s and for every $θ \in R^{p}$ we have

E_{θ} [{(U^{T} U)}^{s} g {(X, S)}^{T} (X - θ)] = \frac{1}{k + 2 s} E_{θ} [{(U^{T} U)}^{s + 1} \nabla \cdot g (X, S)]

provided these expectations exist.

Lemma 3.2

(Stein [19]) Suppose that $X \sim N_{p} (θ, σ^{2} I_{p})$ and $g : R^{p} \to R^{p}$ with known $σ^{2}$ , then

E_{θ} [{(X - θ)}^{T} g (X, S)] = σ^{2} E [\nabla \cdot g (X, S)] .

Taking $s = 1$ in Lemma 3.1, for weakly differentiable function g, the risk differences (10) and (11) become

\begin{aligned} Δ R^{(1)} = & \frac{1}{σ^{2}} E_{θ} [(1 - 2 ω + 2 ω^{2}) {(U^{T} U)}^{2} {∥ g (X, S) ∥}^{2} \\ + \frac{2 (1 - ω)}{k + 2} {(U^{T} U)}^{2} \nabla \cdot g (X, S) \\ + 2 U^{T} U g^{T} (X, S) γ_{q} (X)], \end{aligned}

\begin{aligned} Δ R^{(2)} = & \frac{1}{σ^{2}} E_{θ} [{(U^{T} U)}^{2} {∥ g (X, S) ∥}^{2} + \frac{2 (1 - ω)}{k + 2} {(U^{T} U)}^{2} \nabla \cdot g (X, S) \\ + 2 (1 - ω + ω^{2}) U^{T} U g^{T} (X, S) γ_{q} (X)] . \end{aligned}

In order to further analyze the risk difference, we need the following results.

Lemma 3.3

(Fourdrinier et al. [7])

If r is a non-negative, differentiable and concave real-valued function, then r is nondecreasing on $R^{+}$ and the function $r (t) / t$ is nonincreasing on $R^{+}$ . Furthermore, if in addition r is twice differentiable, then the function $r ({∥ x ∥}^{2}) / {∥ x ∥}^{2}$ is super-harmonic for $p \geq 4$ .

Lemma 3.4

Assume X is a real-valued random variable with symmetric unimodal distribution about $θ \in R^{+}$ . If f is a non-negative function on $R^{+}$ , then

E_{θ} [f (X^{2}) \frac{X^{2}}{σ^{2}} I_{[X < 0]}] \leq \frac{1}{2} E_{θ} [\frac{{(X - θ)}^{2}}{σ^{2}} f (X^{2})] .

Proof

According to the symmetry and unimodality of the distribution, X has a density of the form $h ({(X - θ)}^{2})$ with h nonincreasing. Thus, we can write

\begin{aligned} E_{θ} [\frac{f (X^{2})}{σ^{2}} {X^{2} I_{[X < 0]} - \frac{1}{2} (X^{2} - 2 θ X + θ^{2})}] \\ = E_{θ} [\frac{f (X^{2})}{σ^{2}} {X^{2} I_{[X < 0]} - \frac{1}{2} (X^{2} + 2 θ X + θ^{2}) I_{[X < 0]} - \frac{1}{2} (X^{2} - 2 θ X + θ^{2}) I_{[X \geq 0]}}] \\ = E_{θ} [\frac{f (X^{2})}{σ^{2}} {(\frac{1}{2} X^{2} + θ X - \frac{1}{2} θ^{2}) I_{[X < 0]} - (\frac{1}{2} X^{2} - θ X + \frac{1}{2} θ^{2}) I_{[X \geq 0]}}] . \end{aligned}

By the conditioning expectation (14) on $| X |$ we have the following expectation:

\begin{array}{rcl} = & E_{θ} [\frac{f (X^{2})}{σ^{2}} (\frac{1}{2} X^{2} + θ X - \frac{1}{2} θ^{2}) I_{[X < 0]} | | X |] \\ - E_{θ} [\frac{f (X^{2})}{σ^{2}} (\frac{1}{2} X^{2} - θ X + \frac{1}{2} θ^{2}) I_{[X \geq 0]} | | X |] \\ = & \int_{I_{[\frac{1}{2} X^{2} - θ | x | - \frac{1}{2} θ^{2} > 0]}} \frac{f (x^{2})}{σ^{2}} (\frac{1}{2} x^{2} - θ | x | - \frac{1}{2} θ^{2}) h ({(- | x | - θ)}^{2}) d x \\ - \int_{I_{[\frac{1}{2} x^{2} - θ | x | - \frac{1}{2} θ^{2} > 0]}} \frac{f (x^{2})}{σ^{2}} (\frac{1}{2} x^{2} - θ | x | + \frac{1}{2} θ^{2}) h ({(| x | - θ)}^{2}) d x \\ \leq & \int_{I_{[\frac{1}{2} x^{2} - θ | x | - \frac{1}{2} θ^{2} > 0]}} \frac{f (x^{2})}{σ^{2}} (- θ^{2}) h ({(| x | - θ)}^{2}) d x \\ = & E_{θ} [\frac{f (x^{2})}{σ^{2}} I_{[\frac{1}{2} x^{2} - θ | x | - \frac{1}{2} θ^{2} > 0]} (- θ^{2})] \leq 0 . \end{array}

The result follows since in (15), for all $θ > 0$ , we have ${(- | X | - θ)}^{2} \geq {(| X | - θ)}^{2}$ and $h ({(- | X | - θ)}^{2}) \leq h ({(| X | - θ)}^{2})$ . □

We now state the main result.

Theorem 3.1

The shrinkage estimator $δ_{q}^{(2)} (X, U)$ dominates the natural estimator $δ_{q}^{(1)} (X)$ under the BEL( $δ_{0}^{(1)}$ ), if the following conditions hold:

$p > \frac{q (k + 2)}{2 (1 - ω) (k - 2)} + 2$ ,
$0 < c \leq \frac{(2 (1 - ω) \frac{p - 2}{k + 2} - \frac{q}{k - 2})}{(1 - 2 ω + 2 ω^{2})} \frac{E_{σ = 1} (S^{2})}{E_{σ = 1} (S^{4})}$ .

Proof

Since $0 \leq r (\cdot) \leq 1$ is a non-negative, differentiable and concave function by Lemma 3.3, we have $r^{'} (\cdot) \geq 0$ . Using Lemma 3.1 and by the conditioning risk difference $Δ R^{(1)}$ on $(S = s)$ , we have the following inequality:

\begin{aligned} \frac{1}{σ^{2}} E_{S^{2}} (E_{θ} [c^{2} (1 - 2 ω + 2 ω^{2}) {(U^{T} U)}^{2} \frac{r^{2} (\frac{{∥ X ∥}^{2}}{S^{2}}) S^{4}}{{∥ X ∥}^{2}} \\ - 4 c (1 - ω) \frac{{(U^{T} U)}^{2} r^{'} (\frac{{∥ X ∥}^{2}}{S^{2}})}{k + 2} \\ - 2 c (1 - ω) \frac{(p - 2) {(U^{T} U)}^{2} r (\frac{{∥ X ∥}^{2}}{S^{2}}) S^{2}}{(k + 2) {∥ X ∥}^{2}} \\ + 2 c (U^{T} U) \frac{r (\frac{{∥ X ∥}^{2}}{S^{2}}) S^{2}}{{∥ X ∥}^{2}} \sum_{i = 1}^{q} X_{i}^{2} I_{[X_{i} \leq 0]}] | S = s) \\ \leq \frac{1}{σ^{2}} E_{S^{2}} (E_{θ} [{(U^{T} U)}^{2} \frac{r (\frac{{∥ X ∥}^{2}}{S^{2}})}{{∥ X ∥}^{2}} c (c (1 - 2 ω + 2 ω^{2}) S^{4} - 2 (1 - ω) S^{2} \frac{p - 2}{k + 2} \\ + 2 S^{2} \frac{\sum_{i = 1}^{q} X_{i}^{2} I_{[X_{i} \leq 0]}}{U^{T} U})] | S = s) . \end{aligned}

Suppose $X_{1}^{q} = (X_{1}, \dots, X_{q})$ , $η = (θ_{1}, \dots, θ_{q})$ , $X_{q + 1}^{p} = (X_{q + 1}, \dots, X_{p})$ , $μ = (θ_{q + 1}, \dots, θ_{p})$ , $Z = σ^{- 1} (X - θ)$ , $V = (Z_{1}, \dots, Z_{q})$ and $T = (Z_{q + 1}, \dots, Z_{p})$ . Hence, $V = σ^{- 1} (X_{1}^{q} - η)$ and $T = σ^{- 1} (X_{q + 1}^{p} - μ)$ . Since ${∥ X ∥}^{2} = {∥ X_{1}^{q} ∥}^{2} + {∥ X_{q + 1}^{p} ∥}^{2}$ , $X_{1}^{q} = σ V + η$ and $X_{q + 1}^{p} = σ T + μ$ . Let $W^{2} = V^{'} V + U^{'} U$ . Then, assuming $σ = 1$ , an upper bound on the conditional expression (16) by Lemma 3.4 is given by

\begin{aligned} E_{S^{2}} {E_{θ} [{(W^{2} - V^{T} V)}^{2} \frac{r (({∥ X_{1}^{q} ∥}^{2} + {∥ X_{q + 1}^{p} ∥}^{2}) / S^{2})}{{∥ X_{1}^{q} ∥}^{2} + {∥ X_{q + 1}^{p} ∥}^{2}} c (c (1 - 2 ω + 2 ω^{2}) S^{4} \\ - 2 (1 - ω) S^{2} \frac{p - 2}{k + 2} + S^{2} \frac{V^{T} V}{W^{2} - V^{T} V})] | S = s} . \end{aligned}

Using Lemma 3.3, $\frac{r (({∥ X_{1}^{q} ∥}^{2} + {∥ X_{q + 1}^{p} ∥}^{2}) / S^{2})}{{∥ X_{1}^{q} ∥}^{2} + {∥ X_{q + 1}^{p} ∥}^{2}}$ for $p \geq 4$ is super-harmonic and as a result, in $\frac{{∥ X ∥}^{2}}{S^{2}}$ , is nondecreasing. Therefore, the conditional risk difference (17) given $W^{2}$ and T is

\begin{aligned} c E_{θ} [{(W^{2} - V^{T} V)}^{2} \frac{r (({∥ X_{1}^{q} ∥}^{2} + {∥ X_{q + 1}^{p} ∥}^{2}) / s^{2})}{{∥ X_{1}^{q} ∥}^{2} + {∥ X_{q + 1}^{p} ∥}^{2}} \\ \times (c (1 - 2 ω + 2 ω^{2}) s^{4} - 2 (1 - ω) s^{2} \frac{p - 2}{k + 2} + s^{2} \frac{V^{T} V}{W^{2} - V^{T} V}) | W^{2}, T] \\ \leq c E_{θ} [{(W^{2} - V^{T} V)}^{2} \frac{r (({∥ X_{1}^{q} ∥}^{2} + {∥ X_{q + 1}^{p} ∥}^{2}) / s^{2})}{{∥ X_{1}^{q} ∥}^{2} + {∥ X_{q + 1}^{p} ∥}^{2}} | W^{2}, T] \\ \times E_{θ} [(c (1 - 2 ω + 2 ω^{2}) s^{4} - 2 (1 - ω) s^{2} \frac{p - 2}{k + 2} \\ + s^{2} \frac{V^{T} V}{W^{2} - V^{T} V}) | W^{2}, T] . \end{aligned}

In equality (18), by Lemma 2.2, for fixed $W^{2}$ and T, we see that $E_{θ} [\frac{r (({∥ X_{1}^{q} ∥}^{2} + {∥ X_{q + 1}^{p} ∥}^{2}) / s^{2})}{{∥ X_{1}^{q} ∥}^{2} + {∥ X_{q + 1}^{p} ∥}^{2}} | W^{2}, T]$ is nonincreasing in $V^{T} V$ by Lemma A.4 of Fourdrinier et al. [7]. It suffices to show that the second conditional expectation in (18) is non-positive. Since $U^{T} U$ and $V^{T} V$ have distributions $χ_{k}^{2}$ and $χ_{q}^{2}$ , respectively, $(V^{T} V) / W^{2}$ is distributed according to $B e t a (\frac{q}{2}, \frac{k}{2})$ and hence we get

E [V^{T} V / (W^{2} - V^{T} V)] = q / (k - 2) .

Then the risk difference is non-positive if

0 < c \leq \frac{(2 (1 - ω) \frac{p - 2}{k + 2} - \frac{q}{k - 2})}{(1 - 2 ω + 2 ω^{2})} \frac{E_{σ = 1} (S^{2})}{E_{σ = 1} (S^{4})} .

Simple calculations show that c is positive if and only if

k > \frac{4 (1 - ω) (p - 2) + 2 q}{2 (1 - ω) (p - 2) - q} .

This completes the proof. □

In a similar fashion, we have the following result, stated without proof.

Theorem 3.2

The shrinkage estimator $δ_{q}^{(2)} (X, U)$ dominates the natural estimator $δ_{q}^{(1)} (X)$ under the BEL( $δ_{0}^{(2)}$ ), if the following conditions hold:

$p > \frac{q (1 - ω + ω^{2}) (k + 2)}{2 (1 - ω) (k - 2)} + 2$ ,
$0 < c \leq (2 (1 - ω) \frac{p - 2}{k + 2} - (1 - ω + ω^{2}) \frac{q}{k - 2}) \frac{E_{σ = 1} (S^{2})}{E_{σ = 1} (S^{4})}$ .

The following result is for the p-variate normal distribution, a particular member of the spherical class.

Proposition 3.1

Assume the parent distribution $N_{p} (θ, σ^{2} I_{p})$ with unknown $σ^{2}$ . Then the shrinkage estimator $X + γ_{q} (X) + g (X, S)$ dominates the natural estimator $X + γ_{q} (X)$ under the BEL( $δ_{0}^{(i)}$ ), if the following conditions hold:

For BEL( $δ_{0}^{(1)}$ ): $p > \frac{q}{2 (1 - ω)} + 2$ , $0 < c \leq \frac{(2 (1 - ω) (p - 2) - q)}{(1 - 2 ω + 2 ω^{2})} \frac{E_{σ = 1} (S^{2})}{E_{σ = 1} (S^{4})}$ .
For BEL( $δ_{0}^{(2)}$ ): $p > \frac{(1 - ω + ω^{2}) q}{2 (1 - ω)} + 2$ , $0 < c \leq (2 (1 - ω) (p - 2) - (1 - ω + ω^{2}) q) \frac{E_{σ = 1} (S^{2})}{E_{σ = 1} (S^{4})}$ .

Proof

The proof is similar to that of Theorem 3.1. However, we use Lemma 3.2, instead of Lemma 3.1. □

Simulation

To evaluate the performance of a Baranchik-type shrinkage estimator, in this section, we conduct a Monte Carlo simulation study to compare its risk with that of the natural estimator for the 14-variate t distribution with 13 degrees of freedom. Risk values are obtained from 1000 Monte Carlo replications, and plotted in Figs. 1 and 2, for different values q and w. In these figures θ is selected as $(j, 0, \dots, 0)$ and $j = 0, 0.1, 0.2, \dots, 10$ . In this case, $∥ θ ∥ = θ^{T} θ = \sum_{i = 1}^{p} θ_{i} = j^{2}$ .

Risk curve for $δ_{0}^{(1)} (X)$ , $p = 14$ , black line for $q = 5$ and red line for $q = 10$ for different values of ω

Risk curve for $δ_{0}^{(2)} (X)$ , $p = 14$ , black line for $q = 5$ and red line for $q = 10$ for different values of ω

In Figs. 1 and 2, the (Baranchik-type) shrinkage estimator risk curve is below that of the natural estimator, i.e., the shrinkage estimator dominates the natural estimator. Further, it is seen by increasing the amount of w, the risk difference gets larger, which is a bonus in our study.

Air pollution data

In this section, we further investigate the superior performance of the Baranchik-type shrinkage estimator compared to the natural estimator. For this sake, we use the air pollution dataset of USA cities in 1981, from Everitt and Hothorn [4]. They fitted a p-variate normal distribution to this dataset. Here, we have the following list of variables: SO2 content of air in micrograms per cubic meter (SO2), average annual temperature in degrees Fahrenheit (temp), number of manufacturing enterprises employing 20 or more workers (manu), population size (1970 census) in thousands (popul), average annual wind speed in miles per hour (wind), average annual precipitation in inches (precip), average number of days with precipitation per year (predays). We have implemented a bootstrap analysis to evaluate the risk functions. Table 1 lists the values of risk difference $(Δ R^{(i)})$ for different values of w and $σ^{2}$ , for targeted estimators $δ_{0}^{(1)} (X)$ and $δ_{0}^{(2)} (X)$ , respectively. All the values in these tables are negative. (A negative value is a sign of $R_{ω, δ_{0}^{(i)}} (θ, δ_{q}^{(2)}) \leq R_{ω, δ_{0}^{(i)}} (θ, δ_{q}^{(1)})$ .) The same conclusions as for the figures in the previous section can also be obtained.

Table 1.

Values of risk difference for $p = 7$

(ω,q)	(0.3,5)	(0.5,3)	(0.7,2)
$Δ R^{(1)}$	−0.0004641284	−0.0003845636	−0.0000994561
$Δ R^{(2)}$	−0.0004105216	−0.0002643874	−0.0000819120

Open in a new tab

Conclusion

In this paper, the estimation of a restricted parameter space is considered using a class of general shrinkage type estimators under a balance loss function. The class of Baranchik-type shrinkage estimators is considered as a competitor to the well-known James–Stein ones. Since the scalar scale component was unknown, we used another random variable, $S^{2}$ say, independent from the model under study. Theoretical findings of this paper are further supported by some numerical analyses. It is observed that the Baranchik-type shrinkage estimator is always superior to the natural estimator, regardless of the weight value in balance loss function. The result of this paper can stimulate the research in the direction of the mean estimation in restricted parameter space.

Acknowledgements

The authors would like to thank the editors and reviewers for their valuable comments, which greatly improved the readability of this paper.

Authors’ contributions

The authors have equally made contributions. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests. The authors state that no funding source or sponsor has participated in the realization of this work.

Footnotes

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Hamid Karamikabir, Email: h_karamikabir@yahoo.com.

Mahmoud Afshari, Email: afshar@pgu.ac.ir.

Mohammad Arashi, Email: m_arashi_stat@yahoo.com.

References

1.Baranchik A.J. A family of minimax estimators of the mean of a multivariate normal distribution. Ann. Math. Stat. 1970;41(2):642–645. doi: 10.1214/aoms/1177697104. [DOI] [Google Scholar]
2.Cao M.X., He D. Admissibility of linear estimators of the common mean parameter in general linear models under a balanced loss function. J. Multivar. Anal. 2017;153:246–254. doi: 10.1016/j.jmva.2016.10.003. [DOI] [Google Scholar]
3.Chang Y.T., Strawderman W.E. Simultaneous estimation of p positive normal means with common unknown variance. Stat. Probab. Lett. 2017;121:83–89. doi: 10.1016/j.spl.2016.10.012. [DOI] [Google Scholar]
4.Everitt B., Hothorn T. An Introduction to Applied Multivariate Analysis with R. New York: Springer; 2011. [Google Scholar]
5.Fourdrinier D., Marchand E. On Bayes estimators with uniform priors on spheres and their comparative performance with maximum likelihood estimators for estimating bounded multivariate normal means. J. Multivar. Anal. 2010;101:1390–1399. doi: 10.1016/j.jmva.2010.01.011. [DOI] [Google Scholar]
6.Fourdrinier D., Ouassou I. Estimation of the mean of a spherically symmetric distribution with constraints on the norm. Can. J. Stat. 2000;28(2):399–415. doi: 10.2307/3315987. [DOI] [Google Scholar]
7.Fourdrinier D., Ouassou I., Strawderman W.E. Estimation of a parameter vector when some components are restricted. J. Multivar. Anal. 2003;86:14–27. doi: 10.1016/S0047-259X(02)00045-3. [DOI] [Google Scholar]
8.Fourdrinier D., Strawderman W.E. A paradox concerning shrinkage estimators: should a known scale parameter be replaced by an estimated value in the shrinkage factor. J. Multivar. Anal. 1996;59(2):109–140. doi: 10.1006/jmva.1996.0056. [DOI] [Google Scholar]
9.Fourdrinier D., Strawderman W.E., Wells M.T. Estimation of a location parameter with restrictions or “vague information” for spherically symmetric distributions. Ann. Inst. Stat. Math. 2006;58:73–92. doi: 10.1007/s10463-005-0001-0. [DOI] [Google Scholar]
10.Hoque Z., Wesolowski J., Hossain S. Shrinkage estimator of regression model under asymmetric loss. Commun. Stat., Theory Methods. 2018;47(22):5547–5557. doi: 10.1080/03610926.2017.1397169. [DOI] [Google Scholar]
11.Jafari Jozani M., Marchand E., Parsian A. On estimation with weighted balanced-type loss function. Stat. Probab. Lett. 2006;76:733–780. doi: 10.1016/j.spl.2005.10.026. [DOI] [Google Scholar]
12.Kortbi O., Marchand E. Truncated linear estimation of abounded multivariate normal mean. J. Stat. Plan. Inference. 2012;142:2607–2618. doi: 10.1016/j.jspi.2012.03.022. [DOI] [Google Scholar]
13.Kubokawa T., Marchand E., Strawderman W.E. On improved shrinkage estimators for concave loss. Stat. Probab. Lett. 2015;96:241–246. doi: 10.1016/j.spl.2014.09.024. [DOI] [Google Scholar]
14.Lehmann E.L., Casella G. Theory of Point Estimation. 2. NewYork: Springer; 1998. [Google Scholar]
15.Marchand E., Strawderman W.E. Estimation in restricted parameter spaces: a review, a festschrift for herman Rubin. Lect. Notes Monogr. Ser. 2004;45:21–44. doi: 10.1214/lnms/1196285377. [DOI] [Google Scholar]
16.Marchand E., Strawderman W.E. A unified minimax result for restricted parameter spaces. Bernoulli. 2012;18(2):635–643. doi: 10.3150/10-BEJ336. [DOI] [Google Scholar]
17.Peng P., Guikai Hu G., Liang J. All admissible linear predictors in the finite populations with respect to inequality constraints under a balanced loss function. J. Multivar. Anal. 2015;140:113–122. doi: 10.1016/j.jmva.2015.05.003. [DOI] [Google Scholar]
18.Silvapulle M.J., Sen P.K. Constrained Statistical Inference Inequality, Order, and Shape Restrictions. New Jersey: Wiley; 2005. [Google Scholar]
19.Stein C.M. Estimation of the mean of a multivariate normal distribution. Ann. Stat. 1981;9(6):1135–1151. doi: 10.1214/aos/1176345632. [DOI] [Google Scholar]
20.van Eeden C. Restricted Parameter Space Estimation Problems, Admissibility and Minimaxity Properties. New York: Springer; 2006. [Google Scholar]
21.Zellner A. Bayesian and non-Bayesian estimation using balanced loss functions. In: Berger J.O., Gupta S.S., editors. Statistical Decision Theory and Methods, Volume V. New York: Springer; 1994. pp. 337–390. [Google Scholar]
22.Zinodiny S., Rezaei S., Nadarajah S. Bayes minimax estimation of the multivariate normal mean vector under balanced loss function. Stat. Probab. Lett. 2014;93:96–101. doi: 10.1016/j.spl.2014.06.022. [DOI] [Google Scholar]
23.Zinodiny S., Rezaei S., Nadarajah S. Bayes minimax estimation of the mean matrix of matrix-variate normal distribution under balanced loss function. Stat. Probab. Lett. 2017;125:110–120. doi: 10.1016/j.spl.2017.02.003. [DOI] [Google Scholar]

[CR1] 1.Baranchik A.J. A family of minimax estimators of the mean of a multivariate normal distribution. Ann. Math. Stat. 1970;41(2):642–645. doi: 10.1214/aoms/1177697104. [DOI] [Google Scholar]

[CR2] 2.Cao M.X., He D. Admissibility of linear estimators of the common mean parameter in general linear models under a balanced loss function. J. Multivar. Anal. 2017;153:246–254. doi: 10.1016/j.jmva.2016.10.003. [DOI] [Google Scholar]

[CR3] 3.Chang Y.T., Strawderman W.E. Simultaneous estimation of p positive normal means with common unknown variance. Stat. Probab. Lett. 2017;121:83–89. doi: 10.1016/j.spl.2016.10.012. [DOI] [Google Scholar]

[CR4] 4.Everitt B., Hothorn T. An Introduction to Applied Multivariate Analysis with R. New York: Springer; 2011. [Google Scholar]

[CR5] 5.Fourdrinier D., Marchand E. On Bayes estimators with uniform priors on spheres and their comparative performance with maximum likelihood estimators for estimating bounded multivariate normal means. J. Multivar. Anal. 2010;101:1390–1399. doi: 10.1016/j.jmva.2010.01.011. [DOI] [Google Scholar]

[CR6] 6.Fourdrinier D., Ouassou I. Estimation of the mean of a spherically symmetric distribution with constraints on the norm. Can. J. Stat. 2000;28(2):399–415. doi: 10.2307/3315987. [DOI] [Google Scholar]

[CR7] 7.Fourdrinier D., Ouassou I., Strawderman W.E. Estimation of a parameter vector when some components are restricted. J. Multivar. Anal. 2003;86:14–27. doi: 10.1016/S0047-259X(02)00045-3. [DOI] [Google Scholar]

[CR8] 8.Fourdrinier D., Strawderman W.E. A paradox concerning shrinkage estimators: should a known scale parameter be replaced by an estimated value in the shrinkage factor. J. Multivar. Anal. 1996;59(2):109–140. doi: 10.1006/jmva.1996.0056. [DOI] [Google Scholar]

[CR9] 9.Fourdrinier D., Strawderman W.E., Wells M.T. Estimation of a location parameter with restrictions or “vague information” for spherically symmetric distributions. Ann. Inst. Stat. Math. 2006;58:73–92. doi: 10.1007/s10463-005-0001-0. [DOI] [Google Scholar]

[CR10] 10.Hoque Z., Wesolowski J., Hossain S. Shrinkage estimator of regression model under asymmetric loss. Commun. Stat., Theory Methods. 2018;47(22):5547–5557. doi: 10.1080/03610926.2017.1397169. [DOI] [Google Scholar]

[CR11] 11.Jafari Jozani M., Marchand E., Parsian A. On estimation with weighted balanced-type loss function. Stat. Probab. Lett. 2006;76:733–780. doi: 10.1016/j.spl.2005.10.026. [DOI] [Google Scholar]

[CR12] 12.Kortbi O., Marchand E. Truncated linear estimation of abounded multivariate normal mean. J. Stat. Plan. Inference. 2012;142:2607–2618. doi: 10.1016/j.jspi.2012.03.022. [DOI] [Google Scholar]

[CR13] 13.Kubokawa T., Marchand E., Strawderman W.E. On improved shrinkage estimators for concave loss. Stat. Probab. Lett. 2015;96:241–246. doi: 10.1016/j.spl.2014.09.024. [DOI] [Google Scholar]

[CR14] 14.Lehmann E.L., Casella G. Theory of Point Estimation. 2. NewYork: Springer; 1998. [Google Scholar]

[CR15] 15.Marchand E., Strawderman W.E. Estimation in restricted parameter spaces: a review, a festschrift for herman Rubin. Lect. Notes Monogr. Ser. 2004;45:21–44. doi: 10.1214/lnms/1196285377. [DOI] [Google Scholar]

[CR16] 16.Marchand E., Strawderman W.E. A unified minimax result for restricted parameter spaces. Bernoulli. 2012;18(2):635–643. doi: 10.3150/10-BEJ336. [DOI] [Google Scholar]

[CR17] 17.Peng P., Guikai Hu G., Liang J. All admissible linear predictors in the finite populations with respect to inequality constraints under a balanced loss function. J. Multivar. Anal. 2015;140:113–122. doi: 10.1016/j.jmva.2015.05.003. [DOI] [Google Scholar]

[CR18] 18.Silvapulle M.J., Sen P.K. Constrained Statistical Inference Inequality, Order, and Shape Restrictions. New Jersey: Wiley; 2005. [Google Scholar]

[CR19] 19.Stein C.M. Estimation of the mean of a multivariate normal distribution. Ann. Stat. 1981;9(6):1135–1151. doi: 10.1214/aos/1176345632. [DOI] [Google Scholar]

[CR20] 20.van Eeden C. Restricted Parameter Space Estimation Problems, Admissibility and Minimaxity Properties. New York: Springer; 2006. [Google Scholar]

[CR21] 21.Zellner A. Bayesian and non-Bayesian estimation using balanced loss functions. In: Berger J.O., Gupta S.S., editors. Statistical Decision Theory and Methods, Volume V. New York: Springer; 1994. pp. 337–390. [Google Scholar]

[CR22] 22.Zinodiny S., Rezaei S., Nadarajah S. Bayes minimax estimation of the multivariate normal mean vector under balanced loss function. Stat. Probab. Lett. 2014;93:96–101. doi: 10.1016/j.spl.2014.06.022. [DOI] [Google Scholar]

[CR23] 23.Zinodiny S., Rezaei S., Nadarajah S. Bayes minimax estimation of the mean matrix of matrix-variate normal distribution under balanced loss function. Stat. Probab. Lett. 2017;125:110–120. doi: 10.1016/j.spl.2017.02.003. [DOI] [Google Scholar]

PERMALINK

Shrinkage estimation of non-negative mean vector with unknown covariance under balance loss

Hamid Karamikabir

Mahmoud Afshari

Mohammad Arashi

Abstract

Introduction

Preliminaries

Definition 2.1

Definition 2.2

Lemma 2.1

Lemma 2.2

Main result

Lemma 3.1

Lemma 3.2

Lemma 3.3

Lemma 3.4

Proof

Theorem 3.1

Proof

Theorem 3.2

Proposition 3.1

Proof

Simulation

Figure 1.

Figure 2.

Air pollution data

Table 1.

Conclusion

Acknowledgements

Authors’ contributions

Competing interests

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Shrinkage estimation of non-negative mean vector with unknown covariance under balance loss

Hamid Karamikabir

Mahmoud Afshari

Mohammad Arashi

Abstract

Introduction

Preliminaries

Definition 2.1

Definition 2.2

Lemma 2.1

Lemma 2.2

Main result

Lemma 3.1

Lemma 3.2

Lemma 3.3

Lemma 3.4

Proof

Theorem 3.1

Proof

Theorem 3.2

Proposition 3.1

Proof

Simulation

Figure 1.

Figure 2.

Air pollution data

Table 1.

Conclusion

Acknowledgements

Authors’ contributions

Competing interests

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases