Abstract
Parameter estimation in multivariate analysis is important, particularly when parameter space is restricted. Among different methods, the shrinkage estimation is of interest. In this article we consider the problem of estimating the p-dimensional mean vector in spherically symmetric models. A dominant class of Baranchik-type shrinkage estimators is developed that outperforms the natural estimator under the balance loss function, when the mean vector is restricted to lie in a non-negative hyperplane. In our study, the components of the diagonal covariance matrix are assumed to be unknown. The performance evaluation of the proposed class of estimators is checked through a simulation study along with a real data analysis.
Keywords: Baranchik-type estimator, Balance loss function, Restricted estimator, Shrinkage estimator, Spherical distribution
Introduction
Shrinkage estimation is a method to improve a raw estimator in some sense, by combining it with other information. Although the shrinkage estimator is biased, it is well known that it has minimum quadratic risk compared to natural estimators (mostly the maximum likelihood estimator).
Mean vector (location) parameter estimation is an important problem in the context of shrinkage estimation, specially when some components of location parameter are restricted to be situated in a specific space. In this respect, Fourdrinier and Ouassou [6] initiated the restricted estimation problem of the mean for the general spherical model with known covariance and Fourdrinier et al. [7] studied the restricted estimation in the latter specified general spherical model, under three different constraints; see also Fourdrinier et al. [9]. Fourdrinier and Marchand [5] studied constraints with the form , with known , , and m when , on spheres of radius α centered at . Kortbi and Marchand [12] exhibited a truncated linear estimator under the constraint , in the multivariate normal model. Marchand and Strawderman [16] developed a unified approach for minimax estimation for a restricted parameter space. Kubokawa et al. [13] considered minimax shrinkage estimation of a location vector of a spherically symmetric distribution under a concave squared error loss. Also Chang and Strawderman [3] studied a shrinkage estimation of p positive normal means under sum of squared errors loss. Recently, Hoque et al. [10] investigated the performance of the shrinkage estimator of the parameters of a simple linear regression model under the asymmetric loss (LINEX loss criterion). For more details on this topic, we refer to Marchand and Strawderman [15], Silvapulle and Sen [18] and van Eeden [20], among others.
Here, we develop the approach of Fourdrinier et al. [7], in which they estimated location parameter-vector when some components are non-negative, for unknown covariance matrix under balance loss function. We specifically address the Baranchik-type estimators for our purpose.
The paper is outlined as follows: In Sect. 2, some preliminary results are addressed. Section 3 includes the main result, where we give the conditions under which the proposed class of shrinkage estimators dominates the natural estimator under balance loss function, while the numerical performance analysis is investigated by a simulation study in Sect. 4. In Sect. 5, we use the air pollution dataset of USA cities to further demonstrate the superior performance of the shrinkage estimation. The paper is concluded in Sect. 6.
Preliminaries
In this section, we consider the spherical distribution as the parent model, and introduce the natural and the Baranchik-type shrinkage estimator for estimation of restricted parameter space. A random vector X is said to have a spherically symmetric distribution (or simply spherical distribution) if X and ΛX have the same distribution for all orthogonal matrices Λ. Important members are the multivariate normal (), the “ε-contaminated” normal, and multivariate t distributions. For evaluating the performance of the estimators, we need to set a measure. In this paper, we use the balance loss function.
Definition 2.1
Suppose that X is a random vector having a spherical distribution with unknown mean vector parameter θ and scalar variational component . the balance error loss function, BEL() is defined as follows:
| 1 |
where ia a target estimator.
The special case of the balanced error loss function is weighted quadratic loss when . The balance loss function was introduced by Zellner [21] to reflect two criteria: goodness of fit and precision of estimation. Then the associated risk function with respect to (1), will be . For more details about the use of this loss, we refer to Zinodiny et al. [22], Peng et al. [17], Cao and He [2] and Zinodiny et al. [23], to mention a few.
Assume is a random vector having a spherically symmetric distribution around the vector , and . Further, suppose that the scalar variational component is unknown which will be posed for X. We wish to estimate by under the balance loss function. Here, we consider the cases where the members of a subset of , , are non-negative, i.e., and where are unrestricted. Further, let the scale matrix be equal to with unknown and is an unbiased estimator of , independent of X.
Define , for , as
| 2 |
and if . Then the natural and Baranchik-type shrinkage estimators are, respectively, defined as
| 3 |
| 4 |
where has the form
| 5 |
for some constant c. Furthermore, suppose that the function is twice differentiable and concave. To see the original form of the Baranchik-type shrinkage estimators, refer to Baranchik [1]. In the sequel, we need the following results.
Definition 2.2
A continuous function is super-harmonic at a point if, for every , the average of f over the surface of the sphere is less than or equal to . The function f is super-harmonic in if it is super-harmonic at each .
Lemma 2.1
If is twice differentiable, then f is super-harmonic in if and only if for all ,
| 6 |
Lemma 2.2
Let Y be a random variable, and and any functions for which , , and exist. Then:
- If one of the functions and is nonincreasing and the other is nondecreasing,
- If both functions are either nondecreasing or nonincreasing,
For the proofs of Lemmas 2.1 and 2.2, see Lehmann and Casella [14].
Main result
In this section, we propose the superiority conditions for which the specified shrinkage estimator (4) outperforms the natural one (3). For our purpose, we consider unimodal spherical distributions. Similar to Jafari Jozani et al. [11], the target estimator can be the part of the shrinkage estimator. Let
| 7 |
| 8 |
Hence
Considering these two estimators, the difference in risk for has the form
| 9 |
Replacing the estimators and in (9), the risk differences for are given by the following:
| 10 |
| 11 |
Inside the expectations (10) and (11), the second term depends on θ. To avoid this, we use the following lemmas.
Lemma 3.1
(Fourdrinier and Strawderman [8])
For every weakly differentiable function , for every integer s and for every we have
provided these expectations exist.
Lemma 3.2
(Stein [19]) Suppose that and with known , then
Taking in Lemma 3.1, for weakly differentiable function g, the risk differences (10) and (11) become
| 12 |
| 13 |
In order to further analyze the risk difference, we need the following results.
Lemma 3.3
(Fourdrinier et al. [7])
If r is a non-negative, differentiable and concave real-valued function, then r is nondecreasing on and the function is nonincreasing on . Furthermore, if in addition r is twice differentiable, then the function is super-harmonic for .
Lemma 3.4
Assume X is a real-valued random variable with symmetric unimodal distribution about . If f is a non-negative function on , then
Proof
According to the symmetry and unimodality of the distribution, X has a density of the form with h nonincreasing. Thus, we can write
| 14 |
By the conditioning expectation (14) on we have the following expectation:
| 15 |
The result follows since in (15), for all , we have and . □
We now state the main result.
Theorem 3.1
The shrinkage estimator dominates the natural estimator under the BEL(), if the following conditions hold:
,
.
Proof
Since is a non-negative, differentiable and concave function by Lemma 3.3, we have . Using Lemma 3.1 and by the conditioning risk difference on , we have the following inequality:
| 16 |
Suppose , , , , , and . Hence, and . Since , and . Let . Then, assuming , an upper bound on the conditional expression (16) by Lemma 3.4 is given by
| 17 |
Using Lemma 3.3, for is super-harmonic and as a result, in , is nondecreasing. Therefore, the conditional risk difference (17) given and T is
| 18 |
In equality (18), by Lemma 2.2, for fixed and T, we see that is nonincreasing in by Lemma A.4 of Fourdrinier et al. [7]. It suffices to show that the second conditional expectation in (18) is non-positive. Since and have distributions and , respectively, is distributed according to and hence we get
Then the risk difference is non-positive if
Simple calculations show that c is positive if and only if
This completes the proof. □
In a similar fashion, we have the following result, stated without proof.
Theorem 3.2
The shrinkage estimator dominates the natural estimator under the BEL(), if the following conditions hold:
,
.
The following result is for the p-variate normal distribution, a particular member of the spherical class.
Proposition 3.1
Assume the parent distribution with unknown . Then the shrinkage estimator dominates the natural estimator under the BEL(), if the following conditions hold:
For BEL(): , .
For BEL(): , .
Proof
The proof is similar to that of Theorem 3.1. However, we use Lemma 3.2, instead of Lemma 3.1. □
Simulation
To evaluate the performance of a Baranchik-type shrinkage estimator, in this section, we conduct a Monte Carlo simulation study to compare its risk with that of the natural estimator for the 14-variate t distribution with 13 degrees of freedom. Risk values are obtained from 1000 Monte Carlo replications, and plotted in Figs. 1 and 2, for different values q and w. In these figures θ is selected as and . In this case, .
Figure 1.
Risk curve for , , black line for and red line for for different values of ω
Figure 2.
Risk curve for , , black line for and red line for for different values of ω
In Figs. 1 and 2, the (Baranchik-type) shrinkage estimator risk curve is below that of the natural estimator, i.e., the shrinkage estimator dominates the natural estimator. Further, it is seen by increasing the amount of w, the risk difference gets larger, which is a bonus in our study.
Air pollution data
In this section, we further investigate the superior performance of the Baranchik-type shrinkage estimator compared to the natural estimator. For this sake, we use the air pollution dataset of USA cities in 1981, from Everitt and Hothorn [4]. They fitted a p-variate normal distribution to this dataset. Here, we have the following list of variables: SO2 content of air in micrograms per cubic meter (SO2), average annual temperature in degrees Fahrenheit (temp), number of manufacturing enterprises employing 20 or more workers (manu), population size (1970 census) in thousands (popul), average annual wind speed in miles per hour (wind), average annual precipitation in inches (precip), average number of days with precipitation per year (predays). We have implemented a bootstrap analysis to evaluate the risk functions. Table 1 lists the values of risk difference for different values of w and , for targeted estimators and , respectively. All the values in these tables are negative. (A negative value is a sign of .) The same conclusions as for the figures in the previous section can also be obtained.
Table 1.
Values of risk difference for
| (ω,q) | (0.3,5) | (0.5,3) | (0.7,2) |
|---|---|---|---|
| −0.0004641284 | −0.0003845636 | −0.0000994561 | |
| −0.0004105216 | −0.0002643874 | −0.0000819120 |
Conclusion
In this paper, the estimation of a restricted parameter space is considered using a class of general shrinkage type estimators under a balance loss function. The class of Baranchik-type shrinkage estimators is considered as a competitor to the well-known James–Stein ones. Since the scalar scale component was unknown, we used another random variable, say, independent from the model under study. Theoretical findings of this paper are further supported by some numerical analyses. It is observed that the Baranchik-type shrinkage estimator is always superior to the natural estimator, regardless of the weight value in balance loss function. The result of this paper can stimulate the research in the direction of the mean estimation in restricted parameter space.
Acknowledgements
The authors would like to thank the editors and reviewers for their valuable comments, which greatly improved the readability of this paper.
Authors’ contributions
The authors have equally made contributions. All authors read and approved the final manuscript.
Competing interests
The authors declare that they have no competing interests. The authors state that no funding source or sponsor has participated in the realization of this work.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Hamid Karamikabir, Email: h_karamikabir@yahoo.com.
Mahmoud Afshari, Email: afshar@pgu.ac.ir.
Mohammad Arashi, Email: m_arashi_stat@yahoo.com.
References
- 1.Baranchik A.J. A family of minimax estimators of the mean of a multivariate normal distribution. Ann. Math. Stat. 1970;41(2):642–645. doi: 10.1214/aoms/1177697104. [DOI] [Google Scholar]
- 2.Cao M.X., He D. Admissibility of linear estimators of the common mean parameter in general linear models under a balanced loss function. J. Multivar. Anal. 2017;153:246–254. doi: 10.1016/j.jmva.2016.10.003. [DOI] [Google Scholar]
- 3.Chang Y.T., Strawderman W.E. Simultaneous estimation of p positive normal means with common unknown variance. Stat. Probab. Lett. 2017;121:83–89. doi: 10.1016/j.spl.2016.10.012. [DOI] [Google Scholar]
- 4.Everitt B., Hothorn T. An Introduction to Applied Multivariate Analysis with R. New York: Springer; 2011. [Google Scholar]
- 5.Fourdrinier D., Marchand E. On Bayes estimators with uniform priors on spheres and their comparative performance with maximum likelihood estimators for estimating bounded multivariate normal means. J. Multivar. Anal. 2010;101:1390–1399. doi: 10.1016/j.jmva.2010.01.011. [DOI] [Google Scholar]
- 6.Fourdrinier D., Ouassou I. Estimation of the mean of a spherically symmetric distribution with constraints on the norm. Can. J. Stat. 2000;28(2):399–415. doi: 10.2307/3315987. [DOI] [Google Scholar]
- 7.Fourdrinier D., Ouassou I., Strawderman W.E. Estimation of a parameter vector when some components are restricted. J. Multivar. Anal. 2003;86:14–27. doi: 10.1016/S0047-259X(02)00045-3. [DOI] [Google Scholar]
- 8.Fourdrinier D., Strawderman W.E. A paradox concerning shrinkage estimators: should a known scale parameter be replaced by an estimated value in the shrinkage factor. J. Multivar. Anal. 1996;59(2):109–140. doi: 10.1006/jmva.1996.0056. [DOI] [Google Scholar]
- 9.Fourdrinier D., Strawderman W.E., Wells M.T. Estimation of a location parameter with restrictions or “vague information” for spherically symmetric distributions. Ann. Inst. Stat. Math. 2006;58:73–92. doi: 10.1007/s10463-005-0001-0. [DOI] [Google Scholar]
- 10.Hoque Z., Wesolowski J., Hossain S. Shrinkage estimator of regression model under asymmetric loss. Commun. Stat., Theory Methods. 2018;47(22):5547–5557. doi: 10.1080/03610926.2017.1397169. [DOI] [Google Scholar]
- 11.Jafari Jozani M., Marchand E., Parsian A. On estimation with weighted balanced-type loss function. Stat. Probab. Lett. 2006;76:733–780. doi: 10.1016/j.spl.2005.10.026. [DOI] [Google Scholar]
- 12.Kortbi O., Marchand E. Truncated linear estimation of abounded multivariate normal mean. J. Stat. Plan. Inference. 2012;142:2607–2618. doi: 10.1016/j.jspi.2012.03.022. [DOI] [Google Scholar]
- 13.Kubokawa T., Marchand E., Strawderman W.E. On improved shrinkage estimators for concave loss. Stat. Probab. Lett. 2015;96:241–246. doi: 10.1016/j.spl.2014.09.024. [DOI] [Google Scholar]
- 14.Lehmann E.L., Casella G. Theory of Point Estimation. 2. NewYork: Springer; 1998. [Google Scholar]
- 15.Marchand E., Strawderman W.E. Estimation in restricted parameter spaces: a review, a festschrift for herman Rubin. Lect. Notes Monogr. Ser. 2004;45:21–44. doi: 10.1214/lnms/1196285377. [DOI] [Google Scholar]
- 16.Marchand E., Strawderman W.E. A unified minimax result for restricted parameter spaces. Bernoulli. 2012;18(2):635–643. doi: 10.3150/10-BEJ336. [DOI] [Google Scholar]
- 17.Peng P., Guikai Hu G., Liang J. All admissible linear predictors in the finite populations with respect to inequality constraints under a balanced loss function. J. Multivar. Anal. 2015;140:113–122. doi: 10.1016/j.jmva.2015.05.003. [DOI] [Google Scholar]
- 18.Silvapulle M.J., Sen P.K. Constrained Statistical Inference Inequality, Order, and Shape Restrictions. New Jersey: Wiley; 2005. [Google Scholar]
- 19.Stein C.M. Estimation of the mean of a multivariate normal distribution. Ann. Stat. 1981;9(6):1135–1151. doi: 10.1214/aos/1176345632. [DOI] [Google Scholar]
- 20.van Eeden C. Restricted Parameter Space Estimation Problems, Admissibility and Minimaxity Properties. New York: Springer; 2006. [Google Scholar]
- 21.Zellner A. Bayesian and non-Bayesian estimation using balanced loss functions. In: Berger J.O., Gupta S.S., editors. Statistical Decision Theory and Methods, Volume V. New York: Springer; 1994. pp. 337–390. [Google Scholar]
- 22.Zinodiny S., Rezaei S., Nadarajah S. Bayes minimax estimation of the multivariate normal mean vector under balanced loss function. Stat. Probab. Lett. 2014;93:96–101. doi: 10.1016/j.spl.2014.06.022. [DOI] [Google Scholar]
- 23.Zinodiny S., Rezaei S., Nadarajah S. Bayes minimax estimation of the mean matrix of matrix-variate normal distribution under balanced loss function. Stat. Probab. Lett. 2017;125:110–120. doi: 10.1016/j.spl.2017.02.003. [DOI] [Google Scholar]


