Poisson Counts, Square Root Transformation and Small Area Estimation: Square Root Transformation

Malay Ghosh; Tamal Ghosh; Masayo Y Hirose

doi:10.1007/s13571-021-00269-8

. 2021 Oct 11;84(2):449–471. doi: 10.1007/s13571-021-00269-8

Poisson Counts, Square Root Transformation and Small Area Estimation

Square Root Transformation

Malay Ghosh ^1,^✉, Tamal Ghosh ¹, Masayo Y Hirose ²

PMCID: PMC8503421 PMID: 34658600

Abstract

The paper intends to serve two objectives. First, it revisits the celebrated Fay-Herriot model, but with homoscedastic known error variance. The motivation comes from an analysis of count data, in the present case, COVID-19 fatality for all counties in Florida. The Poisson model seems appropriate here, as is typical for rare events. An empirical Bayes (EB) approach is taken for estimation. However, unlike the conventional conjugate gamma or the log-normal prior for the Poisson mean, here we make a square root transformation of the original Poisson data, along with square root transformation of the corresponding mean. Proper back transformation is used to infer about the original Poisson means. The square root transformation makes the normal approximation of the transformed data more justifiable with added homoscedasticity. We obtain exact analytical formulas for the bias and mean squared error of the proposed EB estimators. In addition to illustrating our method with the COVID-19 example, we also evaluate performance of our procedure with simulated data as well.

Keywords: COVID19, Empirical Bayes, Fay-Herriot model, Random Effects Model, Stein-type shrinkage estimators.

Introduction

Small area estimation is now a topic of global importance. Methodologies abound, and many of these are finding real-life applications.

Normal theory small area estimation pervades the literature, and the pioneering (Fay and Herriot, 1979) method is most often used in real life applications. The Fay-Herriot model is a normal theory mixed effects area-level model. The model needs to assume known error variances in order to avoid non-identifiability, whereas in reality, these are sample estimates.

The present article deals with the Fay-Herriot model with known constant error variance. The motivation came from an analysis of count data, in particular, COVID-19 data, to find estimates of fatality for all counties in the state of Florida. The Poisson model is used, but we make a square transformation of the original data, and the corresponding mean parameters to attain a closer approximation to normality with added homoscedasticity. This is in contrast to the log-transformation, where also one typically assumes normality of the transformed data. However, transformation of the original data in the log-scale bears the potential hazard of leaving out zero counts, which on most occasions, can affect the conclusion significantly. Further, our approach allows one to develop Stein-type shrinkage estimators for small area means and study their properties analytically.

Variable transformation in the small area context has been addressed before. The logarithmic transformation with the assumption of log-normal distribution of the original data is most commonly used, for example, in modeling income distributions. See for example, Slud and Maiti (2006) and Ghosh et al. (2015). Recently, Hirose et al. (2021) considered an arc-sin transformation of binomial proportions for small area estimation. Sugasawa and Kubokawa (2017) suggested a non-explicit EB estimator and performed an analysis based on the dual power transformation similar to that of Hirose et al. (2021).

The remaining sections are as follows. In Section 2, we introduce the square root transformation, develop Stein-type shrinkage estimators for the transformed data motivated from an empirical Bayes point of view, and then back transform properly to estimate the original parameters of interest. In Sections 3 and 4, we obtain exact expressions for the bias and the mean squared error of our shrinkage estimator. Finally, Section 5 we also obtain an estimator of the mean squared error correct up to order $O (m^{- 1})$ where m is the number of small areas. Section 6 contains an illustration of the proposed method to estimate the number of deaths due to COVID-19 in each county. A simulation study is undertaken in Section 7. Some final remarks are made in Section 8.

Empirical Bayes Estimators

Suppose there are m areas with counts y_i for i-th area. We assume y_i are independently distributed from Poisson(λ_i) for i-th area. We transform $z_{i} = \sqrt{y_{i}}$ and with the usual variance stabilizing square root transformation so that V (z_i) is approximately 1/4. We begin with

z_{i} | 𝜃_{i} \overset{i n d}{\sim} N (𝜃_{i}, 1 / 4) where 𝜃_{i} = \sqrt{λ_{i}} (i = 1, ..., m) .

Following the customary approach, we consider independent $N (x_{i}^{⊤} β, A)$ priors for the 𝜃_i with p-dimensional auxiliary variables x_i and regression parameter $β \in ℝ^{p}$ where m > p + 4. The posterior $𝜃_{i} | z_{i} \overset{i n d}{\sim} N ((1 - B) z_{i} + B x_{i}^{⊤} β, (1 - B) / 4)$ where $B = \frac{1 / 4}{1 / 4 + A}$ . Thus, Bayes estimator of λ_i is

{\hat{λ}}_{i}^{B} = E (λ_{i} | z_{i}) = E (𝜃_{i}^{2} | z_{i}) = (1 - B) / 4 + {(1 - B) z_{i} + B x_{i}^{⊤} β}^{2}

We now turn towards empirical Bayes (EB) estimation of the λ_i. Writing X = (x₁,...,x_m)^⊤, Z = (z₁,...,z_m)^⊤ and $\hat{β} = {(X^{⊤} X)}^{- 1} X^{⊤} Z$ , it follows that marginally $| | Z - X \hat{β} | |^{2} \sim \frac{1}{4 B} χ_{m - p}^{2}$ . Here X is a m × p matrix with rank p. Following Efron and Morris (1973), an EB estimator of B is $\hat{B} = \frac{m - p - 2}{4 | | Z - X \hat{β} | |^{2}}$ . Thus an EB estimator of λ_i is

{\hat{λ}}_{i}^{E B} = (1 - \hat{B}) / 4 + {(1 - \hat{B}) z_{i} + \hat{B} x_{i}^{⊤} \hat{β}}^{2}

For proving our technical results, we find it convenient also to define

{\tilde{λ}}_{i}^{E B} = (1 - B) / 4 + {(1 - B) z_{i} + B x_{i}^{⊤} \hat{β}}^{2},

which is also an EB estimator of λ_i if the shrinkage factor B were known. Also, for notational simplicity we write m₀ = m − p hereafter.

Bias of ${\hat{λ}}^{E B}$

For both bias and mean squared error (MSE) calculations for ${\hat{λ}}_{i}^{E B}$ we need the following lemmas.

Lemma 1.

Let $(\begin{matrix} X \\ Y \end{matrix}) \sim N [(\begin{matrix} μ_{1} \\ μ_{2} \end{matrix}), (\begin{matrix} σ_{1}^{2} & σ_{12} \\ σ_{12} & σ_{2}^{2} \end{matrix})] .$ Then

\begin{array}{rcl} V a r (X^{2}) = 2 σ_{1}^{4} + 4 μ_{1}^{2} σ_{1}^{2} \\ V a r (Y^{2}) = 2 σ_{2}^{4} + 4 μ_{2}^{2} σ_{2}^{2} \\ C o v (X^{2}, Y^{2}) = 2 σ_{12}^{2} + 4 μ_{1} μ_{2} σ_{12} \end{array}

Lemma 2.

Define $s_{i} = x_{i}^{⊤} {(X^{⊤} X)}^{- 1} x_{i}$ , $u_{i 1} = (1 - B) z_{i} + B x_{i}^{⊤} \hat{β}$ and $u_{i 2} = (1 - B) z_{i} + B x_{i}^{⊤} β$ then,

[\begin{matrix} u_{i 1} \\ u_{i 2} \end{matrix}] s i m N [(\begin{matrix} x_{i}^{⊤} β \\ x_{i}^{⊤} β \end{matrix}), (\begin{matrix} A (1 - B) + (2 - B) s_{i} / 4 & (1 - B) (A + s_{i} / 4) \\ (1 - B) (A + s_{i} / 4) & A (1 - B) \end{matrix})] .

Proof 1.

The result follows from the independence of $\hat{β}$ and $z_{i} - x_{i} \hat{β}$ and noting that $\hat{β} \sim N (β, (1 / 4 + A) {(X^{⊤} X)}^{- 1})$ while $z_{i} - x_{i}^{⊤} \hat{β} \sim N (0, (1 / 4 + A) (1 - s_{i}))$ . □

Lemma 3.

(i)
$| | Z - X \hat{β} | |$ is distributed independently of $(\frac{(z_{1} - x_{1}^{⊤} \hat{β})}{| | Z - X \hat{β} | |}, \dots, \frac{(z_{m} - x_{m}^{⊤} \hat{β})}{| | Z - X \hat{β} | |})$ .
(ii)
$E [\frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{k}}{| | Z - X \hat{β} | |^{k}}] = \frac{E {(z_{i} - x_{i}^{⊤} \hat{β})}^{k}}{E (| | Z - X \hat{β} | |^{k})}$ for all positive integers k.
(iii)
$E [\frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{k}}{| | Z - X \hat{β} | |^{l}}] = 0$ if k is an odd positive integer and 0 < l < k.

Proof 2.

Marginally, $Z \sim N (X β, (1 / 4 + A) I_{m})$ . Hence, $(\hat{β}, | | Z - X \hat{β} | |)$ is complete sufficient for (β,A), while $(\frac{(z_{1} - x_{1}^{⊤} \hat{β})}{| | Z - X \hat{β} | |}, \dots, \frac{(z_{m} - x_{m}^{⊤} \hat{β})}{| | Z - X \hat{β} | |})$ is ancillary. This proves (i) by an application of Basu’s Theorem (Basu, 1955).

Now for any positive integer k, and by part (i) in Lemma 3, we have,

\begin{array}{rcl} E {(z_{i} - x_{i}^{⊤} \hat{β})}^{k} & = & E [| | Z - X \hat{β} | |^{k} \frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{k}}{| | Z - X \hat{β} | |^{k}}] \\ = & E [| | Z - X \hat{β} | |^{k}] E [\frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{k}}{| | Z - X \hat{β} | |^{k}}] . \end{array}

This leads to $E [\frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{k}}{| | Z - X \hat{β} | |^{k}}] = \frac{E {(z_{i} - x_{i}^{⊤} \hat{β})}^{k}}{E (| | Z - X \hat{β} | |^{k})}$ .

Next noting that $((z_{1} - x_{1}^{⊤} \hat{β}), \dots, (z_{m} - x_{m}^{⊤} \hat{β})) \overset{d}{=} - ((z_{1} - x_{1}^{⊤} \hat{β}), \dots, (z_{m} - x_{m}^{⊤} \hat{β}))$ it follows that

\begin{array}{rcl} (\frac{(z_{1} - x_{1}^{⊤} \hat{β})}{| | Z - X \hat{β} | |}, \dots, \frac{(z_{m} - x_{m}^{⊤} \hat{β})}{| | Z - X \hat{β} | |}) \overset{d}{=} (\frac{- (z_{1} - x_{1}^{⊤} \hat{β})}{| | Z - X \hat{β} | |}, \dots, \frac{- (z_{m} - x_{m}^{⊤} \hat{β})}{| | Z - X \hat{β} | |}) \end{array}

Therefore, $\frac{(z_{i} - x_{i}^{⊤} \hat{β})}{| | Z - X \hat{β} | |}$ is symmetric random variable around 0 and its all odd moments are 0. Hence,

\begin{array}{rcl} E [\frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{k}}{| | Z - X \hat{β} | |^{l}}] & = & E [| | Z - X \hat{β} | |^{k - l} \frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{k}}{| | Z - X \hat{β} | |^{k}}] \\ = & E ([| | Z - X \hat{β} | |^{k - l}] E [\frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{k}}{| | Z - X \hat{β} | |^{k}}] \end{array}

We get the second equality in the above equations using part (i) of this Lemma. Proof of (iii) is complete after observing k is an odd integer. □

Lemma 4.

(i)
$E (\hat{B}) = B$ if m₀ > 2.
(ii)
$E ({\hat{B}}^{2}) = \frac{m_{0} - 2}{m_{0} - 4} B^{2}$ if m₀ > 4.
(iii)
$E (1 / \hat{B}) = \frac{m_{0}}{(m_{0} - 2) B}$ if m₀ > 2.
(iv)
$E (1 / {\hat{B}}^{2}) = \frac{m_{0} (m_{0} + 2)}{{(m_{0} - 2)}^{2} B^{2}}$ if m₀ > 2.
(v)
$E [\frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{2}}{| | Z - X \hat{β} | |^{2}}] = \frac{(1 - s_{i})}{m_{0}}$ .
(vi)
$E [\frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{4}}{| | Z - X \hat{β} | |^{4}}] = \frac{3 {(1 - s_{i})}^{2}}{m_{0} (m_{0} + 2)}$ .

Proof 3.

The proof (i) to (iv) follows by noting that $\hat{B} = \frac{m_{0} - 2}{4 | | Z - X \hat{β} | |^{2}}$ and $| | Z - X \hat{β} | |^{2} \sim \frac{1}{4 B} χ_{m_{0}}^{2}$ . To prove (v) and (vi) we also need part (ii) of Lemma 3. □

Now we start with calculation of the bias $E ({\hat{λ}}_{i}^{E B} - λ_{i})$ . The following theorem is proved.

Theorem 1.

Suppose $z_{i} | 𝜃_{i} \overset{i n d}{\sim} N (𝜃_{i}, 1 / 4)$ with priors $𝜃_{i} \overset{i n d}{\sim} N (x_{i}^{⊤} β, A)$ and m₀ > 2. Then bias of the EB estimator ${\hat{λ}}_{i}^{E B}$ in Eq. 2 for $λ_{i} = 𝜃_{i}^{2}$ is given by

B i a s ({\hat{λ}}_{i}^{E B}) = E ({\hat{λ}}_{i}^{E B} - λ_{i}) = \frac{2 - B}{4} (s_{i} + \frac{2 (1 - s_{i})}{m_{0}}) .

Proof 4.

We begin with the partition

E ({\hat{λ}}_{i}^{E B} - λ_{i}) = E ({\hat{λ}}_{i}^{B} - λ_{i}) + E ({\tilde{λ}}_{i}^{E B} - {\hat{λ}}_{i}^{B}) + E ({\hat{λ}}_{i}^{E B} - {\tilde{λ}}_{i}^{E B})

By Lemmas 1 and 2,

\begin{array}{rcl} E ({\tilde{λ}}_{i}^{E B} - {\hat{λ}}_{i}^{B}) & = & E (u_{i 1}^{2} - u_{i 2}^{2}) = V a r (u_{i 1}) + {(x_{i} β)}^{2} - V a r (u_{i 2}) - {(x_{i} β)}^{2} \\ = & (2 - B) s_{i} / 4 \end{array}

Noticing $E (λ_{i}) = E (E (λ_{i} | z_{i})) = E ({\hat{λ}}_{i}^{B})$ , we have $E ({\hat{λ}}_{i}^{B} - λ_{i}) = 0$ . It is easy to see that

\begin{array}{rcl} {\hat{λ}}_{i}^{E B} - {\tilde{λ}}_{i}^{E B} & = & (B - \hat{B}) / 4 + 2 x_{i}^{⊤} \hat{β} (B - \hat{B}) (z_{i} - x_{i}^{⊤} \hat{β}) \\ + {({\hat{B}}^{2} - B^{2}) - 2 (\hat{B} - B)} {(z_{i} - x_{i}^{⊤} \hat{β})}^{2} \\ = & A_{1} + A_{2} + A_{3}, \end{array}

where $A_{1} = (B - \hat{B}) / 4, A_{2} = 2 x_{i}^{⊤} \hat{β} (B - \hat{B})$ and $A_{3} = {({\hat{B}}^{2} - B^{2}) - 2 (\hat{B} - B)} {(z_{i} - x_{i}^{⊤} \hat{β})}^{2}$ . The expectation of A₁ is 0 since $\hat{B} = \frac{m_{0} - 2}{4 | | Z - X \hat{β} | |^{2}}$ is unbiased estimator of B. Now,

\begin{array}{rcl} E (A_{2}) & = & E (x_{i}^{⊤} \hat{β}) E [(B - \hat{B}) (z_{i} - x_{i}^{⊤} \hat{β})] \\ = & E (x_{i}^{⊤} \hat{β}) E (B (z_{i} - x_{i}^{⊤} \hat{β})) - E (x_{i}^{⊤} \hat{β}) E [\frac{m_{0} - 2}{4 | | Z - X \hat{β} | |^{2}} (z_{i} - x_{i}^{⊤} \hat{β})] \\ = & 0 \end{array}

The first equality in Eq. 9 is by independence of $\hat{β}$ and $((z_{1} - x_{1}^{⊤} \hat{β}), \dots,$ $(z_{n} - x_{n}^{⊤} \hat{β}))$ . The third equality holds by part (iii) of Lemma 3 since $(z_{i} - x_{i}^{⊤} \hat{β}) \sim N (0, (1 / 4 + A) (1 - s_{i}))$ .

Finally, we simplify A₃. By (i) and (iii) of Lemma 3 and Lemma 4,

\begin{array}{rcl} E ({\hat{λ}}_{i}^{E B} - {\tilde{λ}}_{i}^{E B}) & = & E [{({\hat{B}}^{2} - B^{2}) - 2 (\hat{B} - B)} {(z_{i} - x_{i}^{⊤} \hat{β})}^{2}] \\ = & E [\frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{2}}{| | Z - X \hat{β} | |^{2}}] E [{({\hat{B}}^{2} - B^{2}) - 2 (\hat{B} - B)} \frac{m_{0} - 2}{4 \hat{B}}] \\ = & \frac{(1 - s_{i})}{m_{0}} \frac{(m_{0} - 2)}{4} E [\hat{B} - B^{2} / \hat{B} - 2 + 2 B / \hat{B}] \\ = & \frac{(1 - s_{i})}{m_{0}} \frac{(m_{0} - 2)}{4} [B - B \frac{m_{0}}{m_{0} - 2} - 2 + 2 \frac{m_{0}}{m_{0} - 2}] \\ = & \frac{1 - s_{i}}{2 m_{0}} (2 - B) . \end{array}

The proof of Eq. 5 follows now by combining (6) with Eqs. 7–10. □

Remark 1.

With the usual assumption, $s_{i} = x_{i}^{⊤} {(X^{⊤} X)}^{- 1} x_{i} = O (m^{- 1})$ for large m, the bias of the EB estimator ${\hat{λ}}_{i}^{E B}$ , $E ({\hat{λ}}_{i}^{E B} - λ_{i}) = O (m^{- 1})$ .

Remark 2.

We can estimate the bias in Eq. 5 by replacing the B by $\hat{B}$ , An unbiased estimator of the bias is

\hat{b i a s} = \frac{2 - \hat{B}}{4} (s_{i} + \frac{2 (1 - s_{i})}{m_{0}}) .

Thus, from Eq. 11, the EB estimator has positive bias, and the bias-corrected estimator of λ_i is

{\hat{λ}}_{i}^{C E B} = {\hat{λ}}^{E B} - \hat{b i a s} .

MSE of ${\hat{λ}}^{E B}$

The following theorem provides an exact expression for the MSE of ${\hat{λ}}^{E B}$ .

Theorem 2.

Suppose $z_{i} | 𝜃_{i} \overset{i n d}{\sim} N (𝜃_{i}, 1 / 4)$ with priors $𝜃_{i} \overset{i n d}{\sim} N (x_{i}^{⊤} β, A)$ and m₀ > 4. Then MSE of the EB estimator ${\hat{λ}}_{i}^{E B}$ in Eq. 2 for $λ_{i} = 𝜃_{i}^{2}$ is given by

\begin{array}{rcl} M S E ({\hat{λ}}_{i}^{E B}) & = & E {({\hat{λ}}_{i}^{E B} - λ_{i})}^{2} \\ = & (1 - B) [{(x_{i}^{⊤} β)}^{2} + \frac{(1 - B) (2 - B)}{8 B}] \\ + B s_{i} {(x_{i}^{⊤} β)}^{2} + {(1 - B)}^{2} s_{i} / 4 + s_{i}^{2} (1 / 2 - B / 4 - B^{2} / 16) \\ + \frac{B^{2}}{8 (m_{0} - 4)} + \frac{1 - s_{i}}{2 m_{0}} [4 {(x_{i}^{⊤} β)}^{2} B + s_{i}] \\ + 3 {(1 - s_{i})}^{2} [\frac{2 m_{0}^{2} - 9 m_{0} + 6}{4 m_{0} (m_{0} + 2) (m_{0} - 4)} B^{2} - \frac{B}{m_{0} + 2} + \frac{1}{2 m_{0}}] \\ + \frac{(1 - s_{i}) B}{2 m_{0}} [1 - \frac{B (m_{0} - 3)}{m_{0} - 4}] \\ + \frac{s_{i} (1 - s_{i})}{m_{0}} (2 - 2 B + B^{2} / 4) . \end{array}

The proof of Theorem 2 is given in Appendix A.

Remark 3.

Theorem 2 shows that the MSE of EB estimator ${\hat{λ}}_{i}^{E B}$ , $E {({\hat{λ}}_{i}^{E B} - E (λ_{i}))}^{2} = O (1)$ due to the first term in Eq. 12 for large m.

Estimation of the MSE of ${\hat{λ}}^{E B}$

In this section we estimate the MSE of ${\hat{λ}}_{i}^{E B}$ provided in Theorem 2 up to the order O(m^− 1) for large m. We now assume that $s_{i} = O (m^{- 1})$ . Ignoring the O(m^− 2) terms, we rewrite

\begin{array}{rcl} M S E ({\hat{λ}}_{i}^{E B}) & = & (1 - B) {(x_{i}^{⊤} β)}^{2} + \frac{{(1 - B)}^{2} (2 - B)}{8 B} + s_{i} [B {(x_{i}^{⊤} β)}^{2} + \frac{{(1 - B)}^{2}}{4}] \\ + \frac{B^{2}}{8 m} + \frac{2 B {(x_{i}^{⊤} β)}^{2}}{m} + \frac{3 {(1 - B)}^{2}}{2 m} + \frac{B (1 - B)}{2 m} + O (m^{- 2}) \end{array}

It is easy to see that only first two terms in Eq. 12 do not depend on m and remaining terms are of $O (m^{- 1})$ . Using Lemma 4 we get

(\begin{matrix} E (\frac{1}{\hat{B}} - \frac{2}{m \hat{B}}) = \frac{1}{B} + O (m^{- 2}) \\ E ({\hat{B}}^{2} - \frac{2}{m} {\hat{B}}^{2}) = B^{2} + O (m^{- 2}) \end{matrix}\}

Using Eq. 12, we find

E [\frac{{(1 - \hat{B})}^{2} (2 - \hat{B})}{\hat{B}} - \frac{4}{m \hat{B}} + \frac{2 {\hat{B}}^{2}}{m}] = [\frac{{(1 - B)}^{2} (2 - B)}{B}] + O (m^{- 2}) .

Now by $x_{i}^{⊤} \hat{β} \sim N (x_{i}^{⊤} β, \frac{s_{i}}{4 B})$ , Lemma 4 and the independence of $\hat{β}$ and $\hat{B}$ , we also have

E [(1 - \hat{B}) ({(x_{i}^{T} \hat{β})}^{2} - s_{i} / 4 \hat{B})] = (1 - B) {(x_{i}^{T} β)}^{2} + O (m^{- 2}) .

Since we are ignoring $O (m^{- 2})$ terms in MSE estimation, we can estimate the $O (m^{- 1})$ terms in Eq. 12 simply by replacing B² by ${\hat{B}}^{2}$ and ${(x_{i}^{⊤} β)}^{2}$ by ${(x_{i}^{⊤} \hat{β})}^{2}$ . By Eqs. 13 and 14, we derive estimator of the MSE of ${\hat{λ}}_{i}^{E B}$ in Theorem 3.

Theorem 3.

Assume conditions of Theorem 2. Then $M S E ({\hat{λ}}_{i}^{E B})$ as given in (13) is estimated by

\begin{array}{rcl} \hat{M S E ({\hat{λ}}_{i}^{E B})} & = & (1 - \hat{B}) {(x_{i}^{⊤} \hat{β})}^{2} + [\frac{{(1 - \hat{B})}^{2} (2 - \hat{B})}{8 \hat{B}}] \\ + s_{i} [\hat{B} {(x_{i}^{⊤} \hat{β})}^{2} + \frac{{(1 - \hat{B})}^{2}}{4} - \frac{(1 - \hat{B})}{4 \hat{B}}] + \frac{3 {\hat{B}}^{2}}{8 m} \\ + \frac{2 \hat{B}}{m} {(x_{i}^{⊤} \hat{β})}^{2} + \frac{3}{2 m} {(1 - \hat{B})}^{2} + \frac{\hat{B} (1 - \hat{B})}{2 m} - \frac{1}{2 m \hat{B}} + O_{p} (m^{- 2}) . \end{array}

The next theorem shows that MSE of the bias-corrected estimator ${\hat{λ}}_{i}^{C E B}$ equals $M S E ({\hat{λ}}_{i}^{E B}) + O (m^{- 2})$ . In other words, bias correction does not lead to any significant improvement over $M S E ({\hat{λ}}_{i}^{E B})$ , at least, when calculated up to $O (m^{- 1})$ .

Theorem 4.

Assume conditions of Theorem 2. Then

\begin{array}{rcl} E [{({\hat{λ}}_{i}^{C E B} - λ_{i})}^{2}] = E [{({\hat{λ}}_{i}^{E B} - λ_{i})}^{2}] + O (m^{- 2}) . \end{array}

Proof 5.

Write $\hat{b i a s} = \frac{(2 - \hat{B})}{4} d_{i}$ , where $d_{i} = s_{i} + 2 \frac{1 - s_{i}}{m_{0}} = O (m^{- 1})$ . We begin with

\begin{array}{rcl} E [{({\hat{λ}}_{i}^{C E B} - λ_{i})}^{2}] & = & V [{\hat{λ}}_{i}^{C E B} - λ_{i}], \\ = & V [{\hat{λ}}_{i}^{E B} - λ_{i}] + V [\hat{b i a s}] - 2 C o v ({\hat{λ}}_{i}^{E B} - λ_{i}, \hat{b i a s}) . \end{array}

It is immediate that $V [{\hat{λ}}_{i}^{E B} - λ_{i}] = E [{({\hat{λ}}_{i}^{E B} - λ_{i})}^{2}] - {(b i a s)}^{2}$ , where ${(b i a s)}^{2} = \frac{1}{16} {(2 - B)}^{2} d_{i}^{2} = O (m^{- 2})$ . Next

\begin{array}{rcl} V [\hat{b i a s}] & = & E [{\hat{B}}^{2}] - B^{2} = E [\frac{{(m_{0} - 2)}^{2}}{16 | | Z - X \hat{β} | |^{4}}] - B^{2} \\ = & \frac{{(m_{0} - 2)}^{2}}{16} \frac{16 B^{2}}{(m_{0} - 2) (m_{0} - 4)} - B^{2} = \frac{2 B^{2}}{m_{0} - 4} . \end{array}

Hence,

V [\hat{b i a s}] = \frac{d_{i}^{2}}{8} \frac{B^{2}}{m_{0} - 4} = O (m^{- 3}) .

Finally, $C o v ({\hat{λ}}_{i}^{E B} - λ_{i}, \hat{b i a s}) = C o v ({\hat{λ}}_{i}^{E B}, \hat{b i a s}) - C o v (λ_{i}, \hat{b i a s})$ , But

\begin{array}{rcl} C o v (λ_{i}, \hat{b i a s}) & = & E [λ_{i} \hat{b i a s}] - E [λ_{i}] E [\hat{b i a s}], \\ = & C o v (\hat{λ_{i}^{B}}, \hat{b i a s}), \\ = & C o v ({(1 - B) z_{i} + B x_{i}^{⊤} β}^{2}, \hat{b i a s}), \\ = & \frac{(1 - s_{i})}{8 m_{0}} {(1 - B)}^{2} d_{i} = O (m^{- 2}) . \end{array}

Thus, $C o v ({\hat{λ}}_{i}^{E B} - λ_{i}, \hat{b i a s}) = C o v ({\hat{λ}}_{i}^{E B}, \hat{b i a s}) + O (m^{- 2})$ . Now,

\begin{array}{rcl} C o v ({\hat{λ}}_{i}^{E B}, \hat{b i a s}) & = & C o v (\frac{1 - \hat{B}}{4} + {(1 - \hat{B}) z_{i} + \hat{B} x_{i}^{⊤} \hat{β}}^{2}, \frac{2 - \hat{B}}{4} d_{i}), \\ = & \frac{d_{i}}{16} V [\hat{B}] - \frac{d_{i}}{4} C o v ({(1 - \hat{B}) z_{i} + \hat{B} x_{i}^{⊤} \hat{β}}^{2}, \hat{B}), \\ = & \frac{B^{2}}{8 (m_{0} - 4)} d_{i} - \frac{d_{i}}{4} C o v () {x_{i}^{⊤} \hat{β}}^{2} + 2 (1 - \hat{B}) (z_{i} - x_{i}^{⊤} \hat{β}) x_{i}^{⊤} \hat{β} \\ + {(1 - \hat{B})}^{2} {(z_{i} - x_{i}^{⊤} \hat{β})}^{2}, \hat{B})) . \end{array}

Due to the independence of $\hat{β}$ and $Z - X \hat{β}$ , $C o v ({x_{i}^{⊤} \hat{β}}^{2}, \hat{B}) = 0$ . Next, again invoking the symmetry of Z − Xβ around 0, and $E (z_{i} - x_{i}^{⊤} \hat{β}) = 0$ ,

\begin{array}{rcl} C o v ((1 - \hat{B}) (z_{i} - x_{i}^{⊤} \hat{β}) x_{i}^{⊤} \hat{β}, \hat{B}) & = & E [{(1 - \hat{B}) (z_{i} - x_{i}^{⊤} \hat{β}) x_{i}^{⊤} \hat{β} \hat{B}] \\ - E [{(1 - \hat{B}) (z_{i} - x_{i}^{⊤} \hat{β}) x_{i}^{⊤} \hat{β}] E [\hat{B}], \\ = & 0. \end{array}

Hence Eq. 18 reduces to

\begin{array}{rcl} C o v ({\hat{λ}}_{i}^{E B}, \hat{b i a s}) & = & (- d_{i} / 4) C o v ((1 - 2 \hat{B} + {\hat{B}}^{2}) {(z_{i} - x_{i}^{⊤} \hat{β})}^{2}, \hat{B}) \\ + O (m^{- 2}), \end{array}

\begin{array}{rcl} C o v ({(z_{i} - x_{i}^{⊤} \hat{β})}^{2}, \hat{B}) & = & E [\frac{(m_{0} - 2)}{4} \frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{2}}{| | Z - X \hat{β} | |^{2}}] - E [{(z_{i} - x_{i}^{⊤} \hat{β})}^{2}] B . \\ = & \frac{1 - s_{i}}{2 m_{0}} . \end{array}

\begin{array}{rcl} C o v (\hat{B} {(z_{i} - x_{i}^{⊤} \hat{β})}^{2}, \hat{B}) = C o v (\frac{(m_{0} - 2)}{4} \frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{2}}{| | Z - X \hat{β} | |^{2}}, \frac{(m_{0} - 2)}{4 | | Z - X \hat{β} | |^{2}}) = 0 \end{array}

Since $\frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{2}}{| | Z - X \hat{β} | |^{2}}$ is ancillary, and $| | Z - X \hat{β} | |^{2}$ is a function of the complete sufficient statistic $(\hat{β}, Z - X \hat{β})$ , by Basu’s Theorem,

\begin{array}{rcl} C o v ({\hat{B}}^{2} {(z_{i} - x_{i}^{⊤} \hat{β})}^{2}, \hat{B}) & = & E [{\hat{B}}^{3} {(z_{i} - x_{i}^{⊤} \hat{β})}^{2}] - E [{\hat{B}}^{2} {(z_{i} - x_{i}^{⊤} \hat{β})}^{2}] B \\ = & \frac{{(m_{0} - 2)}^{3}}{64} E [\frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{2}}{| | Z - X \hat{β} | |^{2}} \cdot \frac{1}{| | Z - X \hat{β} | |^{4}}] \\ - \frac{{(m_{0} - 2)}^{2}}{16} E [\frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{2}}{| | Z - X \hat{β} | |^{2}} \cdot \frac{1}{| | Z - X \hat{β} | |^{2}}], \\ = & \frac{(m_{0} - 2) (1 - s_{i}) B^{2}}{2 m_{0} (m_{0} - 4)} \end{array}

Hence, from Eqs. 18–21, $C o v ({\hat{λ}}_{i}^{E B}, \hat{b i a s}) = O (m^{- 2})$ . This along with Eqs. 15–17 proves the theorem. □

Data Analysis

In this section, we now deploy our approach on the 2020 COVID-19 pandemic dataset, which is available at https://usafacts.org/visualizations/coronavirus-covid-19-spread-map/usafacts.org. This example is used mainly for illustration. We are using the figures provided as the sampled estimates. Our study shows that the coefficient of determination (R²) does not increase much if we include other demographic variables such as the population size, number of people over age 60, and income in the linear model for the number of deaths regressing on number of confirmed cases. It suggests that the number of confirmed cases is the most crucial variable in estimation of the number of deaths by Coronavirus than the aforementioned demographic variables. We have also studied a few more county-level data sources1 and we found out that adjusted gross income (AGI)2 of the year 2017 is really relevant for estimating the number of deaths. In our model, we have transformed the number of confirmed cases and adjusted gross income (AGI) by taking the square root. All data are aggregated at the county level. We are interested in estimating the counts of death due to Coronavirus for all counties in Florida. Here m = 57 since Florida has 57 counties. From Section 2 we know that $\sqrt{y_{i}} \overset{i n d}{\sim} N (x_{i}^{⊤} β, A + 0.25)$ and we estimate β by ordinary least square method. Based on our analysis, we get $\hat{β} = {(- 0.2786, 0.0917, 0.0003)}^{⊤}$ , the respective estimates for the intercept, number of confirmed cases and AGI. We have summarized our results based on our model in Table 1 and the shrinkage factor $\hat{B} = 0.3777$ . It seems that our model-based approach seems to pull the direct estimates towards some grand average, as one anticipates in a typical EB analysis. Figure 1 shows that the estimates are higher in south east Florida than the rest of the state.

Table 1.

COVID19in Florida as of December 28, 2020

County name	Confirm	AGI¹	Deaths²	EB³	BIAS	Corrected BIAS EB⁴	RMSE⁵	Death rate⁶	Esti-mated rate⁷
Alachua	15473	7.294066	128	128.55	0.023	128.527	12.82	4.76	4.78
Baker	2414	0.554345	36	36.68	0.025	36.655	9.37	12.32	12.56
Bay	11403	4.571565	203	201.86	0.024	201.836	11.48	11.62	11.55
Bradford	2118	0.478624	23	23.75	0.025	23.725	9.32	8.16	8.42
Brevard	20355	17.959964	513	509.65	0.027	509.623	17.42	8.52	8.47
Broward	133480	68.207212	1828	1832.21	0.092	1832.118	45.89	9.36	9.38
Calhoun	1169	0.207429	28	28.70	0.025	28.675	9.16	19.85	20.35
Charlotte	7128	5.233347	231	229.35	0.023	229.327	11.46	12.23	12.14
Citrus	6467	3.447436	260	257.45	0.024	257.426	10.73	17.37	17.20
Clay	10829	6.127044	183	182.44	0.023	182.417	12.05	8.35	8.32
Collier	22004	27.616972	328	329.86	0.048	329.812	21.46	8.52	8.57
Columbia	5837	1.304030	117	116.71	0.025	116.685	9.86	16.32	16.28
DeSoto	2843	0.527165	52	52.52	0.025	52.495	9.38	13.68	13.82
Dixie	1088	0.540883	11	11.73	0.025	11.705	9.28	6.54	6.97
Duval	58491	28.523854	722	720.17	0.030	720.140	24.32	7.54	7.52
Escambia	21507	8.068327	360	357.02	0.024	356.996	13.52	11.31	11.22
Flagler	3565	3.411798	48	48.81	0.024	48.786	10.53	4.17	4.24
Franklin	878	0.283821	4	4.61	0.025	4.585	9.17	3.30	3.80
Gadsden	3836	0.824111	59	59.48	0.025	59.455	9.56	12.92	13.03
Gilchrist	984	0.328444	23	23.73	0.025	23.705	9.19	12.38	12.77
Glades	762	0.232870	11	11.72	0.025	11.695	9.14	7.96	8.49
Gulf	1273	0.321757	27	27.71	0.025	27.685	9.21	19.80	20.32
Hamilton	1224	0.190548	12	12.73	0.025	12.705	9.15	8.32	8.82
Hardee	2057	0.417308	20	20.75	0.025	20.725	9.29	7.42	7.70
Hendry	3235	0.709752	49	49.57	0.025	49.545	9.48	11.66	11.80
Hernando	7074	3.958244	264	261.49	0.024	261.466	10.96	13.61	13.48
Highlands	4822	1.882105	199	197.30	0.025	197.275	10.02	18.73	18.57
Hillsborough	74788	46.284944	1064	1064.35	0.053	1064.297	32.76	7.23	7.23
Holmes	1603	0.280332	24	24.73	0.025	24.705	9.21	12.23	12.61
Indian	6553	7.607307	156	155.99	0.024	155.966	12.36	9.75	9.75
Jackson	4502	0.819253	115	114.65	0.025	114.625	9.60	24.78	24.70
Jefferson	957	0.309690	16	16.74	0.025	16.715	9.18	11.23	11.75
Lafayette	1414	0.106457	21	21.73	0.025	21.705	9.13	24.93	25.80
Lake	14841	9.534133	293	291.32	0.023	291.297	13.67	7.98	7.94
Lee	39332	27.089733	652	649.84	0.031	649.809	22.42	8.46	8.43
Leon	18742	7.918907	173	173.09	0.023	173.067	13.28	5.89	5.90
Levy	1798	0.721669	19	19.77	0.025	19.745	9.39	4.58	4.76
Liberty	716	0.113382	14	14.73	0.025	14.705	9.09	16.76	17.63
Madison	1516	0.324513	33	33.68	0.025	33.655	9.22	17.84	18.21
Manatee	21539	13.804729	412	409.42	0.023	409.397	15.82	10.22	10.15
Marion	17125	7.971880	456	450.97	0.023	450.947	13.20	12.47	12.34
Martin	7656	9.243197	207	206.49	0.024	206.466	13.08	12.86	12.83
Miami-Dade	290363	92.544722	4155	4160.13	0.427	4159.703	67.08	15.29	15.31
Monroe	4168	5.227991	35	36.04	0.024	36.016	11.27	4.72	4.86
Nassau	4521	3.389901	66	66.65	0.024	66.626	10.58	7.45	7.52
Okaloosa	12407	6.812245	224	222.92	0.023	222.897	12.43	10.63	10.58
Okeechobee	2347	0.720267	51	51.54	0.025	51.515	9.42	12.09	12.22
Orange	73691	41.729653	732	735.09	0.044	735.046	30.80	5.25	5.28
Osceola	24512	7.340821	286	284.32	0.026	284.294	13.43	7.61	7.57
Palm	80865	84.441622	1866	1873.59	0.279	1873.311	49.08	12.47	12.52
Pasco	21222	13.333365	360	358.20	0.023	358.177	15.61	6.50	6.47
Pinellas	43480	35.959433	1035	1029.81	0.049	1029.761	26.35	10.62	10.56
Polk	35942	15.491829	767	758.92	0.024	758.896	17.47	10.58	10.47
Putnam	3819	1.217151	71	71.36	0.025	71.335	9.70	9.53	9.58
St.	12481	12.966328	112	113.41	0.025	113.385	14.88	4.23	4.28
St.	13864	7.683491	393	389.01	0.023	388.987	12.87	11.97	11.85
Santa	10575	5.192317	122	122.24	0.023	122.217	11.67	6.62	6.63
Sarasota	17916	20.149336	502	499.32	0.033	499.287	18.14	11.57	11.51
Seminole	17445	16.199289	311	310.49	0.026	310.464	16.51	6.59	6.58
Sumter	4819	4.709867	118	118.11	0.024	118.086	11.11	8.91	8.92
Suwannee	3913	0.701966	94	93.97	0.025	93.945	9.51	21.16	21.16
Taylor	1914	0.362501	25	25.73	0.025	25.705	9.26	11.59	11.93
Union	1416	0.217377	65	65.30	0.025	65.275	9.18	42.66	42.86
Volusia	21365	14.096377	430	427.17	0.023	427.147	15.93	7.77	7.72
Wakulla	2032	0.710564	21	21.77	0.025	21.745	9.40	6.22	6.45
Walton	4707	3.317614	43	43.86	0.024	43.836	10.57	5.81	5.92
Washington	1907	0.397700	30	30.71	0.025	30.685	9.27	11.78	12.06

Open in a new tab

¹ Calculated in millions

² Number of deaths is our direct estimates

³ Empirical Bayes estimator based on our methodology

⁴ Corrected BIAS EB= EB- BIAS

⁵ RMSE= $\sqrt{M S E}$

⁶ Death rate= Deaths/(Population Size)

⁷ E7stimated rate= EB/(Population Size)

This rate is calculated for every 10,000 population

The top two map compares the EB and BIAS corrected EB for the number of deaths due to COVID-19 in each county of Florida, and the bottom two maps show the BIAS and RMSE of the EB

Simulation

In this section, we will measure the performance of our model via a simulation study. The choice of the auxiliary parameters is guided by the case study of the previous section. For illustration purposes, we have considered only one covariate- the number of confirmed cases to estimate the number of deaths due to COVID 19. This data is available for 3,142 counties of the United States. For simulation purposes, we have taken a random sample from this data without any replacement for each choice of m. The number of small areas (counties), m, is set to be 25, 50, 100, 200, 500, or 1000. For each choice of m, we generated data from the model :

\begin{array}{rcl} z_{i} | 𝜃_{i} \overset{i n d}{\sim} N (𝜃_{i}, 1 / 4), 𝜃_{i} = \sqrt{λ_{i}} \overset{i n d}{\sim} N (x_{i}^{⊤} β, A) \end{array}

The design matrix X includes a column of ones and one explanatory variable. To set the value of the parameter for β and A, we first create a linear regression model for the number of deaths on the number of confirmed cases using entire data for 3,142 counties. The estimated value for regression coefficient vector β is (5.281570,0.000272)^⊤ and mean square residuals is 22.75. For simulation we set β = (5.281570,0.000272)^⊤ and A = 22.75 − 0.25 = 22.50, hence shrinkage factor B = 0.011. Due to this variance stabilizing transformation, the shrinkage factor does not change between counties. Now using Eq. 22 we generate λ_i and z_i for all i = (1,…,m). The explanatory variable is again number of confirmed cases which is simulated randomly without replacement from the entire populations of 3,142 counties in the United States.

Here we will compare the true RMSE and estimated RMSE of ${\hat{λ}}_{i}^{E B}$ . We examine our findings in Theorems 2, 3 and 4 based on six different settings for m. Here we will vary the m and only one dataset is generated for each m, the latter taking values 25,50,100,200,500,1000. We have estimated the true RMSE of ${\hat{λ}}_{i}^{C E B}$ based on 1,000 simulated samples since we do not have exact expression for RMSE of ${\hat{λ}}_{i}^{C E B}$ .

Figure 2 substantiates that the approximations given in Theorems 3 and 4 are fairly close to the true RMSE. In addition, they also point out one particular small area where the MSE is significantly higher than rest of the small areas.

figure compares True RMSE: root of Eq. 12, estimated RMSE: root of Eq. 15 and simulated RMSE of bias corrected estimators. The RMSE of bias corrected EB estimators ( ${\hat{λ}}_{i}^{C E B}$ ) is based on 1,000 simulations provided in Fig. 2 and it also verifies the result in Theorem 4

Conclusion

The paper introduces square root transformation of Poisson count data, and attains approximately both normality of the transformed data as well as variance stabilization. In this way, we obtain explicit estimates of bias and MSE for Poisson means. Based on the simulation, it seems that our estimates closely resemble the truth. Data analysis part tells us that estimates are higher on south-east Florida when the model appropriate.

There are many potential extensions. One that immediately comes to mind is consideration of unit level models with corresponding square root transformation. Gonçalves and Ghosh (2021) have addressed this problem using a pure hierarchical Bayesian framework, but an empirical Bayes approach with all its theoretical properties should also be a topic of future investigation. Even under the present framework, one may add a spatial component and using something like a CAR model (see for example, Ghosh et al., 1999). A final interesting problem is to consider an overdispersed Poisson model, i.e. a negative binomial model for count data with variable transformation as in Yu (2009) which also leads to homoscedasticity.

Acknowledgements

The authors are grateful to the editor and anonymous reviewer(s) for their constructive comments and suggestions which greatly improved an earlier version of this article.

Appendix:

Proof of Theorem 2

Proof 6.

In this section we will do calculate the MSE of the EB estimator ${\hat{λ}}_{i}^{E B}$ . We observe that $E (({\hat{λ}}_{i}^{B} - λ_{i}) ({\tilde{λ}}_{i}^{E B} - {\hat{λ}}_{i}^{B})) = 0$ and $E (({\hat{λ}}_{i}^{B} - λ_{i}) ({\hat{λ}}_{i}^{E B} - {\tilde{λ}}_{i}^{E B})) = 0$ since $E (({\hat{λ}}_{i}^{B} - λ_{i}) | z_{i}) = 0$ . Thus we have

\begin{array}{rcl} E {({\hat{λ}}_{i}^{E B} - λ_{i})}^{2} & = & E {({\hat{λ}}_{i}^{B} - λ_{i})}^{2} + E {({\tilde{λ}}_{i}^{E B} - {\hat{λ}}_{i}^{B})}^{2} + E {({\hat{λ}}_{i}^{E B} - {\tilde{λ}}_{i}^{E B})}^{2} \\ + 2 E (({\hat{λ}}_{i}^{E B} - {\tilde{λ}}_{i}^{E B}) ({\tilde{λ}}_{i}^{E B} - {\hat{λ}}_{i}^{B})) \end{array}

5.11

Each term in the right side of Eq. 5.11 will be computed separately. Since $𝜃_{i} | z_{i} \overset{i n d}{\sim} N ((1 - B) z_{i} + B x_{i}^{⊤} β, (1 - B) / 4)$ , by Lemma 1,

\begin{array}{rcl} E {({\hat{λ}}_{i}^{B} - λ_{i})}^{2} & = & E {(E (λ_{i} | z_{i}) - λ_{i})}^{2} = E [E {{(E (λ_{i}) - λ_{i})}^{2} | z_{i}}] = E [V a r (λ_{i} | z_{i})] \\ = & E [{(1 - B)}^{2} / 8 + u_{i 2}^{2} (1 - B)] \\ = & (1 - B) [(1 - B) / 8 + (A (1 - B) + {(x_{i}^{⊤} β)}^{2})] \\ = & (1 - B) [{(x_{i}^{⊤} β)}^{2} + \frac{(1 - B) (2 - B)}{8 B}] . \end{array}

Next we compute the second term in the right side of Eq. 5.11. By Lemmas 2 and Eq. 7,

\begin{array}{rcl} E {({\tilde{λ}}_{i}^{E B} - {\hat{λ}}_{i}^{B})}^{2} & = & V a r ({\hat{λ}}_{i}^{B}) + V a r ({\tilde{λ}}_{i}^{E B}) - 2 C o v ({\hat{λ}}_{i}^{B}, {\tilde{λ}}_{i}^{E B}) + {(E ({\tilde{λ}}_{i}^{E B} - {\hat{λ}}_{i}^{B}))}^{2} \\ = & V a r (u_{i 1}^{2}) + V a r (u_{i 2}^{2}) - 2 C o v (u_{i 1}^{2}, u_{i 2}^{2}) + {(E (u_{i 1}^{2} - u_{i 2}^{2}))}^{2} . \end{array}

5.12

Also,

V a r (u_{i 1}^{2}) = 2 {(A (1 - B) + (2 - B) s_{i} / 4)}^{2} + (4 A (1 - B) + (2 - B) s_{i}) {(x_{i}^{⊤} β)}^{2}

5.13

V a r (u_{i 2}^{2}) = 2 {(A (1 - B))}^{2} + 4 A (1 - B) {(x_{i}^{⊤} β)}^{2}

5.14

C o v (u_{i 1}^{2}, u_{i 2}^{2}) = 2 {(1 - B)}^{2} {(A + s_{i} / 4)}^{2} + 4 {(x_{i}^{⊤} β)}^{2} (1 - B) (A + s_{i} / 4)

5.15

{[E (u_{i 1}^{2} - u_{i 2}^{2})]}^{2} = {(2 - B)}^{2} s_{i}^{2} / 16

5.16

By Eqs. 5.13–5.16, 5.12 simplifies to

E {({\tilde{λ}}_{i}^{E B} - {\hat{λ}}_{i}^{B})}^{2} = B s_{i} {(x_{i}^{⊤} β)}^{2} + {(1 - B)}^{2} s_{i} / 4 + s_{i}^{2} (1 / 2 - B / 4 - B^{2} / 16)

5.17

Now we evaluate the last expression in right side of Eq. 5.11.

\begin{array}{rcl} {\tilde{λ}}_{i}^{E B} - {\hat{λ}}_{i}^{B} & = & {((1 - B) z_{i} + B x_{i}^{⊤} \hat{β})}^{2} - {((1 - B) z_{i} + B x_{i}^{⊤} β)}^{2} \\ = & B^{2} [{(x_{i}^{⊤} \hat{β} - x_{i}^{⊤} β)}^{2} + 2 x_{i}^{⊤} β (x_{i}^{⊤} \hat{β} - x_{i}^{⊤} β)] \\ + 2 B (1 - B) (x_{i}^{⊤} \hat{β} - x_{i}^{⊤} β) x_{i}^{⊤} \hat{β} \\ + & 2 B (1 - B) (x_{i}^{⊤} \hat{β} - x_{i}^{⊤} β) (z_{i} - x_{i}^{⊤} \hat{β}) \\ = & B_{1} + B_{2} + B_{3} \end{array}

5.18

We define $B_{1} = B^{2} [{(x_{i}^{⊤} \hat{β} - x_{i}^{⊤} β)}^{2} + 2 x_{i}^{⊤} β (x_{i}^{⊤} \hat{β} - x_{i}^{⊤} β)], B_{2} = 2 B (1 - B) (x_{i}^{⊤} \hat{β} - x_{i}^{⊤} β) x_{i}^{⊤} \hat{β}, B_{3} = 2 B (1 - B) (x_{i}^{⊤} \hat{β} - x_{i}^{⊤} β) (z_{i} - x_{i}^{⊤} \hat{β})$ . The expressions A₁,A₂ and A₃ in Eq. 8 are functions of residuals $((z_{i} - x_{i}^{⊤} \hat{β}) ..., \dots, (z_{i} - x_{i}^{⊤} \hat{β}))$ and the expressions B₁ and B₂ in Eq. 5.18 are functions of $\hat{β}$ . Therefore, (A₁,A₂,A₃) is independent of (B₁,B₂) and their covariances are 0. The expectation of B₃ is 0 since $(x_{i}^{⊤} \hat{β} - x_{i}^{⊤} β) is independent of (z_{i} - x_{i}^{⊤} \hat{β})$ . Also,

\begin{array}{rcl} C o v (A_{1}, B_{3}) & = & \frac{2 B (1 - B)}{4} E ((B - \hat{B}) (z_{i} - x_{i}^{⊤} \hat{β})) E (x_{i}^{⊤} \hat{β} - x_{i}^{⊤} β) = 0. \\ C o v (A_{3}, B_{3}) & = & 2 B (1 - B) E [({\hat{B}}^{2} - B^{2} - 2 (\hat{B} - B)) {(z_{i} - x_{i}^{⊤} \hat{β})}^{3}] \\ \times E (x_{i}^{⊤} \hat{β} - x_{i}^{⊤} β) \\ = & 0. \end{array}

Hence, again using part (v) of Lemma 4, Eqs. 8 and 5.18, we have

\begin{array}{rcl} C o v ({\hat{λ}}_{i}^{E B} - {\tilde{λ}}_{i}^{E B}, {\tilde{λ}}_{i}^{E B} - {\hat{λ}}_{i}^{B}) & = & C o v (A_{2}, B_{3}) \\ = & E (2 x_{i}^{⊤} \hat{β} (B - \hat{B}) (z_{n} - x_{n}^{⊤} \hat{β}) 2 B (1 - B) \\ \times (x_{i}^{⊤} \hat{β} - x_{i}^{⊤} β) (z_{i} - x_{i}^{⊤} \hat{β})) \\ = & 4 B (1 - B) E ((B - \hat{B}) {(z_{i} - x_{i}^{⊤} \hat{β})}^{2}) \\ \times E (x_{i}^{⊤} \hat{β} (x_{i}^{⊤} \hat{β} - x_{i}^{⊤} β)) \\ = & 4 B (1 - B) \frac{(1 - s_{i})}{2 m_{0}} V a r (x_{i}^{⊤} \hat{β}) \\ = & 4 B (1 - B) \frac{(1 - s_{i})}{2 m_{0}} (1 / 4 + A) s_{i} \\ = & (1 - B) \frac{s_{i} (1 - s_{i})}{2 m_{0}} . \end{array}

Hence, from Eqs. 7 and 10,

\begin{array}{rcl} E (({\hat{λ}}_{i}^{E B} - {\tilde{λ}}_{i}^{E B}) ({\tilde{λ}}_{i}^{E B} - {\hat{λ}}_{i}^{B})) & = & (1 - B) \frac{s_{i} (1 - s_{i})}{2 m_{0}} \\ + \frac{2 - B}{2} \frac{(1 - s_{i})}{m_{0}} \frac{2 - B}{4} s_{i} \\ = & \frac{s_{i} (1 - s_{i})}{2 m_{0}} (2 - 2 B + B^{2} / 4) . \end{array}

Next we compute the remaining third term in the right side of Eq. 5.11.

\begin{array}{rcl} E {({\hat{λ}}_{i}^{E B} - {\tilde{λ}}_{i}^{E B})}^{2} & = & E {(A_{1} + A_{2} + A_{3})}^{2} \\ = & E {(B - \hat{B})}^{2} / 16 + 4 E {(x_{i}^{⊤} \hat{β} (B - \hat{B}) (z_{i} - x_{i}^{⊤} \hat{β}))}^{2} \\ + E [{({\hat{B}}^{2} - B^{2}) - 2 (\hat{B} - B)}^{2} {(z_{i} - x_{i}^{⊤} \hat{β})}^{4}] \\ + [E (B - \hat{B}) {({\hat{B}}^{2} - B^{2}) - 2 (\hat{B} - B)} {(z_{i} - x_{i}^{⊤} \hat{β})}^{2}] / 2. \end{array}

5.19

In the above calculation, the cross terms E(A₁A₂) and E(A₂A₃) in Eq. 5.19 vanish by part (iii) of Lemma 3. Again, by part (ii) of Lemma 4, we have,

E {(B - \hat{B})}^{2} = B^{2} - 2 B E (\hat{B}) + E ({\hat{B}}^{2}) = - B^{2} + \frac{m_{0} - 2}{m_{0} - 4} B^{2} = \frac{2 B^{2}}{(m_{0} - 4)} .

5.20

Again, applying Lemmas 3 and 4,

\begin{array}{rcl} E {[x_{i}^{⊤} \hat{β} (B - \hat{B}) (z_{i} - x_{i}^{⊤} \hat{β})]}^{2} & = & E [{(x_{i}^{⊤} \hat{β})}^{2}] E [{(B - \hat{B})}^{2} {(z_{i} - x_{i}^{⊤} \hat{β})}^{2}] \\ = & [{(x_{i}^{⊤} β)}^{2} + \frac{s_{i}}{4 B}] \\ \times E [\frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{2}}{| | Z - X \hat{β} | |^{2}} {(B - \hat{B})}^{2} \frac{m_{0} - 2}{4 \hat{B}}] \\ = & [{(x_{i}^{⊤} β)}^{2} + \frac{s_{i}}{4 B}] \frac{(1 - s_{i}) (m_{0} - 2)}{4 m_{0}} \\ \times E [B^{2} / \hat{B} - 2 B + \hat{B}] \\ = & [{(x_{i}^{⊤} β)}^{2} + \frac{s_{i}}{4 B}] \frac{(1 - s_{i}) (m_{0} - 2)}{4 m_{0}} \frac{2 B}{m_{0} - 2} \\ = & \frac{1 - s_{i}}{2 m_{0}} [{(x_{i}^{⊤} β)}^{2} B + \frac{s_{i}}{4}] . \end{array}

Now we calculate the third term, $1 / 2 E [{({\hat{B}}^{2} - B^{2}) - 2 (\hat{B} - B)}^{2} {(z_{i} - x_{i}^{⊤} \hat{β})}^{4}]$ in Eq. 5.19. By Lemma 3 and 4 and recalling $| | Z - X \hat{β} | |^{2} = \frac{m_{0} - 2}{4 \hat{B}}$ , we obtain,

\begin{array}{rcl} E [{({\hat{B}}^{2} - B^{2}) - 2 (\hat{B} - B)}^{2} {(z_{i} - x_{i}^{⊤} \hat{β})}^{4}] \\ = & E [\frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{4}}{| | Z - X \hat{β} | |^{4}}] E [{({\hat{B}}^{2} - B^{2}) - 2 (\hat{B} - B)}^{2}} \frac{{(m_{0} - 2)}^{2}}{16 {\hat{B}}^{2}}] \\ = & \frac{3 {(1 - s_{i})}^{2} {(m_{0} - 2)}^{2}}{16 m_{0} (m_{0} + 2)} E [({\hat{B}}^{2} - 2 B^{2} + B^{4} / {\hat{B}}^{2}) - 4 (\hat{B} - B^{2} / \hat{B} - B + B^{3} / {\hat{B}}^{2}) \\ + 4 (1 - 2 B / \hat{B} + B^{2} / {\hat{B}}^{2})] \\ = & \frac{3 {(1 - s_{i})}^{2} {(m_{0} - 2)}^{2}}{16 m_{0} (m_{0} + 2)} [) B^{2} (\frac{m_{0} - 2}{m_{0} - 4} - 2 + \frac{m_{0} (m_{0} + 2)}{{(m_{0} - 2)}^{2}}) \\ + 4 B (\frac{m_{0}}{m_{0} - 2} - \frac{m_{0} (m_{0} + 2)}{{(m_{0} - 2)}^{2}}) + 4 (1 - 2 \frac{m_{0}}{m_{0} - 2} + \frac{m_{0} (m_{0} + 2)}{{(m_{0} - 2)}^{2}})]) \\ = & \frac{3 {(1 - s_{i})}^{2} {(m_{0} - 2)}^{2}}{16 m_{0} (m_{0} + 2)} [) B^{2} \frac{4 (2 m_{0}^{2} - 9 m_{0} + 6)}{(m_{0} - 4) {(m_{0} - 2)}^{2}} \\ + 4 B \frac{- 4 m_{0}}{{(m_{0} - 2)}^{2}} + 4 \frac{2 (m_{0} + 2)}{{(m_{0} - 2)}^{2}})]) \\ = & 3 {(1 - s_{i})}^{2} [\frac{2 m_{0}^{2} - 9 m_{0} + 6}{4 m_{0} (m_{0} + 2) (m_{0} - 4)} B^{2} - \frac{B}{m_{0} + 2} + \frac{1}{2 m_{0}}] . \end{array}

5.21

Finally, once again by Lemmas 3 and 4, and recalling $| | Z - X \hat{β} | |^{2} = \frac{m_{0} - 2}{4 \hat{B}}$ , we get (5.19),

\begin{array}{rcl} E [(B - \hat{B}) {({\hat{B}}^{2} - B^{2}) - 2 (\hat{B} - B)} {(z_{i} - x_{i}^{⊤} \hat{β})}^{2}] \\ = E [\frac{{(z_{i} - x_{i}^{⊤} \hat{β})}^{2}}{| | Z - X \hat{β} | |^{2}}] [E (B - \hat{B}) {({\hat{B}}^{2} - B^{2}) - 2 (\hat{B} - B)} \frac{m_{0} - 2}{4 \hat{B}}] \\ = \frac{(1 - s_{i}) (m_{0} - 2)}{4 m_{0}} E [B \hat{B} - {\hat{B}}^{2} - B^{3} / \hat{B} + B^{2} - 2 (B - \hat{B}) - 2 B + \frac{2 B^{2}}{\hat{B}}] \\ = \frac{(1 - s_{i}) (m_{0} - 2)}{4 m_{0}} [B^{2} - B^{2} \frac{m_{0} - 2}{m_{0} - 4} - \frac{m_{0} B^{2}}{m_{0} - 2} + B^{2} - 2 B + \frac{m_{0} 2 B}{m_{0} - 2}] \\ = \frac{(1 - s_{i}) (m_{0} - 2)}{4 m_{0}} [\frac{4 B}{m_{0} - 2} - \frac{4 B^{2} (m_{0} - 3)}{(m_{0} - 2) (m_{0} - 4)}] \\ = \frac{(1 - s_{i}) B}{m_{0}} [1 - \frac{B (m_{0} - 3)}{m_{0} - 4}] . \end{array}

5.22

From Eqs. 5.19–5.22, we obtain,

\begin{array}{rcl} E {({\hat{λ}}_{i}^{E B} - {\tilde{λ}}_{i}^{E B})}^{2} & = & \frac{B^{2}}{8 (m_{0} - 4)} + \frac{1 - s_{i}}{2 m_{0}} [4 {(x_{i}^{⊤} β)}^{2} B + s_{i}] \\ + 3 {(1 - s_{i})}^{2} [\frac{2 m_{0}^{2} - 9 m_{0} + 6}{4 m_{0} (m_{0} + 2) (m_{0} - 4)} B^{2} - \frac{B}{m_{0} + 2} + \frac{1}{2 m_{0}}] \\ + \frac{(1 - s_{i}) B}{2 m_{0}} [1 - \frac{B (m_{0} - 3)}{m_{0} - 4}] . \end{array}

Theorem 2 follows from Eqs. 5.11, 5.12, 5.17, 5.19 and 5.23. □

Funding

The third author’s research was partially supported by JSPS KAKENHI grant number 18K12758.

Compliance with Ethical Standards

The Author(s) declare(s) that there is no conflict of interest that are relevant to the content of this article.

Footnotes

US Census Bureau and Statistics of Income Division (SOI) of the IRS

https://www.irs.gov/pub/irs-soi/17incyallagi.csv

The third author’s research was partially supported by JSPS KAKENHI grant number 18K12758.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Malay Ghosh, Email: ghoshm@ufl.edu.

Tamal Ghosh, Email: tamalg@ufl.edu.

Masayo Y. Hirose, Email: masayo@imi.kyushu-u.ac.jp

References

Basu D. On statistics independent of a complete sufficient statistic. Sankhyā,: The Indian Journal of Statistics (1933-1960) 1955;15:377–380. [Google Scholar]
Efron B, Morris C. Stein’s estimation rule and its competitors—an empirical bayes approach. Journal of the American Statistical Association. 1973;68:117–130. [Google Scholar]
Fay RE, Herriot RA. Estimates of income for small places: an application of james-stein procedures to census data. Journal of the American Statistical Association. 1979;74:269–277. doi: 10.1080/01621459.1979.10482505. [DOI] [Google Scholar]
Ghosh M, Natarajan K, Waller LA, Kim D. Hierarchical bayes glms for the analysis of spatial data: an application to disease mapping. Journal of Statistical Planning and Inference. 1999;75:305–318. doi: 10.1016/S0378-3758(98)00150-5. [DOI] [Google Scholar]
Ghosh M, Kubokawa T, Kawakubo Y. Benchmarked empirical bayes methods in multiplicative area-level models with risk evaluation. Biometrika. 2015;102:647–659. doi: 10.1093/biomet/asv010. [DOI] [Google Scholar]
Gonçalves, K. C. and Ghosh, M. (2021). Unit level model for small area estimation with count data under square root transformation. Brazilian Journal of Statistics and Probability In Press.
Hirose, MY, Ghosh, M and Ghosh, T (2021). Arc-sin transformation for binomial sample proportions in small area estimation. Statistica Sinica Preprint No: SS-2020-0446. 10.5705/ss.202020.0446
Slud EV, Maiti T. Mean-squared error estimation in transformed fay–herriot models. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2006;68(2):239–257. doi: 10.1111/j.1467-9868.2006.00542.x. [DOI] [Google Scholar]
Sugasawa S, Kubokawa T. Transforming response values in small area prediction. Computational Statistics & Data Analysis. 2017;114:47–60. doi: 10.1016/j.csda.2017.03.017. [DOI] [Google Scholar]
Yu G. Variance stabilizing transformations of poisson, binomial and negative binomial distributions. Statistics & Probability Letters. 2009;79(14):1621–1629. doi: 10.1016/j.spl.2009.04.010. [DOI] [Google Scholar]

[CR1] Basu D. On statistics independent of a complete sufficient statistic. Sankhyā,: The Indian Journal of Statistics (1933-1960) 1955;15:377–380. [Google Scholar]

[CR2] Efron B, Morris C. Stein’s estimation rule and its competitors—an empirical bayes approach. Journal of the American Statistical Association. 1973;68:117–130. [Google Scholar]

[CR3] Fay RE, Herriot RA. Estimates of income for small places: an application of james-stein procedures to census data. Journal of the American Statistical Association. 1979;74:269–277. doi: 10.1080/01621459.1979.10482505. [DOI] [Google Scholar]

[CR4] Ghosh M, Natarajan K, Waller LA, Kim D. Hierarchical bayes glms for the analysis of spatial data: an application to disease mapping. Journal of Statistical Planning and Inference. 1999;75:305–318. doi: 10.1016/S0378-3758(98)00150-5. [DOI] [Google Scholar]

[CR5] Ghosh M, Kubokawa T, Kawakubo Y. Benchmarked empirical bayes methods in multiplicative area-level models with risk evaluation. Biometrika. 2015;102:647–659. doi: 10.1093/biomet/asv010. [DOI] [Google Scholar]

[CR6] Gonçalves, K. C. and Ghosh, M. (2021). Unit level model for small area estimation with count data under square root transformation. Brazilian Journal of Statistics and Probability In Press.

[CR7] Hirose, MY, Ghosh, M and Ghosh, T (2021). Arc-sin transformation for binomial sample proportions in small area estimation. Statistica Sinica Preprint No: SS-2020-0446. 10.5705/ss.202020.0446

[CR8] Slud EV, Maiti T. Mean-squared error estimation in transformed fay–herriot models. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2006;68(2):239–257. doi: 10.1111/j.1467-9868.2006.00542.x. [DOI] [Google Scholar]

[CR9] Sugasawa S, Kubokawa T. Transforming response values in small area prediction. Computational Statistics & Data Analysis. 2017;114:47–60. doi: 10.1016/j.csda.2017.03.017. [DOI] [Google Scholar]

[CR10] Yu G. Variance stabilizing transformations of poisson, binomial and negative binomial distributions. Statistics & Probability Letters. 2009;79(14):1621–1629. doi: 10.1016/j.spl.2009.04.010. [DOI] [Google Scholar]

PERMALINK