Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2021 Oct 11;84(2):449–471. doi: 10.1007/s13571-021-00269-8

Poisson Counts, Square Root Transformation and Small Area Estimation

Square Root Transformation

Malay Ghosh 1,, Tamal Ghosh 1, Masayo Y Hirose 2
PMCID: PMC8503421  PMID: 34658600

Abstract

The paper intends to serve two objectives. First, it revisits the celebrated Fay-Herriot model, but with homoscedastic known error variance. The motivation comes from an analysis of count data, in the present case, COVID-19 fatality for all counties in Florida. The Poisson model seems appropriate here, as is typical for rare events. An empirical Bayes (EB) approach is taken for estimation. However, unlike the conventional conjugate gamma or the log-normal prior for the Poisson mean, here we make a square root transformation of the original Poisson data, along with square root transformation of the corresponding mean. Proper back transformation is used to infer about the original Poisson means. The square root transformation makes the normal approximation of the transformed data more justifiable with added homoscedasticity. We obtain exact analytical formulas for the bias and mean squared error of the proposed EB estimators. In addition to illustrating our method with the COVID-19 example, we also evaluate performance of our procedure with simulated data as well.

Keywords: COVID19, Empirical Bayes, Fay-Herriot model, Random Effects Model, Stein-type shrinkage estimators.

Introduction

Small area estimation is now a topic of global importance. Methodologies abound, and many of these are finding real-life applications.

Normal theory small area estimation pervades the literature, and the pioneering (Fay and Herriot, 1979) method is most often used in real life applications. The Fay-Herriot model is a normal theory mixed effects area-level model. The model needs to assume known error variances in order to avoid non-identifiability, whereas in reality, these are sample estimates.

The present article deals with the Fay-Herriot model with known constant error variance. The motivation came from an analysis of count data, in particular, COVID-19 data, to find estimates of fatality for all counties in the state of Florida. The Poisson model is used, but we make a square transformation of the original data, and the corresponding mean parameters to attain a closer approximation to normality with added homoscedasticity. This is in contrast to the log-transformation, where also one typically assumes normality of the transformed data. However, transformation of the original data in the log-scale bears the potential hazard of leaving out zero counts, which on most occasions, can affect the conclusion significantly. Further, our approach allows one to develop Stein-type shrinkage estimators for small area means and study their properties analytically.

Variable transformation in the small area context has been addressed before. The logarithmic transformation with the assumption of log-normal distribution of the original data is most commonly used, for example, in modeling income distributions. See for example, Slud and Maiti (2006) and Ghosh et al. (2015). Recently, Hirose et al. (2021) considered an arc-sin transformation of binomial proportions for small area estimation. Sugasawa and Kubokawa (2017) suggested a non-explicit EB estimator and performed an analysis based on the dual power transformation similar to that of Hirose et al. (2021).

The remaining sections are as follows. In Section 2, we introduce the square root transformation, develop Stein-type shrinkage estimators for the transformed data motivated from an empirical Bayes point of view, and then back transform properly to estimate the original parameters of interest. In Sections 3 and 4, we obtain exact expressions for the bias and the mean squared error of our shrinkage estimator. Finally, Section 5 we also obtain an estimator of the mean squared error correct up to order O(m1) where m is the number of small areas. Section 6 contains an illustration of the proposed method to estimate the number of deaths due to COVID-19 in each county. A simulation study is undertaken in Section 7. Some final remarks are made in Section 8.

Empirical Bayes Estimators

Suppose there are m areas with counts yi for i-th area. We assume yi are independently distributed from Poisson(λi) for i-th area. We transform zi=yi and with the usual variance stabilizing square root transformation so that V (zi) is approximately 1/4. We begin with

zi|𝜃iindN(𝜃i,1/4)where𝜃i=λi(i=1,...,m).

Following the customary approach, we consider independent N(xiβ,A) priors for the 𝜃i with p-dimensional auxiliary variables xi and regression parameter βp where m > p + 4. The posterior 𝜃i|ziindN((1B)zi+Bxiβ,(1B)/4) where B=1/41/4+A. Thus, Bayes estimator of λi is

λ^iB=E(λi|zi)=E(𝜃i2|zi)=(1B)/4+{(1B)zi+Bxiβ}2 1

We now turn towards empirical Bayes (EB) estimation of the λi. Writing X = (x1,...,xm), Z = (z1,...,zm) and β^=(XX)1XZ, it follows that marginally ||ZXβ^||214Bχmp2. Here X is a m × p matrix with rank p. Following Efron and Morris (1973), an EB estimator of B is B^=mp24||ZXβ^||2. Thus an EB estimator of λi is

λ^iEB=(1B^)/4+{(1B^)zi+B^xiβ^}2 2

For proving our technical results, we find it convenient also to define

λ~iEB=(1B)/4+{(1B)zi+Bxiβ^}2, 3

which is also an EB estimator of λi if the shrinkage factor B were known. Also, for notational simplicity we write m0 = mp hereafter.

Bias of λ^EB

For both bias and mean squared error (MSE) calculations for λ^iEB we need the following lemmas.

Lemma 1.

Let XYNμ1μ2,σ12σ12σ12σ22. Then

Var(X2)=2σ14+4μ12σ12Var(Y2)=2σ24+4μ22σ22Cov(X2,Y2)=2σ122+4μ1μ2σ12

Lemma 2.

Define si=xi(XX)1xi, ui1=(1B)zi+Bxiβ^ and ui2=(1B)zi+Bxiβ then,

ui1ui2simNxiβxiβ,A(1B)+(2B)si/4(1B)(A+si/4)(1B)(A+si/4)A(1B). 4

Proof 1.

The result follows from the independence of β^ and zixiβ^ and noting that β^N(β,(1/4+A)(XX)1) while zixiβ^N(0,(1/4+A)(1si)). □

Lemma 3.

  • (i)

    ||ZXβ^|| is distributed independently of (z1x1β^)||ZXβ^||,,(zmxmβ^)||ZXβ^||.

  • (ii)

    E(zixiβ^)k||ZXβ^||k=E(zixiβ^)kE(||ZXβ^||k) for all positive integers k.

  • (iii)

    E(zixiβ^)k||ZXβ^||l=0 if k is an odd positive integer and 0 < l < k.

Proof 2.

Marginally, ZN(Xβ,(1/4+A)Im). Hence, (β^,||ZXβ^||) is complete sufficient for (β,A), while (z1x1β^)||ZXβ^||,,(zmxmβ^)||ZXβ^|| is ancillary. This proves (i) by an application of Basu’s Theorem (Basu, 1955).

Now for any positive integer k, and by part (i) in Lemma 3, we have,

E(zixiβ^)k=E||ZXβ^||k(zixiβ^)k||ZXβ^||k=E||ZXβ^||kE(zixiβ^)k||ZXβ^||k.

This leads to E(zixiβ^)k||ZXβ^||k=E(zixiβ^)kE(||ZXβ^||k).

Next noting that ((z1x1β^),,(zmxmβ^))=d((z1x1β^),,(zmxmβ^)) it follows that

(z1x1β^)||ZXβ^||,,(zmxmβ^)||ZXβ^||=d(z1x1β^)||ZXβ^||,,(zmxmβ^)||ZXβ^||

Therefore, (zixiβ^)||ZXβ^|| is symmetric random variable around 0 and its all odd moments are 0. Hence,

E(zixiβ^)k||ZXβ^||l=E||ZXβ^||kl(zixiβ^)k||ZXβ^||k=E(||ZXβ^||klE(zixiβ^)k||ZXβ^||k

We get the second equality in the above equations using part (i) of this Lemma. Proof of (iii) is complete after observing k is an odd integer. □

Lemma 4.

  • (i)

    E(B^)=B if m0 > 2.

  • (ii)

    E(B^2)=m02m04B2 if m0 > 4.

  • (iii)

    E(1/B^)=m0(m02)B if m0 > 2.

  • (iv)

    E(1/B^2)=m0(m0+2)(m02)2B2 if m0 > 2.

  • (v)

    E(zixiβ^)2||ZXβ^||2=(1si)m0.

  • (vi)

    E(zixiβ^)4||ZXβ^||4=3(1si)2m0(m0+2).

Proof 3.

The proof (i) to (iv) follows by noting that B^=m024||ZXβ^||2 and ||ZXβ^||214Bχm02. To prove (v) and (vi) we also need part (ii) of Lemma 3. □

Now we start with calculation of the bias E(λ^iEBλi). The following theorem is proved.

Theorem 1.

Suppose zi|𝜃iindN(𝜃i,1/4) with priors 𝜃iindN(xiβ,A) and m0 > 2. Then bias of the EB estimator λ^iEB in Eq. 2 for λi=𝜃i2 is given by

Bias(λ^iEB)=E(λ^iEBλi)=2B4si+2(1si)m0. 5

Proof 4.

We begin with the partition

E(λ^iEBλi)=E(λ^iBλi)+E(λ~iEBλ^iB)+E(λ^iEBλ~iEB) 6

By Lemmas 1 and 2,

E(λ~iEBλ^iB)=E(ui12ui22)=Var(ui1)+(xiβ)2Var(ui2)(xiβ)2=(2B)si/4 7

Noticing E(λi)=E(E(λi|zi))=E(λ^iB), we have E(λ^iBλi)=0. It is easy to see that

λ^iEBλ~iEB=(BB^)/4+2xiβ^(BB^)(zixiβ^)+{(B^2B2)2(B^B)}(zixiβ^)2=A1+A2+A3, 8

where A1=(BB^)/4,A2=2xiβ^(BB^) and A3={(B^2B2)2(B^B)}(zixiβ^)2. The expectation of A1 is 0 since B^=m024||ZXβ^||2 is unbiased estimator of B. Now,

E(A2)=E(xiβ^)E(BB^)(zixiβ^)=E(xiβ^)E(B(zixiβ^))E(xiβ^)Em024||ZXβ^||2(zixiβ^)=0 9

The first equality in Eq. 9 is by independence of β^ and ((z1x1β^),, (znxnβ^)). The third equality holds by part (iii) of Lemma 3 since (zixiβ^)N(0,(1/4+A)(1si)).

Finally, we simplify A3. By (i) and (iii) of Lemma 3 and Lemma 4,

E(λ^iEBλ~iEB)=E[{(B^2B2)2(B^B)}(zixiβ^)2]=E(zixiβ^)2||ZXβ^||2E{(B^2B2)2(B^B)}m024B^=(1si)m0(m02)4E[B^B2/B^2+2B/B^]=(1si)m0(m02)4BBm0m022+2m0m02=1si2m0(2B). 10

The proof of Eq. 5 follows now by combining (6) with Eqs. 710. □

Remark 1.

With the usual assumption, si=xi(XX)1xi=O(m1) for large m, the bias of the EB estimator λ^iEB, E(λ^iEBλi)=O(m1).

Remark 2.

We can estimate the bias in Eq. 5 by replacing the B by B^, An unbiased estimator of the bias is

bias^=2B^4si+2(1si)m0. 11

Thus, from Eq. 11, the EB estimator has positive bias, and the bias-corrected estimator of λi is

λ^iCEB=λ^EBbias^.

MSE of λ^EB

The following theorem provides an exact expression for the MSE of λ^EB.

Theorem 2.

Suppose zi|𝜃iindN(𝜃i,1/4) with priors 𝜃iindN(xiβ,A) and m0 > 4. Then MSE of the EB estimator λ^iEB in Eq. 2 for λi=𝜃i2 is given by

MSE(λ^iEB)=E(λ^iEBλi)2=(1B)(xiβ)2+(1B)(2B)8B+Bsi(xiβ)2+(1B)2si/4+si2(1/2B/4B2/16)+B28(m04)+1si2m04(xiβ)2B+si+3(1si)22m029m0+64m0(m0+2)(m04)B2Bm0+2+12m0+(1si)B2m01B(m03)m04+si(1si)m0(22B+B2/4).

The proof of Theorem 2 is given in Appendix A.

Remark 3.

Theorem 2 shows that the MSE of EB estimator λ^iEB, E(λ^iEBE(λi))2=O(1) due to the first term in Eq. 12 for large m.

Estimation of the MSE of λ^EB

In this section we estimate the MSE of λ^iEB provided in Theorem 2 up to the order O(m− 1) for large m. We now assume that si=O(m1). Ignoring the O(m− 2) terms, we rewrite

MSE(λ^iEB)=(1B)(xiβ)2+(1B)2(2B)8B+siB(xiβ)2+(1B)24+B28m+2B(xiβ)2m+3(1B)22m+B(1B)2m+O(m2)

It is easy to see that only first two terms in Eq. 12 do not depend on m and remaining terms are of O(m1). Using Lemma 4 we get

E1B^2mB^=1B+O(m2)EB^22mB^2=B2+O(m2) 12

Using Eq. 12, we find

E(1B^)2(2B^)B^4mB^+2B^2m=(1B)2(2B)B+O(m2). 13

Now by xiβ^N(xiβ,si4B), Lemma 4 and the independence of β^ and B^, we also have

E[(1B^)((xiTβ^)2si/4B^)]=(1B)(xiTβ)2+O(m2). 14

Since we are ignoring O(m2) terms in MSE estimation, we can estimate the O(m1) terms in Eq. 12 simply by replacing B2 by B^2 and (xiβ)2 by (xiβ^)2. By Eqs. 13 and 14, we derive estimator of the MSE of λ^iEB in Theorem 3.

Theorem 3.

Assume conditions of Theorem 2. Then MSE(λ^iEB) as given in (13) is estimated by

MSE(λ^iEB)^=(1B^)(xiβ^)2+(1B^)2(2B^)8B^+siB^(xiβ^)2+(1B^)24(1B^)4B^+3B^28m+2B^m(xiβ^)2+32m(1B^)2+B^(1B^)2m12mB^+Op(m2).

The next theorem shows that MSE of the bias-corrected estimator λ^iCEB equals MSE(λ^iEB)+O(m2). In other words, bias correction does not lead to any significant improvement over MSE(λ^iEB), at least, when calculated up to O(m1).

Theorem 4.

Assume conditions of Theorem 2. Then

E[(λ^iCEBλi)2]=E[(λ^iEBλi)2]+O(m2).

Proof 5.

Write bias^=(2B^)4di, where di=si+21sim0=O(m1). We begin with

E[(λ^iCEBλi)2]=V[λ^iCEBλi],=V[λ^iEBλi]+V[bias^]2Cov(λ^iEBλi,bias^). 15

It is immediate that V[λ^iEBλi]=E[(λ^iEBλi)2](bias)2, where (bias)2=116(2B)2di2=O(m2). Next

V[bias^]=E[B^2]B2=E(m02)216||ZXβ^||4B2=(m02)21616B2(m02)(m04)B2=2B2m04.

Hence,

V[bias^]=di28B2m04=O(m3). 16

Finally, Cov(λ^iEBλi,bias^)=Cov(λ^iEB,bias^)Cov(λi,bias^), But

Cov(λi,bias^)=E[λibias^]E[λi]E[bias^],=Cov(λiB^,bias^),=Cov({(1B)zi+Bxiβ}2,bias^),=(1si)8m0(1B)2di=O(m2). 17

Thus, Cov(λ^iEBλi,bias^)=Cov(λ^iEB,bias^)+O(m2). Now,

Cov(λ^iEB,bias^)=Cov1B^4+{(1B^)zi+B^xiβ^}2,2B^4di,=di16V[B^]di4Cov{(1B^)zi+B^xiβ^}2,B^,=B28(m04)didi4Cov{xiβ^}2+2(1B^)(zixiβ^)xiβ^+(1B^)2(zixiβ^)2,B^. 18

Due to the independence of β^ and ZXβ^, Cov({xiβ^}2,B^)=0. Next, again invoking the symmetry of ZXβ around 0, and E(zixiβ^)=0,

Cov((1B^)(zixiβ^)xiβ^,B^)=E[{(1B^)(zixiβ^)xiβ^B^]E[{(1B^)(zixiβ^)xiβ^]E[B^],=0.

Hence Eq. 18 reduces to

Cov(λ^iEB,bias^)=(di/4)Cov((12B^+B^2)(zixiβ^)2,B^)+O(m2), 19
Cov((zixiβ^)2,B^)=E(m02)4(zixiβ^)2||ZXβ^||2E[(zixiβ^)2]B.=1si2m0. 20

Next

Cov(B^(zixiβ^)2,B^)=Cov(m02)4(zixiβ^)2||ZXβ^||2,(m02)4||ZXβ^||2=0

Since (zixiβ^)2||ZXβ^||2 is ancillary, and ||ZXβ^||2 is a function of the complete sufficient statistic (β^,ZXβ^), by Basu’s Theorem,

Cov(B^2(zixiβ^)2,B^)=E[B^3(zixiβ^)2]E[B^2(zixiβ^)2]B=(m02)364E(zixiβ^)2||ZXβ^||21||ZXβ^||4(m02)216E(zixiβ^)2||ZXβ^||21||ZXβ^||2,=(m02)(1si)B22m0(m04) 21

Hence, from Eqs. 1821, Cov(λ^iEB,bias^)=O(m2). This along with Eqs. 1517 proves the theorem. □

Data Analysis

In this section, we now deploy our approach on the 2020 COVID-19 pandemic dataset, which is available at https://usafacts.org/visualizations/coronavirus-covid-19-spread-map/usafacts.org. This example is used mainly for illustration. We are using the figures provided as the sampled estimates. Our study shows that the coefficient of determination (R2) does not increase much if we include other demographic variables such as the population size, number of people over age 60, and income in the linear model for the number of deaths regressing on number of confirmed cases. It suggests that the number of confirmed cases is the most crucial variable in estimation of the number of deaths by Coronavirus than the aforementioned demographic variables. We have also studied a few more county-level data sources1 and we found out that adjusted gross income (AGI)2 of the year 2017 is really relevant for estimating the number of deaths. In our model, we have transformed the number of confirmed cases and adjusted gross income (AGI) by taking the square root. All data are aggregated at the county level. We are interested in estimating the counts of death due to Coronavirus for all counties in Florida. Here m = 57 since Florida has 57 counties. From Section 2 we know that yiindN(xiβ,A+0.25) and we estimate β by ordinary least square method. Based on our analysis, we get β^=(0.2786,0.0917,0.0003), the respective estimates for the intercept, number of confirmed cases and AGI. We have summarized our results based on our model in Table 1 and the shrinkage factor B^=0.3777. It seems that our model-based approach seems to pull the direct estimates towards some grand average, as one anticipates in a typical EB analysis. Figure 1 shows that the estimates are higher in south east Florida than the rest of the state.

Table 1.

COVID19in Florida as of December 28, 2020

County name Confirm AGI1 Deaths2 EB3 BIAS Corrected BIAS EB4 RMSE5 Death rate6 Esti-mated rate7
Alachua 15473 7.294066 128 128.55 0.023 128.527 12.82 4.76 4.78
Baker 2414 0.554345 36 36.68 0.025 36.655 9.37 12.32 12.56
Bay 11403 4.571565 203 201.86 0.024 201.836 11.48 11.62 11.55
Bradford 2118 0.478624 23 23.75 0.025 23.725 9.32 8.16 8.42
Brevard 20355 17.959964 513 509.65 0.027 509.623 17.42 8.52 8.47
Broward 133480 68.207212 1828 1832.21 0.092 1832.118 45.89 9.36 9.38
Calhoun 1169 0.207429 28 28.70 0.025 28.675 9.16 19.85 20.35
Charlotte 7128 5.233347 231 229.35 0.023 229.327 11.46 12.23 12.14
Citrus 6467 3.447436 260 257.45 0.024 257.426 10.73 17.37 17.20
Clay 10829 6.127044 183 182.44 0.023 182.417 12.05 8.35 8.32
Collier 22004 27.616972 328 329.86 0.048 329.812 21.46 8.52 8.57
Columbia 5837 1.304030 117 116.71 0.025 116.685 9.86 16.32 16.28
DeSoto 2843 0.527165 52 52.52 0.025 52.495 9.38 13.68 13.82
Dixie 1088 0.540883 11 11.73 0.025 11.705 9.28 6.54 6.97
Duval 58491 28.523854 722 720.17 0.030 720.140 24.32 7.54 7.52
Escambia 21507 8.068327 360 357.02 0.024 356.996 13.52 11.31 11.22
Flagler 3565 3.411798 48 48.81 0.024 48.786 10.53 4.17 4.24
Franklin 878 0.283821 4 4.61 0.025 4.585 9.17 3.30 3.80
Gadsden 3836 0.824111 59 59.48 0.025 59.455 9.56 12.92 13.03
Gilchrist 984 0.328444 23 23.73 0.025 23.705 9.19 12.38 12.77
Glades 762 0.232870 11 11.72 0.025 11.695 9.14 7.96 8.49
Gulf 1273 0.321757 27 27.71 0.025 27.685 9.21 19.80 20.32
Hamilton 1224 0.190548 12 12.73 0.025 12.705 9.15 8.32 8.82
Hardee 2057 0.417308 20 20.75 0.025 20.725 9.29 7.42 7.70
Hendry 3235 0.709752 49 49.57 0.025 49.545 9.48 11.66 11.80
Hernando 7074 3.958244 264 261.49 0.024 261.466 10.96 13.61 13.48
Highlands 4822 1.882105 199 197.30 0.025 197.275 10.02 18.73 18.57
Hillsborough 74788 46.284944 1064 1064.35 0.053 1064.297 32.76 7.23 7.23
Holmes 1603 0.280332 24 24.73 0.025 24.705 9.21 12.23 12.61
Indian 6553 7.607307 156 155.99 0.024 155.966 12.36 9.75 9.75
Jackson 4502 0.819253 115 114.65 0.025 114.625 9.60 24.78 24.70
Jefferson 957 0.309690 16 16.74 0.025 16.715 9.18 11.23 11.75
Lafayette 1414 0.106457 21 21.73 0.025 21.705 9.13 24.93 25.80
Lake 14841 9.534133 293 291.32 0.023 291.297 13.67 7.98 7.94
Lee 39332 27.089733 652 649.84 0.031 649.809 22.42 8.46 8.43
Leon 18742 7.918907 173 173.09 0.023 173.067 13.28 5.89 5.90
Levy 1798 0.721669 19 19.77 0.025 19.745 9.39 4.58 4.76
Liberty 716 0.113382 14 14.73 0.025 14.705 9.09 16.76 17.63
Madison 1516 0.324513 33 33.68 0.025 33.655 9.22 17.84 18.21
Manatee 21539 13.804729 412 409.42 0.023 409.397 15.82 10.22 10.15
Marion 17125 7.971880 456 450.97 0.023 450.947 13.20 12.47 12.34
Martin 7656 9.243197 207 206.49 0.024 206.466 13.08 12.86 12.83
Miami-Dade 290363 92.544722 4155 4160.13 0.427 4159.703 67.08 15.29 15.31
Monroe 4168 5.227991 35 36.04 0.024 36.016 11.27 4.72 4.86
Nassau 4521 3.389901 66 66.65 0.024 66.626 10.58 7.45 7.52
Okaloosa 12407 6.812245 224 222.92 0.023 222.897 12.43 10.63 10.58
Okeechobee 2347 0.720267 51 51.54 0.025 51.515 9.42 12.09 12.22
Orange 73691 41.729653 732 735.09 0.044 735.046 30.80 5.25 5.28
Osceola 24512 7.340821 286 284.32 0.026 284.294 13.43 7.61 7.57
Palm 80865 84.441622 1866 1873.59 0.279 1873.311 49.08 12.47 12.52
Pasco 21222 13.333365 360 358.20 0.023 358.177 15.61 6.50 6.47
Pinellas 43480 35.959433 1035 1029.81 0.049 1029.761 26.35 10.62 10.56
Polk 35942 15.491829 767 758.92 0.024 758.896 17.47 10.58 10.47
Putnam 3819 1.217151 71 71.36 0.025 71.335 9.70 9.53 9.58
St. 12481 12.966328 112 113.41 0.025 113.385 14.88 4.23 4.28
St. 13864 7.683491 393 389.01 0.023 388.987 12.87 11.97 11.85
Santa 10575 5.192317 122 122.24 0.023 122.217 11.67 6.62 6.63
Sarasota 17916 20.149336 502 499.32 0.033 499.287 18.14 11.57 11.51
Seminole 17445 16.199289 311 310.49 0.026 310.464 16.51 6.59 6.58
Sumter 4819 4.709867 118 118.11 0.024 118.086 11.11 8.91 8.92
Suwannee 3913 0.701966 94 93.97 0.025 93.945 9.51 21.16 21.16
Taylor 1914 0.362501 25 25.73 0.025 25.705 9.26 11.59 11.93
Union 1416 0.217377 65 65.30 0.025 65.275 9.18 42.66 42.86
Volusia 21365 14.096377 430 427.17 0.023 427.147 15.93 7.77 7.72
Wakulla 2032 0.710564 21 21.77 0.025 21.745 9.40 6.22 6.45
Walton 4707 3.317614 43 43.86 0.024 43.836 10.57 5.81 5.92
Washington 1907 0.397700 30 30.71 0.025 30.685 9.27 11.78 12.06

1 Calculated in millions

2 Number of deaths is our direct estimates

3 Empirical Bayes estimator based on our methodology

4 Corrected BIAS EB= EB- BIAS

5 RMSE= MSE

6 Death rate= Deaths/(Population Size)

7 E7stimated rate= EB/(Population Size)

This rate is calculated for every 10,000 population

Figure 1.

Figure 1

The top two map compares the EB and BIAS corrected EB for the number of deaths due to COVID-19 in each county of Florida, and the bottom two maps show the BIAS and RMSE of the EB

Simulation

In this section, we will measure the performance of our model via a simulation study. The choice of the auxiliary parameters is guided by the case study of the previous section. For illustration purposes, we have considered only one covariate- the number of confirmed cases to estimate the number of deaths due to COVID 19. This data is available for 3,142 counties of the United States. For simulation purposes, we have taken a random sample from this data without any replacement for each choice of m. The number of small areas (counties), m, is set to be 25, 50, 100, 200, 500, or 1000. For each choice of m, we generated data from the model :

zi|𝜃iindN(𝜃i,1/4),𝜃i=λiindN(xiβ,A)

The design matrix X includes a column of ones and one explanatory variable. To set the value of the parameter for β and A, we first create a linear regression model for the number of deaths on the number of confirmed cases using entire data for 3,142 counties. The estimated value for regression coefficient vector β is (5.281570,0.000272) and mean square residuals is 22.75. For simulation we set β = (5.281570,0.000272) and A = 22.75 − 0.25 = 22.50, hence shrinkage factor B = 0.011. Due to this variance stabilizing transformation, the shrinkage factor does not change between counties. Now using Eq. 22 we generate λi and zi for all i = (1,…,m). The explanatory variable is again number of confirmed cases which is simulated randomly without replacement from the entire populations of 3,142 counties in the United States.

Here we will compare the true RMSE and estimated RMSE of λ^iEB. We examine our findings in Theorems 2, 3 and 4 based on six different settings for m. Here we will vary the m and only one dataset is generated for each m, the latter taking values 25,50,100,200,500,1000. We have estimated the true RMSE of λ^iCEB based on 1,000 simulated samples since we do not have exact expression for RMSE of λ^iCEB.

Figure 2 substantiates that the approximations given in Theorems 3 and 4 are fairly close to the true RMSE. In addition, they also point out one particular small area where the MSE is significantly higher than rest of the small areas.

Figure 2.

Figure 2

figure compares True RMSE: root of Eq. 12, estimated RMSE: root of Eq. 15 and simulated RMSE of bias corrected estimators. The RMSE of bias corrected EB estimators (λ^iCEB) is based on 1,000 simulations provided in Fig. 2 and it also verifies the result in Theorem 4

Conclusion

The paper introduces square root transformation of Poisson count data, and attains approximately both normality of the transformed data as well as variance stabilization. In this way, we obtain explicit estimates of bias and MSE for Poisson means. Based on the simulation, it seems that our estimates closely resemble the truth. Data analysis part tells us that estimates are higher on south-east Florida when the model appropriate.

There are many potential extensions. One that immediately comes to mind is consideration of unit level models with corresponding square root transformation. Gonçalves and Ghosh (2021) have addressed this problem using a pure hierarchical Bayesian framework, but an empirical Bayes approach with all its theoretical properties should also be a topic of future investigation. Even under the present framework, one may add a spatial component and using something like a CAR model (see for example, Ghosh et al., 1999). A final interesting problem is to consider an overdispersed Poisson model, i.e. a negative binomial model for count data with variable transformation as in Yu (2009) which also leads to homoscedasticity.

Acknowledgements

The authors are grateful to the editor and anonymous reviewer(s) for their constructive comments and suggestions which greatly improved an earlier version of this article.

Appendix:

Proof of Theorem 2

Proof 6.

In this section we will do calculate the MSE of the EB estimator λ^iEB. We observe that E((λ^iBλi)(λ~iEBλ^iB))=0 and E((λ^iBλi)(λ^iEBλ~iEB))=0 since E((λ^iBλi)|zi)=0. Thus we have

E(λ^iEBλi)2=E(λ^iBλi)2+E(λ~iEBλ^iB)2+E(λ^iEBλ~iEB)2+2E((λ^iEBλ~iEB)(λ~iEBλ^iB)) 5.11

Each term in the right side of Eq. 5.11 will be computed separately. Since 𝜃i|ziindN((1B)zi+Bxiβ,(1B)/4), by Lemma 1,

E(λ^iBλi)2=E(E(λi|zi)λi)2=E[E{(E(λi)λi)2|zi}]=E[Var(λi|zi)]=E[(1B)2/8+ui22(1B)]=(1B)[(1B)/8+(A(1B)+(xiβ)2)]=(1B)(xiβ)2+(1B)(2B)8B.

Next we compute the second term in the right side of Eq. 5.11. By Lemmas 2 and Eq. 7,

E(λ~iEBλ^iB)2=Var(λ^iB)+Var(λ~iEB)2Cov(λ^iB,λ~iEB)+(E(λ~iEBλ^iB))2=Var(ui12)+Var(ui22)2Cov(ui12,ui22)+(E(ui12ui22))2. 5.12

Also,

Var(ui12)=2(A(1B)+(2B)si/4)2+(4A(1B)+(2B)si)(xiβ)2 5.13
Var(ui22)=2(A(1B))2+4A(1B)(xiβ)2 5.14
Cov(ui12,ui22)=2(1B)2(A+si/4)2+4(xiβ)2(1B)(A+si/4) 5.15
[E(ui12ui22)]2=(2B)2si2/16 5.16

By Eqs. 5.135.165.12 simplifies to

E(λ~iEBλ^iB)2=Bsi(xiβ)2+(1B)2si/4+si2(1/2B/4B2/16) 5.17

Now we evaluate the last expression in right side of Eq. 5.11.

λ~iEBλ^iB=((1B)zi+Bxiβ^)2((1B)zi+Bxiβ)2=B2[(xiβ^xiβ)2+2xiβ(xiβ^xiβ)]+2B(1B)(xiβ^xiβ)xiβ^+2B(1B)(xiβ^xiβ)(zixiβ^)=B1+B2+B3 5.18

We define B1=B2[(xiβ^xiβ)2+2xiβ(xiβ^xiβ)],B2=2B(1B)(xiβ^xiβ)xiβ^,B3=2B(1B)(xiβ^xiβ)(zixiβ^). The expressions A1,A2 and A3 in Eq. 8 are functions of residuals ((zixiβ^)...,,(zixiβ^)) and the expressions B1 and B2 in Eq. 5.18 are functions of β^. Therefore, (A1,A2,A3) is independent of (B1,B2) and their covariances are 0. The expectation of B3 is 0 since (xiβ^xiβ)is independent of(zixiβ^). Also,

Cov(A1,B3)=2B(1B)4E((BB^)(zixiβ^))E(xiβ^xiβ)=0.Cov(A3,B3)=2B(1B)E(B^2B22(B^B))(zixiβ^)3×E(xiβ^xiβ)=0.

Hence, again using part (v) of Lemma 4, Eqs. 8 and 5.18, we have

Cov(λ^iEBλ~iEB,λ~iEBλ^iB)=Cov(A2,B3)=E(2xiβ^(BB^)(znxnβ^)2B(1B)×(xiβ^xiβ)(zixiβ^))=4B(1B)E((BB^)(zixiβ^)2)×E(xiβ^(xiβ^xiβ))=4B(1B)(1si)2m0Var(xiβ^)=4B(1B)(1si)2m0(1/4+A)si=(1B)si(1si)2m0.

Hence, from Eqs. 7 and 10,

E((λ^iEBλ~iEB)(λ~iEBλ^iB))=(1B)si(1si)2m0+2B2(1si)m02B4si=si(1si)2m0(22B+B2/4).

Next we compute the remaining third term in the right side of Eq. 5.11.

E(λ^iEBλ~iEB)2=E(A1+A2+A3)2=E(BB^)2/16+4E(xiβ^(BB^)(zixiβ^))2+E[{(B^2B2)2(B^B)}2(zixiβ^)4]+[E(BB^){(B^2B2)2(B^B)}(zixiβ^)2]/2. 5.19

In the above calculation, the cross terms E(A1A2) and E(A2A3) in Eq. 5.19 vanish by part (iii) of Lemma 3. Again, by part (ii) of Lemma 4, we have,

E(BB^)2=B22BE(B^)+E(B^2)=B2+m02m04B2=2B2(m04). 5.20

Again, applying Lemmas 3 and 4,

E[xiβ^(BB^)(zixiβ^)]2=E[(xiβ^)2]E[(BB^)2(zixiβ^)2]=(xiβ)2+si4B×E(zixiβ^)2||ZXβ^||2(BB^)2m024B^=(xiβ)2+si4B(1si)(m02)4m0×E[B2/B^2B+B^]=(xiβ)2+si4B(1si)(m02)4m02Bm02=1si2m0(xiβ)2B+si4.

Now we calculate the third term, 1/2E[{(B^2B2)2(B^B)}2(zixiβ^)4] in Eq. 5.19. By Lemma 3 and 4 and recalling ||ZXβ^||2=m024B^, we obtain,

E[{(B^2B2)2(B^B)}2(zixiβ^)4]=E(zixiβ^)4||ZXβ^||4E{(B^2B2)2(B^B)}2}(m02)216B^2=3(1si)2(m02)216m0(m0+2)E[(B^22B2+B4/B^2)4(B^B2/B^B+B3/B^2)+4(12B/B^+B2/B^2)]=3(1si)2(m02)216m0(m0+2)B2m02m042+m0(m0+2)(m02)2+4Bm0m02m0(m0+2)(m02)2+412m0m02+m0(m0+2)(m02)2=3(1si)2(m02)216m0(m0+2)B24(2m029m0+6)(m04)(m02)2+4B4m0(m02)2+42(m0+2)(m02)2)=3(1si)22m029m0+64m0(m0+2)(m04)B2Bm0+2+12m0. 5.21

Finally, once again by Lemmas 3 and 4, and recalling ||ZXβ^||2=m024B^, we get (5.19),

E[(BB^){(B^2B2)2(B^B)}(zixiβ^)2]=E(zixiβ^)2||ZXβ^||2E(BB^){(B^2B2)2(B^B)}m024B^=(1si)(m02)4m0EBB^B^2B3/B^+B22(BB^)2B+2B2B^=(1si)(m02)4m0B2B2m02m04m0B2m02+B22B+m02Bm02=(1si)(m02)4m04Bm024B2(m03)(m02)(m04)=(1si)Bm01B(m03)m04. 5.22

From Eqs. 5.195.22, we obtain,

E(λ^iEBλ~iEB)2=B28(m04)+1si2m04(xiβ)2B+si+3(1si)22m029m0+64m0(m0+2)(m04)B2Bm0+2+12m0+(1si)B2m01B(m03)m04.

Theorem 2 follows from Eqs. 5.115.125.175.19 and 5.23. □

Funding

The third author’s research was partially supported by JSPS KAKENHI grant number 18K12758.

Compliance with Ethical Standards

The Author(s) declare(s) that there is no conflict of interest that are relevant to the content of this article.

Footnotes

1

US Census Bureau and Statistics of Income Division (SOI) of the IRS

The third author’s research was partially supported by JSPS KAKENHI grant number 18K12758.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Malay Ghosh, Email: ghoshm@ufl.edu.

Tamal Ghosh, Email: tamalg@ufl.edu.

Masayo Y. Hirose, Email: masayo@imi.kyushu-u.ac.jp

References

  1. Basu D. On statistics independent of a complete sufficient statistic. Sankhyā,: The Indian Journal of Statistics (1933-1960) 1955;15:377–380. [Google Scholar]
  2. Efron B, Morris C. Stein’s estimation rule and its competitors—an empirical bayes approach. Journal of the American Statistical Association. 1973;68:117–130. [Google Scholar]
  3. Fay RE, Herriot RA. Estimates of income for small places: an application of james-stein procedures to census data. Journal of the American Statistical Association. 1979;74:269–277. doi: 10.1080/01621459.1979.10482505. [DOI] [Google Scholar]
  4. Ghosh M, Natarajan K, Waller LA, Kim D. Hierarchical bayes glms for the analysis of spatial data: an application to disease mapping. Journal of Statistical Planning and Inference. 1999;75:305–318. doi: 10.1016/S0378-3758(98)00150-5. [DOI] [Google Scholar]
  5. Ghosh M, Kubokawa T, Kawakubo Y. Benchmarked empirical bayes methods in multiplicative area-level models with risk evaluation. Biometrika. 2015;102:647–659. doi: 10.1093/biomet/asv010. [DOI] [Google Scholar]
  6. Gonçalves, K. C. and Ghosh, M. (2021). Unit level model for small area estimation with count data under square root transformation. Brazilian Journal of Statistics and Probability In Press.
  7. Hirose, MY, Ghosh, M and Ghosh, T (2021). Arc-sin transformation for binomial sample proportions in small area estimation. Statistica Sinica Preprint No: SS-2020-0446. 10.5705/ss.202020.0446
  8. Slud EV, Maiti T. Mean-squared error estimation in transformed fay–herriot models. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2006;68(2):239–257. doi: 10.1111/j.1467-9868.2006.00542.x. [DOI] [Google Scholar]
  9. Sugasawa S, Kubokawa T. Transforming response values in small area prediction. Computational Statistics & Data Analysis. 2017;114:47–60. doi: 10.1016/j.csda.2017.03.017. [DOI] [Google Scholar]
  10. Yu G. Variance stabilizing transformations of poisson, binomial and negative binomial distributions. Statistics & Probability Letters. 2009;79(14):1621–1629. doi: 10.1016/j.spl.2009.04.010. [DOI] [Google Scholar]

Articles from Sankhya. Series B (2008) are provided here courtesy of Nature Publishing Group

RESOURCES