Abstract
The paper intends to serve two objectives. First, it revisits the celebrated Fay-Herriot model, but with homoscedastic known error variance. The motivation comes from an analysis of count data, in the present case, COVID-19 fatality for all counties in Florida. The Poisson model seems appropriate here, as is typical for rare events. An empirical Bayes (EB) approach is taken for estimation. However, unlike the conventional conjugate gamma or the log-normal prior for the Poisson mean, here we make a square root transformation of the original Poisson data, along with square root transformation of the corresponding mean. Proper back transformation is used to infer about the original Poisson means. The square root transformation makes the normal approximation of the transformed data more justifiable with added homoscedasticity. We obtain exact analytical formulas for the bias and mean squared error of the proposed EB estimators. In addition to illustrating our method with the COVID-19 example, we also evaluate performance of our procedure with simulated data as well.
Keywords: COVID19, Empirical Bayes, Fay-Herriot model, Random Effects Model, Stein-type shrinkage estimators.
Introduction
Small area estimation is now a topic of global importance. Methodologies abound, and many of these are finding real-life applications.
Normal theory small area estimation pervades the literature, and the pioneering (Fay and Herriot, 1979) method is most often used in real life applications. The Fay-Herriot model is a normal theory mixed effects area-level model. The model needs to assume known error variances in order to avoid non-identifiability, whereas in reality, these are sample estimates.
The present article deals with the Fay-Herriot model with known constant error variance. The motivation came from an analysis of count data, in particular, COVID-19 data, to find estimates of fatality for all counties in the state of Florida. The Poisson model is used, but we make a square transformation of the original data, and the corresponding mean parameters to attain a closer approximation to normality with added homoscedasticity. This is in contrast to the log-transformation, where also one typically assumes normality of the transformed data. However, transformation of the original data in the log-scale bears the potential hazard of leaving out zero counts, which on most occasions, can affect the conclusion significantly. Further, our approach allows one to develop Stein-type shrinkage estimators for small area means and study their properties analytically.
Variable transformation in the small area context has been addressed before. The logarithmic transformation with the assumption of log-normal distribution of the original data is most commonly used, for example, in modeling income distributions. See for example, Slud and Maiti (2006) and Ghosh et al. (2015). Recently, Hirose et al. (2021) considered an arc-sin transformation of binomial proportions for small area estimation. Sugasawa and Kubokawa (2017) suggested a non-explicit EB estimator and performed an analysis based on the dual power transformation similar to that of Hirose et al. (2021).
The remaining sections are as follows. In Section 2, we introduce the square root transformation, develop Stein-type shrinkage estimators for the transformed data motivated from an empirical Bayes point of view, and then back transform properly to estimate the original parameters of interest. In Sections 3 and 4, we obtain exact expressions for the bias and the mean squared error of our shrinkage estimator. Finally, Section 5 we also obtain an estimator of the mean squared error correct up to order where m is the number of small areas. Section 6 contains an illustration of the proposed method to estimate the number of deaths due to COVID-19 in each county. A simulation study is undertaken in Section 7. Some final remarks are made in Section 8.
Empirical Bayes Estimators
Suppose there are m areas with counts yi for i-th area. We assume yi are independently distributed from Poisson(λi) for i-th area. We transform and with the usual variance stabilizing square root transformation so that V (zi) is approximately 1/4. We begin with
Following the customary approach, we consider independent priors for the 𝜃i with p-dimensional auxiliary variables xi and regression parameter where m > p + 4. The posterior where . Thus, Bayes estimator of λi is
| 1 |
We now turn towards empirical Bayes (EB) estimation of the λi. Writing X = (x1,...,xm)⊤, Z = (z1,...,zm)⊤ and , it follows that marginally . Here X is a m × p matrix with rank p. Following Efron and Morris (1973), an EB estimator of B is . Thus an EB estimator of λi is
| 2 |
For proving our technical results, we find it convenient also to define
| 3 |
which is also an EB estimator of λi if the shrinkage factor B were known. Also, for notational simplicity we write m0 = m − p hereafter.
Bias of
For both bias and mean squared error (MSE) calculations for we need the following lemmas.
Lemma 1.
Let Then
Lemma 2.
Define , and then,
| 4 |
Proof 1.
The result follows from the independence of and and noting that while . □
Lemma 3.
-
(i)
is distributed independently of .
-
(ii)
for all positive integers k.
-
(iii)
if k is an odd positive integer and 0 < l < k.
Proof 2.
Marginally, . Hence, is complete sufficient for (β,A), while is ancillary. This proves (i) by an application of Basu’s Theorem (Basu, 1955).
Now for any positive integer k, and by part (i) in Lemma 3, we have,
This leads to .
Next noting that it follows that
Therefore, is symmetric random variable around 0 and its all odd moments are 0. Hence,
We get the second equality in the above equations using part (i) of this Lemma. Proof of (iii) is complete after observing k is an odd integer. □
Lemma 4.
-
(i)
if m0 > 2.
-
(ii)
if m0 > 4.
-
(iii)
if m0 > 2.
-
(iv)
if m0 > 2.
-
(v)
.
-
(vi)
.
Proof 3.
The proof (i) to (iv) follows by noting that and . To prove (v) and (vi) we also need part (ii) of Lemma 3. □
Now we start with calculation of the bias . The following theorem is proved.
Theorem 1.
Suppose with priors and m0 > 2. Then bias of the EB estimator in Eq. 2 for is given by
| 5 |
Proof 4.
We begin with the partition
| 6 |
By Lemmas 1 and 2,
| 7 |
Noticing , we have . It is easy to see that
| 8 |
where and . The expectation of A1 is 0 since is unbiased estimator of B. Now,
| 9 |
The first equality in Eq. 9 is by independence of and . The third equality holds by part (iii) of Lemma 3 since .
Finally, we simplify A3. By (i) and (iii) of Lemma 3 and Lemma 4,
| 10 |
The proof of Eq. 5 follows now by combining (6) with Eqs. 7–10. □
Remark 1.
With the usual assumption, for large m, the bias of the EB estimator , .
Remark 2.
We can estimate the bias in Eq. 5 by replacing the B by , An unbiased estimator of the bias is
| 11 |
Thus, from Eq. 11, the EB estimator has positive bias, and the bias-corrected estimator of λi is
MSE of
The following theorem provides an exact expression for the MSE of .
Theorem 2.
Suppose with priors and m0 > 4. Then MSE of the EB estimator in Eq. 2 for is given by
The proof of Theorem 2 is given in Appendix A.
Remark 3.
Theorem 2 shows that the MSE of EB estimator , due to the first term in Eq. 12 for large m.
Estimation of the MSE of
In this section we estimate the MSE of provided in Theorem 2 up to the order O(m− 1) for large m. We now assume that . Ignoring the O(m− 2) terms, we rewrite
It is easy to see that only first two terms in Eq. 12 do not depend on m and remaining terms are of . Using Lemma 4 we get
| 12 |
Using Eq. 12, we find
| 13 |
Now by , Lemma 4 and the independence of and , we also have
| 14 |
Since we are ignoring terms in MSE estimation, we can estimate the terms in Eq. 12 simply by replacing B2 by and by . By Eqs. 13 and 14, we derive estimator of the MSE of in Theorem 3.
Theorem 3.
Assume conditions of Theorem 2. Then as given in (13) is estimated by
The next theorem shows that MSE of the bias-corrected estimator equals . In other words, bias correction does not lead to any significant improvement over , at least, when calculated up to .
Theorem 4.
Assume conditions of Theorem 2. Then
Proof 5.
Write , where . We begin with
| 15 |
It is immediate that , where . Next
Hence,
| 16 |
Finally, , But
| 17 |
Thus, . Now,
| 18 |
Due to the independence of and , . Next, again invoking the symmetry of Z − Xβ around 0, and ,
Hence Eq. 18 reduces to
| 19 |
| 20 |
Next
Since is ancillary, and is a function of the complete sufficient statistic , by Basu’s Theorem,
| 21 |
Hence, from Eqs. 18–21, . This along with Eqs. 15–17 proves the theorem. □
Data Analysis
In this section, we now deploy our approach on the 2020 COVID-19 pandemic dataset, which is available at https://usafacts.org/visualizations/coronavirus-covid-19-spread-map/usafacts.org. This example is used mainly for illustration. We are using the figures provided as the sampled estimates. Our study shows that the coefficient of determination (R2) does not increase much if we include other demographic variables such as the population size, number of people over age 60, and income in the linear model for the number of deaths regressing on number of confirmed cases. It suggests that the number of confirmed cases is the most crucial variable in estimation of the number of deaths by Coronavirus than the aforementioned demographic variables. We have also studied a few more county-level data sources1 and we found out that adjusted gross income (AGI)2 of the year 2017 is really relevant for estimating the number of deaths. In our model, we have transformed the number of confirmed cases and adjusted gross income (AGI) by taking the square root. All data are aggregated at the county level. We are interested in estimating the counts of death due to Coronavirus for all counties in Florida. Here m = 57 since Florida has 57 counties. From Section 2 we know that and we estimate β by ordinary least square method. Based on our analysis, we get , the respective estimates for the intercept, number of confirmed cases and AGI. We have summarized our results based on our model in Table 1 and the shrinkage factor . It seems that our model-based approach seems to pull the direct estimates towards some grand average, as one anticipates in a typical EB analysis. Figure 1 shows that the estimates are higher in south east Florida than the rest of the state.
Table 1.
COVID19in Florida as of December 28, 2020
| County name | Confirm | AGI1 | Deaths2 | EB3 | BIAS | Corrected BIAS EB4 | RMSE5 | Death rate6 | Esti-mated rate7 |
|---|---|---|---|---|---|---|---|---|---|
| Alachua | 15473 | 7.294066 | 128 | 128.55 | 0.023 | 128.527 | 12.82 | 4.76 | 4.78 |
| Baker | 2414 | 0.554345 | 36 | 36.68 | 0.025 | 36.655 | 9.37 | 12.32 | 12.56 |
| Bay | 11403 | 4.571565 | 203 | 201.86 | 0.024 | 201.836 | 11.48 | 11.62 | 11.55 |
| Bradford | 2118 | 0.478624 | 23 | 23.75 | 0.025 | 23.725 | 9.32 | 8.16 | 8.42 |
| Brevard | 20355 | 17.959964 | 513 | 509.65 | 0.027 | 509.623 | 17.42 | 8.52 | 8.47 |
| Broward | 133480 | 68.207212 | 1828 | 1832.21 | 0.092 | 1832.118 | 45.89 | 9.36 | 9.38 |
| Calhoun | 1169 | 0.207429 | 28 | 28.70 | 0.025 | 28.675 | 9.16 | 19.85 | 20.35 |
| Charlotte | 7128 | 5.233347 | 231 | 229.35 | 0.023 | 229.327 | 11.46 | 12.23 | 12.14 |
| Citrus | 6467 | 3.447436 | 260 | 257.45 | 0.024 | 257.426 | 10.73 | 17.37 | 17.20 |
| Clay | 10829 | 6.127044 | 183 | 182.44 | 0.023 | 182.417 | 12.05 | 8.35 | 8.32 |
| Collier | 22004 | 27.616972 | 328 | 329.86 | 0.048 | 329.812 | 21.46 | 8.52 | 8.57 |
| Columbia | 5837 | 1.304030 | 117 | 116.71 | 0.025 | 116.685 | 9.86 | 16.32 | 16.28 |
| DeSoto | 2843 | 0.527165 | 52 | 52.52 | 0.025 | 52.495 | 9.38 | 13.68 | 13.82 |
| Dixie | 1088 | 0.540883 | 11 | 11.73 | 0.025 | 11.705 | 9.28 | 6.54 | 6.97 |
| Duval | 58491 | 28.523854 | 722 | 720.17 | 0.030 | 720.140 | 24.32 | 7.54 | 7.52 |
| Escambia | 21507 | 8.068327 | 360 | 357.02 | 0.024 | 356.996 | 13.52 | 11.31 | 11.22 |
| Flagler | 3565 | 3.411798 | 48 | 48.81 | 0.024 | 48.786 | 10.53 | 4.17 | 4.24 |
| Franklin | 878 | 0.283821 | 4 | 4.61 | 0.025 | 4.585 | 9.17 | 3.30 | 3.80 |
| Gadsden | 3836 | 0.824111 | 59 | 59.48 | 0.025 | 59.455 | 9.56 | 12.92 | 13.03 |
| Gilchrist | 984 | 0.328444 | 23 | 23.73 | 0.025 | 23.705 | 9.19 | 12.38 | 12.77 |
| Glades | 762 | 0.232870 | 11 | 11.72 | 0.025 | 11.695 | 9.14 | 7.96 | 8.49 |
| Gulf | 1273 | 0.321757 | 27 | 27.71 | 0.025 | 27.685 | 9.21 | 19.80 | 20.32 |
| Hamilton | 1224 | 0.190548 | 12 | 12.73 | 0.025 | 12.705 | 9.15 | 8.32 | 8.82 |
| Hardee | 2057 | 0.417308 | 20 | 20.75 | 0.025 | 20.725 | 9.29 | 7.42 | 7.70 |
| Hendry | 3235 | 0.709752 | 49 | 49.57 | 0.025 | 49.545 | 9.48 | 11.66 | 11.80 |
| Hernando | 7074 | 3.958244 | 264 | 261.49 | 0.024 | 261.466 | 10.96 | 13.61 | 13.48 |
| Highlands | 4822 | 1.882105 | 199 | 197.30 | 0.025 | 197.275 | 10.02 | 18.73 | 18.57 |
| Hillsborough | 74788 | 46.284944 | 1064 | 1064.35 | 0.053 | 1064.297 | 32.76 | 7.23 | 7.23 |
| Holmes | 1603 | 0.280332 | 24 | 24.73 | 0.025 | 24.705 | 9.21 | 12.23 | 12.61 |
| Indian | 6553 | 7.607307 | 156 | 155.99 | 0.024 | 155.966 | 12.36 | 9.75 | 9.75 |
| Jackson | 4502 | 0.819253 | 115 | 114.65 | 0.025 | 114.625 | 9.60 | 24.78 | 24.70 |
| Jefferson | 957 | 0.309690 | 16 | 16.74 | 0.025 | 16.715 | 9.18 | 11.23 | 11.75 |
| Lafayette | 1414 | 0.106457 | 21 | 21.73 | 0.025 | 21.705 | 9.13 | 24.93 | 25.80 |
| Lake | 14841 | 9.534133 | 293 | 291.32 | 0.023 | 291.297 | 13.67 | 7.98 | 7.94 |
| Lee | 39332 | 27.089733 | 652 | 649.84 | 0.031 | 649.809 | 22.42 | 8.46 | 8.43 |
| Leon | 18742 | 7.918907 | 173 | 173.09 | 0.023 | 173.067 | 13.28 | 5.89 | 5.90 |
| Levy | 1798 | 0.721669 | 19 | 19.77 | 0.025 | 19.745 | 9.39 | 4.58 | 4.76 |
| Liberty | 716 | 0.113382 | 14 | 14.73 | 0.025 | 14.705 | 9.09 | 16.76 | 17.63 |
| Madison | 1516 | 0.324513 | 33 | 33.68 | 0.025 | 33.655 | 9.22 | 17.84 | 18.21 |
| Manatee | 21539 | 13.804729 | 412 | 409.42 | 0.023 | 409.397 | 15.82 | 10.22 | 10.15 |
| Marion | 17125 | 7.971880 | 456 | 450.97 | 0.023 | 450.947 | 13.20 | 12.47 | 12.34 |
| Martin | 7656 | 9.243197 | 207 | 206.49 | 0.024 | 206.466 | 13.08 | 12.86 | 12.83 |
| Miami-Dade | 290363 | 92.544722 | 4155 | 4160.13 | 0.427 | 4159.703 | 67.08 | 15.29 | 15.31 |
| Monroe | 4168 | 5.227991 | 35 | 36.04 | 0.024 | 36.016 | 11.27 | 4.72 | 4.86 |
| Nassau | 4521 | 3.389901 | 66 | 66.65 | 0.024 | 66.626 | 10.58 | 7.45 | 7.52 |
| Okaloosa | 12407 | 6.812245 | 224 | 222.92 | 0.023 | 222.897 | 12.43 | 10.63 | 10.58 |
| Okeechobee | 2347 | 0.720267 | 51 | 51.54 | 0.025 | 51.515 | 9.42 | 12.09 | 12.22 |
| Orange | 73691 | 41.729653 | 732 | 735.09 | 0.044 | 735.046 | 30.80 | 5.25 | 5.28 |
| Osceola | 24512 | 7.340821 | 286 | 284.32 | 0.026 | 284.294 | 13.43 | 7.61 | 7.57 |
| Palm | 80865 | 84.441622 | 1866 | 1873.59 | 0.279 | 1873.311 | 49.08 | 12.47 | 12.52 |
| Pasco | 21222 | 13.333365 | 360 | 358.20 | 0.023 | 358.177 | 15.61 | 6.50 | 6.47 |
| Pinellas | 43480 | 35.959433 | 1035 | 1029.81 | 0.049 | 1029.761 | 26.35 | 10.62 | 10.56 |
| Polk | 35942 | 15.491829 | 767 | 758.92 | 0.024 | 758.896 | 17.47 | 10.58 | 10.47 |
| Putnam | 3819 | 1.217151 | 71 | 71.36 | 0.025 | 71.335 | 9.70 | 9.53 | 9.58 |
| St. | 12481 | 12.966328 | 112 | 113.41 | 0.025 | 113.385 | 14.88 | 4.23 | 4.28 |
| St. | 13864 | 7.683491 | 393 | 389.01 | 0.023 | 388.987 | 12.87 | 11.97 | 11.85 |
| Santa | 10575 | 5.192317 | 122 | 122.24 | 0.023 | 122.217 | 11.67 | 6.62 | 6.63 |
| Sarasota | 17916 | 20.149336 | 502 | 499.32 | 0.033 | 499.287 | 18.14 | 11.57 | 11.51 |
| Seminole | 17445 | 16.199289 | 311 | 310.49 | 0.026 | 310.464 | 16.51 | 6.59 | 6.58 |
| Sumter | 4819 | 4.709867 | 118 | 118.11 | 0.024 | 118.086 | 11.11 | 8.91 | 8.92 |
| Suwannee | 3913 | 0.701966 | 94 | 93.97 | 0.025 | 93.945 | 9.51 | 21.16 | 21.16 |
| Taylor | 1914 | 0.362501 | 25 | 25.73 | 0.025 | 25.705 | 9.26 | 11.59 | 11.93 |
| Union | 1416 | 0.217377 | 65 | 65.30 | 0.025 | 65.275 | 9.18 | 42.66 | 42.86 |
| Volusia | 21365 | 14.096377 | 430 | 427.17 | 0.023 | 427.147 | 15.93 | 7.77 | 7.72 |
| Wakulla | 2032 | 0.710564 | 21 | 21.77 | 0.025 | 21.745 | 9.40 | 6.22 | 6.45 |
| Walton | 4707 | 3.317614 | 43 | 43.86 | 0.024 | 43.836 | 10.57 | 5.81 | 5.92 |
| Washington | 1907 | 0.397700 | 30 | 30.71 | 0.025 | 30.685 | 9.27 | 11.78 | 12.06 |
1 Calculated in millions
2 Number of deaths is our direct estimates
3 Empirical Bayes estimator based on our methodology
4 Corrected BIAS EB= EB- BIAS
5 RMSE=
6 Death rate= Deaths/(Population Size)
7 E7stimated rate= EB/(Population Size)
This rate is calculated for every 10,000 population
Figure 1.
The top two map compares the EB and BIAS corrected EB for the number of deaths due to COVID-19 in each county of Florida, and the bottom two maps show the BIAS and RMSE of the EB
Simulation
In this section, we will measure the performance of our model via a simulation study. The choice of the auxiliary parameters is guided by the case study of the previous section. For illustration purposes, we have considered only one covariate- the number of confirmed cases to estimate the number of deaths due to COVID 19. This data is available for 3,142 counties of the United States. For simulation purposes, we have taken a random sample from this data without any replacement for each choice of m. The number of small areas (counties), m, is set to be 25, 50, 100, 200, 500, or 1000. For each choice of m, we generated data from the model :
The design matrix X includes a column of ones and one explanatory variable. To set the value of the parameter for β and A, we first create a linear regression model for the number of deaths on the number of confirmed cases using entire data for 3,142 counties. The estimated value for regression coefficient vector β is (5.281570,0.000272)⊤ and mean square residuals is 22.75. For simulation we set β = (5.281570,0.000272)⊤ and A = 22.75 − 0.25 = 22.50, hence shrinkage factor B = 0.011. Due to this variance stabilizing transformation, the shrinkage factor does not change between counties. Now using Eq. 22 we generate λi and zi for all i = (1,…,m). The explanatory variable is again number of confirmed cases which is simulated randomly without replacement from the entire populations of 3,142 counties in the United States.
Here we will compare the true RMSE and estimated RMSE of . We examine our findings in Theorems 2, 3 and 4 based on six different settings for m. Here we will vary the m and only one dataset is generated for each m, the latter taking values 25,50,100,200,500,1000. We have estimated the true RMSE of based on 1,000 simulated samples since we do not have exact expression for RMSE of .
Figure 2 substantiates that the approximations given in Theorems 3 and 4 are fairly close to the true RMSE. In addition, they also point out one particular small area where the MSE is significantly higher than rest of the small areas.
Figure 2.
figure compares True RMSE: root of Eq. 12, estimated RMSE: root of Eq. 15 and simulated RMSE of bias corrected estimators. The RMSE of bias corrected EB estimators () is based on 1,000 simulations provided in Fig. 2 and it also verifies the result in Theorem 4
Conclusion
The paper introduces square root transformation of Poisson count data, and attains approximately both normality of the transformed data as well as variance stabilization. In this way, we obtain explicit estimates of bias and MSE for Poisson means. Based on the simulation, it seems that our estimates closely resemble the truth. Data analysis part tells us that estimates are higher on south-east Florida when the model appropriate.
There are many potential extensions. One that immediately comes to mind is consideration of unit level models with corresponding square root transformation. Gonçalves and Ghosh (2021) have addressed this problem using a pure hierarchical Bayesian framework, but an empirical Bayes approach with all its theoretical properties should also be a topic of future investigation. Even under the present framework, one may add a spatial component and using something like a CAR model (see for example, Ghosh et al., 1999). A final interesting problem is to consider an overdispersed Poisson model, i.e. a negative binomial model for count data with variable transformation as in Yu (2009) which also leads to homoscedasticity.
Acknowledgements
The authors are grateful to the editor and anonymous reviewer(s) for their constructive comments and suggestions which greatly improved an earlier version of this article.
Appendix:
Proof of Theorem 2
Proof 6.
In this section we will do calculate the MSE of the EB estimator . We observe that and since . Thus we have
| 5.11 |
Each term in the right side of Eq. 5.11 will be computed separately. Since , by Lemma 1,
Next we compute the second term in the right side of Eq. 5.11. By Lemmas 2 and Eq. 7,
| 5.12 |
Also,
| 5.13 |
| 5.14 |
| 5.15 |
| 5.16 |
By Eqs. 5.13–5.16, 5.12 simplifies to
| 5.17 |
Now we evaluate the last expression in right side of Eq. 5.11.
| 5.18 |
We define . The expressions A1,A2 and A3 in Eq. 8 are functions of residuals and the expressions B1 and B2 in Eq. 5.18 are functions of . Therefore, (A1,A2,A3) is independent of (B1,B2) and their covariances are 0. The expectation of B3 is 0 since . Also,
Hence, again using part (v) of Lemma 4, Eqs. 8 and 5.18, we have
Next we compute the remaining third term in the right side of Eq. 5.11.
| 5.19 |
In the above calculation, the cross terms E(A1A2) and E(A2A3) in Eq. 5.19 vanish by part (iii) of Lemma 3. Again, by part (ii) of Lemma 4, we have,
| 5.20 |
Again, applying Lemmas 3 and 4,
Now we calculate the third term, in Eq. 5.19. By Lemma 3 and 4 and recalling , we obtain,
| 5.21 |
Finally, once again by Lemmas 3 and 4, and recalling , we get (5.19),
| 5.22 |
From Eqs. 5.19–5.22, we obtain,
Theorem 2 follows from Eqs. 5.11, 5.12, 5.17, 5.19 and 5.23. □
Funding
The third author’s research was partially supported by JSPS KAKENHI grant number 18K12758.
Compliance with Ethical Standards
The Author(s) declare(s) that there is no conflict of interest that are relevant to the content of this article.
Footnotes
US Census Bureau and Statistics of Income Division (SOI) of the IRS
The third author’s research was partially supported by JSPS KAKENHI grant number 18K12758.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Malay Ghosh, Email: ghoshm@ufl.edu.
Tamal Ghosh, Email: tamalg@ufl.edu.
Masayo Y. Hirose, Email: masayo@imi.kyushu-u.ac.jp
References
- Basu D. On statistics independent of a complete sufficient statistic. Sankhyā,: The Indian Journal of Statistics (1933-1960) 1955;15:377–380. [Google Scholar]
- Efron B, Morris C. Stein’s estimation rule and its competitors—an empirical bayes approach. Journal of the American Statistical Association. 1973;68:117–130. [Google Scholar]
- Fay RE, Herriot RA. Estimates of income for small places: an application of james-stein procedures to census data. Journal of the American Statistical Association. 1979;74:269–277. doi: 10.1080/01621459.1979.10482505. [DOI] [Google Scholar]
- Ghosh M, Natarajan K, Waller LA, Kim D. Hierarchical bayes glms for the analysis of spatial data: an application to disease mapping. Journal of Statistical Planning and Inference. 1999;75:305–318. doi: 10.1016/S0378-3758(98)00150-5. [DOI] [Google Scholar]
- Ghosh M, Kubokawa T, Kawakubo Y. Benchmarked empirical bayes methods in multiplicative area-level models with risk evaluation. Biometrika. 2015;102:647–659. doi: 10.1093/biomet/asv010. [DOI] [Google Scholar]
- Gonçalves, K. C. and Ghosh, M. (2021). Unit level model for small area estimation with count data under square root transformation. Brazilian Journal of Statistics and Probability In Press.
- Hirose, MY, Ghosh, M and Ghosh, T (2021). Arc-sin transformation for binomial sample proportions in small area estimation. Statistica Sinica Preprint No: SS-2020-0446. 10.5705/ss.202020.0446
- Slud EV, Maiti T. Mean-squared error estimation in transformed fay–herriot models. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2006;68(2):239–257. doi: 10.1111/j.1467-9868.2006.00542.x. [DOI] [Google Scholar]
- Sugasawa S, Kubokawa T. Transforming response values in small area prediction. Computational Statistics & Data Analysis. 2017;114:47–60. doi: 10.1016/j.csda.2017.03.017. [DOI] [Google Scholar]
- Yu G. Variance stabilizing transformations of poisson, binomial and negative binomial distributions. Statistics & Probability Letters. 2009;79(14):1621–1629. doi: 10.1016/j.spl.2009.04.010. [DOI] [Google Scholar]


