Two-way model with random cell sizes

Steven F Arnold; Panagis G Moschopoulos

doi:10.1016/j.jspi.2012.04.017

. Author manuscript; available in PMC: 2013 Mar 26.

Published in final edited form as: J Stat Plan Inference. 2012 Nov;142(11):2965–2975. doi: 10.1016/j.jspi.2012.04.017

Two-way model with random cell sizes

Steven F Arnold ^a, Panagis G Moschopoulos ^b,^*

PMCID: PMC3608410 NIHMSID: NIHMS442817 PMID: 23538487

Abstract

We consider inference for row effects in the presence of possible interactions in a two-way fixed effects model when the numbers of observations are themselves random variables. Let N_ij be the number of observations in the (i, j) cell, π_ij be the probability that a particular observation is in that cell and μ_ij be the expected value of an observation in that cell. We assume that the {N_ij} have a joint multinomial distribution with parameters n and {π_ij}. Then μ̄_i. = Σ_jπ_ijμ_ij/Σ_jπ_ij is the expected value of a randomly chosen observation in the ith row. Hence, we consider testing that the μ̄_i. are equal. With the {π_ij} unknown, there is no obvious sum of squares and F-ratio computed by the widely available statistical packages for testing this hypothesis. Let Ȳ_i.. be the sample mean of the observations in the ith row. We show that Ȳ_i.. is an MLE of μ̄_i., is consistent and is conditionally unbiased. We then find the asymptotic joint distribution of the Ȳ_i.. and use it to construct a sensible asymptotic size α test of the equality of the μ̄_i. and asymptotic simultaneous (1 − α) confidence intervals for contrasts in the μ̄_i..

Keywords: Two-way model, Main effects, Analysis of variance, Unbalanced data, Multinomial cell sizes

1. Introduction

The two-way model with fixed effects for unbalanced data has received considerable attention in the past 30 years. There has been much discussion of the definition of the effects, calculation of sums of squares and definitions of hypotheses. The model is the subject of many texts, prominent among those Searle (1971, 1982), Graybill (1976), Scheffè (1959), and Arnold (1981). Several decompositions of sums of squares are presented in ANOVA tables, and testing the parameters of the model is routinely accomplished by subtracting sums of squares to isolate the contribution of a certain parameter. Searle’s (1971) R-notation, for example R(α/μ, β) is usually involved in testing the contribution of including α after μ and β in the model. The R-notation implies orthogonalizations of the design matrix that lead to sums of squares that test a host of hypotheses that depend on the cell sizes. Many hypotheses have interpretation problems and several articles have been written on the hypotheses implied by using Searle’s R(·/·) notation, e.g. see, Speed et al. (1978), Kutner (1974); for an extensive list see Macnaugton (1998).

This paper considers the same two-way model under the assumption that the overall sample size n is fixed but the cell sizes are themselves random variables following the multinomial distribution with unknown parameters π_ij. The setting has been introduced in Moschopoulos and Davidson (1985). Applications of the new setting arise mainly in survey studies. In these studies it is often not possible to sample from the individual cells. Instead, a sample of size n is taken from the overall population and is divided after the fact into the two-way cells according to sample characteristics. The following two examples will help in understanding this setting:

Example 1

A random sample size n of women is drawn from a metropolitan area for the purpose of studying fertility, measured by the number of children born to women (the response y-variable). It is assumed that fertility is dependent on religion, the row factor (Catholic (1) or Protestant (2)) and educational level, the column factor (Elementary (1), High School (2), College (3)). Sampling 300 women from the overall population results in 2 × 3 = 6 cell sizes n_ij, i = 1, 2; j = 1, 2, 3 that are random variables following the multinomial distribution (for large n). Note that sampling from each of the six classifications is not possible since religion and education are only known after the sample is drawn. In this example, let i represent the woman’s religion and j represent the woman’s educational level. In this situation, π₂₁ represents the proportion of women who are Protestant with Elementary education; μ₁₂ represents the expected number of children in Catholic women with High School education; ${\bar{μ}}_{2 .}^{π}$ represents the expected number of children by Protestant women averaged across educational levels. In testing that the ${\bar{μ}}_{i .}^{π}$ , i = 1, 2 are all equal in this setting, we are testing the equality of the mean number of children for the two religions averaged over educational levels. Testing for no column effects with these weights is testing the equality between educational levels averaged over religion. For a real situation related to this, see Groat and Neal (1967).

Example 2

Consider an academic class size n viewed as a random sample from a population of similar students with the response y being the student performance during an examination. The purpose of the investigation here is to relate class performance to student anxiety, the row factor (High (1), Moderate (2), Low (3)) and student attitude towards the subject the column factor (Positive (1), Negative (2)). Note that sampling from each of the six classifications here is not possible as the student classifications are known only after the sample is drawn. Thus, the cell sizes are random following the multinomial distribution (for large n). As in the first example, here testing the equality of ${\bar{μ}}_{i .}^{π}$ , i = 1, 2, 3 is testing the equality of performances of High, Moderate and Low anxiety students averaged over student attitudes. For a real situation on this, see Galassi et al. (1981).

Random cell sizes but from a different perspective are considered in Psychology, consideration there given to random cell sizes arising as a result of the underlying treatment, see Weiss (1999, 2006).

As stated above, the two-way model with fixed effects and unbalanced data is not free of controversies. The main problems are the following: (1) the definition of main effects, i.e row, column and interaction effects, and (2) hypotheses tested by typical ANOVA decompositions and interpretations of such hypotheses. Concerning (1) above, as it turns out, in the case of the two-way model with multinomially distributed cell sizes we can provide a natural definition of main effects. This definition is not arbitrary but is a natural consequence of the sampling scheme and it is expressed in terms of parameters (i.e cell population proportions and the cell population means), see the next section. As for (2) above, we are concerned here with one particular hypothesis, namely the equality of row (column) means. Let μ_ij, i = 1, …, r; j = 1, …, c be the mean of cell (i,j), and let π_ij be the probability of obtaining an observation in cell (i,j). We show that the i-th row mean is

μ_{i} = \sum_{j = 1}^{c} \frac{π_{i j}}{π_{i .}} μ_{i j},

(1.1)

where $π_{i .} = \sum_{j = 1}^{c} π_{i j}$ . The hypothesis of equality of row means is

H_{o} : μ_{i} equal, i = 1, \dots, r .

(1.2)

The hypothesis (1.2) above is equivalent to the hypothesis that the α-main effects (defined in Section 2) are zero. This hypothesis is well known in the literature but the means are defined with cell size weights, i.e. (see Searle, 1971)

{\tilde{μ}}_{i} = \sum_{j = 1}^{c} \frac{N_{i j}}{N_{i .}} μ_{i j} .

(1.3)

Obviously, when the cell sizes are random variables, testing the equality of means in (1.3) is not proper. Testing of the right hypothesis (1.2) is the second goal of this paper. However, there is no test derived from standard ANOVA sums of squares decompositions that is proper for testing (1.2). We develop an asymptotic (n → ∞) test for (1.2) and asymptotic 100 × (1 − α)% confidence intervals for contrasts of row means. In addition, we provide some numerical evaluations of the performance of the proposed test under the null hypothesis. Under the multinomial assumption for the cell sizes, it is possible that a cell size may be zero. The paper is rigorously considering the case of zero cells.

2. The two-way model with random cell sizes

The two-way model is often modeled in the following over-parameterized form that includes terms for main effects and interactions. We observe Y_ijk independent,

Y_{ijk} ~ N (θ + α_{i} + β_{j} + γ_{i j}, σ^{2}), i = 1, \dots, r; j = 1, \dots, c; k = 1, \dots, N_{i j},

(2.1)

where θ, {α_i}, {β_j}, {γ_ij} and σ² are unknown parameters. In order to make these parameters identifiable, weights w_ij ≥ 0 are often chosen and it is assumed in addition that

\sum_{i} \sum_{j} w_{i j} α_{i} = 0, \sum_{i} \sum_{j} w_{i j} β_{j} = 0, \sum_{i} w_{i j} γ_{i j} = 0, \sum_{j} w_{i j} γ_{i j} = 0 .

(2.2)

It is well known that the problem of testing that the interactions γ_ij = 0 does not depend on the weights, nor does the problem of testing that the main effects α_i = 0 when it is assumed that the γ_ij = 0 (e.g. see Arnold, 1981, pp. 93–96). Unfortunately, however, the problem of testing that the α_i = 0 when the γ_ij may be non-zero depends on the weights chosen. Different weights lead to different hypotheses. Let

μ_{i j} = θ + α_{i} + β_{j} + γ_{i j}, {\bar{μ}}_{i}^{W} = \sum_{j} w_{i j} μ_{i j} / \sum_{j} w_{i j} .

(2.3)

Then testing that α_i = 0 with the weights w_ij is the same as testing the equality of the ${\bar{μ}}_{i}^{W}$ . Therefore, the weights w_ij represent the relative importance of the observations in the ith row, jth cell out of all the observations in the ith row.

Now, suppose that the individuals observed are themselves a sample of size n( = Σ_i Σ_jN_ij) from a very large population. In that case, we may assume that the cell sizes {N_ij} have a joint multinomial distribution with parameters n, and {π_ij} written as

{N_{i j}} ~ M_{r c} (n, {π_{i j}}),

where π_ij is the probability that a randomly chosen individual is in the ith row and the jth column. We assume that the π_ij are unknown parameters for this model. In this case, naturally, we use the weights

w_{i j} = π_{i j} .

(2.4)

Using these weights, we see that the conditional expectation of an observation in row i is

\begin{array}{l} E (Y_{ijk} ∣ row i) = \sum_{j = 1}^{c} P r (column j ∣ row i) E (Y_{ijk} ∣ row i, column j) \\ = \sum_{j = 1}^{c} \frac{π_{i j}}{π_{i .}} μ_{i j} = {\bar{μ}}_{i} \end{array}

(2.5)

since

P (column j ∣ row i) = η_{i j} = π_{i j} / \sum_{j} π_{i j} = \frac{π_{i j}}{π_{i}} .

(2.6)

Therefore, in testing the equality of the μ̄_i. we are testing that the expected value of the response variable is the same in all the rows.

Note

The expectation in (2.5) is approximate if some cells are empty in row i; the case of empty cells is treated in this paper, see Basic results. It is now interesting to note that if the artificial parameters θ, α_i, β_j and γ_ij above are defined using the weights w_ij = π_ij, then

θ = \sum_{i} \sum_{j} π_{i j} μ_{i j}, α_{i} = {\bar{μ}}_{i .} - θ, β_{j} = {\bar{μ}}_{. j} - θ, γ_{i j} = μ_{i j} - {\bar{μ}}_{i .} - {\bar{μ}}_{. j} + θ .

(2.7)

3. Basic results

For simplicity we consider here the cell means μ_ij instead of the over-parameterized model. Let Y_ijk be independent,

Y_{ijk} ~ N (μ_{i j}, σ^{2}), i = 1, \dots, r; j = 1, \dots, c; k = 1, \dots, N_{i j},

where

{N_{i j}} ~ M_{r c} (n, {π_{i j}},)

i.e., the {N_ij} are (jointly) multinomially distributed with parameters n and {π_ij}. We assume that

π_{i j} > 0 for all i and j

and that the π_ij are unknown and must be estimated from the data, but that n is known and fixed in advance. It is important to remember that the N_ij are random variables, and may in fact be 0.

Let q be the number of cells (i,j) in which N_ij > 0. (Note that q is a random variable dependent on the N_ij). Let

{\bar{Y}}_{i j .} = \sum_{k} Y_{ijk} / N_{i j} if N_{i j} > 0, {\bar{Y}}_{i j .} = 0 if N_{i j} = 0,

(3.1)

S^{2} = \sum_{i} \sum_{j} \sum_{k} {(Y_{ijk} - {\bar{Y}}_{i j .})}^{2} / (n - q) .

(3.2)

Then it is easily shown that the sets {Ȳ_ij.}, {N_ij} together with S² form a sufficient statistic for this model and that Ȳ_ij., N_ij/n and (n − q)S²/n are the MLE’s of μ_ij, π_ij and σ² respectively. (Note that the MLE for μ_ij is only unique if N_ij > 0.) Furthermore, N_ij/n is an unbiased estimator of π_ij. Unbiased estimators for the μ_ij do not appear possible, due to the possibility that N_ij = 0. Finally, note that conditionally on q,

(n - q) S^{2} / σ^{2} ∣ q ~ χ_{n - q}^{2}

(3.3)

so that S² is consistent and unbiased.

Now, with μ̄_i. as in (2.5) and π_i. and η_ij as in (2.6), let N_i = Σ_jN_ij and

{\hat{π}}_{i .} = N_{i} / n, {\hat{η}}_{i j} = N_{i j} / N_{i} if N_{i} > 0 and {\hat{η}}_{i j} = 0 if N_{i} = 0

be MLE’s of π_i. and η_ij respectively. Further, let

{\bar{Y}}_{i ..} = \sum_{j} {\hat{η}}_{i j} {\bar{Y}}_{i j .} .

Note that Ȳ_i.. is the MLE for μ̄_i. by the invariance principle, and is unique as long as N_i > 0. Note also that as long as N_i > 0,

{\bar{Y}}_{i ..} = \sum_{j} \sum_{k} Y_{ijk} / N_{i} .

(3.4)

Finally, note that Ȳ_i.. has a point mass at 0, so is not a continuous random variable.

P ({\bar{Y}}_{i ..} = 0) = P (N_{i} = 0) = {(1 - π_{i .})}^{n} .

(3.5)

The Distribution of Ȳ_i

We are primarily interested in inference about the μ̄_i. In particular, we are interested in testing that the μ̄_i. are all equal. In order to draw inference about the μ̄_i., we need to learn about the distribution of Ȳ_i... For notational convenience we let

{\vec{η}}_{i}^{'} = {(η_{i 1}, \dots, η_{i c})}^{'}, {\hat{\vec{η}}}_{i}^{'} = {({\hat{η}}_{i 1}, \dots, {\hat{η}}_{i c})}^{'}, {\vec{m}}_{i}^{'} = (μ_{i 1}, \dots, μ_{i c}) .

The following random variables are useful in our development here and represent row-means weighted by cell sizes. For i = 1, …, r, let

{\tilde{μ}}_{i} = \sum_{j} \frac{N_{i j}}{N_{i}} μ_{i j} = \sum_{j} {\hat{η}}_{i j} μ_{i j} = {\vec{m}}_{i}^{'} {\hat{\vec{η}}}_{i} .

(3.6)

We note here that testing for main effects in the presence of interaction and using the usual conditional F-test entails testing the hypothesis that the μ̃_i’s are equal, see Searle, 1971, pp. 292–231, Arnold, 1981, pp. 93–96. Obviously this hypothesis makes no sense especially if the cell sizes are random variables. Define

e_{ijk} = Y_{ijk} - μ_{i j}, {\bar{e}}_{i ..} = \sum_{j k} e_{ijk} / N_{i} if N_{i} > 0, {\bar{e}}_{i ..} = 0 if N_{i} = 0.

(3.7)

If N_i > 0, then

{\bar{Y}}_{i ..} - {\tilde{μ}}_{i} = \sum_{j k} (Y_{ijk} - μ_{i j}) / N_{i} = {\bar{e}}_{i ..} .

(3.8)

When N_i = 0,

{\bar{Y}}_{i ..} = {\tilde{μ}}_{i} = {\bar{e}}_{i ..} = 0.

Therefore, for all N_i,

{\bar{Y}}_{i ..} = {\tilde{μ}}_{i} + {\bar{e}}_{i ..} = {\bar{e}}_{i ..} + {\vec{m}}_{i}^{'} {\hat{\vec{η}}}_{i} .

(3.9)

Lemma 1

Conditionally on N_i, ē_i.. and η̂_i are independent.

Proof

If N_i = 0, then both ē_i.. and η̂_i are degenerate and hence independent. If N_i > 0, then ē_i.. is the average of all N_i of the errors in the ith row, irrespective of which cell they are in. Since the errors are independently identically distributed, ē_i.. does not depend on the relative proportions in the columns, and hence on η̂_i.

This result implies that conditionally on N⃗(N₁, N₂, …, N_r), the Ȳ_i.., i = 1, …, r are not normally distributed; by (3.9), Ȳ_i.. is the sum of two independent random variables and if it were normal then both components should be normal, which obviously is not the case. This result is of importance in our development for the following reasons: The definition of μ̄_i. depends only on μ_ij and η_ij and the distribution of N⃗ depends only on the π_i.. Therefore, N⃗ is an ancillary statistic for this problem and it makes sense to work conditionally on N⃗. Note also that the N_ij are not ancillary for the μ̄_i., so that we should not work conditionally on the N_ij.

Comment

Working conditionally on N⃗ = (N₁, N₂, …, N_r), is of course equivalent to making inferences about r means of non-normal sub-populations with the overall population being a mixture of these r sub-populations, each with mixing probability π_i. This will lead to an approximate test statistic for the problem for large n and essentially reduces the two-way problem to a ‘one-way problem’. However, preliminary numerical evaluations showed that the ‘heuristic’ one-way ANOVA test of equality of row means in this case is totally inappropriate, because the distribution of the row-sample means is NOT normal. Obviously, any test statistic for testing the hypothesis that the μ_i.’s in (2.5) are equal would have to rely on the distribution of the row-sample mean. We first study the conditional distribution of the Ȳ_i.., given N⃗.

Theorem 1

In the conditional distribution given N⃗, the Ȳ_i.., i = 1, …, r are independent.
If N_i = 0, then E(Ȳ_i.. | N⃗) = Var(Ȳ_i.. | N⃗) = 0.
If N_i > 0, then
$E ({\bar{Y}}_{i ..} ∣ \vec{N}) = {\bar{μ}}_{i .}, Var ({\bar{Y}}_{i ..} ∣ \vec{N}) = δ_{i} σ^{2} / N_{i},$

where
$δ_{i} = 1 + \sum_{j} η_{i j} {(μ_{i j} - {\bar{μ}}_{i .})}^{2} / σ^{2} .$ (3.10)

Proof

Conditionally on the N⃗, the ē_i.. are independent as are the η̂_i, i = 1, …, r. Hence, using the lemma, the ${\bar{Y}}_{i ..} = {\bar{e}}_{i ..} + {\vec{m}}_{i}^{'} {\hat{η}}_{i}$ are also independent.
When N_i = 0, then Ȳ_i.. is degenerate at 0, so these results follow.
If N_i > 0, then $N_{i} {\hat{\vec{η}}}_{i}$ conditional on N⃗ has the following multinomial distribution:
$N_{i} \hat{\vec{η}} ∣ \vec{N} ~ M_{c} (N_{i}, {\vec{η}}_{i}) .$ (3.11)

Now,

\begin{array}{l} E ({\bar{e}}_{i ..} ∣ \vec{N}) = 0, \\ E ({\tilde{μ}}_{i} ∣ \vec{N}) = {\vec{m}}_{i}^{'} E ({\hat{\vec{η}}}_{i} ∣ \vec{N}) = {\vec{m}}_{i}^{'} {\vec{η}}_{i} = {\bar{μ}}_{i .} \end{array}

(3.12)

and therefore,

E ({\bar{Y}}_{i ..} ∣ \vec{N}) = E ({\bar{e}}_{i ..} ∣ \vec{N}) + E ({\tilde{μ}}_{i} ∣ \vec{N}) = 0 + {\bar{μ}}_{i} .

(3.13)

To see the formula for the variance, note that

Var ({\bar{e}}_{i ..} ∣ \vec{N}) = σ^{2} / N_{i}

and using the variance–covariance matrix of the multinomial proportions

{\hat{\vec{η}}}_{i}^{'} = {({\hat{η}}_{i 1}, \dots, {\hat{η}}_{i c})}^{'}

we obtain

Var ({\tilde{μ}}_{i} ∣ \vec{N}) = {\vec{m}}_{i}^{'} (Cov ({\hat{\vec{η}}}_{i} ∣ \vec{N})) {\vec{m}}_{i} = {\vec{m}}_{i}^{'} V_{i} {\vec{m}}_{i} / N_{i},

where V_i has (j, g) element V_ijk given by

V_{ijj} = η_{i j} (1 - η_{i j}), V_{ijg} = - η_{i j} η_{i g}, j \neq g .

(3.14)

Since ē_i.. and μ̃_i are independent (conditionally on N⃗)

Var ({\bar{Y}}_{i ..} ∣ \vec{N}) = Var ({\bar{e}}_{i ..} ∣ \vec{N}) + Var ({\tilde{μ}}_{i} ∣ N) = (σ^{2} / N_{i}) (1 + {\vec{m}}_{i}^{'} V_{i} {\vec{m}}_{i} / σ^{2}) = σ^{2} δ_{i} / N_{i}

and the theorem is proved.

Note from part (c) that Ȳ_i.. is a conditionally unbiased estimator of μ̄_i > 0 but is not unbiased. In fact,

E {\bar{Y}}_{i ..} = {\bar{μ}}_{i .} P (N_{i} > 0) = {\bar{μ}}_{i .} (1 - {(1 - π_{i .})}^{n}) .

The asymptotic distribution of Ȳ_i... is given in the following.

Theorem 2

Ȳ_i.. is a consistent estimator of μ̄_i
$N_{i}^{1 / 2} ({\bar{Y}}_{i ..} - {\bar{μ}}_{i .}) ∣ \vec{N} \overset{d}{\to} U_{i} ~ N (0, δ_{i} σ^{2})$ as N_i → ∞.

Proof

(a) Note first that π_i. > 0 so that

P (N_{i} > 0) = 1 - {(1 - π_{i .})}^{n} \to 1 as n \to \infty .

Therefore,

E {\bar{Y}}_{i ..} = E (E {\bar{Y}}_{i ..} ∣ \vec{N}) = {\bar{μ}}_{i .} P (N_{i} > 0) \to {\bar{μ}}_{i .} as n \to \infty .

To show consistency we need to show that Var(Ȳ_i..) → 0. Recall that

Var ({\bar{Y}}_{i ..}) = Var (E ({\bar{Y}}_{i ..} ∣ \vec{N})) + E (Var ({\bar{Y}}_{i ..} ∣ \vec{N})) .

(3.15)

Now,

Var (E ({\bar{Y}}_{i ..} ∣ \vec{N})) = {\bar{μ}}_{i .}^{2} (1 - {(P (N_{i} > 0))}^{2}) \to {\bar{μ}}_{i .}^{2} (1 - 1^{2}) = 0.

Finally,

\begin{array}{l} E (Var ({\bar{Y}}_{i ..} ∣ \vec{N})) = E (δ_{i} σ^{2} \frac{1}{N_{i}} P (N_{i} > 0)) \\ = P (N_{i} > 0) δ_{i} σ^{2} \sum_{m = 1}^{n} m^{- 1} P (N_{i} = m) \\ = P (N_{i} > 0) δ_{i} σ^{2} \sum_{m = 1}^{n} m^{- 1} (\begin{matrix} n \\ m \end{matrix}) π_{i .}^{m} {(1 - π_{i .})}^{n - m} \\ = P (N_{i} > 0) δ_{i} σ^{2} \frac{1}{(n + 1) π_{i .}} \sum_{m = 1}^{n} (1 + \frac{1}{m}) (\begin{matrix} n + 1 \\ m + 1 \end{matrix}) π_{i .}^{m + 1} {(1 - π_{i .})}^{n - m} \\ \leq P (N_{i} > 0) δ_{i} σ^{2} \frac{2}{(n + 1) π_{i .}} \to 0 as n \to \infty . \end{array}

Therefore, the Var(Ȳ_i..) and the bias of Ȳ_i.. converge to 0 and Ȳ_i.. is consistent. (b). By the asymptotic approximation of the multinomial proportions,

N_{i}^{1 / 2} ({\hat{\vec{η}}}_{i} - {\vec{η}}_{i}) ∣ \vec{N} \overset{d}{\to} {\vec{W}}_{i} ~ N_{c} (\vec{0}, V_{i}),

where V_i is given in (3.14). Therefore,

N_{i}^{1 / 2} ({\tilde{μ}}_{i} - {\bar{μ}}_{i .}) ∣ \vec{N} = N_{i}^{1 / 2} {\vec{m}}_{i}^{'} ({\hat{\vec{η}}}_{i} - {\vec{η}}_{i}) ∣ \vec{N} \overset{d}{\to} N (0, {\vec{m}}_{i}^{'} V_{i} {\vec{m}}_{i}) .

Now, ē_i.. is the average of all the errors in the ith row, so that

N_{i}^{1 / 2} {\bar{e}}_{i ..} ∣ \vec{N} ~ N (0, σ^{2}) .

By (3.9) and the lemma above, conditionally on N, $N_{i}^{1 / 2} ({\tilde{μ}}_{i} - {\bar{μ}}_{i})$ and N^1/2ē_i.. are independent. Therefore,

N_{i}^{1 / 2} ({\bar{Y}}_{i ..} - {\bar{μ}}_{i .}) ∣ \vec{N} = (N_{i}^{1 / 2} {\bar{e}}_{i ..} + N_{i}^{1 / 2} ({\tilde{μ}}_{i} - {\bar{μ}}_{i .})) ∣ \vec{N} \overset{d}{\to} N (0, σ^{2} + {\vec{m}}_{i}^{'} V_{i} {\vec{m}}_{i}) .

However,

σ^{2} + {\vec{m}}_{i}^{'} {\vec{V}}_{i} {\vec{m}}_{i} = σ^{2} δ_{i}

and this proves the theorem.

Corollary

${\vec{Q}}_{n} = n^{1 / 2} {({\bar{Y}}_{1..} - {\bar{μ}}_{1.}, \dots, {\bar{Y}}_{r ..} - {\bar{μ}}_{r .})}^{'} \overset{d}{\to} \vec{Q} ~ N_{r} (\vec{0}, σ^{2} T)$ , as n → ∞, where T is a diagonal matrix whose ith element is δ_i/π_i..

Proof

Note first that π̂_i. = N_i/n → π_i. a.s. (almost surely) Therefore, $N_{i} \overset{a . s}{\to} \infty$ , so that we can use the previous theorem. From that theorem and the conditional independence of the Ȳ_i..,

{\vec{P}}_{n} = (N_{1}^{1 / 2} ({\bar{Y}}_{1..} - {\bar{μ}}_{1.}), \dots, N_{r}^{1 / 2} {({\bar{Y}}_{r ..} - {\bar{μ}}_{r .}))}^{'} ∣ \vec{N} \overset{d}{\to} \vec{P} ~ N_{r} (0, σ^{2} M),

where M is a diagonal matrix whose i-th diagonal element is δ_i. Since this limiting distribution does not depend on N⃗, unconditionally we have

{\vec{P}}_{n} \overset{d}{\to} \vec{P} .

Now, π̂_i is a consistent estimator of π_i.. Therefore, by Slutsky’s theorem,

{\vec{Q}}_{n} = ({\hat{π}}_{1.}^{- 1 / 2} P_{n 1}, \dots, {\hat{π}}_{r .}^{- 1 / 2} P_{n r}) \overset{d}{\to} {(π_{1.}^{- 1 / 2} P_{1}, \dots, π_{r .}^{- 1 / 2} P_{r})}^{'} ~ N (0, T) .

Now consider testing the hypothesis

H_{o} : {\bar{μ}}_{i .} equal for all i = 1, \dots, r .

(3.16)

If the normal distribution in the corollary were exact and the δ_i/π_i. were known, then the model would be a generalized linear model, and this hypothesis would be tested using the following (see (3.3) and recall that q is the number of non-empty cells):

F = n \sum_{i} π_{i .} {({\bar{Y}}_{i ..} - \tilde{Y})}^{2} / δ_{i} (r - 1) S^{2},

(3.17)

where

\tilde{Y} = \sum_{i} (π_{i .} {\bar{Y}}_{i ..} / δ_{i}) / \sum_{i} (π_{i .} / δ_{i})

and under the null hypothesis

F ~ F_{r - 1, n - q} .

(3.18)

In addition, Scheffe’ simultaneous confidence intervals for the set of contrasts between the μ̄_i. (functions Σc_iμ̄_i., Σc_i = 0) are given by

\sum_{i} c_{i} {\bar{μ}}_{i .} \in \sum_{i} c_{i} {\bar{Y}}_{i ..} \pm S {[(r - 1) F^{α}]}^{1 / 2} {[\sum c_{i}^{2} δ_{i} / n π_{i}]}^{1 / 2},

(3.19)

where F^α is the upper 100(1 − α)% point of the F-distribution defined in (3.18). Of course, the δ_i and π_i. would not be known in practice, so we estimate them in the obvious way. Let

{\hat{π}}_{i .} = N_{i} / n, {\hat{δ}}_{i} = 1 + \sum_{j} {\hat{η}}_{i j} {({\bar{Y}}_{i j .} - {\bar{Y}}_{i ..})}^{2} / S^{2} .

(By the invariance principle, π̂_i. and δ̂_i are the MLE’s of π_i. and δ_i). We suggest using the test statistic:

{\hat{F}}_{n} = n \sum {\hat{π}}_{i .} {({\bar{Y}}_{i ..} - \hat{Y})}^{2} / {\hat{δ}}_{i} (r - 1) S^{2} = \sum_{i} N_{i} {({\bar{Y}}_{i ..} - \hat{Y})}^{2} / {\hat{δ}}_{i} (r - 1) S^{2},

(3.20)

where

\hat{Y} = \sum_{i} ({\hat{π}}_{i} {\bar{Y}}_{i ..} / {\hat{δ}}_{i}) \sum_{i} ({\hat{π}}_{i .} / {\hat{δ}}_{i}) .

(3.21)

Since π̂_i. and δ̂_i are consistent (see the proof below), F̂_n should have an approximate F-distribution F_r _{− 1,}_n ₋ _q under the null hypothesis. Similarly, we suggest using the following approximate confidence intervals for contrasts in the μ̄_i.:

\sum c_{i} {\bar{μ}}_{i .} \in \sum c_{i} {\bar{Y}}_{i ..} \pm S {[(r - 1) F^{α}]}^{1 / 2} {[\sum c_{i}^{2} {\hat{δ}}_{i} / N_{i}]}^{1 / 2} .

(3.22)

We finish this section by showing that the test and confidence intervals given above are at least asymptotically correct.

Theorem 3

Let F̂_n be defined in Eq. (3.20). Then under the null hypothesis that μ_i.., i = 1, …, r are equal,
$(r - 1) {\hat{F}}_{n} \overset{d}{\to} H ~ χ_{r - 1}^{2},$

and hence the test which rejects the null hypothesis when
$F_{n} \geq F_{r - 1, n - q}^{α}$

is an asymptotic size α test.
The confidence intervals given in Eq. (3.19) are asymptotic 100(1 − α)% simultaneous confidence intervals for the set of contrasts given above.

Proof

Note first that π̂_ij = N_ij/n are known to be consistent estimators of the π_ij so that the π̂_i. and η̂_ij are consistent estimators of the π̂_i. and η_ij. We have shown that the Ȳ_i.. and $S_{n}^{2}$ are consistent estimators of μ_i. and σ². Therefore, the δ̂_i is consistent estimator of δ_i. Let Q⃗_n and Q⃗ be defined as in the corollary above. Let
$\begin{array}{l} \vec{π} = (π_{1.}, \dots π_{r .}), {\hat{\vec{π}}}_{n} = ({\hat{π}}_{1.}, \dots, {\hat{π}}_{r .}), \vec{δ} = (δ_{1}, \dots, δ_{r}), \\ {\hat{\vec{δ}}}_{n} = ({\hat{δ}}_{1}, \dots, {\hat{δ}}_{r}), \\ h (\vec{Q}, \vec{π}, \vec{δ}, σ^{2}) = \sum_{i} π_{i .} {(Q_{i} - \tilde{Q})}^{2} / δ_{i} σ^{2}, \tilde{Q} = \sum_{i} (π_{i .} Q_{i} / δ_{i}) / \sum_{i} (π_{i .} / δ_{i}) . \end{array}$

By the usual results on the generalized linear model,
$H = h (\vec{Q}, \vec{π}, \vec{δ}, σ^{2}) ~ χ_{r - 1}^{2} .$

Therefore, by Slutsky’s theorem,
$h ({\vec{Q}}_{n}, {\hat{\vec{π}}}_{n}, {\hat{\vec{δ}}}_{n}, S_{n}^{2}) \overset{d}{\to} H ~ χ_{r - 1.}^{2} .$

If the μ̄_i. are all equal, then
$(r - 1) {\hat{F}}_{n} = h ({\vec{Q}}_{n}, {\hat{π}}_{n}, {\hat{δ}}_{n}, S_{n}^{2}) \overset{d}{\to} H ~ χ_{r - 1}^{2} .$

Now, q ≤ rc, so that n − q→ ∞ and hence $(r - 1) F_{r - 1, n - q}^{α} \to χ_{r - 1.}^{2}$ . Therefore, the probability of rejecting under the null hypothesis is
$p (h ({\vec{Q}}_{n}, {\hat{\vec{π}}}_{n}, {\hat{\vec{δ}}}_{n}, S_{n}^{2}) - (r - 1) F_{r - 1, n - q}^{α} > 0) \to P (H - χ_{r - 1, α}^{2} > 0) = α .$
The simultaneous confidence intervals given in Eq. (3.22) are all satisfied if and only if
$h ({\vec{Q}}_{n}, {\hat{\vec{π}}}_{n}, {\hat{\vec{δ}}}_{n} S_{n}^{2}) \leq (r - 1) F_{r - 1, n - q .}^{α} .$

The remainder of the argument is similar to part (a).

Some additional comments

For ease in defining the model, we have assumed that the π_ij > 0. A careful reading of the paper, however, will show that we have only used π_i. > 0. If π_i. = 0, we can just drop the ith row from the design, so that this is really no assumption.
Again, for simplicity, we have assumed that the Y_ijk are normally distributed. However, the results are true under greater generality. In particular, the proof of Theorem 1 has only used that
$E Y_{ijk} = μ_{i j}, var (Y_{ijk}) = σ^{2} .$

For Theorems 2 and 3, all we need is that
$Y_{ijk} = μ_{i j} + e_{ijk},$

where e_ijk are independently identically distributed with mean 0 and with finite variance σ². We have only used the normality to establish that
$N_{i}^{1 / 2} {\bar{e}}_{i ..} \overset{d}{\to} R ~ N (0, σ^{2}),$

which can be established under these more general conditions using the central limit theorem.
For simplicity, we have assumed that we were sampling from a very large population so that the distribution of the N_ij could be assumed multinomial. If we sample from a smaller population we should replace the multinomial distribution with a hypergeometric distribution. Theorem 1 follows equally well for a multivariate hypergeometric (with population size m) if we replace the covariance matrix V for the multinomial with the covariance for the hypergeometric, getting
$δ_{i}^{*} = 1 + (m - n) \sum η_{i j} {(μ_{i j} - {\bar{μ}}_{i .})}^{2} / (m - 1) σ^{2} .$

Theorem 2 can be similarly modified.
As a final generalization, we mention a different sampling scheme. Suppose instead of getting a sample of size n from the whole population, we get independent samples of size N_i from each of the row classifications. As before, let N_ij be the number of observations in the (i,j) cell. We assume that
${\vec{N}}_{i} = (N_{i 1}, \dots, N_{i c}) ~ M_{c} (N_{i}, {\vec{η}}_{i}), {\vec{η}}_{i} = (η_{i 1}, \dots, η_{i c}),$

where η_ij is the proportion of the ith row class in column class j. Let Ȳ_i., $\hat{η_{i j}}$ , μ̄_i., μ̃_i, e_ijk and ē_i.. be defined as in Section 2. The lemma is still true for this model, since it is conditional on the N_i. Similarly, Theorem 1 is conditional on the N_i, so that we see that the Ȳ_i.. are independent (since they are from independent samples) and
$E ({\bar{Y}}_{i ..}) = μ_{i .}, var ({\bar{Y}}_{i ..}) = δ_{i} σ^{2} / N_{i} .$

(Note that for this model, we do not have to worry about the case N_i = 0.) Hence Ȳ_i.. is unbiased and consistent for this model. Similarly Theorem 2 is derived conditionally on the N_i, so that we have
$N_{i}^{1 / 2} ({\bar{Y}}_{i ..} - {\bar{μ}}_{i .}) \overset{d}{\to} U_{i} ~ N (0, δ_{i} σ^{2}) as N_{i} \to \infty .$

Now suppose that
$n = \sum_{i} N_{i}, N_{i} / n \to π_{i .} as n \to \infty$

for some constants π_i.. Then $N_{i} / n \overset{P}{\to} π_{i .}$ as N_i→ ∞, so that the corollary and Theorem 3 follow also. Therefore, the procedure derived in Section 2 with π_i. replaced by lim_n_{→ ∞}(n_i/n) applies equally well to the model considered in this paragraph.

4. Numerical evaluations of the test in (3.20)

The proposed test F_n in (3.20) was evaluated numerically under the null hypothesis of equality of row means in (3.16) for several models of 2 × 2, 2 × 3 and 3 × 4 designs with simulation size 10,000. Tables 1–3, give the models, the cell probabilities, the cell means and the equal row means. The assumed interactions were non-zero but non-significant and the means are plotted in the associated Figs. 1–3. Entries in the tables are the percent of rejections out of the 10,000 samples, for various sample sizes. The size of the entries should be compared to 5%. As seen from all three tables, although asymptotic, the proposed test performs well in maintaining the correct Type-I error.

Table 1.

Type_I error rates of the new test for row effect of 2 × 2 factorial designs, with non-significant interaction.

Model	Cell probabilities	Cell means, SD	Row means	n = 30	n = 50	n = 70	n = 100
Model I	(.3,.4,.1,.2)	(50,15,35,27.5) SD = 20	(30,30)	4.35	4.93	4.35	4.83
Model II	(.3,.4,.1,.2)	(50,30,5.71,55) SD = 50	(38.571,38.571)	4.90	5.62	5.59	6.34
Model III	(.3,.4,.2,.1)	(50,15,40,10) SD = 4	(30,30)	5.23	5.92	6.35	6.52
Model IV	(.2,.4,.3,.1)	(32,20,30,6) SD = 10	(24.24)	4.95	5.84	6.24	6.71

Open in a new tab

Table 3.

Type I error rates of the new test, for row effect of 3 × 4 factorial designs, with non-significant interaction.

Model	Cell probabilities	Cell means, SD	Row means	n = 70	n = 100	n = 200
Model I	(.8,.2,.15,.05,.1,.02,.04,.03,.05,.1,.1,.08)	(50,15,20,10,25,27.5,10,23.54,15,15,40,12.11) SD = 40	(21.88,21.88,21.88)	2.77	3.53	4.13
Model II	(.08,.2,.15,.05,.1,.02,.04,.03,.05,.1,.1,.08)	(40,30,20,50,35,27.5,10,45.63,30,15,20,63.83) SD = 40	(30.63,30.63,30.63)	2.81	3.46	4.07
Model III	(.08,.05,.15,.05,.1,.16,.04,.03,.05,.11,.1,.08)	(40,30,20,50,35,30,45,3.33,20,15,30,60.74) SD = 50	(30.91,30.91,30.91)	3.17	3.65	4.11
Model IV	(.08,.02,.1,.05,.1,.33,.04,.03,.05,.02,.1,.08)	(60,15,20,10,25,35,15,18.33,30,5,20,50) SD = 50	(30.4,30.4,30.4)	3.38	3.72	4.94

Open in a new tab

Fig. 1 — Graphs of means for 2 × 2 factorial designs.

Fig. 3 — Graphs of means for 3 × 4 factorial designs.

Fig. 2 — Graphs of means for 2 × 3 factorial designs.

Table 2.

Type I error rates of the new test, for row effect of 2 × 3 factorial designs, with non-significant interaction.

Model	Cell probabilities	Cell means, SD	Row means	n = 50	n = 70	n = 100
Model I	(.2,.3,.1,.1,.1,.2)	(50,15,20,35,27.5,23.75) SD = 15	(27.5,27.5)	4.22	4.37	5.03
Model II	(.2,.3,.1,.1,.1,.2)	(40,35,20,45,45,23.33) SD = 10	(34.17,34.17)	4.83	4.97	5.49
Model III	(.2,.3,.1,.1,.1,.2)	(65,5,60,30,45,30.83) SD = 50	(34.17,34.17)	4.08	4.30	4.95
Model IV	(.3,.1,.05,.05,.1,.4)	(30,40,60,30,45,33.89) SD = 30	(35.56,35.56)	3.98	4.23	4.38

Open in a new tab

Acknowledgments

Dr. Moschopoulos’s research was partially supported by Grants from the National Center for Research Resources 5G12RR008124 and the National Institute for Minority Health and Health Disparities Grant G12MD007592 from the National Institutes of Health. We express our thanks to Dr. Julia Bader of the Statistical Consulting Laboratory of UTEP for help with the simulations. Finally, we thank the editors and two anonymous referees for comments and revisions that improved the final form of the paper.

References

Arnold SF. Theory of Linear Models and Multivariate Analysis. Wiley; New York: 1981. [Google Scholar]
Galassi JP, Frierson HT, Jr, Sharer R. Behavior of high, moderate and anxious students during an actual test situation. Journal of Consulting and Clinical Psychology. 1981;49 (1):51–62. doi: 10.1037//0022-006x.49.1.51. [DOI] [PubMed] [Google Scholar]
Groat HT, Neal AG. American Sociological Review. 1967. Social psychological correlates of urban fertility; pp. 945–959. [PubMed] [Google Scholar]
Graybill FA. Theory and Application of linear Model. Duxburry; North Scituate, MA: 1976. [Google Scholar]
Kutner MH. Hypothesis testing in linear models. The American Statistician. 1974;28:98–100. [Google Scholar]
Macnaugton D. Which sums of squares are best in unbalanced analysis of variance?. Paper presented at the Joint Statistical Meetings; Boston. 1992; 1998. 〈 http://www.matstat.com/ss/easleaao.pdf〉. [Google Scholar]
Moschopoulos PG, Davidson MI. Hypothesis testing in ANOVA under multinomial sampling. Sankhyâ Series B. 1985;47(Pt. 3):301–309. [Google Scholar]
Scheffè H. The Analysis of Variance. Wiley; NY: 1959. [Google Scholar]
Searle SR. Linear Models. Wiley; NY: 1971. [Google Scholar]
Searle SR. Linear Models for Unbalanced Data. 1982. [Google Scholar]
Speed FM, Hoching RR, Hackney OP. Methods of analysis of linear models with unbalanced data. Journal of the American Statistical Association. 1978;73:105–113. [Google Scholar]
Weiss DJ. An analysis of variance test for random attrition. Journal of Social Behavior and Personality. 1999;14:433–438. [Google Scholar]
Weiss DJ. Analysis of Variance and Functional Measurement: A Practical Guide. Oxford University Press; New York: 2006. [Google Scholar]

[R1] Arnold SF. Theory of Linear Models and Multivariate Analysis. Wiley; New York: 1981. [Google Scholar]

[R2] Galassi JP, Frierson HT, Jr, Sharer R. Behavior of high, moderate and anxious students during an actual test situation. Journal of Consulting and Clinical Psychology. 1981;49 (1):51–62. doi: 10.1037//0022-006x.49.1.51. [DOI] [PubMed] [Google Scholar]

[R3] Groat HT, Neal AG. American Sociological Review. 1967. Social psychological correlates of urban fertility; pp. 945–959. [PubMed] [Google Scholar]

[R4] Graybill FA. Theory and Application of linear Model. Duxburry; North Scituate, MA: 1976. [Google Scholar]

[R5] Kutner MH. Hypothesis testing in linear models. The American Statistician. 1974;28:98–100. [Google Scholar]

[R6] Macnaugton D. Which sums of squares are best in unbalanced analysis of variance?. Paper presented at the Joint Statistical Meetings; Boston. 1992; 1998. 〈 http://www.matstat.com/ss/easleaao.pdf〉. [Google Scholar]

[R7] Moschopoulos PG, Davidson MI. Hypothesis testing in ANOVA under multinomial sampling. Sankhyâ Series B. 1985;47(Pt. 3):301–309. [Google Scholar]

[R8] Scheffè H. The Analysis of Variance. Wiley; NY: 1959. [Google Scholar]

[R9] Searle SR. Linear Models. Wiley; NY: 1971. [Google Scholar]

[R10] Searle SR. Linear Models for Unbalanced Data. 1982. [Google Scholar]

[R11] Speed FM, Hoching RR, Hackney OP. Methods of analysis of linear models with unbalanced data. Journal of the American Statistical Association. 1978;73:105–113. [Google Scholar]

[R12] Weiss DJ. An analysis of variance test for random attrition. Journal of Social Behavior and Personality. 1999;14:433–438. [Google Scholar]

[R13] Weiss DJ. Analysis of Variance and Functional Measurement: A Practical Guide. Oxford University Press; New York: 2006. [Google Scholar]

PERMALINK

Two-way model with random cell sizes

Steven F Arnold

Panagis G Moschopoulos

Abstract

1. Introduction

Example 1

Example 2

2. The two-way model with random cell sizes

Note

3. Basic results

The Distribution of Ȳ_i

Lemma 1

Proof

Comment

Theorem 1

Proof

Theorem 2

Proof

Corollary

Proof

Theorem 3

Proof

Some additional comments

4. Numerical evaluations of the test in (3.20)

Table 1.

Table 3.

Fig. 1.

Fig. 3.

Fig. 2.

Table 2.

Acknowledgments

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Two-way model with random cell sizes

Steven F Arnold

Panagis G Moschopoulos

Abstract

1. Introduction

Example 1

Example 2

2. The two-way model with random cell sizes

Note

3. Basic results

The Distribution of Ȳi

Lemma 1

Proof

Comment

Theorem 1

Proof

Theorem 2

Proof

Corollary

Proof

Theorem 3

Proof

Some additional comments

4. Numerical evaluations of the test in (3.20)

Table 1.

Table 3.

Fig. 1.

Fig. 3.

Fig. 2.

Table 2.

Acknowledgments

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

The Distribution of Ȳ_i