Probabilities for separating sets of order statistics

D H Glueck; A Karimpour-Fard; J Mandel; K E Muller

doi:10.1080/02331880902986984

. Author manuscript; available in PMC: 2011 Jan 13.

Published in final edited form as: Statistics (Ber). 2010 Apr 1;44(2):145–153. doi: 10.1080/02331880902986984

Probabilities for separating sets of order statistics

D H Glueck ^a,^*, A Karimpour-Fard ^b, J Mandel ^c,^d,^e, K E Muller ^f

PMCID: PMC3020799 NIHMSID: NIHMS184247 PMID: 21243084

Abstract

Consider a set of order statistics that arise from sorting samples from two different populations, each with their own, possibly different distribution functions. The probability that these order statistics fall in disjoint, ordered intervals and that of the smallest statistics, a certain number come from the first populations is given in terms of the two distribution functions. The result is applied to computing the joint probability of the number of rejections and the number of false rejections for the Benjamini–Hochberg false discovery rate procedure.

Keywords: Benjamini and Hochberg procedure, block matrix, permanent, multiple comparison

1. Introduction

Glueck et al. [1] gave explicit expressions for the probability that arbitrary subsets of order statistics fall in disjoint, ordered intervals on the set of real numbers. In this paper, we extend this work and consider two sets of real-valued, independent but not necessarily identically distributed random variables. We give expressions in terms of cumulative distribution functions for the probability that arbitrary subsets of order statistics fall in disjoint, ordered intervals and that of the smallest statistics, a certain number come from one set. We have been unable to find any previous papers on this topic. This problem is of interest in calculating probabilities for the Benjamini and Hochberg [2] multiple comparisons procedure.

2. A simple example

Consider the following simple example. Let X₁, X₂ ∈ [0, 1] be independent random variables. Denote by F_X₁(x₁) and F_X₂(x₂) the marginal cumulative distribution functions and by F_X₁,X₂(x₁, x₂) the joint cumulative distribution function of X₁ and X₂. Assume that the cumulative distribution functions are continuous. Let Y₁ = min{X₁, X₂} and let Y₂ = max{X₁, X₂} be the order statistics. For i = 1, 2, write the marginal cumulative distribution function of Y_i as F_{Y_i} (y_i) and the joint cumulative distribution function as F_Y₁,Y₂(y₁, y₂), for y₁ ≤ y₂. This joint cumulative distribution function is also continuous [3, p. 10].

Choose numbers b₁ < b₂, b₁, b₂ ∈ (0, 1). We wish to find the probabilities

𝒜 = Pr {(y_{1} < b_{1}) \land (y_{2} > b_{2})},

(1)

β = Pr {(y_{1} < b_{1}) \land (y_{2} > b_{2}) \land (x_{1} < b_{1})}

(2)

and

γ = Pr {(y_{1} < b_{1}) \land (y_{2} > b_{2}) \land \neg (x_{1} < b_{1})} .

(3)

and express them in terms of the distribution functions F_X₁ and F_X₂. First, we will find the probabilities directly. So,

\begin{matrix} β & = Pr {(x_{1} < b_{1}) \land (x_{2} > b_{2})} \\ = F_{X_{1}} (b_{1}) [1 - F_{X_{2}} (b_{2})] \end{matrix}

(4)

and

\begin{matrix} γ & = Pr {(x_{1} > b_{2}) \land (x_{2} < b_{1})} \\ = [1 - F_{X_{1}} (b_{2})] F_{X_{2}} (b_{1}) . \end{matrix}

(5)

Equations (4) and (5) follow directly from the independence of the random variables and the definition of the cumulative distribution functions. Since

\begin{matrix} {(y_{1} < b_{1}) \land (y_{2} > b_{2})} & = {(y_{1} < b_{1}) \land (y_{2} > b_{2}) \land (x_{1} < b_{1})} \\ \cup {(y_{1} < b_{1}) \land (y_{2} > b_{2}) \land \neg (x_{1} < b_{1})} \end{matrix}

(6)

and the union is disjoint, it follows that

𝒜 = β + γ .

(7)

For a problem with more than two order statistics, the number of cases one needs to consider and the number of possible combinations of statistics, subsets, and bounds makes a direct approach impractical. An algorithmic approach to obtaining γ and β will allow the generalization to an arbitrary number of order statistics.

Using the assumption that the distribution functions are continuous, simple set operations, and the definition of distribution function, we obtain that the probability of the union (6) is

𝒜 = Pr {(y_{1} < b_{1}) \land \neg (y_{2} < b_{2})}

(8)

= Pr {y_{1} < b_{1}} - Pr {(y_{1} < b_{1}) \land (y_{2} < b_{2})}

(9)

= F_{Y_{1}} (b_{1}) - F_{Y_{1}, Y_{2}} (b_{1}, b_{2}) .

(10)

The cumulative distributions of the order statistics can be written [4],

F_{Y_{1}} (b_{1}) = F_{X_{1}} (b_{1}) F_{X_{2}} (b_{1}) + F_{X_{1}} (b_{1}) [1 - F_{X_{2}} (b_{1})] + [1 - F_{X_{1}} (b_{1})] F_{X_{2}} (b_{1})

(11)

F_{Y_{1}, Y_{2}} (b_{1}, b_{2}) = F_{X_{1}} (b_{1}) [F_{X_{2}} (b_{2}) - F_{X_{2}} (b_{1})] + [F_{X_{1}} (b_{2}) - F_{X_{1}} (b_{1})] F_{X_{2}} (b_{1}) + F_{X_{1}} (b_{1}) F_{X_{2}} (b_{1}) .

(12)

Then, substituting Equations (11) and (12) into Equation (10), we can write 𝒜 in terms of the distribution functions of X₁ and X₂,

\begin{matrix} 𝒜 = & F_{X_{1}} (b_{1}) [1 - F_{X_{2}} (b_{1})] + [1 - F_{X_{1}} (b_{1})] F_{X_{2}} (b_{1}) \\ - F_{X_{1}} (b_{1}) [F_{X_{2}} (b_{2}) - F_{X_{2}} (b_{1})] - [F_{X_{1}} (b_{2}) - F_{X_{1}} (b_{1})] F_{X_{2}} (b_{1}) \end{matrix}

(13)

= F_{X_{1}} (b_{1}) [1 - F_{X_{2}} (b_{2})] + [1 - F_{X_{1}} (b_{2})] F_{X_{2}} (b_{1}) .

(14)

We now interpret the terms in the sum in Equation (14). The term that includes F_X₁(b₁) as a factor is the probability of an event in which x₁ < b₁ occurs, and the term that includes 1 − F_X₁(b₂) as a factor is the probability of an event in which x₁ > b₂. Since b₁ < b₂, the two events are disjoint, and, consequently, Equation (7) follows again.

To summarize, we have expressed the probability in terms of the joint distribution of the order statistics, which was in turn written in terms of the distribution functions of the random variables. Finally, by recognizing terms that corresponded to a partition, we decomposed 𝒜 into a sum of β and γ, the two probabilities of interest.

3. General case

The logic used in this simple, two random variables example can be generalized to an arbitrary number of random variables. Consider a set of order statistics that arise from sorting samples from two different populations, each with their own, possibly different distribution function. We wish to find the probability that these order statistics fall in a given union of intervals and that of the smallest statistics, a certain number come from one population.

For this general case, we need to introduce some notation and definitions. Let X_i, i = 1,…m, be independent but not necessarily identically distributed real-valued random variables with values in the interval [0, 1] and continuous cumulative distribution functions F_{X_i} (x_i). Partition the set {X₁, X₂,…,X_m} into two subsets,

S_{1} = {X_{1}, X_{2}, \dots, X_{n}}, S_{2} = {X_{n + 1}, X_{n + 2}, \dots, X_{m}} .

(15)

For example, one can consider measurements for males or females, or for two different populations of breast cancer, slow or fast growing. The order statistics Y₁, Y₂,…,Y_m are random variables defined by sorting the values of X_i. Thus Y₁ ≤ Y₂ ≤ … ≤ Y_m. Denote the realizations of the order statistics by y₁ ≤ y₂ ≤ … ≤ y_m.

The arguments of the joint cumulative distribution function of order statistics are customarily written omitting redundant arguments; thus for 1 ≤ e ≤ m let 1 ≤ n₁ < n₂ < ⋯ < n_e ≤ m, denote the indices of the order statistics of interest. The joint cumulative distribution function of the set {Y_n₁, Y_n₂,…,Y_{n_e}}, which is a subset of the complete set of order statistics, is defined as

F_{Y_{n_{1}}, \dots, Y_{n_{e}}} (y_{1}, \dots, y_{e}) = Pr ({Y_{n_{1}} \leq y_{1}} \cap {Y_{n_{2}} \leq y_{2}} \cap \dots \cap {Y_{n_{e}} \leq y_{e}}) .

(16)

Suppose we are given s ≤ m disjoint intervals

(c_{q}, d_{q}), 0 = c_{1} < d_{1} < c_{2} < \dots < c_{s} < d_{s} = 1,

(17)

and integers

k_{q} \geq 0, \sum_{q = 1}^{s} k_{q} = m,

(18)

where k₀ = 0 and k_q is the number of order statistics that fall in the qth interval. Define $w_{q, 1} = 1 + \sum_{i = 1}^{q - 1} k_{i}, and w_{q, k_{q}} = \sum_{i = 1}^{q} k_{i}$ to be the subscripts of the largest and smallest order statistics, respectively, that fall in the qth interval. In the case when k_q = 1, we have w_q,1 = w_{q,k_q}. Using this notation, the event that exactly k_q of the order statistics fall in the qth interval is

{c_{1} < Y_{w_{1, 1}} < \dots < Y_{w_{1, k_{1}}} < d_{1} \land \dots \land c_{s} < Y_{w_{s, 1}} < \dots < Y_{w_{s, k_{s}}} < d_{s}},

(19)

or in a more compact notation (21) given below. Now let B be another random event. The following theorem gives the probability of this event intersected with the event (19), in terms of the cumulative distribution functions of the order statistics relative to the event B. This distribution function is defined by

F_{Y_{n_{1}}, \dots, Y_{n_{e}}; B} (y_{1}, \dots, y_{e}) = Pr ({Y_{n_{1}} \leq y_{1}} \cap {Y_{n_{2}} \leq y_{2}} \cap \dots \cap {Y_{n_{e}} \leq y_{e}} \cap B) .

(20)

Contrary to the usual convention, we do not require that the indices of the order statistics in the cumulative distribution function (20) are sorted, because that would result in a complication of the notation in the next theorem (additional renumbering of the arguments).

THEOREM 1 Denote the event

E = \cap_{q = 1}^{s} ({c_{q} < Y_{w_{q, 1}}} \cap {Y_{w_{q, k_{q}}} < d_{q}}) .

(21)

Then

\begin{matrix} Pr (E \cap B) = & F_{Y_{w_{1, k_{1}}}, Y_{w_{2, k_{2}}}, \dots, Y_{w_{s, k_{s}}}; B} (d_{1}, d_{2}, \dots, d_{q}) \\ - \sum_{q = 1}^{s} F_{Y_{w_{1, k_{1}}}, Y_{w_{2, k_{2}}}, \dots, Y_{w_{s, k_{s}}}, Y_{w_{q, k_{q}}}; B} (d_{1}, d_{2}, \dots, d_{q}, c_{q}) \\ + \sum_{\underset{r < t}{r, t = 1}}^{s} F_{Y_{w_{1, k_{1}}}, Y_{w_{2, k_{2}}}, \dots, Y_{w_{s, k_{s}}}, Y_{w_{r, 1},} Y_{w_{t, 1};} B} (d_{1}, d_{2}, \dots, d_{q}, c_{r}, c_{t}) \\ ⋮ \\ + {(- 1)}^{s} F_{Y_{w_{1, 1}}, Y_{w_{1, k_{1}}}, Y_{w_{2, 1}}, Y_{w_{2, k_{2}}}, \dots, Y_{w_{s, 1}}, Y_{w_{s, k_{s}}}; B} (c_{1}, d_{1,} c_{2}, d_{2}, \dots, c_{s}, d_{s}) . \end{matrix}

(22)

Proof By standard set operations,

E = \cap_{q = 1}^{s} {c_{q} < Y_{w_{q, 1}}} \cap \cap_{q = 1}^{s} {Y_{w_{q, k_{q}}} < d_{q}}

(23)

and

\cap_{q = 1}^{s} {c_{q} < Y_{w_{q, 1}}} = {\cap_{q = 1}^{s} {Y_{w_{q, 1}} \leq c_{q}}}^{C} = {(\cup_{q = 1}^{s} {Y_{w_{q, 1}} \leq c_{q}})}^{C},

(24)

where ^C denotes the complement. Therefore,

E \cap B = {(\cup_{q = 1}^{s} {Y_{w_{q, 1}} \leq c_{q}})}^{C} \cap F,

(25)

where the event F is defined by

F = \cap_{q = 1}^{s} {Y_{w_{q, k_{q}}} < d_{q}} \cap B .

(26)

By the additivity of probability, it follows Equation from (25) that

Pr (E \cap B) = Pr (F) - Pr (\cup_{q = 1}^{s} {Y_{w_{q, 1}} \leq c_{q}} \cap F) = Pr (F) - Pr (\cup_{q = 1}^{s} A_{q}),

(27)

where A_q = {Y_{w_q,1} ≤ c_q} ∩ F. Using the additivity of probability again, we have

Pr (\cup_{q = 1}^{s} A_{q}) = \sum_{q = 1}^{s} Pr (A_{q}) - \sum_{\underset{r < t}{r, t = 1}}^{s} Pr (A_{r} \cap A_{t}) + \dots

(28)

+ {(- 1)}^{s} Pr (\cap_{q = 1}^{s} A_{q})

(29)

Now putting Equations (26)–(29) together and using the continuity of the cumulative distribution functions, we obtain

\begin{matrix} Pr (E \cap B) = & \underset{F_{Y_{w_{1, k_{1}}}, Y_{w_{2, k_{2}}}, \dots, Y_{w_{s, k_{s}}}; B} (d_{1}, d_{2}, \dots, d_{q})}{\underset{︸}{Pr (\cap_{q = 1}^{s} {Y_{w_{q, k_{q}}} \leq d_{q}} \cap B)}} \\ - \sum_{r = 1}^{s} \underset{F_{Y_{w_{1, k_{1}}}, Y_{w_{2, k_{2}}}, \dots, Y_{w_{s, k_{s}}}, Y_{w_{r, 1}}; B} (d_{1}, d_{2}, \dots, d_{q}, c_{r})}{\underset{︸}{Pr ({Y_{w_{r, 1}} \leq c_{r}} \cap \cap_{q = 1}^{s} {Y_{w_{q, k_{q}}} \leq d_{q}} \cap B)}} \\ + \sum_{\underset{r < t}{r, t = 1}}^{s} \underset{F_{Y_{w_{1, k_{1}}}, Y_{w_{2, k_{2}}}, \dots, Y_{w_{s, k_{s}}}, Y_{w_{r, 1}}, Y_{w_{t, 1}}; B} (c_{1} d_{1}, d_{2}, \dots, d_{q}, c_{r}, c_{t})}{\underset{︸}{Pr ({Y_{w_{r, 1}} \leq c_{t}} \cap {Y_{w_{t, 1}} \leq c_{t}} \cap \cap_{q = 1}^{s} {Y_{w_{q, k_{q}}} \leq d_{q}} \cap B)}} \\ ⋮ \\ + {(- 1)}^{s} \underset{F_{Y_{w_{1, 1}}, Y_{w_{1, k_{1}}}, Y_{w_{2, 1}}, Y_{w_{2, k_{2}}}, \dots, Y_{w_{s, 1}}, Y_{w_{s, k_{s}}}; B} (c_{1}, d_{1}, c_{2}, d_{2}, \dots, c_{s}, d_{s})}{\underset{︸}{Pr (\cap_{q = 1}^{s} {Y_{w_{q, 1}} \leq c_{q}} \cap \cap_{q = 1}^{s} {Y_{w_{q, k_{q}}} \leq d_{q}} \cap B)}}, \end{matrix}

which concludes the proof.

From now on assume that B is the event that exactly j elements of S₁ fall in the interval (0, y₁), for a given j ≤ n. This event is shown in Table 1. Thus, to compute the probability of interest, it is enough evaluate the cumulative distribution functions relative to the event B of the order statistic, given by Equation (20). An efficient method for the computation of cumulative distribution functions of order statistics from two populations was proposed by Glueck et al. [5]. Here we need a slight generalization, involving the event B, which requires a different proof.

Table 1.

Numbers of order statistics from the sets S₁ and S₂ in the interval (0, d₁) and outside the interval (0, d₁), in the event B.

	< d₁	≥ d₁
S₁	j	n − j	n
S₂	k₁ − j	(m − n) − (k₁ − j)	(m − n)
Total	k₁	m − k₁	m

Open in a new tab

THEOREM 2 Denote the index vector i = (i₀, i₁,…,i_e+1) and the summation index set

ℐ = {i : \begin{matrix} 0 = i_{0} \leq i_{1} \leq \dots \leq i_{e} \leq i_{e + 1} = m, \\ and i_{a} \geq n_{a} for all 1 \leq a \leq k \end{matrix}} .

(30)

Suppose that F_{X_i} (x) = F (x), for all 1 ≤ i ≤ n, and F_{X_i} (x) = G (x), for all n + 1 ≤ i ≤ m. Then the cumulative distribution function relative to the event B (20) is given by

F_{Y_{n_{1}}, \dots, Y_{n_{e}}; B} (y_{1}, \dots, y_{e}) = \sum_{i \in ℐ} \sum_{λ} \prod_{a = 1}^{e + 1} \frac{n! (m - n)!}{λ_{a}! (i_{a} - i_{a - 1} - λ_{a})!} \times {[F (y_{a}) - F (y_{a - 1})]}^{λ_{a}} {[G (y_{a}) - G (y_{a - 1})]}^{i_{a} - i_{a - 1} - λ_{a}},

(31)

where y₀ = 0, y_e+1 = 1, and λ = (λ₁, λ₂,…,λ_e+1) ranges over all integer vectors such that λ₁ = j and

λ_{1} + λ_{2} + \dots + λ_{e + 1} = n, 0 \leq λ_{a} \leq i_{a} - i_{a - 1} .

(32)

Proof Denote by A_i,λ the event that exactly i_a − i_a−1 of the random variables X_i fall in the interval (y_a−1, y_a), and exactly λ_a of those are elements of S₁. When a = 1, (y_a−1, y_a) = (y₀, y₁) = (0, y₁). If B occurs, λ₁ = j. Then from the binomial theorem,

Pr (A_{i, λ}) = \prod_{a = 1}^{e + 1} \frac{n! (m - n)!}{λ_{a}! (i_{a} - i_{a - 1} - λ_{a})!} {[F (y_{a}) - F (y_{a - 1})]}^{λ_{a}} {[G (y_{a}) - G (y_{a - 1})]}^{i_{a} - i_{a - 1} - λ_{a}} .

(33)

Since the events A_i,λ for different (i, λ) are disjoint, the result follows.

The only difference between Theorem 2 and the result by Glueck et al. [1] is the added condition λ₁ = j.

In the case of two random variables, we recover the same results as the direct method in Section 2. With m = 2, n = 1, s = 2, c₁ = 0, d₁ = b₁, c₂ = b₂, d₂ = 1, S₁ = {X₁}, S₂ = {X₁}, k₁ = 1, k₂ = 1, Y_{w_1,1} = Y_{w_1,k₁} = Y₁, Y_{w_2,1} = Y_{w_2,k₁} = Y₂, using Theorems 1 and 2 yields

(E \cap B) = γ,

(34)

when j = 0, and

(E \cap B) = β,

(35)

when j = 1.

In conclusion, for two sets of real-valued, independent but not necessarily identically distributed random variables, we have now given an expression for the probability that arbitrary subsets of order statistics fall in disjoint, ordered intervals and that of the smallest statistics, a certain number come from one set.

4. Concluding example

The methods of this paper can be used to calculating the joint probability of the number of rejections and the number of false rejection for the Benjamini–Hochberg [2] procedure. A rejection of a hypothesis for which the null holds is a false rejection. Given an false discovery rate α ∈ (0, 1), hypotheses H_i, i = 1,…,m, p-values X_i, and the corresponding order statistics for the p-values Y_i = X_(e) (the random variables X_i sorted in nondecreasing order X₍₁₎ ≤ X₍₂₎ ≤ ⋯ ≤ X_(m)), the procedure produces a nondecreasing sequence of numbers b_i = iα/m ∈ (0, 1), rejects the hypotheses H_(e), e = 1,…, k₁, such that k₁ is the largest number for which y_k₁ ≤ b_k₁, and accepts all others. For n ∈ {0, 1,…,m}, assume that the null holds for H₁, H₂,…, H_n and that the alternative holds for H_n+1, H_n+2,…,H_m. Let S₁ = {X₁, X₂,…,X_n} be the set of p-values that correspond to the null hypotheses, and S₂ = {X_n+1, X_n+2,…,X_m} be the set of p-values for which the alternative holds. Then j is the number of null hypotheses that are rejected, which is equal to the number of p-values corresponding to null hypotheses that fall in the interval [0, b_k₁].

Under the assumption that the p-values for which the alternative holds have the same distribution, one can use the methods of this paper to find the joint distribution of j and k₁. For each value of k₁ and m, Glueck et al. [6] pointed out that the rejection regions for the Benjamini and Hochberg (1995) procedure can be decomposed into disjoint sets of events. These events correspond to certain numbers of order statistics falling into sets of intervals, defined by the numbers b_i. Details about the decomposition of the rejection regions into these events are given in Glueck et al. [6]. The general case is too complicated to detail here. However, as an example, we calculate the probabilities that with m = 2 hypotheses, and n = 1 null hypotheses, the Benjamini and Hochberg [2] procedure rejects k₁ = 1 hypotheses and that j, the number of false rejections, is either 0 or 1.

Suppose we wish to test m = 2 hypotheses. Specifically, we wish to test hypotheses about the location of the sample mean. We plan to conduct a two-sided test. We assume that we have two large populations, with known variances (both σ²), and that the variables of interest, say ε₁ and ε₂, are normally distributed, so that ε₁ ~ N(μ₁, σ²) and ε₂ ~ N(μ₂, σ²). We wish to test two hypotheses H₁: μ₁ = μ₀, and H₂: μ₂ = μ₀, with the alternative hypothesis for both populations the same, so H_A: μ = μ_A. We sample N_i random variables from each population, say ε_i1, ε_i2,…,ε_{iN_i}. For convenience, we will assume that the random sample is of the same size for each hypothesis test, so N₁ = N₂ = N.

With

{\bar{ε}}_{i} = N^{- 1} \sum_{δ = 1}^{N} ε_{i δ},

(36)

the test statistics are given by

Z_{i} = {(\frac{σ}{\sqrt{N}})}^{- 1} ({\bar{ε}}_{i} - μ_{0}),

(37)

and the two sided p-values are [7, p. 244]

X_{i} = {\begin{matrix} 2 Φ (Z_{i}) & Z_{i} \leq 0 \\ 2 [1 - Φ (Z_{i})] & Z_{i} > 0 \end{matrix},

(38)

where Φ is the cumulative distribution function of the standard normal (mean = 0 and variance = 1). Let ϕ be the probability density function of the standard normal.

Suppose that in truth, we have ε₁ ~ N (μ₀, σ²), so that the null holds for H₁, and ε₂ ~ N(μ_A, σ²), so that alternative holds for H₂. Define S₁ = {X₁}, and S₂ = {X₂}. Then the number of p-value for which the null holds, n = 1. For H₁, the hypotheses for which the null holds, the p-value has a uniform distribution on the interval [0, 1], so for x₁ ∈ [0, 1],

F_{X_{1}} (x_{1}) = x_{1} .

(39)

For H₂, the alternative holds. When we conduct the hypothesis test, we are unaware of the truth. We always calculate the p-value under the null. However, since the alternative actually holds,

\begin{matrix} Pr [Z_{2} \leq z_{2}] & = Pr [\frac{{\bar{ε}}_{i} - μ_{0}}{σ / \sqrt{N}} \leq z_{2}] \\ = Pr [\frac{ε_{i} - μ_{A}}{σ / \sqrt{N}} \leq z_{2} + \frac{μ_{0} - μ_{A}}{σ / \sqrt{N}}] \\ = Φ [z_{2} + \frac{μ_{0} - μ_{A}}{σ / \sqrt{N}}] . \end{matrix}

(40)

Finally,

\begin{matrix} F_{X_{2}} (x_{2}) & = Pr (X_{2} < x_{2}) \\ = Pr ({X_{2} < x_{2}} \cap {Z_{2} \leq 0}) + Pr ({X_{2} < x_{2}} \cap {Z_{2} > 0}) \\ = Pr ({2 Φ (Z_{2}) < x_{2}}) + Pr ({2 [1 - Φ (Z_{2})] < x_{2}}) \\ = Pr ({Z_{2} \leq Φ^{- 1} (\frac{x_{2}}{2})}) + 1 - Pr ({Z_{2} \leq Φ^{- 1} (\frac{1 - x_{2}}{2})}) \\ = Φ [Φ^{- 1} (x_{2} / 2) + \frac{μ_{0} - μ_{A}}{σ / \sqrt{N}}] + 1 - Φ [Φ^{- 1} (1 - x_{2} / 2) + \frac{μ_{0} - μ_{A}}{σ / \sqrt{N}}], \end{matrix}

(41)

where the last step follows by substitution from Equation (40).

Now, as a specific example, we fix μ₀ = 0, μ_A = 1, σ² = 1 α = 0.05. We wish to calculate the probability that k₁ = 1 and that j = 0 or j = 1. With c₁ = 0, d₁ = α/2, c₂ = α, d₂ = 1. This is the probability that of the two hypotheses, we reject exactly one, and it is H₁, the hypothesis for which the null holds. When j = 0, the rejection we make is of the hypothesis for which the alternative holds, and when j = 1, the rejection we make is of the null hypothesis, a false rejection.

We calculated the probability using our methodology, and by a simulation using a sample of 100,000 variables. Recall that k₁ is the number of order statistics that is less than b₁, and j is the number in Set 1, and less than b₁. The results are shown in Table 2.

Table 2.

Comparison of simulation and theory.

k₁	j	Theory	Simulation	Difference
1	0	0.472982	0.47388	0.000898
1	1	0.00978051	0.0095	0.00028051

Open in a new tab

Note: Recall that k₁ is the number of hypotheses that were rejected, and j is the number of null hypotheses that were rejected. We had two hypotheses and one null hypothesis.

Notice that the simulation differs from the theory only in the fourth decimal place. The theory is exact. Software that implements this method in Mathematica is available from the authors upon request.

Acknowledgements

Glueck was supported by NCI K07CA88811. Mandel was supported by NSF-CMS 0325314. Muller was supported by NCI P01 CA47 982-04, NCI R01 CA095749-01A1 and NIAID 9P30 AI 50410. The authors thank Professor Gary Grunwald for his helpful comments.

Footnotes

AMS (2000) Subject Classification: Primary: 62E15, 65C60

References

1.Glueck DH, Karimpour-Fard A, Mandel J, Muller KE. On the probability that order statistics fall in intervals. 2008 in review. [Google Scholar]
2.Benjamini Y, Hochberg Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B Stat. Methodol. 1995;57:289–300. [Google Scholar]
3.David HA. Order Statistics. 2nd ed. New York: Wiley; 1981. [Google Scholar]
4.Bapat RB, Beg MI. Order statistics for non-identically distributed variables and permanents. Sankhya Ser. A. 1989;51:79–93. [Google Scholar]
5.Glueck DH, Karimpour-Fard A, Mandel J, Hunter L, Muller KE. Fast computation by block permanents of cumulative distribution functions of order statistics from several populations. Commun. Stat. Theory Methods. 2008 doi: 10.1080/03610920802001896. to appear. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Glueck DH, Muller KE, Karimpour-Fard A, Hunter L. Expected power for the false discovery rate with independence. Commun. Stat. Theory Methods. 2008;37(12) doi: 10.1080/03610920801893731. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Rosner B. Fundamentals of Biostatistics. 6th ed. New York: Brooks-Cole; 2006. [Google Scholar]
8.Ross S. A First Course in Probability. 2nd ed. New York: Macmillan Publishing Company; 1984. [Google Scholar]

[R1] 1.Glueck DH, Karimpour-Fard A, Mandel J, Muller KE. On the probability that order statistics fall in intervals. 2008 in review. [Google Scholar]

[R2] 2.Benjamini Y, Hochberg Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B Stat. Methodol. 1995;57:289–300. [Google Scholar]

[R3] 3.David HA. Order Statistics. 2nd ed. New York: Wiley; 1981. [Google Scholar]

[R4] 4.Bapat RB, Beg MI. Order statistics for non-identically distributed variables and permanents. Sankhya Ser. A. 1989;51:79–93. [Google Scholar]

[R5] 5.Glueck DH, Karimpour-Fard A, Mandel J, Hunter L, Muller KE. Fast computation by block permanents of cumulative distribution functions of order statistics from several populations. Commun. Stat. Theory Methods. 2008 doi: 10.1080/03610920802001896. to appear. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Glueck DH, Muller KE, Karimpour-Fard A, Hunter L. Expected power for the false discovery rate with independence. Commun. Stat. Theory Methods. 2008;37(12) doi: 10.1080/03610920801893731. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] 7.Rosner B. Fundamentals of Biostatistics. 6th ed. New York: Brooks-Cole; 2006. [Google Scholar]

[R8] 8.Ross S. A First Course in Probability. 2nd ed. New York: Macmillan Publishing Company; 1984. [Google Scholar]

PERMALINK

Probabilities for separating sets of order statistics

D H Glueck

A Karimpour-Fard

J Mandel

K E Muller

Abstract

1. Introduction

2. A simple example

3. General case

Table 1.

4. Concluding example

Table 2.

Acknowledgements

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Probabilities for separating sets of order statistics

D H Glueck

A Karimpour-Fard

J Mandel

K E Muller

Abstract

1. Introduction

2. A simple example

3. General case

Table 1.

4. Concluding example

Table 2.

Acknowledgements

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases