Finite sampling inequalities: an application to two-sample Kolmogorov-Smirnov statistics

Evan Greene; Jon A Wellner

doi:10.1016/j.spa.2016.04.020

. Author manuscript; available in PMC: 2017 Dec 1.

Published in final edited form as: Stoch Process Their Appl. 2016 Dec;126(12):3701–3715. doi: 10.1016/j.spa.2016.04.020

Finite sampling inequalities: an application to two-sample Kolmogorov-Smirnov statistics

Evan Greene ¹, Jon A Wellner ²

PMCID: PMC5218830 NIHMSID: NIHMS786904 PMID: 28070139

Abstract

We review a finite-sampling exponential bound due to Serfling and discuss related exponential bounds for the hypergeometric distribution. We then discuss how such bounds motivate some new results for two-sample empirical processes. Our development complements recent results by Wei and Dudley (2012) concerning exponential bounds for two-sided Kolmogorov - Smirnov statistics by giving corresponding results for one-sided statistics with emphasis on “adjusted” inequalities of the type proved originally by Dvoretzky et al. (1956) and by Massart (1990) for one-sample versions of these statistics.

Keywords: Bennett inequality, finite sampling, Hoeffding inequality, hypergeometric distribution, two-samples, Kolmogorov-Smirnov statistics, exponential bounds

1. Introduction: Serfling’s finite sampling exponential bound

Suppose that {c₁,…, c_N} is a finite population with each c_i ∈ ℝ. For n ≤ N, let Y₁,…, Y_n be a sample drawn from {c₁,…, c_N} without replacement; we can regard the finite population {c₁,…, c_N} as an urn containing N balls labeled with the numbers c₁,…, c_N. Some notation: we let

\begin{array}{l} μ_{N} = N^{- 1} \sum_{i = 1}^{N} c_{i} \equiv {\bar{c}}_{N}, σ_{N}^{2} = N^{- 1} \sum_{i = 1}^{N} {(c_{i} - {\bar{c}}_{N})}^{2}, \\ a_{N} \equiv min_{1 \leq i \leq N} c_{i}, b_{N} \equiv max_{1 \leq i \leq N} c_{i}, \\ f_{n} \equiv \frac{n - 1}{N - 1}, and f_{n}^{*} \equiv \frac{n - 1}{N} . \end{array}

It is well-known (see e.g. Rice (2007), Theorem B, page 208) that ${\bar{Y}}_{n} = n^{- 1} \sum_{i = 1}^{n} Y_{i}$ satisfies E(Ȳ_n) = μ_N and

Var ({\bar{Y}}_{n}) = \frac{σ_{N}^{2}}{n} (1 - \frac{n - 1}{N - 1}) = \frac{σ_{N}^{2}}{n} (1 - f_{n}) .

(1)

Serfling (1974), Corollary 1.1, shows that for all λ > 0

P (\sqrt{n} ({\bar{Y}}_{n} - μ_{N}) \geq λ) \leq exp (- \frac{2 λ^{2}}{(1 - f_{n}^{*}) {(b_{N} - a_{N})}^{2}}) .

(2)

This inequality is an inequality of the type proved by Hoeffding (1963) for sampling with replacement and more generally for sums of independent bounded random variables. Comparing (1) and (2), it seems reasonable to ask whether the factor $f_{n}^{*}$ in (2) can be improved to f_n ≡ (n − 1)/(N − 1)? Indeed Serfling ends his paper (on page 47) with the remark: “(it is) also of interest to obtain (2) with the usual sampling fraction instead of $f_{n}^{*}$ ”. Note that when n = N, Ȳ_n = μ_N, and hence the probability in (2) is 0 for all λ > 0, and the conjectured improvement of Serfling’s bound agrees with this while Serfling’s bound itself is positive when n = N.

Despite related results due to Kemperman (1973a,b,c), it seems that a definitive answer to this question is not yet known.

A special case of considerable importance is the case when the numbers on the balls in the urn are all 1’s and 0’s: suppose that c₁ = ··· = c_D = 1, while c_D₊₁,…, c_N = 0. Then $X \equiv n {\bar{Y}}_{n} = \sum_{i = 1}^{n} Y_{i}$ is well-known to have a Hypergeometric(n, D, N) distribution given by

P (\sum_{i = 1}^{n} Y_{i} = k) = \frac{(\begin{matrix} D \\ k \end{matrix}) (\begin{matrix} N - D \\ n - k \end{matrix})}{(\begin{matrix} N \\ n \end{matrix})}, max {0, D + n - N} \leq k \leq min {n, D} .

In this special case μ_N = D/N, $σ_{N}^{2} = μ_{N} (1 - μ_{N})$ , while b_N = 1 and a_N = 0. Thus Serfling’s inequality (2) becomes

P (\sqrt{n} ({\bar{Y}}_{n} - μ_{N}) \geq λ) \leq exp (- \frac{2 λ^{2}}{1 - f_{n}^{*}}) for all λ > 0,

and the conjectured improvement is

P (\sqrt{n} ({\bar{Y}}_{n} - μ_{N}) \geq λ) \leq exp (- \frac{2 λ^{2}}{1 - f_{n}}) for all λ > 0.

Despite related results due to Chvátal (1979) and Hush and Scovel (2005) it seems that a bound of the form in the last display remains unknown.

We should note that an exponential bound of the Bennett type for the tails of the hypergeometric distribution does follow from results of Vatutin and Mikha ilov (1982) and Ehm (1991); see also Pitman (1997).

Theorem 1

(Ehm, 1991) If 1 ≤ n ≤ D ∧ (N − D), then $\sum_{i = 1}^{n} Y_{i} \overset{d}{=} \sum_{i = 1}^{n} X_{i}$ where X_i ~ Bernoulli(π_i), with π_i ∈ (0, 1), are independent.

It follows from Theorem 1 that

\begin{array}{l} n (D / N) = E (\sum_{1}^{n} Y_{i}) = E (\sum_{1}^{n} X_{i}) = \sum_{i = 1}^{n} π_{i}, \\ n \frac{D}{N} (1 - \frac{D}{N}) (1 - f_{n}) = Var (\sum_{1}^{n} Y_{i}) = Var (\sum_{1}^{n} X_{i}) = \sum_{i = 1}^{n} π_{i} (1 - π_{i}) . \end{array}

Furthermore, by applying Theorem 1 together with Bennett’s inequality (Bennett (1962); see also Shorack and Wellner (1986), page 851), we obtain the following exponential bound for the tail of the hypergeometric distribution:

Corollary 1

If 1 ≤ n ≤ D ∧ (N − D), then for all λ > 0

P (\sqrt{n} ({\bar{Y}}_{n} - μ_{N}) \geq λ) \leq exp (- \frac{λ^{2}}{2 σ_{N}^{2} (1 - f_{n})} ψ (\frac{λ}{\sqrt{n} σ_{N}^{2} (1 - f_{n})}))

where μ_N ≡ D/N, $σ_{N}^{2} \equiv μ_{N} (1 - μ_{N})$ , 1 − f_n ≡ 1 − (n − 1)/(N − 1) is the finite-sampling correction factor, and ψ (y) ≡ 2y⁻²h(1 + y) where h(y) ≡ y(log y − 1) + 1.

Since $σ_{N}^{2} = μ_{N} (1 - μ_{N}) \leq 1 / 4$ , the inequality of the corollary yields a further bound which is quite close to the conjectured Hoeffding type improvement of Serfling’s bound, and which now has the desired finite-sampling correction factor 1 − f_n:

Corollary 2

\begin{array}{l} P (\sqrt{n} ({\bar{Y}}_{n} - μ_{N}) \geq λ) \leq exp (- \frac{2 λ^{2}}{(1 - f_{n})} ψ (\frac{λ}{\sqrt{n} σ_{N}^{2} (1 - f_{n})})) \\ \leq exp (- \frac{2 λ^{2}}{(1 - f_{n})} ψ (\frac{1}{σ_{N}^{2} (1 - f_{n})})) . \end{array}

By considerations related to the work of Talagrand (1994) and León and Perron (2003), the authors of the present paper have succeeded in proving the following exponential bound.

Theorem 2

(Greene and Wellner (2015); Greene (2016)) Suppose that $\sum_{i = 1}^{n} Y_{i} ~ Hypergeometric (n, D, N)$ . Define μ_N = D/N and suppose N > 4 and 2 ≤ n < D ≤ N/2. Then for all $0 < λ < \sqrt{n} / 2$ we have

P (\sqrt{n} ({\bar{Y}}_{n} - μ_{N}) \geq λ) \leq \sqrt{\frac{1}{2 π λ^{2}}} (\frac{1}{2}) \sqrt{(\frac{N - n}{N}) (\frac{\sqrt{n} + 2 λ}{\sqrt{n} - 2 λ}) (\frac{N - n + 2 \sqrt{n} λ}{N - n - 2 \sqrt{n} λ})} \cdot exp (- \frac{2}{1 - \frac{n}{N}} λ^{2}) exp (- \frac{1}{3} (1 + \frac{n^{3}}{{(N - n)}^{3}}) \frac{λ^{4}}{n}) .

The proof of this bound, along with a complete analogue for the hypergeometric distribution of a bound of Talagrand (1994) for the binomial distribution, appears in Greene and Wellner (2015) and in the forthcoming Ph.D. thesis of the first author, Greene (2016).

The bound given in Theorem 2 involves a still better finite-sampling correction factor, namely 1 − f̄_n = 1 − n/N, which has also appeared in Lo (1986) in the context of a Bayesian analysis of finite sampling. Note that as N → ∞, the above bound yields

\underset{N \to \infty}{lim sup} P (\sqrt{n} ({\bar{Y}}_{n} - μ_{N}) \geq λ) \leq \sqrt{\frac{1}{2 π λ^{2}}} (\frac{1}{2}) \sqrt{(\frac{\sqrt{n} + 2 λ}{\sqrt{n} - 2 λ})} \cdot exp (- 2 λ^{2} - \frac{λ^{4}}{3 n}),

a bound which improves slightly on the bound given by León and Perron (2003) in the case of sums of i.i.d. Bernoulli random variables.

Before leaving this section we begin to make a connection to finite-sampling empirical distributions: Now let $F_{n} (t) = n^{- 1} \sum_{i = 1}^{n} 1_{(- \infty, t]} (Y_{i})$ and $F_{N} (t) = N^{- 1} \sum_{i = 1}^{N} 1_{(- \infty, t]} (c_{i})$ . Then it is easily seen that Serfling’s bound yields

P (\sqrt{n} (F_{n} (t) - F_{N} (t)) \geq λ) \leq exp (- \frac{2 λ^{2}}{(1 - (n - 1) / N)})

for each fixed λ > 0 and t ∈ ℝ. Note that since 𝔽_n(t) is equal in distribution to the sample mean of n draws without replacement from an urn containing N F_N(t) 1’s and N (1 − F_N(t)) 0’s, the bound in the last display only involves the hypergeometric special case of Serfling’s inequality. This leads to the following conjecture concerning bounds for the finite sampling empirical process { $\sqrt{n} (F_{n} (t) - F_{N} (t)) : t \in ℝ$ }:

Conjecture

There exist constants C, D > 0 (possibly C = 1 and D = 2?) such that

P (\sqrt{n} sup_{t} (F_{n} (t) - F_{N} (t)) \geq λ) \leq C exp (- \frac{2 λ^{2}}{(1 - f_{n})}),

(3)

P (\sqrt{n} sup_{t} ∣ F_{n} (t) - F_{N} (t) ∣ \geq λ) \leq D exp (- \frac{2 λ^{2}}{(1 - f_{n})})

(4)

for all λ > 0. The possibility that D = 2 is suggested by the corresponding inequality established by Massart (1990) in the case of sampling with replacement.

With these strong indications of the plausibility of an improvement of Serfling’s bound and corresponding improvements in exponential bounds for the uniform-norm deviations of the finite-sampling empirical process, we can now turn to an application of the basic idea in the context of two-sample Kolmogorov-Smirnov statistics.

2. Two-sample tests and finite-sampling connections

To connect this with the two-sample Kolmogorov-Smirnov statistics, suppose that X₁, …, X_m are i.i.d. F and Y₁, …, Y_n are i.i.d. G. Let N = m+n. Then for testing H_c : F = G with F continuous versus K⁺ : F ≥ G (F ≺_s G), K⁻ : G ≥ F, (G ≺_s F), or K : F ≠ G, the classical K-S test statistics are

\begin{array}{l} D_{m, n}^{+} \equiv \sqrt{\frac{m n}{N}} sup_{x} (F_{m} (x) - G_{n} (x)), \\ D_{m, n}^{-} \equiv \sqrt{\frac{m n}{N}} sup_{x} (G_{n} (x) - F_{m} (x)), and \\ D_{m, n} \equiv \sqrt{\frac{m n}{N}} sup_{x} ∣ F_{m} (x) - G_{n} (x) ∣, \end{array}

respectively. It is well-known that under H_c we have

D_{m, n}^{\pm} \to_{d} sup_{0 \leq t \leq 1} U (t), D_{m, n} \to_{d} sup_{0 \leq t \leq 1} ∣ U (t) ∣

if m ∧ n → ∞ where 𝕌 is a standard Brownian bridge process on [0, 1]; see e.g. Hájek and Šidák (1967), pages 189–190, Hodges (1958), and van der Vaart and Wellner (1996), pages 360–366.

Note that with λ_N ≡ m/N and

ℍ_{N} \equiv λ_{N} F_{m} + (1 - λ_{N}) G_{n} = N^{- 1} \sum_{i = 1}^{N} 1_{(- \infty, \cdot]} (Z_{(i)})

where Z₍₁₎ ≤ ··· ≤ Z₍_N₎ are the order statistics of the pooled sample, we have

\begin{array}{l} F_{m} - ℍ_{N} = F_{m} - λ_{N} F_{m} - (1 - λ_{N}) G_{n} = (1 - λ_{N}) (F_{m} - G_{n}), and \\ G_{n} - ℍ_{N} = G_{n} - λ_{N} F_{m} - (1 - λ_{N}) G_{n} = λ_{N} (G_{n} - F_{m}), \end{array}

and hence, with λ̄_N = 1 − λ_N,

\begin{array}{l} \sqrt{\frac{m n}{N}} (F_{m} - G_{n}) = \sqrt{N} \sqrt{λ_{N} {\bar{λ}}_{N}} \frac{1}{{\bar{λ}}_{N}} (F_{m} - ℍ_{N}) = \frac{1}{\sqrt{{\bar{λ}}_{N}}} \sqrt{m} (F_{m} - ℍ_{N}), \\ \sqrt{\frac{m n}{N}} (G_{n} - F_{m}) = \sqrt{N} \sqrt{λ_{N} {\bar{λ}}_{N}} \frac{1}{λ_{N}} (G_{n} - ℍ_{N}) = \frac{1}{\sqrt{λ_{N}}} \sqrt{n} (G_{n} - ℍ_{N}) . \end{array}

Thus, using the independence of the ranks R and the order statistics Z (both based on the pooled sample),

P (D_{m, n}^{+} \geq t) = E_{Z} P_{R} (\sqrt{m} {‖ {(F_{m} - ℍ_{N})}^{+} ‖}_{\infty} > t \sqrt{1 - λ_{N}})

and it would follow from (3) that

\begin{array}{l} P (D_{m, n}^{+} \geq t) \leq C exp (- 2 {\bar{λ}}_{N} t^{2} / (1 - f_{m})) \\ \leq C exp (- 2 (n / N) t^{2} / (n / (N - 1))) \\ = C exp (- 2 \frac{N - 1}{N} t^{2}) \end{array}

(5)

for all t > 0. Similarly it would also follow from (3) that

\begin{array}{l} P (D_{m, n}^{-} \geq t) \leq C exp (- 2 λ_{N} t^{2} / (1 - f_{n})) \\ \leq C exp (- 2 (m / N) t^{2} / (m / (N - 1))) = C exp (- 2 \frac{N - 1}{N} t^{2}) \end{array}

for all t > 0. Combining the two one-sided inequalities yields a (conjectured) two-sided inequality:

\begin{array}{l} P (D_{m, n} \geq t) \equiv P (\sqrt{m n / N} {‖ F_{m} - G_{n} ‖}_{\infty} > t) \\ \leq P (D_{m, n}^{+} > t) + P (D_{m, n}^{-} > t) \\ \leq 2 C exp (- 2 \frac{N - 1}{N} t^{2}) . \end{array}

In the next section we will prove that bounds of this type with C = 1 and D = 2 hold in the special case m = n. For some results for the two-side two-sample Kolmogorov-Smirnov statistic in the case m = n and computational results for m ≠ n, see Wei and Dudley (2012). These authors were aiming for a bound of the form C exp(−2t²) both for m = n and m ≠ n. The above heuristics seem to suggest that a bound of the form C exp(−2((N −1)/N)t²) might be a natural goal.

3. An exponential bound for $D_{m, n}^{+}$ when m = n

Throughout this section we suppose that the null hypothesis H_c holds: G = F is a continuous distribution function.

From Hodges (1958), (2.3) on page 473 (together with $t = \sqrt{m n / N d}$ and d = a/n from page 473, line 4), when m = n (so N = 2n),

\begin{array}{l} P (D_{n, n}^{+} \geq t) = P (\sqrt{\frac{m n}{N}} sup_{x} (F_{m} (x) - G_{n} (x)) \geq \sqrt{\frac{m n}{N}} \frac{a}{n}) \\ = P (\sqrt{\frac{n^{2}}{2 n}} sup_{x} (F_{n} (x) - G_{n} (x)) \geq \sqrt{\frac{n^{2}}{2 n}} \frac{a}{n}) \\ = \frac{(\begin{matrix} 2 n \\ n - a \end{matrix})}{(\begin{matrix} 2 n \\ n \end{matrix})} for a = 1, 2, \dots, n . \end{array}

We first compare the exact probability from the last display with the possible upper bounds

\begin{array}{l} P B_{2} (n) = exp (- 2 \frac{2 n - 1}{2 n} \frac{a^{2}}{2 n}); \\ P B_{3} (n) = exp (- 2 \frac{a^{2}}{2 n}) . \end{array}

For n = 3 we find that

a	0	1	2	3
E(xact)	1	.75	0.3	0.05

PB2	1	0.7574	0.3291	0.0821
PB2 - E	0	0.0074	0.0291	0.0321

PB3	1	0.7165	0.2636	0.0498
PB3 - E	0	−0.0335	−0.0364	−0.0002

Open in a new tab

Further comparisons for m = n = 10, 12, 13, 14, 15, 25 support the validity of the bound involving the finite sampling fraction f_n. These comparisons agree with the following theorem:

Theorem 3

A. When m = n (so that N = 2n) the second bound in (5) holds for all n ≥ 1 with C = 1:

P (D_{n, n}^{+} \geq t) = P (\sqrt{\frac{m n}{N}} sup_{x} (F_{m} (x) - G_{n} (x)) \geq t)

(6)

\leq exp (- 2 \frac{N - 1}{N} t^{2}) for all t > 0.

(7)

Equivalently, when m = n,

P (\sqrt{\frac{m n}{N}} \sqrt{\frac{N - 1}{N}} sup_{x} (F_{m} (x) - G_{n} (x)) \geq t) \leq exp (- 2 t^{2})

(8)

for all t > 0.

B. On the other hand, when m = n (so that N = 2n), for all n ≥ 1 we have

P (D_{n, n}^{+} \geq t) > exp (- 2 t^{2}) for all 0 < t < 1.

Proof

A. Since the inequality holds trivially for a = 0, and can be shown easily by numerical computation for a ∈ {1, 2, 3} (see the Table above), it suffices to show that

\frac{(\begin{matrix} 2 n \\ n - a \end{matrix})}{(\begin{matrix} 2 n \\ n \end{matrix})} \leq exp (- 2 \frac{2 n - 1}{2 n} \frac{a^{2}}{2 n})

for a ∈ {1,…, n} and n ≥ 4. Furthermore, we will show that it holds for a = n in a separate argument, and thus it suffices to show that it holds for a ∈ {1,…, n − 1} and n ≥ 4. By rewriting the numerator and denominator on the left side of the last display, the desired inequality can be rewritten as

\frac{n! n!}{(n - a)! (n + a)!} \leq exp (- \frac{2 n - 1}{2 n} \cdot \frac{a^{2}}{n}) .

By taking logarithms we can rewrite this as

log (\frac{n! n!}{(n - a)! (n + a)!}) + \frac{2 n - 1}{2 n} \frac{a^{2}}{n} \leq 0.

(9)

Now by Stirling’s formula with bounds (see e.g. Nanjundiah (1959)) we have

\sqrt{2 π k} {(\frac{k}{e})}^{k} exp (\frac{1}{12 k} - \frac{1}{360 k^{3}}) \leq k! \leq \sqrt{2 π k} {(\frac{k}{e})}^{k} exp (\frac{1}{12 k}) .

(10)

Using these bounds in (9) we find that the left side is bounded above by

\begin{array}{l} - n {(1 - \frac{a}{n}) log (1 - \frac{a}{n}) + (1 + \frac{a}{n}) log (1 + \frac{a}{n})} \\ - \frac{1}{2} (log (1 - \frac{a}{n}) + log (1 + \frac{a}{n})) \\ + {\frac{1}{6 n} - \frac{1}{12 (n - a)} - \frac{1}{12 (n + a)} + \frac{1}{360} (\frac{1}{{(n - a)}^{3}} + \frac{1}{{(n + a)}^{3}})} \\ + \frac{a^{2}}{n} - \frac{a^{2}}{2 n^{2}} \\ \equiv I_{1} + I_{2} + I_{3} + I_{4} . \end{array}

Note that I₁ and I₂ are as defined in Wei and Dudley (2012) page 640, while I₃ and I₄ differ. From Wei and Dudley (2012) page 640,

I_{1} \leq - \frac{a^{2}}{n} - \frac{a^{4}}{6 n^{3}} - \frac{a^{6}}{15 n^{5}} - \frac{a^{8}}{28 n^{7}},

(11)

(which is proved by Taylor expansion of (1 + x) log(1 + x) + (1 − x) log(1 − x) about x = 0), and

I_{2} \leq \frac{a^{2}}{2 n^{2}} + \frac{a^{4}}{4 n^{4}} + \frac{a^{6}}{6 n^{6} (1 - a^{2} / n^{2})} .

(12)

Note that the lead term in the bound (11) for I₁ and lead term of I₄ cancel each other, while the first term of the bound (12) for I₂ cancels the second term of I₄. Adding the bounds yields

\begin{array}{l} I_{1} + I_{2} + I_{3} + I_{4} \leq - \frac{a^{4}}{12 n^{3}} - \frac{a^{4}}{12 n^{3}} - \frac{a^{6}}{15 n^{5}} - \frac{a^{8}}{28 n^{7}} + \frac{a^{4}}{4 n^{4}} + \frac{a^{6}}{6 n^{6} (1 - a^{2} / n^{2})} + I_{3} \\ = - \frac{a^{4}}{n^{3}} (\frac{1}{12} - \frac{1}{4 n}) - \frac{a^{4}}{12 n^{3}} - \frac{a^{6}}{n^{5}} (\frac{1}{15} - \frac{1}{6 n (1 - a^{2} / n^{2})}) - \frac{a^{8}}{28 n^{7}} + I_{3} \\ \leq - \frac{a^{4}}{n^{3}} (\frac{1}{12} - \frac{1}{4 n}) - \frac{a^{4}}{12 n^{3}} - \frac{a^{6}}{n^{5}} (\frac{1}{15} - \frac{1}{6 (2 - 1 / n)}) - \frac{a^{8}}{28 n^{7}} + I_{3} \\ \leq - \frac{a^{4}}{n^{3}} (\frac{1}{12} - \frac{1}{4 n}) - \frac{a^{4}}{12 n^{3}} + \frac{3 a^{6}}{105 n^{5}} - \frac{a^{8}}{28 n^{7}} + I_{3} \\ = - \frac{a^{4}}{n^{3}} (\frac{1}{12} - \frac{1}{4 n}) - \frac{a^{4}}{12 n^{3}} (1 - \frac{36 a^{2}}{105 n^{2}}) - \frac{a^{8}}{28 n^{7}} + I_{3} \\ \leq - \frac{a^{4}}{n^{3}} (\frac{1}{12} - \frac{1}{4 n}) - \frac{a^{4}}{21 n^{3}} - \frac{a^{8}}{28 n^{7}} + I_{3} \\ \equiv R_{12} + I_{3} . \end{array}

Now R₁₂ ≤ 0 for n ≥ 4 and I₃ ≤ 0 for all n ≥ 2 and a ∈ {1, …, n − 1} by the following argument:

\begin{array}{l} I_{3} = \frac{1}{6 n} - \frac{1}{12 (n - a)} - \frac{1}{12 (n + a)} + \frac{1}{360} (\frac{1}{{(n + a)}^{3}} + \frac{1}{{(n - a)}^{3}}) \\ = - \frac{1}{6} \frac{a^{2}}{n (n^{2} - a^{2})} + \frac{2}{360} \frac{n (n^{2} + 3 a^{2})}{{(n^{2} - a^{2})}^{3}} \\ = - \frac{1}{6 n (n^{2} - a^{2})} {a^{2} - \frac{2}{60} \frac{n^{2} (n^{2} + 3 a^{2})}{{(n^{2} - a^{2})}^{2}}} \\ = - \frac{1}{6 n (n^{2} - a^{2})} {a^{2} - \frac{1}{30} \frac{n^{2} (n^{2} - a^{2} + 4 a^{2})}{{(n^{2} - a^{2})}^{2}}} \\ = - \frac{1}{6 n (n^{2} - a^{2})} {a^{2} (1 - \frac{2}{15} \frac{n^{2}}{{(n^{2} - a^{2})}^{2}}) - \frac{n^{2} (n^{2} - a^{2})}{30 {(n^{2} - a^{2})}^{2}}} \\ \leq - \frac{1}{6 n (n^{2} - a^{2})} {a^{2} (1 - \frac{2}{15} \frac{1}{3}) - \frac{n^{2}}{30 (n^{2} - a^{2})}} \\ by using a \leq n - 1, so n^{2} - a^{2} \geq n^{2} - {(n - 1)}^{2} = (2 n - 1), \\ and n^{2} / {(2 n - 1)}^{2} \leq 1 / 3 for n \geq 4, \\ = - \frac{1}{6 n (n^{2} - a^{2})} {a^{2} (1 - \frac{2}{3 \cdot 15}) - \frac{n^{2} - a^{2} + a^{2}}{30 (n^{2} - a^{2})}} \\ = - \frac{1}{6 n (n^{2} - a^{2})} {a^{2} (1 - \frac{2}{3 \cdot 15} - \frac{1}{30 (n^{2} - a^{2})}) - \frac{1}{30}} \\ \leq - \frac{1}{6 n (n^{2} - a^{2})} {a^{2} (1 - \frac{2}{3 \cdot 15} - \frac{1}{30 (2 n - 1)}) - \frac{1}{30}} \\ \leq - \frac{1}{6 n (n^{2} - a^{2})} {a^{2} (1 - \frac{31}{630}) - \frac{1}{30}} \end{array}

for n ≥ 4. This is a decreasing function of a for fixed n, and hence to show that it is < 0 it suffices to check it for a = 1. But when a = 1 the right side above equals

- \frac{1}{6 n (n^{2} - 1^{2})} {1 - \frac{31}{630} - \frac{1}{30}} = - \frac{1}{n (n^{2} - 1)} {\frac{289}{6 \cdot 315}} < - \frac{1}{n (n^{2} - 1)} {\frac{280}{6 \cdot 315}} = - \frac{4}{27 n (n^{2} - 1)} < 0,

so we conclude that I₃ < 0 for a ∈ {1, …, n − 1} and n ≥ 4. It remains only to show that the desired bound holds for a = n; that is we have

\frac{1}{(\begin{matrix} 2 n \\ n \end{matrix})} \leq exp (- (n - 1 / 2)) .

But this can easily be shown via the Stirling formula bounds (10).

Thus

exp (I_{1} + I_{2} + I_{3}) \leq exp (- I_{4}) = exp (- \frac{2 n - 1}{2 n} \frac{a^{2}}{n}),

and the claimed inequality holds for all n ≥ 4. Since the bounds hold for n = 1, 2, 3 by direct numerical computation, the claim follows.

B. We first define

\begin{array}{l} r_{n} (a) \equiv log {\frac{(\begin{matrix} 2 n \\ n - a \end{matrix}) / (\begin{matrix} 2 n \\ n \end{matrix})}{exp (- 2 a^{2} / (2 n))}} \\ = log (\begin{matrix} 2 n \\ n - a \end{matrix}) - log (\begin{matrix} 2 n \\ n \end{matrix}) + \frac{a^{2}}{n} . \end{array}

Since we can take $t = a / \sqrt{2 n}$ , it suffices to show that r_n(a) > 0 for $1 \leq a \leq ⌊ \sqrt{2 n} ⌋$ . We will first show this for n ≥ 31. Then the proof will be completed by checking the inequality numerically for $1 \leq a \leq ⌊ \sqrt{2 n} ⌋$ and n ∈ {1, …, 30}.

By using the Stirling formula bounds of (10) as in the proof of A, but now with upper bounds replaced by lower bounds, we find that

\begin{array}{l} r_{n} (a) = 2 log (n!) - log (n - a)! - log (n + a)! + \frac{a^{2}}{n} \\ \geq - n {(1 - \frac{a}{n}) log (1 - \frac{a}{n}) + (1 + \frac{a}{n}) log (1 + \frac{a}{n})} - \frac{1}{2} {log (1 - \frac{a}{n}) + log (1 + \frac{a}{n})} + \frac{1}{6 n} - \frac{1}{180 n^{3}} - \frac{1}{12 (n - a)} - \frac{1}{12 (n + a)} + \frac{a^{2}}{n} \\ \equiv L_{1} + L_{2} + L_{3} + L_{4} . \end{array}

As in (11) and (12) and the displays following them, we find that

\begin{array}{l} L_{1} & \geq & - n {\frac{a^{2}}{n^{2}} + \frac{a^{4}}{6 n^{4}} + \frac{a^{6}}{15 n^{6}} + \frac{a^{8}}{28 n^{8}} (\frac{n^{2}}{n^{2} - a^{2}})}, \\ L_{2} & \geq & \frac{a^{2}}{2 n^{2}} + \frac{a^{4}}{4 n^{4}} + \frac{a^{6}}{6 n^{6}}, \\ L_{3} & = & - \frac{a^{2}}{6 n (n^{2} - a^{2})} - \frac{1}{180 n^{3}}, \\ L_{4} & = & \frac{a^{2}}{n} . \end{array}

Putting these pieces together and rearranging we find that

r_{n} (a) \geq [\frac{31 a^{2}}{64 n^{2}} + \frac{a^{4}}{4 n^{4}} + \frac{a^{6}}{6 n^{6}} - \frac{a^{4}}{6 n^{3}} - \frac{a^{6}}{15 n^{5}} - \frac{a^{8}}{28 n^{5}} (\frac{1}{n^{2} - a^{2}})] + [\frac{a^{2}}{64 n^{2}} + \frac{1}{6 n} - \frac{1}{180 n^{3}} - \frac{1}{12 (n + a)} - \frac{1}{12 (n - a)}]

(13)

= : K_{1} + K_{2} > 0

(14)

will prove the claim. Note in (13) that the a²/n term cancelled by virtue of the lower bound estimate based on the Taylor expansion of (1 + x) log(1 + x) + (1 − x) log(1 − x). First note that

\begin{array}{l} K_{2} = \frac{a^{2}}{64 n^{2}} + \frac{1}{6 n} - \frac{1}{180 n^{3}} - \frac{1}{12 (n + a)} - \frac{1}{12 (n - a)} \\ = \frac{a^{2} [28 n^{3} - 45 a^{2} n] + a^{2} [16 n^{3} - 480 n^{2}] + [a^{2} n^{3} + 16 a^{2} - 16 n^{2}]}{2880 n^{3} (n - a) (n + a)} \end{array}

The denominator of the right-hand-side is clearly positive for $a \in {1, 2, \dots, ⌊ \sqrt{2 n} ⌋}$ . By inspection, we can see the term a²n³ + 16a² − 16n² in the numerator is increasing in a. Picking a = 1, we then see n³ + 16 − 16n² > 0 for n ≥ 31, and thus a²n³ + 16a² − 16n² > 0 for all admissible a. Next, the polynomial 28n³ − 45a²n is decreasing in the admissible a. For any fixed n, the minimum value it can attain is then larger than 28n³ − 90n². For n ≥ 31, this quantity is positive. Therefore, 28n³ − 45a²n > 0 for all admissible a when n ≥ 31. Finally, note that 16n³ − 480n² = 16n²(n − 30) > 0 for n ≥ 31. Hence we have shown K₂ > 0.

We next have

\begin{array}{l} K_{1} = [\frac{31 a^{2}}{64 n^{2}} - \frac{a^{4}}{6 n^{3}}] + [\frac{a^{4}}{4 n^{4}} - \frac{a^{6}}{15 n^{5}}] + [\frac{a^{6}}{6 n^{6}} - \frac{a^{8}}{28 n^{5}} (\frac{1}{n^{2} - a^{2}})] \\ = [(\frac{a^{2}}{192 n^{3}}) (93 n - 32 a^{2})] + [(\frac{a^{4}}{60 n^{5}}) (15 n - 4 a^{2})] + [(\frac{a^{6}}{84 n^{6} (n^{2} - a^{2})}) (14 n^{2} - 3 a^{2} n - 14 a^{2})] \\ \equiv [(α) (93 n - 32 a^{2})] + [(β) (15 n - 4 a^{2})] + [(γ) (14 n^{2} - 3 a^{2} n - 14 a^{2})] . \end{array}

(15)

Again since $a \in {1, \dots, ⌊ \sqrt{2 n} ⌋}$ , it is clear that α, β, and γ in (15) are positive for all admissible choices of a. Hence, the sign of each bracketed term will be dictated by the remaining polynomial in a. It is also clear from their form that each polynomial is decreasing in a; hence we need only evaluate at the endpoints to determine positivity. But $93 n - 32 {(\sqrt{2 n})}^{2} = 29 n > 0, 15 n - 4 {(\sqrt{2 n})}^{2} = 15 n - 8 n = 7 n > 0$ , and $14 n^{2} - 3 {(\sqrt{2 n})}^{2} n - 14 {(\sqrt{2 n})}^{2} = 14 n^{2} - 6 n^{2} - 28 n = 4 n (2 n - 7) > 0$ with the final inequality following as n ≥ 31. Hence all terms in (15) are positive and so K₁ > 0. Together with K₂ > 0 as proved above, the claim is proved for n ≥ 31.

Since the bound holds for $a \in {1, \dots, ⌊ \sqrt{2 n} ⌋}$ and n ∈ {1, …, 30} by direct numerical computation, the claim follows.

4. Some comparisons and connections

4.1. Comparisons: two-sided tail bounds

Here we compare and contrast our results with those of Wei and Dudley (2012). As in Wei and Dudley (2012) (see also Wei and Dudley (2011)), we say that the DKW inequality holds for given m, n and C if

P (D_{m, n} \geq t) \leq C exp (- 2 t^{2}) for all t > 0,

and we say that the DKWM inequality holds for given m, n if the inequality in the last display holds with C = 2. Wei and Dudley (2012) prove the following theorem:

Theorem 4

(Wei and Dudley, 2012) For m = n in the two sample case:

The DKW inequality always holds with C = e≐2.71828.
For m = n ≥ 4, the smallest n such that H_c can be rejected at level 0.05, the DKW inequality holds with C = 2.16863.
The DKWM inequality holds for all m = n ≥ 458.
For each m = n < 458, the DKWM inequality fails for some t of the form $t = k / \sqrt{2 n}$ .
For each m = n < 458, the DKW inequality holds for C = 2(1 + δ_n) for some δ_n > 0 where, for 12 ≤ n ≤ 457,
$δ_{n} < - \frac{0.07}{n} + \frac{40}{n^{2}} - \frac{400}{n^{3}} .$

For comparison, the following theorem follows from Theorem 3. We say that the modified DKWM inequality holds for given m, n if

P (D_{m, n} \geq t) \leq 2 exp (- 2 (\frac{N - 1}{N}) t^{2}) for all t > 0,

Theorem 5

For m = n in the two sample case:

For all n ≥ 1 the modified DKWM inequality holds.
Alternatively, for the modified Kolmogorov statistic given by
$D_{m, n}^{\mod} \equiv \sqrt{\frac{N - 1}{N}} \sqrt{\frac{m n}{N}} {‖ F_{m} - G_{n} ‖}_{\infty},$

the DKWM inequality holds for all n ≥ 1.

We are not claiming that our “modified” version of the DKWM inequality improves on the results of Wei and Dudley (2012): it is clearly worse for m = n > 458. On the other hand, it may provide a useful clue to the formulation of DKWM type exponential bounds for two-sample Kolmogorov statistics when m ≠ n. In this direction we have the following conjecture:

Conjecture

For any m ≠ n,

P (D_{m, n}^{+} > t) \leq exp (- 2 (\frac{N - 1}{N}) t^{2}) for all t > 0

(16)

P (D_{m, n} > t) \leq 2 exp (- 2 (\frac{N - 1}{N}) t^{2}) for all t > 0.

(17)

That is, we conjecture that the modified DKWM inequality holds for all m, n ≥ 1. This is supported by all the numerical experiments we have conducted so far.

4.2. Comparisons: one-sided tail bounds

Wei and Dudley (2012) do not treat bounds for the one-sided statistics. Here we summarize our results with a theorem which parallels their Theorem 4 above. In analogy with their terminology, we say that the one-sided DKW inequality holds for given m, n and C if

P (D_{m, n}^{+} \geq t) \leq C exp (- 2 t^{2}) for all t > 0,

and we say that the one-sided DKWM inequality holds for given m, n if the inequality in the last display holds with C = 1. Moreover, we say that the modified one-sided DKWM inequality holds for given m, n if

P (D_{m, n}^{+} \geq t) \leq exp (- 2 (\frac{N - 1}{N}) t^{2}) for all t > 0.

Theorem 6

For m = n in the two sample case:

The one-sided DKW inequality holds for all n ≥ 1 with C = e/2≐2.71828/2 = 1.35914. For this range of n, C = e/2 is sharp since equality occurs for n = 1 and $t = 1 / \sqrt{2}$ (or $a = t \sqrt{2 n} = 1$ ).
For m = n ≥ 5, the one-sided DKW inequality holds with C = 2.16863/2 = 1.084315.
The one-sided DKWM inequality fails for all m = n ≥ 1.
The modified one-sided DKWM inequality holds for all m = n ≥ 1.

Proof

To prove (a), we first note that Wei and Dudley (2012) showed that for n ≥ 108 we have

\begin{array}{l} \frac{(\begin{matrix} 2 n \\ n + a \end{matrix})}{(\begin{matrix} 2 n \\ n \end{matrix})} < exp (- a^{2} / n) for \sqrt{3 n} \leq a \leq n \\ < (e / 2) exp (- a^{2} / n) . \end{array}

Thus to prove that the claimed inequality holds for n ≥ 108, it suffices to show that it holds for $t_{0} \sqrt{n} \leq a \leq \sqrt{3} \sqrt{n}$ where $t_{0} \equiv \sqrt{(1 / 2) log (e / 2)}$ is the smallest value of t for which the bound is less than or equal to 1.

Proceeding as in the proof of Theorem 3-A, we find that we want to show that

log \frac{n! n!}{(n + a)! (n - a)!} + \frac{a^{2}}{n} - log (e / 2) < 0 for t_{0} \sqrt{n} \leq a \leq \sqrt{3} \sqrt{n} .

By the same arguments used in the proof of Theorem 3-A, we find that the left side in the last display is bounded above by

- \frac{a^{4}}{6 n^{3}} - \frac{a^{6}}{15 n^{5}} - \frac{a^{8}}{28 n^{7}} + \frac{a^{4}}{4 n^{4}} + \frac{a^{6}}{6 n^{6} (1 - a^{2} / n^{2})} + I_{3} + \frac{a^{2}}{2 n^{2}} - log (e / 2) \equiv K_{1} + K_{2} .

Now K₁ ≤ 0 for n ≥ 4 and a ∈ {1, …, n − 1} by the previous proof, and

K_{2} \equiv \frac{a^{2}}{2 n^{2}} - log (e / 2) < 0 for all a \leq \sqrt{3} \sqrt{n}

\frac{3}{2 n} < log (e / 2), or n > \frac{3}{2 log (e / 2)} ≐ 4.888 \dots .

This completes the proof for n ≥ 108. Numerical computation easily shows that the claim holds for all n ∈ {1, …, 107}.

The proof of (b) is similar upon replacing e/2 by 1.084315, and again computing numerically for n ∈ {1, …, 107}.

Corollary 3

For n ≥ 5 and C = 1.084315,

\begin{array}{l} P (D_{n, n}^{+} \geq t) \leq min {exp (- 2 (1 - 1 / N) t^{2}), C exp (- 2 t^{2})} \\ = {\begin{cases} C exp (- 2 t^{2}), & t \geq t_{0} \equiv \sqrt{n log C} ≐ .285 \sqrt{n}, \\ exp (- 2 (1 - 1 / N) t^{2}), & t \leq t_{0} \equiv \sqrt{n log C} . \end{cases} \end{array}

Figures 1 and 2 illustrate Theorem 6.

Difference between approximations and exact one-sided probabilities $P (D_{n, n}^{+} > t)$ for n = 128 and a ∈ {1, 2, …, 128}. Negative values indicate the exact probability exceeds the approximation. Serfling DKWM is the bound obtained via the heuristic of section 2, using the sampling fraction $1 - f_{n}^{*} = (N - n + 1) / N$ . Modified DKWM uses the sampling fraction 1 − *f_n* = (N − n)/(N − 1). DKWM uses the fraction from Wei and Dudley.

Difference between approximations and exact one-sided probabilities $P (D_{n, n}^{+} > t)$ for n = 23 and a ∈ {1, 2, …, 23}. Negative values indicate the exact probability exceeds the approximation. DKWM6a corresponds to the DKWM bound with the constant e/2, discussed in Theorem 6(a). DKWM6b corresponds to the DKWM bound with the constant 2.16863/2, discussed in Theorem 6(b).

Acknowledgments

The second author owes thanks to Werner Ehm for several helpful conversations and to Martin Wells for pointing out the Pitman reference. We also owe thanks to the referee for a number of helpful comments and suggestions.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Contributor Information

Evan Greene, Department of Statistics, University of Washington, Seattle, WA 98195-4322.

Jon A. Wellner, Department of Statistics, University of Washington, Seattle, WA 98195-4322

References

Bennett G. Probability inequalities for the sum of independent random variables. J Amer Statist Assoc. 1962;57:33–45. [Google Scholar]
Chvátal V. The tail of the hypergeometric distribution. Discrete Math. 1979;25(3):285–287. [Google Scholar]
Dvoretzky A, Kiefer J, Wolfowitz J. Asymptotic minimax character of the sample distribution function and of the classical multinomial estimator. Ann Math Statist. 1956;27:642–669. [Google Scholar]
Ehm W. Binomial approximation to the Poisson binomial distribution. Statist Probab Lett. 1991;11(1):7–16. [Google Scholar]
Greene E. PhD thesis. University of Washington; 2016. Finite sampling exponential bounds with applications to empirical processes. [Google Scholar]
Greene E, Wellner JA. Exponential bounds for the hypergeometric distribution. Tech Rep. 2015 doi: 10.3150/15-BEJ800. arXiv:1507.08298. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hájek J, Šidák Z. Theory of rank tests. Academic Press; New York: 1967. [Google Scholar]
Hodges JL., Jr The significance probability of the Smirnov two-sample test. Ark Mat. 1958;3:469–486. [Google Scholar]
Hoeffding W. Probability inequalities for sums of bounded random variables. J Amer Statist Assoc. 1963;58:13–30. [Google Scholar]
Hush D, Scovel C. Concentration of the hypergeometric distribution. Statist Probab Lett. 2005;75(2):127–132. [Google Scholar]
Kemperman JHB. Moment problems for sampling without replacement. I Nederl Akad Wetensch. Proc Ser A 76=Indag Math. 1973a;35:149–164. [Google Scholar]
Kemperman JHB. Moment problems for sampling without replacement. II Nederl Akad Wetensch. Proc Ser A 76=Indag Math. 1973b;35:165–180. [Google Scholar]
Kemperman JHB. Moment problems for sampling without replacement. III Nederl Akad Wetensch. Proc Ser A 76=Indag Math. 1973c;35:181–188. [Google Scholar]
León CA, Perron F. Extremal properties of sums of Bernoulli random variables. Statist Probab Lett. 2003;62(4):345–354. [Google Scholar]
Lo AY. Bayesian statistical inference for sampling a finite population. Ann Statist. 1986;14(3):1226–1233. [Google Scholar]
Massart P. The tight constant in the Dvoretzky-Kiefer-Wolfowitz inequality. Ann Probab. 1990;18(3):1269–1283. [Google Scholar]
Nanjundiah TS. Note on Stirling’s formula. Amer Math Monthly. 1959;66:701–703. [Google Scholar]
Pitman J. Probabilistic bounds on the coefficients of polynomials with only real zeros. J Combin Theory Ser A. 1997;77(2):279–303. [Google Scholar]
Rice JA. Mathematical Statistics and Data Analysis. 3. Duxbury Press; Belmont, CA: 2007. [Google Scholar]
Serfling RJ. Probability inequalities for the sum in sampling without replacement. Ann Statist. 1974;2:39–48. [Google Scholar]
Shorack GR, Wellner JA. Wiley Series in Probability and Mathematical Statistics: Probability and Mathematical Statistics. John Wiley & Sons Inc; New York: 1986. Empirical processes with applications to statistics. [Google Scholar]
Talagrand M. Sharper bounds for Gaussian and empirical processes. Ann Probab. 1994;22(1):28–76. [Google Scholar]
van der Vaart AW, Wellner JA. Springer Series in Statistics. Springer-Verlag; New York: 1996. Weak convergence and empirical processes. [Google Scholar]
Vatutin VA, Mikhaĭlov VG. Limit theorems for the number of empty cells in an equiprobable scheme for the distribution of particles by groups. Teor Veroyatnost i Primenen. 1982;27(4):684–692. [Google Scholar]
Wei F, Dudley RM. Tech rep. MIT, Department of Mathematics; 2011. Dvoretzky-Kiefer-Wolfowitz inequalities for the two-sample case. [Google Scholar]
Wei F, Dudley RM. Two-sample Dvoretzky-Kiefer-Wolfowitz inequalities. Statist Probab Lett. 2012;82(3):636–644. [Google Scholar]

[R1] Bennett G. Probability inequalities for the sum of independent random variables. J Amer Statist Assoc. 1962;57:33–45. [Google Scholar]

[R2] Chvátal V. The tail of the hypergeometric distribution. Discrete Math. 1979;25(3):285–287. [Google Scholar]

[R3] Dvoretzky A, Kiefer J, Wolfowitz J. Asymptotic minimax character of the sample distribution function and of the classical multinomial estimator. Ann Math Statist. 1956;27:642–669. [Google Scholar]

[R4] Ehm W. Binomial approximation to the Poisson binomial distribution. Statist Probab Lett. 1991;11(1):7–16. [Google Scholar]

[R5] Greene E. PhD thesis. University of Washington; 2016. Finite sampling exponential bounds with applications to empirical processes. [Google Scholar]

[R6] Greene E, Wellner JA. Exponential bounds for the hypergeometric distribution. Tech Rep. 2015 doi: 10.3150/15-BEJ800. arXiv:1507.08298. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] Hájek J, Šidák Z. Theory of rank tests. Academic Press; New York: 1967. [Google Scholar]

[R8] Hodges JL., Jr The significance probability of the Smirnov two-sample test. Ark Mat. 1958;3:469–486. [Google Scholar]

[R9] Hoeffding W. Probability inequalities for sums of bounded random variables. J Amer Statist Assoc. 1963;58:13–30. [Google Scholar]

[R10] Hush D, Scovel C. Concentration of the hypergeometric distribution. Statist Probab Lett. 2005;75(2):127–132. [Google Scholar]

[R11] Kemperman JHB. Moment problems for sampling without replacement. I Nederl Akad Wetensch. Proc Ser A 76=Indag Math. 1973a;35:149–164. [Google Scholar]

[R12] Kemperman JHB. Moment problems for sampling without replacement. II Nederl Akad Wetensch. Proc Ser A 76=Indag Math. 1973b;35:165–180. [Google Scholar]

[R13] Kemperman JHB. Moment problems for sampling without replacement. III Nederl Akad Wetensch. Proc Ser A 76=Indag Math. 1973c;35:181–188. [Google Scholar]

[R14] León CA, Perron F. Extremal properties of sums of Bernoulli random variables. Statist Probab Lett. 2003;62(4):345–354. [Google Scholar]

[R15] Lo AY. Bayesian statistical inference for sampling a finite population. Ann Statist. 1986;14(3):1226–1233. [Google Scholar]

[R16] Massart P. The tight constant in the Dvoretzky-Kiefer-Wolfowitz inequality. Ann Probab. 1990;18(3):1269–1283. [Google Scholar]

[R17] Nanjundiah TS. Note on Stirling’s formula. Amer Math Monthly. 1959;66:701–703. [Google Scholar]

[R18] Pitman J. Probabilistic bounds on the coefficients of polynomials with only real zeros. J Combin Theory Ser A. 1997;77(2):279–303. [Google Scholar]

[R19] Rice JA. Mathematical Statistics and Data Analysis. 3. Duxbury Press; Belmont, CA: 2007. [Google Scholar]

[R20] Serfling RJ. Probability inequalities for the sum in sampling without replacement. Ann Statist. 1974;2:39–48. [Google Scholar]

[R21] Shorack GR, Wellner JA. Wiley Series in Probability and Mathematical Statistics: Probability and Mathematical Statistics. John Wiley & Sons Inc; New York: 1986. Empirical processes with applications to statistics. [Google Scholar]

[R22] Talagrand M. Sharper bounds for Gaussian and empirical processes. Ann Probab. 1994;22(1):28–76. [Google Scholar]

[R23] van der Vaart AW, Wellner JA. Springer Series in Statistics. Springer-Verlag; New York: 1996. Weak convergence and empirical processes. [Google Scholar]

[R24] Vatutin VA, Mikhaĭlov VG. Limit theorems for the number of empty cells in an equiprobable scheme for the distribution of particles by groups. Teor Veroyatnost i Primenen. 1982;27(4):684–692. [Google Scholar]

[R25] Wei F, Dudley RM. Tech rep. MIT, Department of Mathematics; 2011. Dvoretzky-Kiefer-Wolfowitz inequalities for the two-sample case. [Google Scholar]

[R26] Wei F, Dudley RM. Two-sample Dvoretzky-Kiefer-Wolfowitz inequalities. Statist Probab Lett. 2012;82(3):636–644. [Google Scholar]

PERMALINK

Finite sampling inequalities: an application to two-sample Kolmogorov-Smirnov statistics

Evan Greene

Jon A Wellner

Abstract

1. Introduction: Serfling’s finite sampling exponential bound

Theorem 1

Corollary 1

Corollary 2

Theorem 2

Conjecture

2. Two-sample tests and finite-sampling connections

3. An exponential bound for $D_{m, n}^{+}$ when m = n

Theorem 3

Proof

4. Some comparisons and connections

4.1. Comparisons: two-sided tail bounds

Theorem 4

Theorem 5

Conjecture

4.2. Comparisons: one-sided tail bounds

Theorem 6

Proof

Corollary 3

Figure 1.

Figure 2.

Acknowledgments

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Finite sampling inequalities: an application to two-sample Kolmogorov-Smirnov statistics

Evan Greene

Jon A Wellner

Abstract

1. Introduction: Serfling’s finite sampling exponential bound

Theorem 1

Corollary 1

Corollary 2

Theorem 2

Conjecture

2. Two-sample tests and finite-sampling connections

3. An exponential bound for Dm,n+ when m = n

Theorem 3

Proof

4. Some comparisons and connections

4.1. Comparisons: two-sided tail bounds

Theorem 4

Theorem 5

Conjecture

4.2. Comparisons: one-sided tail bounds

Theorem 6

Proof

Corollary 3

Figure 1.

Figure 2.

Acknowledgments

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

3. An exponential bound for $D_{m, n}^{+}$ when m = n