Rigorous Error Control Methods for Estimating Means of Bounded Random Variables

Dr Zhengjia Chen; Dr Xinjia Chen

doi:10.1016/j.jspi.2014.08.007

. Author manuscript; available in PMC: 2016 Sep 16.

Published in final edited form as: J Stat Plan Inference. 2014 Sep 6;157-158:54–76. doi: 10.1016/j.jspi.2014.08.007

Rigorous Error Control Methods for Estimating Means of Bounded Random Variables

Zhengjia Chen ^1,^✉, Xinjia Chen ^2,^✉

PMCID: PMC5026247 NIHMSID: NIHMS626237 PMID: 27642222

Abstract

In this article, we propose rigorous sample size methods for estimating the means of random variables, which require no information of the underlying distributions except that the random variables are known to be bounded in a certain interval. Our sample size methods can be applied without assuming that the samples are identical and independent. Moreover, our sample size methods involve no approximation. We demonstrate that the sample complexity can be significantly reduced by using a mixed error criterion. We derive explicit sample size formulae to ensure the statistical accuracy of estimation.

1 Introduction

Many problems of engineering and sciences boil down to estimating the mean value of a random variable [18, 19]. More formally, let X be a random variable with mean μ. It is a frequent problem to estimate μ based on samples X₁, X₂, ⋯, X_n of X, which are defined on a probability space (Ω, ℱ, ℙ_μ), where the subscript in the probability measure ℙ_μ indicates its association with μ. In many situations, the information on the distribution of X is not available except that X is known to be bounded in some interval [a, b]. For example, in clinical trials, many quantities under investigation are bounded random variables, such as biomarker, EGFR, K-Ras, B-Raf, Akt, etc (see., e.g., [3, 13, 23] and the references therein). Moreover, the samples X₁, X₂, ⋯, X_n may not be identical and independent (i.i.d). This gives rise to the significance of estimating μ under the assumption that

a \leq X_{k} \leq b almost surely for k \in ℕ,

(1)

E [X_{k} | ℱ_{k - 1}] = μ almost surely for k \in ℕ,

(2)

where ℕ denotes the set of positive integers, and {ℱ_k, k = 0, 1, ⋯, ∞} is a sequence of σ-subalgebra such that {∅, Ω} = ℱ₀ ⊂ ℱ₁ ⊂ ℱ₂ ⊂ ⋯ ⊂ ℱ, with ℱ_k being generated by X₁, ⋯, X_k. The motivation we propose to consider the estimation of μ under dependency assumption (2) is twofold. First, from a theoretical point of view, we want the results to hold under the most general conditions. Clearly, (2) is satisfied in the special case that X₁, X₂, ⋯ are i.i.d. Second, from a practical standpoint, we want to weaken the independency assumption for more applications. For example, in the Monte Carlo estimation technique based on adaptive importance sampling, the samples X₁, X₂, ⋯ are not necessarily independent. However, as demonstrated in page 6 of [10], it may be shown that the samples satisfy (2). An example of adaptive importance sampling is given in Section 5.8 of [8] on the study of catastrophic failure.

An unbiased estimator for μ can be taken as

{\bar{X}}_{n} = \frac{\sum_{i = 1}^{n} X_{i}}{n} .

Let ɛ ∈ (0, 1) and δ ∈ (0, 1) be pre-specified margin of absolute error and confidence parameter, respectively. Since the probability distributions of X₁, X₂, ⋯ are usually unknown, one would use an absolute error criterion and seek the sample size, n, as small as possible such that for all values of μ,

ℙ_{μ} {| {\bar{X}}_{n} - μ | < ε} > 1 - δ

(3)

holds for all distributions having common mean μ. It should be noted that it is difficult to specify a margin of absolute error ɛ, without causing undue conservatism, for controlling the accuracy of estimation if the underlying mean value μ can vary in a wide range. To achieve acceptable accuracy, it is necessary to choose small ɛ for small μ. However, this leads to unnecessarily large sample sizes for large μ.

In addition to the absolute error criterion, a relative error criterion is frequently used for the purpose of error control. Let η ∈ (0, 1) and δ ∈ (0, 1) be pre-specified margin of relative error and confidence parameter, respectively. It is desirable to determine the sample size, n, as small as possible such that for all values of μ,

ℙ_{μ} {| {\bar{X}}_{n} - μ | < η | μ |} > 1 - δ

(4)

holds for all distributions having common mean μ. Unfortunately, the determination of sample size, n, requires a good lower bound for μ, which is usually not available. Otherwise, the sample size n needs to be very large, or infinity.

To overcome the aforementioned difficulties, a mixed criterion may be useful. The reason is that, from a practical point of view, an estimate can be acceptable if either an absolute criterion or a relative criterion is satisfied. More specifically, let ɛ > 0, η ∈ (0, 1) and δ ∈ (0, 1). To control the reliability of estimation, it is crucial that the sample size n is as small as possible, such that for all values of μ,

ℙ_{μ} {| {\bar{X}}_{n} - μ | < ε or | {\bar{X}}_{n} - μ | < η | μ |} > 1 - δ

(5)

holds for all distributions having common mean μ.

In the estimation of parameters, a margin of absolute error is usually chosen to be much smaller than the margin of relative error. For instance, in the estimation of a binomial proportion, a margin of relative error η = 0.1 may be good enough for most situations, while a margin of absolute error may be expected to be ɛ = 0.001 or even smaller. In many applications, a practitioner accepting a relative error normally expects a much smaller absolute error, i.e., ɛ ≪ η. On the other hand, one accepting an absolute error ɛ typically tolerates a much larger relative error, i.e., η ≫ ɛ. It will be demonstrated that the required sample size can be substantially reduced by using a mixed error criterion.

Given that the measure of precision is chosen, the next task is to determine appropriate sample sizes. A conventional method is to determine the sample size by normal approximation derived from the central limit theorem [5, 7]. Such an approximation method inevitably leads to unknown statistical error due to the fact the sample size n must be a finite number [8, 11]. This motivates us to explore rigorous methods for determining sample sizes.

In this paper, we consider the problem of estimating the means of bounded random variables based on a mixed error criterion. The remainder of the paper is organized as follows. In Section 2, we introduce some martingale inequalities. In Section 3, we derive explicit sample size formulae by virtue of concentration inequalities and martingale inequalities. In Section 4, we extend the techniques to the problem of estimating the difference of means of two bounded random variables. Illustrative examples are given in Section 5. Section 6 provides our concluding remarks. Most proofs are given in Appendices.

2 Martingale Inequalities

Under assumption (2), it can be readily shown that {X_k − μ} is actually a sequence of martingale differences (see, e.g., [6, 24] and the references therein). In the sequel, we shall introduce some martingale inequalities which are crucial for the determination of sample sizes to guarantee pre-specified statistical accuracy.

Define function

ψ (ε, μ) = (μ + ε) \ln (\frac{μ + ε}{μ}) + (1 - μ - ε) \ln (\frac{1 - μ - ε}{1 - μ})

for 0 < ɛ < 1 − μ < 1. Under the assumption that 0 ≤ X_k ≤ 1 almost surely and (2) holds for all k ∈ ℕ, Hoeffding [12] established that

ℙ_{μ} {{\bar{X}}_{n} \geq μ + ε} < \exp (- n ψ (ε, μ)) for 0 < ε < 1 - μ .

(6)

To see that such result is due to Hoeffding, see Theorem 1 and the remarks on page 18, the second paragraph, of his paper [12]. For bounds tighter than Hoeffding’s inequality, see a recent paper [4].

To obtain simpler probabilistic inequalities, define bivariate function

φ (ε, μ) = \frac{ε^{2}}{2 (μ + \frac{ε}{3}) (1 - μ - \frac{ε}{3})} .

It is shown by Massart [17] that

ψ (ε, μ) > φ (ε, μ) .

(7)

By virtue of Hoeffding’s inequality and Massart’s inequality, the following results can be justified.

Theorem 1

Assume that 0 ≤ X_k ≤ 1 almost surely and (2) holds for all k ∈ ℕ. Then,

ℙ_{μ} {{\bar{X}}_{n} \geq μ + ε} \leq \exp (- n φ (ε, μ)) for 0 < ε < 3 (1 - μ),

(8)

ℙ_{μ} {{\bar{X}}_{n} \geq μ - ε} \leq \exp (- n φ (- ε, μ)) for 0 < ε < 3 μ .

(9)

Proof

To prove Theorem 1, note that

ℙ_{μ} {{\bar{X}}_{n} \geq μ + ε} = 0 < \exp (- n φ (ε, μ)) for ε > 1 - μ .

(10)

From Hoeffding’s inequality (6) and Massart’s inequality (7), we have

ℙ_{μ} {{\bar{X}}_{n} \geq μ + ε} < \exp (- n φ (ε, μ)) for 0 < ε < 1 - μ .

(11)

Observe that $ℙ_{μ} {{\bar{X}}_{n} \geq z}$ is a left-continuous function of z and that φ(ɛ, µ) is a continuous function of ɛ. Making use of this observation and (11), we have

ℙ_{μ} {{\bar{X}}_{n} \geq μ + ε} \leq \exp (- n φ (ε, μ)) for ε = 1 - μ .

(12)

Note that

ℙ_{μ} {{\bar{X}}_{n} \geq μ + ε} \leq ℙ_{μ} {{\bar{X}}_{n} > 1} = 0 \leq \exp (- n φ (ε, μ)) for 1 - μ < ε < 3 (1 - μ) .

(13)

Combining (10), (11), (12) and (13) yields

ℙ_{μ} {{\bar{X}}_{n} \geq μ + ε} \leq \exp (- n φ (ε, μ)) for 0 < ε < 3 (1 - μ) .

This proves (8). To show (9), define Y_i = 1−X_i for i = 1, ⋯, n. Define ${\bar{Y}}_{n} = 1 - {\bar{X}}_{n}$ and ν = 1−μ. Then, $E [{\bar{Y}}_{n}] = v$ . Applying (8), we have

ℙ_{μ} {{\bar{Y}}_{n} \geq v + ε} \leq \exp (- n φ (ε, v))

for 0 < ɛ < 3(1 − ν). By the definitions of ν and $\bar{Y} n$ , we can rewrite the above inequality as

ℙ_{μ} {{\bar{Y}}_{n} \geq v + ε} = ℙ_{μ} {1 - {\bar{Y}}_{n} \leq μ - ε} \leq \exp (- n φ (ε, 1 - μ))

for 0 < ɛ < 3μ. Observing that ${\bar{X}}_{n} = 1 - {\bar{Y}}_{n}$ and that φ(ɛ, 1 − μ) = φ(−ɛ, μ), we have (9). This completes the proof of Theorem 1. □

It should be noted that Theorem 1 extends Massart’s inequality in two aspects. First, the random variables are not required to be i.i.d. Bernoulli random variables. Second, the inequalities hold for wider supports.

3 Explicit Sample Size Formulae

In this section, we shall investigate sample size methods for estimating the mean of bounded random variable X.

If X₁, ⋯, X_n are i.i.d. samples of X bounded in interval [0, 1], it can be shown by Chebyshev’s inequality that (3) holds provided that

n \geq \frac{1}{4 δ ε^{2}} .

(14)

Under the assumption that 0 ≤ X_k ≤ 1 and $E [X_{k} | ℱ_{k - 1}] = μ$ almost surely for all k ∈ ℕ, Azuma-Hoeffding inequality [2, 12] implies that (3) holds for all μ ∈ (0, 1) if

n > \frac{\ln \frac{2}{δ}}{2 ε^{2}} .

(15)

Clearly, the ratio of the sample size determined by (14) to that of (15) is equal to

\frac{\frac{1}{4 δ ε^{2}}}{\frac{\ln \frac{2}{δ}}{2 ε^{2}}} = \frac{1}{2 δ \ln \frac{2}{δ}},

which is substantially greater than 1 for small δ ∈ (0, 1). Despite the significant improvement upon the sample size bound (14), the sample size bound (15) usually leads to a very large sample size, since ɛ is typically a small number in practice. For example, with δ = 0.05, we have n = 1, 844, 440 and n = 184, 443, 973 for ɛ = 0.001 and 0.0001, respectively.

To the best of our knowledge, the sample size bound (15) is the tightest one discovered so far under the assumption that 0 ≤ X_k ≤ 1 and $E [X_{k} | ℱ_{k - 1}] = μ$ almost surely for all k ∈ ℕ. In order to reduce the sample complexity, we propose to use the mixed error criterion, which can be viewed as a relaxation of the absolute error criterion. In this direction, we have exploited the application of Chebyshev’s inequality to establish the following result.

Theorem 2

If X₁, ⋯, X_n are i.i.d. samples of X bounded in interval [0, 1], then (5) holds for all μ ∈ (0, 1) provided that $λ = \frac{ε}{η} \leq \frac{1}{2}$ and that

n > \frac{1 - λ}{δ ε η} .

(16)

See Appendix A for proof.

The sample size formula (16) may be too conservative. To derive tighter sample size formulae, we need to use the martingale inequalities of exponential form presented in the last section. Throughout the remainder of this section, we make the following assumption:

X₁, X₂, ⋯ are random variables such that a ≤ X_k ≤ b and $E [X_{k} | ℱ_{k - 1}] = μ$ almost surely for all k ∈ ℕ.

In the case that X₁, X₂, ⋯ are nonnegative random variables, we have the following general result.

Theorem 3

Let 0 ≤ a < b. Assume that 0 < ɛ < b − a and $η \in (0, \frac{3}{2})$ and that $a < \frac{ε}{η} < b$ Define

N = {\begin{cases} \frac{2}{ε^{2}} (\frac{ε}{η} + \frac{ε}{3} - a) (b - \frac{ε}{η} - \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{2 a b}{a + b} \leq \frac{ε}{η} + \frac{ε}{3} \leq \frac{a + b}{2}, \\ \frac{{(b - a)}^{2}}{2 a b} {(\frac{1}{η} + \frac{1}{3})}^{2} \ln \frac{2}{δ} & for \frac{ε}{η} + \frac{ε}{3} < \frac{2 a b}{a + b}, \\ \frac{{(b - a)}^{2}}{2 a b} \ln \frac{2}{δ} & for \frac{ε}{η} + \frac{ε}{3} > \frac{a + b}{2} \end{cases}

and

M = {\begin{cases} \frac{2}{ε^{2}} (\frac{ε}{η} - \frac{ε}{3} - a) (b - \frac{ε}{η} + \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{2 a b}{a + b} \leq \frac{ε}{η} - \frac{ε}{3} \leq \frac{a + b}{2}, \\ \frac{{(b - a)}^{2}}{2 a b} {(\frac{1}{η} - \frac{1}{3})}^{2} \ln \frac{2}{δ} & for \frac{η}{3} < \frac{b - a}{b + a} and \frac{ε}{η} - \frac{ε}{3} < \frac{2 a b}{a + b}, \\ [\frac{2}{3 η} (1 - \frac{a}{b}) - \frac{2}{9}] \ln \frac{2}{δ} & for \frac{b - a}{b + a} < \frac{η}{3} < \frac{b - a}{b} and \frac{ε}{η} - \frac{ε}{3} \leq \frac{a + b}{2}, \\ 1 & for \frac{η}{3} \geq \frac{b - a}{b}, \\ \frac{{(b - a)}^{2}}{2 ε^{2}} \ln \frac{2}{δ} & else \end{cases}

Then, $ℙ_{μ} {| {\bar{X}}_{n} - μ | < ε o r | {\bar{X}}_{n} - μ | < η μ} > 1 - δ$ for any μ ∈ (a, b) provided that n > max(N, M).

See Appendix B for proof. In Theorem 3, our purpose of assuming ɛ < b − a and $a < \frac{ε}{η} < b$ is to make sure that the absolute error criterion is active for some μ ∈ (a, b) and that the relative error criterion is active for some μ ∈ (a, b). In Table 1, we list sample sizes for b = 1, ɛ = 0.001, η = 0.1, δ = 0.01 and various values of a, where N_mix denotes the sample sizes calculated by virtue of Theorem 3 and the mixed error criterion, and N_abs denotes the sample sizes obtained from the Chernoff-Hoeffding bound. More precisely,

N_{mix} = ⌈ max (N, M) ⌉

and

N_{abs} = ⌈ \frac{{(b - a)}^{2} \ln \frac{2}{δ}}{2 ε^{2}} ⌉,

(17)

where ⌈.⌉ denotes the ceil function. It can be seen from the table that the sample complexity can be significantly reduced by using a mxied error criterion and our sample size formula.

Table 1.

Table of Sample Sizes

a	N_mix	N_abs	a	N_mix	N_abs
0.001	97880	499001	0.006	40765	494019
0.002	87393	498002	0.007	34871	493025
0.003	76906	497005	0.008	30451	492032
0.004	66419	496008	0.009	27013	491041
0.005	55932	495013	0.01	24263	490050

Open in a new tab

As an immediate application of Theorem 3, we have the following result.

Corollary 1

Let ɛ and η be respectively the margins of absolute and relative error such that

\frac{ε}{η} + \frac{ε}{3} < \frac{1}{2} .

(18)

Assume that 0 ≤ X_k ≤ 1 almost surely for all k ∈ ℕ. Then, (5) holds for all μ ∈ (0, 1) provided that

n > 2 (\frac{1}{η} + \frac{1}{3}) (\frac{1}{ε} - \frac{1}{η} - \frac{1}{3}) \ln \frac{2}{δ} .

(19)

It should be noted that (18) can be readily satisfied in practice, since 0 < ɛ ≪ η < 1 is true in most applications.

An appealing feature of formula (19) is that the resultant sample size is much smaller as compared to that of (15) and (16). Moreover, to apply (19), no approximation is involved and no information of μ is needed. Furthermore, the samples need not be i.i.d.

Under the condition that 0 < ɛ ≪ η ≪ 1, the sample size bound of (19) can be approximated as

\frac{2 \ln \frac{2}{δ}}{ε η},

which indicates that the required sample size is inversely proportional to the product of margins of absolute and relative errors. It can be shown that the ratio of the bound of (16) to the sample size bound of (19) converges to 0 as δ decreases to 0, which implies that the bound of (19) is better for small δ.

The comparison of sample size formulae (15) and (19) is shown in Figure 1, where it can be seen that the sample size formula (19) leads to a substantial reduction in sample complexity as compared to (15).

To obtain more insight into such a reduction of sample size, we shall investigate the ratio of the sample sizes, which is given as

\frac{\frac{\ln \frac{2}{δ}}{2 ε^{2}}}{2 (\frac{1}{η} + \frac{1}{3}) (\frac{1}{ε} - \frac{1}{η} - \frac{1}{3}) \ln \frac{2}{δ}} = \frac{1}{4 (λ + \frac{ε}{3}) (1 - λ - \frac{ε}{3})} .

Let ɛ ∈ (0, 1) and η ∈ (0, 1) such that (18) holds. When no information of μ is available except that μ is known to be bounded in (0, 1), the best known sample size bound is given by (15), which asserts that (3) holds for any μ ∈ (0, 1) provided that (15) holds. According to Corollary 1, we have that (5) holds for any μ ∈ (0, 1) provided that (19) is true. In view of (15) and (19), the ratio of the sample sizes tends to

R (λ) \overset{def}{=} \frac{1}{4 λ (1 - λ)}

as ɛ → 0 under the restriction that $λ = \frac{η}{ε}$ is fixed.

From Figure 2, it can be seen that the limiting ratio, R(λ), of sample sizes is substantially greater than 1 for small λ > 0. For example, if ɛ = 10⁻⁵ and η = 0.1, we have $λ = \frac{ε}{η} = 10^{- 4}$ and R(λ) ≈ 2500. This demonstrates that the required sample size can be significantly reduced by virtue of a mixed error criterion. As mentioned earlier, for small η (e.g. η = 0.1), the requirement (5) can be viewed as a slight relaxation of the requirement (3). Our analysis indicates that such a slight relaxation is well worthy of the significant reduction in the sample complexity.

In Theorem 3, the random variables X₁, X₂, ⋯ are assumed to be non-negative. In light of the fact that, in some situations, the random variables may assume positive or negative values, we have derived explicit sample size formula in the following result.

Theorem 4

Let a < 0 < b. Assume that 0 < ɛ < b − a and $η \in (0, \frac{3}{2})$ and that ɛ < η max(|a|, b). Define

M = {\begin{cases} \frac{2}{ε^{2}} [| a + b | (\frac{ε}{η} + \frac{ε}{3}) - {(\frac{ε}{η} + \frac{ε}{3})}^{2} - a b] \ln \frac{2}{δ} & for \frac{| a + b |}{2} > \frac{ε}{η} + \frac{ε}{3}, \\ \frac{{(b - a)}^{2}}{2 ε^{2}} \ln \frac{2}{δ} & else \end{cases}

Then, $ℙ_{μ} {| {\bar{X}}_{n} - μ | < ε or | {\bar{X}}_{n} - μ | < η | μ |} > 1 - δ$ for any μ ∈ (a, b) provided that n > M.

See Appendix C for proof.

It should be noted that the advantage of using the mixed error criterion is more pronounced if the interval [a, b] contains 0 and is more asymmetrical about 0. As an illustration, consider the configuration with ɛ = 0.1, η = 0.1 and δ = 0.05. Assume that the lower bound, a, of the interval is fixed as −1 and the upper bound, b, of the interval is a parameter. From formula (15), we know that the sample size required to ensure $ℙ_{μ} {| {\bar{X}}_{n} - μ | < ε} \geq 1 - δ$ for any μ ∈ [a, b] can be obtained from (17).

According to Theorem 4, the sample size required to ensure $ℙ_{μ} {| {\bar{X}}_{n} - μ | < ε o r | {\bar{X}}_{n} - μ | < η | μ |} \geq 1 - δ$ for any μ ∈ [a, b] can be calculated as

N_{mix} = ⌈ \frac{2}{ε^{2}} [| a + b | (\frac{ε}{η} + \frac{ε}{3}) - {(\frac{ε}{η} + \frac{ε}{3})}^{2} - a b] \ln \frac{2}{δ} ⌉ .

Since a is fixed, the ratio, $\frac{N_{abs}}{N_{mix}}$ , of sample sizes is a function of b. Such a function is shown by Figure 3, from which it can be seen that the larger b is, the greater the reduction of sample size can be achieved by virtue of a mixed error criterion.

Ratio of Sample Sizes (ɛ = 0.1, η = 0.1, δ = 0.05 and a = −1)

4 Estimating the Difference of Two Population Means

Our method can be extended to the estimation of the difference of means of bounded random variables. Let Y and Z be two bounded random variables such that $E [Y] = μ_{Y}$ and $E [Z] = μ_{Z}$ Let X = Y − Z and μ = μ_Y − μ_Z. Let Y₁, ⋯ Y_n be i.i.d. samples of Y. Let Z₁, ⋯ Z_n be i.i.d. samples of Z. Assume that the samples of Y and Z are independent. Let X_i = Y_i −Z_i for i = 1, 2, ⋯, n. Then, X₁, ⋯, X_n are i.i.d. samples of X. Clearly, X is a bounded random variable. So are X₁, ⋯, X_n. Define

{\bar{X}}_{n} = \frac{\sum_{i = 1}^{n} X_{i}}{n}, {\bar{Y}}_{n} = \frac{\sum_{i = 1}^{n} Y_{i}}{n}, {\bar{Z}}_{n} = \frac{\sum_{i = 1}^{n} Z_{i}}{n} .

Then, ${\bar{X}}_{n} = {\bar{Y}}_{n} - {\bar{Z}}_{n}$ is an estimator for μ = μ_Y − μ_Z. We can apply the sample size methods proposed in Section 3 to determine n such that

ℙ_{μ} {| {\bar{X}}_{n} - μ | < ε or | {\bar{X}}_{n} - μ | < η | μ |} > 1 - δ .

To illustrate, consider an example with Y bounded in [0, 10] and Z bounded in [0, 1]. Assume that ɛ = 0.1, η = 0.1 and δ = 0.05. Since X = Y − Z is a random variable bounded in the interval [−1, 10], from the discussion in last section, it can be seen that Theorem 4 can be employed to obtain the minimum sample size as 13, 408.

5 Illustrations

In this section, we shall illustrate the applications of our sample size formulae by examples in control and telecommunication engineering.

An extremely important problem of control engineering is to determine the probability that a system will fail to satisfy pre-specified requirements in an uncertain environment. This critical issue has been extensively studied in an area referred to as probabilistic robustness analysis (See, e.g. [14, 15, 21] and the references therein). In general, there is no effective deterministic method for computing such failure probability except the Monte Carlo estimation method. To estimate the probability of failure, the uncertain environment is modeled by a random variable Δ, which may be scalar or matrix-valued. Hence, a Bernoulli random variable X can be defined as a function $X (.)$ of Δ such that $X = X (Δ)$ assumes value 1 if the system associated with Δ fails to satisfy pre-specified requirements and assumes value 0 otherwise. Clearly, the failure probability p is equal to the mean of X. That is, $p = E [X] = E [X (Δ)]$ For estimating the failure probability p, randomized algorithms have been implemented in a widely used software package RACT [22], in which an absolute error criterion is used for estimating p. Specifically, for a priori ɛ, δ ∈ (0, 1), the objective is to obtain an estimator $\hat{p}$ such that $ℙ {| \hat{p} - p | < ε} > 1 - δ$ holds regardless of the value of the p ∈ (0, 1). The estimator is defined as

\hat{p} = \frac{1}{N} \sum_{i = 1}^{N} X (Δ_{i}),

where N is the sample size and Δ₁, Δ₂, ⋯, Δ_N are i.i.d. samples of Δ. In most situations, there is no useful information about the range of the failure probability p due to the complexity of the system. Therefore, the determination of the sample size N should not be dependent on the range of p. It is well-known that, to make $ℙ {| \hat{p} - p | < ε} > 1 - δ$ for any p ∈ (0, 1), an approximate sample size based on normal approximation is

N = ⌈ \frac{Z_{δ / 2}^{2}}{4 ε^{2}} ⌉,

(20)

where $Z_{δ / 2}$ is the critical value such that

\int_{Z_{δ / 2}}^{\infty} \frac{1}{\sqrt{2 π}} \exp (- \frac{x^{2}}{2}) d x = \frac{δ}{2} .

The approximate sample size formula (20) will inevitably lead to unknown statistical error, since the formula (20) is based on the central limit theorem, which is an asymptotic result. In view of this drawback, control theorists and practitioners are reluctant to use the approximate formula (20). To rigorously control the statistical accuracy of the estimation, the Chernoff-Hoeffding bound is most frequently used in control engineering for determination of sample size. To ensure that $ℙ {| \hat{p} - p | < ε} > 1 - δ$ holds for any p ∈ (0, 1), it suffices to take sample size

N = ⌈ \frac{\ln \frac{2}{δ}}{2 ε^{2}} ⌉ .

(21)

The ratio of the sample size (21) to the sample size (20) is approximately equal to $\frac{2 \ln \frac{2}{δ}}{Z_{δ / 2}^{2}}$ , which tends to 1 as δ → 0. It can be shown that

\frac{2 \ln \frac{2}{δ}}{Z_{δ / 2}^{2}} < \frac{3}{2} for δ \in (0, \frac{1}{10}) .

This indicates that in most situations, the ratio of the rigorous sample size (21) to the approximate sample size (20) does not exceed $\frac{3}{2}$ . From this analysis, it can be seen that it is worthy to obtain a rigorous control of the statistical accuracy by using the sample size (21) at the price of increasing the computational complexity up to 50%. This explains why the sample size (21) is frequently used in control engineering. As a matter of fact, the sample size formula (21) is implemented in RACT to estimate the failure probability.

In control engineering, the absolute error criterion is widely used. Recall that in Section 3, we have shown that a much smaller sample size is sufficient if a mixed error criterion is used. More specifically, the sample size can be significantly reduced by letting η ∈ (0, 1) and relaxing the requirement $ℙ {| \hat{p} - p | < ε} > 1 - δ$ as

ℙ {| \hat{p} - p | < ε or | \hat{p} - p | < η p} > 1 - δ .

In many situations, the margin of absolute error ɛ needs to be very small (e.g., ɛ << 0.1), since p is usually a very small number. However, the margin of relative error η does not need to be extremely small. For example, η = 0.1 may be sufficient for most cases.

As a concrete illustrative example, consider an uncertain dynamic system described by the differential equation

\frac{d^{3} y (t)}{d t^{3}} + q_{1} \frac{d^{2} y (t)}{d t^{2}} + q_{2} q_{3} \frac{d y (t)}{d t} + q_{2} y (t) = u (t),

where u(t) is the input, y(t) is the output, and q₁, q₂, q₃ are uncertain parameters. Assume that the tuple (q₁, q₂, q₃) is uniformly distributed over the domain

| 1 - q_{1} | \leq 1.1, | 1 - q_{2} | \leq 1, | 1 - q_{3} | \leq 0.5.

According to control theory, the system is said to be stable if the output is bounded for any bounded input. It can be shown that such a stability criterion is satisfied if and only if all the roots of the polynomial equation

s^{3} + q_{1} s^{2} + q_{2} q_{3} s + q_{2} = 0

(22)

with respect to s in the field of complex number have negative real parts (see, e.g., Section 3.6 of [9] for an explanation of the concept of stability). Since the roots of equation (22) are functions of random variables q₁, q₂ and q₃, a Bernoulli random variable X can be defined in terms of q₁, q₂ and q₃ such that X assumes value 0 if all the roots have negative real parts, and otherwise X assumes value 1. For this particular example, we are interested in estimating the probability that the system is unstable. This amounts to the estimation of the probability that the Bernoulli random variable X assumes value 1. Since X is bounded in interval [0, 1], our sample size formula can be useful for the planning of the Monte Carlo experiment. Let δ = 10⁻³. If the margin of error ɛ = 10⁻³, then the sample size is obtained by (21) as 3800452. If we use a mixed criterion with η = 0.1 and the same ɛ and δ, then the sample size can be computed by (19) as 155463, which is only about 5% of sample size for the absolute criterion. The estimate of the probability of instability is obtained as 0.5403.

In wireless data communications, a frequent problem is to evaluate the bit error rate of a data transmission scheme. The bit error rate is the probability that a bit is transmitted incorrectly. In many situations, due to the complexity of the transmission system, the only tool to obtain the bit error rate is the Monte Carlo simulation method. For example, there is no exact analytical method for computing the bit error rate of a wireless data transmission system employing multiple antennas and space-time block codes. The principle of this transmission system is proposed in [1] (see, e.g., [16] and the references therein for a comprehensive discussion). The wireless data transmission process can be modeled by a sequence of Bernoulli random variables X₁, X₂, ⋯, where X_i assumes value 0 and 1 in accordance with the correct and incorrect transmission of the i-th bit. If X₁, X₂, ⋯ are identically and independently distributed Bernoulli random variables of the same mean μ ∈ (0, 1), then the bit error rate is μ and its estimator can be taken as $\frac{\sum_{i = 1}^{n} X_{i}}{n}$ with n being sufficiently large. However, as a consequence of the application of the space-time block codes, the random variables X₁, X₂, ⋯ are not independent. This gives rise to the following question:

Is it possible to estimate the bit error rate without the independence of the random variables X₁, X₂, ⋯?

In a wireless data transmission system employing multiple antennas and space-time block codes, the expectation of X_k conditioned upon X_ℓ, ℓ < k is a constant μ with respect to k, since the noise process is stationary and the input data can be treated as a Bernoulli process [1, 16]. This implies that it is reasonable to treat X₁, X₂, ⋯ as a martingale process such that condition (2) is satisfied. Hence, despite the lack of independence, the bit error rate can be approximated by $\frac{\sum_{i = 1}^{n} X_{i}}{n}$ . To control the statistical error, the sample size method proposed in the previous section can be applied to determine the appropriate value of n.

6 Concluding Remarks

In this paper, we have considered the problem of estimating means of bounded random variables. We have illustrated that in many applications, it may be more appropriate to use a mixed error criterion for quantifying the reliability of estimation. We demonstrated that as a consequence of using the mixed error criterion, the sample complexity can be substantially reduced. By virtue of probabilistic inequalities, we have developed explicit sample size formulae for the purpose of controlling the statistical error of estimation. We have attempted to make our results generally applicable by eliminating the need of i.i.d. assumptions of the samples and the form of the underlying distributions.

Research highlights.

A rigorous sample size method for estimating the mean of bonded random variable.
It requires neither information nor IID condition of the underlying distribution.
It involves no approximation.
Sample complexity can be significantly reduced by using a mixed error criterion.
Explicit sample size formulae to ensure the statistical accuracy of estimation.

Acknowledgments

The author would like to thank the Associated Editor and referees for their time, effort and comments in reviewing this paper.

This research is supported in part by NIH/NCI Grants No. 1 P01 CA116676, P30 CA138292-01, and 5 P50 CA128613.

A Proof of Theorem 2

Note that

ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq ε, | {\bar{X}}_{n} - μ | \geq η μ} = ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq max (ε, η μ)} .

(23)

Since X₁, ⋯, X_n are i.i.d. samples of X, it follows from (23) and Chebyshev’s inequality that

ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq ε, | {\bar{X}}_{n} - μ | \geq η μ} \leq \frac{V (X)}{n {[max (ε, η μ)]}^{2}},

(24)

where $V (X)$ denotes the variance of X. Since 0 ≤ X ≤ 1 almost surely and $E [X] = μ$ , it must be true that

V (X) \leq μ (1 - μ) .

(25)

Combining (24) and (25) yields

ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq ε, | {\bar{X}}_{n} - μ | \geq η μ} \leq \frac{Q (μ)}{n},

(26)

where

Q (μ) = \frac{μ (1 - μ)}{{[max (ε, η μ)]}^{2}}

for μ ∈ (0, 1). Now we investigate the maximum of Q(μ) for μ ∈ (0, 1) by considering two cases as follows.

Case (i) : 0 ≤ μ ≤ λ.
Case (ii) : λ < µ ≤ 1.

In Case (i), we have $0 \leq μ \leq λ = \frac{ε}{η} \leq \frac{1}{2}$ and

Q (μ) = \frac{μ (1 - μ)}{ε^{2}} \leq \frac{λ (1 - λ)}{ε^{2}} = \frac{1 - λ}{ε η},

(27)

where we have used the fact that μ(1 − μ) is increasing with respect to $μ \in (0, \frac{1}{2})$ . In Case (ii), we have λ < μ ≤ 1 and

Q (μ) = \frac{μ (1 - μ)}{(η μ) 2} = \frac{1 - μ}{η^{2} μ} \leq \frac{1 - λ}{η^{2} λ} = \frac{1 - λ}{ε η} .

(28)

In view of (27) and (28), we have

Q (μ) \leq \frac{1 - λ}{ε η}, \forall μ \in [0, 1] .

(29)

Making use of (26) and (29), we have

ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq ε, | {\bar{X}}_{n} - μ | \geq η μ} \leq \frac{1 - λ}{n ε η}, \forall μ \in [0, 1] .

from which the theorem immediately follows. This completes the proof of Theorem 2.

Throughout the proofs of Theorems 3 and 4, we shall use the following definitions. Let

θ = \frac{μ - a}{b - a} .

Let $ℙ_{θ}$ denote the probability measure associated with θ. Define

Y_{k} = \frac{X_{k} - a}{b - a}, {\bar{Y}}_{n} = \frac{\sum_{i = 1}^{k} Y_{i}}{k} k \in ℕ,

where X₁, X₂, ⋯ are random variables such that a ≤ X_k ≤ b and $E [X_{k} | F_{k - 1}] = μ$ almost surely for all $k \in ℕ$ .

B Proof of Theorem 3

To prove the theorem, we need some preliminary results.

Lemma 1

Let ζ ∈ (0, 1). Define

Q_{1} (θ) = {\begin{cases} \exp (- n φ (ζ, θ)) & for θ \in (0, 1 - \frac{ζ}{3}), \\ 0 & for θ \in (1 - \frac{ζ}{3}, 1) . \end{cases}

Then, $ℙ_{θ} {{\bar{Y}}_{n} \geq θ + ζ} \leq Q_{1} (θ)$ for θ ∈ (0, 1). Moreover, $Q_{1} (θ)$ is increasing with respect to $θ \in (0, \frac{1}{2} - \frac{ζ}{3})$ and non-increasing with respect to $θ \in (\frac{1}{2} - \frac{ζ}{3}, 1)$

Proof

For $θ \in (0, \frac{1}{2} - \frac{ζ}{3})$ , we have 0 < ζ < 3(1 − θ), it follows from (8) that $ℙ_{θ} {{\bar{Y}}_{n} \geq θ + ζ} \leq \exp (- n φ (ζ, θ))$ for $θ \in (0, 1 - \frac{ζ}{3})$ . For $θ \in (1 - \frac{ζ}{3}, 1)$ , we have θ + ζ > 1 and consequently,

ℙ_{θ} {{\bar{Y}}_{n} \geq θ + ζ} \leq ℙ_{θ} {{\bar{Y}}_{n} \geq 1} = 0.

Thus, we have shown that $ℙ_{θ} {{\bar{Y}}_{n} \geq θ + ζ} \leq Q_{1} (θ)$ for θ ∈ (0, 1). To establish the monotonicity of $Q_{1} (θ)$ , it is sufficient to observe that

\frac{\partial φ (ζ, θ)}{\partial θ} = \frac{ζ^{2}}{{(θ + \frac{ζ}{3})}^{2} {(1 - θ - \frac{ζ}{3})}^{2}} [2 (θ + \frac{ζ}{3}) - 1],

which is negative for any $θ \in (0, \frac{1}{2} - \frac{ζ}{3})$ and positive for any $θ \in (\frac{1}{2} - \frac{ζ}{3}, 1)$ . □

Lemma 2

Let ζ ∈ (0, 1). Define

Q_{2} (θ) = {\begin{cases} \exp (- n φ (- ζ, θ)) & for θ \in (\frac{ζ}{3}, 1), \\ 0 & for θ \in (0, \frac{ζ}{3}) . \end{cases}

Then, $ℙ_{θ} {{\bar{Y}}_{n} \leq θ - ζ} \leq Q_{2} (θ)$ for θ ∈ (0, 1). Moreover, $Q_{2} (θ)$ is non-decreasing with respect to $θ \in (0, \frac{1}{2} + \frac{ζ}{3})$ and decreasing with respect to $θ \in (\frac{1}{2} + \frac{ζ}{3}, 1)$ .

Proof

For $θ \in (\frac{ζ}{3}, 1)$ , we have 0 < ζ < 3θ, it follows from (9) that $ℙ_{θ} {{\bar{Y}}_{n} \leq θ - ζ} \leq \exp (- n φ (- ζ, θ))$ for $(\frac{ζ}{3}, 1)$ . For $θ \in (\frac{ζ}{3}, 1)$ , we have θ − ζ < 0 and consequently,

ℙ_{θ} {{\bar{Y}}_{n} \leq θ - ζ} \leq ℙ_{θ} {{\bar{Y}}_{n} < 0} = 0.

Thus, we have shown that $ℙ_{θ} {{\bar{Y}}_{n} \leq θ - ζ} \leq Q_{2} (θ)$ for θ ∈ (0, 1). To establish the monotonicity of $Q_{2} (θ)$ , it is sufficient to observe that

\frac{\partial φ (- ζ, θ)}{\partial θ} = \frac{ζ^{2}}{{(θ - \frac{ζ}{3})}^{2} {(1 - θ + \frac{ζ}{3})}^{2}} [2 (θ - \frac{ζ}{3}) - 1],

which is negative for any $θ \in (\frac{ζ}{3}, \frac{1}{2} + \frac{ζ}{3})$ and positive for any $θ \in (\frac{1}{2} + \frac{ζ}{3}, 1)$ . □

Lemma 3

Let $- \frac{3}{η} < c \leq 0$ . Define

r * = \frac{3 + η c}{3 + η}, v * = \frac{1}{3 + η} (η c + \frac{3 c}{2 c - 1})

and

Q_{3} (θ) = {\begin{cases} \exp (- n φ (η (θ - c), θ)) & for θ \in (0, r *), \\ 0 & for θ \in [r *, 1) . \end{cases}

Then, the following assertions hold.

$ℙ_{θ} {{\bar{Y}}_{n} \geq θ + η (θ - c)} \leq Q_{3} (θ)$ for θ ∈ (0, 1).
If ν* > 0, then $Q_{3} (θ)$ is increasing with respect to θ ∈ (0, ν*) and non-increasing with respect to θ ∈ (ν*, 1).
If ν* ≤ 0, then $Q_{3} (θ)$ is non-increasing with respect to θ ∈ (0, 1)

Proof

To show assertion (I), note that θ + η(θ − c) > 1 for θ ∈ [r*, 1). Consequently,

ℙ_{θ} {\bar{Y} \geq θ + η (θ - c)} \leq ℙ_{θ} {{\bar{Y}}_{n} > 1} = 0

for θ ∈ [r*, 1). On the other hand, 0 < η (θ − c) < 3(1 − θ) for θ ∈ (0, r*). Hence, it follows from inequality (8) that

ℙ_{θ} {{\bar{Y}}_{n} \geq θ + η (θ - c)} \leq \exp (- n φ (η (θ - c), θ)) = \exp (- \frac{n η^{2}}{2} g (θ)) for θ \in (0, r *),

where

g (θ) = \frac{{(θ - c)}^{2}}{ρ (θ) [1 - ρ (θ)]}

with $ρ (θ) = θ + \frac{η}{3} (θ - c)$ Clearly, 0 < r* < 1 and 0 < ρ(θ) < 1 for θ ∈ (0, r*). This proves assertion (I).

To show assertions (II) and (III), consider the derivative of g(θ) with respect to θ. Let $x = (1 + \frac{η}{3}) θ$ and $α = \frac{c η}{3}$ . Then, ρ(θ) = x − α and

\begin{array}{l} g^{'} (θ) = \frac{2 (θ - c)}{ρ (θ) [1 - ρ (θ)]} - \frac{{(θ - c)}^{2} {ρ^{'} (θ) [1 - ρ (θ)] - ρ (θ) ρ^{'} (θ)}}{{[ρ (θ)]}^{2} {[1 - ρ (θ)]}^{2}} \\ = \frac{2 (θ - c) {ρ (θ) [1 - ρ (θ)] - (1 + \frac{η}{3}) (θ - c) [\frac{1}{2} - ρ (θ)]}}{{[ρ (θ)]}^{2} {[1 - ρ (θ)]}^{2}} \\ = \frac{2 (θ - c)}{{[ρ (θ)]}^{2} {[1 - ρ (θ)]}^{2}} [(x - α) (1 - x + α) - (x - γ) (\frac{1}{2} - x + α)] \\ = \frac{2 (θ - c)}{{[ρ (θ)]}^{2} {[1 - ρ (θ)]}^{2}} [(\frac{1}{2} + α - γ) x - α (1 + α) + γ (\frac{1}{2} + α)] \\ = \frac{2 (θ - c)}{{[ρ (θ)]}^{2} {[1 - ρ (θ)]}^{2}} [(\frac{1}{2} - c) x - α (1 + α) + (c + α) (\frac{1}{2} + α)] \\ = \frac{2 (θ - c)}{{[ρ (θ)]}^{2} {[1 - ρ (θ)]}^{2}} [(\frac{1}{2} - c) ρ (θ) + \frac{c}{2}] . \end{array}

(30)

Since θ − c > 0 for θ ∈ (0, 1), it follows from (30) that g′(θ) ≥ 0 if and only if

(\frac{1}{2} - c) ρ (θ) + \frac{c}{2} \geq 0,

which is equivalent to θ ≥ ν*. As a consequence of c < 1, we have ν* < r*. It follows that assertions (II) and (III) hold. □

Lemma 4

Define

N = {\begin{cases} \frac{2}{ε^{2}} (\frac{ε}{η} + \frac{ε}{3} - α) (b - \frac{ε}{η} - \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{2 a b}{a + b} < \frac{ε}{η} + \frac{ε}{3} < \frac{a + b}{2}, \\ \frac{{(b - a)}^{2}}{2 a b} {(\frac{1}{η} + \frac{1}{3})}^{2} \ln \frac{2}{δ} & for \frac{ε}{η} + \frac{ε}{3} < \frac{2 a b}{a + b}, \\ \frac{{(b - a)}^{2}}{2 ε^{2}} \ln \frac{2}{δ} & for \frac{ε}{η} + \frac{ε}{3} > \frac{a + b}{2} . \end{cases}

Then, $ℙ_{μ} {{\bar{X}}_{n} \geq μ + max (ε, η μ)} \leq δ$ for all μ ∈ (a, b) provided that n > N.

Proof

For simplicity of notations, define $ζ = \frac{ε}{b - a}, λ = \frac{ε}{η}, c = \frac{a}{a - b},$

p * = \frac{1}{2} - \frac{ζ}{3}, θ * = \frac{λ}{b - a} + c, v * = \frac{1}{3 + η} (η c + \frac{3 c}{2 c - 1})

and $Q^{+} (θ) = ℙ_{θ} {{\bar{Y}}_{n} \geq θ + max (ζ, η (θ - c))}$ for θ ∈ (0, 1). Then, $ℙ_{μ} {{\bar{X}}_{n} \geq μ + max (ε, η μ)}$ . It suffices to show the lemma for the following three cases.

Case (1): $\frac{2 a b}{a + b} \leq \frac{ε}{η} + \frac{ε}{3} \leq \frac{a + b}{2} .$
Case (2): $\frac{ε}{η} + \frac{ε}{3} < \frac{2 a b}{a + b} .$
Case (3): $\frac{ε}{η} + \frac{ε}{3} > \frac{a + b}{2} .$

First, consider Case (1). Clearly, as a consequence of $\frac{ε}{η} + \frac{ε}{3} \leq \frac{a + b}{2}$ , we have p* ≥ θ*. As a consequence of $\frac{ε}{η} + \frac{ε}{3} \geq \frac{2 a b}{a + b}$ , we have θ* ≥ ν*. Therefore, it follows from $\frac{2 a b}{a + b} \leq \frac{ε}{η} + \frac{ε}{3} \leq \frac{a + b}{2}$ that p* ≥ θ* ≥ ν*. Since $\frac{1}{2} > p * \geq θ > 0$ , it follows from Lemma 1 that $Q_{1} (θ)$ is increasing for θ ∈ (0, θ*]. Hence,

Q^{+} (θ) \leq Q_{1} (θ) \leq Q_{1} (θ *) = Q_{3} (θ *)

for θ ∈ (0, θ*]. Since θ* ≥ ν*, it follows from Lemma 3 that $Q_{3} (θ)$ is decreasing for θ ∈ [θ*, 1). Hence,

Q^{+} (θ) \leq Q_{3} (θ) \leq Q_{3} (θ *)

for θ ∈ [θ*, 1). Therefore, $Q^{+} (θ) \leq \frac{δ}{2}$ for θ ∈ (0, 1) provided that $Q_{3} (θ *) \leq \frac{δ}{2}$ . Observing that

Q_{3} (θ *) = \exp (- n φ (ζ, θ *)),

we have that $Q^{+} (θ) \leq \frac{δ}{2}$ for θ ∈ (0, 1) provided that

\begin{array}{l} n > \frac{\ln \frac{2}{δ}}{φ (ζ, θ *)} = \frac{2}{ζ^{2}} (θ * + \frac{ζ}{3}) (1 - θ * - \frac{ζ}{3}) \ln \frac{2}{δ} \\ = \frac{2}{ζ^{2}} (\frac{λ}{b - a} + c + \frac{ζ}{3}) (1 - \frac{λ}{b - a} - c - \frac{ζ}{3}) \ln \frac{2}{δ} \\ = \frac{2}{ε^{2}} (\frac{ε}{η} + \frac{ε}{3} + a) (b - \frac{ε}{η} - \frac{ε}{3}) \ln \frac{2}{δ} . \end{array}

Next, consider Case (2). As a consequence of $\frac{ε}{η} + \frac{ε}{3} < \frac{2 a b}{a + b}$ , we have θ* < min(p*, ν*). Since $\frac{1}{2} > p * > θ * > 0$ , it follows from Lemma 1 that $Q_{1} (θ)$ is increasing for for θ ∈ (0, θ*]. Hence,

Q^{+} (θ) \leq Q_{1} (θ) \leq Q_{1} (θ *) = Q_{3} (θ *) \leq Q_{3} (v *)

for θ ∈ (0, θ*]. Since θ* < ν*, it follows from Lemma 3 that $Q_{3} (θ)$ is increasing for θ ∈ [θ*, ν*) and is decreasing for θ ∈ [ν*, 1). Hence,

Q^{+} (θ) \leq Q_{3} (θ) \leq Q_{3} (v *)

for θ ∈ [θ*, 1). Therefore, $Q^{+} (θ) \leq \frac{δ}{2}$ for θ ∈ (0, 1) provided that $Q_{3} (v *) \leq \frac{δ}{2}$ . Since $Q_{3} (v *) = \exp (- n φ (η (v * - c), v *))$ , it follows that $Q^{+} (θ) \leq \frac{δ}{2}$ for θ ∈ (0, 1) provided that

n > \frac{\ln \frac{2}{δ}}{φ (η (v * - c), v *)} = \frac{{(b - a)}^{2}}{2 a b} {(\frac{1}{η} + \frac{1}{3})}^{2} \ln \frac{2}{δ},

where we have used the definitions of ν* and c.

Finally, consider Case (3). In this case, we have $Q^{+} (θ) \leq ℙ_{θ} {{\bar{Y}}_{n} \geq θ + ζ} \leq \exp (- 2 n ζ^{2})$ . Therefore, $Q^{+} (θ) \leq \frac{δ}{2}$ provided that

n > \frac{\ln \frac{2}{δ}}{2 ζ^{2}} = \frac{{(b - a)}^{2}}{2 ε^{2}} \ln \frac{2}{δ} .

This completes the proof of the lemma. □

Lemma 5

Let c ≤ 0. Define

r * = - \frac{η c}{3 - η}, v * = \frac{1}{3 - η} (\frac{3 c}{2 c - 1} - η c)

and

Q_{4} (θ) = {\begin{cases} \exp (- n φ (η (c - θ), θ)) & for θ \in (r *, 1), \\ 0 & for θ \in (0, r *) . \end{cases}

Then, the following assertions hold.

$ℙ_{θ} {{\bar{Y}}_{n} \leq θ - η (θ - c)} \leq Q_{4} (θ)$ for θ ∈ (0, 1).
If ν* ≤ 1, then $Q_{4} (θ)$ is non-decreasing with respect to θ ∈ (0, ν*) and decreasing with respect to θ ∈ (ν*, 1).
If ν* > 1, then $Q_{4} (θ)$ is non-decreasing with respect to θ ∈ (0, 1).

Proof

Clearly, r* ≥ 0. To show assertion (I), note that θ − η(θ − c) < 0 for θ ∈ (0, r*). It follows that

ℙ_{θ} {{\bar{Y}}_{n} \leq θ - η (θ - c)} \leq ℙ_{θ} {{\bar{Y}}_{n} \leq 0} = 0

for θ ∈ (0, r*). On the other hand, 0 < η (θ − c) < 3θ for θ ∈ (r*, 1), it follows from (9) that

ℙ_{θ} {{\bar{Y}}_{n} \leq θ - η (θ - c)} \leq \exp (- n φ (η (c - θ), θ)) = \exp (- \frac{n η^{2}}{2} g (θ)),

where

g (θ) = \frac{{(θ - c)}^{2}}{ρ (θ) [1 - ρ (θ)]}

with $ρ (θ) = θ - \frac{η}{3} (θ - c)$ . Clearly, ρ(θ) < θ < 1. Since θ > r*, we have ρ(θ) > 0. Hence, 0 < ρ(θ) < 1 for θ ∈ (r*, 1). This establishes assertion (I).

To show assertions (II) and (III), consider the derivative of g(θ) with respective to θ. Tedious computation shows that

g^{'} (θ) = \frac{2 (θ - c)}{{[ρ (θ)]}^{2} {[1 - ρ (θ)]}^{2}} [(\frac{1}{2} - c) ρ (θ) + \frac{c}{2}] .

(31)

Since θ − c > 0 for θ ∈ (0, 1), it follows from (31) that $g^{'} (θ) \geq 0$ if and only if $(\frac{1}{2} - c) ρ (θ) + \frac{c}{2} \geq 0$ , which is equivalent to θ ≥ ν*. Direct computation shows that 0 < r* < ν*. It follows that assertions (II) and (III) hold. □

Lemma 6

Define

M = {\begin{cases} \frac{2}{ε^{2}} (\frac{ε}{η} - \frac{ε}{3} - a) (b - \frac{ε}{η} + \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{2 a b}{a + b} < \frac{ε}{η} - \frac{ε}{3} < \frac{a + b}{2}, \\ \frac{{(b - a)}^{2}}{2 a b} {(\frac{1}{η} - \frac{1}{3})}^{2} \ln \frac{2}{δ} & for \frac{η}{3} < \frac{b - a}{b + a} and \frac{ε}{η} - \frac{ε}{3} < \frac{2 a b}{a + b}, \\ [\frac{2}{3 η} (1 - \frac{a}{b}) - \frac{2}{9}] \ln \frac{2}{δ} & for \frac{b - a}{b + a} < \frac{η}{3} < \frac{b - a}{b} and \frac{ε}{η} - \frac{ε}{3} < \frac{a + b}{2}, \\ 1 & for \frac{η}{3} > \frac{b - a}{b}, \\ \frac{{(b - a)}^{2}}{2 ε^{2}} \ln \frac{2}{δ} & else . \end{cases}

Then, $ℙ_{μ} {{\bar{X}}_{n} \leq μ - max (ε, η μ)} \leq δ$ for all μ ∈ (a, b) provided that n > M.

Proof

For simplicity of notations, define $ζ = \frac{ε}{b - a}, λ = \frac{ε}{η}, c = \frac{a}{a - b},$

q * = \frac{1}{2} + \frac{ζ}{3}, θ * = \frac{λ}{b - a} + c, v * = \frac{1}{3 - η} (\frac{3 c}{2 c - 1} - η c), r * = - \frac{η c}{3 - η} .

It can be checked that 0 < θ* < 1. Define $Q^{-} (θ) = ℙ_{θ} {{\bar{Y}}_{n} \leq θ - max (ζ, η (θ - c))}$ for θ ∈ (0, 1). Then, $ℙ_{μ} {{\bar{X}}_{n} \leq μ - max (ε, η μ)} = Q^{-} (θ)$ . We need to show the lemma for the following six cases.

Case (i): $\frac{2 a b}{a + b} \leq \frac{ε}{η} - \frac{ε}{3} \leq \frac{a + b}{2} .$
Case (ii): $\frac{η}{3} < \frac{b - a}{b + a}$ and $\frac{ε}{η} - \frac{ε}{3} < \frac{2 a b}{a + b}$ .
Case (iii): $\frac{b - a}{b + a} < \frac{η}{3} < \frac{b - a}{b}$ and $\frac{ε}{η} - \frac{ε}{3} \leq \frac{a + b}{2}$ .
Case (iv): $\frac{η}{3} \geq \frac{b - a}{b}$
Case (v): Else.

First, consider Case (i). Clearly, as a consequence of $\frac{ε}{η} - \frac{ε}{3} \leq \frac{a + b}{2}$ , we have q* ≥ θ*. As a consequence of $\frac{ε}{η} - \frac{ε}{3} \geq \frac{2 a b}{a + b}$ , we have θ* ≥ ν*. It follows from $\frac{2 a b}{a + b} \leq \frac{ε}{η} - \frac{ε}{3} \leq \frac{a + b}{2}$ that q* ≥ θ* ≥ ν*. Since q* ≥ θ*, it follows from Lemma 2 that $Q_{2} (θ)$ is non-decreasing for θ ∈ (0, θ*]. Hence, for θ ∈ (0, θ*].

Q^{-} (θ) \leq Q_{2} (θ) \leq Q_{2} (θ *) .

Since θ* ≥ ν*, it follows from Lemma 5 that $Q_{4} (θ)$ is decreasing for θ ∈ [θ*, 1). Hence,

Q^{-} (θ) \leq Q_{4} (θ) \leq Q_{4} (θ *) = Q_{2} (θ *)

for θ ∈ [θ*, 1). Hence, $Q^{-} (θ) \leq \frac{δ}{2}$ for θ ∈ (0, 1) provided that $Q_{2} (θ *) \leq \frac{δ}{2}$ . Observing that $Q_{2} (θ *) \exp (- n φ (- ζ, θ *))$ , we have that $Q^{-} (θ) \leq \frac{δ}{2}$ for θ ∈ (0, 1) provided that

\begin{array}{l} n > \frac{\ln \frac{2}{δ}}{φ (- ζ, θ *)} = \frac{2}{ζ^{2}} (θ * - \frac{ζ}{3}) (1 - θ * + \frac{ζ}{3}) \ln \frac{2}{δ} \\ = \frac{2}{ζ^{2}} (\frac{λ}{b - a} + c - \frac{ζ}{3}) (1 - \frac{λ}{b - a} - c + \frac{ζ}{3}) \ln \frac{2}{δ} \\ = \frac{2}{ε^{2}} (\frac{ε}{η} - \frac{ε}{3} - a) (b - \frac{ε}{η} + \frac{ε}{3}) \ln \frac{2}{δ} . \end{array}

Second, consider Case (ii). As a consequence of $\frac{η}{3} < \frac{b - a}{b + a}$ , we have ν* < 1. Making use of $\frac{ε}{η} - \frac{ε}{3} < \frac{2 a b}{a + b}$ , we have θ* < min(q*, ν*). Since q* > θ*, it follows from Lemma 2 that $Q_{2} (θ)$ is non-decreasing for θ ∈ (0, θ*]. Hence, for θ ∈ (0, θ*],

Q^{-} (θ) \leq Q_{2} (θ) \leq Q_{2} (θ *) .

(32)

Since θ* < ν*, it follows from Lemma 5 that $Q_{4} (θ)$ is increasing for θ ∈ [θ*, ν*) and is decreasing for θ ∈ [ν*, 1). Hence,

Q^{-} (θ) \leq Q_{4} (θ) \leq Q_{4} (v *)

(33)

for θ ∈ [θ*, 1). Note that

Q_{4} (v *) \geq Q_{4} (θ *) = Q_{2} (θ *) .

(34)

In view of (32), (33) and (34), we have that $Q^{-} (θ) \leq \frac{δ}{2}$ for θ ∈ (0, 1) provided that $Q_{4} (v *) \leq \frac{δ}{2}$ Observing that

Q_{4} (v *) = \exp (- n φ (- η (v * - c), v *)),

we have that $Q^{-} (θ) \leq \frac{δ}{2}$ for θ ∈ (0, 1) provided that the corresponding sample size

n > \frac{\ln \frac{2}{δ}}{φ (- η (v * - c), v *)} = \frac{{(b - a)}^{2}}{2 a b} {(\frac{1}{η} - \frac{1}{3})}^{2} \ln \frac{2}{δ},

where we have used the definitions of ν and c.

Third, consider Case (iii). As a consequence of

\frac{b - a}{b + a} < \frac{η}{3} < \frac{b - a}{b}, \frac{ε}{η} - \frac{ε}{3} \leq \frac{a + b}{2},

we have r* < 1 < ν* and q* ≥ θ*. Since q* ≥ θ*, it follows from Lemma 2 that $Q_{2} (θ)$ is non-decreasing for θ ∈ (0, θ*]. Hence, for θ ∈ (0, θ*],

Q^{-} (θ) \leq Q_{2} (θ) \leq Q_{2} (θ *) .

Since ν* > 1, it follows from Lemma 5 that $Q_{4} (θ)$ is non-decreasing for θ ∈ [θ*, 1). Hence,

Q^{-} (θ) \leq Q_{4} (θ) \leq Q_{4} (1) .

for θ ∈ [θ*, 1). Note that $Q_{4} (1) \geq Q_{4} (θ *) = Q_{2} (θ *)$ . Hence, $Q^{-} (θ) \leq \frac{δ}{2}$ for θ ∈ (0, 1) provided that $Q_{4} (1) \geq \frac{δ}{2}$ . Since $Q_{4} (1) = \exp (- n φ (- η (1 - c), 1))$ , it follows that $Q^{-} (θ) \leq \frac{δ}{2}$ for θ ∈ (0, 1) provided that the corresponding sample size

n > \frac{\ln \frac{2}{δ}}{φ (- η (1 - c), 1)} = \frac{2 (1 - \frac{η (1 - c)}{3}) (\frac{η (1 - c)}{3})}{{[η (1 - c)]}^{2}} \ln \frac{2}{δ} = [\frac{2}{3 η} (1 - \frac{a}{b}) - \frac{2}{9}] \ln \frac{2}{δ} .

Now, consider Case (iv). As a consequence of $\frac{η}{3} \geq \frac{b - a}{b}$ we have r* ≥ 1, which implies that Q⁻(θ) = 0 for θ ∈ (0, 1). Hence, $Q^{-} (θ) \leq \frac{δ}{2}$ for θ ∈ (0, 1) for any sample size n ≥ 1.

Finally, consider Case (v). In this case, we have $Q^{-} (θ) \leq ℙ_{θ} {{\bar{Y}}_{n} \leq θ - ζ} \leq \exp (- 2 n ζ^{2})$ . Therefore, $Q^{-} (θ) \leq \frac{δ}{2}$ provided that

n > \frac{\ln \frac{2}{δ}}{2 ζ^{2}} = \frac{{(b - a)}^{2}}{2 ζ^{2}} \ln \frac{2}{δ} .

This completes the proof of the lemma. □

Finally, Theorem 3 can be established by making use of Lemmas 4 and 6.

C Proof of Theorem 4

To prove the theorem, we need some preliminary results.

Lemma 7

Let c ∈ (0, 1). Define $r * = \frac{3 + η c}{3 + η}$ and

ℋ_{1} (θ) = {\begin{cases} \exp (- n φ (η (θ - c), θ)) & for θ \in (c, r *), \\ 0 & for θ \in (r *, 1) . \end{cases}

Then, $ℙ_{θ} {{\bar{Y}}_{n} \geq θ + η (θ - c)} \leq ℋ_{1} (θ)$ for θ ∈ (c, 1). Moreover, $ℋ_{1} (θ)$ is non-increasing with respect to θ ∈ (c, 1).

Proof

By the definition of r*, it can be checked that c < r* < 1. Note that θ + η(θ − c) > 1 for θ ∈ (r*, 1). Hence, $ℙ_{θ} {{\bar{Y}}_{n} \geq θ + η (θ - c)} \leq ℙ_{θ} {{\bar{Y}}_{n} > 1} = 0$ for θ ∈ (r*, 1). On the other hand, 0 < η(θ − c) < 3(1 − θ) for θ ∈ (c, r*). Thus, it follows from inequality (8) that

ℙ_{θ} {{\bar{Y}}_{n} \geq θ + η (θ - c)} \leq \exp (- n φ (η (θ - c), θ)) = \exp (- \frac{n η^{2}}{2} g (θ)),

where

g (θ) = \frac{{(θ - c)}^{2}}{ρ (θ) [1 - ρ (θ)]}

with $ρ (θ) = θ + \frac{η}{3} (θ - c)$ . It can be verified that 0 < ρ(θ) < 1 for θ ∈ (c, r*). This shows that $ℙ_{θ} {{\bar{Y}}_{n} \geq θ + η (θ - c)} \leq ℋ_{1} (θ)$ for θ ∈ (c, 1).

To show that $ℋ_{1} (θ)$ is non-increasing with respect to θ ∈ (c, 1), consider the derivative of g(θ) with respective to θ. Tedious computation shows that the derivative of g(θ) is given as

g^{'} (θ) = \frac{2 (θ - c)}{{[ρ (θ)]}^{2} {[1 - ρ (θ)]}^{2}} [(\frac{1}{2} - c) ρ (θ) + \frac{c}{2}] .

(35)

We claim that the derivative g′(θ) is positive for θ ∈ (c, 1). In view of (35) and the fact that θ −c > 0 for θ ∈ (c, 1), it is sufficient to show that $(\frac{1}{2} - c) ρ (θ) + \frac{c}{2} \geq 0$ for θ ∈ (c, 1) in the case that $\frac{1}{2} - c \geq 0$ and the case that $\frac{1}{2} - c < 0$ . By the definition of ρ(θ), we have

c \leq ρ (θ) \leq 1 + \frac{η}{3} (1 - c)

(36)

for θ ∈ (c, 1). In the case of $\frac{1}{2} - c \geq 0$ , using the lower bound of ρ(θ) given by (36), we have $(\frac{1}{2} - c) ρ (θ) + \frac{c}{2} \geq (\frac{1}{2} - c) c + \frac{c}{2} > 0$ for θ ∈ (c, 1). In the case of $\frac{1}{2} - c < 0$ , using the upper bound of ρ(θ) given by (36), we have

\begin{array}{l} (\frac{1}{2} - c) ρ (θ) + \frac{c}{2} \geq (\frac{1}{2} - c) [1 + \frac{η}{3} (1 - c)] + \frac{c}{2} \\ = \frac{1}{2} + \frac{η}{6} - \frac{1 + η}{2} c + c^{2} \frac{η}{3} \\ = \frac{η}{3} (c - 1) (c - \frac{3}{2 η} - \frac{1}{2}) > 0 \end{array}

for θ ∈ (c, 1) and η ∈ (0, 1). Thus, we have shown the claim that g′(θ) > 0 in all cases. This implies that $ℋ_{1} (θ)$ is non-increasing with respect to θ ∈ (c, 1). □

Lemma 8

Let c ∈ (0, 1). Define $ℋ_{2} (θ) = \exp (- n φ (η (c - θ), θ))$ for θ ∈ (c, 1). Then, $ℙ_{θ} {{\bar{Y}}_{n} \leq θ - η (θ - c)} \leq ℋ_{2} (θ)$ for θ ∈ (c, 1). Moreover, $ℋ_{2} (θ)$ is decreasing with respect to θ ∈ (c, 1). □

Proof

Clearly, 0 < η(θ − c) < 3θ for θ ∈ (c, 1). It follows from inequality (9) that

ℙ_{θ} {{\bar{Y}}_{n} \leq θ - η (θ - c)} \leq \exp (- n φ (η (c - θ), θ)) = \exp (- \frac{n η^{2}}{2} g (θ)),

where

g (θ) = \frac{{(θ - c)}^{2}}{ρ (θ) [1 - ρ (θ)]}

with $ρ (θ) = θ - \frac{η}{3} (θ - c)$ . Clearly, 0 < ρ(θ) < 1 for θ ∈ (c, 1). This shows that $ℙ_{θ} {{\bar{Y}}_{n} \leq θ - η (θ - c)} \leq ℋ_{2} (θ)$ for θ ∈ (c, 1).

To show that $ℋ_{2} (θ)$ is decreasing with respect to θ ∈ (c, 1), consider the derivative of g(θ) with respect to θ. Tedious computation shows that the derivative of g(θ) is given as

g^{'} (θ) = \frac{2 (θ - c)}{{[ρ (θ)]}^{2} {[1 - ρ (θ)]}^{2}} [(\frac{1}{2} - c) ρ (θ) + \frac{c}{2}] .

(37)

We claim that the derivative g′(θ) is positive for θ ∈ (c, 1). In view of (37) and the fact that θ −c > 0 for θ ∈ (c, 1), it is sufficient to show that $(\frac{1}{2} - c) ρ (θ) + \frac{c}{2} \geq 0$ for θ ∈ (c, 1) in the case that $\frac{1}{2} - c < 0$ and the case that $\frac{1}{2} - c < 0$ . By the definition of ρ(θ), we have

0 < ρ (θ) < 1 - \frac{η}{3} (1 - c)

(38)

for θ ∈ (c, 1). In the case of $\frac{1}{2} - c < 0$ , using the upper bound of ρ(θ) given by (38), we have $(\frac{1}{2} - c) ρ (θ) + \frac{c}{2} \geq \frac{c}{2} \geq 0$ for θ ∈ (c, 1). In the case of $\frac{1}{2} - c < 0$ , using the upper bound of ρ(θ) given by (38), we have

\begin{array}{l} (\frac{1}{2} - c) ρ (θ) + \frac{c}{2} \geq (\frac{1}{2} - c) [1 - \frac{η}{3} (1 - c)] + \frac{c}{2} = \frac{1}{2} - \frac{η}{6} - \frac{1 - η}{2} c - c^{2} \frac{η}{3} \\ > \frac{1}{2} - \frac{η}{6} - \frac{1 - η}{2} c - c \frac{η}{3} = \frac{1}{6} (3 - η) (1 - c) > 0. \end{array}

Thus, we have established the claim that g′(θ) > 0 for θ ∈ (c, 1). It follows that $ℋ_{2} (θ)$ is decreasing with respect to θ ∈ (c, 1). This completes the proof of the lemma. □

Lemma 9

Let c ∈ (0, 1). Define $ℋ_{3} (θ) = \exp (- n φ (η (c - θ), θ))$ for θ ∈ (0, c). Then, $ℙ_{θ} {{\bar{Y}}_{n} \geq θ + η (c - θ)} \leq ℋ_{3} (θ)$ for θ ∈ (0, c). Moreover, $ℋ_{3} (θ)$ is increasing with respect to θ ∈ (0, c).

Proof

Note that 0 < η (c − θ) < 3(1 − θ) for θ ∈ (0, c). It follows from inequality (8) that

ℙ_{θ} {{\bar{Y}}_{n} \geq θ + η (c - θ)} \leq \exp (- n φ (η (c - θ), θ)) = \exp (- \frac{n η^{2}}{2} g (θ)),

where

g (θ) = \frac{{(c - θ)}^{2}}{ρ (θ) [1 - ρ (θ)]}

with $ρ (θ) = θ + \frac{η}{3} (c - θ)$ . Clearly, ρ(θ) > 0 for θ ∈ (0, c). Since c ∈ (0, 1) and η ∈ (0, 3), we have $ρ (θ) < θ + \frac{η}{3} (1 - θ) < θ + (1 - θ) = 1$ for θ ∈ (0, c). Hence, we have established that $ℙ_{θ} {{\bar{Y}}_{n} \leq θ + η (c - θ)} \leq ℋ_{3} (θ)$ for θ ∈ (0, c).

To show that $ℋ_{3} (θ)$ is increasing with respect to θ ∈ (0, c), consider the derivative of g(θ) with respect to θ. Tedious computation shows that the derivative of g(θ) is given as

g^{'} (θ) = \frac{2 (θ - c)}{{[ρ (θ)]}^{2} {[1 - ρ (θ)]}^{2}} [(\frac{1}{2} - c) ρ (θ) + \frac{c}{2}] .

(39)

We claim that g′(θ) is negative for θ ∈ (0, c). In view of (39) and the fact that θ −c < 0 for θ ∈ (0, c), it suffices to show $(\frac{1}{2} - c) ρ (θ) + \frac{c}{2} > 0$ for the case that $\frac{1}{2} - c \geq 0$ and the case that $\frac{1}{2} - c < 0$ . Note that $\frac{η}{3} c < ρ (θ) < c$ for θ ∈ (0, c). In the case of $\frac{1}{2} - c \geq 0$ , we have

(\frac{1}{2} - c) ρ (θ) + \frac{c}{2} \geq (\frac{1}{2} - c) \frac{η}{3} c + \frac{c}{2} > \frac{c}{2} > 0.

In the case of $\frac{1}{2} - c < 0$ , we have

(\frac{1}{2} - c) ρ (θ) + \frac{c}{2} \geq (\frac{1}{2} - c) c + \frac{c}{2} > 0.

Therefore, we have established the claim that g′(θ) < 0 for θ ∈ (0, c). This implies that $ℋ_{3} (θ)$ is increasing with respect to θ ∈ (0, c). The proof of the lemma is thus completed. □

Lemma 10

Let c ∈ (0, 1). Define $r^{*} = \frac{η c}{3 + η}$ and

ℋ_{4} (θ) = {\begin{cases} \exp (- n φ (η (θ - c), θ)) & for θ \in (r^{*}, c), \\ 0 & for θ \in (0, r^{*}) . \end{cases}

Then, $ℙ_{θ} {{\bar{Y}}_{n} \leq θ + η (θ - c)} \leq ℋ_{4} (θ)$ for θ ∈ (0, c). Moreover, $ℋ_{4} (θ)$ is non-decreasing with respect to θ ∈ (0, c).

Proof

Clearly, 0 < r* < c. Note that θ + η (θ − c) < 0 for θ ∈ (0, r*). Hence, $ℙ_{θ} {{\bar{Y}}_{n} \leq θ + η (θ - c)} \leq ℙ_{θ} {{\bar{Y}}_{n} < 0}$ for θ ∈ (0, r*). On the other hand, it can be checked that 0 < η(c − θ) < 3θ for θ ∈ (r*, c). It follows from inequality (9) that

ℙ_{θ} {{\bar{Y}}_{n} \leq θ + η (θ - c)} \leq \exp (- n φ (η (θ - c), θ)) = \exp (- \frac{n η^{2}}{2} g (θ))

for θ ∈ (r*, c), where

g (θ) = \frac{{(c - θ)}^{2}}{ρ (θ) [1 - ρ (θ)]}

with $ρ (θ) = θ + \frac{η}{3} (θ - c)$ . It can be verified that ρ(θ) > 0 for θ ∈ (r*, c). Since c ∈ (0, 1) and η ∈ (0, 3), we have $ρ (θ) < θ + \frac{η}{3} (1 - θ) < θ + (1 - θ) = 1$ for θ ∈ (r*, c). Thus, we have shown that $ℙ_{θ} {{\bar{Y}}_{n} \leq θ + η (θ - c)} \leq ℋ_{4} (θ)$ for θ ∈ (0, c).

To show that $ℋ_{4} (θ)$ is non-decreasing with respect to θ ∈ (0, c), consider the derivative of g(θ) with respective to θ. Tedious computation shows that the derivative of g(θ) is given by

g^{'} (θ) = \frac{2 (θ - c)}{{[ρ (θ)]}^{2} {[1 - ρ (θ)]}^{2}} [(\frac{1}{2} - c) ρ (θ) + \frac{c}{2}] .

(40)

Note that $- \frac{η}{3} c < ρ (θ) < c$ for θ ∈ (0, c). It follows that

(\frac{1}{2} - c) ρ (θ) + \frac{c}{2} \geq (\frac{1}{2} - c) c + \frac{c}{2} \geq 0

(41)

for $c \in (\frac{1}{2}, 1)$ and θ ∈ (0, c). Moreover,

(\frac{1}{2} - c) ρ (θ) + \frac{c}{2} \geq - (\frac{1}{2} - c) \frac{η}{3} c + \frac{c}{2} > - \frac{η}{6} c + \frac{c}{2} > 0

(42)

for $c \in (0, \frac{1}{2}]$ and θ ∈ (0, c). Making use of (41), (41) and (42), we have g′(θ) < 0 for θ ∈ (r*, c). So, we have established that $ℋ_{4} (θ)$ is non-decreasing with respect to θ ∈ (0, c). This completes the proof of the lemma. □

Lemma 11

Assume that a < 0 < b. Define

N = {\begin{cases} \frac{2}{ε^{2}} (\frac{ε}{η} + \frac{ε}{3} - a) (b - \frac{ε}{η} - \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{ε}{η} + \frac{ε}{3} < \frac{a + b}{2}, \\ \frac{2}{ε^{2}} (- \frac{ε}{3} - a) (b + \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{ε}{3} + \frac{a + b}{2} < 0, \\ \frac{{(b - a)}^{2}}{2 ε^{2}} \ln \frac{2}{δ} & else . \end{cases}

(43)

Then, $ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq \max (ε, η | μ |)} < δ$ for any μ ∈ (0, b) provided that n > N.

Proof

For simplicity of notations, define $ζ = \frac{ε}{b - a}$ ,

λ = \frac{ε}{η}, c = \frac{a}{a - b}, θ^{*} = \frac{λ}{b - a} + c, p^{*} = \frac{1}{2} - \frac{ε}{3 (b - a)}, q^{*} = \frac{1}{2} + \frac{ε}{3 (b - a)} .

Define functions

\begin{array}{l} Q^{+} (θ) = ℙ_{θ} {{\bar{Y}}_{n} \geq θ + \max (ζ, η (θ - c))}, \\ Q^{-} (θ) = ℙ_{θ} {{\bar{Y}}_{n} \leq θ - \max (ζ, η (θ - c))} \end{array}

for θ ∈ (c, 1). For μ ∈ (0, b), putting $θ = \frac{μ}{b - a} + c$ , we have c < θ < 1 and

\begin{array}{l} ℙ_{μ} {{\bar{X}}_{n} \geq μ + \max (ε, η μ)} = Q^{+} (θ), \\ ℙ_{μ} {{\bar{X}}_{n} \leq μ - \max (ε, η μ)} = Q^{-} (θ) . \end{array}

To prove the lemma, it suffices to consider the following three cases.

Case (I): $\frac{ε}{η} + \frac{ε}{3} < \frac{a + b}{2}$ .
Case (II): $\frac{ε}{3} + \frac{a + b}{2} < 0$ .
Case (III): $- \frac{ε}{3} \leq \frac{a + b}{2} \leq \frac{ε}{η} + \frac{ε}{3}$ .

First, consider Case (I). As a consequence of $\frac{ε}{η} + \frac{ε}{3} < \frac{a + b}{2}$ , we have θ* < p*. Since p* > θ*, it follows from Lemma 1 that $Q_{1} (θ)$ is increasing for θ ∈ (0, θ*]. Moreover, according to Lemma 7, we have that $ℋ_{1} (θ)$ is non-increasing for θ ∈ [θ*, 1). It follows that

\begin{array}{l} Q^{+} (θ) \leq Q_{1} (θ) \leq Q_{1} (θ^{*}) & for θ \in [c, θ^{*}] \end{array} .

(44)

and that

\begin{array}{l} Q^{+} (θ) \leq ℋ_{1} (θ) \leq ℋ_{1} (θ^{*}) = Q_{1} (θ^{*}) & for θ \in [θ^{*}, 1) \end{array} .

(45)

Observing that q* > θ* and making use of Lemma 2, we have that $Q_{2} (θ)$ is non-decreasing for θ ∈ (c, θ*]. According to Lemma 8, we have that $ℋ_{2} (θ)$ is decreasing for θ ∈ [θ*, 1). It follows that

\begin{array}{l} Q^{-} (θ) \leq Q_{2} (θ) \leq Q_{2} (θ^{*}) & for θ \in [c, θ^{*}) \end{array} .

(46)

and that

\begin{array}{l} Q^{-} (θ) \leq ℋ_{2} (θ) \leq ℋ_{2} (θ^{*}) = Q_{2} (θ^{*}) & for θ \in [θ^{*}, 1) \end{array} .

(47)

Combining (44), (45), (46) and (47), we have

Q^{+} (θ) + Q^{-} (θ) \leq Q_{1} (θ^{*}) + Q_{2} (θ^{*})

for θ ∈ (c, 1). Observing that

Q_{1} (θ^{*}) = \exp (- \frac{n ζ^{2}}{2 (θ^{*} + \frac{ζ}{3}) (1 - θ^{*} - \frac{ζ}{3})}), Q_{2} (θ^{*}) = \exp (- \frac{n ζ^{2}}{2 (θ^{*} - \frac{ζ}{3}) (1 - θ^{*} + \frac{ζ}{3})})

and that

(θ^{*} + \frac{ζ}{3}) (1 - θ^{*} - \frac{ζ}{3}) - (θ^{*} - \frac{ζ}{3}) (1 - θ^{*} + \frac{ζ}{3}) = \frac{2 ζ}{3} (1 - 2 θ^{*}) > \frac{2 ζ}{3} (1 - 2 p^{*}) > 0,

we have $Q_{1} (θ^{*}) > Q_{2} (θ^{*})$ and consequently,

Q^{+} (θ) + Q^{-} (θ) \leq 2 Q_{1} (θ^{*})

for θ ∈ (c, 1). It follows that

ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq \max (ε, η | μ |)} \leq Q^{+} (θ) + Q^{-} (θ) \leq 2 Q_{1} (θ^{*}) = 2 \exp (- n φ (ζ, θ^{*}))

for μ ∈ (0, b). This implies that $ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq \max (ε, η | μ |)} < δ$ provided that the corresponding sample size

\begin{array}{l} n > \frac{\ln \frac{2}{δ}}{φ (ζ, θ^{*})} = \frac{2 (θ^{*} + \frac{ζ}{3}) (1 - θ^{*} + \frac{ζ}{3})}{ζ^{2}} \ln \frac{2}{δ} \\ = \frac{2 (\frac{λ}{b - a} + c + \frac{ζ}{3}) (1 - \frac{λ}{b - a} - c - \frac{ζ}{3})}{ε^{2}} \ln \frac{2}{δ} \\ = \frac{2}{ε^{2}} (\frac{ε}{η} + \frac{ε}{3} - a) (b - \frac{ε}{η} - \frac{ε}{3}) \ln \frac{2}{δ} . \end{array}

Next, consider Case (II). As a consequence of $\frac{ε}{3} + \frac{a + b}{2} < 0$ , we have q* < c. Clearly, p* < q* < c < θ*. By Lemma 1, $Q_{1} (θ)$ is non-increasing for θ ∈ (c, θ*]. By Lemma 7, $ℋ_{1} (θ)$ is non-increasing for θ ∈ [θ*, 1). It follows that

Q^{+} (θ) \leq Q_{1} (θ) \leq Q_{1} (c) for θ \in [c, θ^{*}] .

(48)

Moreover,

\begin{array}{l} Q^{+} (θ) \leq ℋ_{1} (θ) \leq ℋ_{1} (θ^{*}) = Q_{1} (θ^{*}) \leq Q_{1} (c) & for θ \in [θ^{*}, 1) \end{array} .

(49)

Similarly, according to Lemma 2, $Q_{2} (θ)$ is decreasing for θ ∈ (c, θ*]. By Lemma 8, $ℋ_{2} (θ)$ is decreasing for θ ∈ [θ*, 1). It follows that

Q^{-} (θ) \leq Q_{2} (θ) + Q_{2} (c) for θ \in [c, θ^{*}] .

(50)

Moreover,

Q^{-} (θ) \leq ℋ_{2} (θ) \leq ℋ_{2} (θ^{*}) = Q_{2} (θ^{*}) \leq Q_{2} (c) for θ \in [θ^{*}, 1) .

(51)

Making use of (48), (49), (50) and (51), we have

Q^{+} (θ) + Q^{-} (θ) \leq Q_{1} (c) + Q_{2} (c)

for θ ∈ (c, 1). Observing that

Q_{1} (c) = \exp (- \frac{n ζ^{2}}{2 (c + \frac{ζ}{3}) (1 - c - \frac{ζ}{3})}), Q_{2} (c) = \exp (- \frac{n ζ^{2}}{2 (c - \frac{ζ}{3}) (1 - c + \frac{ζ}{3})})

and that

(c + \frac{ζ}{3}) (1 - c - \frac{ζ}{3}) - (c - \frac{ζ}{3}) (1 - c + \frac{ζ}{3}) = \frac{2 ζ}{3} (1 - 2 c) < \frac{2 ζ}{3} (1 - 2 q^{*}) < 0,

we have $Q_{1} (c) < Q_{2} (c)$ and consequently,

Q^{+} (θ) + Q^{-} (θ) \leq 2 Q_{2} (c)

for θ ∈ (c, 1). It follows that

ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq \max (ε, η | μ |)} \leq Q^{+} (θ) + Q^{-} (θ) \leq 2 Q_{2} (c) = 2 \exp (- n φ (- ζ, c))

for μ ∈ (0, b). This implies that $ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq \max (ε, η | μ |)} < δ$ provided that the corresponding sample size

\begin{array}{l} n > \frac{\ln \frac{2}{δ}}{φ (- ζ, c)} = \frac{2 (c - \frac{ζ}{3}) (1 - c + \frac{ζ}{3})}{ζ^{2}} \ln \frac{2}{δ} \\ = \frac{2 (- a - \frac{ε}{3}) (b - a + a + \frac{ε}{3})}{ε^{2}} \ln \frac{2}{δ} \\ = \frac{2}{ε^{2}} (- \frac{ε}{3} - a) (b + \frac{ε}{3}) \ln \frac{2}{δ} . \end{array}

Finally, consider Case (III). In this case, we have $ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq \max (ε, η | μ |)} \leq Q^{+} (θ) + Q^{-} (θ) \leq ℙ_{θ} {| {\bar{Y}}_{n} - θ | \geq ζ} \leq 2 exp (- 2 n ζ^{2})$ . Therefore, $ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq \max (ε, η | μ |)} < δ$ provided that

n > \frac{\ln \frac{2}{δ}}{2 ζ^{2}} = \frac{{(b - a)}^{2}}{2 ε^{2}} \ln \frac{2}{δ} .

This completes the proof of the lemma. □

Lemma 12

Assume that a < 0 < b. Define

M = {\begin{cases} \frac{2}{ε^{2}} (- \frac{ε}{η} - \frac{ε}{3} - a) (b + \frac{ε}{η} + \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{ε}{η} + \frac{ε}{3} < \frac{a + b}{2}, \\ \frac{2}{ε^{2}} (\frac{ε}{3} - a) (b - \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{ε}{3} < \frac{a + b}{2}, \\ \frac{{(b - a)}^{2}}{2 ε^{2}} \ln \frac{2}{δ} & else . \end{cases}

(52)

Then, $ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq \max (ε, η | μ |)} < δ$ for any μ ∈ (a, 0) provided that n > M.

Proof

For simplicity of notations, define $ζ = \frac{ε}{b - a}$ ,

λ = \frac{ε}{η}, c = \frac{a}{a - b}, ϑ^{*} = c - \frac{λ}{b - a}, p^{*} = \frac{1}{2} - \frac{ε}{3 (b - a)}, q^{*} = \frac{1}{2} + \frac{ε}{3 (b - a)} .

Define functions

\begin{array}{l} Q^{+} (θ) = ℙ_{θ} {{\bar{Y}}_{n} \geq θ + \max (ζ, η (c - θ))}, \\ Q^{-} (θ) = ℙ_{θ} {{\bar{Y}}_{n} \leq θ - \max (ζ, η (c - θ))} \end{array}

for θ ∈ (c, 1). For μ ∈ (0, b), putting $θ = \frac{μ}{b - c} + c$ , we have c < θ < 1 and

\begin{array}{l} ℙ_{μ} {{\bar{X}}_{n} \geq μ + \max (ε, η | μ |)} = Q^{+} (θ), \\ ℙ_{μ} {{\bar{X}}_{n} \leq μ - \max (ε, η | μ |)} = Q^{-} (θ) . \end{array}

To prove the lemma, it suffices to consider the following three cases.

Case (I): $\frac{ε}{η} + \frac{ε}{3} < - \frac{a + b}{2}$ .
Case (II): $\frac{ε}{3} < - \frac{a + b}{2}$ .
Case (III): $- (\frac{ε}{η} + \frac{ε}{3}) \leq \frac{a + b}{2} \leq \frac{ε}{3}$ .

First, consider Case (I). As a consequence of $\frac{ε}{η} + \frac{ε}{3} < - \frac{a + b}{2}$ , we have q* < ϑ*. Since p* < q* < ϑ*, it follows from Lemma 9 that $ℋ_{3} (θ)$ is increasing for θ ∈ (0, ϑ*]. According to Lemma 1, $Q_{1} (θ)$ is non-increasing for θ ∈ [ϑ*, c). Hence,

\begin{array}{l} Q^{+} (θ) \leq ℋ_{3} (θ) \leq ℋ_{3} (ϑ^{*}) = Q_{1} (ϑ^{*}) & for θ \in (0, ϑ^{*}] \end{array} .

(53)

Moreover,

\begin{array}{l} Q^{+} (θ) \leq Q_{1} (θ) \leq Q_{1} (ϑ^{*}) & for θ \in (ϑ^{*}, c] \end{array} .

(54)

Since q* < ϑ*, it follows from Lemma 10 that $ℋ_{4} (θ)$ is non-decreasing for θ ∈ (0, ϑ*]. From Lemma 2, $Q_{2} (θ)$ is decreasing for θ ∈ [ϑ*, c). Hence,

\begin{array}{l} Q^{-} (θ) \leq ℋ_{4} (θ) \leq ℋ_{4} (ϑ^{*}) = Q_{2} (ϑ^{*}) & for θ \in (0, ϑ^{*}] \end{array} .

(55)

Moreover,

\begin{array}{l} Q^{-} (θ) \leq Q_{2} (θ) \leq Q_{2} (ϑ^{*}) & for θ \in (ϑ^{*}, c] \end{array} .

(56)

Making use of (53), (54), (55) and (56), we have

Q^{+} (θ) + Q^{-} (θ) \leq Q_{1} (ϑ *) + Q_{2} (ϑ *)

for θ ∈ (0, c). Observing that

Q_{1} (ϑ^{*}) = \exp (- \frac{n ζ^{2}}{2 (ϑ^{*} + \frac{ζ}{3}) (1 - ϑ^{*} - \frac{ζ}{3})}), Q_{2} (ϑ^{*}) = \exp (- \frac{n ζ^{2}}{2 (ϑ^{*} - \frac{ζ}{3}) (1 - ϑ^{*} + \frac{ζ}{3})})

and that

(ϑ^{*} + \frac{ζ}{3}) (1 - ϑ^{*} - \frac{ζ}{3}) - (ϑ^{*} - \frac{ζ}{3}) (1 - ϑ^{*} + \frac{ζ}{3}) = \frac{2 ζ}{3} (1 - 2 ϑ^{*}) > \frac{2 ζ}{3} (1 - 2 q^{*}) < 0,

we have $Q_{1} (ϑ^{*}) < Q_{2} (ϑ^{*})$ and consequently,

Q^{+} (θ) + Q^{-} (θ) \leq 2 Q_{2} (ϑ^{*})

for θ ∈ (0, c). It follows that

ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq \max (ε, η | μ |)} \leq Q^{+} (θ) + Q^{-} (θ) \leq 2 Q_{2} (θ^{*}) = 2 \exp (- n φ (- ζ, θ^{*}))

for μ ∈ (a, 0). This implies that $ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq \max (ε, η | μ |)} < δ$ provided that the corresponding sample size

\begin{array}{l} n > \frac{\ln \frac{2}{ζ}}{φ (- ζ, c)} = \frac{2 (ϑ^{*} - \frac{ζ}{3}) (1 - ϑ^{*} + \frac{ζ}{3})}{ζ^{2}} \ln \frac{2}{δ} \\ = \frac{2 (- \frac{λ}{b - a} + c - \frac{ζ}{3}) (1 + \frac{λ}{b - a} + c + \frac{ζ}{3})}{ζ^{2}} \ln \frac{2}{δ} \\ = \frac{2}{ε^{2}} (- \frac{ε}{η} - \frac{ε}{3} - a) (b + \frac{ε}{η} + \frac{ε}{3}) \ln \frac{2}{δ} . \end{array}

Case (II). As a consequence of $\frac{ε}{a} < \frac{a + b}{2}$ , we have p* > c. Clearly, q* > p* > c > ϑ*. By Lemma 9, $ℋ_{3} (θ)$ is increasing for θ ∈ (0, ϑ*]. By Lemma 1, $Q_{1} (θ)$ is increasing for θ ∈ [ϑ*, c). Hence,

\begin{array}{l} Q^{+} (θ) \leq ℋ_{3} (θ) \leq ℋ_{3} (ϑ^{*}) = Q_{1} (ϑ^{*}) \leq Q_{1} (c) & for θ \in (0, ϑ^{*}] \end{array} .

(57)

Moreover,

\begin{array}{l} Q^{+} (θ) \leq Q_{1} (θ) \leq Q_{2} (c) & for θ \in (ϑ^{*}, c] \end{array} .

(58)

Similarly, by Lemma 10, $ℋ_{4} (θ)$ is non-decreasing for θ ∈ (0, ϑ*]. By Lemma 2, $Q_{2} (θ)$ is non-decreasing for θ ∈ [ϑ*, c). Hence,

\begin{array}{l} Q^{-} (θ) \leq ℋ_{4} (θ) \leq ℋ_{4} (ϑ^{*}) = Q_{2} (ϑ^{*}) \leq Q_{2} (c) & for θ \in (0, ϑ^{*}] \end{array} .

(59)

Moreover,

\begin{array}{l} Q^{-} (θ) \leq Q_{2} (θ) \leq Q_{2} (c) & for θ \in (ϑ^{*}, c] \end{array} .

(60)

Making use of (57), (58), (59) and (60), we have

Q^{+} (θ) + Q^{-} (θ) \leq Q_{1} (c) + Q_{2} (c)

for θ ∈ (0, c). Observing that

Q_{1} (c) = \exp (- \frac{n ζ^{2}}{2 (c + \frac{ζ}{3}) (1 - c - \frac{ζ}{3})}), Q_{2} (c) = \exp (- \frac{n ζ^{2}}{2 (c - \frac{ζ}{3}) (1 - c + \frac{ζ}{3})})

and that

(c + \frac{ζ}{3}) (1 - c - \frac{ζ}{3}) - (c - \frac{ζ}{3}) (1 - c + \frac{ζ}{3}) = \frac{2 ζ}{3} (1 - 2 c) > \frac{2 ζ}{3} (1 - 2 p^{*}) > 0,

we have $Q_{1} (c) > Q_{2} (c)$ and consequently,

Q^{+} (θ) + Q^{-} (θ) \leq 2 Q_{1} (c)

for θ ∈ (0, c). It follows that

ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq \max (ε, η | μ |)} \leq Q^{+} (θ) + Q^{-} (θ) \leq 2 Q_{1} (c) = 2 \exp (- n φ (ζ, c))

for μ ∈ (a, 0). This implies that $ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq \max (ε, η | μ |)} < δ$ provided that the corresponding sample size

\begin{array}{l} n > \frac{\ln \frac{2}{ζ}}{φ (ζ, c)} = \frac{2 (c + \frac{ζ}{3}) (1 - c - \frac{ζ}{3})}{ζ^{2}} \ln \frac{2}{δ} \\ = \frac{2 (- a + \frac{ε}{3}) (b - a + a - \frac{ε}{3})}{ε^{2}} \ln \frac{2}{δ} \\ = \frac{2}{ε^{2}} (\frac{ε}{3} - a) (b - \frac{ε}{3}) \ln \frac{2}{δ} . \end{array}

Finally, consider Case (III). In this case, we have $ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq \max (ε, η | μ |)} \leq Q^{+} (θ) + Q^{-} (θ) \leq ℙ_{θ} {| {\bar{Y}}_{n} - θ | ζ} \leq 2 \exp (- 2 n ζ^{2})$ . Therefore, $ℙ_{μ} {| {\bar{X}}_{n} - μ | \geq \max (ε, η | μ |)} < δ$ provided that

n > \frac{\ln \frac{2}{ζ}}{2 ζ^{2}} = \frac{{(b - a)}^{2}}{2 ε^{2}} \ln \frac{2}{δ} .

This completes the proof of the lemma. □

Lemma 13

Let N and M be defined by (43) and (52), respectively. Then,

\max (N, M) = {\begin{cases} \frac{2}{ε^{2}} [| a + b | (\frac{ε}{η} + \frac{ε}{3}) - {(\frac{ε}{η} + \frac{ε}{3})}^{2} - a b] \ln \frac{2}{δ} & for \frac{| a + b |}{2} > \frac{ε}{η} + \frac{ε}{3}, \\ \frac{{(b - a)}^{2}}{2 ε^{2}} ln \frac{2}{δ} & else . \end{cases}

(61)

Proof

To prove the lemma, it suffices to consider two cases as follows.

Case (A): $\frac{a + b}{2} \geq 0.$
Case (B): $\frac{a + b}{2} < 0.$

In Case (A), as a consequence of $\frac{a + b}{2} \geq 0$ , we have

N = {\begin{cases} \frac{2}{ε^{2}} (\frac{ε}{η} + \frac{ε}{3} - a) (b - \frac{ε}{η} - \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{ε}{η} + \frac{ε}{3} < \frac{a + b}{2}, \\ \frac{{(b - a)}^{2}}{2 ε^{2}} ln \frac{2}{δ} & else \end{cases}

and

M = {\begin{cases} \frac{2}{ε^{2}} (\frac{ε}{3} - a) (b - \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{ε}{η} + \frac{ε}{3} < \frac{a + b}{2}, \\ \frac{2}{ε^{2}} (\frac{ε}{3} - a) (b - \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{ε}{3} < \frac{a + b}{2}, \frac{ε}{η} + \frac{ε}{3} > \frac{a + b}{2}, \\ \frac{{(b - a)}^{2}}{2 ε^{2}} \ln \frac{2}{δ} & else . \end{cases}

Therefore, in Case (A), we have

max (N, M) = {\begin{cases} \frac{2}{ε^{2}} (\frac{ε}{η} + \frac{ε}{3} - a) (b - \frac{ε}{η} - \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{a + b}{2} > \frac{ε}{η} + \frac{ε}{3}, \\ \frac{{(b - a)}^{2}}{2 ε^{2}} ln \frac{2}{δ} & for 0 < \frac{a + b}{2} < \frac{ε}{η} + \frac{ε}{3} . \end{cases}

(62)

In Case (B), as a consequence of $\frac{a + b}{2} < 0$ , we have

N = {\begin{cases} \frac{2}{ε^{2}} (- \frac{ε}{3} - a) (b + \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{ε}{η} + \frac{ε}{3} + \frac{a + b}{2} < 0, \\ \frac{2}{ε^{2}} (- \frac{ε}{3} - a) (b + \frac{ε}{η}) \ln \frac{2}{δ} & for \frac{ε}{η} + \frac{a + b}{2} < 0, \frac{ε}{η} + \frac{ε}{3} + \frac{a + b}{2} > 0, \\ \frac{{(b - a)}^{2}}{2 ε^{2}} \ln \frac{2}{δ} & else \end{cases}

and

M = {\begin{cases} \frac{2}{ε^{2}} (- \frac{ε}{η} - \frac{ε}{3} - a) (b + \frac{ε}{η} + \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{ε}{η} + \frac{ε}{η} < - \frac{a + b}{2}, \\ \frac{{(b - c)}^{2}}{2 ε^{2}} ln \frac{2}{δ} & else . \end{cases}

Therefore, in Case (B), we have

max (N, M) = {\begin{cases} \frac{2}{ε^{2}} (- \frac{ε}{η} - \frac{ε}{3} - a) (b + \frac{ε}{η} + \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{a + b}{2} < - (\frac{ε}{η} + \frac{ε}{3}), \\ \frac{{(b - a)}^{2}}{2 ε^{2}} ln \frac{2}{δ} & for - (\frac{ε}{η} + \frac{ε}{3}) < \frac{a + b}{2} < 0. \end{cases}

(63)

Combining (62) and (63), we have

max (N, M) = {\begin{cases} \frac{2}{ε^{2}} (\frac{ε}{η} + \frac{ε}{3} - a) (b - \frac{ε}{η} - \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{a + b}{2} > \frac{ε}{η} + \frac{ε}{3}, \\ \frac{2}{ε^{2}} (- \frac{ε}{η} - \frac{ε}{3} - a) (b + \frac{ε}{η} + \frac{ε}{3}) \ln \frac{2}{δ} & for \frac{a + b}{2} < - (\frac{ε}{η} + \frac{ε}{3}), \\ \frac{{(b - a)}^{2}}{2 ε^{2}} \ln \frac{2}{δ} & else \end{cases}

which implies (61). This completes the proof of the lemma. □

Finally, Theorem 4 can be established by making use of Lemmas 11, 12 and 13.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Contributor Information

Dr. Zhengjia Chen, Email: zchen38@emory.edu, Department of Biostatistics and Bioinformatics, Emory University, Atlanta, GA 30322.

Dr. Xinjia Chen, Email: xinjia_chen@subr.edu, Department of Electrical Engineering, Southern University at Baton Rouge, LA 70813.

References

1.Alamouti SM. A simple transmit diversity technique for wireless communications. IEEE Journal on Selected Areas in Communications. 1998;16(8):1451–1458. [Google Scholar]
2.Azuma K. Weighted sums of certain dependent random variables. Tôkuku Math J. 1967;19(3):357–367. [Google Scholar]
3.Arellano M, Pakkala S, Langston A, Tighiouart M, Pan L, Chen Z, Heffner LT, Lonial S, Winton E, Khoury HJ. Early clearance of peripheral blood blasts predicts response to induction chemotherapy in acute myeloid leukemia. Cancer. 2012;118(21):5278–5282. doi: 10.1002/cncr.27494. [DOI] [PubMed] [Google Scholar]
4.Bentkus V. On Hoeffdings inequalities. The Annals of Probability. 2004;32(2):1650–1673. [Google Scholar]
5.Chow SC, Shao J, Wang H. Sample Size Calculations in Clinical Trials. 2nd. Chapman & Hall; 2008. [Google Scholar]
6.Doob J. Stochastic Processes. Wiley; 1953. [Google Scholar]
7.Desu MM, Raghavarao D. Sample Size Methodology. Academic Press; 1990. [Google Scholar]
8.Fishman GS. Monte Carlo – Concepts, Algorithms and Applications. Spring-Verlag; 1996. [Google Scholar]
9.Franklin GF, Powell JD, Emami-Naeini A. Feedback Control of Dynamic Systems. Pearson Higher Education, Inc; 2014. [Google Scholar]
10.Gajek L, Niemiro W, Pokarowski P. Optimal Monte Carlo integration with fixed relative precision. Journal of Complexity. 2013;29:4–26. [Google Scholar]
11.Hampel F. Is statistics too difficult? The Canadian Journal of Statistics. 1998;26:497–513. [Google Scholar]
12.Hoeffding W. Probability inequalities for sums of bounded variables. Journal of American Statistical Association. 1963;58:13–29. [Google Scholar]
13.Janik M, Hartlage G, Alexopoulos N, Mirzoyev Z, McLean DS, Arepalli CD, Chen Z, Stillman AE, Raggi P. Epicardial adipose tissue volume and coronary artery calcium to predict myocardial ischemia on positron emission tomography-computed tomography studies. Journal of Nuclear Cardiology. 2010;17(5):841–847. doi: 10.1007/s12350-010-9235-1. [DOI] [PubMed] [Google Scholar]
14.Khargonekar P, Tikku A. Randomized algorithms for robust control analysis and synthesis have polynomial complexity. Proceedings of the IEEE Conference on Decision and Control. 1996 [Google Scholar]
15.Lagoa CM, Barmish BR. Distributionally robust Monte Carlo simulation: a tutorial survey. Proceedings of the IFAC World Congress. 2002 [Google Scholar]
16.Larsson E, Stoica P. Space-Time Block Coding For Wireless Communications. Cambridge University Press; UK: 2003. [Google Scholar]
17.Massart P. The tight constant in the Dvoretzky-Kiefer-Wolfowitz inequality. The Annals of Probability. 1990;18:1269–1283. [Google Scholar]
18.Mitzenmacher M, Upfal E. Probability and Computing. Cambridage University Press; 2005. [Google Scholar]
19.Motwani R, Raghavan P. Randomized Algorithms. Cambridge University Press; 1995. [Google Scholar]
20.Proakis JG. Digital Communications. Mcgraw-Hill; 2000. [Google Scholar]
21.Tempo R, Calafiore G, Dabbene F. Randomized Algorithms for Analysis and Control of Uncertain Systems. Springer; 2005. [Google Scholar]
22.Tremba, Calafiore G, Dabbene F, Gryazina E, Polyak BT, Shcherbakov PS, Tempo R. RACT: Randomized Algorithms Control Toolbox for MATLAB. Proc of the IFAC World Congress; Seoul, Korea. July 2008. [Google Scholar]
23.Wang D, Müller S, Amin AR, Huang D, Su L, Hu Z, Rahman MA, Nannapaneni S, Koenig L, Chen Z, Tighiouart M, Shin DM, Chen ZG. The pivotal role of integrin β1 in metastasis of head and neck squamous cell carcinoma. Clinical Cancer Research. 2012;18(17):4589–4599. doi: 10.1158/1078-0432.CCR-11-3127. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Willams D. Probability with Martingales. Cambriage University Press; 1991. [Google Scholar]

[R1] 1.Alamouti SM. A simple transmit diversity technique for wireless communications. IEEE Journal on Selected Areas in Communications. 1998;16(8):1451–1458. [Google Scholar]

[R2] 2.Azuma K. Weighted sums of certain dependent random variables. Tôkuku Math J. 1967;19(3):357–367. [Google Scholar]

[R3] 3.Arellano M, Pakkala S, Langston A, Tighiouart M, Pan L, Chen Z, Heffner LT, Lonial S, Winton E, Khoury HJ. Early clearance of peripheral blood blasts predicts response to induction chemotherapy in acute myeloid leukemia. Cancer. 2012;118(21):5278–5282. doi: 10.1002/cncr.27494. [DOI] [PubMed] [Google Scholar]

[R4] 4.Bentkus V. On Hoeffdings inequalities. The Annals of Probability. 2004;32(2):1650–1673. [Google Scholar]

[R5] 5.Chow SC, Shao J, Wang H. Sample Size Calculations in Clinical Trials. 2nd. Chapman & Hall; 2008. [Google Scholar]

[R6] 6.Doob J. Stochastic Processes. Wiley; 1953. [Google Scholar]

[R7] 7.Desu MM, Raghavarao D. Sample Size Methodology. Academic Press; 1990. [Google Scholar]

[R8] 8.Fishman GS. Monte Carlo – Concepts, Algorithms and Applications. Spring-Verlag; 1996. [Google Scholar]

[R9] 9.Franklin GF, Powell JD, Emami-Naeini A. Feedback Control of Dynamic Systems. Pearson Higher Education, Inc; 2014. [Google Scholar]

[R10] 10.Gajek L, Niemiro W, Pokarowski P. Optimal Monte Carlo integration with fixed relative precision. Journal of Complexity. 2013;29:4–26. [Google Scholar]

[R11] 11.Hampel F. Is statistics too difficult? The Canadian Journal of Statistics. 1998;26:497–513. [Google Scholar]

[R12] 12.Hoeffding W. Probability inequalities for sums of bounded variables. Journal of American Statistical Association. 1963;58:13–29. [Google Scholar]

[R13] 13.Janik M, Hartlage G, Alexopoulos N, Mirzoyev Z, McLean DS, Arepalli CD, Chen Z, Stillman AE, Raggi P. Epicardial adipose tissue volume and coronary artery calcium to predict myocardial ischemia on positron emission tomography-computed tomography studies. Journal of Nuclear Cardiology. 2010;17(5):841–847. doi: 10.1007/s12350-010-9235-1. [DOI] [PubMed] [Google Scholar]

[R14] 14.Khargonekar P, Tikku A. Randomized algorithms for robust control analysis and synthesis have polynomial complexity. Proceedings of the IEEE Conference on Decision and Control. 1996 [Google Scholar]

[R15] 15.Lagoa CM, Barmish BR. Distributionally robust Monte Carlo simulation: a tutorial survey. Proceedings of the IFAC World Congress. 2002 [Google Scholar]

[R16] 16.Larsson E, Stoica P. Space-Time Block Coding For Wireless Communications. Cambridge University Press; UK: 2003. [Google Scholar]

[R17] 17.Massart P. The tight constant in the Dvoretzky-Kiefer-Wolfowitz inequality. The Annals of Probability. 1990;18:1269–1283. [Google Scholar]

[R18] 18.Mitzenmacher M, Upfal E. Probability and Computing. Cambridage University Press; 2005. [Google Scholar]

[R19] 19.Motwani R, Raghavan P. Randomized Algorithms. Cambridge University Press; 1995. [Google Scholar]

[R20] 20.Proakis JG. Digital Communications. Mcgraw-Hill; 2000. [Google Scholar]

[R21] 21.Tempo R, Calafiore G, Dabbene F. Randomized Algorithms for Analysis and Control of Uncertain Systems. Springer; 2005. [Google Scholar]

[R22] 22.Tremba, Calafiore G, Dabbene F, Gryazina E, Polyak BT, Shcherbakov PS, Tempo R. RACT: Randomized Algorithms Control Toolbox for MATLAB. Proc of the IFAC World Congress; Seoul, Korea. July 2008. [Google Scholar]

[R23] 23.Wang D, Müller S, Amin AR, Huang D, Su L, Hu Z, Rahman MA, Nannapaneni S, Koenig L, Chen Z, Tighiouart M, Shin DM, Chen ZG. The pivotal role of integrin β1 in metastasis of head and neck squamous cell carcinoma. Clinical Cancer Research. 2012;18(17):4589–4599. doi: 10.1158/1078-0432.CCR-11-3127. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] 24.Willams D. Probability with Martingales. Cambriage University Press; 1991. [Google Scholar]

PERMALINK

Rigorous Error Control Methods for Estimating Means of Bounded Random Variables

Dr Zhengjia Chen

Dr Xinjia Chen

Abstract

1 Introduction

2 Martingale Inequalities

Theorem 1

Proof

3 Explicit Sample Size Formulae

Theorem 2

Theorem 3

Table 1.

Corollary 1

Figure 1.

Figure 2.

Theorem 4

Figure 3.

4 Estimating the Difference of Two Population Means

5 Illustrations

6 Concluding Remarks

Research highlights.

Acknowledgments

A Proof of Theorem 2

B Proof of Theorem 3

Lemma 1

Proof

Lemma 2

Proof

Lemma 3

Proof

Lemma 4

Proof

Lemma 5

Proof

Lemma 6

Proof

C Proof of Theorem 4

Lemma 7

Proof

Lemma 8

Proof

Lemma 9

Proof

Lemma 10

Proof

Lemma 11

Proof

Lemma 12

Proof

Lemma 13

Proof

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases