NONPARAMETRIC GOODNESS-OF-FIT TESTS FOR UNIFORM STOCHASTIC ORDERING

Chuan-Fa Tang; Dewei Wang; Joshua M Tebbs

doi:10.1214/16-AOS1535

. Author manuscript; available in PMC: 2018 Dec 15.

Published in final edited form as: Ann Stat. 2017 Dec 15;45(6):2565–2589. doi: 10.1214/16-AOS1535

NONPARAMETRIC GOODNESS-OF-FIT TESTS FOR UNIFORM STOCHASTIC ORDERING

Chuan-Fa Tang ¹, Dewei Wang ¹, Joshua M Tebbs ¹

PMCID: PMC5771311 NIHMSID: NIHMS853643 PMID: 29353943

Abstract

We propose L^p distance-based goodness-of-fit (GOF) tests for uniform stochastic ordering with two continuous distributions F and G, both of which are unknown. Our tests are motivated by the fact that when F and G are uniformly stochastically ordered, the ordinal dominance curve R = FG⁻¹ is star-shaped. We derive asymptotic distributions and prove that our testing procedure has a unique least favorable configuration of F and G for p ∈ [1,∞]. We use simulation to assess finite-sample performance and demonstrate that a modified, one-sample version of our procedure (e.g., with G known) is more powerful than the one-sample GOF test suggested by Arcones and Samaniego (2000, Annals of Statistics). We also discuss sample size determination. We illustrate our methods using data from a pharmacology study evaluating the effects of administering caffeine to prematurely born infants.

Keywords and phrases: Brownian bridge, Hazard rate ordering, Least favorable distribution, Order-restricted inference, Ordinal dominance curve, Star-shaped ordering

MSC 2010 subject classifications: Primary 62G10, secondary 62F30

1. Introduction

Suppose X and Y are continuous random variables with distribution functions F and G, respectively. In many applications, it is of interest to compare F and G. The ordinal dominance curve (ODC), which plots (G(t), F(t)) for −∞ ≤ t≤∞, is a useful graphical tool that facilitates such a comparison (Bamber, 1975; Hsieh and Turnbull, 1996; Carolan and Tebbs, 2005; Davidov and Herman, 2012). The ODC can also be defined as R = FG⁻¹, where G⁻¹(u) = inf{t: G(t) ≥ u} is the quantile function of G. When F = G, the ODC follows the main diagonal of the unit square, the so-called equal distribution line.

We consider order-restricted comparisons of F and G. Define F̄ = 1− F and Ḡ = 1 − G. These are the survivor functions if X and Y are lifetime random variables, although herein we do not require X and Y to be nonnegative. Denote the corresponding densities by f and g, respectively. If F̄ ≤ Ḡ, then X and Y are stochastically ordered; this is written as F ≤_S G and means informally that X “tends to be smaller” than Y. Two stronger orders are the uniform stochastic order and the likelihood ratio order. When F̄/Ḡ is nonincreasing, X and Y satisfy a uniform stochastic order, written F ≤_US G. When f/g is nonincreasing, X and Y satisfy a likelihood ratio order, written F ≤_LR G. It is easy to show these orderings follow the nested structure: F ≤_LR G ⇒ F ≤_US G ⇒ F ≤_S G. A comprehensive account of these and other orderings is given in Shaked and Shanthikumar (2007).

Different stochastic orderings give rise to different functional forms of the ODC. The weakest ordering F ≤_S G holds if and only if R is at least as large as the equal distribution line; i.e., R(u) ≥ u, for 0 ≤ u ≤ 1. The strongest ordering F ≤_LR G holds if and only if R is concave. The intermediate ordering F ≤_US G holds if and only if R is star-shaped (Lehmann and Rojo, 1992). One way to characterize a star-shaped ODC is that the slope of the secant line from the point (1, 1) to (u,R(u)); i.e., r(u) = {1−R(u)}/(1−u), is nonincreasing in u. Figure 1 gives examples of ODCs that correspond to stochastic, uniform stochastic, and likelihood ratio orderings. This figure demonstrates the utility of the ODC in characterizing how two distributions are ordered and how the structure F ≤_LR G ⇒ F ≤_US G ⇒ F ≤_S G manifests itself graphically in the ODC.

Fig 1 — Ordinal dominance curves. Left: F ≤_S G. Middle: F ≤_US G. Right: F ≤_LR G. In each subfigure, the equal distribution line is shown dotted.

This article is motivated by a pharmacology study evaluating the effects of administering caffeine to prematurely born infants in Columbia, South Carolina; see Section 5. Among 404 infants in the study, m = 127 were administered caffeine and n = 277 were not. Each infant was then followed until he or she was discharged from the hospital. All infants were eventually discharged and were alive at the time of discharge; i.e., no discharge times were censored. One of the goals of the study was to understand how the distributions of discharge times F (caffeine) and G (no caffeine) compared for the two groups. In Figure 2 (left), we display the sample ODC for the data, which is defined as $R_{m n} (u) = F_{m} {G_{n}^{- 1} (u)}$ , for 0 ≤ u ≤ 1, where F_m and G_n are the empirical distribution functions and $G_{n}^{- 1} (u) = inf {t : G_{n} (t) \geq u}$ is the empirical quantile function. The sample ODC and its large-sample properties were described in Hsieh and Turnbull (1996).

Fig 2 — Premature infant data. Left: The sample ODC $R_{m n} (u) = F_{m} {G_{n}^{- 1} (u)}$ for the time to discharge (F = caffeine; G = no caffeine). Right: The least star-shaped majorant ℳR_mn is shown in blue. In each subfigure, the equal distribution line is shown dotted.

On the basis of Figure 2, which stochastic ordering, if any, characterizes the true relationship between the discharge time distributions? There is a substantive literature on nonparametric tests for stochastic orderings with two or more distributions; see Davidov and Herman (2012), El Barmi and McKeague (2016), and the references therein. In the two-sample case, most of this literature describes tests where the equal distribution assumption F = G is treated as the null hypothesis and the ordering (i.e., F ≤_S G, F ≤_US G, or F ≤_LR G) is placed in the alternative. A potential drawback with this type of test is that it is constructed assuming a specific order-restricted class of alternatives; if the assumed class is incorrect, the test may lead to misleading or vacuous conclusions. For example, applying tests of this type to the premature infant data, we obtain the following results:

testing F = G versus F ≤_S G: p-value < 0.00002 (Davidov and Herman, 2012)
testing F = G versus F ≤_US G: p-value < 0.00001 (Arcones and Samaniego, 2000)
testing F = G versus F ≤_LR G: p-value < 0.00001 (Carolan and Tebbs, 2005).

Each test clearly dictates that the infant data are not consistent with F = G. However, we are no closer to identifying which specific ordering (if any) holds in this setting.

In this light, we consider goodness-of-fit (GOF) testing procedures instead. By “goodness-of-fit,” we mean the procedure places the ordering in the null hypothesis and attempts to detect departures from the ordering. By comparison, the literature on nonparametric GOF tests with two distributions is more sparse, perhaps because this type of testing problem is more difficult. The primary reason for the added difficulty is that the ordering can hold under different configurations of F and G. Therefore, one must determine the least favorable configuration of the two distributions before the test can be performed; i.e., so that the probability of type I error can be controlled. Carolan and Tebbs (2005) proposed nonparametric GOF tests for likelihood ratio ordering with two continuous distributions by using the least concave majorant of the sample ODC. This work was generalized and improved upon by Beare and Moon (2015) in the econometrics literature, who considered likelihood ratio ordering and its applications in finance.

GOF tests for uniform stochastic ordering have been proposed but only in limited settings. Dardanoni and Forcina (1998) considered likelihood-based tests against uniform stochastic ordering in a two-way contingency table. Park et al. (1998) used a nonparametric maximum likelihood approach to formulate GOF tests with two or more continuous distributions, but only after data from these distributions have been assigned to disjoint intervals in the form of counts. This essentially discretizes the problem and results in testing against uniform stochastic ordering among several multinomial distributions. Furthermore, this formulation gives rise to non-unique least favorable configurations that depend on how the intervals are selected, the number of distributions, and even the significance level used. Finally, in the two-population setting, Arcones and Samaniego (2000) suggested a GOF test for uniform stochastic ordering based on the family of order-restricted estimators in Mukerjee (1996). However, these authors assume that one of the population distributions is known (e.g., G is known) and do not determine the least favorable configuration for their procedure. Instead, the authors use critical values from an upper bound asymptotic distribution which leads to a conservative test.

In this article, we propose a family of GOF tests for uniform stochastic ordering with two continuous distributions F and G; that is, we are interested in testing H₀: F ≤_US G versus H₁: F ≰_US G, where both distributions are unknown. Motivated by the ODC approaches taken in Carolan and Tebbs (2005) and Beare and Moon (2015), we construct test statistics for H₀ versus H₁ based on the L^p difference between the sample ODC and its least star-shaped majorant (defined in Section 2). We then derive asymptotic distributions and prove that our testing procedure has a unique least favorable configuration for p ∈ [1,∞]. Interestingly, this theoretical result is different from the finding in Beare and Moon (2015), who showed that when using L^p distance-based GOF tests for likelihood ratio ordering, the least favorable configuration exists only when p ∈ [1,2]. Furthermore, unlike Park et al. (1998), our approach does not require one to discretize the support of the distributions which can only lead to a loss in power. Finally, we show that the one-sample version of our test (e.g., with G known) is not as conservative as the test proposed by Arcones and Samaniego (2000) and is generally better equipped to detect departures from H₀.

Formulating L^p distance-based GOF tests for uniform stochastic ordering in the two-sample problem is technically challenging. It is not possible to simply modify the proofs in Carolan and Tebbs (2005) and Beare and Moon (2015) under likelihood ratio ordering; see Section 3. At the same time, establishing that such an ordering exists has great practical implications. For example, if X and Y are lifetime random variables (and are absolutely continuous), then F ≤_US G is equivalent to the corresponding hazard rates being ordered. This is an important characterization in reliability and survival analysis applications. Our interest in uniform stochastic ordering is motivated by our collaboration with researchers in the premature infant study discussed earlier. Letting X and Y denote the times to discharge for the caffeine and no-caffeine groups, respectively, uniform stochastic ordering holds if and only if pr(X > t|X > t₀ ) ≤ pr(Y > t|Y > t₀ ), for all t, t₀ satisfying t > t₀ ≥ 0. In other words, no matter how much time t₀ ≥ 0 has subsequently passed, administering caffeine is consistent with shorter discharge times. Note that, in this context, stochastic ordering requires that the relationship above hold only initially (i.e., when t₀ = 0). Uniform stochastic ordering guarantees this type of dominance will hold for all t₀ ≥ 0.

2. Testing procedure

Suppose that X₁,X₂, …, X_m are independent and identically distributed (iid) from F and that Y₁, Y₂, …, Y_n are iid from G. We assume the two samples are independent and that both F and G are unknown. Let R = FG⁻¹ denote the corresponding ODC. For our asymptotic results in Section 3 to hold, as in Hsieh and Turnbull (1996), we assume F and G have continuous densities f and g and that the first derivative of R is bounded over [0, 1]. Throughout this article, we denote the parameter space of R by Θ, the collection of nondecreasing, continuously differentiable functions from [0, 1] to [0, 1]. Under our assumptions, the hypotheses H₀: F ≤_US G and H₁: F ≰_US G can be expressed equivalently as

H_{0} : R \in Θ_{0} = {θ \in Θ : θ is star - shaped} and H_{1} : R \in Θ_{1} = Θ \ Θ_{0} .

Recall that θ ∈ Θ is star-shaped if and only if {1 − θ(u)}/(1 − u) is nonincreasing in u.

Let $R_{m n} = R_{m n} (u) = F_{m} {G_{n}^{- 1} (u)}$ denote the sample ODC, defined in Section 1. Informally, our testing procedure is based on measuring the distance between R_mn and an estimate of R subject to the constraint that F ≤_US G. Towards defining this restricted estimator, let l([0, 1]) denote the collection of bounded functions on [0, 1]. For any h ∈ l([0, 1]), its least star-shaped majorant is defined as

ℳ h = inf {h^{*} \in l ([0, 1]) : h \leq h^{*} and h^{*} is star - shaped};

i.e., ℳh is the smallest star-shaped function in l([0,1]) that is at least as large as h. Throughout our work, we call ℳ: l([0, 1]) ↦ l([0, 1]) the least star-shaped majorant operator. Just as R_mn is an estimator of R under no restriction (Hsieh and Turnbull, 1996), the least star-shaped majorant ℳR_mn is an estimator of R under H₀: F ≤_US G. Using Lemma 1 in the supplementary article (Tang et al., 2016), we show that this restricted estimator can be calculated as

ℳ R_{m n} (u) = 1 - min_{\begin{matrix} v \in V \cup {0} \\ v \leq u \end{matrix}} {\frac{1 - R_{m n} (v)}{1 - v}} (1 - u),

for 0 ≤ u < 1, where 𝒱 is the set of discontinuous (jump) points of R_mn and ℳR_mn(1) = 1. Figure 2 (right) shows the least star-shaped majorant of the sample ODC for the premature infant data described in Section 1.

Our testing procedure utilizes the sample ODC R_mn and its least star-shaped majorantℳR_mn. Specifically, we propose the family of test statistics

M_{m n}^{p} = c_{m n} {‖ ℳ R_{m n} - R_{m n} ‖}_{p},

where c_mn = {mn/(m+n)}^1/2 is a normalizing constant and || · ||_p is the L^p norm with respect to Lebesgue measure. We allow for p ∈ [1,∞]; i.e., ||h||_p = (∫_[0,1] |h(u)|^pdu)^1/^p when p < ∞ and ||h||_∞ = sup_u_∈[0,1] |h(u)|. For example, when p = 1, ||ℳR_mn − R_mn||₁ equals the area between the two estimators; when p = ∞, ||ℳR_mn−R_mn||_∞ equals the largest vertical distance between the estimators. For any p ∈ [1,∞], clearly large values of $M_{m n}^{p}$ are evidence against H₀.

3. Theoretical results

In this section, we first describe the asymptotic distribution of $M_{m n}^{p}$ for any star-shaped ODC; i.e., for any R ∈ Θ₀. We then demonstrate that, for any p ∈ [1,∞], all null distributions are dominated stochastically by the asymptotic distribution of $M_{m n}^{p}$ under R(u) = u, that is, when F = G. From this least favorable distribution, we can find the critical value c_α,p that satisfies ${lim}_{m, n \to \infty} pr (M_{m n}^{p} \geq c_{α, p}) = α$ when F = G and ${lim}_{m, n \to \infty} pr (M_{m n}^{p} \geq c_{α, p}) \leq α$ when H₀: F ≤_US G is true. In other words, rejecting H₀ when $M_{m n}^{p} \geq c_{α, p}$ is an asymptotic size α decision rule. Finally, we examine relevant asymptotic distributions when R ∈ Θ₁ and then characterize large-sample power properties. We also discuss sample size calculations to detect departures from H₀. All theorems are proved in Section 7. Additional technical details are provided in the supplementary article (Tang et al., 2016).

3.1. Asymptotic results under H₀

Let ℐ denote the identity operator on l([0, 1]) and define 𝒟 = ℳ−ℐ. When H₀ is true; i.e., when R ∈ Θ₀, note that ℳR = R and

M_{m n}^{p} = c_{m n} {‖ ℳ R_{m n} - R_{m n} ‖}_{p} = c_{m n} {‖ D R_{m n} - D R ‖}_{p} .

At first glance, establishing the limiting distribution of $M_{m n}^{p}$ under H₀ might seem to be straightforward, that is, one could simply start with the asymptotic distribution of c_mn(R_mn − R) described in Hsieh and Turnbull (1996) and apply the functional delta method (see, e.g., Section 3.9 in van der Vaart and Wellner, 1996) and continuous mapping theorem. This was the approach taken by Beare and Moon (2015) with their L^p distance-based GOF test statistics under likelihood ratio ordering. In our setting, this direct approach is not possible because whereas the least concave majorant operator in Beare and Moon (2015) is Hadamard directionally differentiable (Shapiro, 1990, 1991), the least star-shaped majorant operator ℳ (and hence 𝒟) is not always so; see Lemma 5 in the supplementary article (Tang et al., 2016). Fortunately, this does not create insurmountable problems because weak convergence of c_mn(𝒟R_mn − 𝒟R) is not a necessary prerequisite to derive the asymptotic distribution of c_mn||𝒟R_mn −𝒟R||_p.

Before we state the asymptotic distribution of $M_{m n}^{p}$ for any R ∈ Θ₀, we need to describe R precisely because these distributions depend completely on the shape of R. Recall that when R ∈ Θ₀, the slope function r(u) = {1 − R(u)}/(1 − u) is nonincreasing in u. When r(u) is strictly decreasing over [0, 1], we say that R is strictly star-shaped. When R ∈ Θ₀ is not strictly star-shaped, then, analogously to Beare and Moon (2015), there exists a unique collection (finite or countable) of closed, pairwise disjoint intervals of the form [a_k, b_k], 0 ≤ a_k < b_k ≤ 1, where

the slope r(u) is constant over each interval (i.e., R is affine over each interval)
no two intervals possess the same value of r(u).

In this case, we say that R ∈ Θ₀ is non-strictly star-shaped. The reason we bifurcate Θ₀ using “strictly” and “non-strictly” descriptors is that the nondegenerate part of the asymptotic distribution of $M_{m n}^{p}$ depends only on those regions where R is non-strictly star-shaped. If R is strictly star-shaped over [0, 1], the distribution of $M_{m n}^{p}$ collapses to zero in the limit.

To make our description of the asymptotic distributions precise, we therefore introduce the following notation. For 0 ≤ a < b ≤ 1, define

ℳ_{[a, b]}^{(1, 0)} h = inf {h^{*} \in l ([0, 1]) : h \leq h^{*} and h^{*} is star - shaped over [a, b] with kernel (1, 0)} .

A general definition of what it means for a function h^* to be star-shaped with kernel (c, d) is given directly before Lemma 1 in the supplementary article (Tang et al., 2016). For any h ∈ l([0, 1]), the function $ℳ_{[a, b]}^{(1, 0)} h$ has two defining characteristics. First, $ℳ_{[a, b]}^{(1, 0)} h (u) = h (u)$ whenever u/∈ [a, b]. Second, over [a, b], $ℳ_{[a, b]}^{(1, 0)} h$ is the smallest function (at least as large as h) that is star-shaped with kernel (1, 0); i.e., the slope function $- ℳ_{[a, b]}^{(1, 0)} h (u) / (1 - u)$ over [a, b] is nonincreasing in u. The importance of the functional operator $ℳ_{[a, b]}^{(1, 0)} : l ([0, 1]) \mapsto l ([0, 1])$ becomes clear as we state our first main result.

Theorem 1

Suppose R ∈ Θ₀ and let ℬ denote a standard Brownian bridge. The asymptotic results below hold when min{m, n} → ∞ and n/(m + n) → λ ∈ (0, 1).

If R is strictly star-shaped over [0, 1], then $M_{m n}^{p} \overset{d}{\to} 0$ 0 for all p ∈ [1,∞].
If R is non-strictly star-shaped, then for p ∈ [1,∞),
$M_{m n}^{p} \overset{d}{\to} {\sum_{k} {[λ R^{'} (a_{k}) + (1 - λ) {R^{'} (a_{k})}^{2}]}^{p / 2} \int_{a_{k}}^{b_{k}} {D_{[a_{k}, b_{k}]}^{(1, 0)} ℬ (u)}^{p} d u}^{1 / p};$

when p = ∞,
$M_{m n}^{p} \overset{d}{\to} sup_{k} {{[λ R^{'} (a_{k}) + (1 - λ) {R^{'} (a_{k})}^{2}]}^{1 / 2} sup_{u \in [a_{k}, b_{k}]} {D_{[a_{k}, b_{k}]}^{(1, 0)} ℬ (u)}} .$

In both asymptotic distributions, R′ is the derivative of R and $D_{[a_{k}, b_{k}]}^{(1, 0)} = ℳ_{[a_{k}, b_{k}]}^{(1, 0)} - ℐ$ .

From Theorem 1, one can see that when F ≤_US G, the only randomness in the asymptotic distribution of $M_{m n}^{p}$ arises from the non-strictly star-shaped regions [a_k, b_k] and is described probabilistically by the $D_{[a_{k}, b_{k}]}^{(1, 0)} ℬ$ processes. Furthermore, when F = G, the asymptotic distribution of $M_{m n}^{p}$ simplifies to ${‖ D_{[0, 1]}^{(1, 0)} ℬ ‖}_{p}$ for all p ∈ [1,∞]. When p = 1, for example, this quantity describes the distribution of the area between the least star-shaped majorant of a standard Brownian bridge ℬ and ℬ itself. When p = ∞, ${‖ D_{[0, 1]}^{(1, 0)} ℬ ‖}_{\infty}$ describes the distribution of the sup-norm distance between these two processes. Readers familiar with the GOF tests for likelihood ratio ordering in Carolan and Tebbs (2005) and Beare and Moon (2015) will no doubt recognize the homology between our Theorem 1 and the corresponding results in these articles. However, as noted earlier, GOF tests for uniform stochastic ordering present their own set of mathematical challenges and different conclusions are reached about the existence of a least favorable configuration.

Theorem 2

Suppose R ∈ Θ₀. For any p ∈ [1,∞], the asymptotic distribution of $M_{m n}^{p}$ is ordinary stochastically smaller than ${‖ D_{[0, 1]}^{(1, 0)} ℬ ‖}_{p}$ ; i.e.,

lim_{\begin{matrix} m, n \to \infty \\ n / (m + n) \to λ \end{matrix}} {pr}_{R \in Θ_{0}} (M_{m n}^{p} \geq t) \leq pr ({‖ D_{[0, 1]}^{(1, 0)} ℬ ‖}_{p} \geq t),

for all t ∈ ℝ, where λ is defined in Theorem 1.

Theorem 2 establishes that when using $M_{m n}^{p}$ to test H₀: F ≤_US G versus H₁: F ≰_US G, the equal distribution line R(u) = u represents the least favorable configuration of F and G for all p ∈ [1,∞]. Proving this result involves showing that each of the $D_{[a_{k}, b_{k}]}^{(1, 0)} ℬ$ processes in Theorem 1 are mutually independent, a somewhat startling discovery because each process shares the same Brownian bridge ℬ and each operator $D_{[a_{k}, b_{k}]}^{(1, 0)}$ shares the same kernel point (1, 0). The practical utility of Theorem 2 is that, for any p ∈ [1,∞], we can determine the critical value that maximizes the probability of type I error over all configurations of F and G in Θ₀. This result is different than the conclusion reached in Beare and Moon (2015), who showed that when testing against likelihood ratio ordering using L^p distance-based statistics involving the least concave majorant of R_mn, R(u) = u is the least favorable configuration when p ∈ [1, 2] and for p > 2 the least favorable configuration does not exist. Careful inspection of Theorem 1 and some intuition sheds insight on why this is true. When R is star-shaped, but not strictly star-shaped, each of the derivatives R′(a_k) in Theorem 1 satisfies R′(a_k) ≤ 1. However, when F ≤_LR G, there is no guarantee these derivatives are uniformly bounded for all concave R and hence anomalous limiting behavior can result when p is too large.

For given values of the significance level α and p ∈ [1,∞], denote the 1−α quantile of ${‖ D_{[0, 1]}^{(1, 0)} ℬ ‖}_{p}$ by c_α,p; i.e., c_α,p solves $α = pr ({‖ D_{[0, 1]}^{(1, 0)} ℬ ‖}_{p} \geq c_{α, p})$ . To approximate the distribution of ${‖ D_{[0, 1]}^{(1, 0)} ℬ ‖}_{p}$ , we generated 100,000 Brownian bridge paths on a grid of 100,000 equally spaced points in [0, 1], and, for each p ∈ {1, 2, 3, 5, ∞}, we calculated ${‖ D_{[0, 1]}^{(1, 0)} ℬ ‖}_{p}$ for each path. For each p, these 100,000 values were used to approximate the density function of ${‖ D_{[0, 1]}^{(1, 0)} ℬ ‖}_{p}$ and quantiles c_α,p, for α = 0.01, 0.05, and 0.10. These functions and the selected quantiles c_α,p are provided in the supplementary article (Tang et al., 2016).

3.2. Asymptotic results under H₁

The difference between the asymptotic distribution of $M_{m n}^{p}$ under H₀: R ∈ Θ₀ and that under H₁: R ∈ Θ₁ arises from the non-star-shaped regions of R. To characterize a non-star-shaped ODC R ∈ Θ₁, start with ℳR, which is star-shaped, and note that (as in Section 3.1) one can partition the unit interval [0, 1] as [0, 1] = S∪(∪_kS_k), where ℳR is strictly star-shaped over S and non-strictly star-shaped over pairwise disjoint intervals of the form S_k = [a_k, b_k], 0 ≤ a_k < b_k ≤ 1, for k = 1, 2, …. One can further partition each S_k as S_k = S_k₁ ∪ S_k₂, where S_k₁ = {u ∈ S_k: ℳR(u) = R(u)} and S_k₂ = {u ∈ S_k: ℳR(u) > R(u)}. Each S_k₁ must contain a_k so it is never empty, and the non-star-shaped regions of R can be written as ∪_kS_k₂. In other words, R ∈ Θ₀ when ∪_kS_k₂ is empty and R ∈ Θ₁ otherwise.

In general, these types of regions contribute differently to the limiting distribution of $M_{m n}^{p}$ . Over the strictly star-shaped region S, ℳR(u) = R(u) for all u and the L^p norm of c_mn{𝒟R_mn(u) − 𝒟R(u)} converges in distribution to 0, as in Section 3.1. To clearly describe the contribution over the S_k regions, we introduce new notation. For any h ∈ l([0, 1]), define the functional operator ℒ_Sk: l([0, 1]) ↦ l([0, 1]) according to

ℒ_{S_{k}} h (u) = - inf_{\begin{matrix} v \in S_{k 1} \\ v \leq u \end{matrix}} {\frac{- h (v)}{1 - v}} (1 - u) I_{S_{k}} (u) + h (u) I_{S_{k}^{c}} (u), for u \in [0, 1),

where I_A(·) is the indicator function over the set A and A^c denotes the complement of A. When u = 1, ℒ_Skh(u) = max{h(1), 0} or h(1) depending on whether the singleton {1} ∈ S_k₁ or not; see Appendix C in the supplementary article (Tang et al., 2016). Using this new operator, we now characterize asymptotic distributions for any ODC R ∈ Θ with those in Θ₁ = Θ\Θ₀ of particular interest. A discussion on the large-sample power properties of our testing procedure follows.

Theorem 3

Suppose R ∈ Θ. Using the notation described in this subsection,

c_{m n} {‖ D R_{m n} - D R ‖}_{p} \overset{d}{\to} {\sum_{k} \int_{u \in S_{k}} {| ℒ_{S_{k}} T_{R}^{λ} (u) - T_{R}^{λ} (u) |}^{p} d u}^{1 / p}

for p ∈ [1,∞); when p = ∞,

c_{m n} {‖ D R_{m n} - D R ‖}_{p} \overset{d}{\to} sup_{k} sup_{u \in S_{k}} | ℒ_{S_{k}} T_{R}^{λ} (u) - T_{R}^{λ} (u) | .

Both results hold as min{m, n} → ∞ and n/(m + n) → λ ∈ (0, 1). In both cases, $T_{R}^{λ} (u) = λ^{1 / 2} ℬ_{1} (R (u)) + {(1 - λ)}^{1 / 2} R^{'} (u) ℬ_{2} (u)$ , 0 ≤ u ≤ 1, where ℬ₁ and ℬ₂ denote two independent standard Brownian bridges.

Four remarks are in order. First, the process $T_{R}^{λ} = {T_{R}^{λ} (u), 0 \leq u \leq 1}$ in Theorem 3 is well known; as noted earlier, it represents the asymptotic distribution of c_mn(R_mn − R) for any R ∈ Θ; see, e.g., Theorem 2.2 in Hsieh and Turnbull (1996). Second, the asymptotic distributions identified in Theorem 3 apply for any R ∈ Θ, but we show in the supplementary article (Tang et al., 2016) that they quickly reduce to those in Theorem 1 when R ∈ Θ₀. Third, our L^p tests are consistent for p ∈ [1,∞]. To see why, consider the sup-norm (p = ∞) case in Theorem 3 and note that, by the triangle inequality,

{pr}_{R \in Θ_{1}} (M_{m n}^{\infty} \geq c_{α, \infty}) = {pr}_{R \in Θ_{1}} (c_{m n} {‖ D R_{m n} ‖}_{\infty} \geq c_{α, \infty}) \geq {pr}_{R \in Θ_{1}} (c_{m n} {‖ D R_{m n} - D R ‖}_{\infty} \leq c_{m n} {‖ D R ‖}_{\infty} - c_{α, \infty})

which can be approximated by

{pr}_{R \in Θ_{1}} (sup_{k} sup_{u \in S_{k}} | ℒ_{S_{k}} T_{R}^{λ} (u) - T_{R}^{λ} (u) | \leq c_{m n} {‖ D R ‖}_{\infty} - c_{α, \infty}) .

It is easy to show that ${sup}_{k} {sup}_{u \in S_{k}} ∣ ℒ_{S_{k}} T_{R}^{λ} (u) - T_{R}^{λ} (u) ∣$ is bounded and that, for any R ∈ Θ₁, c_mn||𝒟R||_∞ → ∞, as min{m, n} → ∞, which establishes our claim. The finite p argument is analogous. Fourth, approximate lower bounds on the power, like the one above in the sup-norm case, can be used for sample size calculations. For an ODC R ∈ Θ₁ deemed to be clinically relevant, one can determine numerically the smallest m and n that solve ${pr}_{R \in Θ_{1}} ({sup}_{k} {sup}_{u \in S_{k}} ∣ ℒ_{S_{k}} T_{R}^{λ} (u) - T_{R}^{λ} (u) ∣ \leq c_{m n} {‖ D R ‖}_{\infty} - c_{α, \infty}) = 1 - β$ , where β ∈ (0, 1). The resulting solution will be inexorably conservative but still potentially useful for planning purposes. We illustrate this approach with examples in Section 4.

We conclude this section with a brief discussion on local power. This discussion is ultimately not dissimilar from the local power discussion in Beare and Moon (2015) under likelihood ratio ordering. However, our interest in local power arises because we want to compare the one-sample version of our testing procedure to the GOF test suggested by Arcones and Samaniego (2000). This one-sample comparison is given in Section 4.3. The two-sample discussion is given now. Let {R⁽^r⁾, r = 1, 2, …, } denote a sequence of ODCs in Θ₁. For each r ≥ 1, denote the corresponding distributions by F⁽^r⁾ and G⁽^r⁾ from which we have independent random samples $X_{1}^{(r)}, X_{2}^{(r)}, \dots, X_{m}^{(r)}$ and $Y_{1}^{(r)}, Y_{2}^{(r)}, \dots, Y_{n}^{(r)}$ , respectively. We examine local power properties by letting R⁽^r⁾ approach Θ₀ in the sense that ||𝒟R⁽^r⁾||_p = ||ℳR⁽^r⁾−R⁽^r⁾ ||_p → 0 as r →∞ at different rates. Using the notation in this paragraph, our last theorem summarizes the salient results.

Theorem 4

Suppose the first derivative of R⁽^r⁾ ∈ Θ₁ is uniformly bounded over [0, 1] for all r. Suppose p ∈ [1,∞]. All limits stated below assume that max{m, n} = O(r) and n/(m + n) → λ ∈ (0, 1), as r→∞.

If lim c_mn||𝒟R⁽^r⁾||_p = ∞, then $lim {pr}_{R^{(r)} \in Θ_{1}} (M_{m n}^{p} > c_{α, p}) = 1$ .
For any β ∈ (0, 1), there exists η_p(β) > 0 such that
$lim inf {pr}_{R^{(r)} \in Θ_{1}} (M_{m n}^{p} > c_{α, p}) \geq 1 - β$

whenever lim inf c_mn||𝒟R⁽^r⁾||_p ≥ η_p(β).

Part (a) of Theorem 4 indicates that when ||𝒟R⁽^r⁾||_p converges to 0 at a rate slower than $c_{m n}^{- 1}$ , c_mn||𝒟R⁽^r⁾||_p diverges and the power of our test converges to 1. Part (b) guarantees that when c_mn||𝒟R⁽^r⁾||_p remains bounded away from zero, the power of our test is still nontrivial; i.e., it does not converge to 0. This occurs when the “amount of information” c_mn increases and the “departure” ||𝒟R⁽^r⁾||_p decreases, and both do so at the same rate.

4. Simulation evidence

We use simulation to assess the finite-sample performance of our tests. In Section 4.1, we consider fixed ODCs under both H₀ : F ≤_US G and H₁ : F ≰ _US G to estimate type I error probability and power, respectively, and we illustrate the sample size calculations described in Section 3.2. In Section 4.2, we modify our testing procedure to allow for one of the population distributions to be known and compare this modified test to the one-sample GOF test in Arcones and Samaniego (2000). Local power results are provided in Section 4.3.

4.1. Fixed ODC comparisons

We consider four ODCs satisfying R ∈ Θ₀ (R₁, R₂, R₃, and R₄) and four ODCs satisfying R ∈ Θ₁ (R₅, R₆, R₇, and R₈). The H₀ ODCs (Figure 3, left) are each members of a family of star-shaped ODCs that we describe in the supplementary article (Tang et al., 2016). The H₁ ODCs (Figure 3, right) are not star-shaped and are also described in Tang et al. (2016). We also consider R₀ = R₀(u) = u, for u ∈ [0, 1], to examine finite-sample performance under the least favorable configuration F = G. All of our results are based on 10, 000 Monte Carlo data sets using independent samples from F and G with sample sizes m and n, respectively. To generate the samples, we let F(u) = R_i(u) and G(u) = u, for u ∈ [0, 1]. We then sample X₁, X₂, …, X_m from F using the inverse cumulative distribution function technique and independently sample Y₁, Y₂, …, Y_n from a uniform(0, 1) distribution. This provides independent samples for each ODC R under consideration.

Fig. 3 — *Left: Star-shaped ODCs; i.e., R_i ∈ Θ₀. Right: Non-star-shaped ODCs; i.e., R_i* ∈ Θ₁*. A description of each curve is given in the* supplementary article (Tang et al., 2016).

Table S.2 in the supplementary article (Tang et al., 2016) gives Monte Carlo estimates of the probability of rejecting H₀ : F ≤_US G for different sample sizes, values of p ∈ {1, 2,∞}, and α = 0.05. We experimented with other values of p (i.e., p = 3 and p = 5) but obtained results similar to those when p = 2. Of initial interest is the finite-sample performance when F = G. With 10,000 simulated data sets, the margin of error associated with the size estimates under F = G, assuming a 99 percent confidence level, is approximately 0.006. Therefore, one notes that our tests with p = 1 and p = 2 are slightly anticonservative with small samples and otherwise operate closely to the nominal level. Furthermore, examining the rejection rates for the other star-shaped ODCs (R₁, R₂, R₃, and R₄) supports Theorem 2 which, for p ∈ [1,∞], guarantees the probability of type I error will be at its maximum under F = G. Likewise, powers for the non-star-shaped ODCs (R₅, R₆, R₇, and R₈) all approach unity as m and n become large. This reinforces our consistency claim.

We also use the non-star-shaped ODCs in Figure 3 to illustrate sample size determination. For p ∈ [1,∞] and for a given R ∈ Θ₁, denote by d_R,β,p the 1 − β quantile of the asymptotic distributions in Theorem 3. Using our lower bound on the asymptotic power from Section 3.2 and taking m = n (for simplicity), we obtain a closed-form expression for the minimum sample size necessary to detect the departure ||𝒟R||_p = ||ℳR−R||_p with probability 1 − β when using an asymptotic size α test; i.e.,

m = 2 {(\frac{d_{R, β, p} + c_{α, p}}{{‖ D R ‖}_{p}})}^{2}, for p \in [1, \infty] .

With α = 0.05 and 1−β = 0.8, the supplementary article (Tang et al., 2016) tables these solutions for each non-star-shaped ODC in Figure 3 and for each p ∈ {1, 2,∞}. For example, for the R₅ ODC, which corresponds to F and G being stochastically ordered (but not uniformly stochastically ordered), the minimum sample size solutions for p ∈ {1, 2,∞}, respectively, are m = 634, m = 461, and m = 582. Such sample sizes might seem dispiritingly large; however, it is not surprising these solutions are conservative. We describe in Section 6 alternative approaches that should reduce this conservatism.

4.2. Comparison with Arcones and Samaniego (2000)

We now turn our attention to the special case of testing H₀ : F ≤_US G versus H₁ : F ≰ _US G where G is known. Arcones and Samaniego (2000), who focused largely on optimal estimation of F (with F ≤_US G and G known), also suggested a conservative large-sample procedure to test against H₀. Their proposed test statistic, which we denote by D_m, can be expressed as a function of the one-sample ODC R_m = F_mG⁻¹; specifically,

D_{m} = m^{1 / 2} sup_{0 \leq v \leq u \leq 1} [(1 - v) {1 - R_{m} (u)} - (1 - u) {1 - R_{m} (v)}] .

However, instead of deriving a least favorable (asymptotic) distribution for inference, the authors proved that the asymptotic distribution of D_m is bounded above by 2 sup_u_∈[0_,_1] |ℬ (u)|, where ℬ is a standard Brownian bridge, and selected their critical value $c_{α / 2}^{AS}$ to satisfy $α = pr ({sup}_{u \in [0, 1]} ∣ ℬ (u) ∣ \geq c_{α / 2}^{AS})$ . On the other hand, one-sample versions of our GOF procedure are available and use the test statistics

M_{m}^{p} = m^{1 / 2} {[\int_{[0, 1]} {D R_{m} (u)}^{p} d u]}^{1 / p} and M_{m}^{\infty} = m^{1 / 2} sup_{u \in [0, 1]} {D R_{m} (u)},

where 𝒟 is the operator defined in Section 3.1 and R_m(u) = F_m{G⁻¹(u)}. The limiting distributions in Theorem 1 also apply here as m → ∞; in addition, it is straightforward to modify the proof of Theorem 2 to conclude that F = G admits the least favorable configuration for p ∈ [1,∞] in the known G case.

For different sample sizes m (now corresponding to F only), Table S.3 in the supplementary article (Tang et al., 2016) gives small-sample rejection rates of our one-sample tests and the test from Arcones and Samaniego (2000), both performed using α = 0.05. We used techniques similar to those described in Section 3.1 to approximate the critical value $c_{α / 2}^{AS} = c_{0.025}^{AS} = 1.359$ and performed all simulations in the same way as before except G is now known. Clearly, there is a price to be paid for using the test based on the D_m statistic when F = G; type I error probability estimates remain significantly below the nominal level for all m ≤ 200. On the other hand, our p = 1 and p = 2 tests are only minimally conservative when m ≤ 75, and our sup-norm (p = ∞) test performs nominally even when m = 20. In addition, the sup-norm test can be markedly more powerful at detecting non-star-shaped alternatives with small to moderately sized samples.

4.3. Local power analysis

A consequence of Theorem 3 is that, for any fixed R ∈ Θ₁, our L^p GOF tests are consistent for all p ∈ [1,∞]. To glean additional insight on which values of p might be preferred in practice, we investigate the power associated with local alternatives. Starting in the lower left corner, Figure 4 depicts a sequence of ODCs in Θ₁ that approach Θ₀ (moving from lower left to upper right). Each ODC shown in Figure 4 belongs to a family of ODCs described in the supplementary article (Tang et al., 2016); the defining feature of this family is that it is indexed by a single parameter δ ∈ [0, 0.5]. The δ = 0 member, say R₍₀₎, is the initial ODC in the lower left corner of Figure 4; the δ = 0.5 member R_(0.5), shown in the upper right, is the limiting ODC in Θ₀. ODCs R₍_δ₎ with intermediate values of δ ∈ (0, 0.5) are also identified in Figure 4.

Fig. 4 — *Local power family of ODCs indexed by δ* ∈ [0, 0.5]*. The δ* = 0 *member R*₍₀₎ *is the initial ODC in* Θ₁*; the δ* = 0.5 *member R*_(0.5) *is the limiting ODC in* Θ₀*. This family is described in the* supplementary article (Tang et al., 2016).

In our testing problem, a local power analysis involves examining a sequence of ODCs {R⁽^r⁾, r = 1, 2, …, } in Θ₁ that converges to Θ₀ at different rates. We do so here by using the family of ODCs just described. Specifically, we consider the rates ζ_r ∈ {log r, r²^/⁵, r¹^/²}. For each ζ_r, we first choose a sequence of constants δ⁽^r⁾ such that lim_r_→_∞ ζ_r|δ⁽^r⁾ − 0.5| = c_{ζ_r} > 0 and then select members from our ODC family identified by R⁽^r⁾ = R_(δ^(r)), for r = 1, 2, …. The resulting sequence R⁽^r⁾ satisfies ||𝒟R⁽^r⁾||_p = ||ℳR⁽^r⁾ − R⁽^r⁾||_p → 0 and $ζ_{r} {‖ D R^{(r)} ‖}_{p} \to c_{ζ_{r}, p}^{*} > 0$ , both as r→∞. This investigation allows us to learn more about the practical aspects of Theorem 4 (i.e., with both F and G unknown). We also use these ODC sequences, one for each rate ζ_r, to compare the one-sample versions of our tests with the test in Arcones and Samaniego (2000).

For each r ∈ {50, 100, 500, 1000, 5000, 10000}, we simulated 10,000 independent random samples, $X_{1}^{(r)}, X_{2}^{(r)}, \dots, X_{m}^{(r)}$ from F⁽^r⁾ and $Y_{1}^{(r)}, Y_{2}^{(r)}, \dots, Y_{n}^{(r)}$ from G⁽^r⁾, where F⁽^r⁾(u) = R⁽^r⁾(u) and G⁽^r⁾(u) = u, 0 ≤ u ≤ 1, and m = n = r. Figure 5 (top row) shows the estimated powers of our α = 0.05 tests associated with each rate: ζ_r = logr (left), ζ_r = r²^/⁵ (middle), and ζ_r = r¹^/² (right). Note that with m = n = r, considering the slower rates ζ_r = logr and ζ_r = r²^/⁵ allows us to assess part (a) of Theorem 4, while the fastest rate ζ_r = r¹^/² allows us to assess part (b). Both parts are supported by our empirical results in Figure 5. For the slower rates, the powers approach unity as expected; however, we find that there is no decisively preferred value of p among p ∈ {1, 2,∞}. On the other hand, when ζ_r = r¹^/², the p = 1 powers hover only slightly above 0.3 for all r, while the p = 2 and p = ∞ powers still approach unity.

Fig. 5 — *Local power results with α* = 0.05. Left: ζ_r = logr. Middle: ζ_r = r²^/⁵*. Right: ζ_r* = r¹^/²*. Top: Two-sample case. Bottom: One-sample case. Our L^p results are shown dotted for p* = 1, dashed for p = 2, and dot-dashed for p = ∞. Arcones and Samaniego (2000) *results (one-sample case only) are shown using a solid line.*

Switching to the one-sample problem, we find quite different results. For each rate ζ_r, Figure 5 (bottom row) displays the estimated powers of our one-sample α = 0.05 tests which use $M_{m}^{1}, M_{m}^{2}$ , and $M_{m}^{\infty}$ . Powers were estimated in the same way as for the two-sample case except now we treat G⁽^r⁾(u) = u as known and take m = r. In this setting, the sup-norm test consistently provides the largest power, followed by the p = 2 test and the p = 1 test. In addition, all three distance-based tests outperform the corresponding α = 0.05 Arcones and Samaniego (2000) test in terms of local power, especially at the fastest rate ζ_r = r¹^/² where ${pr}_{R^{(r)} \in Θ_{1}} (D_{m} > c_{0.025}^{AS})$ appears to decrease towards zero.

5. Premature infant data

Caffeine is commonly used to treat newborn infants for apnea of prematurity (Schmidt et al., 2006) and to prevent the onset of respiratory distress syndrome, bronchopulmonary dysplasia, and extubation failure (Cox et al., 2015). Known as “the silver bullet” in the treatment of prematurely born infants at risk for these and other acute conditions (Aranda et al., 2010), caffeine is widely regarded within the neonatal care community to be safe and cost effective. It has also been approved by the United States Food and Drug Administration for use with preterm infants due to its history of providing beneficial outcomes with no long-term adverse side effects (Dobson and Hunt, 2013).

We now analyze the data from the study described in Section 1; for complete details, see Cox et al. (2015). Because assessing the use of caffeine with premature infants was a central focus of this study, we consider only those infants who were classified as “premature;” i.e., newborns whose gestational age was at or below 37 weeks. With F and G denoting the discharge time distributions for the caffeine and no-caffeine groups, respectively, recall that Figure 2 displays the sample ODC R_mn and its least star-shaped majorant ℳR_mn, calculated from samples of size m = 127 from F and n = 277 from G. As noted in Section 1, we performed the test in Davidov and Herman (2012) with these data and concluded that F ≤_S G was strongly supported over F = G. We also performed the GOF tests in Beare and Moon (2015) and concluded that F ≤_LR G would be rejected at α = 0.05; the L¹ and L² statistics based on the least concave majorant of R_mn are 0.717 and 0.999, respectively, which are larger than the corresponding 0.95 quantiles 0.664 and 0.753 identified by their least favorable distributions.

We therefore assess whether or not the data in Figure 2 are consistent with uniform stochastic ordering. Testing H₀ : F ≤_US G versus H₁ : F ≰ _US G based on the least star-shaped majorant of R_mn, our GOF test statistics are $M_{m n}^{1} = 0.170, M_{m n}^{2} = 0.263$ , and $M_{m n}^{\infty} = 0.949$ , each of which is well below the α = 0.10 critical values identified in the supplementary article (0.496, 0.586, and 1.219, respectively), that is, H₀ cannot be discounted at any reasonable level of significance. Therefore, not only does caffeine therapy provide point-of-care health benefits and improved long-term outcomes for prematurely born infants, our analysis suggests that treating these infants with caffeine may also lead to hospital discharge times that are uniformly stochastically smaller than those for infants not treated with caffeine.

6. Concluding remarks

When two distributions F and G satisfy uniform stochastic ordering, F and G when conditioned on the interval [t₀,∞), for any t₀ ∈ ℝ, also satisfy uniform stochastic ordering. This desirable property could be exploited to increase the power of our tests under H₁ and simultaneously reduce the sample sizes necessary to detect departures from H₀. To see how, suppose that uniform stochastic ordering is suspected to be violated when t > t₀, either from historical information or from observing data in related applications. In this situation, one could apply our tests after conditioning to determine if R is non-star-shaped over the smaller region [G⁻¹(t₀), 1] and calculate sample sizes to detect departures over it instead of over [0, 1]. A similar approach was suggested by Carolan and Tebbs (2005) for detecting departures from likelihood ratio ordering. In the same spirit, Beare and Moon (2015) suggest that bootstrapping samples over departure regions could help to increase the power of GOF tests for likelihood ratio ordering. This strategy may also be fruitful in our setting, allowing one to reduce the conservatism arising from relying on the least favorable distribution over the entire unit interval.

We believe that our GOF tests could be generalized to allow for different types of censored data, but the theory underpinning these extensions would not be trivial. For example, with random right-censored data, there would be nothing to prevent one from simply replacing the empirical survival functions F̄_m and Ḡ_n with Kaplan-Meier estimators of F and G and then calculating R_mn and ℳR_mn using these estimates. However, asymptotic distributions of the corresponding test statistics may depend heavily on the latent censoring distributions, and there is no guarantee that the least favorable configuration of F and G will exist. Future work could investigate censored-data extensions of majorant-based inference–not only with uniform stochastic ordering, but with other orderings as well.

Finally, estimating distributions under a uniform stochastic ordering assumption has received considerable attention for two populations; see, e.g., Rojo and Samaniego (1993), Mukerjee (1996), and Arcones and Samaniego (2000). We view the one- and two-sample tests proposed herein as helpful inference procedures to determine if the uniform stochastic ordering assumption is plausible and hence restricted estimation methods for F and G are warranted. An anonymous referee has suggested that developing pointwise confidence intervals for R(u) under a uniform stochastic ordering constraint may be a worthwhile next step. We agree and comment on this further after Lemma 4 in the supplementary article (Tang et al., 2016). Another interesting avenue for future research would be to generalize our majorant-based tests to more than two populations. Estimation techniques in this setting are available in Dykstra et al. (1991) and El Barmi and Mukerjee (2016).

7. Proofs

In this section, we provide the proofs of Theorems 1–4. Lemmas cited in this section are stated and proved in the supplementary article (Tang et al., 2016), henceforth referred to as “the supplementary article.”

Proof of Theorem 1

We start with the asymptotic distribution of R_mn, suitably centered and scaled. Applying Theorem 2.2 in Hsieh and Turnbull (1996), it follows that c_mn(R_mn–R) converges weakly to $T_{R}^{λ}$ as min{m, n} → ∞ and n/(m + n) → λ ∈ (0, 1), where $T_{R}^{λ}$ satisfies $T_{R}^{λ} (u) = λ^{1 / 2} ℬ_{1} (R (u)) + {(1 - λ)}^{1 / 2} R^{'} (u) ℬ_{2} (u)$ , for 0 ≤ u ≤ 1, and ℬ₁ and ℬ₂ are independent standard Brownian bridges. When R ∈ Θ₀, 𝒟R = 0 and $M_{m n}^{p} = c_{m n} {‖ D R_{m n} - D R ‖}_{p}$ . Define the functional operator d𝒟_R : l([0, 1]) ↦ l([0, 1]) by

d D_{R} h (u) = {\begin{cases} max {h (1), 0} - h (1), & if u = 1 \\ ℳ_{[a_{k}, b_{k}]}^{(1, 0)} h (u) - h (u), & if \exists k such that a_{k} \leq u \leq b_{k} \\ 0, & otherwise, \end{cases}

for h ∈ l([0, 1]). Denote by C([0, 1]) the collection of all real continuous functions with domain [0, 1]. If 𝒟 is Hadamard directionally differentiable tangentially to C([0, 1]) at R, then d𝒟_R is the Hadamard directional derivative of 𝒟. Applying the functional delta method and continuous mapping theorem yields $M_{m n}^{p} \overset{d}{\to} {‖ d D_{R} T_{R}^{λ} ‖}_{p}$ for p ∈ [1,∞]. Those situations in which 𝒟 is Hadamard directionally differentiable are described in Lemma 5 in the supplementary article.

When 𝒟 is not Hadamard directionally differentiable, the functional delta method and continuous mapping theorem cannot be applied. However, by using Lemma 6 in the supplementary article, we are able to prove that $M_{m n}^{p} \overset{d}{\to} {‖ d D_{R} T_{R}^{λ} ‖}_{p}$ anyway. For convenience, let Z_mn = c_mn(R_mn – R) and $Z = T_{R}^{λ}$ . From Theorem 12.2 in Billingsley (1999) and Skorohod’s representation theorem (see, e.g., Theorem 6.7 in Billingsley, 1999), there exist random elements $Z_{m n}^{'}$ and Z′ defined on a common probability space with $Z_{m n}^{'} \overset{L}{=} Z_{m n}$ and $Z^{'} \overset{L}{=} Z$ such that ${‖ Z_{m n}^{'} - Z^{'} ‖}_{\infty} \to 0$ almost surely. The notation “ $\overset{L}{=}$ ” denotes that two processes are equivalent in distribution. Define $R_{m n}^{'} = c_{m n}^{- 1} Z_{m n}^{'} + R$ . From Lemma 6 in the supplementary article, because $c_{m n}^{- 1}$ decreases to 0 and ${‖ Z_{m n}^{'} - Z^{'} ‖}_{\infty} \to 0$ almost surely, then for all p ∈ [1,∞] we have

lim_{\begin{matrix} m, n \to \infty \\ n / (m + n) \to λ \end{matrix}} c_{m n} {‖ D R_{m n}^{'} - D R ‖}_{p} = {‖ d D_{R} Z^{'} ‖}_{p}

almost surely. Because $c_{m n} {‖ D R_{m n}^{'} - D R ‖}_{p} \overset{d}{=} c_{m n} {‖ D R_{m n} - D R ‖}_{p}$ and also ${‖ d D_{R} Z^{'} ‖}_{p} \overset{d}{=} {‖ d D_{R} T_{R}^{λ} ‖}_{p}$ , where the notation “ $\overset{d}{=}$ ” means equal in distribution, we have

lim_{\begin{matrix} m, n \to \infty \\ n / (m + n) / λ \end{matrix}} c_{m n} {‖ D R_{m n} - D R ‖}_{p} \overset{d}{=} {‖ d D_{R} T_{R}^{λ} ‖}_{p} .

This shows that $M_{m n}^{p} \overset{d}{\to} {‖ d D_{R} T_{R}^{λ} ‖}_{p}$ for all p ∈ [1,∞].

When R is strictly star-shaped over [0, 1], it is easy to see that ${‖ d D_{R} T_{R}^{λ} ‖}_{p} = 0$ which quickly establishes part (a). The remainder of the proof focuses on establishing part (b). When R is non-strictly star-shaped,

{‖ d D_{R} T_{R}^{λ} ‖}_{p} = {[\sum_{k} \int_{a_{k}}^{b_{k}} {D_{[a_{k}, b_{k}]}^{(1, 0)} T_{R}^{λ} (u)}^{p} d u]}^{1 / p}

for p ∈ [1,∞) and ${‖ d D_{R} T_{R}^{λ} ‖}_{p} = {sup}_{k} {{sup}_{u \in [a_{k}, b_{k}]} D_{[a_{k}, b_{k}]}^{(1, 0)} T_{R}^{λ} (u)}$ for p = ∞. Using Lemma 1 in the supplementary article, we write $D_{[a_{k}, b_{k}]}^{(1, 0)} T_{R}^{λ} (u) = {sup}_{v \in [a_{k}, u]} Q_{k} (u, v)$ for u ∈ [a_k, b_k], where

Q_{k} (u, v) = (\frac{1 - u}{1 - v}) T_{R}^{λ} (v) - T_{R}^{λ} (u), for v \in [a_{k}, u] .

In Lemma 8 in the supplementary article, we show that the processes

{Q_{k} (u, v), a_{k} \leq v \leq u < b_{k}}

are mutually independent across k. Therefore, { $D_{[a_{k}, b_{k}]}^{(1, 0)} T_{R}^{λ} (u)$ , u ∈ [a_k, b_k]} are also mutually independent. To prove further results, we note that over each non-strictly star-shaped region [a_k, b_k], we can write R(u) as a linear function; i.e., R(u) = 1–R′(a_k)(1–u). Thus, from Lemma 2 in the supplementary article, we have

D_{[a_{k}, b_{k}]}^{(1, 0)} T_{R}^{λ} (u) = D_{[a_{k}, b_{k}]}^{(1, 0)} {W_{R}^{λ} (u) - l_{R, k}^{λ} (1)},

for all k, where $W_{R}^{λ} (u) = λ^{1 / 2} W_{1} (R (u)) + {(1 - λ)}^{1 / 2} R^{'} (u) W_{2} (u)$ ,

l_{R, k}^{λ} (u) = λ^{1 / 2} {1 - R^{'} (a_{k}) (1 - u) W_{1} (1) + {(1 - λ)}^{1 / 2} R^{'} (a_{k}) u W_{2} (1)},

and 𝒲₁ and 𝒲₂ are independent standard Wiener processes; i.e., 𝒲_i, for i = 1, 2, satisfies ℬ_i(u) = 𝒲_i(u) – u𝒲_i(1), 0 ≤ u ≤ 1, for i = 1, 2. Based on the properties of a standard Wiener process, it follows that for u ∈ [a_k, b_k],

\begin{array}{l} W_{i} (R (u)) - W_{i} (1) = W_{i} (1 - R^{'} (a_{k}) (1 - u)) - W_{i} (1) \\ \overset{L}{=} R^{'} {(a_{k})}^{1 / 2} {W_{i} (u) - W_{1} (1)}, \end{array}

for i = 1, 2. Furthermore, for u ∈ [a_k, b_k], we have R′(u) = R′ (a_k) and

\begin{array}{l} W_{R}^{λ} (u) - l_{R, k}^{λ} (1) \overset{L}{=} λ^{1 / 2} R^{'} {(a_{k})}^{1 / 2} {W_{1} (u) - W_{1} (1)} + {(1 - λ)}^{1 / 2} R^{'} (a_{k}) {W_{2} (u) - W_{2} (1)} \\ \overset{L}{=} {λ R^{'} (a_{k}) + (1 - λ) R^{'} {(a_{k})}^{2}}^{1 / 2} {W (u) - W (1)}, \end{array}

where 𝒲 is a standard Wiener process. The last equivalence (in distribution) follows because both right-hand side processes above are Gaussian, they have the same mean $E {W_{R}^{λ} (u) - l_{R, k}^{λ} (1)} = 0$ , for u ∈ [a_k, b_k], and they have the same covariance $cov {W_{R}^{λ} (u_{1}) - l_{R, k}^{λ} (1), W_{R}^{λ} (u_{2}) - l_{R, k}^{λ} (1)} = {λ R^{'} (a_{k}) + (1 - λ) R^{'} {(a_{k})}^{2}} min {1 - u_{1}, 1 - u_{2}}$ , for u₁, u₂ ∈ [a_k, b_k]. Using Lemma 2 in the supplementary article again, we have

D_{[a_{k}, b_{k}]}^{(1, 0)} {λ R^{'} (a_{k}) + (1 - λ) R^{'} {(a_{k})}^{2}}^{1 / 2} {W (u) - W (1)} = {λ R^{'} (a_{k}) + (1 - λ) R^{'} {(a_{k})}^{2}}^{1 / 2} D_{[a_{k}, b_{k}]}^{(1, 0)} ℬ (u),

where ℬ is a standard Brownian bridge formed by 𝒲; i.e., ℬ(u) = 𝒲(u) – u 𝒲(1), for u ∈ [0, 1]. We can therefore write

\int_{a_{k}}^{b_{k}} {D_{[a_{k}, b_{k}]}^{(1, 0)} T_{R}^{λ} (u)}^{p} d u \overset{d}{=} {λ R^{'} (a_{k}) + (1 - λ) R^{'} {(a_{k})}^{2}}^{p / 2} \int_{a_{k}}^{b_{k}} {D_{[a_{k}, b_{k}]}^{(1, 0)} ℬ (u)}^{p} d u,

for p ∈ [1,∞), and

sup_{u \in [a_{k}, b_{k}]} D_{[a_{k}, b_{k}]}^{(1, 0)} T_{R}^{λ} (u) \overset{d}{=} {λ R^{'} (a_{k}) + (1 - λ) R^{'} {(a_{k})}^{2}}^{1 / 2} sup_{u \in [a_{k}, b_{k}]} D_{[a_{k}, b_{k}]}^{(1, 0)} ℬ (u),

for p = ∞. For p ∈ [1,∞), we have shown that $\int_{a_{k}}^{b_{k}} {D_{[a_{k}, b_{k}]}^{(1, 0)} T_{R}^{λ} (u)}^{p} d u$ are mutually independent. One can show that $\int_{a_{k}}^{b_{k}} {D_{[a_{k}, b_{k}]}^{(1, 0)} ℬ (u)}^{p} d u$ are also mutually independent by replacing $T_{R}^{λ} (\cdot)$ with ℬ(·) in the definition of Q_k(u, v) and repeating the same argument. Therefore, we have

\sum_{k} \int_{a_{k}}^{b_{k}} {D_{[a_{k}, b_{k}]}^{(1, 0)} T_{R}^{λ} (u)}^{p} d u \overset{d}{=} \sum_{k} {λ R^{'} (a_{k}) + (1 - λ) R^{'} {(a_{k})}^{2}}^{p / 2} \int_{a_{k}}^{b_{k}} {D_{[a_{k}, b_{k}]}^{(1, 0)} ℬ (u)}^{p} d u,

which completes the proof for p ∈ [1,∞). Completing the proof for the p = ∞ case is analogous.

Proof of Theorem 2

When F = G, the ODC is R₀ = R₀(u) = u, 0 ≤ u ≤ 1, and $T_{R_{0}}^{λ} \overset{d}{=} ℬ$ . Because R₀ is non-strictly star-shaped over [0, 1], Theorem 1 yields $M_{m n}^{p} \overset{d}{\to} {‖ d D_{R_{0}} T_{R_{0}}^{λ} ‖}_{p} \overset{d}{=} {‖ D_{[0, 1]}^{(1, 0)} ℬ ‖}_{p}$ when F = G for p ∈ [1,∞]. It therefore suffices to show ${‖ D_{[0, 1]}^{(1, 0)} ℬ ‖}_{p} \geq_{S} {‖ d D_{R} T_{R}^{λ} ‖}_{p}$ for p ∈ [1,∞] and for any other R ∈ Θ₀. If R ∈ Θ₀ is strictly star-shaped, then from Theorem 1, ${‖ d D_{R} T_{R}^{λ} ‖}_{p} = 0$ for p ∈ [1,∞] and hence ${‖ D_{[0, 1]}^{(1, 0)} ℬ ‖}_{p} \geq_{S} {‖ d D_{R} T_{R}^{λ} ‖}_{p}$ . If R ∈ Θ₀ is non-strictly star-shaped, then for p ∈ [1,∞),

\begin{array}{r} {‖ D_{[0, 1]}^{(1, 0)} ℬ ‖}_{p} = {[\int_{0}^{1} {D_{[0, 1]}^{(1, 0)} ℬ (u)}^{p} d u]}^{1 / p} \geq {[\sum_{k} \int_{a_{k}}^{b_{k}} {D_{[0, 1]}^{(1, 0)} ℬ (u)}^{p} d u]}^{1 / p} \\ \geq {[\sum_{k} \int_{a_{k}}^{b_{k}} {D_{[a_{k}, b_{k}]}^{(1, 0)} ℬ (u)}^{p} d u]}^{1 / p} . \end{array}

(7.1)

The first and second inequalities above hold because $D_{[0, 1]}^{(1, 0)} ℬ (u) \geq 0$ and also $D_{[0, 1]}^{(1, 0)} ℬ (u) \geq D_{[a_{k}, b_{k}]}^{(1, 0)} ℬ (u) \geq 0$ , for all u ∈ [0, 1]. Because λ ∈ (0, 1) and R′(a_k) ≤ 1 for all k, λ R′(a_k) + (1–λ)R′(a_k)² ≤ 1 and the rightmost side of (7.1) is greater than or equal to

{[\sum_{k} {λ R^{'} (a_{k}) + (1 - λ) R^{'} {(a_{k})}^{2}}^{p / 2} \int_{a_{k}}^{b_{k}} {D_{[a_{k}, b_{k}]}^{(1, 0)} ℬ (u)}^{p} d u]}^{1 / p} \overset{d}{=} {‖ d D_{R} T_{R}^{λ} ‖}_{p} .

Therefore, for R ∈ Θ₀ non-strictly star-shaped, we have ${‖ D_{[0, 1]}^{(1, 0)} ℬ ‖}_{p} \geq_{S} {‖ d D_{R} T_{R}^{λ} ‖}_{p}$ for p ∈ [1,∞). Showing ${‖ D_{[0, 1]}^{(1, 0)} ℬ ‖}_{\infty} \geq_{S} {‖ d D_{R} T_{R}^{λ} ‖}_{\infty}$ for R ∈ Θ₀ non-strictly star-shaped is analogous.

Proof of Theorem 3

When R ∈ Θ₁, we redefine the functional operator d𝒟_R : l([0, 1]) ↦ l([0, 1]) in Theorem 1 by

d D_{R} h (u) = {\begin{cases} - h (1), & if u = 1, R (u) < 1 \\ max {h (1), 0} - h (1), & if u = 1, R (u) = 1 \\ ℒ_{S_{k}} h (u) - h (u), & if \exists k such that u \in S_{k} \ {1} \\ 0, & otherwise . \end{cases}

The proof proceeds in the same manner as in Theorem 1. If 𝒟 is not Hadamard directionally differentiable, one can use Skorohod’s representation theorem and part (b) of Lemma 7 in the supplementary article to obtain the result.

Proof of Theorem 4

For convenience, all limits stated in this proof assume that max{m, n} = O(r) and n/(m + n) → λ ∈ (0, 1), as r → ∞. We have independent random samples $X_{1}^{(r)}, X_{2}^{(r)}, \dots, X_{m}^{(r)}$ and $Y_{1}^{(r)}, Y_{2}^{(r)}, \dots, Y_{n}^{(r)}$ from F⁽^r⁾ and G⁽^r⁾, respectively. The sample ODC is $R_{m n}^{(r)} = F_{m}^{(r)} {(G_{n}^{(r)})}^{- 1}$ , where $F_{m}^{(r)}$ and ${(G_{n}^{(r)})}^{- 1}$ are the empirical distribution and empirical quantile functions, respectively. Our test statistic is $M_{m n}^{p} = c_{m n} {‖ D R_{m n}^{(r)} ‖}_{p}$ . By the triangle inequality,

{pr}_{R^{(r)} \in Θ_{1}} (M_{m n}^{p} \geq c_{α, p}) \geq {pr}_{R^{(r)} \in Θ_{1}} (c_{m n} {‖ D R_{m n}^{(r)} - D R^{(r)} ‖}_{p} < c_{m n} {‖ D R^{(r)} ‖}_{p} - c_{α, p})

for all p ∈ [1,∞]. Therefore, to prove part (a), it suffices to show that $c_{m n} {‖ D R_{m n}^{(r)} - D R^{(r)} ‖}_{p} = O_{P} (1)$ .

From Lemma 3 in the supplementary article, it follows that ${‖ ℳ R_{m n}^{(r)} - ℳ R^{(r)} ‖}_{\infty} \leq {‖ R_{m n}^{(r)} - R^{(r)} ‖}_{\infty}$ , which implies ${‖ D R_{m n}^{(r)} - D R^{(r)} ‖}_{\infty} \leq 2 {‖ R_{m n}^{(r)} - R^{(r)} ‖}_{\infty}$ . Because L^p norms are dominated by the sup-norm, it therefore suffices to show $c_{m n} {‖ R_{m n}^{(r)} - R^{(r)} ‖}_{\infty}$ is O_P (1). To accomplish this, we decompose $c_{m n} (R_{m n}^{(r)} - R^{(r)})$ into two parts:

c_{m n} (R_{m n}^{(r)} - R^{(r)}) = c_{m n} {F_{m}^{(r)} {(G_{n}^{(r)})}^{- 1} - F^{(r)} {(G_{n}^{(r)})}^{- 1}} + c_{m n} {F^{(r)} {(G_{n}^{(r)})}^{- 1} - F^{(r)} {(G^{(r)})}^{- 1}} .

(7.2)

Define the two independent empirical processes

U_{m} (u) = \frac{1}{m} \sum_{i = 1}^{m} I {F^{(r)} (X_{i}^{(r)}) \leq u}

and $V_{n} (u) = n^{- 1} \sum_{i = 1}^{n} I {G^{(r)} (Y_{i}^{(r)}) \leq u}$ , for 0 ≤ u ≤ 1. This allows us to rewrite $F_{m}^{(r)}$ as U_mF⁽^r⁾ and $F^{(r)} {(G_{n}^{(r)})}^{- 1}$ as $R^{(r)} V_{n}^{(r)}$ . Consequently, the two terms on the right-hand side of Equation (7.2) can be written as

c_{m n} {F_{m}^{(r)} {(G_{n}^{(r)})}^{- 1} - F^{(r)} {(G_{n}^{(r)})}^{- 1}} = c_{m n} [U_{m} {F^{(r)} {(G_{n}^{(r)})}^{- 1}} - U {F^{(r)} {(G_{n}^{(r)})}^{- 1}}]

(7.3)

and

c_{m n} {F^{(r)} {(G_{n}^{(r)})}^{- 1} - F^{(r)} {(G^{(r)})}^{- 1}} = c_{m n} (R^{(r)} V_{n} - R^{(r)} V),

(7.4)

where U(·) and V (·) both represent the cumulative distribution function of a uniform distribution on [0, 1]. These expressions allow us to unify all random samples (from different distributions) to be uniformly distributed.

We are now ready to show that the sup-norms of the right-hand sides of Equations (7.3) and (7.4) are uniformly bounded in probability. We begin with the uniform processes. From Theorem 3 in Komlós et al. (1975), there exist versions of independent standard Brownian bridges $ℬ_{1}^{(m)}$ and $ℬ_{2}^{(n)}$ such that, almost surely,

{‖ \sqrt{m} (U_{m} - U) - ℬ_{1}^{(m)} ‖}_{\infty} = o (m^{- 1 / 2} {(log m)}^{2})

(7.5)

{‖ \sqrt{n} (V_{n} - V) - ℬ_{2}^{(n)} ‖}_{\infty} = o (n^{- 1 / 2} {(log n)}^{2}) .

(7.6)

Because $lim c_{m n} / (λ^{1 / 2} \sqrt{m}) = 1$ , we have ${‖ c_{m n} (U_{m} - U) - λ^{1 / 2} ℬ_{1}^{(m)} ‖}_{\infty} = o (m^{- 1 / 2} {(log m)}^{2})$ from Equation (7.5). Consequently, the sup-norm of the right-hand side of Equation (7.3) is less than or equal to

{‖ c_{m n} [U_{m} {F^{(r)} {(G_{n}^{(r)})}^{- 1}} - U {F^{(r)} {(G_{n}^{(r)})}^{- 1}}] - λ^{1 / 2} ℬ_{1}^{(m)} {F^{(r)} {(G_{n}^{(r)})}^{- 1}} ‖}_{\infty} + {‖ λ^{1 / 2} ℬ_{1}^{(m)} {F^{(r)} {(G_{n}^{(r)})}^{- 1}} ‖}_{\infty}

which is less than or equal to

{‖ c_{m n} (U_{m} - U) - λ^{1 / 2} ℬ_{1}^{(m)} ‖}_{\infty} + {‖ λ^{1 / 2} ℬ_{1}^{(m)} ‖}_{\infty} = o (m^{- 1 / 2} {(log m)}^{2}) + O_{P} (1) .

The O_P (1) term arises because $ℬ_{1}^{(m)}$ is bounded with probability 1. Likewise, the o(m^−1/2(logm)²) term comes from Equation (7.5). Therefore, we have shown that the sup-norm of the right-hand side of Equation (7.3), that is, ${‖ c_{m n} {F_{m}^{(r)} {(G_{n}^{(r)})}^{- 1} - F^{(r)} {(G_{n}^{r})}^{- 1}} ‖}_{\infty} = O_{P} (1)$ .

For the right-hand side of Equation (7.4), we use the mean value theorem to write

R^{(r)} V_{n} (u) - R^{(r)} V (u) = {\dot{R}}^{(r)} (τ_{u}) {V_{n} (u) - V (u)},

where Ṙ⁽^r⁾ denotes the derivative of R⁽^r⁾ and where τ_u is between V_n(u) and V (u). Therefore,

sup_{u \in [0, 1]} | \sqrt{n} {R^{(r)} V_{n} (u) - R^{(r)} V (u)} - {\dot{R}}^{(r)} (τ_{u}) ℬ_{2}^{(n)} (u) | = sup_{u \in [0, 1]} | {\dot{R}}^{(r)} (τ_{u}) [\sqrt{n} {V_{n} (u) - V (u)} - ℬ_{2}^{(n)} (u)] |

which is less than or equal to

{‖ {\dot{R}}^{(r)} ‖}_{\infty} {‖ \sqrt{n} (V_{n} - V) - ℬ_{2}^{(n)} ‖}_{\infty} = O (1) o (n^{- 1 / 2} {(log n)}^{2}) = o (n^{- 1 / 2} {(log n)}^{2}) .

The O(1) term above comes from the assumption that the derivative of R⁽^r⁾ is uniformly bounded for all r over [0, 1]. Likewise, the o(n^−1/2(log n)²) term comes from Equation (7.6). Therefore, because $lim c_{m n} / {{(1 - λ)}^{1 / 2} \sqrt{n}} = 1$ and because $ℬ_{2}^{(n)}$ is bounded with probability 1, we have shown the sup-norm of the right-hand side of Equation (7.4), that is, $c_{m n} {‖ R^{(r)} V_{n}^{(r)} - R^{(r)} V ‖}_{\infty} = O_{P} (1)$ . Finally, from Equation (7.2), we have $c_{m n} {‖ R_{m n}^{(r)} - R^{(r)} ‖}_{\infty} = O_{P} (1) + O_{P} (1) = O_{P} (1)$ , which establishes part (a).

To prove part (b), let $q_{β, p}^{(r)}$ denote the 1 – β quantile of the finite-sample distribution of $c_{m n} {‖ D R_{m n}^{(r)} - D R^{(r)} ‖}_{p}$ ; i.e., $q_{β, p}^{(r)}$ solves ${pr}_{R^{(r)} \in Θ_{1}} (c_{m n} {‖ D R_{m n}^{(r)} - D R^{(r)} ‖}_{p} \leq q_{β, p}^{(r)}) = 1 - β$ . We have already shown $c_{m n} {‖ D R_{m n}^{(r)} - D R^{(r)} ‖}_{p} = O_{P} (1)$ , so ${sup}_{r} q_{β, p}^{(r)} \equiv q_{β, p} < \infty$ . Therefore,

lim inf {pr}_{R^{(r)} \in Θ_{1}} (c_{m n} {‖ D R_{m n}^{(r)} - D R^{(r)} ‖}_{p} < q_{β, p}) \geq 1 - β .

Set η_p(β) = q_β_,_p +c_α_,_p. Whenever lim inf c_mn||𝒟R||_p ≥ η_p(β), it follows from the triangle inequality that $lim inf {pr}_{R^{(r)} \in Θ_{1}} (M_{m n}^{p} \geq c_{α, p})$ is greater than or equal to

lim inf {pr}_{R^{(r)} \in Θ_{1}} (c_{m n} {‖ D R_{m n}^{(r)} - D R^{(r)} ‖}_{p} < c_{m n} {‖ D R^{(r)} ‖}_{p} - c_{α, p}) \geq lim inf {pr}_{R^{(r)} \in Θ_{1}} (c_{m n} {‖ D R_{m n}^{(r)} - D R^{(r)} ‖}_{p} < q_{β, p}) \geq 1 - β .

This completes the proof of part (b).

Supplementary Material

Web Supplement

NIHMS853643-supplement-Web_Supplement.pdf^{(646KB, pdf)}

Acknowledgments

This work was supported by Grant R01 AI121351 from the National Institutes of Health.

We thank the Associate Editor and two anonymous referees for their helpful comments on an earlier version of this article. We also thank Dr. Christina Cox for her collaboration on the premature infant study.

Footnotes

SUPPLEMENTARY MATERIAL

Supplement to “Nonparametric goodness-of-fit tests for uniform stochastic ordering” (DOI: COMPLETED BY THE TYPESETTER; .pdf). In the supplementary article (Tang et al., 2016), we state and prove lemmas that are cited in this manuscript. These lemmas describe theoretical properties of the least star-shaped majorant operator, including Hadamard directional differentiability. We also provide the estimated densities of ${‖ D_{[0, 1]}^{(1, 0)} ℬ ‖}_{p}$ and critical values c_α_,_p for our tests. Finally, we describe the families of ODCs used in Section 4 and give finite-sample simulation results and sample size calculations.

References

Aranda J, Beharry K, Valencia G, Natarajan G, Davis J. Caffeine impact on neonatal morbidities. Journal of Maternal-Fetal and Neonatal Medicine. 2010;23:20–23. doi: 10.3109/14767058.2010.517704. [DOI] [PubMed] [Google Scholar]
Arcones M, Samaniego F. On the asymptotic distribution theory of a class of consistent estimators of a distribution satisfying a uniform stochastic ordering constraint. Annals of Statistics. 2000;28:116–150. [Google Scholar]
Bamber D. The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. Journal of Mathematical Psychology. 1975;12:387–415. [Google Scholar]
Beare B, Moon J. Nonparametric tests of density ratio ordering. Econometric Theory. 2015;31:471–492. [Google Scholar]
Billingsley P. Convergence of Probability Measures. Wiley; New York: 1999. [Google Scholar]
Carolan C, Tebbs J. Nonparametric tests for and against likelihood ratio ordering in the two-sample problem. Biometrika. 2005;92:159–171. [Google Scholar]
Cox C, Hashem N, Tebbs J, Bookstaver B, Iskersky V. Evaluation of caffeine and the development of necrotizing enterocolitis. Journal of Neonatal-Perinatal Medicine. 2015;8:339–347. doi: 10.3233/NPM-15814059. [DOI] [PubMed] [Google Scholar]
Dardanoni V, Forcina A. A unified approach to likelihood inference on stochastic orderings in a nonparametric context. Journal of the American Statistical Association. 1998;93:1112–1123. [Google Scholar]
Davidov O, Herman A. Ordinal dominance curve based inference for stochastically ordered distributions. Journal of the Royal Statistical Society, Series B. 2012;74:825–847. [Google Scholar]
Dobson N, Hunt C. Caffeine use in neonates: Indications, pharmacokinetics, clinical effects, outcomes. NeoReviews. 2013;14:540–550. [Google Scholar]
Dykstra R, Kochar S, Robertson T. Statistical inference for uniform stochastic ordering in several populations. Annals of Statistics. 1991;19:870–888. [Google Scholar]
El Barmi H, McKeague I. Testing for uniform stochastic ordering via empirical likelihood. Annals of the Institute of Statistical Mathematics. 2016;68:955–976. [Google Scholar]
El Barmi H, Mukerjee H. Consistent estimation of survival functions under uniform stochastic ordering: The k-sample case. Journal of Multivariate Analysis. 2016;144:99–109. [Google Scholar]
Hsieh F, Turnbull B. Nonparametric estimation of the receiver operating characteristic curve. Annals of Statistics. 1996;24:25–40. [Google Scholar]
Komlós J, Major P, Tusnády G. An approximation of partial sums of independent RV’s and the sample DF. I. Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete. 1975;32:111–131. [Google Scholar]
Lehmann E, Rojo J. Invariant directional orderings. Annals of Statistics. 1992;20:2100–2110. [Google Scholar]
Mukerjee H. Estimation of survival functions under uniform stochastic ordering. Journal of the American Statistical Association. 1996;91:1684–1689. [Google Scholar]
Park C, Lee C, Robertson T. Goodness-of-fit test for uniform stochastic ordering among several populations. Canadian Journal of Statistics. 1998;26:69–81. [Google Scholar]
Rojo J, Samaniego F. On estimating a survival curve subject to a uniform stochastic ordering constraint. Journal of the American Statistical Association. 1993;88:566–572. [Google Scholar]
Schmidt B, Roberts R, Davis P, Doyle L, Barrington K, Ohlsson A, Solimano A, Tin W. Caffeine therapy for apnea of prematurity. New England Journal of Medicine. 2006;354:2112–2121. doi: 10.1056/NEJMoa054065. [DOI] [PubMed] [Google Scholar]
Shaked M, Shanthikumar J. Stochastic Orders. Springer-Verlag; New York: 2007. [Google Scholar]
Shapiro A. On concepts of directional differentiability. Journal of Optimization Theory and Applications. 1990;66:477–487. [Google Scholar]
Shapiro A. Asymptotic analysis of stochastic programs. Annals of Operations Research. 1991;30:169–186. [Google Scholar]
Tang C, Wang D, Tebbs J. Supplement to “Nonparametric goodness-of-fit tests for uniform stochastic ordering”. 2016 doi: 10.1214/16-AOS1535. [DOI] [PMC free article] [PubMed] [Google Scholar]
van der Vaart A, Wellner J. Weak Convergence and Empirical Processes. Springer-Verlag; New York: 1996. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Web Supplement

NIHMS853643-supplement-Web_Supplement.pdf^{(646KB, pdf)}

[R1] Aranda J, Beharry K, Valencia G, Natarajan G, Davis J. Caffeine impact on neonatal morbidities. Journal of Maternal-Fetal and Neonatal Medicine. 2010;23:20–23. doi: 10.3109/14767058.2010.517704. [DOI] [PubMed] [Google Scholar]

[R2] Arcones M, Samaniego F. On the asymptotic distribution theory of a class of consistent estimators of a distribution satisfying a uniform stochastic ordering constraint. Annals of Statistics. 2000;28:116–150. [Google Scholar]

[R3] Bamber D. The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. Journal of Mathematical Psychology. 1975;12:387–415. [Google Scholar]

[R4] Beare B, Moon J. Nonparametric tests of density ratio ordering. Econometric Theory. 2015;31:471–492. [Google Scholar]

[R5] Billingsley P. Convergence of Probability Measures. Wiley; New York: 1999. [Google Scholar]

[R6] Carolan C, Tebbs J. Nonparametric tests for and against likelihood ratio ordering in the two-sample problem. Biometrika. 2005;92:159–171. [Google Scholar]

[R7] Cox C, Hashem N, Tebbs J, Bookstaver B, Iskersky V. Evaluation of caffeine and the development of necrotizing enterocolitis. Journal of Neonatal-Perinatal Medicine. 2015;8:339–347. doi: 10.3233/NPM-15814059. [DOI] [PubMed] [Google Scholar]

[R8] Dardanoni V, Forcina A. A unified approach to likelihood inference on stochastic orderings in a nonparametric context. Journal of the American Statistical Association. 1998;93:1112–1123. [Google Scholar]

[R9] Davidov O, Herman A. Ordinal dominance curve based inference for stochastically ordered distributions. Journal of the Royal Statistical Society, Series B. 2012;74:825–847. [Google Scholar]

[R10] Dobson N, Hunt C. Caffeine use in neonates: Indications, pharmacokinetics, clinical effects, outcomes. NeoReviews. 2013;14:540–550. [Google Scholar]

[R11] Dykstra R, Kochar S, Robertson T. Statistical inference for uniform stochastic ordering in several populations. Annals of Statistics. 1991;19:870–888. [Google Scholar]

[R12] El Barmi H, McKeague I. Testing for uniform stochastic ordering via empirical likelihood. Annals of the Institute of Statistical Mathematics. 2016;68:955–976. [Google Scholar]

[R13] El Barmi H, Mukerjee H. Consistent estimation of survival functions under uniform stochastic ordering: The k-sample case. Journal of Multivariate Analysis. 2016;144:99–109. [Google Scholar]

[R14] Hsieh F, Turnbull B. Nonparametric estimation of the receiver operating characteristic curve. Annals of Statistics. 1996;24:25–40. [Google Scholar]

[R15] Komlós J, Major P, Tusnády G. An approximation of partial sums of independent RV’s and the sample DF. I. Zeitschrift für Wahrscheinlichkeitstheorie und Verwandte Gebiete. 1975;32:111–131. [Google Scholar]

[R16] Lehmann E, Rojo J. Invariant directional orderings. Annals of Statistics. 1992;20:2100–2110. [Google Scholar]

[R17] Mukerjee H. Estimation of survival functions under uniform stochastic ordering. Journal of the American Statistical Association. 1996;91:1684–1689. [Google Scholar]

[R18] Park C, Lee C, Robertson T. Goodness-of-fit test for uniform stochastic ordering among several populations. Canadian Journal of Statistics. 1998;26:69–81. [Google Scholar]

[R19] Rojo J, Samaniego F. On estimating a survival curve subject to a uniform stochastic ordering constraint. Journal of the American Statistical Association. 1993;88:566–572. [Google Scholar]

[R20] Schmidt B, Roberts R, Davis P, Doyle L, Barrington K, Ohlsson A, Solimano A, Tin W. Caffeine therapy for apnea of prematurity. New England Journal of Medicine. 2006;354:2112–2121. doi: 10.1056/NEJMoa054065. [DOI] [PubMed] [Google Scholar]

[R21] Shaked M, Shanthikumar J. Stochastic Orders. Springer-Verlag; New York: 2007. [Google Scholar]

[R22] Shapiro A. On concepts of directional differentiability. Journal of Optimization Theory and Applications. 1990;66:477–487. [Google Scholar]

[R23] Shapiro A. Asymptotic analysis of stochastic programs. Annals of Operations Research. 1991;30:169–186. [Google Scholar]

[R24] Tang C, Wang D, Tebbs J. Supplement to “Nonparametric goodness-of-fit tests for uniform stochastic ordering”. 2016 doi: 10.1214/16-AOS1535. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R25] van der Vaart A, Wellner J. Weak Convergence and Empirical Processes. Springer-Verlag; New York: 1996. [Google Scholar]

PERMALINK

NONPARAMETRIC GOODNESS-OF-FIT TESTS FOR UNIFORM STOCHASTIC ORDERING

Chuan-Fa Tang

Dewei Wang

Joshua M Tebbs

Abstract

1. Introduction

Fig 1.

Fig 2.

2. Testing procedure

3. Theoretical results

3.1. Asymptotic results under H0

Theorem 1

Theorem 2

3.2. Asymptotic results under H1

Theorem 3

Theorem 4

4. Simulation evidence

4.1. Fixed ODC comparisons

Fig. 3.

4.2. Comparison with Arcones and Samaniego (2000)

4.3. Local power analysis

Fig. 4.

Fig. 5.

5. Premature infant data

6. Concluding remarks

7. Proofs

Proof of Theorem 1

Proof of Theorem 2

Proof of Theorem 3

Proof of Theorem 4

Supplementary Material

Acknowledgments

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

3.1. Asymptotic results under H₀

3.2. Asymptotic results under H₁