Abstract
The exact two-sided likelihood ratio test for testing the equality of two exponential means is proposed and proved to be the uniformly most powerful unbiased test. This exact test has advantages over two alternative approaches in that it is unbiased and more powerful while maintaining the type I error. The use of the proposed test is demonstrated in a non-small cell lung cancer clinical trial design.
Some Keywords: Survival analysis, exponential family, uniformly most powerful unbiased test, power calculation
1 Introduction
The exponential distribution has been shown to have high inference efficiency in survival analysis (Miller (1983) and Meier et al. (2004)). The problem of testing the equality of two exponential means is commonly seen in practice. For example, Bui et al. (2011) compared the survival distributions of sarcoma before and after metastasis as well as for patients with low and high biomarker Cx43 scores, where the survival distribution is believed to be exponential.
There have been existing approaches for comparing two exponential distributions. For example, Lee (1992) stated an asymptotic test. An exact F-test, which was originally given in Cox (1953), has been widely employed by statistical software packages including both noncommercial packages such as STPLAN and commercial packages such as Power Analysis and Sample Size (PASS). These two widely used approaches, however, suffer their weaknesses. As shown in Section 4, the ALRT is an asymptotic test requiring a relatively large sample size to maintain type I error. The F-test is a biased test and with the bias, its power is typically low, which may cause misleading interpretation of the data.
In this paper, we propose a uniformly most powerful unbiased (UMPU) test that has advantages over the existing tests. We discuss a likelihood ratio test in Section 2, and prove it is the UMPU test in Section 3. In Section 4, we compare the proposed test with the two existing tests. Concluding remarks are given in Section 5. Proofs of a remark, two lemmas, and a theorem are provided in the appendix.
2 Testing the Equivalence of Two Exponential Distributions
Suppose we have two groups of observations following exponential distributions. In group 1, we let {t1,i}i=1, …, n1 and {c1,i}i=1, …,, n1 denote the event times and the censoring indicator, respectively, where n1 is the number of observations, c1,i = 1 if the ith observation is a event, and c1,i = 0 if censored. Similarly, we let {t2,i}i=1, …, n2 and {c2,i}i=1, …, n2 denote the event/cesoring times and the censoring status in group 2. We let d1 and d2 denote the numbers of events in groups 1 and 2, respectively. We let {t1,1 ≤ t1,2 ≤ ··· ≤ t1,d1} denote the event times in group 1, and the corresponding c1,1 = c1,2 = ··· = c1,d1 = 1 the indicators of the d1 events. Similarly, {t2,1 ≤ t2,2 ≤, …, ≤ t2,d2} are the event times in group 2, and c2,1 = c2,2 = ··· = c2,d2 = 1 are indicators of d2 events. We assume that the survival times in group 1 and 2 follow exponential distributions with means λ1 and λ2, respectively, and that the censoring mechanisms in two groups are both type II. For m ∈ {1, 2}, the maximum likelihood estimates (MLE) of λm is
| (1) |
where xm is the total time on test (TTOT) between time 0 and time tm,dm
The α level confidence interval of λm is
where is the lower quantile at probability α/2 of the central chi-square distribution with 2dm degrees of freedom (Epstein and Sobel 1954). As discussed in Kalbfleisch and Prentice (1980), page 41, if the survival distribution is exponential in group 1, then x1 is a realization from Gamma(d1, λ1), the gamma distribution with parameters (d1, λ1) and having mean d1λ1 and variance . Similarly, x2 is a realization from Gamma(d2, λ2) if the survival distribution is exponential in group 2. We let X1 and X2 denote two random variables with their distributions X1 ~ Gamma(d1, λ1) and X2 ~ Gamma(d2, λ2).
The null and alternative hypotheses for testing the equivalence between λ1 and λ2 are
| (2) |
Under the null hypothesis, we let λ = λ1 = λ2 denote the model parameter. We derive the likelihood ratio test (LRT) in Remark 1.
Remark 1
The likelihood ratio test statistic for testing hypothesis (2) can be written as
The level α likelihood ratio test rejects H0 if φ (x1, x2) ≤ Cα, where Cα is a real value having P(φ(X1, X2) ≤ Cα|H0) = α for all α ∈ (0, 1).
Next we quantify the p-value of this LRT. Under H0, Y = X1/(X1 + X2) has a beta distribution, Beta(d1,d2), with the probability density function (p.d.f.)
| (3) |
for all 0 < y < 1. Given x1 and x2, the p-value of the LRT is P (Y (1−Y) ≤ φ(x1, x2)|H0). Because the function yd1(1−y)d2 ∞ fβ(y|d1+1, d2+1) is strictly increasing for y ∈ [0, d1/(d1+d2)] and decreasing for y ∈ [d1/(d1+d2), 1], there must exist two real numbers A1 ∈ [0, d1/(d1+d2)] and A2 ∈ [d1/(d1 + d2), 1] satisfying
| (4) |
where 0 ≤ A1 ≤ d1/(d1 + d2) ≤ A2 ≤ 1, and
| (5) |
The p-value of the LRT can be computed as
| (6) |
We illustrate this LRT for d1 = d2 = 1. In this case, the test statistic is φ(x1, x2) = x1x2/(x1 + x2)2 = y(1 − y), where y = x1/(x1 + x2). So φ(x1, x2) ≤ Cα iff
Because Y = X1/(X1+X2) has a uniform distribution on [0, 1], leads to Cα = 0.25 × (2α − α2). If α = 0.05, the LRT rejects H0 if x1/(x1 + x2) ≤ 1/40 or x1/(x1 + x2) ≥ 39/40.
There exist A1 and A2 to achieve any level α ∈ (0, 1) because the function yd1(1 − y)d2 is continuous and monotone for y ∈ [0, d1/(d1+d2)] and y ∈ [d1/(d1+d2), 1], and the cumulative density function (c.d.f.) of Beta(d1, d2) is continuous as well. To compute the p-value of the test (2) given x1 and x2, we search (A1, A2) numerically for arbitrary positive integers d1 and d2 with (5) and (6). Based on (3)–(5), x1/(x1 + x2) = A1 if x1/(x1 + x2) ≤ d1/(d1 + d2), and x1/(x1 + x2) = A2 if x1/(x1 + x2) ≥ d1/(d1 + d2). We implement the bisection search algorithm to determine A2 ∈ [d1/(d1 + d2)] (or A1 ∈ [0, d1/(d1 + d2)]). The exact p-value in (6) can be computed using a beta distribution table or a beta c.d.f. in software packages such as MATLAB (function “betacdf”) and SAS (function “cdf”). Similarly, given a level α, the critical values A1 and A2 can be computed using (5) and the equation
| (7) |
3 UMPU Property of the Likelihood Ratio Test
Lehmann (1986), page 188–192, provided a general form of the UMPU test for the exponential family. For a two parameter model having parameters (θ1, θ2), the level α UMPU test for
| (8) |
is to reject if V (x) ≤ C1 or V (x) ≥ C2 if the following five conditions hold:
-
The likelihood function can be written as
(9) where C(θ1, θ2) is a function of (θ1, θ2) and H(x), T(x), and U(x) are functions of x;
T(x) is sufficient for θ1 and U(x) is sufficient for θ2;
V(X) is independent of T(X) under θ2 = 0;
-
V(x) can be written as
(10) where a(·) and b(·) are two functions with a(T(x)) > 0;
V(x) is continuous in x and L(θ1, θ2|x) is continuous in θ1 and θ2.
The value of (C1, C2) is determined by
| (11) |
and
| (12) |
where I(C1,C2)(·) is an indicator function such that I(C1,C2)(V(X)) = 1 if V(X) ∈ (C1, C2) and 0 otherwise.
Applying this general form to the two sample exponential setting, we can derive Lemma 1.
Lemma 1
The UMPU test of H0 vs. H1 in (2) rejects H0 if y = x1/(x1 + x2) ≤ C1 or y ≥ C2 where C1 and C2 are determined by
| (13) |
and
| (14) |
We propose Lemma 2 to prove Theorem 1.
Lemma 2
Suppose that Xn is a random variable having binomial distribution Bin(n, p), where the probability mass function (p.m.f.) of Xn can be written as
| (15) |
for all x = 0, 1, …, n, and Zn+1 is a random variable with Bin(n+1, p). If Zn+1 is independent with Xn, then
| (16) |
for any positive integer k.
Following Lemma 1 and Lemma 2, Theorem 1 provides the UMPU condition for the test of H0 vs. H1.
Theorem 1
For any level α ∈ (0, 1), the LRT in Remark 1 is the UMPU test.
4 Method Comparison
In this section, we compare two existing tests with the exact LRT. Prior to the comparison in Section 4.3, we introduce the alternative tests in Section 4.1 and derive the power of the three tests in Section 4.2.
4.1 Alternative Tests
The first alternative is an asymptotic likelihood ratio test (ALRT) described in Lee (1992), page 233–236. The α level ALRT rejects H0 in (2) if , where
is the LRT test statistic. Following the notation y = x1/(x1 + x2) and the p.d.f. in (3), the α level ALRT rejects H0 if , where
As discussed in Section 2, fβ(y|d1 + 1, d2 + 1) ∞ yd1 (1 − y)d2 is monotonically increasing for y ∈ [0, d1/(d1 + d2)] and monotonically decreasing for y ∈ [d1/(d1 + d2), 1]. The rejection region of the ALRT for testing the two-sided hypothesis (2) at level α is
| (17) |
where and for all α ∈ [0, 1]. The bisection search algorithm mentioned in Section 2 can be used to compute and numerically. Lee (1992) pointed out three limitations of the ALRT. First, it does not suit one-sided hypotheses. Secondly, this asymptotic test requires a large sample size. Thirdly, the power of ALRT is typically lower than the exact tests.
The second alternative is an F-test originally proposed by Cox (1953) for comparing the rates of occurrence of two Poisson samples, which is the same as comparing the means of two exponential samples of waiting times. Based on the fact that F′ = (X1/d1)/(X2/d2) has an exact F distribution with degrees of freedom 2d1 and 2d2 under H0, the F-test rejects H0 in (2) at level α if f′ ≤ F2d1,2d2,α/2 or f′ ≥ F2d1,2d2,1−α/2, where f′ = (x1/d1)/(x2/d2) and F2d1,2d2,α is the 100 × α% lower quantile of the F distribution with degrees of freedom 2d1 and 2d2. Because of the functional relation Y = X1/(X1 + X2) = 1 − 1/(F′d2 +1), the F-test and the LRT are identical when testing a one-sided hypothesis. For the two-sided hypothesis in (2), the level α F-test has the rejection region
| (18) |
The difference between the F-test and the proposed LRT lies in that the F-test has the equal tail probability rejection region in (18) but the rejection region of the LRT is defined by equations (5) and (7). The F-test and the LRT are identical if and only if A1 = Iβ(α/2; d1, d2) and A2 = Iβ(1 − α/2; d1, d2), which occurs when d1 = d2.
4.2 Rejection Probabilities of the Three Tests
To quantify the power, we first derive the p.d.f. of y = x1/(x1 + x2) given λ1 and λ2. Define z = x2. Then x1 = yz/(1 − y) and x2 = z. Thus,
| (19) |
Thus, given two generic real numbers 0 ≤ g1 ≤ g2 ≤ 1, we can quantify the probability of Y ∈ [0, g1] ∪ [g1, 1] as
| (20) |
Combining (20) with (5), (7), (17), and (18), the rejection probabilities of all three tests at level α has the form (20) with for the ALRT, for the F-test, and (g1, g2) = (A1, A2) for the proposed LRT. Numerical integration methods (e.g., importance sampling, Laplace approximation, and Riemann sum approximation) can be used to approximate the integration in (20). We use Riemann sum approximation for the three examples in Section 4.3.
4.3 Comparing Three Tests
We compare the three tests in Examples 1–3 by investigating the type I error in Example 1 and the power in Examples 2 and 3.
Example 1
Let d1 = d2 = d, λ1 = λ2, and level α = 0.05. Rejection probabilities of the three tests are calculated for d = 1, …, 100 using (20). Figure 1(a) plots the type I error against d1. We can see that both the F-test and the LRT have their type I errors equal to α. The type I error of the ALRT is larger than 0.05 and converges to 0.05 asymptotically as the sample size increases. A general rule given by Lee (1992) suggests that the ALRT works only if d1 + d2 ≥ 25, which can be violated in engineering experiments and medical research if the sample size is small. The rejection probability under α = 0.05 for d = 13 (the smallest value of d to satisfy d1 + d2 ≥ 25) is roughly 0.053, which can be still too high for controlling the type I error.
Figure 1.
(a) Plot of type I error vs. sample size per group for the three tests in Example 1 (d = d1 = d2, α = 0.05); (b) Plot of power minus type I error vs. level α for the F-test and LRT in Example 2 (d1 = 30, d2 = 4, λ1 = 12, λ2 = 11); (c) and (d) Plots of power vs. d1 for the F-test and LRT in Example 3 (d2 = 4 × d1, α = 0.1, (λ1, λ2) = (10, 10.5) in (c), (λ1, λ2) = (10, 15) in (d)).
Example 2
Let (d1, d2) = (30, 4) and (λ1, λ2) = (12, 11). We compare the power of the F-test with the power of the LRT for level α ∈ (0, 1). Following the power computation in (20), we depict in Figure 1(b) the difference between the power and level α. We see that the proposed LRT is unbiased because the solid curve is positive. The F-test, however, is biased for all α ∈ (0, 1). Further, the LRT is more powerful than the F-test. For example, at level α = 0.1, power of the F-test is about 0.097, and the power from the LRT is about 0.104 in this example.
Example 3
We investigate the relation between the power and sample size of the F-test and LRT under two scenarios. In this example, α = 0.1, (λ1, λ2) = (10, 10.5) for scenario 1, and (λ1, λ2) = (10, 15) for scenario 2. We compute the power of the F-test and LRT for d1 = {1, 2, …, 50} and d2 = 4 × d1, which mimics a typical sample size increase when moving from the phase II to the phase III of a clinical trial. Figure 1(c) and (d) plot the power against d1 for scenarios 1 and 2, respectively. We can see that the power is increasing with the sample size d1. The power of the LRT is higher than the power of the F-test for all d1 ∈ {1, 2, … 50}. For instance, 2 more patients are needed for the F-test to be as powerful as the LRT for all d1 ∈ {1, …, 50} in scenario 2. Moreover, bias from the F-test occurs under both scenarios. In scenario 1 where the difference between λ1 and λ2 is small (λ2 = 1.05 × λ1), the F-test is biased for all d1 < 11; for scenario 2 with a larger difference between λ1 and λ2 (λ2 = 1.5 × λ1), the F-test is biased for d1 < 2. This result indicates that the bias in the F-test occurs mostly for small sample size with a moderate difference between λ1 and λ2.
4.4 Application in the Design of a Comparative Lung Cancer Clinical Trial
We implement the F-test and LRT to design a non-small cell lung cancer (NSCLC) clinical trial. Simon, Schell, Begum, Haura, Antonia and Bepler (2011) introduced a personalized NSCLC therapy along with three alternative standard (non-personalized) NSCLC therapies, and showed that the personalized therapy had statistically significant improvements over standard therapies in response rate, overall survival, and progression free survival. We attempt to design a phase III trial to further compare the first 2 year survival distributions of the personalized therapy and that of a standard therapy (recorded as MCC 13303 in Simon et al. (2011)). The time span of first two years was chosen because it was where the difference in survival was found and where exponential assumption was believed to be valid. We assume that more patients can be recruited for the personalized therapy compared with the alternative therapy because 1, the personalized therapy showed a significant improvement in the Phase II trial (Simon et al. 2011), and 2, non-small cell lung cancer patients typically prefer the most promising treatment. Specifically, we assume that 75% of the events are from the personalized therapy and the other 25% are from the alternative therapy.
Figure 2 plots the Kaplan-Meier curves of overall survival corresponding to the two therapies from month 0 to 24. Assuming that λ1 = 24 for personalized therapy group and λ2 = 15 for the alternative group, we applied the exponential test in Hollander and Proschan (1979) to check the exponential assumption with patients alive at 2 years being censored. The p-value of 0.853 (test statistic 0.185) of personalized therapy group and 0.561 (test statistic 0.582) of the other group indicate that exponential assumption was valid for both groups.
Figure 2.
Kaplan-Meier curves of the first 2 year survival data from the personalized therapy and an alternative therapy described in Simon et al. (2011).
We compute the power of the F-test and LRT based on the MLE of (λ1, λ2) that were estimated using the data in Simon et al. (2011), i.e., λ̂1 = 22.25 and λ̂2 = 13.52. Table 1 lists the exact power from the two tests for the two-sided hypothesis in (2) with significance level 0.1. We can see that in the first row, where only 4 patients were recruited, the power of the F-test is less than the significance level 0.1, which indicates that the F-test is biased. The proposed LRT is unbiased. Furthermore, the minimum sample size to achieve 80% of power is 136 for the LRT, less than that for the F-test with difference of 4 patients. Similarly, the minimum sample size to achieve 90% of power is 184 for the LRT, which is 4 patients less than that of the F-test.
Table 1.
Total sample size (left column), power from the LRT (middle column), and power from the F-test (right column) for testing the equivalence of λ1 and λ2 with (λ̂1, λ̂2) = (22.25, 13.52) and proportion of patients being 3: 1 in the two groups.
| Total Sample Size | LRT Power | F-Test Power |
|---|---|---|
| 4 | 0.112 | 0.098 |
| 8 | 0.148 | 0.124 |
| 12 | 0.177 | 0.152 |
| 100 | 0.687 | 0.673 |
| 104 | 0.703 | 0.689 |
| 108 | 0.718 | 0.704 |
| 112 | 0.732 | 0.719 |
| 116 | 0.745 | 0.733 |
| 120 | 0.758 | 0.747 |
| 124 | 0.771 | 0.760 |
| 128 | 0.783 | 0.772 |
| 132 | 0.794 | 0.784 |
| 136 | 0.805 | 0.795 |
| 140 | 0.815 | 0.806 |
| 144 | 0.825 | 0.816 |
| 148 | 0.834 | 0.826 |
| 152 | 0.843 | 0.835 |
| 156 | 0.852 | 0.844 |
| 160 | 0.860 | 0.852 |
| 164 | 0.867 | 0.860 |
| 168 | 0.875 | 0.868 |
| 172 | 0.882 | 0.875 |
| 176 | 0.888 | 0.882 |
| 180 | 0.894 | 0.889 |
| 184 | 0.900 | 0.895 |
| 188 | 0.906 | 0.901 |
5 Summary and Concluding Remarks
The test for comparing two sample exponential distributions is used extensively in survival studies and engineering experiments. A UMPU test is necessary to provide correct interpretation of the two example exponential data in scientific research. In this paper we have proposed a two-sided exact likelihood ratio test to compare two exponential parameters. We have proved the LRT is the UMPU test. This test can also be implemented to compare the occurrence rates of two Poisson samples (Cox 1953).
Because of the UMPU property, the acceptance regions of Y = X1/(X1 + X2) for all level α are the uniformly most accurate unbiased (UMAU) confidence sets under H0 (Lehmann 1986). Similar form of the acceptance region has been discussed in literature. For instance, Tate and Klett (1959) provided the optimal confidence interval of the variance parameter from a normal distribution. Their confidence interval was defined using two conditions similar to (5) and (7).
To implement the proposed LRT (as well as the other two tests), it is crucial to verify the exponential assumption. Given that the focus of this manuscript is rather on the exact UMPU test for comparing two exponential samples, we briefly discuss violation and verification of the exponential assumption next. Violation of the exponential assumption may be due to either non-exponential survival or censoring (e.g., different follow-up time or lost to follow-up). Verifying the exponential distribution with censored observations can be challenging, especially for small samples. Three approaches could be applied to check the exponential assumption: The first approach is the test derived in Hollander and Proschan (1979) that was shown in Section 4.4 for the lung cancer data with censoring. The second is a graphical inspection approach given in Lee (1992). Relying on subjective judgement only, this approach can be difficult to apply in practice. The third approach, which is currently under development, is a reduced piecewise exponential modeling approach that builds on the proposed LRT and a backward elimination procedure to model the failure rate by a piecewise constant function with significant changepoints. Exponential distribution is believed to be valid if no significant changepoint in the failure rate could be identified. According to our findings, this approach is more objective than the graphical inspection, and can be more powerful than the test in Hollander and Proschan (1979).
Current software packages (e.g., PASS and STPLAN) implement the F-test, which, as shown in Section 4.3, may fail to reject significant differences due to its bias and lower power, and thus result in misleading interpretation of the data. The weaknesses of the F-test are more pronounced in situations where the sample size of the test is relatively small, which can be seen in clinical trials of rare diseases and high cost industrial experiments. We suggest that the proposed LRT shall be used instead for comparing two exponential distributions. A software package on the two-sided UMPU test has been developed in MATLAB and is available upon request.
Acknowledgments
The authors would like to thank the editor, Dr. Steven Snapinn, the associate editor, and two referees for their constructive suggestions that improved the quality of this paper. The authors give thanks Gerold Bepler and George R. Simon for providing the non-small cell lung cancer data. This research was sponsored, in part, by the National Institute of Health and the National Cancer Institute, grant 1RC2CA14833201.
APPENDIX. PROOFS
Proof of Remark 1
Proof
The test statistic of the LRT is
So the LRT rejects H0 if (x1/(x1 + x2))d1 (x2/(x1 + x2))d2 ≤ Cα, where P((X1/(X1 + X2))d1 (X2/(X1 + X2))d2 ≤ Cα)= α under H0.
Proof of Lemma 1
Proof
Define θ1 = 1/λ1 + 1/λ2 and θ2 = 1/λ2 − 1/λ1. The likelihood function of (θ1, θ2) can be written in the form of (9), where
Let V (x1, x2) = x1/(x1 + x2) = y. Then the distribution of Y, Beta(d1, d2), is free of θ1 under H0. Thus V (x1, x2) is an ancillary statistic of θ1. For exponential family, T (X1, X2) is minimum sufficient and complete for θ1. (See Lehmann (1986), page 142.) Thus, Basu’s theorem (Basu 1958) guarantees that V (X1, X2) and T (X1, X2) are independent. On the other hand, V (x1, x2) can be written in the form of (10) with a(T (x1, x2)) = −1/[2T (x1, x2)] > 0 and b(T(x1, x2)) = 0.5. Thus, (9) and (10) both hold.
Note that testing the hypothesis in (2) is equivalent to testing vs. . Following the five conditions in Section 3, the UMPU test rejects H0 (or ) if y ≤ C1 or y ≥ C2 where the values of C1 and C2 are determined by (11) and (12). Since Y has Beta(d1, d2) distribution, (11) can be written as (13), and (12) can be written as
| (21) |
By (3), (21) is equivalent to (14). Thus (13) and (14) are equivalent to (11) and (12). The test in Lemma 1 is the UMPU test.
Proof of Lemma 2
Proof
Let and denote two mutually independent random variables distributed Bin(n, p) and Bin(n+1, p), respectively, where {{Si}i=1,…,n, {Rj}j=1,…,n+1} are 2n + 1 mutually independently and identically distributed (i.i.d.) random variables having Bernoulli distribution with parameter p. Using the i.i.d. condition and the definition of Xn and Zn+1,
Equation (16) holds for all positive integer k.
Proof of Theorem 1
Proof
Conditional on (13), (14) is equivalent to , which can be written as Iβ(C2; d1 + 1, d2) − Iβ(C1; d1 + 1, d2) = Iβ(C2; d1, d2) − Iβ(C1; d1, d2), where the function Iβ(·; d1, d2) is the incomplete Beta function with parameters d1 and d2, and Iβ(C1; d1, d2) = P (Y < C1|Y ~ Beta(d1, d2)). Thus, conditional on (13), (14) can be written as
| (22) |
By integrating beta p.d.f. (3) by parts, Iβ(C1; d1, d2) can be written as
| (23) |
Let n = d1 +d2 −1 and p = C1. Following the notation in Lemma 2, Xn ~ Bin(d1 +d2 −1, C1) and Zn+1 ~ Bin(d1 + d2, C1).
By (15) and (23), Iβ(C1; d1, d2) = P (Xn ≥ d1) and Iβ(C1; d1 + 1, d2) = P (Zn+1 ≥ d1 + 1). Using Lemma 2, the right hand side of (22) can be written as
| (24) |
Similarly, the left hand side of (22) can be derived as
| (25) |
By (22), (24), and (25), the joint condition of (13, 14) is equivalent to the joint condition of (13) and
| (26) |
Notice that if (A1, A2) = (C1, C2), the joint condition of (5, 7) can be written as (13, 26). Thus, A1 and A2 in the LRT can guarantee (13) and (14) if we let A1 = C1 and A2 = C2. This completes the proof.
Contributor Information
Gang Han, Email: Gang.Han@moffitt.org.
Michael J. Schell, Email: Michael.Schell@moffitt.org.
Jongphil Kim, Email: Jongphil.Kim@moffitt.org.
References
- Basu D. On Statistics Independent of a Sufficient Statistic. Sankhyā. 1958;20:223–226. [Google Scholar]
- Bui M, Han G, Acs G, Gonzalez R, Reed D, Pasha IL, Zhang P. Connexin 43 Is a Potential Prognostic Biomarker for Ewing Sarcoma (EWS)/Primitive neuroectodermal tumor (PNET) Sarcoma. 2011:Article ID 971050. doi: 10.1155/2011/971050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cox DR. Some Simple Approximate Tests for Poisson Variates. Biometrika. 1953;40:354–360. [Google Scholar]
- Epstein B, Sobel M. Some Theorems Relevant to Life Testing from an Exponential Distribution. Annals of Mathematical Statistics. 1954;25:373–381. [Google Scholar]
- Hollander M, Proschan F. Testing to Determine the Underlying Distribution Using Randomly Censored Data. Biometrics. 1979;35:393–401. [Google Scholar]
- Kalbfleisch JD, Prentice RL. The Statistical Analysis of Failure Time Data. Wiley; New York: 1980. [Google Scholar]
- Lee ET. Statistical Methods for Survival Datat Analysis. Wiley; New York: 1992. [Google Scholar]
- Lehmann EL. Testing Statistical Hypotheses. Springer; New York: 1986. [Google Scholar]
- Meier P, Karrison T, Chappell R, Xie H. The Price of Kaplan-Meier. Journal of the American Statistical Association. 2004;99:890–896. [Google Scholar]
- Miller RG. What Price KaplanMeier? Biometrics. 1983;39:1077–1081. [PubMed] [Google Scholar]
- Simon GR, Schell MJ, Begum M, Haura JKE, Antonia SJ, Bepler G. Preliminary indication of survival benefit from ERCC1 and RRM1-tailored chemotherapy in patients with advanced nonsmall cell lung cancer: Evidence from an individual patient analysis. Cancer. 2011 doi: 10.1002/cncr.26522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tate RF, Klett GW. Optimal Confidence Intervals for the Variance of a Normal Distribution. Journal of the American Statistical Association. 1959;54:674– 682. [Google Scholar]


