Maximum Likelihood Ratio Tests for Comparing the Discriminatory Ability of Biomarkers Subject to Limit of Detection

Albert Vexler; Aiyi Liu; Ekaterina Eliseeva; Enrique F Schisterman

doi:10.1111/j.1541-0420.2007.00941.x

. Author manuscript; available in PMC: 2009 Oct 13.

Published in final edited form as: Biometrics. 2007 Nov 19;64(3):895–903. doi: 10.1111/j.1541-0420.2007.00941.x

Maximum Likelihood Ratio Tests for Comparing the Discriminatory Ability of Biomarkers Subject to Limit of Detection

Albert Vexler ^1,^2,^*, Aiyi Liu ¹, Ekaterina Eliseeva ^1,³, Enrique F Schisterman ¹

PMCID: PMC2761038 NIHMSID: NIHMS133597 PMID: 18047527

Summary

In this article, we consider comparing the areas under correlated receiver operating characteristic (ROC) curves of diagnostic biomarkers whose measurements are subject to a limit of detection (LOD), a source of measurement error from instruments’ sensitivity in epidemiological studies. We propose and examine the likelihood ratio tests with operating characteristics that are easily obtained by classical maximum likelihood methodology.

Keywords: Area under curve (AUC), Censoring, Hypothesis testing, Limit of detection (LOD), Maximum likelihood, Receiver operating characteristics (ROC)

1. Introduction

Receiver operating characteristic (ROC) curve is a well-accepted statistical tool for evaluating the discriminatory ability of biomarkers (e.g., Shapiro, 1999). An ROC curve plots the true positive rates of a biomarker versus its false-positive rates for various thresholds of the test result. It is a convenient way to compare diagnostic biomarkers because the ROC curve places tests on the same scale where they can be compared for accuracy.

The area under the ROC curve (AUC) is a common index of the diagnostic performance of a biomarker. Bamber (1975) showed that AUC = Pr(X > Y), where X and Y represent values of the biomarker from diseased and healthy populations, respectively. Obviously, the closer the AUC is to 1, the better the diagnostic accuracy of the biomarker. In a parametric setting the AUCs can generally be expressed as a function of unknown parameters and thus can be evaluated via estimation of these parameters. Nonparametric estimation of the AUC has also been well addressed in the biostatistical and epidemiological literature. However, the test-scores of biomarkers are frequently associated with measurement error, and in this article we focus on measurement errors due to the limit of detection (LOD).

The LOD is a source of bias in many experiments and is usually caused by the limitation of instruments in measuring very high or low concentrations (e.g., Lyles, Williams, and Chuachoowong, 2001; Lubin et al., 2004; Mumford et al., 2006; Schisterman et al., 2006; Vexler, Liu, and Schisterman, 2006). This inability to accurately determine values of biomarkers introduces bias in the analysis of data from such experiments. For example, biomarkers for polychlorinated biphenyl (PCB), which are associated with endometriosis (Louis et al., 2005), are limited by instrument sensitivity (e.g., Lubin et al., 2004). The LOD issue can be considered as a problem of censored data analysis (e.g., Vexler et al., 2006). Perkins, Schisterman, and Vexler (2007) as well as Mumford et al. (2006) have proposed methods for estimation of ROC curves based on samples with LOD.

Often it is necessary to determine whether a biomarker has satisfactory accuracy in correctly discriminating between cases and controls, for example, testing for AUC = 0.5 (i.e., a biomarker has no discriminatory ability), or whether one biomarker has better diagnostic accuracy than another (e.g., Molodianovitch, Faraggi, and Reiser, 2006). This can be achieved by comparing the AUCs of these biomarkers. The present article addresses these issues when the measurements of the biomarkers are subject to an LOD. We investigate the maximum likelihood ratio test (MLRT), utilizing the likelihood function proposed by Lyles et al. (2001). Operating characteristics of the proposed tests (e.g., significance level and power) can be obtained from classical results of the maximum likelihood method.

The article is organized as follows. Section 2 introduces the MLRT for comparing AUCs. Section 3 presents Monte Carlo simulation results. In Section 4, we apply the proposed tests to data from two studies to evaluate the AUCs of several biomarkers, with some concluding remarks in Section 5. One example is from a study conducted in Birmingham, Alabama, to investigate whether intrauterine inflammation is associated with neuron developmental abnormalities in early childhood, so that certain educational methods for improvement will be utilized. In this example the levels of intrauterine inflammation biomarkers are observed only if they are above the detection limits. Another example uses data from a study of atherosclerotic coronary heart disease to test for discriminatory ability of several biomarkers. This study sampled residents of Niagara and Erie counties in New York who were between the ages of 35 and 79. Adults between the ages of 35 and 65 were randomly selected using the New York State Department of Motor Vehicles drivers’ licenses rolls. Individuals between 65 and 79 years of age were sampled randomly from the Health Care Financing Administration database. A cohort of 942 individuals consisted of 143 people with myocardial infarction (cases) and 799 controls. The purpose of the study was to determine whether biomarkers that measure individuals’ oxidative stress and antioxidant status are good at determining an individual's disease status.

2. Maximum Likelihood Ratio Tests

2.1 Test Based on Complete Data

Let X_k and Y_k represent the values of biomarker k(=1, 2) associated with a diseased (X) and healthy (Y) population, respectively, and {x_k1, . . . , x_kn} and {y_k1, . . . , y_km} be the corresponding test-scores. Suppose the independent vectors (x_1i, x_2i)^T follow a normal distribution

\begin{matrix} {(x_{1 i}, x_{2 i})}^{T} \sim N ({(μ_{x_{1}}, μ_{x_{2}})}^{T}, [\begin{matrix} σ_{x_{1}}^{2} & ρ_{x} σ_{x_{1}} σ_{x_{2}} \\ ρ_{x} σ_{x_{1}} σ_{x_{2}} & σ_{x_{2}}^{2} \end{matrix}]), \\ i = 1, \dots, n, \end{matrix}

and similarly,

\begin{matrix} {(y_{1 j}, y_{2 j})}^{T} \sim N ({(μ_{y_{1}}, μ_{y_{2}})}^{T}, [\begin{matrix} σ_{y_{1}}^{2} & ρ_{y} σ_{y_{1}} σ_{y_{2}} \\ ρ_{y} σ_{y_{1}} σ_{y_{2}} & σ_{y_{2}}^{2} \end{matrix}]), \\ j = 1, \dots, m . \end{matrix}

Following Bamber (1975), the AUCs of the biomarkers are AUC₁ = P(X₁ > Y₁) and AUC₂ = P(X₂ > Y₂), respectively.

In this section, we formally consider testing hypothesis:

H_{0} : A U C_{1} = A U C_{2} versus H_{1} : A U C_{1} \neq A U C_{2} .

(1)

It is clear that $A U C_{k} = Φ {(μ_{x_{k}} - μ_{y_{k}}) ∕ \sqrt{σ_{x_{k}}^{2} + σ_{y_{k}}^{2}}}$ , k = 1, 2, and therefore

A U C_{1} = A U C_{2} iff μ_{x_{1}} = \frac{(μ_{x_{2}} - μ_{y_{2}}) {(σ_{x_{1}}^{2} + σ_{y_{1}}^{2})}^{1 ∕ 2}}{{(σ_{x_{2}}^{2} + σ_{y_{2}}^{2})}^{1 ∕ 2}} + μ_{y_{1}} .

In a simple case, where all the parameters are known and there is no measurement error, that is, (X₁, X₂) and (Y₁, Y₂) are observed completely, we can utilize the classical MLRT for testing H₀. To this end, note that under H₁ and H₀ the likelihood function has the form

\begin{matrix} \prod_{\begin{matrix} i = 1, \dots, n \\ j = 1, \dots, m \end{matrix}} f (x_{1 i}, x_{2 i}, y_{1 j}, y_{2 j}; ϴ_{X}^{H_{1}}, ϴ_{Y}^{H_{1}}), \\ \prod_{\begin{matrix} i = 1, \dots, n \\ j = 1, \dots, m \end{matrix}} f (x_{1 i}, x_{2 i}, y_{1 j}, y_{2 j}; ϴ_{X}^{H_{0}}, ϴ_{Y}^{H_{0}}), \end{matrix}

respectively, where the vectors of parameters $ϴ_{X}^{H_{1}}, ϴ_{Y}^{H_{1}}, ϴ_{X}^{H_{0}}, ϴ_{Y}^{H_{0}}$ are

\begin{matrix} ϴ_{X}^{H_{1}} & = (μ_{x_{1}}, μ_{x_{2}}, σ_{x_{1}}^{2}, σ_{x_{2}}^{2}, ρ_{x}), \\ ϴ_{Y}^{H_{1}} & = (μ_{y_{1}}, μ_{y_{2}}, σ_{y_{1}}^{2}, σ_{y_{2}}^{2}, ρ_{y}), \\ ϴ_{X}^{H_{0}} & = (\frac{(μ_{x_{2}} - μ_{y_{2}}) {(σ_{x_{1}}^{2} + σ_{y_{1}}^{2})}^{1 ∕ 2}}{{(σ_{x_{2}}^{2} + σ_{y_{2}}^{2})}^{1 ∕ 2}} + μ_{y_{1}}, μ_{x_{2}}, σ_{x_{1}}^{2}, σ_{x_{2}}^{2}, ρ_{x}), \\ ϴ_{Y}^{H_{0}} & = ϴ_{Y}^{H_{1}}, \end{matrix}

and the density function f is f(x₁, x₂, y₁, y₂; Θ_X, Θ_Y) = ϕ(x₁, x₂; Θ_X) ϕ(y₁, y₂; Θ_Y), where, with Θ = (θ₁, θ₂, θ₃, θ₄, θ₅),

\begin{matrix} ϕ (u, v; ϴ) = & \frac{1}{2 π θ_{3} θ_{4} \sqrt{1 - θ_{5}^{2}}} \\ \times \exp [- \frac{1}{2} (\frac{1}{1 - θ_{5}^{2}}) {\frac{{(u - θ_{1})}^{2}}{θ_{3}^{2}} - 2 θ_{5} \frac{(u - θ_{1})}{θ_{3}} \\ \times \frac{(v - θ_{2})}{θ_{4}} + \frac{{(v - θ_{2})}^{2}}{θ_{4}^{2}}}] . \end{matrix}

Therefore the classical likelihood ratio test-statistic is

z = \prod_{\begin{matrix} i = 1, \dots, n \\ j = 1, \dots, m \end{matrix}} \frac{f (x_{1 i}, x_{2 i}, y_{1 j}, y_{2 j}; ϴ_{X}^{H_{1}}, ϴ_{Y}^{H_{1}})}{f (x_{1 i}, x_{2 i}, y_{1 j}, y_{2 j}; ϴ_{X}^{H_{0}}, ϴ_{Y}^{H_{0}})} .

Thus, we reject the null hypothesis iff z > z_α, where the threshold z_α corresponds to type I error α. It is clear that this test is the most powerfulunbiased test; see, for example, Lehmann (1997).

When the parameters $ϴ_{X}^{H_{1}}, ϴ_{Y}^{H_{1}}, ϴ_{X}^{H_{0}}, ϴ_{Y}^{H_{0}}$ are unknown, Molodianovitch et al. (2006) propose the transformed normal approach by normalizing data through transformation and then applying the parametric test proposed by Wieand et al. (1989) in order to test for hypothesis (1). (This test is based on confidence intervals (CIs) of AUCs [e.g., Reiser and Faraggi, 1997]. We will investigate this method in detail in Section 4.2.) Alternatively, we can apply the maximum likelihood estimation and obtain the test-statistic:

z = \frac{\sup_{{\overset{‒}{μ}}_{x_{1}}, {\overset{‒}{μ}}_{x_{2}}, {\overset{‒}{μ}}_{y_{1}}, {\overset{‒}{μ}}_{y_{2}}, {\overset{‒}{σ}}_{x_{1}}^{2}, {\overset{‒}{σ}}_{x_{2}}^{2}, {\overset{‒}{σ}}_{y_{1}}^{2}, {\overset{‒}{σ}}_{y_{2}}^{2}, {\overset{‒}{ρ}}_{x}, {\overset{‒}{ρ}}_{y}} \prod_{\begin{matrix} i = 1, \dots, n \\ j = 1, \dots, m \end{matrix}} f (x_{1 i}, x_{2 i}, y_{1 j}, y_{2 j}; {\overset{‒}{ϴ}}_{X}^{H_{1}}, {\overset{‒}{ϴ}}_{Y}^{H_{1}})}{\sup_{{\overset{‒}{μ}}_{x_{2}}, {\overset{‒}{μ}}_{y_{1}}, {\overset{‒}{μ}}_{y_{2}}, {\overset{‒}{σ}}_{x_{1}}^{2}, {\overset{‒}{σ}}_{x_{2}}^{2}, {\overset{‒}{σ}}_{y_{1}}^{2}, {\overset{‒}{σ}}_{y_{2}}^{2}, {\overset{‒}{ρ}}_{x}, {\overset{‒}{ρ}}_{y}} \prod_{\begin{matrix} i = 1, \dots, n \\ j = 1, \dots, m \end{matrix}} f (x_{1 i}, x_{2 i}, y_{1 j}, y_{2 j}; {\overset{‒}{ϴ}}_{X}^{H_{0}}, {\overset{‒}{ϴ}}_{Y}^{H_{0}})},

where

\begin{matrix} {\overset{‒}{ϴ}}_{X}^{H_{1}} & = ({\overset{‒}{μ}}_{x_{1}}, {\overset{‒}{μ}}_{x_{2}}, {\overset{‒}{σ}}_{x_{1}}^{2}, {\overset{‒}{σ}}_{x_{2}}^{2}, {\overset{‒}{ρ}}_{x}), \\ {\overset{‒}{ϴ}}_{Y}^{H_{1}} & = ({\overset{‒}{μ}}_{y_{1}}, {\overset{‒}{μ}}_{y_{2}}, {\overset{‒}{σ}}_{y_{1}}^{2}, {\overset{‒}{σ}}_{y_{2}}^{2}, {\overset{‒}{ρ}}_{y}), \\ {\overset{‒}{ϴ}}_{X}^{H_{0}} & = (\frac{({\overset{‒}{μ}}_{x_{2}} - {\overset{‒}{μ}}_{y_{2}}) {({\overset{‒}{σ}}_{x_{1}}^{2} + {\overset{‒}{σ}}_{y_{1}}^{2})}^{1 ∕ 2}}{{({\overset{‒}{σ}}_{x_{2}}^{2} + {\overset{‒}{σ}}_{y_{2}}^{2})}^{1 ∕ 2}} + {\overset{‒}{μ}}_{y_{1}}, {\overset{‒}{μ}}_{x_{2}}, {\overset{‒}{σ}}_{x_{1}}^{2}, {\overset{‒}{σ}}_{x_{2}}^{2}, {\overset{‒}{ρ}}_{x}), \\ {\overset{‒}{ϴ}}_{Y}^{H_{0}} & = {\overset{‒}{ϴ}}_{Y}^{H_{1}} . \end{matrix}

It is well known (e.g., Lehmann, 1997) that under H₀, the statistic 2 log z asymptotically has a $χ_{1}^{2}$ distribution and therefore the threshold z_α can be easily obtained from Pr(z > z_α) = α, as n, m → ∞. Moreover, this test is asymptotically most powerful (e.g., Choi, Hall, and Schick, 1996).

2.2 Test Based on Data Subject to Limit of Detection

If measurements of the biomarkers are subject to a LOD, then instead of observing x_1i, x_2i, y_1j, y_2j we have

\begin{matrix} x_{k i}^{'} & = {\begin{matrix} x_{k i}, & if x_{k i} \geq d_{x}; \\ NA (not available), & x_{k i} < d_{x}, \end{matrix} \\ y_{k j}^{'} & = {\begin{matrix} y_{k j}, & if y_{k j} \geq d_{y}; \\ NA, & y_{k j} < d_{y}, \end{matrix} \end{matrix}

where k = 1, 2, i = 1, . . . , n, j = 1, . . . , m and d_x, d_y are the values of the LOD (e.g., Lynn, 2001; Lubin et al., 2004; Mumford et al., 2006; Schisterman et al., 2006; Vexler et al., 2006). In the present article we assume, without loss of generality, that d_x = d_y = d and d is known (if d is unknown, it can be easily estimated, for example, by min_i,j,k{x_ki, y_kj}). We can still obtain the MLRT statistic based on the left-censored data. Following Lyles et al. (2001), write the likelihood functions based on $X^{'} = {x_{1 i}^{'}, x_{2 i}^{'}}_{i = 1}^{n}$ and $Y^{'} = {y_{1 j}^{'}, y_{2 j}^{'}}_{j = 1}^{m}$ as $L (X^{'}; ϴ_{X}^{H_{1}})$ and $L (Y^{'}; ϴ_{Y}^{H_{1}})$ , respectively, that are formally defined in Appendix A.

Thus the MLRT statistic is given by

\begin{matrix} z^{(L O D)} \\ = \frac{\sup_{{\overset{‒}{μ}}_{x_{1}}, {\overset{‒}{μ}}_{x_{2}}, {\overset{‒}{μ}}_{y_{1}}, {\overset{‒}{μ}}_{y_{2}}, {\overset{‒}{σ}}_{x_{1}}^{2}, {\overset{‒}{σ}}_{x_{2}}^{2}, {\overset{‒}{σ}}_{y_{1}}^{2}, {\overset{‒}{σ}}_{y_{2}}^{2}, {\overset{‒}{ρ}}_{x}, {\overset{‒}{ρ}}_{y}} L (X^{'}; {\overset{‒}{ϴ}}_{X}^{H_{1}}) L ((Y^{'}; {\overset{‒}{ϴ}}_{Y}^{H_{1}})}{\sup_{{\overset{‒}{μ}}_{x_{2}}, {\overset{‒}{μ}}_{y_{1}}, {\overset{‒}{μ}}_{y_{2}}, {\overset{‒}{σ}}_{x_{1}}^{2}, {\overset{‒}{σ}}_{x_{2}}^{2}, {\overset{‒}{σ}}_{y_{1}}^{2}, {\overset{‒}{σ}}_{y_{2}}^{2}, {\overset{‒}{ρ}}_{x}, {\overset{‒}{ρ}}_{y}} L (X^{'}; {\overset{‒}{ϴ}}_{X}^{H_{0}}) L ((Y^{'}; {\overset{‒}{ϴ}}_{Y}^{H_{0}})} . \end{matrix}

Subsequently, the test threshold z_α can be obtained by the MLRT's asymptotic result: $2 log z^{(L O D)} ~ χ_{1}^{2}$ as n, m → ∞, and d is fixed.

Remark 1. Numerical calculations

Note that, applying statistical software such as R, SPlus, etc., allows us to calculate test-statistics z and z^(LOD) without using closed forms of the estimators of the unknown parameters. A schematic example of programming in R is available upon request from the first author.

Remark 2. Transformed normal approach

The proposed method is based on the MLRT technique and hence the parametric assumptions regarding the data points are required. In order to relax the normal distribution assumptions, following Molodianovitch et al. (2006) we can fit the data to a Box–Cox power transformation modelto better achieve normality and then test for (1). Note that, Molodianovitch et al. (2006) have concluded that the transformed normal approach is efficient and robust when AUCs are compared. We present a modification of the proposed test in Appendix B.

3. Simulation

We conducted Monte Carlo simulations to examine the performance of the proposed method. To this end, we generated values of {x_1i, i = 1, . . . , n} from the normal distribution with mean μ_x1 and variance 1, and {x_2i = ax_1i + ε_i, i = 1, . . . , n}, where the independent and identically distributed (i.i.d.) random variables ε_i ∼ N(0, 1). Similarly, y_1j ∼ N(1, 0.5²) and y_2j = by_1j + ε_j(j = 1, . . . , m) were generated. Hence, $ρ_{x} = a σ_{x_{1}} ∕ {(1 + a^{2} σ_{x_{1}}^{2})}^{1 ∕ 2}, σ_{x_{2}} = {(a^{2} σ_{x_{1}}^{2} + 1)}^{1 ∕ 2}, σ_{y_{2}} = {(b^{2} σ_{y_{1}}^{2} + 1)}^{1 ∕ 2}$ , and $ρ_{y} = b σ_{y_{1}} ∕ {(1 + b^{2} σ_{y_{1}}^{2})}^{1 ∕ 2}$ , where a and b are specified below.

Significance level of the test

Setting a = 0.7, b = 0.5, μ_x1 = 1.274, we have AUC₁ = AUC₂ = 0.597. For each value of d = −3, −1, −0.5, 0, 0.5, and 0.75 we generated 10,000 samples of {x_1i, x_2i, i = 1, . . . , n} and {y_1j, y_2j, j = 1, . . . , m}. Based on the generated samples, in each repetition we calculated the values of the test-statistic z^(LOD).

In Table 1 we present the Monte Carlo estimation of type I error, where the test thresholds 2 log z_α are 3.84 and 6.63. These thresholds correspond to Pr(ξ > 3.84) = 0.05 and Pr(ξ > 6.63) = 0.01, where $ξ ~ χ_{1}^{2}$ . Table 1 also provides the theoretical proportion of the number of observations of X₁, X₂, Y₁, and Y₂ that are below the LOD value d. As can be seen, asymptotically, the type I error of the proposed test can be obtained from the $χ_{1}^{2}$ distribution of the 2 log z^(LOD) statistic. However, if, for example, n = m = 150 and d = 0.75 (in which case about 60% of Y₂'s are not observed numerically), then this assumption is dubious. (Note that for Type I error α we can assume for this simulation CI = α ± 1.96{α(1 − α)/10,000}^1/2.)

Table 1.

Monte Carlo results for the significance levels of the proposed test. F(u) = Pr(2 log z^(LOD) > u) and F(3.84) ≃ 0.05, F(6.63) ≃ 0.01.

n = m	d	F(3.84)	F(6.63)	P(x₁ < d)	P(y₁ < d)	P(x₂ < d)	P(y₂ < d)
150	−3	0.0504	0.0098	9.5 × 10⁻⁶	6.22 × 10⁻¹⁶	0.0007	0.0003
	−1	0.0510	0.0100	0.0115	3.17 × 10⁻⁵	0.0606	0.0728
	−0.5	0.0535	0.0114	0.0380	0.0013	0.1271	0.1660
	0	0.0573	0.0146	0.1013	0.0228	0.2325	0.3138
	0.5	0.0601	0.0153	0.2194	0.1587	0.3740	0.5000
	0.75	0.0634	0.0241	0.3000	0.3085	0.4537	0.5958
30	−3	0.0510	0.0110
	−0.5	0.0507	0.0120
	0	0.0593	0.0150

Open in a new tab

Power of the test

Here we examine the power of the test for situations where {AUC₁ = 0.5, AUC₂ = 0.6} and {AUC₁ = 0.6, AUC₂ = 0.9}. For the first case, we set μ_x1 = 1.3, a = 0.5, b = −1.5, and for the second μ_x1 = 1, a = 0.7, b = 0.3. For both cases n = m = 150. Table 2 displays the Monte Carlo estimation of the test's power for different values of d. Obviously, the power of the test is dependent on the proportion of X₁'s, X₂'s, Y₁'s, and Y₂'s below d.

Table 2.

Monte Carlo results for the power of the test. F = Pr(2 log z^(LOD) > 3.84).

d	AUC₁	AUC₂	F	P(x₁ < d)	P(y₁ < d)	P(x₂ < d)	P(y₂ < d)
−3	0.5	0.6	0.8394	3.2 × 10⁻⁵	6.2 × 10⁻¹⁶	1.1 × 10⁻³	5.5 × 10⁻⁴
−1	0.5	0.6	0.8284	2.3 × 10⁻²	3.2 × 10⁻⁵	8.2 × 10⁻²	9.9 × 10⁻²
0	0.5	0.6	0.8374	0.16	2.3 × 10⁻²	0.28	0.38
0.75	0.5	0.6	0.7372	0.40	0.31	0.52	0.67
−3	0.6	0.9	0.9995	8.5 × 10⁻⁶	6.2 × 10⁻¹⁶	5.5 × 10⁻⁴	0.12
−1	0.6	0.9	0.9985	1.1 × 10⁻²	3.2 × 10⁻⁵	0.07	0.66
0	0.6	0.9	0.9973	0.08	0.02	0.28	0.89

Open in a new tab

Table 2 demonstrates the high values of the power even in the situation where AUC₁ is close to AUC₂ and the proportions of the biomarker values below d is high (d = 0, 0.75).

Robustness

The simulations thus far assume that the samples follow normal distributions. In order to illustrate the robustness of our method, we performed the following Monte Carlo simulations. Suppose that, instead of following normal distributions, the diagnostic markers satisfy $x_{2 i} = 0.7 x_{1 i}^{(df)} + ε_{i}^{(f d)}, y_{2 j} = 0.3 y_{1 j}^{(df)} + ε_{j}^{(f d)}$ , where $x_{1}^{(df)}, y_{1}^{(df)}$ , and ε^(df) are independent identically t-distributed random variables with df degrees of freedom, mean 0 and variance 1, 1 ≤ i, j ≤ 150. Thus, AUC₁ = AUC₂ = 0.5. Here we ran 10,000 repetitions of the sample (X′, Y′) at each df = 5, 10, 15 and d = −3, −1, 0 (d is the value of LOD). We examined the significance level of the proposed test given the uncorrected distributional assumption. Table 3 corresponds to the case when we expect the type I error to be 0.05 (the test threshold 2 log z_α is 3.84).

Table 3.

The Monte Carlo type I error of the proposed test given the uncorrected distributional assumption

df	d	P̂r(2 log z^(LOD) > 3.84)
15	−3	0.0506
10	−3	0.0524
5	−3	0.0518
15	−1	0.0590
10	−1	0.0610
5	−1	0.0651
15	0	0.0958
10	0	0.1507
5	0	0.3288

Open in a new tab

From these results we conclude that the proposed method is reasonable even when the distributional assumptions do not exactly satisfy normality. However, the accuracy of the expected significance level is poor when d = 0 (about 50% of the data are below the detection limit). In contrast, Table 1 indicates that under the corrected distributional assumption this proportion of observations below LOD is not critical.

Imputation method

Conventional approaches to dealing with data below LOD include omission, resulting in a truncated data set, and imputation with a constant, such as d or a fraction thereof (e.g., d/2, $d ∕ \sqrt{2}$ ); or the observed values may be used directly or indirectly (e.g., Lubin et al., 2004; Schisterman et al., 2006). Perkins et al. (2007) showed that the imputation method can lead to biased parametric/nonparametric estimation of AUCs. Here we report results of the Monte Carlo simulation corresponding to Table 1, where the test based on CIs (e.g., Wieand et al., 1989; Reiser and Faraggi, 1997; for details, see Section 4.2) has been calculated for observations:

x_{k i}^{″} = {\begin{matrix} x_{k i}, & x_{k i} \geq d; \\ I m p, & x_{k i} < d, \end{matrix} y_{k j}^{″} = {\begin{matrix} y_{k i}, & y_{k j} \geq d_{y}; \\ I m p, & y_{k j} < d, \end{matrix}

k = 1, 2, i = 1, . . . , n = 150, j = 1, . . . , m = 150, and Imp = d/2, $d ∕ \sqrt{2}$ .

In contrast with Table 1, Table 4 demonstrates that when d = 0, −3, 0.75 these conventional approaches should not be recommended. (Investigation of the test based on the samples ignoring the NA values and the nonparametric test [Wieand et al., 1989] based on $x_{k i}^{''}, y_{k j}^{''}$ , k = 1, 2, i = 1, . . . , 150, j = 1, . . . , 150 led to similar conclusion.)

Table 4.

The Monte Carlo type I error of the test based on confidence intervals when the imputation method is applied and the expected significance level is 0.05

d	Imp = d/2	$I m p = d ∕ \sqrt{2}$
−3	0.0549	0.0546
0	0.0689	0.0687
0.5	0.1098	0.1455
0.75	0.1539	0.2273

Open in a new tab

4. Examples

We exemplify the proposed method with data from the two studies briefly described in the introduction.

4.1 The IQ Study

Here we examine whether biomarker IL8 has the ability to discriminate between low and high levels of IQ. The data include 369 subjects. The IQ indicator full-scale IQ (FSIQ) has values ranging from 46 to 118 with an average equal to 82.57. We split our data into two populations, where population A includes those with IQ less than 82.57 and population B includes those with IQ greater than 82.57. We associate biomarker IL8 with both populations separately. Denote X, Y as biomarker values related to population A and B, respectively. The total number of Xs is 189 and the total number of Ys is 180. According to the instrument manual, the LOD for IL8 is d = 3.2, yielding the numbers of NAs to be 95 and 108 for X and Y, respectively. The logarithmic values of the biomarkers are used in order to better achieve normality. The empirical histograms of the log-transformed biomarker corresponding to high and low levels of IQ are depicted in Figure 1.

Histograms of the log-transformed biomarker of interest corresponding to low (a) and high (b) levels of IQ.

Under the assumption that log X and log Y have normal distributions, applying the maximum likelihood estimation proposed by Vexler et al. (2006) (or estimation based on censored data, see, e.g., Gupta [1952]) leads to estimated mean of log X and log Y as 1.02 and 0.38, respectively. The corresponding standard deviations are 2.60 and 2.88. In this case the estimated AUC is

Φ [\frac{E \log X - E \log Y}{{var (\log X) + var (\log Y)}^{1 ∕ 2}}] = 0.57 .

Now, we test for AUC = 0.5 under the ROC curve of IL8 (i.e., no discriminatory ability of the biomarker). This is a particular case of the testing procedure considered in Section 2. Specifically, because the AUC = 0.5 iff E log X = E log Y, the test statistic has the form

\begin{matrix} 2 \log z = & 2 [\sup_{μ_{x}, μ_{y}, σ_{x}, σ_{y}} {l (\log X, n_{x}, k_{x}; μ_{x}, σ_{x}) \\ + l (\log Y, n_{y}, k_{y}; μ_{y}, σ_{y})} \\ - \sup_{μ_{x}, σ_{x}, σ_{y}} {l (\log X, n_{x}, k_{x}; μ_{x}, σ_{x}) \\ + l (\log Y, n_{y}, k_{y}; μ_{x}, σ_{y})}], \end{matrix}

where

\begin{matrix} l (z, n, k; μ, σ) \\ = - (n - k) \ln (σ) - Σ_{i; z_{i} > \log d} \frac{{(z_{i} - μ)}^{2}}{2 σ^{2}} + k \ln Φ (\frac{\log d - μ}{σ}), \\ n_{x} = 189, n_{y} = 180, k_{x} = 95, k_{y} = 108, \end{matrix}

and the function exp(l) is proportional to the likelihood based on censored data. For details regarding this maximum likelihood function see Vexler et al. (2006). The value of the test-statistic is computed to be 2.97. Because the value of z_0.05 corresponding to Pr_H₀(2 log z > z_0.05) ≃ 0.05 (from $χ_{1}^{2}$ distribution) is 3.84, we do not reject H₀. Therefore we conclude that the discriminatory ability of biomarker IL8 is not significant.

4.2 Evaluating Biomarkers for Coronary Heart Disease

For this example, we compare the diagnostic accuracy of two biomarkers, cholesterol and hdl-cholesterol. To normalize the data, we log-transform the values of both biomarkers. It is obvious from a biological standpoint that the levels of cholesterol and hdl-cholesterol are correlated. Denote by X₁, Y₁ the log-transformed values of cholesterol for the cases and controls, respectively, and similarly, X₂, Y₂ the log-transformed hdl-cholesterol levels from cases and controls. The estimated means of X₁, X₂, Y₁, and Y₂ are 5.63, 4.15, 5.47, 4.13, and the estimated standard deviations are 0.18, 0.24, 0.30, 0.25, respectively. The estimators of the correlation between X₁ and X₂, as well as between Y₁ and Y₂ are ${\hat{ρ}}_{x} = 0.06$ and ${\hat{ρ}}_{y} = 0.04$ , respectively. Figure 2 introduces empirical histograms of X₁, X₂, Y₁, and Y₂.

Histograms of the log-transformed biomarkers of interest corresponding to cholesterol cases (a), hdl-cholesterol cases (b), cholesterol controls (c), and hdl-cholesterol controls (d). The vertical bold lines correspond to average values of the biomarkers.

Assume that the values of the log-transformed biomarkers are normally distributed. Simulation studies were conducted for each of the d = 0, 3, 3.25, 3.5, 3.75, 4, and 4.25. Table 5 presents estimators of the correlated AUCs and p-values obtained based on values of the test-statistic z for different d (theoretically 2 log z^(LOD) is approximately $χ_{1}^{2}$ distributed). (Note that situation d = 0 corresponds to no LOD effect.)

Table 5.

Estimation of the AUCs and values of the test-statistic for different d. N_{X_k}, N_{Y_k} are numbers of events {X_k < d} and {Y_k < d}, respectively (k = 1, 2).

d	N_X₂	N_Y₁	N_Y₂	AÛC₁	AÛC₂	2 1og z^(LOD)	p-value
0.00	0	0	0	0.671	0.524	24.291	10.20 × 10⁻⁷
3.00	0	0	1	0.671	0.524	24.049	9.39 × 10⁻⁷
3.25	0	0	2	0.671	0.524	22.156	2.51 × 10⁻⁶
3.50	2	1	8	0.671	0.524	21.969	2.77 × 10⁻⁶
3.75	8	1	40	0.672	0.525	22.570	2.03 × 10⁻⁶
4.00	34	2	207	0.672	0.530	22.941	1.67 × 10⁻⁶
4.25	88	5	553	0.673	0.583	21.862	2.93 × 10⁻⁶

Open in a new tab

From Table 5, for any selected value of d, the null hypothesis H₀ is rejected with p-values increasing as d increases.

Although standard SPSS output gives the asymptotic 95% CI of AUC₁ as (0.628,0.708) and of AUC₂ as (0.481,0.585), in the simple case where d = 0, we cannot conclude that H₀ : AUC₁ = AUC₂ is rejected because the estimators of AUC₁ and AUC₂ are correlated. We utilize a method proposed by Wieand et al. (1989, p. 587). Following these authors, we have, if biomarkers’ values are normally distributed, the test-statistic

z_{C I} = \frac{{\hat{δ}}_{1} - {\hat{δ}}_{2}}{{(n + m) var (T)}^{1 ∕ 2}} \sim N (0, 1), n, m \to \infty,

where

\begin{matrix} {\hat{δ}}_{k} & = \frac{{\hat{μ}}_{x_{k}} - {\hat{μ}}_{y_{k}}}{{({\hat{σ}}_{x_{k}}^{2} + {\hat{σ}}_{y_{k}}^{2})}^{1 ∕ 2}}, {\hat{μ}}_{x_{k}} = \frac{Σ_{i = 1}^{n} x_{k_{i}}}{n}, \\ {\hat{μ}}_{y_{k}} & = \frac{Σ_{j = 1}^{m} y_{k_{j}}}{m}, {\hat{σ}}_{x_{k}}^{2} = \frac{Σ_{i = 1}^{n} {(x_{k_{i}} - {\hat{μ}}_{x_{k}})}^{2}}{n - 1}, \\ {\hat{σ}}_{y_{k}}^{2} & = \frac{Σ_{j = 1}^{m} {(y_{k_{j}} - {\hat{μ}}_{y_{k}})}^{2}}{m - 1}, \\ var ({(n + m)}^{1 ∕ 2} T) & = σ_{11} - 2 σ_{12} + σ_{22}, \\ {(n + m)}^{- 1} σ_{i i} & = σ_{i}^{- 2} (n^{- 1} σ_{x_{i}}^{2} + m^{- 1} σ_{y_{i}}^{2}) \\ + \frac{1}{2} δ_{i}^{2} σ_{i}^{- 4} {{(n - 1)}^{- 1} σ_{x_{i}}^{4} + {(m - 1)}^{- 1} σ_{y_{i}}^{4}}, \\ {(n + m)}^{- 1} σ_{12} & = {(σ_{1} σ_{2})}^{- 1} (n^{- 1} C_{x} + m^{- 1} C_{y}) + \frac{1}{2} δ_{1} δ_{2} {(σ_{1} σ_{2})}^{- 2} \\ \times {{(n - 1)}^{- 1} C_{x}^{2} + {(m - 1)}^{- 1} C_{y}^{2}}, \\ σ_{i}^{2} & = σ_{x_{i}}^{2} + σ_{y_{i}}^{2}, C_{x} = ρ_{x} σ_{x_{1}} σ_{x_{2}}, \\ C_{y} & = ρ_{y} σ_{y_{1}} σ_{y_{2}}, \\ and δ_{k} & = \frac{μ_{x_{k}} - μ_{y_{k}}}{{(σ_{x_{k}}^{2} + σ_{y_{k}}^{2})}^{1 ∕ 2}}, k = 1, 2 . \end{matrix}

Thus, because the z_CI calculated from the data is 4.71, the p-value of the test |z_CI| > z_α is 0.0021, whereas our proposed method has p-value 10.20 × 10⁻⁷; see Table 5 (d = 0).

5. Discussion

In the present article, we have shown that the maximum likelihood ratio approach serves as a method of testing for the hypothesis regarding the comparison of AUCs. Such an approach yields a powerful test with characteristics that can be obtained by the well-established maximum likelihood theory. We used real data examples to illustrate how easily the MLRT method can be carried out in order to compare two biomarkers and to determine whether a biomarker has discriminatory ability.

The article assumes normal distributions for the values of the biomarkers when LOD is present. However, the proposed approach can be extended to other commonly used distributions, for example, gamma, lognormal, etc. Similarly, we can perform hypothesis testing for AUCs based on right, double-censored, or truncated data. We have focused on comparing paired correlated areas, but the proposed method can be adapted to multivariate cases as well.

Our article presented a method dealing with data subject to LOD with broad validity under a reasonable set of assumptions. Sensitivity analysis, though beyond the scope of the present article, is important to assess these distributional assumptions. This topic can be discussed in a generalcontext of missing data analysis; see Molenberghs and Kenward (2007). However, one must bear in mind that data below LOD are informative missing, in the sense that they are unobservable only if the actual values are below the detection limit.

We briefly investigated several imputation methods that are commonly applied among epidemiologists in dealing with LOD data. These methods, however, are not statistically justified and should not be confused with the popular method of multiple imputation (e.g., Rubin, 1987) in the missing data analysis literature. The use of multiple imputation in the analysis of LOD data deserves further investigation.

Note that nonparametric distribution function estimation based on censored data can be obtained and hence Kolmogorov–Smirnov-(or Shapiro–Wilk)-type tests for correctness of parametric assumptions can be evaluated (e.g., Verrill and Johnson, 1988). In the context of the ROC curves and Box–Cox power transformation models based on data subject to LOD, we will address nonparametric and semi-parametric methods in a subsequent article.

The proposed approach preserves the efficiency of the MLRT when applied to testing for biomarkers’ diagnostic accuracy subject to the LOD. When an additive measurement error is in effect, the appropriate maximum likelihood approach can also be utilized following a method similar to that of Section 2.

Acknowledgements

This research was supported by the IntramuralResearch Program of the National Institute of Child Health and Human Development, National Institutes of Health. The opinions expressed are those of the authors and not necessarily of the National Institutes of Health. The authors would like to thank Margaret Hillier for providing the intrauterine inflammation data. We are grateful to the co-editor, associate editor, and referee for their helpful comments that clearly improved this article.

Appendix A

The Likelihood Functions Based on Data Subject to LOD

The likelihood function based on $X^{'} = {x_{1 i}^{'}, x_{2 i}^{'}}_{i = 1}^{n}$ has the form of

\begin{matrix} L (X^{'}; ϴ_{X}^{H_{1}}) & = (\prod_{i = 1}^{n_{1}} t_{i 1}^{(x)}) \times (\prod_{i = n_{1} + 1}^{n_{2}} t_{i 2}^{(x)}) \\ \times (\prod_{i = n_{2} + 1}^{n_{3}} t_{i 3}^{(x)}) \times {(t_{4}^{(x)})}^{n_{4}}, \end{matrix}

where

\begin{matrix} t_{i 1}^{(x)} & = {(2 π σ_{x_{1}} σ_{x_{2} ∣ x_{1}})}^{- 1} \\ \times \exp [- 0.5 {\frac{{(x_{2 i}^{'} - μ_{x_{2} ∣ x_{1 i}})}^{2}}{σ_{x_{2} ∣ x_{1}}^{}} + \frac{{(x_{1 i}^{'} - μ_{x_{1}})}^{2}}{σ_{x_{1}}^{2}}}], \\ t_{i 2}^{(x)} & = {(2 π σ_{x_{1}}^{2})}^{- 1 ∕ 2} \exp {- 0.5 \frac{{(x_{1 i}^{'} - μ_{x_{1}})}^{2}}{σ_{x_{1}}^{2}}} Φ (\frac{d - μ_{x_{2} ∣ x_{1 i}}}{σ_{x_{2} ∣ x_{1}}}), \\ t_{i 3}^{(x)} & = {(2 π σ_{x_{2}}^{2})}^{- 1 ∕ 2} \exp {- 0.5 \frac{{(x_{2 i}^{'} - μ_{x_{2}})}^{2}}{σ_{x_{2}}^{2}}} Φ (\frac{d - μ_{x_{1} ∣ x_{2 i}}}{σ_{x_{1} ∣ x_{2}}}), \\ t_{4}^{(x)} & = \int_{- \infty}^{d} Φ [\frac{d - {μ_{x_{1}} + \frac{ρ_{x} σ_{x_{1}} (x_{2} - μ_{x_{2}})}{σ_{x_{2}}}}}{σ_{x_{1}} \sqrt{1 - ρ_{x}^{2}}}] {(2 π σ_{x_{2}}^{2})}^{- 1 ∕ 2} \\ \times \exp {- 0.5 \frac{{(x_{2} - μ_{x_{2}})}^{2}}{σ_{x_{2}}^{2}}} d x_{2}, \\ μ_{x_{2} ∣ x_{1 i}} & = μ_{x_{2}} + (ρ_{x} σ_{x_{2}} ∕ σ_{x_{1}}) (x_{1 i}^{'} - μ_{x_{1}}), σ_{x_{2} ∣ x_{1}}^{2} = σ_{x_{2}}^{2} (1 - ρ_{x}^{2}), \\ μ_{x_{1} ∣ x_{2 i}} & = μ_{x_{1}} + (ρ_{x} σ_{x_{1}} ∕ σ_{x_{2}}) (x_{2 i}^{'} - μ_{x_{2}}), σ_{x_{1} ∣ x_{2}}^{2} = σ_{x_{1}}^{2} (1 - ρ_{x}^{2}), \end{matrix}

and n₁, n₂, n₃, n₄ (n₁ + n₂ + n₃ + n₄ = n) are the numbers of events ${x_{1 i}^{'} \neq NA, x_{2 i}^{'} \neq NA}_{i = 1}^{n}, {x_{1 i}^{'} \neq NA, x_{2 i}^{'} = NA}_{i = 1}^{n}, {x_{1 i}^{'} = NA, x_{2 i}^{'} \neq NA}_{i = 1}^{n}$ , and ${x_{1 i}^{'} = NA, x_{2 i}^{'} = NA}_{i = 1}^{n}$ , respectively. Here, the term $t_{i 1}^{(x)}$ corresponds to situations where x_1i and x_2i are observed completely; $t_{i 2}^{(x)}$ relates to situations where x_1i is observed, whereas x_2i is below the detection limit d and thus $x_{2 i}^{'}$ is NA (the opposite case where x_1i is unobserved whereas x_2i is available, matches $t_{i 3}^{(x)}$ ). Finally, when both x₁ and x₂ are not observed numerically, we have $t_{4}^{(x)}$ (i.e., $t_{4}^{(x)}$ represents the probability of both $x_{1}^{'}$ and $x_{2}^{'}$ to be NA).

The likelihood function based on $Y^{'} = {y_{1 j}^{'}, y_{2 j}^{'}}_{j = 1}^{m}$ is defined in a similar manner:

\begin{matrix} L (Y^{'}; ϴ_{Y}^{H_{1}}) = & (\prod_{j = 1}^{m_{1}} t_{j 1}^{(y)}) \times (\prod_{j = m_{1} + 1}^{m_{2}} t_{j 2}^{(y)}) \times (\prod_{j = m_{2} + 1}^{m_{3}} t_{j 3}^{(y)}) \\ \times {(t_{4}^{(y)})}^{m_{4}}, Σ_{r = 1}^{4} m_{r} = m . \end{matrix}

Appendix B

Transformed Normal Approach

We denote the function

T (u, λ) = {\begin{matrix} \frac{u^{λ} - 1}{λ}, & λ \neq 0, \\ \log (u), & λ = 0 \end{matrix}

and extend the likelihoods $L (X^{'}; ϴ_{X}^{H_{1}})$ and $L (Y^{'}; ϴ_{Y}^{H_{1}})$ by Appendix A to the forms of

\begin{matrix} L (X^{'}; ϴ_{X}^{H_{1}}, λ_{1}, λ_{2}) = & (\prod_{i = 1}^{n_{1}} {\tilde{t}}_{i 1}^{(x)}) \times (\prod_{i = n_{1} + 1}^{n_{2}} {\tilde{t}}_{i 2}^{(x)}) \\ \times (\prod_{i = n_{2} + 1}^{n_{3}} {\tilde{t}}_{i 3}^{(x)}) \times {({\tilde{t}}_{4}^{(x)})}^{n_{4}}, \\ L (Y^{'}; ϴ_{Y}^{H_{1}}, λ_{1}, λ_{2}) = & (\prod_{i = 1}^{m_{1}} {\tilde{t}}_{i 1}^{(y)}) \times (\prod_{i = m_{1} + 1}^{m_{2}} {\tilde{t}}_{i 2}^{(y)}) \\ \times (\prod_{i = m_{2} + 1}^{m_{3}} {\tilde{t}}_{i 3}^{(y)}) \times {({\tilde{t}}_{4}^{(y)})}^{m_{4}}, \end{matrix}

where for z = x, y

\begin{matrix} {\tilde{t}}_{i 1}^{(z)} = & {(2 π σ_{z_{1}} σ_{z_{2} ∣ z_{1}})}^{- 1} \\ \times \exp [- 0.5 {\frac{{(T (z_{2 i}^{'}, λ_{2}) - μ_{z_{2} ∣ z_{1 i}})}^{2}}{σ_{z_{2} ∣ z_{1}}^{2}} \\ + \frac{{(T (z_{1 i}^{'}, λ_{1}) - μ_{z_{1}})}^{2}}{σ_{z_{1}}^{2}}}], \\ {\tilde{t}}_{i 2}^{(z)} & = {(2 π σ_{z_{1}}^{2})}^{- 1 ∕ 2} \exp {- 0.5 \frac{{(T (z_{1 i}^{'}, λ_{1}) - μ_{z_{1}})}^{2}}{σ_{z_{1}}^{2}}} \\ \times Φ (\frac{T (d, λ_{2}) - μ_{z_{2} ∣ z_{1 i}}}{σ_{z_{2} ∣ z_{1}}}), \\ {\tilde{t}}_{i 3}^{(z)} & = {(2 π σ_{z_{2}}^{2})}^{- 1 ∕ 2} \exp {- 0.5 \frac{{(T (z_{2 i}^{'}, λ_{2}) - μ_{z_{2}})}^{2}}{σ_{z_{2}}^{2}}} \\ \times Φ (\frac{T (d, λ_{1}) - μ_{z_{1} ∣ z_{2 i}}}{σ_{z_{1} ∣ z_{2}}}), \\ {\tilde{t}}_{4}^{(z)} & = \int_{- \infty}^{T (d, λ_{2})} Φ [\frac{T (d, λ_{1}) - {μ_{z_{1}} + \frac{ρ_{z} σ_{z_{1}} (z_{2} - μ_{z_{2}})}{σ_{z_{2}}}}}{σ_{z_{1}} \sqrt{1 - ρ_{z}^{2}}}] \\ \times {(2 π σ_{z_{2}}^{2})}^{- 1 ∕ 2} \exp {- 0.5 \frac{{(z_{2} - μ_{z_{2}})}^{2}}{σ_{z_{2}}^{2}}} d z_{2}, \end{matrix}

and it is assumed that (T(z₁, λ₁), T(z₂, λ₂)) are jointly normally distributed. Thus, in this case, the MLR test-statistic is

\frac{\sup_{{\overset{‒}{μ}}_{x_{1}}, {\overset{‒}{μ}}_{x_{2}}, {\overset{‒}{μ}}_{y_{1}}, {\overset{‒}{μ}}_{y_{2}}, {\overset{‒}{σ}}_{x_{1}}^{2}, {\overset{‒}{σ}}_{x_{2}}^{2}, {\overset{‒}{σ}}_{y_{1}}^{2}, {\overset{‒}{σ}}_{y_{2}}^{2}, {\overset{‒}{ρ}}_{x}, {\overset{‒}{ρ}}_{y}, λ_{1}, λ_{2}} L (X^{'}; {\overset{‒}{ϴ}}_{X}^{H_{1}}, λ_{1}, λ_{2}) L (Y^{'}; {\overset{‒}{ϴ}}_{Y}^{H_{1}}, λ_{1}, λ_{2})}{\sup_{{\overset{‒}{μ}}_{x_{2}}, {\overset{‒}{μ}}_{y_{1}}, {\overset{‒}{μ}}_{y_{2}}, {\overset{‒}{σ}}_{x_{1}}^{2}, {\overset{‒}{σ}}_{x_{2}}^{2}, {\overset{‒}{σ}}_{y_{1}}^{2}, {\overset{‒}{σ}}_{y_{2}}^{2}, {\overset{‒}{ρ}}_{x}, {\overset{‒}{ρ}}_{y}, λ_{1}, λ_{2}} L (X^{'}; {\overset{‒}{ϴ}}_{X}^{H_{0}}, λ_{1}, λ_{2}) L (Y^{'}; {\overset{‒}{ϴ}}_{Y}^{H_{0}}, λ_{1}, λ_{2})} .

References

Bamber D. The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. Journal of Mathematical Psychology. 1975;12:387–415. [Google Scholar]
Choi S, Hall WJ, Schick A. Asymptotically uniformly most powerful tests in parametric and semi-parametric models. Annals of Statistics. 1996;24:841–861. [Google Scholar]
Gupta AK. Estimation of the mean and standard deviation of a normal population from a censored sample. Biometrika. 1952;39:260–273. [Google Scholar]
Lehmann EL. Testing Statistical Hypotheses. 2nd edition John Wiley and Sons; New York: 1997. [Google Scholar]
Louis GM, Weiner JM, Whitecomb BW, Sperrazza R, Schisterman EF, Lobdell DT, Crickard K, Greizerstein H, Kostyniak PJ. Environmental PCB exposure and risk of endometriosis. Human Reproduction. 2005;20:279–285. doi: 10.1093/humrep/deh575. [DOI] [PubMed] [Google Scholar]
Lubin JH, Colt JS, Camann D, Davis S, Cerhan JR, Severson RK, Bernstein L, Hartge P. Epidemiological evaluation of measurement data in the presence of detection limits. Environmental Health Perspectives. 2004;112:1691–1696. doi: 10.1289/ehp.7199. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lyles RH, Williams JK, Chuachoowong R. Correlating two viral load assays with known detection limits. Biometrics. 2001;57:1238–1244. doi: 10.1111/j.0006-341x.2001.01238.x. [DOI] [PubMed] [Google Scholar]
Lynn HS. Maximum likelihood inference for left-censored HIV RNA data. Statistics in Medicine. 2001;20:33–45. doi: 10.1002/1097-0258(20010115)20:1<33::aid-sim640>3.0.co;2-o. [DOI] [PubMed] [Google Scholar]
Molenberghs G, Kenward MG. Missing Data in Clinical Studies. Wiley; Chichester, UK: 2007. [Google Scholar]
Molodianovitch K, Faraggi D, Reiser B. Comparing the areas under two correlated ROC curves: Parametric and non-parametric approaches. Biometrical Journal. 2006;48:745–757. doi: 10.1002/bimj.200610223. [DOI] [PubMed] [Google Scholar]
Mumford SL, Schisterman EF, Vexler A, Liu A. Pooling biospecimens and limits of detection: Effects on ROC curve analysis. Biostatistics. 2006;7:585–598. doi: 10.1093/biostatistics/kxj027. [DOI] [PubMed] [Google Scholar]
Perkins NJ, Schisterman EF, Vexler A. Receiver operating characteristic curve inference from a sample with a limit of detection. American Journal of Epidemiology. 2007;165:325–333. doi: 10.1093/aje/kwk011. [DOI] [PubMed] [Google Scholar]
Reiser B, Faraggi D. Confidence intervals for the generalized ROC criterion. Biometrics. 1997;53:644–652. [PubMed] [Google Scholar]
Rubin DB. Multiple Imputation for Nonresponse in Surveys. Wiley; New York: 1987. [Google Scholar]
Schisterman EF, Vexler A, Whitcomb BW, Liu A. The limitations due to exposure detection limits for regression models. American Journal of Epidemiology. 2006;163:374–383. doi: 10.1093/aje/kwj039. [DOI] [PMC free article] [PubMed] [Google Scholar]
Shapiro D. The interpretation of diagnostic tests. Statistical Methods in Medical Research. 1999;8:113–134. doi: 10.1177/096228029900800203. [DOI] [PubMed] [Google Scholar]
Verrill S, Johnson RA. Tables and large-sample distribution theory for censored-data correlation statistics for testing normality. Journal of the American Statistical Association. 1988;83:1192–1197. [Google Scholar]
Vexler A, Liu A, Schisterman EF. Efficient design and analysis of biospecimens with measurements subject to detection limit. Biometrical Journal. 2006;48:780– 791. doi: 10.1002/bimj.200610266. [DOI] [PubMed] [Google Scholar]
Wieand S, Gail MH, James BR, James KL. A family of nonparametric statistics for comparing diagnostic markers with paired or unpaired data. Biometrika. 1989;76:585–592. [Google Scholar]

[R1] Bamber D. The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. Journal of Mathematical Psychology. 1975;12:387–415. [Google Scholar]

[R2] Choi S, Hall WJ, Schick A. Asymptotically uniformly most powerful tests in parametric and semi-parametric models. Annals of Statistics. 1996;24:841–861. [Google Scholar]

[R3] Gupta AK. Estimation of the mean and standard deviation of a normal population from a censored sample. Biometrika. 1952;39:260–273. [Google Scholar]

[R4] Lehmann EL. Testing Statistical Hypotheses. 2nd edition John Wiley and Sons; New York: 1997. [Google Scholar]

[R5] Louis GM, Weiner JM, Whitecomb BW, Sperrazza R, Schisterman EF, Lobdell DT, Crickard K, Greizerstein H, Kostyniak PJ. Environmental PCB exposure and risk of endometriosis. Human Reproduction. 2005;20:279–285. doi: 10.1093/humrep/deh575. [DOI] [PubMed] [Google Scholar]

[R6] Lubin JH, Colt JS, Camann D, Davis S, Cerhan JR, Severson RK, Bernstein L, Hartge P. Epidemiological evaluation of measurement data in the presence of detection limits. Environmental Health Perspectives. 2004;112:1691–1696. doi: 10.1289/ehp.7199. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] Lyles RH, Williams JK, Chuachoowong R. Correlating two viral load assays with known detection limits. Biometrics. 2001;57:1238–1244. doi: 10.1111/j.0006-341x.2001.01238.x. [DOI] [PubMed] [Google Scholar]

[R8] Lynn HS. Maximum likelihood inference for left-censored HIV RNA data. Statistics in Medicine. 2001;20:33–45. doi: 10.1002/1097-0258(20010115)20:1<33::aid-sim640>3.0.co;2-o. [DOI] [PubMed] [Google Scholar]

[R9] Molenberghs G, Kenward MG. Missing Data in Clinical Studies. Wiley; Chichester, UK: 2007. [Google Scholar]

[R10] Molodianovitch K, Faraggi D, Reiser B. Comparing the areas under two correlated ROC curves: Parametric and non-parametric approaches. Biometrical Journal. 2006;48:745–757. doi: 10.1002/bimj.200610223. [DOI] [PubMed] [Google Scholar]

[R11] Mumford SL, Schisterman EF, Vexler A, Liu A. Pooling biospecimens and limits of detection: Effects on ROC curve analysis. Biostatistics. 2006;7:585–598. doi: 10.1093/biostatistics/kxj027. [DOI] [PubMed] [Google Scholar]

[R12] Perkins NJ, Schisterman EF, Vexler A. Receiver operating characteristic curve inference from a sample with a limit of detection. American Journal of Epidemiology. 2007;165:325–333. doi: 10.1093/aje/kwk011. [DOI] [PubMed] [Google Scholar]

[R13] Reiser B, Faraggi D. Confidence intervals for the generalized ROC criterion. Biometrics. 1997;53:644–652. [PubMed] [Google Scholar]

[R14] Rubin DB. Multiple Imputation for Nonresponse in Surveys. Wiley; New York: 1987. [Google Scholar]

[R15] Schisterman EF, Vexler A, Whitcomb BW, Liu A. The limitations due to exposure detection limits for regression models. American Journal of Epidemiology. 2006;163:374–383. doi: 10.1093/aje/kwj039. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] Shapiro D. The interpretation of diagnostic tests. Statistical Methods in Medical Research. 1999;8:113–134. doi: 10.1177/096228029900800203. [DOI] [PubMed] [Google Scholar]

[R17] Verrill S, Johnson RA. Tables and large-sample distribution theory for censored-data correlation statistics for testing normality. Journal of the American Statistical Association. 1988;83:1192–1197. [Google Scholar]

[R18] Vexler A, Liu A, Schisterman EF. Efficient design and analysis of biospecimens with measurements subject to detection limit. Biometrical Journal. 2006;48:780– 791. doi: 10.1002/bimj.200610266. [DOI] [PubMed] [Google Scholar]

[R19] Wieand S, Gail MH, James BR, James KL. A family of nonparametric statistics for comparing diagnostic markers with paired or unpaired data. Biometrika. 1989;76:585–592. [Google Scholar]

PERMALINK

Maximum Likelihood Ratio Tests for Comparing the Discriminatory Ability of Biomarkers Subject to Limit of Detection

Albert Vexler

Aiyi Liu

Ekaterina Eliseeva

Enrique F Schisterman

Summary

1. Introduction

2. Maximum Likelihood Ratio Tests

2.1 Test Based on Complete Data

2.2 Test Based on Data Subject to Limit of Detection

Remark 1. Numerical calculations

Remark 2. Transformed normal approach

3. Simulation

Significance level of the test

Table 1.

Power of the test

Table 2.

Robustness

Table 3.

Imputation method

Table 4.

4. Examples

4.1 The IQ Study

Figure 1.

4.2 Evaluating Biomarkers for Coronary Heart Disease

Figure 2.

Table 5.

5. Discussion

Acknowledgements

Appendix A

The Likelihood Functions Based on Data Subject to LOD

Appendix B

Transformed Normal Approach

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Maximum Likelihood Ratio Tests for Comparing the Discriminatory Ability of Biomarkers Subject to Limit of Detection

Albert Vexler

Aiyi Liu

Ekaterina Eliseeva

Enrique F Schisterman

Summary

1. Introduction

2. Maximum Likelihood Ratio Tests

2.1 Test Based on Complete Data

2.2 Test Based on Data Subject to Limit of Detection

Remark 1. Numerical calculations

Remark 2. Transformed normal approach

3. Simulation

Significance level of the test

Table 1.

Power of the test

Table 2.

Robustness

Table 3.

Imputation method

Table 4.

4. Examples

4.1 The IQ Study

Figure 1.

4.2 Evaluating Biomarkers for Coronary Heart Disease

Figure 2.

Table 5.

5. Discussion

Acknowledgements

Appendix A

The Likelihood Functions Based on Data Subject to LOD

Appendix B

Transformed Normal Approach

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases