Abstract
Youden index, a linear function of sensitivity and specificity, provides a direct measurement of the highest diagnostic accuracy achievable by a biomarker. It is maximized at the cut-off point that optimizes the biomarker's overall classification rate while assigning equal weight to sensitivity and specificity. In this paper, we consider the problem of estimating the Youden index when only group-tested data are available. The unavailability of individual disease statuses poses a challenge, especially when there is differential false positives and negatives in disease screening. We propose both parametric and nonparametric procedures for estimation of the Youden index, and exemplify our methods by utilizing data from the National Health and Nutrition Examination Survey (NHANES) to evaluate the diagnostic ability of monocyte for predicting chlamydia.
Keywords: Diagnostic accuracy, differential misclassification, group testing, joint model, sensitivity, specificity
1. Introduction
In diagnostic medicine, biomarkers have been widely used to detect the presence of a disease or condition of interest. For example, CD4 and CD8 are commonly used as biomarkers in HIV/AIDs detection,1 and hemoglobin A1C (HbA1C) is considered as a biomarker for the presence and severity of hyperglycemia.2 The Receiver Operating Characteristic (ROC) curve is a widely used graphical tool that illustrates the discriminatory accuracy of continuous diagnostic biomarkers in distinguishing between diseased and healthy individuals. It operates on the principle that an individual is classified as diseased or healthy based on whether the corresponding biomarker value exceeds or falls below a specified threshold value. The effectiveness of any specific threshold value can be assessed through the measurement of the probability of a true positive (sensitivity) and negative (specificity). The ROC curve is a plot of the sensitivity versus 1-specificity over all possible threshold values. Both parametric and nonparametric methods have been used for estimating the ROC curves, see Pepe3 and Nakas et al.4
In addition to the area or partial area under the ROC curve (AUC or pAUC), which are commonly used global indices of diagnostic accuracy, the Youden index provides another useful measure in evaluating the biomarker's predictive capacity. This index is a function of sensitivity and specificity and is defined as , where YI is the Youden index, is the cut-off point, and the maximum is taken over all possible values. Ranging between 0 and 1, a YI with a value close to 1 (0) is indicative of a relatively large (small) diagnostic capacity. Several studies have addressed the problem of estimating the Youden index. Fluss et al.5 compared several estimation procedures (parametric and non-parametric); Molanes-López and Letón6 applied the modified delta method and adjusted empirical likelihood to estimate the Youden index and its associated threshold; Yin and Tian7,8 presented parametric and non-parametric approaches for joint confidence region estimation of sensitivity and specificity at the cut-off point as well as of AUC and Youden index; Yin et al.9 proposed a nonparametric method based on a kernel-smoothed estimate of the cumulative distribution functions. However, these studies focused on the case where individual disease statuses are all available.
Due to resource constraints and/or privacy considerations, individual-level disease data may not be accessible in many instances. In such scenarios, group testing has been recommended as a practical alternative. The approach, initially introduced by Dorfman10 for screening syphilis antigen in U.S. army recruits, not only safeguards patient confidentiality when individual results are not imperative but also enhances statistical efficiency. As a result, group testing has found widespread application in various fields, see Hepworth,11 Hughes-Oliver and Rosenberger,12 McCann and Tebbs,13 Turner et al.,14 Warasi et al.,15 Malinovsky and Albert,16 Malinovsky and Albert,17 Malinovsky et al.,18 Mokalled et al.,19 Bilder et al.20 Recently, Zhang et al.21 considered AUC estimation when only group-based test results on the disease status are available.
In the present paper, we are concerned with estimating Youden index in the setting of group-tested data, where disease statuses exist at group level while biomarkers are available for each individual subjects. Similar to when AUC is of interest, we are faced with challenges arisen from the unavailability of individual disease status and differential miscalssification that may depend on the group size and number of diseased individuals within each group.
The paper is organized as follows. In Section 2, we establish the procedures to obtain the Youden index via Normal, Gamma, Log-normal, and nonparametric estimation with group-tested data, respectively. Simulation studies will be presented in Section 3. In Section 4, we illustrate our methods with data on chlamydia detection from NHANES. Conclusions and further research discussions will be presented in Section 5.
2. Methodology
2.1. Youden index
Suppose is the concentration level of a continuous biomarker with distribution function and probability density function (pdf) . Meanwhile, let have conditional distributions and in non-diseased and diseased populations, respectively, with and the corresponding density functions. Let be the true binary disease status so that prevalence . Further, let be the observed disease status so that the specificity and sensitivity of the lab test are and , respectively. It follows . For any given cut-off point , the specificity and sensitivity of a biomarker can be written as
(1) |
and the Youden index as
(2) |
The value of that achieves this maximum will be considered the optimal threshold and the estimation of YI is carried out by estimating and and substituting them in (1):
(3) |
where and are the estimators of and , respectively.
In this paper, our main interest is to estimate the Youden index in the setting of group-tested data. According to (3), this requires us to estimate and first. To this end, we will introduce three different estimation procedures, which contains both parametric and nonparametric methods.
2.2. Group-tested data
Consider a total of subjects randomly allocated into groups, each with a size denoted as for , where . The continuous variable is observed for each subject, resulting in observations , . Let and denote the group-tested disease results by . Define as the true disease status of the th subject in the th group, and as the true disease status of group . For each group , we assume that the specificity of the lab test remains constant, i.e. . The sensitivity is differential, depending on the group size and the number of diseased subjects in the group, i.e. , as detailed in Hwang,22 Hung and Swallow,23 Haber et al.24
Given the true group disease status , we assume that biomarker of subjects in a group and the group-tested results are independent so that the conditional probability density of can be simplified as
where . Denoting as the joint density function of and and letting be an indicator function, the corresponding likelihood function for the observed data can be written as
(4) |
where
(5) |
For notation brevity, we assume equal group sizes, ; all subsequent results can be extended when group size varies. The log-likelihood function can be written as follows:
(6) |
where is the vector containing all parameters related to and . We use the ‘optim’ function in R to estimate and hence and . Consequently, estimate of Youden index can be obtained according to (2) and (3).
2.3. Estimation
2.3.1. Normal estimation
Assume follows Normal distributions and with means and and standard deviations and for non-diseased and diseased populations, respectively. Here we suppose . For , one may simply switch diseased with non-diseased. and in (1) can then be written as
(7) |
for a given cut-point , where denotes the standard Normal distribution function. When , the optimal cut-off point is the midpoint between diseased and non-diseased means, i.e. . Otherwise, it is given as
(8) |
where is the natural logarithm funtcion, and . Let , then the Youden index YI occurs at if and at otherwise, see Fluss et al.5 Given , , , and consequently and , estimate of the optimal cut-off point can be obtained through (8) and that of the Youden index through (3).
2.3.2. Gamma estimation
Assume follows Gamma distribution and with shapes and and rates and for non-diseased and diseased populations, respectively. The optimal cut-off point can be explicitly expressed as follows.
- When ,
where . When , one may simply switch diseased with non-diseased. - When ,
Otherwise, can be obtained from (2) directly.
The estimation procedure of shapes and rates in and is similar to the Normal estimation case above, and the optimal cut-off point and corresponding estimator of Youden index can be obtained similarly.
2.3.3. Log-normal estimation
Assume follows Log-normal distribution and with log-means and , and log-standard deviation and for non-diseased and diseased populations, respectively. The optimal cut-off point and the corresponding Youden index can be obtained similarity to that in the Normal estimation.
2.3.4. Nonparametric estimation
To estimate and nonparametrically, we follow the estimation procedure proposed by Zhang et al.21 This involves two steps: 1) Estimate the distribution functions and as well as the prevalence , and 2) Estimate and nonparametrically.
Based on Zhang et al.,21 the number of groups that are tested positive, , follows a binomial distribution with size and probability
Let and . It follows that the log-likelihood function based on the group testing results is . Then the MLE of can be obtained by maximizing . Based on asymptotic normality of MLE proposed by van der Vaart,25
where
Consequently, we have
The final nonparametric estimators of and are
where
These estimators of and are step functions. We estimate the optimal cut-off point by locating the value of that maximizes and then obtain the corresponding Youden index estimate.
3. Simulations
We conducted extensive simulation studies to evaluate the performance of our proposed approach. The total number of subjects is and we generated the true disease status for all subjects from a Bernoulli distribution with probability , where the prevalence was set to 0.02 or 0.03. The group size was chosen from {1, 2, 5}, with corresponding to individual testing. The sensitivity and specificity were selected from {0.90, 0.95, 1.00}. We specified using the model of Hung and Swallow23 as , where represents the number of diseased individuals in a group and . For the observed disease status , we randomly divided the subjects into groups of size and generated the group-tested result from a Bernoulli distribution with probability for groups with all and for groups with at least one .
We considered three data generating scenarios for biomarker : (1) Normal data, where has a Normal distribution , with mean and standard deviation in non-diseased population and and in diseased population; (2) Gamma data, where has a Gamma distribution , with shape and rate in non-diseased population, and and in diseased population; (3) Log-normal data, where follows a Log-normal distribution, with log-mean 1 and log-standard deviation 0.3 in non-diseased population, and log-mean 1.4 and log-standard deviation 1 in diseased population. The true values of Youden index in these three scenarios are 0.6590, 0.6430 and 0.4135, respectively.
We examined the proposed four estimation procedures (Normal estimation, Gamma estimation, Log-normal estimation, and nonparametric estimation) across varying group sizes , different prevalence rates , and misclassification rates . Our assessments were conducted using widely used criteria, including bias, standard error (SE), root mean square error (RMSE), 95% coverage probability (CP), and the average length of confidence intervals (ACIL). The bootstrap method was employed to compute CP and ACIL. In total, 300 simulated data sets were generated, and within each simulation, 500 bootstrap replicates were created.
We first report simulation results under the Normal data scenario. In this scenario, Gamma or Log-normal estimation were not considered as can have both positive and negative values. Table 1 shows the performance of the prevalence estimator. It is easy to see that the estimates under the Normal estimation method are all close to the true values, with coverage probabilities close to the nominal level. Across board, the estimator exhibits increased statistical efficiency (indicated by smaller SE) as the misclassification error ( and ) decreases. For example, under Normal estimation, when prevalence is , misclassification rates are both 0.1 , and the group size is , the relative efficiency of the estimator is about 0.85 (0.1983/0.2336) in comparison to . We found similar performance under nonparametric estimation. For example, when , , and , the relative efficiency of the estimator is 0.66 (0.2480/0.3781). In addition, under the same situation, the RMSE under Normal estimation is better than that under nonparametric estimation. For example, when , and , the RMSE is 0.1859 in the scenario of Normal estimation, which is less than 0.2083 in nonparametric estimation.
Table 1.
Simulation results with Normal data for the prevalence estimator based on the group and individual testing approaches: estimate (Est), bias (Bias), standard error (SE), root mean square error (RMSE), coverage probability (CP) and average confidence interval length (ACIL) of the estimator for the biomarker. Entries of Est, Bias, SE are multiplied by 100 for better presentation. is the prevalence, and are specificity and sensitivity, and is the size of each group.
Normal | Nonparametric | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Est | Bias | SE | RMSE | CP(ACIL) | Est | Bias | SE | RMSE | CP(ACIL) | ||
0.90 | 1 | 1.9645 | −0.0355 | 0.2336 | 0.2363 | 93.00%(0.0091) | 1.9997 | −0.0003 | 0.3620 | 0.3620 | 95.67%(0.0143) |
2 | 1.9841 | −0.0159 | 0.2214 | 0.2219 | 94.33%(0.0086) | 2.0087 | 0.0087 | 0.2899 | 0.2901 | 93.33%(0.0111) | |
5 | 1.9886 | −0.0114 | 0.1983 | 0.1986 | 95.00%(0.0078) | 1.9951 | −0.0049 | 0.2227 | 0.2227 | 96.33%(0.0089) | |
0.95 | 1 | 1.9717 | −0.0283 | 0.1910 | 0.1931 | 95.00%(0.0076) | 1.9908 | −0.0092 | 0.2434 | 0.2435 | 97.33%(0.0100) |
2 | 1.9881 | −0.0119 | 0.1859 | 0.1863 | 94.33%(0.0071) | 2.0005 | 0.0005 | 0.2083 | 0.2083 | 94.33%(0.0082) | |
5 | 1.9859 | −0.0141 | 0.1689 | 0.1695 | 96.00%(0.0066) | 1.9883 | −0.0117 | 0.1734 | 0.1738 | 96.33%(0.0070) | |
1.00 | 1 | 2.0045 | 0.0045 | 0.1257 | 0.1258 | 96.33%(0.0050) | 2.0043 | 0.0043 | 0.1263 | 0.1263 | 96.33%(0.0050) |
2 | 2.0058 | 0.0058 | 0.1281 | 0.1283 | 96.67%(0.0051) | 2.0057 | 0.0057 | 0.1278 | 0.1279 | 96.33%(0.0051) | |
5 | 2.0056 | 0.0056 | 0.1308 | 0.1309 | 96.00%(0.0052) | 2.0065 | 0.0065 | 0.1305 | 0.1307 | 96.33%(0.0053) | |
0.90 | 1 | 2.9775 | −0.0225 | 0.2636 | 0.2646 | 93.67%(0.0102) | 3.0117 | 0.0117 | 0.3781 | 0.3783 | 94.67%(0.0147) |
2 | 2.9921 | −0.0079 | 0.2460 | 0.2461 | 95.00%(0.0095) | 3.0154 | 0.0154 | 0.3018 | 0.3022 | 94.67%(0.0118) | |
5 | 2.9881 | −0.0119 | 0.2252 | 0.2255 | 95.33%(0.0089) | 3.0038 | 0.0038 | 0.2480 | 0.2481 | 97.00%(0.0098) | |
0.95 | 1 | 2.9832 | −0.0168 | 0.2115 | 0.2122 | 96.67%(0.0086) | 2.9964 | −0.0036 | 0.2581 | 0.2581 | 96.33%(0.0106) |
2 | 2.9987 | −0.0013 | 0.2041 | 0.2041 | 95.00%(0.0081) | 3.0087 | 0.0087 | 0.2215 | 0.2217 | 95.00%(0.0089) | |
5 | 2.9870 | −0.0130 | 0.1909 | 0.1914 | 95.33%(0.0076) | 2.9952 | −0.0048 | 0.2001 | 0.2002 | 96.00%(0.0081) | |
1.00 | 1 | 3.0107 | 0.0107 | 0.1556 | 0.1559 | 94.67%(0.0061) | 3.0108 | 0.0108 | 0.1555 | 0.1558 | 94.67%(0.0061) |
2 | 3.0130 | 0.0130 | 0.1585 | 0.1591 | 94.33%(0.0062) | 3.0134 | 0.0134 | 0.1595 | 0.1601 | 95.33%(0.0062) | |
5 | 3.0154 | 0.0154 | 0.1613 | 0.1620 | 94.33%(0.0064) | 3.0157 | 0.0157 | 0.1649 | 0.1656 | 96.00%(0.0065) |
Table 2 summarizes the performance of the Youden index estimator under Normal data scenario. As expected, Youden index estimator in group testing has better statistical efficiency than that in the individual testing. For example, under Normal estimation, when prevalence is , misclassification rate are both equal to 0.05 , and the group size is , the relative efficiency of the estimator is 0.85 (0.0387/0.0448) in comparison to . Results are similar using nonparametric estimation method. For example, when prevalence is , , and , the relative efficiency of the estimator is 0.76 (0.0633/0.0831). However, when there is no misclassification , the superiority of group testing disappears, with either larger SE or RMSE compared to individual testing. In addition, under the same situation, the RMSE under Normal estimation is better than that under nonparametric estimation. For example, when , and , the RMSE is 0.0290 in the scenario of Normal estimation, which is less than 0.0556 in nonparametric estimation.
Table 2.
Simulation results with Normal data for the Youden index estimator based on the group and individual testing approaches: estimate (Est), bias (Bias), standard error (SE), root mean square error (RMSE), coverage probability (CP) and average confidence interval length (ACIL) of the estimator for the biomarker. is the prevalence, and are specificity and sensitivity, and is the size of each group.
Normal | Nonparametric | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|
Est | Bias | SE | RMSE | CP(ACIL) | Est | Bias | SE | RMSE | CP(ACIL) | ||
0.90 | 1 | 0.6733 | 0.0143 | 0.0565 | 0.0583 | 91.00%(0.2125) | 0.6995 | 0.0405 | 0.1203 | 0.1270 | 87.33%(0.4282) |
2 | 0.6681 | 0.0091 | 0.0487 | 0.0495 | 94.00%(0.1984) | 0.6911 | 0.0321 | 0.0967 | 0.1019 | 92.33%(0.3599) | |
5 | 0.6658 | 0.0068 | 0.0459 | 0.0465 | 93.33%(0.1770) | 0.6967 | 0.0377 | 0.0916 | 0.0991 | 89.33%(0.3277) | |
0.95 | 1 | 0.6701 | 0.0111 | 0.0448 | 0.0462 | 94.67%(0.1779) | 0.6876 | 0.0287 | 0.0843 | 0.0891 | 93.00%(0.3230) |
2 | 0.6660 | 0.0070 | 0.0421 | 0.0427 | 95.33%(0.1629) | 0.6850 | 0.0260 | 0.0683 | 0.0730 | 94.67%(0.2674) | |
5 | 0.6642 | 0.0052 | 0.0387 | 0.0391 | 94.67%(0.1465) | 0.6887 | 0.0297 | 0.0671 | 0.0734 | 93.00%(0.2567) | |
1.00 | 1 | 0.6590 | 0.0000 | 0.0234 | 0.0234 | 95.00%(0.0914) | 0.6666 | 0.0076 | 0.0289 | 0.0299 | 94.00%(0.1091) |
2 | 0.6590 | 0.0000 | 0.0250 | 0.0250 | 95.67%(0.0964) | 0.6708 | 0.0118 | 0.0328 | 0.0348 | 92.33%(0.1289) | |
5 | 0.6595 | 0.0005 | 0.0281 | 0.0281 | 93.33%(0.1061) | 0.6801 | 0.0211 | 0.0440 | 0.0488 | 93.00%(0.1780) | |
0.90 | 1 | 0.6680 | 0.0090 | 0.0382 | 0.0392 | 94.67%(0.1576) | 0.6848 | 0.0258 | 0.0831 | 0.0870 | 95.33%(0.3251) |
2 | 0.6653 | 0.0063 | 0.0365 | 0.0370 | 93.67%(0.1426) | 0.6826 | 0.0236 | 0.0633 | 0.0676 | 96.33%(0.2682) | |
5 | 0.6653 | 0.0063 | 0.0335 | 0.0341 | 93.67%(0.1306) | 0.6876 | 0.0286 | 0.0616 | 0.0679 | 91.00%(0.2501) | |
0.95 | 1 | 0.6656 | 0.0066 | 0.0317 | 0.0324 | 95.67%(0.1291) | 0.6773 | 0.0183 | 0.0556 | 0.0585 | 94.33%(0.2273) |
2 | 0.6637 | 0.0047 | 0.0298 | 0.0302 | 95.67%(0.1174) | 0.6785 | 0.0195 | 0.0472 | 0.0511 | 94.67%(0.1926) | |
5 | 0.6649 | 0.0059 | 0.0284 | 0.0290 | 93.67%(0.1109) | 0.6828 | 0.0238 | 0.0503 | 0.0556 | 93.67%(0.1981) | |
1.00 | 1 | 0.6600 | 0.0010 | 0.0193 | 0.0193 | 94.00%(0.0745) | 0.6661 | 0.0071 | 0.0238 | 0.0248 | 92.67%(0.0892) |
2 | 0.6599 | 0.0009 | 0.0203 | 0.0203 | 94.67%(0.0789) | 0.6690 | 0.0100 | 0.0280 | 0.0297 | 92.33%(0.1065) | |
5 | 0.6601 | 0.0011 | 0.0232 | 0.0232 | 93.67%(0.0867) | 0.6775 | 0.0185 | 0.0395 | 0.0437 | 92.33%(0.1492) |
Besides that, results of prevalence and Youden index estimators in Gamma/Log-normal data are presented in Appendix A of supplemental. We also conducted additional simulations with a smaller sample size , higher prevalences , and a set of expanded equal and unequal sensitivity and specificity values in Appendix B of supplemental. In addition, a new simulation where the sensitivity and specificity are mis-specified are presented in Appendix C of supplemental.
In summary, the simulations demonstrate these findings: (1) all proposed estimation method based on group testing can be superior (in both accuracy and efficiency) to those based on individual testing; (2) as misclassification rates decrease, the proposed estimator becomes more efficient; (3) as expected, using the parametric distribution that is the same as the generating distribution (e.g. Normal estimation for Normal data) results in the best estimators. Otherwise, Nonparametric estimation performs the best; and (4) it is easy to see that when sample size decreases and prevalence increases, the group testing will have lower precision than the individual testing. This suggests that, if the sample size is small, the group testing will perform well only when the prevalence is relatively low; see Liu et al.26
4. Application
We applied our proposed method to genital chlamydia infections and utilized data from the National Health and Nutrition Examination Survey (NHANES), a comprehensive population study designed to assess the health and nutritional status of individuals throughout the United States. Further details can be found at https://www.cdc.gov/nchs/nhanes/index.htm.
In the NHANES study, urine samples were collected from individuals between the ages of 18 and 39, and tests for genital chlamydia infections were conducted using the DNA strand displacement amplification method. The publicly available dataset includes the assay results of eligible participants. Chlamydia, which is caused by Chlamydia trachomatis, is a common sexually transmitted disease that has the potential to influence the levels of monocyte and erythrocyte sedimentation rate, see Łój et al.,27 Park et al.28 In our analysis, we considered using monocyte as a biomarker for detecting chlamydia infections.
We gathered data on chlamydia and monocyte from six consecutive and independent surveys conducted as part of NHANES, spanning the years 1999–2000, 2001–2002, 2003–2004, 2005–2006, 2007–2008, and 2009–2010. To account for the potential impact of oversampling and the intricate survey design, we implemented a resampling technique on the data from each two-year survey dataset. This resampling process involved replacement and utilized sampling weights proportional to the probabilities, while maintaining the original dataset's sample size. Following this, we merged these resampled datasets to construct a comprehensive sample.
After removing those with missing values of chlamydia and monocyte, independent observations of were included in our final analysis. Among these observations, 220 subjects tested positive for chlamydia. It's worth noting that the NHANES study did not employ group testing to detect chlamydia and we presented a hypothetical scenario using group-tested data. This approach is justifiable because the self-antibody test and the three biomarkers relied on different specimens. We independently generated group-tested outcomes for disease presence. To achieve this, we considered the testing results in the dataset as the true disease statuses of the subjects and randomly assigned the self-antibody test specimens to groups of size . The values of the biomarker for each subject remained unchanged.
Since we do not have a reasonable distributional assumption on the monocyte data, we chose to use the Nonparametric estimation method. For , we estimated the prevalence and Youden index as well as there standard errors and 95% confidence intervals (95% CI) based on individual and group-tested results. We set specificity of and a sensitivity of (see Haugland et al.29) and assumed that with .
The prevalence and Youden index estimators for monocyte, derived from both individual and group-tested results, are presented in Table 3. The standard errors of estimators are computed using 1000 bootstrap replicates. The table reveals that the prevalence and Youden index estimates achieve better efficiency when , with a relative efficiency of 0.88 (0.1331/0.1506) in prevalence and 0.94 (3.8543/4.1126) in Youden index, compared to the individual-tested results .
Table 3.
Nonparametric estimators for the Chlamydia data: estimates (Est), standard error (SE), and 95% Confidence Interval (95% CI) of Youden index for the biomarker monocyte based on individual and group testing approaches. Entries of SE are multiplied 100 for better presentation. is the size of each group.
Youden index | ||||||
---|---|---|---|---|---|---|
Est | SE | 95% CI | Est | SE | 95% CI | |
1 | 0.0155 | 0.1506 | (0.0125, 0.0184) | 0.0871 | 4.1126 | (0.0065, 0.1677) |
2 | 0.0166 | 0.1479 | (0.0137, 0.0195) | 0.1142 | 4.9292 | (0.0176, 0.2109) |
5 | 0.0169 | 0.1331 | (0.0143, 0.0195) | 0.0462 | 3.8543 | (0, 0.1217) |
5. Summary and discussion
Youden index is an important measure of the accuracy of a diagnostic biomarker. In the present paper, we considered the problem of estimating Youden index of a continuous biomarker when only group-based test results on the disease status are available, in order to save cost and/or protect patients' confidentiality. The biomarker values are observed on the individual levels, particularly when the cost of measuring the biomarker is relatively low.
We proposed both parametric and nonparametric estimation procedures and observed promising performance through simulations. As expected, when the distributional assumptions of the biomarker are reasonable, estimators under these distributions perform well. Youden index remains unchanged under monotonic transformation of the data. If the data follow approximately Normal or Gamma distribution after some monotonic transformation (e.g. box-cox transformation), then our methods can be applied. In situations where we lack reasonable distributional insight or when we desire flexible assumptions, the nonparametric approach provides an attractive estimation procedure. Compared with individual testing, group testing strategy in general not only reduces the cost of disease screening but also provides more efficient estimation of the Youden index, when the disease's prevalence is low and the test for screening for the disease is imperfect. Thus, group testing is an appealing alternative to the conventional individual testing.
The conclusion that considering group-tested data would lead to more precise Youden index estimation is somehow counter-intuitive but can be explained. If misclassification error does not exists, the individual testing provides most information on the disease and thus is always more precise than group testing. However, when there is misclassification error, the individual testing requires more tests be performed and could more likely generate higher testing errors (i.e. false positives and false negatives) than the group testing, which requires fewer tests be performed. This could potentially lead to decreased precision of estimation for individual testing. When comparing two different group sizes, it requires even smaller prevalence for group testing with larger group size to perform better than individual testing.
This paper only considered three pre-determined group sizes, i.e. . Exploring the optimal group size could be an interesting avenue for future research. Moreover, the current paper focused on a single biomarker. Consideration of multiple biomarkers in the estimation of Youden index can be beneficial and constitutes another promising future work direction. Finally, non-equal (random) group sizes could be another interesting extension.
Supplementary Material
Supplemental materials for this article are available online.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of this article.
Footnotes
Declaration of conflicting interests
The author(s) declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
References
- 1.Pastor L, Urrea V, Carrillo J, et al. Dynamics of CD4 and CD8 T-cell subsets and inflammatory biomarkers during early and chronic HIV infection in Mozambican adults. Front Immunol 2018; 8. DOI: 10.3389/fimmu.2017.01925. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Lyons TJ and Basu A. Biomarkers in diabetes: hemoglobin A1c, vascular and tissue markers. Transl Res 2012; 159: 303–312. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Pepe MS. Statistical evaluation of medical tests for classification and prediction. Oxford: Oxford University Press, 2003. [Google Scholar]
- 4.Nakas C, Bantis L and Gatsonis C. ROC analysis for classification and prediction in practice. Chicago: Chapman & Hall, 2023. [Google Scholar]
- 5.Fluss R, Faraggi D and Reiser B. Estimation of the Youden index and its associated cutoff point. Biom J 2005; 4: 458–472. [DOI] [PubMed] [Google Scholar]
- 6.Molanes-López EM and Letón E. Inference of the Youden index and associated threshold using empirical likelihood for quantiles. Stat Med 2011; 30: 2467–2480. [DOI] [PubMed] [Google Scholar]
- 7.Yin J and Tian L. Joint confidence region estimation for area under ROC curve and Youden index. Stat Med 2013; 33: 985–1000. [DOI] [PubMed] [Google Scholar]
- 8.Yin J and Tian L. Joint inference about sensitivity and specificity at the optimal cut-off point associated with Youden index. Comput Stat Data Anal 2014; 77: 1–13. [Google Scholar]
- 9.Yin J, Samawi H and Linder D. Improved nonparametric estimation of the optimal diagnostic cut-off point associated with the Youden index under different sampling schemes. Biom J 2016; 58: 915–934. [DOI] [PubMed] [Google Scholar]
- 10.Dorfman R The detection of defective members of large populations. Ann Math Stat 1943; 14: 436–440. [Google Scholar]
- 11.Hepworth G Exact confidence intervals for proportions estmated by group testing. Biometrics 1996; 52: 1134–1146. [Google Scholar]
- 12.Hughes-Oliver JM and Rosenberger WF. Efficient estimation of the prevalence of multiple rare traits. Biometrika 2000; 87: 315–327. [Google Scholar]
- 13.McCann MH and Tebbs JM. Pairwise comparisons for proportions estimated by pooled testing. J Stat Plan Inference 2007; 137: 1278–1290. [Google Scholar]
- 14.Turner DW, Stamey JD and Young DM. Classic group testing with cost for grouping and testing. Comput Math Appl 2009; 58: 1930–1935. [Google Scholar]
- 15.Warasi M, Tebbs J, McMahan C, et al. Estimating the prevalence of multiple diseases from two-stage hierarchical pooling. Stat Med 2016; 35: 3851–3864. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Malinovsky Y and Albert PS. A note on the minimax solution for the two-stage group testing problem. Am Stat 2015; 69: 45–52. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Malinovsky Y and Albert PS. Revisiting nested group testing procedures: new results, comparisons, and robustness. Am Stat 2019; 73: 117–125. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Malinovsky Y, Haber G and Albert PS. An optimal design for hierarchical generalized group testing. J R Stat Soc Ser C 2020; 69: 607–621. [Google Scholar]
- 19.Mokalled S, McMahan C, Tebbs J, et al. Incorporating the dilution effect in group testing regression. Stat Med 2021; 40: 2540–2555. [DOI] [PubMed] [Google Scholar]
- 20.Bilder C, Tebbs J and McMahan C. Discussion on “is group testing ready for prime-time in disease identification?”. Stat Med 2021; 40: 3881–3886. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Zhang W, Liu A, Li Q, et al. Nonparametric estimation of distributions and diagnostic accuracy based on group-tested results with differential misclassification. Biometrics 2020; 76: 1147–1156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Hwang FK. Group testing with a dilution effect. Biometrika 1976; 63: 671–680. [Google Scholar]
- 23.Hung M and Swallow W. Robustness of group testing in the estimation of proportions. Biometrics 1999; 55: 231–237. [DOI] [PubMed] [Google Scholar]
- 24.Haber G, Malinovsky Y and Albert PS. Is group testing ready for prime-time in disease identification? Stat Med 2021; 40: 3865–3880. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.van der Vaart AW. Asymptotic statistics. Cambridge: Cambridge University Press, 1998. [Google Scholar]
- 26.Liu C, Liu A, Zhang Z, et al. Optimality of group testing in the presence of misclassification. Biometrika 2012; 99: 245–251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Łój B, Brodowska A, Ciećwież S, et al. The role of serological testing for Chlamydia trachomatis in differential diagnosis of pelvic pain. Ann Agric Environ Med 2016; 23: 506–510. [DOI] [PubMed] [Google Scholar]
- 28.Park ST, Lee SW, Kim MJ, et al. Clinical characteristics of genital Chlamydia infection in pelvic inflammatory disease. BMC Womens Health 2017; 17: 5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Haugland S, Thune T, Fosse B, et al. Comparing urine samples and cervical swabs for Chlamydia testing in a female population by means of Strand Displacement Assay (SDA). BMC Womens Health 2010; 10: 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.