Informative Dorfman Screening

Christopher S McMahan; Joshua M Tebbs; Christopher R Bilder

doi:10.1111/j.1541-0420.2011.01644.x

. Author manuscript; available in PMC: 2013 Mar 1.

Published in final edited form as: Biometrics. 2011 Jul 15;68(1):287–296. doi: 10.1111/j.1541-0420.2011.01644.x

Informative Dorfman Screening

Christopher S McMahan ¹, Joshua M Tebbs ^1,^*, Christopher R Bilder ²

PMCID: PMC3197971 NIHMSID: NIHMS306267 PMID: 21762119

Summary

Since the early 1940s, group testing (pooled testing) has been used to reduce costs in a variety of applications, including infectious disease screening, drug discovery, and genetics. In such applications, the goal is often to classify individuals as positive or negative using initial group testing results and the subsequent process of decoding of positive pools. Many decoding algorithms have been proposed, but most fail to acknowledge, and to further exploit, the heterogeneous nature of the individuals being screened. In this paper, we use individuals’ risk probabilities to formulate new informative decoding algorithms which implement Dorfman retesting in a heterogeneous population. We introduce the concept of “thresholding” to classify individuals as “high” or “low risk,” so that separate, risk-specific algorithms may be used, while simultaneously identifying pool sizes that minimize the expected number of tests. When compared to competing algorithms which treat the population as homogeneous, we show that significant gains in testing efficiency can be realized with virtually no loss in screening accuracy. An important additional benefit is that our new procedures are easy to implement. We apply our methods to chlamydia and gonorrhea data collected recently in Nebraska as part of the Infertility Prevention Project.

Keywords: Dorfman retesting, Group testing, Infertility Prevention Project, Pooled testing, Sensitivity, Specificity

1. Introduction

Chlamydia and gonorrhea are the two most common sexually transmitted diseases (STDs) in the United States. In addition to the 1.5 million new infections that are diagnosed annually, it has been estimated that an additional 2.1 million cases go unreported each year (Centers for Disease Control and Prevention, ^{CDC, 2009}). Both infections are usually asymptomatic, so positive individuals, perhaps unaware of their status, can spread the infections to others. Left untreated, both infections can lead to serious medical conditions, including pelvic inflammatory disease (PID), ectopic pregnancy, and sterility (Kacena et al., 1998a, 1998b). It also has been suggested that chlamydia and gonorrhea facilitate the transmission of other STDs, including HIV (Farley, Cohen, and Elkins, 2003).

Many countries have developed national programs to screen for chlamydia and gonorrhea infections. In the United States, one of the largest such programs is the Infertility Prevention Project (IPP), which is funded by the CDC. Since its origination in 1988, the primary goals of the IPP have been to screen for chlamydia and gonorrhea in high risk populations and to administer treatment to those who are infected. All 50 states participate in the IPP. The state of Nebraska does so through its Sexually Transmitted Diseases and Infertility Control Program. At clinic sites throughout the state, urine and swab specimens are collected on individuals. These individual specimens are then transported to the Nebraska Public Health Laboratory (NPHL) in Omaha for testing.

With increasing public health costs, our medical colleagues at the NPHL have expressed an interest in adopting group testing (pooled testing) for chlamydia and gonorrhea identification. Group testing dates back to Dorfman (1943), who proposed that it be used to screen World War II soldiers for syphilis. When testing for low prevalence diseases, pooling specimens (e.g., blood, urine, swabs, etc.) through group testing is a novel way to increase screening efficiency, and, when compared to individual testing, there is overwhelming evidence that group testing can maintain high levels of classification accuracy. Since Dorfman’s seminal work, group testing has been used to screen individuals for various STDs (see, e.g., Kacena et al., 1998a, 1998b; Mine et al., 2003; Pilcher et al., 2005) and also for other infectious agents including West Nile Virus (Alter, 2004) and the avian influenza virus H5N1 (Hourfar et al., 2007). In addition to blood/plasma donation screening in the United States and elsewhere (Tabor and Epstein, 2002; Mine et al., 2003), group testing has been utilized in screening individuals for drug use (Gastwirth and Johnson, 1994), in preventing the potential spread of bioterrorist agents (Schmidt et al., 2005), in genetics (Gastwirth, 2000), and in drug discovery (Xie et al., 2001; Remlinger et al., 2006).

The group testing literature contains a myriad of classification (decoding) algorithms which vary in their level of complexity. Kim et al. (2007) provide an excellent review of existing algorithms and derive their operating characteristics in the presence of testing error. In practice, testing errors (false positives/negatives) can occur when diagnostic tests do not have perfect sensitivity and specificity. Kim et al. (2007) consider array-based and hierarchical algorithms. Array testing involves placing individual specimens in a square or rectangular array and testing its rows and columns. These responses give information about the location of positive individuals, although further retesting is needed if there are ambiguities. A hierarchical algorithm involves retesting non-overlapping subsets of individuals from positive pools, possibly in multiple stages, until each individual is classified as positive or negative. Dorfman’s original procedure, where each individual is retested separately, is an example of a two-stage hierarchical algorithm. Higher-stage hierarchical algorithms often improve efficiency but are generally more difficult to implement. Likely because of its simplicity, Dorfman’s procedure is the most commonly used classification algorithm in practice.

Statistical research in the group testing classification problem has generally proceeded under the assumption that each individual has the same probability of infection; see, e.g., Kim et al. (2007). However, in most screening situations, including the Nebraska IPP, there are covariates available that offer information as to which individuals have a higher risk of infection. For example, it is well known that age, race, gender, socioeconomic status, and the number of sexual partners are excellent predictors of both chlamydia and gonorrhea positivity (CDC, 2009). Therefore, if the goal is to develop efficient screening protocols for the Nebraska IPP and elsewhere, one should exploit this available information.

Among all group testing algorithms in the literature, however, very few acknowledge population heterogeneity. Hwang (1975) generalized Dorfman’s original procedure to allow individuals to have different levels of risk and proposed an optimal grouping strategy that minimizes the number of tests needed to classify all individuals. Unfortunately, Hwang’s approach assumes that the probability of testing error is zero, which is unreasonable in the Nebraska IPP and in most screening applications involving human subjects. More recently, Bilder, Tebbs, and Chen (2010) have proposed a framework to incorporate covariate information using modifications of Sterrett’s (1957) algorithm. However, while the “informative retesting” procedures in Bilder et al. (2010) are effective at reducing the number of tests, Sterrett-type algorithms are inherently complex and may not be logistically feasible in some screening environments. This is especially true for the NPHL, where the use of complicated decoding algorithms could considerably lengthen the time needed to screen all individuals.

In this paper, we propose new hierarchical screening procedures that utilize Dorfman retesting in a heterogeneous population. Unlike Hwang’s approach, our algorithms allow for imperfect diagnostic testing, and when compared to the higher-stage Sterrett-type algorithms in Bilder et al. (2010), ours are easier to implement. In Section 2, we describe a general informative Dorfman algorithm that uses individual risk probabilities to assign individuals to pools, and we derive the operating characteristics of this algorithm in the presence of testing error. In Section 3, we propose three versions of this algorithm which incorporate two important criteria: the determination of optimal pool sizes and the use of risk thresholds (cutoffs) to classify subjects according to their level of risk. In Section 4, we provide simulation evidence to demonstrate the effectiveness of our new procedures, and, in Section 5, we illustrate their use with data from the Nebraska IPP. In Section 6, we provide a summary discussion.

2. Informative Dorfman Algorithm

2.1 Preliminaries

Suppose that N individuals are to be screened for the presence of a binary trait, such as chlamydia status, and denote by p_i the (true) probability of positivity for the ith individual $I_{i}$ , i = 1, 2, …, N. The true status of $I_{i}$ is a binary random variable denoted by ${\tilde{g}}_{i}$ , where $pr ({\tilde{g}}_{i} = 1) = p_{i}$ . We assume throughout that the statuses ${\tilde{g}}_{i}$ are independent random variables. To derive the operating characteristics of our algorithms, we initially assume that the true probabilities p_i are known (this assumption is relaxed later; see Section 5). We start by ordering the N individuals from low to high in terms of their risk probabilities, producing $I_{(1)}, I_{(2)}, \dots, I_{(N)}$ corresponding to $p_{(1)} \leq p_{(2)} \leq \dots \leq p_{(N)}$ . Ordering the probabilities isolates those individuals who are at the highest risk for infection. This allows us to determine optimal pool sizes and formulate the use of thresholding; see Section 3.

Let c_j denote the pool size for the jth pool, j = 1, 2, …, J. Dorfman’s (1943) original strategy begins by assigning each individual, at random, to exactly one pool. Those pools which test negative are declared to contain all negative individuals and are not examined further. Positive pools are decoded by retesting each subject individually. Instead of random assignment, our Informative Dorfman (ID) algorithm specifies that individuals are assigned to pools based on their ordered risk probabilities. In particular, pool $P_{1}$ consists of individuals $I_{(1)}, I_{(2)}, \dots, I_{(c_{1})}$ , pool $P_{2}$ consists of individuals $I_{(c_{1} + 1)}, I_{(c_{1} + 2)}, \dots, I_{(c_{1} + c_{2})}$ , and so on; in general, pool $P_{j}$ consists of the c_j lowest risk subjects which remain after constructing the first j − 1 pools; i.e., $P_{j} = {I_{(i)} : i = \sum_{j^{'} = 1}^{j - 1} c_{j^{'}} + k, k = 1, 2, \dots, c_{j}}$ , for j = 2, 3, …, J. Positive pools are then decoded in the same way as in Dorfman’s (1943) algorithm.

2.2 Operating Characteristics

To streamline our notation, let $I_{j (k)}$ denote the kth ordered individual in the jth pool and let p_j(k) denote the corresponding risk probability, for j = 1, 2, …, J and k = 1, 2, …, c_j. To derive the operating characteristics of ID, we assume that the test sensitivity S_e and specificity S_p are known constants which do not depend on c_j. Previous research has proceeded under this assumption (Vansteelandt, Goetghebeur, and Verstraeten, 2000; Kim et al., 2007; Kim and Hudgens, 2009), and numerous empirical studies have shown this to be reasonable for sensible values of c_j. For example, a number of chlamydia and gonorrhea studies have shown that assay tests based on nucleic acid technology (NAT) possess high sensitivity and specificity with negligible dilution effects for groups up to size c_j = 10 when pooling urine or swabs (Kacena et al., 1998a, 1998b; Butylkina et al., 2007; Shipitsyna et al., 2007). Similar findings have been documented for other infectious diseases when pooling blood or serum samples. For example, Mine et al. (2003) use c_j = 50 for hepatitis B/C screening in Japan. Pilcher et al. (2005) use pools of size c_j = 90 for HIV screening in North Carolina.

Let ${\tilde{G}}_{j} = 1$ if the jth pool is truly positive; i.e., $P_{j}$ contains at least one positive individual, ${\tilde{G}}_{j} = 0$ otherwise, for j = 1, 2, …, J. Similarly, let G_j = 1 if $P_{j}$ tests positive, G_j = 0 otherwise. Let the random variable T denote the total number of tests needed to decode all N individuals. For the ID algorithm, we show in Web Appendix A that

E (T) = J + \sum_{j = 1}^{J} c_{j} I (c_{j} > 1) pr (G_{j} = 1)

var (T) = \sum_{j = 1}^{J} c_{j}^{2} I (c_{j} > 1) pr (G_{j} = 1) pr (G_{j} = 0),

where I(·) denotes the usual indicator function, $pr (G_{j} = 1) = S_{e} + (1 - S_{e} - S_{p}) \prod_{k = 1}^{c_{j}} (1 - p_{j (k)})$ , and pr(G_j = 0) = 1 − pr(G_j = 1). In general, we refer to E(T) as the efficiency, as is common in the group testing literature.

The quality of a decoding algorithm is often measured through its efficiency. However, it is also important to characterize how well an algorithm correctly classifies individuals as being positive or negative. To explore this, $let {\tilde{g}}_{j (k)} = 1$ , if the kth individual in the jth pool is truly positive, so that $pr ({\tilde{g}}_{j (k)} = 1) = p_{j (k)}$ , and $let {\tilde{g}}_{j (k)} = 0$ , otherwise. We define the pooling sensitivity $P S_{e}^{I_{j (k)}}$ and the pooling specificity $P S_{p}^{I_{j (k)}}$ as

P S_{e}^{I_{j (k)}} = pr (I_{j (k)} classified positive ∣ {\tilde{g}}_{j (k)} = 1)

P S_{p}^{I_{j (k)}} = pr (I_{j (k)} classified negative ∣ {\tilde{g}}_{j (k)} = 0)

and derive expressions for $P S_{e}^{I_{j (k)}}$ and $P S_{p}^{I_{j (k)}}$ under ID when c_j > 1. Let g_j(k) = 1, if the kth individual in the jth pool tests positive, g_j(k) = 0, otherwise. As in Kim et al. (2007), we assume that diagnostic test results are independent, conditional on the true status of the pool (individual) being tested; see Litvak, Tu, and Pagano (1994) for discussion. This implies that the testing result for any pool (individual), given its true status, is independent of previous testing results. For ID, individual $I_{j (k)}$ is classified as positive when its group test and its subsequent individual test are both positive. Therefore,

\begin{matrix} P S_{e}^{I_{j (k)}} & = pr (G_{j} = 1, g_{j (k)} = 1 ∣ {\tilde{g}}_{j (k)} = 1) \\ = pr (G_{j} = 1 ∣ {\tilde{g}}_{j (k)} = 1) pr (g_{j (k)} = 1 ∣ G_{j} = 1, {\tilde{g}}_{j (k)} = 1) = S_{e}^{2} . \end{matrix}

This is identical to the pooling sensitivity of Dorfman’s strategy in a homogeneous population; see Kim et al. (2007). This equivalence would not result if an assay test’s performance was somehow related to an individual’s risk probability p_j(k). However, our medical colleagues have found no plausible reason for this to be true with any of the commonly used assays. To derive the pooling specificity $P S_{p}^{I_{j (k)}}$ , note that

P S_{p}^{I_{j (k)}} = pr (G_{j} = 0 ∣ {\tilde{g}}_{j (k)} = 0) + pr (G_{j} = 1, g_{j (k)} = 0 ∣ {\tilde{g}}_{j (k)} = 0) .

From the Law of Total Probability, the first term

\begin{matrix} pr (G_{j} = 0 ∣ {\tilde{g}}_{j (k)} = 0) & = \sum_{s = 0}^{1} pr (G_{j} = 0 ∣ {\tilde{G}}_{j} = 0) pr ({\tilde{G}}_{j} = s ∣ {\tilde{g}}_{j (k)} = 0) \\ = (1 - S_{e}) - (1 - S_{e} - S_{p}) \prod_{k^{'} \in A_{j}^{(k)}} (1 - p_{j (k^{'})}), \end{matrix}

where $A_{j}^{(k)} = {1, \dots, k - 1, k + 1, \dots, c_{j}}$ . Similarly, the second term

\begin{matrix} pr (G_{j} = 1, g_{j (k)} = 0 ∣ {\tilde{g}}_{j (k)} = 0) & = pr (G_{j} = 1 ∣ {\tilde{g}}_{j (k)} = 0) pr (g_{j (k)} = 0 ∣ G_{j} = 1, {\tilde{g}}_{j (k)} = 0) \\ = \sum_{s = 0}^{1} pr (G_{j} = 1 ∣ {\tilde{G}}_{j} = s) pr ({\tilde{G}}_{j} = s ∣ {\tilde{g}}_{j (k)} = 0) \times S_{p} \\ = S_{e} S_{p} + (1 - S_{e} - S_{p}) S_{p} \prod_{k^{'} \in A_{j}^{(k)}} (1 - p_{j (k^{'})}) . \end{matrix}

Combining the terms and simplifying, we obtain

P S_{p}^{I_{j (k)}} = 1 - (1 - S_{p}) {S_{e} + (1 - S_{e} - S_{p}) \prod_{k^{'} \in A_{j}^{(k)}} (1 - p_{j (k^{'})})} .

Unlike the homogeneous case, $P S_{p}^{I_{j (k)}}$ is potentially different for different individuals. When p_j(k) = p; i.e., each individual has the same probability of infection, our expression for $P S_{p}^{I_{j (k)}}$ reduces to equation (7) in Kim et al. (2007), the pooling specificity for Dorfman retesting in a homogeneous population.

Two additional classification measures are commonly used in the group testing literature. The pooling positive predictive value, $P P V^{I_{j (k)}}$ , is the probability that the kth individual in the jth pool is truly positive, given that it is classified as positive. Similarly, the pooling negative predictive value, $N P V^{I_{j (k)}}$ , is the probability that the kth individual in the jth pool is truly negative, given that it is classified as negative. By Bayes Rule,

P P V^{I_{j (k)}} = pr ({\tilde{g}}_{j (k)} = 1 ∣ I_{j (k)} classified positive) = \frac{p_{j (k)} S_{e}^{2}}{p_{j (k)} S_{e}^{2} + (1 - p_{j (k)}) (1 - P S_{p}^{I_{j (k)}})}

N P V^{I_{j (k)}} = pr ({\tilde{g}}_{j (k)} = 0 ∣ I_{j (k)} classified negative) = \frac{(1 - p_{j (k)}) P S_{p}^{I_{j (k)}}}{(1 - p_{j (k)}) P S_{p}^{I_{j (k)}} + p_{j (k)} (1 - S_{e}^{2})} .

Like $P S_{p}^{I_{j (k)}}$ , these measures are individual-specific; thus, they can provide substantial information about the true statuses on a per-individual basis. On the other hand, treating the population as homogeneous provides constant values of PPV and NPV, offering no knowledge about which individuals are more likely to be misclassified. We would envision this additional per-individual information to be especially useful in developing informative back-end screening protocols (Gastwirth and Johnson, 1994; Johnson and Pearson, 1999), not only in chlamydia and gonorrhea surveillance, but also in blood screening, where Dorfman retesting is used extensively.

3. Optimized Informative Dorfman Procedures

Having developed the operating characteristics of ID, our goal is to now construct sets of heterogeneous pools to maximize testing efficiency; i.e., to minimize E(T). As stated in Section 1, Hwang (1975) finds the optimal solution to this problem when S_e = S_p = 1, a simpler and perhaps unrealistic setting. Because an extension of Hwang’s method to the imperfect testing case appears to be intractable, we propose three specific versions of ID which minimize E(T) subject to given pooling constraints. Our new algorithms are “greedy” in nature, forming optimal sets of pools in accordance with the specified constraints.

3.1 Optimal Dorfman

Consider using the ID algorithm with a common pool size c_j = c, for j = 1, 2, …, J (with c_J < c if N < cJ), and let T^(c) denote the number of tests needed to decode all N individuals when pools of size c are used. We call the ID procedure that uses c = c_opt, where

c_{o p t} = \arg \min_{c \geq 1} {J + c I (c > 1) \sum_{j = 1}^{J - 1} pr (G_{j} = 1) + c_{J} I (c_{J} > 1) pr (G_{J} = 1)},

the Optimal Dorfman (OD) algorithm. In other words, OD is a special case of ID where the common pool size c = c_opt minimizes E(T^(c)). Computing c_opt can be done by performing a search over values of c deemed acceptable by the investigator. For example, if one is worried about dilution effects for pool sizes larger than M, say, the minimization procedure can be carried out over {c : c = 1, 2, …, M}. In the objective function above, we point out that J = [N/c], a function of c. If c = 1 (individual testing), then J = N.

3.2 Thresholding

The goal of OD is to maximize testing efficiency by identifying the best common size for all pools. We now consider a different approach to accomplish the same goal.

Suppose, hypothetically, that the N individuals to be screened are partitioned into the two distinct classes $C_{1} = {I_{i} : {\tilde{g}}_{i} = 0, i = 1, 2, \dots, N}$ and $C_{2} = {I_{i} : {\tilde{g}}_{i} = 1, i = 1, 2, \dots, N}$ ; that is, $C_{1} (C_{2})$ consists entirely of negative (positive) individuals. In the absence of testing error, the best possible decoding algorithm, in terms of minimizing E(T), would be to test all individuals in $C_{1}$ in one pool and each individual in $C_{2}$ separately. Of course, one never gets to construct this partition in practice; however, the salient point is that efficiency can always be increased if positive subjects are removed and are tested individually. With this in mind, suppose, more realistically, that one partitions the N individuals into the two classes $C_{1}^{*} = {I_{i} : p_{i} \leq p^{*}, i = 1, 2, \dots, N}$ and $C_{2}^{*} = {I_{i} : p_{i} > p^{*}, i = 1, 2, \dots, N}$ , where p* ∈ [0, 1] is a thresholding value; i.e., a value that distinguishes “low risk” individuals (in $C_{1}^{*}$ ) from those that are “high risk” (in $C_{2}^{*}$ ). By classifying individuals as high risk in $C_{2}^{*}$ before testing begins, one is more likely to a priori isolate those who are positive. Our Optimal Dorfman with Threshold (TOD) procedure specifies that individuals in $C_{1}^{*}$ are tested in pools of optimal equal size and that those in $C_{2}^{*}$ are tested individually.

The threshold p* can be specified by the investigator; however, it is not necessary to do so, as our TOD algorithm identifies its optimal value. To explain this point, $T_{P_{j}}$ denote the number of tests needed to decode the jth pool. When a common pool size is specified, say c₀, the per-pool efficiencies $E (T_{P_{j}})$ satisfy $E (T_{P_{J}}) \geq E (T_{P_{J - 1}}) \geq \dots \geq E (T_{P_{1}})$ whenever S_e + S_p ≥ 1; see Web Appendix B. Therefore, starting with the highest risk individuals, the TOD algorithm seeks to find the value of j, say j*, where $E (T_{P_{j^{*}}}) \geq c_{0}$ and $E (T_{P_{j^{*} - 1}}) < c_{0}$ . For j = j*, j* + 1, …, J (the high risk pools), the per-pool efficiency $E (T_{P_{j}})$ is larger than or equal to c₀; that is, it is more (as) costly to pool than (as) it is to screen individually. Subsequently, p* is taken to be $p^{*} = \frac{1}{2} (p_{j^{*} - 1 (c_{0})} + p_{j^{*} (1)})$ , the average of the risk probabilities for the highest risk subject in $P_{j^{*} - 1}$ and the lowest risk subject in $P_{j^{*}}$ . Subjects in pools $P_{j^{*}}, P_{j^{*} + 1}, \dots, P_{J}$ are classified as “high risk,” placed into $C_{2}^{*}$ , and are tested individually. Subjects in pools $P_{1}, P_{2}, \dots, P_{j^{*} - 1}$ are classified as “low risk,” placed into $C_{1}^{*}$ , and are decoded using OD with $c = c_{o p t} (C_{1}^{*})$ , the optimal common pool size for those individuals in $C_{1}^{*}$ .

Small details regarding TOD warrant brief remarks. First, if N < c₀J, then the lowest risk pool $P_{1}$ will contain c₁ < c₀ individuals, but the ordering $E (T_{P_{J}}) \geq E (T_{P_{J - 1}}) \geq \dots \geq E (T_{P_{1}})$ will still hold. Second, we have found through simulation that the choice of c₀ does not have a large effect on the efficiency of TOD. Generally, as c₀ increases, the number of individual tests from $C_{2}^{*}$ also increases. However, this increase is typically offset by a fewer number of tests required to decode “low risk” individuals in $C_{1}^{*}$ . Third, if $E (T_{P_{j}}) < c_{0}$ for all j = 1, 2, …, J, then TOD reduces to OD. Intuitively, this is more likely to occur when the overall prevalence is very low. On the other hand, if $E (T_{P_{j}}) \geq c_{0}$ for all j = 1, 2, …, J, then one would classify all individuals as “high risk.” This rarely occurs when c₀ is chosen sensibly. We recommend choosing c₀ ≈ c_opt, the optimal pool size identified by OD for all N individuals.

3.3 Pool-Specific Optimal Dorfman

The OD and TOD algorithms share a common characteristic; namely, each uses a common pool size. OD uses a common size for all pools, while TOD uses a common pool size for all individuals who are “low risk.” Therefore, with regards to implementation, OD and TOD are simple procedures, requiring only individual testing and testing pools of common size (with the possible exception of one remainder pool). However, using a common pool size may not be the best way to exploit individual heterogeneity, especially if the individual probabilities exhibit a large amount of variability. With this in mind, we propose a final procedure that determines optimal sizes for each pool and call this the Pool-Specific Optimal Dorfman (PSOD) procedure. The idea behind PSOD is motivated by the fact that E(T) can be expressed as a sum of the efficiencies on a per-individual basis. If we can reduce the expected testing expenditure for each individual, we simultaneously reduce E(T).

Specifically, the goal of PSOD is to identify the pool sizes c_j, j = 1, 2, …, J, that minimize the expected per-individual testing expenditure on a pool by pool basis, starting with the lowest risk pool $P_{1}$ and continuing until the highest risk pool $P_{J}$ has been formed. To describe how PSOD determines the jth pool size c_j, $let N_{j}^{P}$ denote the total number of individuals in the j − 1 lowest risk pools combined; i.e., $N_{1}^{P} = 0$ and $N_{j}^{P} = \sum_{j^{'} = 1}^{j - 1} c_{j^{'}}$ , for j = 2, 3, …, J. $Let N_{j}^{R} = N - N_{j}^{P}$ and define $T_{P_{j}; c_{j}}$ to be the number of tests needed to decode $P_{j}$ , where $P_{j}$ consists of the c_j lowest risk subjects remaining after $P_{1}, P_{2}, \dots, P_{j - 1}$ have been formed. For Dorfman retesting, the expected per-individual testing expenditure is the same for each individual belonging to a common pool. Therefore, PSOD identifies $c_{j}^{*} = \arg \min_{1 \leq c_{j} \leq N_{j}^{R}} {c_{j}^{- 1} E (T_{P_{j}; c_{j}})}$ , where

\begin{matrix} E (T_{P_{j}; c_{j}}) & = 1 + c_{j} I (c_{j} > 1) pr (G_{j} = 1) \\ = 1 + c_{j} I (c_{j} > 1) {S_{e} + (1 - S_{e} - S_{p}) \prod_{k = 1}^{c_{j}} (1 - p_{(N_{j}^{P} + k)})}, \end{matrix}

and defines $c_{j} = c_{j}^{*}$ as the optimal size for the jth pool. The full algorithm for PSOD is given in Web Appendix B.

Because individuals are ordered a priori in terms of their risk, the pool sizes c_j identified by PSOD, like those identified by Hwang’s method, are guaranteed to satisfy c₁ ≥ c₂ ≥ ⋯ ≥ c_J. This is a characteristic that makes PSOD and Hwang’s method attractive because higher risk pools should use smaller pool sizes. It is straightforward to amend PSOD to guard against excessively large pool sizes. One could simply minimize the expected per-individual testing expenditure $c_{j}^{- 1} E (T_{P_{j}; c_{j}})$ over ${c_{j} : c_{j} = 1, 2, \dots, {\hat{N}}_{j}^{R}}$ , where ${\hat{N}}_{j}^{R} = \min {N_{j}^{R}, M}$ and M is the maximum allowable pool size. This guarantees that each pool will not contain more than M individuals. Hwang’s method can be amended similarly. It is important to note that even when S_e = S_p = 1, the situation in which Hwang’s solution is globally optimal, PSOD and Hwang’s method are different procedures, because PSOD is a greedy algorithm that makes pool size selections sequentially. Overall, PSOD and Hwang’s method do acknowledge heterogeneity more directly than OD or TOD. However, this also means that PSOD and Hwang’s method may be more difficult to implement, as different pool sizes are needed to complete the identification process.

4. Simulation Evidence

To assess the impact of incorporating heterogeneity, we first compare Dorfman’s original procedure (D), where individuals are assigned to pools at random, to OD, TOD, PSOD, and Hwang’s method (H). We generate true probabilities p_i, i = 1, 2, …, N, from a beta(1, θ) distribution; note that θ determines the mean prevalence, p = 1/(1 + θ), and the amount of heterogeneity in the population. We consider values of p = 0.0001, 0.0002, …, 0.0099 to investigate rare infections and also p = 0.01, 0.02, …, 0.50 to examine higher prevalence infections. Our decision to consider larger values of p may seem unsuited because group testing is typically used when the infection rate is small. However, as we demonstrate, the optimized Dorfman procedures consistently confer savings over individual testing even when p is large. We use values of S_e ∈ {0.80, 0.90, 0.91, …, 1} and S_p ∈ {0.80, 0.90, 0.95, 0.99, 1} and let the maximum allowable pool size M = 10, 20, or 30. Recall that using M = 10 is consistent with the chlamydia and gonorrhea screening literature; see Section 2. Also, recall that Hwang’s grouping method depends on neither S_e nor S_p.

Figure 1 displays a representative subset of the results; more extensive comparisons are in Web Appendix C. In Figure 1, we display the overall expected per-individual testing expenditure, E(T)/N, when M = 10 and N = 1000. For each (p, S_e, S_p) combination, OD, TOD, PSOD, and H use their own procedure-specific pool size(s). To ensure the fairest possible comparison, we use the optimal pool size for D throughout; i.e., the pool size c that minimizes the expected number of tests per pool, 1+c{S_e+(1−S_e−S_p)(1−p)^c}. The results in Figure 1, along with those in Web Appendix C, show that the gains in efficiency from OD, TOD, and PSOD (over D) are generally very small when p < 0.01 but increase notably as p increases. When p > 0.01, we have consistently seen that PSOD is more efficient than TOD, that TOD is more efficient than OD, and that OD remains more efficient than D as long as the prevalence is not too large (roughly p < 0.35; see Web Appendix C). Figure 1 also shows that PSOD and H are nearly identical when S_e and S_p are close to unity. However, for lower values of S_e and S_p, OD, TOD, and PSOD each can outperform H, especially when p is larger. For very large p, we have found that TOD, PSOD, and H each consistently outperform individual testing even when the prevalence is as high as 50%. This prevalence is likely not to be encountered when screening for sexually transmitted diseases, but it could be of interest in other applications where the use of pooling for identification has not been previously envisioned.

Comparison of Dorfman algorithms. Expected per-individual efficiency, with M = 10 and N = 1000, for p = 0.01, 0.05, 0.10, 0.20, and 0.30. The optimal pool size for D has been used. A more comprehensive comparison among D, OD, TOD, PSOD, and H is in Web Appendix C.

Using the same values of p, S_e, S_p, and M, we also compare OD, TOD, and PSOD to noninformative array testing (A). Like Dorfman screening, A is a two-stage classification procedure; that is, rows and columns are tested initially followed by individual testing if needed. Higher stage procedures are included for comparison purposes in Section 5. The salient features of A were summarized in Section 1; we restrict attention to two-dimensional square arrays (Phatarfod and Sudbury, 1994; Kim et al., 2007; Hudgens and Kim, 2011) in the comparison. We make this comparison using the optimal array size for A; that is, for given values of p, S_e, and S_p, we compare OD, TOD, and PSOD with the c×c array procedure that minimizes the expected number of tests; see equation (13) in Kim et al. (2007).

Figure 2 shows how OD, TOD, and PSOD compare with A in terms of efficiency when M = 10 and N = 1000. For given values of p, S_e, and S_p, we first compute $E (T ∣ D) - E (T ∣ A)$ , where $D \in (OD, TOD, PSOD)$ and E(T|·) is the efficiency for a given procedure. Values of $E (T ∣ D) - E (T ∣ A)$ that are negative (positive) indicate that the informative Dorfman procedure $D$ is more (less) efficient than A; this is represented in Figure 2 using dark (light) grey coloring. Heat maps of scaled values of $E (T ∣ D) - E (T ∣ A)$ are in Web Appendix C. From Figure 2, one sees that OD, TOD, and PSOD are all preferred to A, for nearly all (S_e, S_p) combinations, when the mean prevalence p is larger (e.g., p > 0.10). The optimized Dorfman procedures also outperform A for very small p when M = 10 is used as a maximum allowable pool size, although additional results in Web Appendix C show that A outperforms OD, TOD, and PSOD for most values of p < 0.08 (roughly) when larger arrays are allowed (e.g., M = 20, 30). Of course, as stated in Section 2, using larger pool sizes may not be possible with some diagnostic tests for fear of dilution effects. In addition, array testing generally requires more resources than Dorfman screening; this may make its use prohibitive in some screening applications (Westreich et al., 2008).

Comparison of Dorfman algorithms (OD, TOD, and PSOD) with A when M = 10 and N = 1000. Top: OD versus A; Middle: TOD versus A; Bottom: PSOD versus A. Regions in dark grey denote those where the Dorfman procedures are more efficient. The optimal (square) array size for A has been used for each (*p, S_e, S_p*) configuration.

Finally, we have also compared D, OD, TOD, PSOD, H, and A in terms of the screening accuracy measures from Section 2. To summarize, when compared to D, there are no substantial changes in screening accuracy that arise from using our new informative Dorfman algorithms and H in realistic settings. Furthermore, in regions where A is more efficient, Dorfman procedures can improve overall pooling sensitivity and pooling negative predictive value. Complete details are given in Web Appendix C.

5. Nebraska IPP Data

We now apply our new Dorfman screening algorithms to chlamydia and gonorrhea data collected during 2008-2009 as part of the Nebraska IPP. For each infection, we create four strata by cross-classifying each individual according to gender and specimen type (swab/urine). A complete summary of the data, including the number of individuals screened per stratum, values of S_e and S_p (provided by the NPHL), and individual covariates, is given in Table 1. Across strata, there were 23,146 individuals screened in 2008 and 27,551 individuals screened in 2009. All individuals were screened for both infections. The NPHL estimates that individual swab (urine) tests cost about $11 ($16) each.

Table 1.

Nebraska IPP data summary. Numbers of individuals screened and overall prevalence levels are provided for years 2008 and 2009. The values of S_e and S_p are provided by the NPHL. The individual covariates available are age, gender, race, clinic type, location of clinic, reason for visit (family planning, prenatal, STD screening), symptoms, clinical observations (cervical friability, PID, cervicitis, urethritis), and risk history (multiple partners in last 90 days, new partner in last 90 days, contact to STD).

					Number screened		Mean prevalence

Infection	Gender	Specimen	S_e	S_p	2008	2009	2008	2009
Chlamydia	Female	Urine	0.805	0.96	2338	4972	0.092	0.080
	Female	Swab	0.928	0.96	14441	14530	0.072	0.069
	Male	Urine	0.930	0.95	3541	6139	0.077	0.081
	Male	Swab	0.925	0.95	2826	1910	0.137	0.157

Gonorrhea	Female	Urine	0.849	0.98	2338	4972	0.024	0.017
	Female	Swab	0.966	0.98	14441	14530	0.013	0.013
	Male	Urine	0.970	0.96	3541	6139	0.012	0.021
	Male	Swab	0.985	0.96	2826	1910	0.068	0.070

Open in a new tab

Up until now, we have assumed that individuals’ risk probabilities are known. While this is not realistic, in most screening situations, investigators will often have access to data recorded from previous periods of screening. These data can be used to estimate the risk levels for new individuals to be screened. For example, in the Nebraska IPP, using a “training” data set is realistic, since testing is performed daily at the NPHL. For purposes of illustration, we treat the 23,146 individual diagnoses in 2008 and the individual covariates (see Table 1) as training data. For purposes of illustration, we fit a first-order logistic regression model using all of the available covariates for each infection and specimen type.

We regard the 2009 individual diagnoses as the true responses and, for OD, TOD, PSOD, and H, assign the 2009 individuals to pools based on their estimated probabilities ${\hat{p}}_{i}$ from the 2008 model fits. For noninformative D (A), we assign the 2009 individuals to pools (arrays) chronologically based on the specimen arrival date using optimal pool (array) sizes determined from the 2008 estimated mean prevalence levels; see Table 1. A maximum pool size M = 10 is used for all procedures. Simulated pool diagnoses are then determined using the S_e and S_p levels in Table 1. For OD, TOD, PSOD, and H, we first create “blocks” of individuals of size N = 50, 100, and 200, ordered chronologically by the specimen arrival date; decoding is then completed within each block (the last block formed in 2009 is potentially of smaller size). We choose to implement retesting in this manner to acknowledge the sequential nature in which specimens are tested at the NPHL; it is not realistic to wait until all individual specimens from 2009 have been collected to start the screening process. Note that with noninformative D, the last pool formed is potentially of smaller size; for A, the last array formed is potentially smaller.

Table 2 displays the number of tests expended when screening individuals for chlamydia and gonorrhea in 2009. Because the 2009 diagnoses are simulated for each infection, we implement each procedure 1000 times for each gender-infection-specimen type configuration; values in Table 2 are averages over these 1000 simulations. First, the informative Dorfman procedures always provide a reduction in the number of tests when compared to the best noninformative D procedure, and this reduction can be substantial. For OD, TOD, PSOD, and H, it is difficult to identify a relationship between the blocking size (50, 100, 200) and efficiency, although this relationship likely depends on the order in which individual specimens arrive at the NPHL for testing. Second, although H is more efficient than PSOD in a small majority of the cases examined, the differences are often minor and could perhaps be explained by the fact that testing responses are simulated; recall also from Section 4 that PSOD and H are nearly identical for smaller prevalence levels. Not surprisingly, PSOD does outperform H for the lowest S_e cohort (female-chlamydia-urine). Finally, the informative Dorfman procedures perform on par with the best noninformative A procedure. OD, TOD, and PSOD are each more efficient (regardless of blocking size) in four of the eight gender-infection-specimen type strata; A is more efficient in three of the strata, and the remaining stratum (male-gonorrhea-urine) offers mixed results.

Table 2.

Nebraska IPP screening results for 2009. Mean number of tests, averaged over 1000 implementations, for the 8 strata created by infection, specimen type, and gender. Blocks of size N = 50, 100, and 200 are used for OD, TOD, PSOD, and H. Gender/specimen individual stratum counts and values of S_e and S_p are in Table 1. Optimal pool sizes for D and A have been used; see Web Appendix D.

			Male				Female

	Method		N = 50	N = 100	N = 200		N = 50	N = 100	N = 200
Chlamydia Urine	D	3417.3				2483.6
	OD		3301.6	3267.4	3271.5		2482.0	2469.4	2451.3
	TOD		3301.2	3267.6	3271.5		2482.6	2468.1	2453.1
	PSOD		3221.1	3210.4	3218.1		2458.7	2445.7	2434.5
	H		3218.3	3218.0	3240.8		2472.5	2450.7	2470.0
	A	3136.4				2124.6

Chlamydia Swab	D	1395.7				7354.9
	OD		1299.1	1295.3	1311.4		7214.6	7168.6	7189.1
	TOD		1290.8	1288.7	1301.2		7213.4	7167.8	7190.6
	PSOD		1308.4	1285.4	1316.3		7076.9	7107.6	7091.1
	H		1289.1	1303.5	1309.1		7035.3	7103.3	7054.2
	A	1385.4				6550.9

Gonorrhea Urine	D	1956.3				1225.2
	OD		1767.9	1771.4	1704.5		1200.5	1194.4	1178.6
	TOD		1768.6	1767.7	1704.7		1203.8	1197.6	1177.8
	PSOD		1716.7	1672.8	1624.8		1212.3	1176.9	1169.6
	H		1691.7	1681.0	1618.3		1170.4	1155.4	1176.5
	A	1630.5				1229.2

Gonorrhea Swab	D	1025.3				3401.8
	OD		829.4	811.4	806.3		3179.8	3279.1	3215.3
	TOD		787.6	812.8	816.9		3184.8	3279.0	3216.7
	PSOD		750.0	752.7	766.3		3189.4	3127.9	3125.9
	H		741.7	746.1	742.8		3128.7	3127.2	3108.5
	A	963.8				3467.2

Open in a new tab

Web Appendix D contains further analyses of the Nebraska IPP data, including a summary of the screening accuracy measures, histograms of the estimated individual probabilities in 2009, and additional comparisons which include three-stage halving and the best informative Sterrett procedure from Bilder et al. (2010). As expected, higher stage procedures can reduce the number of tests expended for the Nebraska IPP, but at the cost of increased complexity and, in some cases, screening accuracy.

6. Discussion

In this paper, we have proposed new Dorfman identification algorithms which use optimal pool sizes and thresholding in a heterogeneous population. Our informative algorithms account for imperfect diagnostic tests, compete strongly with other available screening procedures in terms of efficiency and accuracy, and preserve the simplicity of Dorfman retesting. To disseminate this work, we have written R programs to implement all of the algorithms in this paper. We are happy to provide these programs to those who request them.

We have illustrated the use of our new informative Dorfman procedures using chlamydia and gonorrhea data collected as part of the Nebraska IPP. We believe that our informative algorithms could also find successful application in blood and plasma donation screening, mainly because (noninformative) Dorfman retesting is already widely used to screen donations in the United States and elsewhere (Dodd, Notari, and Stramer, 2002; Tabor and Epstein, 2002; Mine et al., 2003; Seed, Kiely, and Keller, 2005). Our procedures are only minimally more involved than classical Dorfman retesting, and they can provide a substantial reduction in the number of tests without sacrificing classification accuracy. The development of enhanced back-end screening procedures is also possible as predictive probabilities can be estimated on a per-individual basis.

To implement any informative classification algorithm, it is necessary to estimate the levels of risk for individuals entering the screening procedure. In our collaboration with researchers from the NPHL and the American Red Cross (ARC), we have found that training data from past individuals are usually plentiful. For example, the NPHL screens around 25,000 individuals per year for chlamydia and gonorrhea, and the ARC screens about 6 million donations annually for a variety of infectious diseases (Zou et al., 2004). When training data are not available, one could use the regression methods of Vansteelandt et al. (2000) or Xie (2001) to estimate individual infection probabilities using responses from the initial pools, regroup individuals in positive pools based on these estimates, and then decode using our informative Dorfman techniques. We have found that this approach provides about the same level of benefit when compared to the results found in this paper. Of course, suitable diagnostics should always be performed to assess the fit of the model used to produce the risk estimates.

In closing, we believe that group testing in heterogeneous populations is a fertile area of research. We are currently developing informative extensions of the array testing procedures outlined in Kim et al. (2007) and the (hierarchical) halving algorithms proposed in Litvak et al. (1994). We also believe that developing informative procedures to screen for the presence of multiple infections simultaneously would be worthwhile.

Supplementary Material

Supp Material S1

NIHMS306267-supplement-Supp_Material_S1.pdf^{(998.5KB, pdf)}

Acknowledgements

The authors are grateful to the Editor, the Associate Editor, and the two referees for their helpful comments. The authors also thank Dr. Peter Iwen, Dr. Steven Hinrichs, Philip Medina, and Jeri Weberg-Bryce for their consultation on the IPP and Dr. Brandon Bookstaver for his additional insight on screening practices. This research is funded by Grant R01 AI067373 from the National Institutes of Health.

Footnotes

Supplementary Materials

The Web Appendices referenced in Sections 2-5 are available under the Paper Information link at the Biometrics website http://www.biometrics.tibs.org.

References

Alter H. Emerging, re-emerging, and submerging infectious threats to the blood supply. Vox Sanguinis. 2004;87:56–61. doi: 10.1111/j.1741-6892.2004.00496.x. [DOI] [PubMed] [Google Scholar]
Bilder C, Tebbs J, Chen P. Informative retesting. Journal of the American Statistical Association. 2010;105:942–955. doi: 10.1198/jasa.2010.ap09231. [DOI] [PMC free article] [PubMed] [Google Scholar]
Butylkina R, Juseviciute V, Kasparaviciene G, Vagoras A, Pagirskas E, Unemo M, Domeika M. Pooling of urine specimens allows accurate and cost-effective genetic detection of Chlamydia Trachomatis in Lithuania and other low-resource countries. Scandinavian Journal of Infectious Diseases. 2007;39:209–212. doi: 10.1080/00365540600978914. [DOI] [PubMed] [Google Scholar]
Centers for Disease Control . Trends in reportable sexually transmitted diseases in the United States, 2007. 2009. Available at http://www.cdc.gov/STD/stats07/trends.htm. [Google Scholar]
Dodd R, Notari E, Stramer S. Current prevalence and incidence of infectious disease markers and estimated window-period risk in the American Red Cross donor population. Transfusion. 2002;42:975–979. doi: 10.1046/j.1537-2995.2002.00174.x. [DOI] [PubMed] [Google Scholar]
Dorfman R. The detection of defective members of large populations. Annals of Mathematical Statistics. 1943;14:436–440. [Google Scholar]
Farley T, Cohen D, Elkins W. Asymptomatic sexually transmitted diseases: The case for screening. Preventative Medicine. 2003;36:502–509. doi: 10.1016/s0091-7435(02)00058-0. [DOI] [PubMed] [Google Scholar]
Gastwirth J. The efficiency of pooling in the detection of rare mutations. American Journal of Human Genetics. 2000;67:1036–1039. doi: 10.1086/303097. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gastwirth J, Johnson W. Screening with cost effective quality control: Potential applications to HIV and drug testing. Journal of the American Statistical Association. 1994;89:972–981. [Google Scholar]
Hourfar M, Themann A, Eickmann M, Puthavathana P, Laue T, Seifried E, Schmidt M. Blood screening for influenza. Emerging Infectious Diseases. 2007;13:1081–1083. doi: 10.3201/eid1307.060861. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hudgens M, Kim H. Optimal configuration of a square array group testing algorithm. Communications in Statistics: Theory and Methods. 2011;40:436–448. doi: 10.1080/03610920903391303. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hwang F. A generalized binomial group testing problem. Journal of the American Statistical Association. 1975;70:923–926. [Google Scholar]
Johnson W, Pearson L. Dual screening. Biometrics. 1999;55:867–873. doi: 10.1111/j.0006-341x.1999.00867.x. [DOI] [PubMed] [Google Scholar]
Kacena K, Quinn S, Hartman S, Quinn T, Gaydos C. Pooling of urine samples for screening for Neisseria gonorrhoeae by ligase chain reaction: Accuracy and application. Journal of Clinical Microbiology. 1998a;36:3624–3628. doi: 10.1128/jcm.36.12.3624-3628.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kacena K, Quinn S, Howell M, Madico G, Quinn T, Gaydos C. Pooling urine samples for ligase chain reaction screening for genital Chlamydia trachomatis infection in asymptomatic women. Journal of Clinical Microbiology. 1998b;36:481–485. doi: 10.1128/jcm.36.2.481-485.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kim H, Hudgens M. Three-dimensional array-based group testing algorithms. Biometrics. 2009;65:903–910. doi: 10.1111/j.1541-0420.2008.01158.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kim H, Hudgens M, Dreyfuss J, Westreich D, Pilcher C. Comparison of group testing algorithms for case identification in the presence of testing error. Biometrics. 2007;63:1152–1163. doi: 10.1111/j.1541-0420.2007.00817.x. [DOI] [PubMed] [Google Scholar]
Litvak E, Tu X, Pagano M. Screening for the presence of a disease by pooling sera samples. Journal of the American Statistical Association. 1994;89:424–434. [Google Scholar]
Mine H, Emura H, Miyamoto M, Tomono T, Minegishi K, Murokawa H, Yamanaka R, Yoshikawa A, Nishioka K. High throughput screening of 16 million serologically negative blood donors for hepatitis B virus, hepatitis C virus, and human immunodeficiency virus type-1 by nucleic acid amplification testing with specific and sensitive multiplex reagent in Japan. Journal of Virological Methods. 2003;112:145–151. doi: 10.1016/s0166-0934(03)00215-5. [DOI] [PubMed] [Google Scholar]
Phatarfod R, Sudbury A. The use of a square array scheme in blood testing. Statistics in Medicine. 1994;13:2337–2343. doi: 10.1002/sim.4780132205. [DOI] [PubMed] [Google Scholar]
Pilcher C, Fiscus S, Nguyen T, Foust E, Wolf L, Williams D, Ashby R, O’Dowd J, McPherson J, Stalzer B, Hightow L, Miller W, Eron J, Cohen M, Leone P. Detection of acute infections during HIV testing in North Carolina. New England Journal of Medicine. 2005;352:1873–1883. doi: 10.1056/NEJMoa042291. [DOI] [PubMed] [Google Scholar]
Remlinger K, Hughes-Oliver J, Young S, Lam R. Statistical design of pools using optimal coverage and minimal collision. Technometrics. 2006;48:133–143. [Google Scholar]
Schmidt M, Roth W, Meyer H, Seifried E, Hourfar M. Nucleic acid test screening of blood donors for orthopoxviruses can potentially prevent dispersion of viral agents in case of bioterrorism. Transfusion. 2005;45:399–403. doi: 10.1111/j.1537-2995.2005.04242.x. [DOI] [PubMed] [Google Scholar]
Seed C, Kiely P, Keller A. Residual risk of transfusion transmitted human immunodeficiency virus, hepatitis B virus, hepatitis C virus, and human T lymphotrophic virus. Internal Medicine Journal. 2005;35:592–598. doi: 10.1111/j.1445-5994.2005.00926.x. [DOI] [PubMed] [Google Scholar]
Shipitsyna E, Shalepo K, Savicheva A, Unemo M, Domeika M. Pooling samples: The key to sensitive, specific and cost-effective genetic diagnosis of Chlamydia trachomatis in low-resource countries. Acta Dermato-Venerologica. 2007;87:140–143. doi: 10.2340/00015555-0196. [DOI] [PubMed] [Google Scholar]
Sterrett A. On the detection of defective members of large populations. Annals of Mathematical Statistics. 1957;28:1033–1036. [Google Scholar]
Tabor E, Epstein J. NAT screening of blood and plasma donations: Evolution of technology and regulatory policy. Transfusion. 2002;42:1230–1237. doi: 10.1046/j.1537-2995.2002.00183.x. [DOI] [PubMed] [Google Scholar]
Vansteelandt S, Goetghebeur E, Verstraeten T. Regression models for disease prevalence with diagnostic tests on pools of serum samples. Biometrics. 2000;56:1126–1133. doi: 10.1111/j.0006-341x.2000.01126.x. [DOI] [PubMed] [Google Scholar]
Westreich D, Hudgens M, Fiscus S, Pilcher C. Optimizing screening for acute human immunodeficiency virus infection with pooled nucleic acid amplification tests. Journal of Clinical Microbiology. 2008;46:1785–1792. doi: 10.1128/JCM.00787-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
Xie M. Regression analysis of group testing samples. Statistics in Medicine. 2001;20:1957–1969. doi: 10.1002/sim.817. [DOI] [PubMed] [Google Scholar]
Xie M, Tatsuoka K, Sacks J, Young S. Group testing with blockers and synergism. Journal of the American Statistical Association. 2001;96:92–102. [Google Scholar]
Zou S, Notari E, Stramer S, Wahab F, Musavi F, Dodd R. Patterns of age- and sex-specific prevalence of major blood-borne infections in United States blood donors, 1995 to 2002: American Red Cross blood donor study. Transfusion. 2004;44:1640–1647. doi: 10.1111/j.0041-1132.2004.04153.x. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supp Material S1

NIHMS306267-supplement-Supp_Material_S1.pdf^{(998.5KB, pdf)}

[R1] Alter H. Emerging, re-emerging, and submerging infectious threats to the blood supply. Vox Sanguinis. 2004;87:56–61. doi: 10.1111/j.1741-6892.2004.00496.x. [DOI] [PubMed] [Google Scholar]

[R2] Bilder C, Tebbs J, Chen P. Informative retesting. Journal of the American Statistical Association. 2010;105:942–955. doi: 10.1198/jasa.2010.ap09231. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] Butylkina R, Juseviciute V, Kasparaviciene G, Vagoras A, Pagirskas E, Unemo M, Domeika M. Pooling of urine specimens allows accurate and cost-effective genetic detection of Chlamydia Trachomatis in Lithuania and other low-resource countries. Scandinavian Journal of Infectious Diseases. 2007;39:209–212. doi: 10.1080/00365540600978914. [DOI] [PubMed] [Google Scholar]

[R4] Centers for Disease Control . Trends in reportable sexually transmitted diseases in the United States, 2007. 2009. Available at http://www.cdc.gov/STD/stats07/trends.htm. [Google Scholar]

[R5] Dodd R, Notari E, Stramer S. Current prevalence and incidence of infectious disease markers and estimated window-period risk in the American Red Cross donor population. Transfusion. 2002;42:975–979. doi: 10.1046/j.1537-2995.2002.00174.x. [DOI] [PubMed] [Google Scholar]

[R6] Dorfman R. The detection of defective members of large populations. Annals of Mathematical Statistics. 1943;14:436–440. [Google Scholar]

[R7] Farley T, Cohen D, Elkins W. Asymptomatic sexually transmitted diseases: The case for screening. Preventative Medicine. 2003;36:502–509. doi: 10.1016/s0091-7435(02)00058-0. [DOI] [PubMed] [Google Scholar]

[R8] Gastwirth J. The efficiency of pooling in the detection of rare mutations. American Journal of Human Genetics. 2000;67:1036–1039. doi: 10.1086/303097. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] Gastwirth J, Johnson W. Screening with cost effective quality control: Potential applications to HIV and drug testing. Journal of the American Statistical Association. 1994;89:972–981. [Google Scholar]

[R10] Hourfar M, Themann A, Eickmann M, Puthavathana P, Laue T, Seifried E, Schmidt M. Blood screening for influenza. Emerging Infectious Diseases. 2007;13:1081–1083. doi: 10.3201/eid1307.060861. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] Hudgens M, Kim H. Optimal configuration of a square array group testing algorithm. Communications in Statistics: Theory and Methods. 2011;40:436–448. doi: 10.1080/03610920903391303. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] Hwang F. A generalized binomial group testing problem. Journal of the American Statistical Association. 1975;70:923–926. [Google Scholar]

[R13] Johnson W, Pearson L. Dual screening. Biometrics. 1999;55:867–873. doi: 10.1111/j.0006-341x.1999.00867.x. [DOI] [PubMed] [Google Scholar]

[R14] Kacena K, Quinn S, Hartman S, Quinn T, Gaydos C. Pooling of urine samples for screening for Neisseria gonorrhoeae by ligase chain reaction: Accuracy and application. Journal of Clinical Microbiology. 1998a;36:3624–3628. doi: 10.1128/jcm.36.12.3624-3628.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] Kacena K, Quinn S, Howell M, Madico G, Quinn T, Gaydos C. Pooling urine samples for ligase chain reaction screening for genital Chlamydia trachomatis infection in asymptomatic women. Journal of Clinical Microbiology. 1998b;36:481–485. doi: 10.1128/jcm.36.2.481-485.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] Kim H, Hudgens M. Three-dimensional array-based group testing algorithms. Biometrics. 2009;65:903–910. doi: 10.1111/j.1541-0420.2008.01158.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] Kim H, Hudgens M, Dreyfuss J, Westreich D, Pilcher C. Comparison of group testing algorithms for case identification in the presence of testing error. Biometrics. 2007;63:1152–1163. doi: 10.1111/j.1541-0420.2007.00817.x. [DOI] [PubMed] [Google Scholar]

[R18] Litvak E, Tu X, Pagano M. Screening for the presence of a disease by pooling sera samples. Journal of the American Statistical Association. 1994;89:424–434. [Google Scholar]

[R19] Mine H, Emura H, Miyamoto M, Tomono T, Minegishi K, Murokawa H, Yamanaka R, Yoshikawa A, Nishioka K. High throughput screening of 16 million serologically negative blood donors for hepatitis B virus, hepatitis C virus, and human immunodeficiency virus type-1 by nucleic acid amplification testing with specific and sensitive multiplex reagent in Japan. Journal of Virological Methods. 2003;112:145–151. doi: 10.1016/s0166-0934(03)00215-5. [DOI] [PubMed] [Google Scholar]

[R20] Phatarfod R, Sudbury A. The use of a square array scheme in blood testing. Statistics in Medicine. 1994;13:2337–2343. doi: 10.1002/sim.4780132205. [DOI] [PubMed] [Google Scholar]

[R21] Pilcher C, Fiscus S, Nguyen T, Foust E, Wolf L, Williams D, Ashby R, O’Dowd J, McPherson J, Stalzer B, Hightow L, Miller W, Eron J, Cohen M, Leone P. Detection of acute infections during HIV testing in North Carolina. New England Journal of Medicine. 2005;352:1873–1883. doi: 10.1056/NEJMoa042291. [DOI] [PubMed] [Google Scholar]

[R22] Remlinger K, Hughes-Oliver J, Young S, Lam R. Statistical design of pools using optimal coverage and minimal collision. Technometrics. 2006;48:133–143. [Google Scholar]

[R23] Schmidt M, Roth W, Meyer H, Seifried E, Hourfar M. Nucleic acid test screening of blood donors for orthopoxviruses can potentially prevent dispersion of viral agents in case of bioterrorism. Transfusion. 2005;45:399–403. doi: 10.1111/j.1537-2995.2005.04242.x. [DOI] [PubMed] [Google Scholar]

[R24] Seed C, Kiely P, Keller A. Residual risk of transfusion transmitted human immunodeficiency virus, hepatitis B virus, hepatitis C virus, and human T lymphotrophic virus. Internal Medicine Journal. 2005;35:592–598. doi: 10.1111/j.1445-5994.2005.00926.x. [DOI] [PubMed] [Google Scholar]

[R25] Shipitsyna E, Shalepo K, Savicheva A, Unemo M, Domeika M. Pooling samples: The key to sensitive, specific and cost-effective genetic diagnosis of Chlamydia trachomatis in low-resource countries. Acta Dermato-Venerologica. 2007;87:140–143. doi: 10.2340/00015555-0196. [DOI] [PubMed] [Google Scholar]

[R26] Sterrett A. On the detection of defective members of large populations. Annals of Mathematical Statistics. 1957;28:1033–1036. [Google Scholar]

[R27] Tabor E, Epstein J. NAT screening of blood and plasma donations: Evolution of technology and regulatory policy. Transfusion. 2002;42:1230–1237. doi: 10.1046/j.1537-2995.2002.00183.x. [DOI] [PubMed] [Google Scholar]

[R28] Vansteelandt S, Goetghebeur E, Verstraeten T. Regression models for disease prevalence with diagnostic tests on pools of serum samples. Biometrics. 2000;56:1126–1133. doi: 10.1111/j.0006-341x.2000.01126.x. [DOI] [PubMed] [Google Scholar]

[R29] Westreich D, Hudgens M, Fiscus S, Pilcher C. Optimizing screening for acute human immunodeficiency virus infection with pooled nucleic acid amplification tests. Journal of Clinical Microbiology. 2008;46:1785–1792. doi: 10.1128/JCM.00787-07. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] Xie M. Regression analysis of group testing samples. Statistics in Medicine. 2001;20:1957–1969. doi: 10.1002/sim.817. [DOI] [PubMed] [Google Scholar]

[R31] Xie M, Tatsuoka K, Sacks J, Young S. Group testing with blockers and synergism. Journal of the American Statistical Association. 2001;96:92–102. [Google Scholar]

[R32] Zou S, Notari E, Stramer S, Wahab F, Musavi F, Dodd R. Patterns of age- and sex-specific prevalence of major blood-borne infections in United States blood donors, 1995 to 2002: American Red Cross blood donor study. Transfusion. 2004;44:1640–1647. doi: 10.1111/j.0041-1132.2004.04153.x. [DOI] [PubMed] [Google Scholar]

PERMALINK

Informative Dorfman Screening

Christopher S McMahan

Joshua M Tebbs

Christopher R Bilder

Summary

1. Introduction

2. Informative Dorfman Algorithm

2.1 Preliminaries

2.2 Operating Characteristics

3. Optimized Informative Dorfman Procedures

3.1 Optimal Dorfman

3.2 Thresholding

3.3 Pool-Specific Optimal Dorfman

4. Simulation Evidence

Figure 1.

Figure 2.

5. Nebraska IPP Data

Table 1.

Table 2.

6. Discussion

Supplementary Material

Acknowledgements

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Informative Dorfman Screening

Christopher S McMahan

Joshua M Tebbs

Christopher R Bilder

Summary

1. Introduction

2. Informative Dorfman Algorithm

2.1 Preliminaries

2.2 Operating Characteristics

3. Optimized Informative Dorfman Procedures

3.1 Optimal Dorfman

3.2 Thresholding

3.3 Pool-Specific Optimal Dorfman

4. Simulation Evidence

Figure 1.

Figure 2.

5. Nebraska IPP Data

Table 1.

Table 2.

6. Discussion

Supplementary Material

Acknowledgements

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases