Summary
Background
The ability of individuals to use HIV self-tests correctly is debated. To inform the 2016 WHO recommendation on HIV self-testing, we assessed the reliability and performance of HIV rapid diagnostic tests when used by self-testers.
Methods
In this systematic review and meta-analysis, we searched PubMed, PopLine, and Embase, conference abstracts, and additional grey literature between Jan 1, 1995, and April 30, 2016, for observational and experimental studies reporting on HIV self-testing performance. We excluded studies evaluating home specimen collection because patients did not interpret their own test results. We extracted data independently, using standardised extraction forms. Outcomes of interest were agreement between self-testers and health-care workers, sensitivity, and specificity. We calculated κ to establish the level of agreement and pooled κ estimates using a random-effects model, by approach (directly assisted or unassisted) and type of specimen (blood or oral fluid). We examined heterogeneity with the I2 statistic.
Findings
25 studies met inclusion criteria (22 to 5662 participants). Quality assessment with QUADAS-2 showed studies had low risk of bias and incomplete reporting in accordance with the STARD checklist. Raw proportion of agreement ranged from 85·4% to 100%, and reported κ ranged from fair (κ 0·277, p<0·001) to almost perfect (κ 0·99, n=25). Pooled κ suggested almost perfect agreement for both types of approaches (directly assisted 0·98, 95% CI 0·96–0·99 and unassisted 0·97, 0·96–0·98; I2=34·5%, 0–97·8). Excluding two outliers, sensitivity and specificity was higher for blood-based rapid diagnostic tests (4/16) compared with oral fluid rapid diagnostic tests (13/16). The most common error that affected test performance was incorrect specimen collection (oral swab or finger prick). Study limitations included the use of different reference standards and no disaggregation of results by individuals taking antiretrovirals.
Interpretation
Self-testers can reliably and accurately do HIV rapid diagnostic tests, as compared with trained health-care workers. Errors in performance might be reduced through the improvement of rapid diagnostic tests for self-testing, particularly to make sample collection easier and to simplify instructions for use.
Funding
The Bill & Melinda Gates Foundation and Unitaid.
Introduction
Interest in HIV self-testing—an approach to increase access to HIV testing—has increased since 2014.1 As a discreet and convenient approach, HIV self-testing might be most useful in reaching people who are reluctant or unable to access existing HIV testing services because of concerns about privacy, stigma, discrimination, and, in some contexts, criminalisation. According to various studies,2, 3, 4, 5, 6, 7 HIV self-testing is highly acceptable among many different population groups, including those with low testing coverage and who report barriers to and low uptake of existing HIV testing services. Despite this, some policy makers and users have raised concerns that self-testers might not be able to do the test or interpret the test results correctly.1
We did a systematic review to assess the reliability and performance of HIV rapid diagnostic tests when used by self-testers, compared with health-care workers. Although previous reviews assessed the accuracy of rapid diagnostic tests for self-testing,8, 9 they primarily focused on sensitivity and specificity and did not consider the validity of the reference standard. Thus, we systematically measure and report test concordance between self-testers and health-care workers to account for imperfect reference standards to establish the reliability and performance of rapid diagnostic tests used for self-testing.
Research in context.
Evidence before this study
To diagnose HIV, at least two or three tests, depending on the HIV prevalence among the population being tested, are needed. The validity of using a single test as a reference standard is imperfect. Previous systematic reviews focused on sensitivity and specificity of HIV rapid diagnostic tests for self-testing. An initial search of PubMed, for studies published from Jan 1, 1995, to Jan 26, 2016, with the search terms “HIV self-testing” and “review”, indicated that the validity of the reference standard has not been considered previously.
Added value of this study
To inform a WHO recommendation, we assessed the reliability and performance of HIV rapid diagnostic tests used by self-testers compared with health-care workers, by calculating statistics on test concordance to account for the imperfect reference standard. We included studies that used products designed for self-testing in diverse country settings. Previous reviews were done when HIV self-testing was emerging; these reviews primarily drew from US and European studies that used professional-use products or prototypes that have since been adapted for HIV self-testing.
Implications of all the available evidence
Self-testers could reliably and accurately do an HIV rapid diagnostic test, whether assistance was provided or not, when compared with a trained health-care worker. Errors in the test procedure might be reduced by refinement of the design of rapid diagnostic tests for self-testing, improvement of manufacturer labelling and instructions for use, and provision of additional support with instructional videos. Modifications should always be the responsibility of the manufacturer.
Methods
Search strategy and selection criteria
This systematic review and meta-analysis followed the PRISMA standards (appendix pp 1–2). We searched PubMed, PopLine, and Embase for studies published from Jan 1, 1995, to April 30, 2016. We also reviewed six electronic HIV/AIDS conference databases (ie, Conference on Retroviruses and Opportunistic Infections, International AIDS Conference, International AIDS Society, American Public Health Association, National HIV Prevention Conference, and the HIV Diagnostics Conference) for all available years (appendix p 3).
We searched for grey literature through Google Scholar (first 100 titles of 201 results). We screened bibliographies of included articles and purposely selected and contacted experts (ie, academic researchers with ongoing studies on HIV self-testing) to identify additional sources. We contacted authors of relevant studies (up to two attempts) to retrieve relevant study information. We placed no language, age, study type, or geographical limitations on the search.
We included studies reporting the performance of rapid diagnostic tests by self-testers and those reporting the concordance or the sensitivity and specificity of rapid diagnostic tests compared with the results of testing done by a health-care worker. Two reviewers (CF, CJ) screened records independently and resolved disagreements through discussion and consensus.
We defined HIV self-testing as a process where an individual collects his or her specimen, does a test, and interprets their own test result.1 In the directly assisted approach, self-testers received an in-person demonstration of how to do the test or to interpret the test result; in the unassisted approach, self-testers were provided only with manufacturers' instructions for use included in the kit. All self-testers, irrespective of type of approach used, could access or receive assistance over the phone, through the internet, or with additional instructions (eg, videos, animations, or diagrams).10 We did not consider HIV counselling, linkage to care, and referral information as part of HIV self-testing assistance.10 We considered observed studies when participants were directly observed or video recorded to evaluate their HIV self-testing performance.
We excluded studies reporting on home specimen collection, concordance or sensitivity and specificity of self-testing, or self-monitoring devices for conditions other than HIV.
We defined the testing strategy used to establish the reference result as any testing sequence used to identify HIV infection (appendix pp 4–7). We classified testing strategies as aligned or not aligned with WHO guidance on the basis of the 2015 Consolidated guidelines on HIV testing services.11
Data analysis
We defined measures of concordance (inter-reader reliability) as the percentage agreement and Cohen's κ12 between the health-care worker and the self-tester.
We defined measurements of accuracy as specificity and sensitivity. HIV positivity among participants was based on the number of HIV-positive participants with known status or who received an HIV-positive diagnosis during the study. HIV positivity was then categorised as high (≥5%) or low (<5%).11
Given the imperfect or absent reference standards among studies, to evaluate performance of HIV rapid diagnostic tests used by self-testers we first assessed whether the result of the index and the reference test agreed or disagreed.13 We then calculated the raw estimates of sensitivity and specificity.
We extracted data for true reactive, true non-reactive, false reactive, and false non-reactive results to calculate κ and raw estimates of sensitivity and specificity, and explored the effect for oral fluid and blood-based rapid diagnostic tests separately, by type of assistance (direct assistance or no assistance) and type of observation. We calculated raw estimates of sensitivity and specificity with Meta-DiSc software;14 we did not consider invalid or indeterminate values to avoid skewness of results.
We determined quality of studies using the Standards for Reporting Studies of Diagnostic Accuracy (STARD) checklist,15 and the Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2).16 We considered high risk of partial verification bias if more than 10% of study participants did not have their HIV test results and status confirmed, and if the selection of patients to receive the reference standard was not randomised. We considered a study to have a high risk of differential verification bias if more than 10% of patients received testing with a different reference standard. CF scored studies for quality in terms of risk of bias and concerns regarding applicability.
Given the high study variability and the inclusion of multiple reference standards, we pooled κ estimates using a random-effects model with the R package metaphor, version 3.4.4.13 We assessed heterogeneity by visual inspection of forest plots and calculation of the I2 statistic (>25–50% moderate).17
Role of the funding source
The funder of the study had no role in study design, data collection, data analysis, data interpretation, writing of the report, or the decision to submit for publication. The corresponding author had full access to all study data and final responsibility for the decision to submit for publication.
Results
After screening and removing duplicates, we included 25 studies4, 6, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 in the review (figure 1). All studies4, 6, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40 evaluated concordance, 15 studies4, 6, 18, 19, 20, 22, 26, 27, 28, 30, 31, 32, 34, 36, 38 evaluated sensitivity and specificity, and one25 only evaluated sensitivity. 15 studies4, 6, 18, 20, 24, 26, 28, 30, 31, 32, 33, 34, 35, 36, 38 used oral fluid-based rapid diagnostic tests, six21, 22, 25, 27, 37, 40 used blood-based rapid diagnostic tests, and four19, 23, 29, 39 used both.
13 studies4, 19, 22, 23, 24, 25, 26, 27, 28, 31, 32, 33, 39 reported on unassisted HIV self-testing, 116, 18, 20, 29, 30, 34, 35, 36, 37, 38, 40 reported on directly assisted HIV self-testing, and one21 reported on both approaches (table 1). 23 of 25 studies were observational in design (three cohort,4, 6, 19 18 cross-sectional,21, 22, 23, 24, 25, 26, 27, 28, 29, 32, 33, 34, 35, 36, 37, 38, 39, 40 and two cross-sectional and qualitative20, 30), and two were randomised controlled trials.18, 31 Sample size varied from 22 to 5662 participants. HIV positivity among participants was available in 28 reports from 22 studies; 19 (68%) of 28 reports had a high HIV positivity,4, 6, 18, 20, 22, 25, 26, 27, 28, 29, 30, 31, 32, 36, 39 and eight (29%) had low HIV positivity.4, 19, 23, 24, 33, 34, 35, 38 Reference test strategy was not available in five of 25 studies, or was not aligned with WHO testing guidance in another five studies (appendix pp 4–7). 16 (64%) of 25 studies were considered to be at low risk of bias and applicability across all key domains for QUADAS-2 (appendix p 8). 17 (68%) of 25 studies also failed to fulfil at least 60% of the STARD criteria, with a mean of 16·2 available items out of 34 (appendix pp 9–34).
Table 1.
Setting | Type of RDT specimen | Sample size | Male participants | Type of population | Age | Participants ever tested for HIV | Education | Study design | |
---|---|---|---|---|---|---|---|---|---|
Directly assisted studies | |||||||||
Prazuck et al (2016)37 | France, urban | Blood based | 411* | 54·5% | GP (100%) | .. | 78·6% (367/411) | .. | Cross-sectional |
Majam et al (2016)40 | South Africa, urban | Blood based | 60 | 46·7% | GP (100%) | .. | .. | 33% primary, 34% secondary, 33% tertiary | Cross-sectional |
MacGowan et al (2014)29 | USA, urban | Both | 22 | 100% | KP (100%) | .. | .. | 45% (10/22) college graduate or higher, 41% (9/22) some college, 14% (3/22) less than college | Cross-sectional |
Choko et al (2015)6 | Malawi, urban | Oral fluid based | 1649 | .. | GP (91·4%), PLHIV (8·5%) | .. | .. | .. | Cohort |
Choko et al (2011)20 | Malawi, urban | Oral fluid based | 283 | 48·1% | GP (92·7%), PLHIV (7·3%) | 27 years (IQR 22–32) | 62% (175/283) | 40·3% (114/283) primary or less, 59·7% (169/283) higher than primary education | Cross-sectional and qualitative |
Marley et al (2014)30 | China, urban | Oral fluid based | 229 | .. | GP (100%), VCT clients | .. | .. | .. | Cross-sectional and qualitative |
Martínez Pérez et al (2016)36 | South Africa, rural | Oral fluid based | 2198 | 33·7% | GP (84·7%), PLHIV (15·3%) | 27·5 years (IQR 22–36) | 94·1% (2068/2198) | .. | Cross-sectional |
Sarkar et al (2016)38 | India, rural | Oral fluid based | 202 | 0 | Pregnant women (100%) | .. | .. | .. | Cross-sectional |
Pant Pai et al (2013)34 | South Africa, urban | Oral fluid based | 251 | 21·1% | HCW (100%) | .. | 86·8% (218/251) | 59·8% (150/251) high school or less, 24·3% (61/251) college or technical school, 10·3% (26/251) university or higher, 4·8% (12/251) other | Cross-sectional |
Pant Pai et al (2014)35 | Canada, urban | Oral fluid based | 145 | 38·6% | GP (100%) | 22 years | 49·4% (124/145) | College 20·6% (30/145), vocational or trade school 13·1% (19/145), university or higher 66·2% (96/145) | Cross-sectional |
Asiimwe et al (2014; observed arm)18 | Uganda, rural | Oral fluid based | 123 | 62·6% | GP (100%) | 27 years (IQR 22–32) | 78·1% (96/123) | 70·7% (87/123) less than primary, 21·1% (26/123) primary complete, 8·1% (10/123) secondary or higher | Randomised controlled trial |
Asiimwe et al (2014; unobserved arm)18 | Uganda, rural | Oral fluid based | 123 | 52·1% | GP (100%) | 28 years (IQR 23–32) | 78·9% (97/123) | 75·6% (93/123) less than primary, 13·8% (17/123) primary complete, 10·6% (13/123) secondary or higher | Randomised controlled trial |
Unassisted studies | |||||||||
Lee et al (2007)27 | Singapore, urban | Blood based | 350 | 89·4% | GP (74·9%), PLHIV (25·1%) | 33 years (IQR 27–41) | 74·8% (262/350) | 12% (40/350) primary, 28% (97/350) secondary, 60% (210/350) at least tertiary education | Cross-sectional |
Gras et al (2014)25 | France, urban | Blood based | 40 | 75·0% | PLHIV (100%) | .. | .. | 32·5% (13/40) primary, 35% (14/40) secondary, 32·5% (13/40) tertiary education | Cross-sectional |
Dong et al (2014)22 | South Africa, rural | Blood based | 233 | 28·8% | GP (100%) | .. | 89·3% (208/233) | Less than high school 63·5% (148/233), high school 29·2% (68/233), some tertiary education 7·3% (17/233) | Cross-sectional |
Chavez et al (2016; oral fluid arm)19 | USA, urban | Both | 818† | 100% | .. | 27 years (range 18–54) | 82% (671/818) | 86% some college | Cohort |
Gaydos et al (2011; oral fluid arm)23 | USA, urban | Oral fluid based | 433 | 41·3% | GP (100%) | 38·5 years (12·7) | .. | .. | Cross-sectional |
Gaydos et al (2011; blood-based arm)23 | USA, urban | Blood based | 45 | 42·2% | GP (100%) | 37·2 years (13·0) | .. | .. | Cross-sectional |
Spielberg et al (2003)39 | USA, urban | Both | 340 | .. | PLHIV (100%) | .. | .. | .. | Cross-sectional |
Gaydos et al (2013)24 | USA, urban | Oral fluid based | 467 | 40·4% | GP (100%) | 41 years | .. | .. | Cross-sectional |
Kurth et al (2016)26 | Kenya, urban | Oral fluid based | 239 | 67·4% | GP (100%) | 35·9 years (9·7) | 90·7% (217/239) | 12·04 years of education (3·13) | Cross-sectional |
Li et al (2016)28 | China, urban | Oral fluid based | 200 | 100% | KP (100%) | 29·6 years (8·66) | 10% (10/200) primary or less; 44·5% (89/200) secondary, 45·5% (91/200) tertiary education | Cross-sectional | |
Nour et al (2012)33 | USA, urban | Oral fluid based | 249 | 42·2% | GP (100%) | 41 years | 0 (0/249)‡ | .. | Cross-sectional |
Mavedzenge et al (2015; urban arm)31 | Zimbabwe, urban | Oral fluid based | 172 | 47·0% | GP (91·1%), PLHIV (8·9%) | 30 years (range 18–70) | 80% (138/172) | .. | Randomised controlled trial |
Mavedzenge et al (2015; urban arm)31 | Zimbabwe, rural | Oral fluid based | 62 | 47·0% | GP (91·1%), PLHIV (8·9%) | 29 years (range 18–70) | 89% (55/62) | .. | Randomised controlled trial |
Ng et al (2012)32 | Singapore, urban | Oral fluid based | 994 | 88·5% | GP (63·7%), PLHIV (20%), KP (6·3%) | 32·4 years (IQR 27·1–40·5) | .. | 32·8% (326/994) less than high school, 29·8% (296/994) high school, 37·4% (372/994) at least college | Cross-sectional |
FDA phase 2b (2012; observed arm)4 | USA, urban | Oral fluid based | 1031 | 66·9% | GP (42·4%), PLHIV (51·3%) KP (6·3%) | .. | .. | 19·1% (197/1031) low literate; 45·6% (470/1031) high school or less | Cohort |
FDA phase 3 (2012; unobserved arm)4 | USA, urban | Oral fluid based | 5662§ | 51·3% | GP (86·9%), KP (13·1%) | .. | .. | Low literate 28·0% (1624/5662); high school or less 54·9% (3113/5662) | Cohort |
Directly assisted and unassisted studies | |||||||||
de la Fuente et al (2012; directly assisted arm)21 | Spain, urban | Blood based | 208 | 58·2% | GP (63·8%), KP (36·2%) | .. | 39·9% (83/208) | 57·2% (119/208) at least university, 41·3% (86/208) less than university | Cross-sectional |
de la Fuente et al (2012; unassisted arm)21 | Spain, urban | Blood based | 313 | 70·5% | GP (63·8%), KP (36·2%) | .. | 51·1% (160/313) | 48·5% (150/313) at least university, 51·5% (159/313) less than university | Cross-sectional |
Data are n, %, mean (SD), median (IQR), median, mean (range), or % (n/N). RDT=rapid diagnostic test. GP=general population. KP=key population (men who have sex with men, sex workers, people who inject drugs, transgender people, and people in prisons or closed settings). PLHIV=people living with HIV. VCT=voluntary counselling and testing. HCW=health-care worker.
Study was divided into two substudies: 264 participants performed the self-test, and 147 participants interpreted contrived pictures.
515 participants had all three results (both self-tests and dried blood home collection), 622 reported the oral fluid-based result, 565 reported the blood-based result, and 548 had the dried blood spot cards processed.
In the previous 6 months.
163 participants had no self-test results.
Of the 25 studies evaluating concordance between the result of an HIV rapid diagnostic test used by a self-tester compared with a result obtained by a health-care worker, 184, 6, 18, 19, 20, 21, 22, 23, 24, 28, 29, 31, 33, 34, 35, 37, 39, 40 reported raw percentage of agreement, three26, 27, 32 reported a κ statistic, and four23, 30, 36, 38 reported both (table 2).
Table 2.
HIVST concordance* | Reasons for disagreement† | HIV positivity | Type of observation | Errors in performance | Invalid results (invalid result/tests performed) | Reasons for invalid result | |
---|---|---|---|---|---|---|---|
Directly assisted studies | |||||||
Prazuck et al (2016)‡37 | 97·1% (142/147) | Non-reactive as reactive 2·7% (4/147), invalid as reactive 2·7% (4/147) or non-reactive 2·7% (4/147), non-reactive as invalid 1·4% (2/147) | .. | Observed | .. | 1% (2/264) | .. |
Majam et al (2016)40 | 88% (53/60) | Non-reactive as reactive 1·7% (1/60), non-reactive as invalid 1·7% (1/60), reactive as non-reactive 1·7% (1/60), invalid 6·7% (4/60) as reactive or non-reactive | .. | Observed | 20 participants made mistakes; common errors were with blood collection and transferring and use of buffer | .. | .. |
MacGowan et al (2014; oral fluid arm)29 | 95% (21/22) | Reactive as non-reactive 4·5% (1/22) | 22·7% (5/22) | Observed | 13·6% (3/22) participants made mistakes, common errors were spilling buffer and incorrect time to read the results | 4·5% (1/22) | .. |
MacGowan et al (2014; blood-based arm)29 | 95% (20/21) | One HIV-positive participant with an invalid result interpreted his result as reactive 4·8% (1/21) | 19% (4/21) | 23·8% (5/21) participants made mistakes; common errors were incorrectly pushing the device into test holder and incorrect timing to read the results; one participant broke the device | 9·5% (2/21) | Operational error | |
Choko et al (2015)§6 | 99·4%, 98·9%–99·7% (1639/1649) | Reactive as non-reactive 0·5% (9/1649), non-reactive as reactive 0·06% (1/1649) | 8·6% (141/1649) | Observed | .. | .. | .. |
Choko et al (2011)¶20 | 99·2%, 97–100% (256/258)‖ | One HIV-positive participant with a faint reactive result interpreted his result as uncertain, one HIV-positive participant had an invalid result | 16·9% (48/283) | Non-observed | Common errors were touching collection pad, incorrect or incomplete swabbing, removing kit from developer too early, buffer spills, reading incorrectly, and fumbling vial or cap when opening developer fluid | 0·4% (1/260) | .. |
Marley et al (2014)30 | 93·9% (215/229), κ 0·551, p=0·012 | Reactive as invalid 3·1% (7/229), non-reactive as reactive 1·3% (3/229), invalid as non-reactive 1·3% (3/229), non-reactive as invalid 0·4% (1/229) | 5·6% (13/229) | Observed | Common errors were unpreparedness before start 42% (94/229), inability to swab correctly 10% (23/229), buffer 15·9% (36/229), testing and reading test results 7·5% (17/229) | 3·5% (8/229) | Six participants used test paper incorrectly |
Martínez Pérez et al (2016)36 | 99·4% (2184/2198), κ 0·99** | Reactive as non-reactive 0·2% (4/2181) | 15·3% (337/2198) | Observed | Two participants had to repeat the self-test, they accidentally spilled buffer vial; excluding known people living with HIV, 0·18% (4/2181) interpreted their tests as negative whereas the HCW interpreted the result as positive | 0·5% (11/2198) | .. |
Sarkar et al (2016)38 | 98%, κ 0·566, p<0·001 | Invalid as non-reactive 0·5% (1/202), non-reactive as invalid 0·9% (2/202) | 0·9% (2/202) | Observed | .. | 0·9% (2/202) | .. |
Pant Pai (2013)34 | 98·8% (248/251) | Reactive as non-reactive 1·2% (3/251), two of which had a faint reactive line | 3·6% (9/251) | Non-observed | Errors were in conducting and interpreting results | .. | .. |
Pant Pai et al (2014)35 | 100% (145/145) | No difference between self-tester and HCW interpretation | 0 | Non-observed | .. | .. | .. |
Asiimwe et al (2014; observed arm)18 | 99·2% (122/123) | Non-reactive as invalid 0·8% (1/123) | 10·6% (13/123) | Observed | 19·5% (24/123) participants made mistakes; common errors were incorrect swabbing of gums, touching the collection pad and buffer spills | 0·8% (1/123) | .. |
Asiimwe et al (2014; unobserved arm)18 | 94·3% (116/123) | .. | 16·3% (20/123) | Non-observed | No errors were reported | 0·8% (1/117)†† | |
Unassisted studies | |||||||
Lee et al (2007)27 | κ 0·277, p<0·001 | Invalid as non-reactive 50·1% (176/350), invalid as reactive 4·6% (16/350) and reactive as non-reactive 0·3% (1/350) | 25% (88/350) | Observed | .. | 56·3% (197/350) | 85% failed to perform all steps correctly |
Gras et al (2014)25 | 100% | No difference between self-tester and HCW interpretation | 100% (40/40) | Observed | Common errors were insufficient blood, wrong lancet utilisation, and mixing of samples | 5·7% (2/35) | .. |
Dong et al (2014)22 | 98·7% (230/233) | Reactive as non-reactive 0·5% (1/195), invalid as non-reactive 0·5% (1/195), non-reactive as invalid 0·5% (1/195) | 18·9% (44/233) | Observed (video recorded) | .. | 0·4% (1/233) | .. |
Chavez et al (2016; oral fluid arm)‡‡19 | 98% (500/511) | Non-reactive as reactive 1·4%, non-reactive as invalid 0·8% | 2% (11/622) | Non-observed | .. | .. | .. |
Chavez et al (2016; blood-based arm)‡‡19 | 99% (506/511) | Non-reactive as reactive 0·6%, non-reactive as invalid 0·4% | 1% (7/565) | 4·6% (26/565) | Operational error | ||
Gaydos et al (2011)§§23 | 99·6%, 0·41–1·00 (476/478) weighted κ 0·75 | Reactive as non-reactive 0·2% (1/478) | 0·8% (4/478) | Observed | Difficulties were interpreting results, reading result chart, reading or following instructions, swabbing or pricking properly, or both, and opening the kit | 0·2% (1/478) | Insufficient blood |
Spielberg et al (2003; oral fluid arm)39 | 95% | .. | 100% (340/340) | Non-observed | Difficulties performing test decreased through changes made to instructions and labelling from 4·3% to 4% | 4·1% (14/340) | Failure to put the test device in the vial with developer solution |
Spielberg et a; (2003; blood-based arm)39 | 97% | 100% (340/340) | Difficulties performing test decreased through changes made to instructions and labelling from 14% to 9% | 7·9% (27/340) | |||
Gaydos et al (2013)24 | 100% | No difference between self-tester and HCW interpretation | 0·2% (1/467) | Observed | .. | .. | .. |
Kurth et al (2016)26 | κ 0·92 (0·84–0·99) | Non-reactive as invalid 12·5% (30/239), reactive as non-reactive 1·2% (3/239), non-reactive as reactive 0·4% (1/239) | 14·6% (35/239) | Observed (video recorded) | Common errors were difficulty opening bottle, incorrect or incomplete swab of gums, and incorrect time to read the results; some individuals could have made multiple errors | 15·1% (36/239) | All individuals recognised something went wrong with their test |
Li et al (2016)28 | 95% (190/200) | Non-reactive as invalid 2·5% (5/200), reactive as non-reactive 1·5% (3/200), non-reactive as reactive 0·5% (1/200) | 27·5% (55/200) | Observed | Common errors were incorrect or incomplete swab of gums, incorrect time to read the results, touching the collection pad, and buffer spills | 3% (6/200) | .. |
Nour et al (2012)33 | 100% | No difference between self-tester and HCW interpretation | 1·6% (4/249) | Observed | .. | .. | .. |
Mavedzenge et al (2015; urban arm)¶¶31 | 93% (160/172) | Non-reactive as invalid 2% (3/172) | 9% (16/172) | Observed (video recorded) | Common errors were confusion with desiccant, buffer spills, dipping test device in developer before collecting sample, incorrect sampling, and incorrect time to read the results. | 2·9% (5/172) | Participants with invalid results typically did not follow instructions |
Mavedzenge et al (2015; rural arm)31 | 90% (56/62) | Non-reactive as reactive 4·8% (3/62) | 8% (5/62) | 3·2% (2/62) | |||
Ng et al (2012)32 | κ 0·97, 0·95–0·99 | Reactive as non-reactive 2·6% (5/983), reactive as invalid 0·5% (1/983), non-reactive as invalid 0·3% (2/983) and non-reactive as reactive 0·1% (1/983) | 19·3% (192/994) | Observed | Common errors were incorrect or incomplete swab of gums, touching the collection pad during removal from packaging, or buffer spills | 0·3% (3/983) | .. |
FDA phase 2b (2012; observed arm)‖‖4 | 93% (942/1013) | Reactive as non-reactive 0·9% (10/1013), non-reactive as reactive 0·1% (1/1013) | 2·1% (120/5662) | Observed | Common errors were interpreting results (11/986), dipping device in developer prior to swabbing gums (11/986), buffer spills (4/986), incorrect swabbing (5/986), and could not find developer (2/986) | 3·3% (33/986) | Operational errors |
FDA phase 3 (2012; unobserved arm)***4 | 99·8% (5490/5499) | Reactive as non-reactive 0·1% (8/5499), non-reactive as reactive 0·01% (1/5499) | 51% (526/1031) | Non-observed | Not understanding where to place the test stick after sample collection (1/4999) | 0·6% (31/4999) | .. |
Directly assisted and unassisted studies | |||||||
de la Fuente et al (2012; directly assisted arm)21 | 85·4% (445/521) | Invalid as reactive 2·8%, non-reactive as reactive 2·7%, non-reactive as invalid 2%, reactive as invalid 1·9%, invalid as non-reactive 1·5% and reactive as non-reactive 1·1% | .. | Observed | .. | 0·9% (2/208) | Most difficult step was obtaining blood and depositing it in the correct place |
de la Fuente et al (2012; directly assisted arm)21 | 85·4% (445/521) | Invalid as reactive 2·8%, non-reactive as reactive 2·7%, non-reactive as invalid 2·1%, reactive as invalid 1·9%, invalid as non-reactive 1·5% and reactive as non-reactive 1·1% | .. | Observed | .. | 0·9% (2/208) | Most difficult step was obtaining blood and depositing it in the correct place |
Data are % (n/N); %, 95% CI, (n/N), κ, p value; or κ (95%CI). HIVST=HIV self-test. HCW=health-care worker. FDA=Food and Drug Administration.
Reported as percentage of agreement or κ.
Reason for disagreement assumes the self-tester perspective compared with the HCW.
The study was divided into two substudies: 264 participants performed the self-test, and 147 participants interpreted contrived pictures.
Four participants were on antiretrovirals.
260 of 283 participants self-tested.
Two participants had no confirmatory results.
17 known people living with HIV were not considered to calculate the κ.
Six participants had no results.
515 participants had all three results (both self-tests and dried blood home collection [dried blood spot]), 622 reported the oral fluid-based result, 565 reported the blood-based result, and 548 had the dried blood spot cards processed.
Disaggregated results by type of specimen were not available. One participant was on antiretrovirals with undetectable viral load.
One participant in the urban arm was on antiretrovirals.
1013 of 1031 participants completed the study.
18 positives and 482 negatives were excluded from the accuracy analysis.
Reported κ ranged from fair (κ 0·277, p<0·001) to almost perfect (κ 0·99).23, 26, 27, 30, 32, 36, 38 The raw proportion of agreement was high, ranging from 85·4% to 100%. Overall, our estimates of pooled agreement across studies were almost perfect for both types of approaches (directly assisted κ 0·98, 95% CI 0·96 to 0·99; unassisted κ 0·97, 0·96 to 0·98; I2=34·5%, 0 to 97·8; figure 2). Pooled estimates according to whether HIV self-testing was observed or not also had almost perfect agreement (observed 0·98, 0·96 to 0·99; unobserved 0·96, 0·94 to 0·99; I2=43·0%, 38·8 to 98·4; figure 2). The lowest estimated agreement (κ 0·47, −0·04 to 0·97) was in rural Zimbabwe; the study investigators attributed poor performance to low literacy in the population tested, and verbose instructions that needed further optimisation.31
The proportion of disagreements, assuming the self-tester perspective compared with health-care worker, ranged from 0% to 14·6%. Across 29 reports from 25 studies, four reports24, 25, 33, 35 found no difference in interpretation between self-testers and health-care workers. Most reported differences resulted from the interpretation of a reactive result as non-reactive (0·01–4·8%, 13 of 29 reports), a reactive result as invalid (2·7–6·7%, five of 29), a non-reactive result as reactive (0·1–4·5%, 14 of 29), an invalid result as reactive (0·5–3·1%, three of 29), an invalid result as non-reactive (0·3–12·5%, 13 of 29), or a non-reactive result as invalid (0·5–50%, seven of 29).
Reasons for disagreements were higher in directly assisted studies (2·7–6·7%) than unassisted studies (4·6%) when interpreting a reactive result as invalid, and were higher in unassisted studies (0·01–4·8%) than directly assisted studies (0·06–2·7%) when interpreting a reactive result as non-reactive.
Across 20 reports from 16 studies,4, 6, 18, 19, 20, 22, 25, 26, 27, 28, 30, 31, 32, 34, 36, 38 16 (80%) of 20 reports had specificity of more than 98%. Sensitivity varied substantially; 18 (90%) of 20 reports had sensitivity of at least 80%. Two studies reported sensitivity of less than 80%: one34 had insufficient information on how to interpret faint positive lines, and the other31 suggested lengthy instructions were a barrier to participants in the rural arm, in which literacy levels were lower than the urban arm. Excluding these two studies,31, 34 sensitivity estimates were higher for blood-based rapid diagnostic tests (96·2–100%)19, 22, 25, 27 than oral fluid-based rapid diagnostic tests (80–100%),4, 6, 18, 20, 26, 28, 30, 31, 32, 36, 38 as were specificity estimates (blood-based 99·5–100% vs oral fluid 95·1–100%). Studies4, 6, 18, 22, 25, 26, 27, 28, 30, 31, 32, 36, 38 in which testing was observed reported a modest difference in sensitivity (80–100%) compared with unobserved studies4, 18, 19, 20, 34 (88·9–100%; table 3).
Table 3.
Sensitivity | TR/(TR+FN) | Specificity | TN/(TN+FR) | HIV positivity | Type of population | |
---|---|---|---|---|---|---|
Unobserved studies | ||||||
Pant Pai et al (2013)*34 | 66·7% (29·9–92·5) | 6/(6+3) | 100% (98·5–100) | 242/(242+0) | 3·6% (9/251) | HCW (100%) |
Asiimwe et al (2014; unobserved arm)*18 | 90·0% (68·3–98·8) | 18/(18+2) | 95·1% (89·0–98·4) | 98/(98+5) | 17·2% (20/116) | GP (100%) |
Chavez et al (2016; blood-based arm)†19 | 100% (54·1–100) | 6/(6+0) | 100% (99·2–100) | 486/(486+0) | 1·7% (9/515) | KP (100%) |
Chavez et al (2016; oral fluid arm)*19 | 88·9% (51·8–99·7) | 8/(8+1) | 100% (99·3–100) | 501/(501+0) | 1·7% (9/515) | KP (100%) |
FDA phase 3 (2012)*4 | 91·7% (84·2–96·3) | 88/(88+8) | 100% (99·9–100) | 4902/(4902+1) | 1·9% (96/4903) | GP (86·9%), KP (13·1%) |
Choko et al (2011)*20 | 97·9% (88·9–99·9) | 47/(47+1) | 100% (98·3–100) | 210/(210+0) | 16·9% (48/283) | GP (100%) |
Observed studies | ||||||
Gras et al (2014)†25 | 96·2% (80·4–99·9) | 25/(25+1) | .. | .. | 100% (26/26) | PLHIV (100%) |
Lee (2007)†27 | 98·8% (93·5–100) | 83/(83+1) | 99·6% (97·9–100) | 260/(260+1) | 24·3% (84/345) | GP (90%), KP (10%) |
Dong et al (2014)†22 | 97·7% (88·0–99·9) | 43/(43+1) | 99·5% (97·1–100) | 186/(186+1) | 19·0% (44/231) | GP (100%) |
Sarkar et al (2016)*38 | 100% (15·8–100) | 2/(2+0) | 100% (98·1–100) | 197/(197+0) | 0·9% (2/202) | Pregnant women (100%) |
Marley et al (2014)‡30 | 100% (54·1–100) | 6/(6+0) | 98·6% (95·9–99·7) | 209/(209+3) | 5·8% (13/222) | GP (100%), VCT clients |
Choko et al (2015)§6 | 93·6% (88·2–97·0) | 132/(132+9) | 99·9% (99·6–100) | 1507/(1507+1) | 8·6% (141/1649) | GP (100%) |
Asiimwe et al (2014; observed arm)*18 | 100% (75·3–100) | 13/(13+0) | 99·1% (95·0–100) | 109/(109+4) | 10·6% (13/122) | GP (100%) |
Martínez Pérez et al (2016)*36 | 98·8% (96·9–99·7) | 323/(323+4) | 100% (99·8–100) | 1860/(1860+0) | 14·9% (327/2187) | GP (100%) |
Li et al (2016)*28 | 94·4% (84·6–98·8) | 51/(51+3) | 99·3% (96·1–100) | 139/(139+1) | 28·9% (55/190) | KP (100%) |
Kurth et al (2016)*26 | 89·7% (72·6–97·8) | 26/(26+3) | 99·4% (96·8–100) | 173/(173+1) | 14·3% (29/203) | GP (100%) |
Mavedzenge et al (2015; rural arm)*31 | 66·7% (9·4–99·2) | 2/(2+1) | 94·7% (85·4–98·9) | 54/(54+3) | 8% (5/62) | GP (100%) |
Mavedzenge et al (2015; urban arm)*‡31 | 80·0% (28·4–99·5) | 4/(4+1) | 97·8% (88·5–99·9) | 145/(145+1) | 9% (16/172) | GP (100%) |
Ng et al (2012)*32 | 97·4% (94·0–99·1) | 186/(186+5) | 99·9% (99·3–100) | 791/(791+1) | 19·3% (192/994) | GP (63·7%), PLHIV (20%), KP (16·3%) |
FDA phase 2b (2012)*4 | 97· 9% (96·2–99·0) | 470/(470+10) | 99·8% (98·8–100) | 472/(472+1) | 51·9% (526/1013) | GP (42·4%), PLHIV (513%), KP (6·3%) |
Directly assisted studies | ||||||
Pant Pai et al (2013)*34 | 66·7% (29·9–92·5) | 6/(6+3) | 100% (98·5–100) | 242/(242+0) | 3·6% (9/251) | HCW (100%) |
Sarkar et al (2016)*38 | 100% (15·8–100) | 2/(2+0) | 100% (98·1–100) | 197/(197+0) | 0·9% (2/202) | Pregnant women (100%) |
Choko et al (2011)*20 | 97·9% (88·9–99·9) | 47/(47+1) | 100% (98·3–100) | 210/(210+0) | 16·9% (48/283) | GP (100%) |
Choko et al (2015)*§6 | 93·6% (88·2–97·0) | 132/(132+9) | 99·9% (99·6–100) | 1507/(1507+1) | 8·6% (141/1649) | GP (100%) |
Marley et al (2014)*‡30 | 100% (54·1–100) | 6/(6+0) | 98·6% (95·9–99·7) | 209/(209+3) | 5·8% (13/222) | GP (29%) |
Asiimwe et al (2014; observed arm) *18 | 100% (75·3–100) | 13/(13+0) | 99·1% (95·0–100) | 109/(109+4) | 10·6% (13/122) | GP (100%) |
Asiimwe et al (2014; unobserved arm)*18 | 90·0% (68·3–98·8) | 18/(18+2) | 95·1% (89·0–98·4) | 98/(98+5) | 17·2% (20/116) | GP (100%) |
Martínez Pérez et al (2016)*36 | 98·8% (96·9–99·7) | 323/(323+4) | 100% (99·8–100) | 1860/(1860+0) | 14·9% (327/2187) | GP (100%) |
Unassisted studies | ||||||
Gras et al (2014)†25 | 96·2% (80·4–99·9) | 25/(25+1) | .. | .. | 100% (26/26) | PLHIV (100%) |
Lee et al (2007)†27 | 98·8% (93·5–100) | 83/(83+1) | 99·6% (97·9–100) | 260/(260+1) | 24·3% (84/345) | GP (90%), KP (10%) |
Dong et al (2014)†22 | 97·7% (88·0–99·9) | 43/(43+1) | 99·5% (97·1–100) | 186/(186+1) | 19·0% (44/231) | GP (100%) |
Chavez et al (2016; blood-based arm)†19 | 100% (54·1–100) | 6/(6+0) | 100% (99·2–100) | 486/(486+0) | 1·7% (9/515) | KP (100%) |
Chavez et al (2016; oral fluid arm) *19 | 88·9% (51·8–99·7) | 8/(8+1) | 100% (99·3–100) | 501/(501+0) | 1·7% (9/515) | KP (100%) |
Li et al (2016)*28 | 94·4% (84·6–98·8) | 51/(51+3) | 99·3% (96·1–100) | 139/(139+1) | 28·9% (55/190) | KP (100%) |
Kurth et al (2016)*26 | 89·7% (72·6–97·8) | 26/(26+3) | 99·4% (96·8–100) | 173/(173+1) | 14·3% (29/203) | GP (100%) |
FDA phase 3 (2012)*4 | 91·7% (84·2–96·3) | 88/(88+8) | 100% (99·9–100) | 4902/(4902+1) | 1·9% (96/4903) | GP (86·9%), KP (13·1%) |
Mavedzenge et al (2015; rural arm)31 | 66·7% (9·4–99·2) | 2/(2+1) | 94·7% (85·4–98·9) | 54/(54+3) | 8% (5/62) | GP (100%) |
Mavedzenge et al (2015; urban arm)*¶31 | 80·0% (28·4–99·5) | 4/(4+1) | 97·8% (88·5–99·9) | 45/(45+1) | 9% (16/172) | GP (100%) |
Ng et al (2012)*32 | 97·4% (94·0–99·1) | 186/(186+5) | 99·9% (99·3–100) | 791/(791+1) | 19·3% (192/994) | GP (63·7%), PLHIV (20%), and KP (16·3%) |
FDA phase 2b (2012)*4 | 97·9% (96·2–99·0) | 470/(470+10) | 99·8% (98·8–100) | 472/(472+1) | 51·9% (526/1013) | GP (42·4%), PLHIV (51·3%), and KP (6·3%) |
Data are % (95% CI) or n/(n+n). TR=true reactive result. FR=false reactive result. FN=false non-reactive result. TN=true non-reactive result. HCW=health-care worker. GP=general population. KP=key population. FDA=US Food and Drug Administration. PLHIV=people living with HIV. VCT=voluntary counselling and testing
Oral fluid-based.
Finger stick-based or whole blood-based.
This study assessed accuracy in a subsample of participants (229/800).
Four participants were on antiretrovirals; they tested negative via self-test and positive in confirmatory testing.
A study31 from Zimbabwe with oral fluid-based rapid diagnostic tests, with data disaggregated by setting, found that urban populations had higher sensitivity (80%, 95% CI 28·4–99·5) than rural populations with lower literacy (66·7%, 9·4–99·2), and that this was also the case for specificity (urban 97·8%, 88·5–99·9 vs rural 94·7%, 85·4–98·9).
All studies6, 18, 20, 30, 34, 36, 38 addressing directly assisted HIV self-testing used oral fluid-based rapid diagnostic tests. The estimated sensitivity was similar to that in studies4, 19, 26, 28, 31, 32 with oral fluid rapid diagnostic tests within the unassisted approach (table 3).
Three studies included some participants taking antiretroviral drugs. In two studies,6, 31 participants had non-reactive test results, but later received confirmatory testing and were diagnosed or disclosed their HIV statuses afterward. In the third study,23 self-testers and health-care workers both obtained non-reactive results because they used the same oral test.
We identified 25 reports from 20 studies with information on invalid results: seven reports20, 29, 30, 31, 32, 36, 40 used the directly assisted approach, six23, 25, 26, 28, 34, 39 used the unassisted approach and two4, 18 used both approaches. Invalid results ranged from one (0·2%) of 478 tests to 197 (56·3%) of 350 tests (table 2).4, 18, 19, 20, 21, 22, 23, 25, 26, 27, 28, 29, 30, 31, 32, 36, 37, 38, 39
Although most participants were able to obtain a correct result, user errors among self-testers were noted in 15 reports. Of these reports, two found a high proportion of user error: one27 reported most users were unable to take blood samples or transfer blood specimens correctly (197 invalid results from 350 tests; 56·3%), and the other26 reported users were aware of making mistakes (36 invalid results from 239 tests; 15·1%). Excluding these studies, the proportion of invalid results was similar in studies20, 29, 30, 31, 32, 36, 37, 38, 40 of the directly assisted approach (0·3–9·5%) and studies19, 22, 23, 25, 28, 39 of the unassisted approach (0·2–7·9%). The proportion of invalid results was higher in studies4, 18, 21, 22, 23, 25, 28, 29, 30, 31, 32, 36, 37, 38 in which testing was observed (0·2–9·5%) when compared with unobserved studies4, 18, 19, 20, 39 (0·4–7·9%).
The proportion of studies reporting invalid results among self-testers was greater in studies19, 21, 22, 25, 27, 29, 37, 39, 40 using blood-based rapid diagnostic tests (0·4–9·5%) than studies4, 18, 20, 23, 26, 28, 30, 31, 32, 36, 38 using oral fluid-based rapid diagnostic tests (0·2–4·5%). Excluding studies21, 25, 26, 27, 29, 39 with feasibility of less than 60%, the proportion of invalid results was less than 5% (0·2–4·6%), regardless of the approach or specimen.
User errors described in studies of the directly assisted approach were incorrect or incomplete specimen collection (finger prick or oral swab),20, 21, 30, 31, 40 incorrect use or spillage of buffer,20, 29, 30, 31, 36, 40 incorrect transfer of blood specimen, and problems with the interpretation of results.4, 20, 23, 30, 34, 39, 40 Reported errors in studies of the unassisted approach included specimen collection (finger prick or oral swab),23, 26, 28 misinterpretation of test results,23, 34 incorrect time to read the results,26, 28 test kit opened incorrectly,23, 26 incorrect use or spillage of buffer,28 instructions not followed or read,23 or incorrect transfer of the blood specimen.25
In general, reported errors in performance were similar by type of specimen; however, studies using oral fluid rapid diagnostic tests reported errors in the interpretation of test results and studies using blood-based rapid diagnostic tests reported errors in transfer of the blood specimen.
Two studies4, 25 found that people with known HIV status had a higher proportion of errors (ie, when collecting the specimen) when self-testing compared with people with unknown HIV status (0·8% vs 0·2%), whereas a third study27 found that known HIV-positive participants were more likely to do the test correctly.
Discussion
Self-testers can achieve the same results as health-care workers when using HIV rapid diagnostic tests and diagnostic accuracy of rapid diagnostic tests for self-testing is high. Application of the estimated ranges of sensitivity (80–100%) and specificity (95·1–100%) to a hypothetical group of 100 000 people with 1% of HIV prevalence would result in 0–200 HIV-positive cases being missed, and 0–4851 HIV-negative individuals being misidentified with a reactive result, excluding two outliers.31, 34
This systematic review and meta-analysis suggests that in the hands of self-testers, the sensitivity and specificity of blood-based rapid diagnostic tests were higher than oral fluid rapid diagnostic tests, although fewer studies used blood-based rapid diagnostic tests. The reduced sensitivity is probably explained by the lower quantity of HIV antibodies in oral fluid compared with whole blood, as observed in professional-use assessments.41 Although blood-based rapid diagnostic tests might have the potential to deliver more accurate results, more invalid results might occur because the greatest number of user errors was related to standard procedures when capillary tubes and pipettes were used.
Most studies had a high HIV positivity among participants where tests are expected to have a higher positive-predictive value than lower prevalence populations. Furthermore, imperfect reference standards might also decrease the degree of accuracy. We found wide variability in sensitivity estimates, which could be explained by the use of adapted rapid diagnostic tests not specifically designed for self-test use,21, 22, 23, 25, 27, 37, 39, 40 or used test kits before approval by national regulatory authorities, in 23 of 25 included studies.4, 19, 20, 24, 26, 28, 29, 30, 31, 32, 33, 34, 35, 36, 38
When we excluded six studies21, 25, 26, 27, 29, 39 with low feasibility, the proportion of invalid results met the minimum acceptable criteria for rapid diagnostic test performance (<5%);42 however, we still found no significant differences in the proportion of invalid results by type of approach, suggesting that use of a rapid diagnostic tests for self-testing without assistance will not increase the possibility of obtaining an invalid result.
Most invalid results and errors in performance reported by studies included in this review related to user errors and manufacturing defects. These invalid results and errors can be mitigated with instructions for use because the complexity of the test procedure or the complexity of the instructions can increase the possibility of failure in performance and incorrect interpretation of a result.
Recommendations include use of simple and clear language and well designed pictorial instructions, especially for the steps related to specimen collection, buffer use, and interpretation of results;43 easily identifiable kit components; reduction of the volume of specimen needed to do the test; and intuitive single-step test kits with controlled and automatic specimen collection, transfer, and buffer use.
In some settings, instructions could be adapted and validated for the cultural context and for less-skilled users, including individuals with low literacy or visual impairments. This could include translation in local languages, clear and large print instructions for use, detailed images and descriptions, or electronic documents or audio instructions. To improve performance of less-skilled users, instructions for use could be coupled with in-person or video demonstrations on how to do the test and interpret the result.
Product labelling should clearly state that people with reactive or invalid test results should seek further testing at a health facility. The labelling should also include information on test limitations in detecting HIV infection during the window period, for people taking pre-exposure prophylaxis, or in people with a suppressed immune response, such as people on antiretroviral drugs. This is a crucial issue because reports show that people with HIV on antiretrovial therapy might be using HIV self-testing kits to check and reconfirm their HIV status, and could obtain a false-negative result.44, 45
Strengths of this study include completeness of the search strategy, explicit inclusion criteria, a systematic approach to data collection, and independent assessment of each included study. Among the limitations were that most included studies used oral fluid-based rapid diagnostic tests, and studies used different and imperfect reference standard tests to identify HIV-positive individuals. Most studies did not compare approaches or specimens head-to-head. Results were considered biased in studies where the reference test strategy was not aligned with WHO testing guidance. Our last search was done on April 30, 2016, and since then 11 studies reporting on HIV self-testing performance have been published,46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56 of which eight are abstracts. These studies reached the same conclusion as we did, reporting that most participants were able to do the self-test correctly, and, where reported, the raw proportion of agreement was also high, ranging from 84% to 99%.45, 46, 48, 49, 55 Six of 11 studies would not have met our inclusion criteria, and the four that did meet the inclusion criteria might not have influenced our findings. Furthermore, most studies used adapted test kits that were not specifically designed or packaged for self-testing, and in some studies participants did not interpret their own results, but interpreted contrived devices or pictures or photographs.
No study provided information on people recently or acutely infected with HIV, and no study disaggregated data by individuals taking antiretroviral drugs. Because little data were available, we could not explore the influence of HIV prevalence, type of reference used, or study design. Selection bias is likely because most studies carefully selected participants; some studies included only known HIV-positive individuals. We did not assess publication bias because analytical methods are not well suited for testing observational data.57 Finally, although most studies were judged to be at low risk of bias, concerns remained about studies with small samples and the extent to which the findings can be generalised.
In summary, self-testers can achieve a high level of agreement with the results obtained by a health-care worker when using an HIV rapid diagnostic test for self-testing, whether or not assistance was provided. Errors in performance of the test procedure might be reduced through improvement of the design of rapid diagnostic tests for self-testing, clearer product labels, inclusion of simple instructions for use, and provision of additional support, such as instructional videos.
Acknowledgments
Acknowledgments
The Bill & Melinda Gates Foundation, in collaboration with the STAR Consortium, which is supported by Unitaid (grant number PO#10140-0-600), funded the study. We are grateful for the technical input from the WHO Technical Working Group on HIV self-testing. Special thanks to the investigators who provided additional data and shared their studies with us. Thanks also to Mary Henderson for language editing and proofreading.
Contributors
CF, CJ, and RB developed the initial study design. CF and CJ searched for and extracted data. CF analysed the findings and wrote the first draft of the manuscript. All authors collaboratively discussed key decisions throughout the course of the review, provided critical feedback on preliminary drafts and interpretation of results, and approved the final manuscript.
Declaration of interests
We declare no competing interests.
Supplementary Material
References
- 1.WHO . World Health Organization; Geneva: 2014. March 2014 supplement to the 2013 consolidated guidelines on the use of antiretroviral drugs for treating and preventing HIV infection: recommendations for a public health approach.http://www.who.int/hiv/pub/guidelines/arv2013/arvs2013upplement_march2014/en/ [PubMed] [Google Scholar]
- 2.Kalibala S, Tun W, Cherutich P, Nganga A, Oweya E, Oluoch P. Factors associated with acceptability of HIV self-testing among health care workers in Kenya. AIDS Behav. 2014;18:S405–S414. doi: 10.1007/s10461-014-0830-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Desmond N, Corbett E. Introducing HIV self-testing technologies for couples in resource-poor contexts: a case study in urban Malawi. International AIDS Conference; Melbourne, VIC, Australia; July 20–25, 2014. TUSA14.
- 4.US Food and Drug Administration; Rockville, MD: 2012. Summary of safety and effectiveness.https://www.fda.gov/downloads/BiologicsBloodVaccines/BloodBloodProducts/ApprovedProducts/PremarketApprovalsPMAs/UCM312534.pdf [Google Scholar]
- 5.Ngure K, Renee H, Mugo N. Uptake of HIV self-testing among people receiving PrEP in Kenya. AIDS Res Hum Retroviruses. 2014;30:A70. [Google Scholar]
- 6.Choko AT, MacPherson P, Webb EL. Uptake, accuracy, safety, and linkage into care over two years of promoting annual self-testing for HIV in Blantyre, Malawi: a community-based prospective study. PLoS Med. 2015;12:e1001873. doi: 10.1371/journal.pmed.1001873. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Figueroa C, Johnson C, Verster A, Baggaley R. Attitudes and acceptability on HIV self-testing among key populations: a literature review. AIDS Behav. 2015;19:1949–1965. doi: 10.1007/s10461-015-1097-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Pant Pai N, Sharma J, Shivkumar S. Supervised and unsupervised self-testing for HIV in high- and low-risk populations: a systematic review. PLoS Med. 2013;10:e1001414. doi: 10.1371/journal.pmed.1001414. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Krause J, Subklew-Sehume F, Kenyon C, Colebunders R. Acceptability of HIV self-testing: a systematic literature review. BMC Public Health. 2013;13:735. doi: 10.1186/1471-2458-13-735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.WHO . World Health Organization; Geneva: 2016. Guidelines on HIV self-testing and partner notification. Supplement to the consolidated guidelines on HIV testing services. [PubMed] [Google Scholar]
- 11.WHO . World Health Organization; Geneva: 2015. Consolidated Guidelines on HIV Testing Services. [PubMed] [Google Scholar]
- 12.Cohen J. A coefficient of agreement for nominal scales. Edus Psychol Meas. 1960;20:37–46. [Google Scholar]
- 13.Trikalinos TA, Balion CM. Chapter 9: options for summarizing medical test performance in the absence of a “gold standard”. J Gen Intern Med. 2012;27(suppl 1):S67–S75. doi: 10.1007/s11606-012-2031-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Zamora J, Abraira V, Muriel A, Khan K, Coomarasamy A. Meta-DiSc: a software for meta-analysis of test accuracy data. BMC Med Res Methodol. 2006;6:31. doi: 10.1186/1471-2288-6-31. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Bossuyt PM, Reitsma JB, Bruns DE. STARD 2015: an updated list of essential items for reporting diagnostic accuracy studies. BMJ. 2015;351:h5527. doi: 10.1136/bmj.h5527. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Whiting PF, Rutjes AW, Westwood ME. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011;155:529–536. doi: 10.7326/0003-4819-155-8-201110180-00009. [DOI] [PubMed] [Google Scholar]
- 17.Higgins JP, Thompson SG. Quantifying heterogeneity in a meta-analysis. Stat Med. 2002;21:1539–1558. doi: 10.1002/sim.1186. [DOI] [PubMed] [Google Scholar]
- 18.Asiimwe S, Oloya J, Song X, Whalen CC. Accuracy of un-supervised versus provider-supervised self-administered HIV testing in Uganda: a randomized implementation trial. AIDS Behav. 2014;18:2477–2484. doi: 10.1007/s10461-014-0765-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Chavez P, Wesolowski L, Owen M, Gravens L, Sullivan P, MacGowan R. Perceptions and performance of self-administered rapid HIV tests conducted by untrained users in real world settings. HIV Diagnostics Conference; Atlanta, GA, USA; March 21– 24, 2016. E4.
- 20.Choko AT, Desmond N, Webb EL. The uptake and accuracy of oral kits for HIV self-testing in high HIV prevalence setting: a cross-sectional feasibility study in Blantyre, Malawi. PLoS Med. 2011;8:e1001102. doi: 10.1371/journal.pmed.1001102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.de la Fuente L, Rosales-Statkus ME, Hoyos J. Are participants in a street-based HIV testing program able to perform their own rapid test and interpret the results? PLoS One. 2012;7:e46555. doi: 10.1371/journal.pone.0046555. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Dong M, Regina R, Hlongwane S, Ghebremichael M, Wilson D, Dong K. Can laypersons in high prevalence South Africa perform an HIV self-test accurately? International AIDS Conference. Melbourne, VIC, Australia; July 20–25, 2014. WEPE034.
- 23.Gaydos CA, Hsieh YH, Harvey L. Will patients “opt in” to perform their own rapid HIV test in the emergency department? Ann Emerg Med. 2011;58:S74–S78. doi: 10.1016/j.annemergmed.2011.03.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Gaydos CA, Solis M, Hsieh YH, Jett-Goheen M, Nour S, Rothman RE. Use of tablet-based kiosks in the emergency department to guide patient HIV self-testing with a point-of-care oral fluid test. Int J STD AIDS. 2013;24:716–721. doi: 10.1177/0956462413487321. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Gras G, Le Bret P, Dailloux JF. Low feasibiliy rate of self testing with a finger-stick whole blood test. Top Antivir Med. 2014;22:512. [Google Scholar]
- 26.Kurth AE, Cleland CM, Chhun N. Accuracy and acceptability of oral fluid HIV self-testing in a general adult population in Kenya. AIDS Behav. 2016;20:870–879. doi: 10.1007/s10461-015-1213-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Lee VJ, Tan SC, Earnest A, Seong PS, Tan HH, Leo YS. User acceptability and feasibility of self-testing with HIV rapid tests. J Acquir Immune Defic Syndr. 2007;45:449–453. doi: 10.1097/QAI.0b013e318095a3f3. [DOI] [PubMed] [Google Scholar]
- 28.Li YF, Wang YM, Zhang RR. Analysis on accuracy and influencing factors of oral fluid-based rapid HIV self-testing among men who have sex with men. Zhonghua Liu Xing Bing Xue Za Zhi. 2016;37:72–75. doi: 10.3760/cma.j.issn.0254-6450.2016.01.015. (in Chinese). [DOI] [PubMed] [Google Scholar]
- 29.MacGowan R, Chavez P, Freeman A, et al. Pilot evaluation of men who have sex with men's ability to self-administer rapid HIV tests and interpret test results. American Public Health Association Conference; Atlanta, GA, USA; Nov 15–19, 2014. 298247.
- 30.Marley G, Kang D, Wilson EC. Introducing rapid oral-fluid HIV testing among high risk populations in Shandong, China: feasibility and challenges. BMC Public Health. 2014;14:422. doi: 10.1186/1471-2458-14-422. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Mavedzenge SN, Sibanda E, Mavengere Y. Supervised HIV self-testing to inform implementation and scale up of self-testing in Zimbabwe. J Int AIDS Soc. 2015;18:96. [Google Scholar]
- 32.Ng OT, Chow AL, Lee VJ. Accuracy and user-acceptability of HIV self-testing using an oral fluid-based HIV rapid test. PLoS One. 2012;7:e45168. doi: 10.1371/journal.pone.0045168. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Nour S, Hsieh YH, Rothman RE. Patients can accurately perform their own rapid HIV point-of-care test in the emergency department. Point Care. 2012;11:176–179. doi: 10.1097/POC.0b013e3182666eb7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Pant Pai N, Behlim T, Abrahams L. Will an unsupervised self-testing strategy for HIV work in health care workers of South Africa? A cross sectional pilot feasibility study. PLoS One. 2013;8:e79772. doi: 10.1371/journal.pone.0079772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Pant Pai N, Bhargava M, Joseph L. Will an unsupervised self-testing strategy be feasible to operationalize in Canada? Results from a pilot study in students of a large canadian university. AIDS Res Treat. 2014;2014:747619. doi: 10.1155/2014/747619. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Martínez Pérez G, Steele SJ, Govender I. Supervised oral HIV self-testing is accurate in rural KwaZulu Natal, South Africa. Trop Med Inter Health. 2016;21:759–767. doi: 10.1111/tmi.12703. [DOI] [PubMed] [Google Scholar]
- 37.Prazuck T, Karon S, Gubavu C. A finger-stick whole-blood HIV self-test as an HIV screening tool adapted to the general public. PLoS One. 2016;11:e0146755. doi: 10.1371/journal.pone.0146755. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Sarkar A, Mburu G, Shivkumar PV. Feasibility of supervised self-testing using an oral fluid-based HIV rapid testing method: a cross-sectional, mixed method study among pregnant women in rural India. J Int AIDS Soc. 2016;19:20993. doi: 10.7448/IAS.19.1.20993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Spielberg F, Camp S, Ramachandra E. HIV home self-testing: can it work? National HIV Prevention Conference; Atlanta, GA, USA; July 27–30, 2003. 03-A-1007-NHPC.
- 40.Majam M. HIV self-testing in South Africa: the current landscape. Treatment optimization continuing medical education meeting; Durban, South Africa; Apr 8, 2017.
- 41.Stekler JD, O'Neal JD, Lane A. Relative accuracy of serum, whole blood and oral fluid HIV tests among Seattle men who have sex with men. J Clin Virol. 2013;58(suppl 1):e19–e22. doi: 10.1016/j.jcv.2013.09.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.WHO . World Health Organization; Geneva: 2015. HIV assays: laboratory performance and other operational characteristics: rapid diagnostic tests (combined detection of HIV-1/2 antibodies and discriminatory detection of HIV-1 and HIV-2 antibodies): report 18. [Google Scholar]
- 43.Ibitoye M, Frasca T, Giguere R, Carballo-Diéguez A. Home testing past, present and future: lessons learned and implications for HIV home tests. AIDS Behav. 2014;18:933–949. doi: 10.1007/s10461-013-0668-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Kumwenda M, Munthali A, Phiri M. Factors shaping initial decision-making to self-test amongst cohabiting couples in urban Blantyre, Malawi. AIDS Behav. 2014;18(suppl 4):S396–S404. doi: 10.1007/s10461-014-0817-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Sibanda E, Neuman M, Tumushime M, et al. Linkage to care after HIV self-testing in Zimbabwe: a cluster randomised trial. Conference on Retroviruses and Opportunistic Infections; Boston, MA, USA; March 4–7, 2018. 150LB.
- 46.Saunders J, Brima N, Orzol M. Prospective observational study to evaluate the performance of the BioSure HIV Self-Test in the hands of lay users. Sex Transm Infect. 2017 doi: 10.1136/sextrans-2017-053231. DOI:10.1136/sextrans-2017-053231 published online Sept 18. [DOI] [PubMed] [Google Scholar]
- 47.Tonen-Wolyec S, Batina-Agasa S, Muwonga J, Fwamba N'kulu F, Mboumba Bouassa RS, Belec L. Evaluation of the practicability and virological performance of finger-stick whole-blood HIV self-testing in French-speaking sub-Saharan Africa. PLoS One. 2018;13:e0189475. doi: 10.1371/journal.pone.0189475. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Ortblad K, Kibuuka D, Ngabirano T, et al. The impact of HIV self-testing on recent testing, status knowledge, and linkage to care among female sex workers in Kampala, Uganda: a randomized controlled trial. 9th International AIDS Society Conference on HIV Science. Paris, France; July 23–26, 2017. MOLBPEC27.
- 49.Kapaku K, Neuman M, Maluzi K, et al. Is OraQuick HIV-self-testing valid among intended users? Analysis from a clinical performance study in Lusaka, Zambia. 9th International AIDS Society Conference on HIV Science. Paris, France; July 23–26, 2017. TUPEC0842.
- 50.Green K, Vu B, Phan H, et al. How acceptable and feasible is HIV self-testing among key populations in Vietnam? Preliminary results from an intervention evaluation study. 9th International AIDS Society Conference on HIV Science. Paris, France; July 23–26, 2017. TUPEC0844.
- 51.Ortblad K, Kibuuka D, Ngabirano T, et al. An assessment of oral HIV self-testing process accuracy and results interpretation among female sex worker peer educators in Kampala, Uganda. 9th International AIDS Society Conference on HIV Science. Paris, France; July 23–26, 2017. TUPEC0846.
- 52.Mwau M, Achieng L, Bwana P. Performance and usability of INSTI, a blood-based rapid HIV self test for qualitative detection of HIV antibodies in intended use populations in Kenya. 9th International AIDS Society Conference on HIV Science. Paris, France; July 23–26, 2017. MOAX0106LB.
- 53.Nanfuka M, Choko A, Birungi J, et al. Accelerating progress towards the first 90 among men: a trial of the peer-based distribution of HIV self-test kits in Bulisa, Uganda. 9th International AIDS Society Conference on HIV Science. Paris, France; July 23–26, 2017. WEPDD0106LB.
- 54.Indravudh P, Kumwenda M, Neuman M, et al. Providing user support for HIV self-testing beyond instructions-for-use in Malawi. Conference on Retroviruses and Opportunistic Infections. Seattle, WA, USA; Feb 13–16, 2017. 896.
- 55.Hong S, Amaambo T, Mukoroli M, et al. Validation of OraQuick HIV self-testing kit among key populations for use in Namibia. 19th International Conference on AIDS and STIs in Africa. Abidjan, Côte d'Ivoire; Dec 4–7, 2017. TUAE0101.
- 56.Gresenguet G, Longo JD, Tonen-Wolyec S, Mboumba Bouassa RS, Belec L. Acceptability and usability evaluation of finger-stick whole blood HIV self-test as an HIV Screening tool adapted to the general public in the Central African Republic. Open AIDS J. 2017;11:101–118. doi: 10.2174/1874613601711010101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Tang JL, Liu JL. Misleading funnel plot for detection of bias in meta-analysis. J Clin Epidemiol. 2000;53:477–484. doi: 10.1016/s0895-4356(99)00204-8. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.