Skip to main content
Wellcome Open Research logoLink to Wellcome Open Research
. 2023 May 4;8:14. Originally published 2023 Jan 10. [Version 2] doi: 10.12688/wellcomeopenres.18737.2

Visual diagnosis of female genital schistosomiasis in Zambian women from hand-held colposcopy: agreement of expert image review and association with clinical symptoms

Amy Sturt 4,5,a, Henrietta Bristowe 6, Emily Webb 7, Isaiah Hansingo 8, Comfort Phiri 9, Maina Mudenda 8, Joyce Mapani 8, Tobias Mweene 9, Bruno Levecke 10, Piet Cools 10,11, Govert van Dam 12, Paul Corstjens 13, Helen Ayles 9,14, Richard Hayes 7, Suzanna Francis 7, Lisette van Lieshout 12, Bellington Vwalika 1, Eyrun Kjetland 2,3, Amaya Bustinduy 14
PMCID: PMC9971661  PMID: 36864924

Version Changes

Revised. Amendments from Version 1

The revised version of this manuscript states the study objectives more clearly in the introduction. In response to reviewers, the authors also provide additional information regarding how the subset of participants tested for sexually transmitted infection were chosen and the standard protocol the midwives used to capture cervicovaginal images using hand-held colposcopy.

Abstract

Background: Female genital schistosomiasis (FGS) can occur in  S. haematobium infection and is caused by egg deposition in the genital tract. Confirming a diagnosis of FGS is challenging due to the lack of a diagnostic reference standard. A 2010 expert-led consensus meeting proposed visual inspection of the cervicovaginal mucosa as an adequate reference standard for FGS diagnosis. The agreement of expert human reviewers for visual-FGS has not been previously described.

Methods: In two Zambian communities, non-menstruating, non-pregnant, sexually-active women aged 18-31 years participating in the HPTN 071 (PopART) Population-Cohort were enrolled in a cross-sectional study. Self-collected genital swabs and a urine specimen were collected at a home visit; trained midwives performed cervicovaginal lavage (CVL) and hand-held colposcopy at a clinic visit.  S. haematobium eggs and circulating anodic antigen (CAA) were detected from urine. Two senior physicians served as expert reviewers and independently diagnosed visual-FGS as the presence of sandy patches, rubbery papules or abnormal blood vessels in cervicovaginal images obtained by hand-held colposcopy. PCR-FGS was defined as  Schistosoma DNA detected by real-time PCR in any genital specimen (CVL or genital swab). 

Results: Of 527 women with cervicovaginal colposcopic images, 468/527 (88.8%) were deemed interpretable by Reviewer 1 and 417/527 (79.1%) by Reviewer 2. Visual-FGS was detected in 35.3% (165/468) of participants by expert review of colposcopic images by Reviewer 1 and in 63.6% (265/417) by Reviewer 2. Cohen’s kappa statistic for agreement between the two reviewers was 0.16, corresponding to "slight" agreement. The reviewers made concordant diagnoses in 38.7% (204/527) participants (100 negative, 104 positive) and discordant diagnoses in 31.8% (168/527) participants.

Conclusions: The unexpectedly low level of correlation between expert reviewers highlights the imperfect nature of visual diagnosis for FGS based on cervicovaginal images. This finding is a call to action for improved point-of-care diagnostics for female genital schistosomiasis.

Keywords: Female genital schistosomiasis, Schistosoma haematobium, hand-held colposcopy

Introduction

Female genital schistosomiasis (FGS), primarily caused by S. haematobium infection, is a neglected tropical disease associated with poverty, inadequate sanitation, and limited access to safe drinking water 1, 2 . FGS occurs when schistosome eggs destined for excretion via the urinary bladder are deposited in the female genital tract. These tissue-deposited eggs can be associated with characteristic genital mucosal lesions and can present with genitopelvic findings including contact bleeding 3 , abnormal vaginal discharge 4 , and in some cases, infertility 5 . Visual-FGS refers to the identification of these characteristic mucosal changes, such as sandy patches (grainy and homogeneous), rubbery papules, and abnormal blood vessels by visual inspection of the cervicovaginal mucosa 3 . The visual detection of FGS-associated lesions requires the insertion of a vaginal speculum, a good light source, and a lens providing adequate magnification 6 . A standard colposcope has traditionally been used in research settings for visual-FGS diagnosis, but the bilharzia and HIV (BILHIV) study demonstrated recently that hand-held colposcopy could also be used to decentralize colposcopy services 69 .

Confirming a diagnosis of FGS is challenging as there is not a widely accepted diagnostic reference standard for research, diagnosis, and screening 2 . A 2010 expert-led consensus meeting proposed visual inspection of the cervicovaginal mucosa as an adequate reference standard for FGS diagnosis 10 . However, the mucosal changes in visual-FGS are non-specific and have also been associated with herpes simplex virus, human papillomavirus (HPV) infection, and cervical precancer 3 . Diagnostic methods that are not adequately specific for FGS diagnosis may lead to over-treatment with praziquantel and may overlook the diagnosis and treatment of sexually transmitted infection (STI) and cervical cancer. Although there is little evidence of praziquantel resistance in humans 11 , indiscriminate treatment may theoretically increase the risk of the development of praziquantel resistance 12 . Since cervicovaginal visualization is widely promoted 13 for FGS screening and diagnosis, we aimed to use BILHIV study data to further evaluate the agreement of human expert reviewers for the diagnosis of visual-FGS. Secondary aims were to evaluate the association between visual-FGS and abdominal, genitourinary, and reproductive manifestations as well as evaluating Schistosoma diagnostic methods for their association with the presence of visual-FGS.

Methods

Study setting and participants

The cross-sectional bilharzia and HIV (BILHIV) study 9 was nested in the HPTN 071 (PopART) cluster randomized trial in Zambia 14 . The S. haematobium is endemic in Zambia, and while more data are needed, prevalences ranging between 14 – 76% were reported in a recent systematic review 15 . The HPTN 071 (PopART) trial was a cluster randomized trial assessing the impact of an HIV-1 combination prevention package including “universal test and treat” 14 . As previously described, after the 36-month HPTN 071 (PopART) visit, community workers made home visits to women expressing interest in the BILHIV study 9 . Between January and August 2018, eligible women who were 18–31 years old, not pregnant, sexually active, and resident in one of two urban communities that participated in HPTN 071 (PopART) in Livingstone, Zambia were enrolled in the BILHIV study. The primary aim of the BILHIV study was to compare the performance of genital self-sampling (cervical and vaginal swabs) to clinic-based cervicovaginal lavage (CVL) for the detection of Schistosoma DNA by quantitative PCR (qPCR) as previously described 9 . A specific pre-specified BILHIV study objective (the subject of the current manuscript) was to compare agreement of expert review of images obtained through hand-held colposcopy for the diagnosis of visual-FGS.

Home and clinic-based sample collection

As previously described, the home visit included written informed consent, a questionnaire, genital self-sampling (cervical and vaginal), and collection of a urine specimen 9 . There were no restrictions on the timing of urine self-sample collection, and 69.5% (419/603) of the total BILHIV study samples were performed between 9:00 and 14:00 9 . Enrolled women who were not currently menstruating were then invited to attend Livingstone Central Hospital cervical cancer clinic, where midwives collected CVL. After speculum insertion, normal saline (10 mL) was flushed across the cervix and vaginal walls for one minute with a bulb syringe and CVL fluid was collected from the posterior fornices.

Hand-held colposcopy and image review

At the clinic, cervicovaginal images were captured with a portable colposcope (EVA System, MobileODT, Tel Aviv, Israel) according to a standardized protocol. Per the protocol, trained midwives evaluated the cervix, anterior fornix, posterior fornix, left and right lateral cervix and vaginal walls and captured images of each location using the zoom and lighting functions in the Mobile ODT colposcope. Two senior physicians who have training and expertise in colposcopy and FGS served as expert reviewers. Digital images were independently evaluated by the expert reviewers for any of the four recognized FGS cervicovaginal manifestations: grainy sandy patches, homogenous yellow sandy patches, rubbery papules, and abnormal blood vessels 16 . At their discretion, expert reviewers could exclude images that they felt could not be evaluated due to technical issues, image quality, or limited cervical visualization. If any of the four recognized FGS cervicovaginal manifestations was present, the participant was categorized as “visual-FGS”. If none of the four cervicovaginal manifestations were present the participant was categorized as “visual-FGS not detected” 16 . The expert reviewers were both senior practicing physicians at the Professor level, who have training and expertise in standard colposcopy. Reviewer 1 (EFK) is full-time FGS researcher and an infectious diseases physician and Reviewer 2 (BV) is an obstetrician and gynecologist who regularly analyses images for cervical cancer. Both reviewers have extensive practical and research-based expertise in evaluating and diagnosing FGS in endemic settings. Additionally, both reviewers contributed as authors of the 2015 WHO FGS Pocket Atlas 16 . Each reviewer was informed of the study setting and methods, but both were blinded to the study participants’ FGS and Schistosoma status.

Women with at least one of the visual manifestations of FGS 3, 16 or with any positive urine or genital Schistosoma diagnostic result were treated free-of-charge with 40 mg/kg praziquantel. Testing for STIs was not performed at the point-of-care and participants with suspected STIs were offered syndromic management, as per local guidelines 17 . In line with national and local clinic protocols adapted to real-world resource limitations, human papillomavirus (HPV) testing was not performed.

In parallel with BILHIV study procedures, participants could choose to engage in free cervical cancer screening using the visual inspection with acetic acid (VIA) technique. In the subset of women who engaged in cervical cancer screening, midwives applied 3-5% acetic acid to the cervix after CVL collection, as previously described 18 . An opaque white reaction was classified as positive and no change as negative 19 . Images for FGS analyses were taken before application of acetic acid. Images for cervical cancer screeing were taken after application of acetic acid.

Urine microscopy, and circulating anodic antigen

Up to 60mL of fresh urine was centrifuged and examined by microscopy for S. haematobium eggs. The participant was considered to have urinary schistosomiasis if a pellet contained at least one S. haematobium egg 9 . All study specimens were stored at -80°C. A lateral flow assay utilizing up-converting reporter particles for the quantification of circulating anodic antigen (CAA) was performed on urine samples, as previously described 9, 20 . Analyzing the equivalent of 417 μL urine (wet reagent, UCAA hT 417), a test result indicating a CAA value >0.6 pg/mL was considered positive 21 .

qPCR for detection of Schistosoma DNA

DNA extraction, amplification and detection of the Schistosoma-specific internal-transcribed-spacer-2 (ITS-2) target by real-time (qPCR) was performed at Leiden University Medical Center, as previously described, using 200 µL of CVL, cervical or vaginal swab fluid 9, 22 .

Other infections

Due to budgetary constraints, a subset of participants was evaluated for STI. As previously described, all participants with FGS and all participants with probable FGS were selected for characterization of the cervicovaginal microbiota and STI by qPCR on cervical swabs 23 . Three FGS-negative participants were selected for every FGS and probable FGS participant using a random number generator. The FGS-negative participants were frequency-matched by age to participants with FGS 23 . Laboratory-based fourth-generation HIV-1 testing (Abbott Architect HIV Ag/Ab Combo Assay) was performed for HPTN 071 (PopART) Population Cohort participants at each study visit 14 . STIs were quantified among a subset of participants by qPCR using the S-DiaCTNG TM (for C. trachomatis and N. gonorrhea) and S-DiaMGTV TM kits (for M. genitalium and T. vaginalis) (Diagenode Diagnostics, Seraing, Belgium) on DNA from cervical swabs at Ghent University (Ghent, Belgium) according to the manufacturer’s instructions.

Consent

The study was approved by the University of Zambia Biomedical Research Ethics Committee (011-08-17), the Zambia National Health Research Authority and the London School of Hygiene and Tropical Medicine Ethics Committee (14506). Permission to conduct the study was given by Livingstone District Health Office and the Livingstone Central Hospital superintendent.

Statistical methods

The planned sample size of the BILHIV study was based on calculations related to the primary BILHIV study objective, as previously described 9 . Participant characteristics were summarized by median and interquartile range (IQR) for continuous variables, and by frequency and percentage for categorical variables. Participants missing data for a specific variable were excluded from analysis involving that variable. The primary analysis evaluated the agreement between the two expert reviewers using Cohen’s kappa statistic. A secondary analysis evaluated the association between visual-FGS (exposure) and abdominal, genitourinary, and reproductive manifestations (outcomes). Crude associations were evaluated using chi-squared tests, and logistic regression was used to calculate crude and adjusted odds ratios (OR) for the association of visual-FGS with clinical manifestations; this was done separately for each expert reviewer’s diagnosis of visual-FGS. In this study we employed various diagnostic tests to evaluate urinary Schistosoma infection (CAA and urine microscopy), and FGS (portable colposcopy, and Schistosoma DNA on CVL and genital swabs) as previously described 2325 . Another secondary analysis evaluated each diagnostic method for its association with the presence of visual-FGS, separately for each expert reviewer. Due to small numbers, for evaluating the association of visual-FGS with PCR-FGS, we used a composite definition of PCR-FGS or “any positive genital PCR”, defined as any positive cervical or vaginal swab or CVL specimen. Chi-squared tests were used to assess crude associations, and logistic regression was used to calculate crude and age-adjusted odds ratio (OR) of the various Schistosoma and FGS diagnostics with the presence or absence of visual-FGS. We were unable to adjust for other potential confounders due to small numbers, particularly for STI and cervical pre-cancer status which were collected on a sub-set of participants. For both secondary analyses, exact logistic regression was used for analyses where 5 or fewer participants in a particular exposure category had the outcome. Due to the exploratory nature of these analyses, we did not adjust for multiple comparisons. Data were analyzed using STATA 15.1 (Stata Corporation, College Station, TX).

Results

Baseline characteristics and demographics

The BILHIV study enrolled 603 eligible women, 527 (87.4%) of whom had cervicovaginal images captured by portable colposcopy. Of the 527 women with images, 468 (88.8%) were deemed interpretable by Reviewer 1 and 418 (79.3%) by Reviewer 2 ( Figure 1). Each reviewer designated a proportion of images uninterpretable, leading to differences in denominators. The median age of the participants was 24 years (range 22 – 28) and 323 (61.3%) had attended some secondary school ( Table 1). The majority of participants were married, had previously been pregnant, and had been sexually active within the last six months. There was no association between visual FGS, as identified by any expert review, and current or childhood water contact.

Figure 1. Flowsheet of cervicovaginal image review after hand-held colposcopy in BILHIV study participants.

Figure 1.

Table 1. Demographics and reproductive health characteristics of the BILHIV study participants who underwent portable colposcopy (n=527).

Participant Characteristics Study population (n=527)
Age in years – Median (IQR) 24 (22 – 28)
Marital status Single 213 (40.4)
Married or cohabitating 292 (55.4)
Divorced or separated 22 (4.2)
Education (highest level) None or any primary school 155 (29.4)
Any secondary school 323 (61.3)
Trade, degree or higher 49 (9.3)
District Community A 290 (55.0)
Community B 237 (45.0)
Household members 1-3 162 (30.7)
4-5 210 (39.9)
6+ 155 (29.4)
Employment status Unemployed 363 (68.9)
Employed 164 (31.1)
Current water contact None 447 (84.8)
Any 80 (15.2)
Childhood water contact None 151 (28.7)
Any 376 (71.3)
Reproductive Health Characteristics
Age at sexual debut (years) * 8-16 221 (42.0)
17-19 228 (43.4)
20-24 77 (14.6)
Lifetime sexual partners 1 145 (27.5)
2 134 (25.4)
3 108 (20.5)
4+ 140 (26.6)
Prior pregnancy * No 74 (14.1)
Yes 452 (85.9)
Currently sexually active ** No 65 (12.4)
Yes 460 (87.6)
Condom use with last sex No 381 (73.7)
Yes 136 (26.3)
HIV-1 infection †† Not detected 407 (77.8)
Detected 116 (22.2)
Any STI Not detected 138 (63.8)
Detected 73 (34.6)
Any Hormonal Contraception ∼∼ No 201 (38.1)
Yes 326 (61.9)
VIA Negative 213 (89.9)
Positive 24 (10.1)

*Participants who responded with “no answer” (n=1) are not shown in the table

**Defined as any sexual activity in the last 6 months; Participants who responded with “no answer” (n=2) are not shown in the table

†Participants who responded with “no answer” (n=10) are not shown in the table

††Participants with missing data (n=4) are not shown in the table

∼STI were evaluated in a sub-set of women from the BILHIV study, missing values (n=316) are not shown in the table)

∼∼Any hormonal contraception is defined as use of injectable agents, implants, or oral contraceptive pills

‡VIA results were not collected in the BILHIV study and were not available for all participants, participants with missing data (n=366) are not shown in the table

Prevalence of visual-FGS and interrater agreement

Visual-FGS was detected in 35.3% (165/468) of participants by expert review of digital images from hand-held colposcopy by Reviewer 1 and in 63.6% (265/417) by Reviewer 2. The Cohen’s kappa statistic for interrater agreement between the two expert reviewers was 0.16, corresponding to "slight" agreement ( Table 2). The reviewers made concordant diagnoses in 38.7% (204/527) participants (100 concordant negative, 104 concordant positive) and discordant diagnoses in 31.8% (168/527) cases (reviewer 1 positive, reviewer 2 negative in 32; reviewer 2 positive and reviewer 1 negative in 136). Both reviewers agreed 14 images were unevaluable. A further 26.7% (141/527) images were discordant in evaluability by the expert reviewers (Reviewer 1, n=45; Reviewer 2, n=96).

Table 2. Agreement of expert reviewers for the presence or absence of visual-FGS.

Reviewer 1 FGS not
detected
Visual
FGS
Not
evaluable
Total
Reviewer 2 Cohen’s Kappa
(+ SE) **
Interpretation
Visual FGS not
detected
100 (19.0) 32 (6.1) 20 (3.8) 152 (28.8) *
Visual FGS 136 (25.8) 104 (19.7) 25 (4.7) 265 (50.3) * 0.16 (0.04) None to slight
Not evaluable 67 (12.7) 29 (5.5) 14 (2.7) 110 (20.9)
Total 303 (57.5) * 165 (31.3) * 59 (11.2) 527 (100.0)

FGS – female genital schistosomiasis, SE – standard error

*total evaluable for Reviewer 1: visual FGS not detected 303/468 (64.7), visual FGS detected 165/468 (35.3); total evaluable for Reviewer 2: visual FGS not detected 152/417 (36.5), visual FGS detected 265/417 (63.5);

**Cohen’s kappa is restricted to those participants where both Reviewer 1 and Reviewer 2 provided a diagnosis

Visual FGS and Schistosoma laboratory tests

Of the 527 participants, 6.1% (32/527) had urinary S. haematobium infection, as diagnosed by urine microscopy, and 14.9% (78/525) had a detectable urine CAA. There was no association between S. haematobium egg-positive urine microscopy or urine CAA and visual-FGS, as defined by Reviewer 1 or Reviewer 2’s assessmen ( Table 3).

Table 3. Associations of Schistosoma diagnostics with visual-FGS.

Participant
Characteristics
Visual FGS
not detected
Visual FGS
detected
Crude OR P-
value *
Adjusted OR ** P-
value
Schistosoma diagnostics
(Reviewer 1)
n=303 n=165
Eggs on urine microscopy Not detected 287 (94.7) 151 (91.5) reference 0.3 reference 0.2
Detected 16 (5.3) 14 (8.5) 1.66 (0.73 – 3.74) 1.68 (0.74 – 3.81)
Circulating anodic antigen ~ Not detected 261 (86.1) 131 (80.4) reference 0.1 reference 0.1
Detected 42 (13.9) 32 (19.6) 1.52 (0.92 – 2.52) 1.49 (0.91 – 2.51)
Any positive genital PCR DNA not
detected
288 (95.1) 155 (93.9) reference 0.8 reference 0.7
DNA detected 15 (4.9) 10 (6.0) 1.24 (0.54 – 2.82) 1.27 (0.55 – 2.94)
Schistosoma Diagnostics
(Reviewer 2)
n=152 n=265
Eggs on urine microscopy †† Not detected 147 (96.7) 243 (91.7) reference 0.06 reference 0.07
Detected 5 (3.3) 22 (8.3) 2.65 (0.95 – 9.17) 2.65 (0.95 – 9.19)
Circulating anodic antigen ~ Not detected 131 (86.7) 215 (81.4) reference 0.2 reference 0.2
Detected 20 (13.3) 49 (18.6) 1.49 (0.85 – 2.62) 1.49 (0.85 – 2.63)
Any positive genital PCR DNA not
detected
146 (96.1) 248 (93.6) reference 0.3 reference 0.3
DNA detected 6 (3.9) 17 (6.4) 1.69 (0.64 – 4.33) 1.67 (0.64 – 4.36)

*Chi squared p-value unless otherwise noted

**Adjusted for age

†Likelihood ratio test p-value

Missing values not included in the table: ~ (n=2)

††Odds Ratios and p-values obtained through exact logistic regression in both crude and adjusted analyses

PCR-FGS, defined as any positive Schistosoma qPCR from a genital sample, was diagnosed in 5.0% (30/603) of participants [3.4% (18/527) cervical swab, 2.7% (14/527) vaginal swab, and 2.7% (14/527) CVL]. Further details regarding the operating characteristics of these tests have previously been described 9 . There was no association between visual-FGS and PCR-FGS, 10 of the 165 women (6.0%) identified by Reviewer 1 as having visual-FGS had PCR-FGS and 17 of the 265 women (6.4%) identified by Reviewer 2 as having visual-FGS had PCR-FGS, compared to 4.9% and 3.9% among women identified by Reviewer 1 and 2 as not having visual-FGS respectively ( Table 3).

Symptoms

The association between abdominal, genitourinary, and reproductive manifestations and visual-FGS is shown in Table 4. Neither vaginal discharge, vaginal bleeding after sex, the presence of external genital sores, dysuria nor abdominal pain was associated with the presence of visual FGS as diagnosed by either expert reviewer ( Table 4). Self-reported delay in conception was associated with the presence of visual-FGS, as assessed by Reviewer 1, both in crude analysis and after adjusting for age and community of residence (aOR 2.74 [1.29 – 5.83], p<0.01). Visual-FGS, as defined by Reviewer 2’s assessment of hand-held colposcopy images, was associated with hematuria (aOR 4.44 [1.00 – 40.63] p=0.05) and dyspareunia (aOR 1.71 [0.99 – 2.95], p=0.05), albeit with weak evidence of an association ( Table 4).

Table 4. Associations of genitourinary and abdominal symptoms with visual-FGS.

Participant
Characteristics
Visual
FGS not
detected
Visual
FGS
detected
P-value * Crude OR P-value Adjusted OR ** P-
value
Signs & Symptoms
(Reviewer 1)
n= 303 n=165
Vaginal discharge Not present 273 (90.1) 139 (84.2) 0.06 reference 0.06 reference 0.1
Present 30 (9.9) 26 (15.8) 1.70 (0.97 – 2.99) 1.58 (0.90 – 2.80)
Dyspareunia Not present 239 (78.9) 139 (84.2) 0.2 reference 0.2 reference 0.1
Present 64 (21.1) 26 (15.8) 0.70 (0.42 – 1.15) 0.68 (0.41 – 1.12)
Vaginal bleeding after
sex
Not present 289 (95.4) 153 (92.7) 0.2 reference 0.2 reference 0.3
Present 14 (4.6) 12 (7.3) 1.62 (0.73 – 3.59) 1.48 (0.66 – 3.31)
Vaginal sores Not present 280 (92.4) 154 (93.3) 0.7 reference 0.7 reference 0.6
Present 23 (7.6) 11 (6.7) 0.87 (0.41 – 1.83) 0.84 (0.40 – 1.77)
Dysuria Not present 259 (85.5) 140 (84.9) 0.9 reference 0.9 reference 0.8
Present 44 (14.5) 25 (15.1) 1.05 (0.62 – 1.79) 0.92 (0.53 – 1.59)
Hematuria Not present 295 (97.4) 155 (93.9) 0.07 reference 0.07 reference 0.1
Present 8 (2.6) 10 (6.1) 2.38 (0.92 – 6.20) 2.18 (0.84 – 5.67)
Abdominal pain Not present 217 (71.6) 123 (74.5) 0.5 reference 0.5 reference 0.4
Present 86 (28.4) 42 (25.5) 0.86 (0.56 – 1.33) 0.82 (0.53 – 1.26)
Delay in conception †† No 235 (94.4) 111 (86.7) 0.01 reference 0.01 reference <0.01
Yes 14 (5.6) 17 (13.3) 2.57 (1.22 – 5.40) 2.74 (1.29 – 5.83)
Signs & Symptoms
(Reviewer 2)
n=152 n=265
Vaginal discharge Not present 133 (87.5) 236 (89.1) 0.6 reference 0.6 reference 0.6
Present 19 (12.5) 29 (10.9) 0.86 (0.46 – 1.60) 0.84 (0.45 – 1.57)
Dyspareunia Not present 131 (86.2) 208 (78.5) 0.05 reference 0.05 reference 0.05
Present 21 (13.8) 57 (21.5) 1.71 (0.99 – 2.95) 1.71 (0.99 – 2.95)
Vaginal bleeding after
sex
Not present 144 (94.7) 251 (94.7) 1.0 reference 1.0 reference 1.0
Present 8 (5.3) 14 (5.3) 1.00 (0.41 – 2.45) 0.99 (0.40 – 2.42)
Vaginal sores Not present 146 (96.1) 244 (92.1) 0.1 reference 0.1 reference 0.1
Present 6 (3.9) 21 (7.9) 2.09 (0.83 – 5.31) 2.10 (0.83 – 5.31)
Dysuria Not present 132 (86.8) 231 (87.2) 0.9 reference 0.9 reference 0.9
Present 20 (13.2) 34 (12.8) 0.97 (0.54 – 1.76) 0.95 (0.52 – 1.75)
Hematuria ††† Not present 150 (98.7) 250 (94.3) 0.03 reference 0.05 reference 0.05
Present 2 (1.3) 15 (5.7) 4.49 (1.02 – 40.99) 4.44 (1.00 – 40.63)
Abdominal pain Not present 108 (71.0) 192 (72.5) 0.8 reference 0.8 reference 0.7
Present 44 (28.9) 73 (27.5) 0.93 (0.60 – 1.45) 0.92 (0.59 – 1.44)
Delay in conception No 112 (91.8) 193 (91.9) 1.0 reference 1.0 reference 1.0
Yes 10 (8.2) 17 (8.1) 0.99 (0.44 – 2.22) 1.01 (0.45 – 2.92)

*Chi squared p-value

**Adjusted for age and district of residence

†Likelihood ratio test p-value

††(Reviewer 1) Declined to answer (n=33) and not applicable (n=95) are not included in the table (54 visual FGS not detected; 37 FGS; 37 missing)

†††Odds Ratios and p-values obtained through exact logistic regression in both crude and adjusted analyses

‡(Reviewer 2) Declined to answer (n=33) and missing (n=95) are not included in the table; (30 visual FGS not detected; 55 FGS detected; 43 missing)

Discussion

Diagnostics for neglected tropical diseases should be accurate, accessible, and affordable, with specimen collection that is easy 26 . Making a diagnosis of FGS is challenging as there is currently not a widely accessible, sensitive and non-invasive reference standard for either diagnosis or screening which confirms Schistosoma genital involvement at the point-of-care. In a 2010 expert-led consensus meeting, visual imaging of the vagina and cervix with photocolposcopic methods was proposed as an adequate reference standard for FGS visual diagnosis 27 . Imaging is currently the only widely available point-of-care diagnostic tool for FGS diagnosis outside of the research setting and the BILHIV study sought to use hand-held colposcopy to enable community-based FGS diagnosis 9 . Visual imaging can be useful in the assessment of Schistosoma-related morbidity, praziquantel treatment response, and defining the natural history of visual-FGS. Additionally, hand-held and traditional colposcopy have the logistical advantage that they can be integrated with existing cervical cancer screening programmes 28 . However, visual imaging has important limitations. Firstly, interpretation of visual imaging is subjective. Secondly, visual imaging lacks specificity as the characteristic sandy patches can also be associated with STI and the abnormal blood vessels can also be associated with cervical precancer 3 . This study shows “slight” agreement between senior, highly experienced expert reviewers, highlighting the imperfect nature of human expert review of images for FGS.

Visual FGS-diagnosis is a widely accepted diagnostic tool for evaluating Schistosoma-associated genital morbidity. However, visual-FGS screening is often centralized in settings with access to traditional colposcopy and is invasive, requiring vaginal speculum insertion and trained medical professionals (physicians, nurses, or midwives) to visualize the cervix and vagina at high resolution 9 . Additionally, visual-FGS diagnosis requires a full inspection of the mucosal surfaces of the vagina and cervix. If metal specula are used, post-examination autoclaving and appropriate disinfection further constrains the settings in which this diagnostic strategy can be seamlessly implemented. Disposable specula have risks and benefits. While hygienic and convenient, disposable plastic specula may not be sturdy enough when rotated to inspect the anterior and posterior vaginal walls and may contribute to missed visual-FGS diagnoses 6 . A good light source is needed for optimal cervicovaginal visualization 6 , as well as a device which can provide adequate magnification, ideally a colposcope, hand-held colposcope, or digital camera 8 . Thus, colposcopy, whether hand-held or traditional, for visual-FGS diagnosis is not readily scalable for use as a population-based screening technique.

In this current work, without complete STI and HPV testing or cervicovaginal biopsy on each participant, it is challenging to assess the significance of the sandy patches and abnormal blood vessels identified by the clinical expert reviewers. Notably, researchers in Tanzania performed macroscopic cervicovaginal examinations comparing S. haematobium endemic and non-endemic areas, finding 75% of participants in endemic areas had cervical lesions (including sandy patches, edema, erosions and petechiae) compared with 36% of women in non-endemic areas (although their travel and medical history were not described) 29 . The Tanzanian study illustrates the limited specificity of visual techniques, since one-third of the women had cervical lesions in communities where S. haematobium is not endemic.

Other diagnostic approaches such as PCR-based methods, have been implemented in research settings but are not yet field-deployable 9 . Antigen, antibody, and pathogen-based diagnostics (such as microscopy) are useful diagnostic adjuncts for Schistosoma infection, but do not confirm the involvement of genital tissue. Future diagnostic algorithms may be optimized by first performing a microbiologic S. haematobium diagnosis prior to performing screening for genital involvement 9 . Promising pathogen detection strategies that can be implemented at the point-of-care include isothermal DNA amplification methods 30, 31 . These field-deployable molecular assays should be further developed for use at the point-of-care to identify Schistosoma DNA in self-collected genital swabs 31 .

Our study did not show a consistent association between expert diagnosis of visual-FGS and abdominal, genitourinary and reproductive symptoms. Reviewer 1’s evaluation suggested an association between self-reported delay in conception and visual-FGS and Reviewer 2’s evaluation suggested a weak association with hematuria and dyspareunia in participants with visual-FGS. A retrospective study from Tanzania evaluating histopathology reported tubal schistosomiasis in 4 patients reporting with infertility 32 and a cross-sectional study from Zimbabwe found strong evidence that the presence of S. haematobium in pap smear was associated with infertility in women aged 20 – 49 years, after adjusting for age and HIV status 5 . While alluring to consider the association of delayed conception identified by one reviewer with visual-FGS in isolation, the association would have been strengthened by consistency of the findings across reviewers. Additionally, in interpreting this result, it is important to consider the possibility of a type 1 error when large numbers of statistical tests are performed.

Previous work on visual-FGS has compared visual imaging to other diagnostic standards 33 or have used computerized algorithms 34, 35 , or a combination of human reviewers and a digital gridding technique to evaluate visual-FGS 7 . A recent Madagascan study utilized human reviewers together with a digital image gridding technique to review images of women with known FGS-associated clinical lesions and found a Fleiss kappa of 0.55 (“moderate” agreement) for detecting rubbery papules. Reviewers in that study achieved a higher agreement than that described in our study, potentially by undergoing an initial consensus rating exercise to reach agreement on uniform rating of images. Our approach in the BILHIV study illustrates a real-world scenario where expert reviewers may not necessarily have the opportunity for consensus agreement prior to consultation. This is the first study to assess the agreement of human expert reviewers for diagnosing visual-FGS with hand-held colposcopy, where both reviewers were blinded to the participants’ FGS and Schistosoma diagnostic status. In this study, both expert reviewers are experienced clinical Professors who have expertise in diagnosing FGS in endemic settings and contributed as authors to the 2015 WHO FGS Pocket Atlas 16 .

While our approach is unique, this work has some limitations. The prevalence of urinary schistosomiasis and PCR-FGS were low, thus limiting precision in effect sizes and power to detect association when comparing PCR-FGS and urinary schistosomasis with visual-FGS. Additionally, the urban setting, relatively narrow age range of the participants and low urinary S. haematobium prevalence may limit generalizability. Future additional work in a setting with higher schistosomiasis prevalence would be needed to definitively exclude an association between symptoms, standard Schistosoma, and FGS diagnostics and visual-FGS. To replicate real-world conditions, standardized equipment on which to perform image review was not provided to reviewers. Thus, we cannot exclude that differences in color, brightness, contrast, or saturation of images on the reviewers’ computers contributed to differences between reviewers. Additionally, future work could incorporate artificial intelligence, such as computer algorithms to detect the characteristic color change caused by involvement of the genital mucosa with FGS 35 or the use of digital gridding techniques 7 . Additionally, a initial consensus rating exercise could be incorporated into future work with human expert review for FGS-associated lesions. The presence or absence of the specific FGS lesion (sandy patch, rubbery papule, abnormal blood vessels) was not consistently documented along with the presence or absence of visual-FGS, limiting analysis by lesion type. Study participants self-reported their time-to-conception status, thus results may be subject to recall bias. STI testing was only performed on a subset of the study population and visual inspection with acetic acid data were not obtained within the BILHIV study, thus data on these variables are incomplete 18 . Without complete STI and HPV testing or cervicovaginal biopsy on each participant, it is challenging to assess the significance of the sandy patches and abnormal blood vessels identified by the clinical expert reviewers. Thus, we cannot exclude residual or unmeasured confounding.

In conclusion, with only “slight” agreement between experienced expert reviewers who identified visual-FGS from digital images obtained during point-of-care colposcopy, we suggest caution when visual imaging is used as a stand-alone FGS diagnostic. While we await field-deployable molecular methods to supplement FGS diagnosis, further studies could evaluate if combining colposcopy for visual-FGS with point-of-care STI diagnostics might improve specificity. Our findings highlight the imperfect nature and challenges of visual diagnosis of FGS as a research and clinical endpoint and is a call to action for improved point-of-care diagnostics and diagnostic pathways for female genital schistosomiasis.

Acknowledgements

We wish to recognize the BILHIV study participants, their contribution made this study possible. We thank the BILHIV study community workers: Namakau Chola, Ethel Mwansa, Mwiingana Lukonga, Ruth Mwanza, Mervis Kantukaleza, and Judith Lungu. We recognize Mr. Clement Mwakamui (Zambart) for his enduring administrative support. We also gratefully acknowledge Eric A.T. Brienen (LUMC) for performing the Schistosoma PCR analysis, Pytsje T. Hoekstra (LUMC) and Claudia J. de Dood for performing the CAA analysis.

Funding Statement

Professor A Bustinduy received funding from the Wellcome Trust (Award 205954/Z/17/Z). Professor P Cools received funding from the Research Foundation – Flanders (BOFSTG2019010101) and was financially supported by a Bill and Melinda Gates Foundation Project (OPP1120972). Professor E Webb and Professor R Hayes received funding from MRC Grant Reference MR/R010161/1, and Dr SC Francis received salary from MRC Grant Reference MR/N023692/1. These awards are jointly funded by the UK Medical Research Council (MRC) and the UK Department for International Development (DFID) under the MRC/DFID Concordat agreement and are also part of the EDCTP2 program supported by the European Union. HPTN 071 (PopART) was supported by the National Institute of Allergy and Infectious Diseases (NIAID) under Cooperative Agreements UM1-AI068619, UM1-AI068617, and UM1-AI068613, with funding from the U.S. President’s Emergency Plan for AIDS Relief (PEPFAR); the International Initiative for Impact Evaluation with support from the Bill and Melinda Gates Foundation; the NIAID, the National Institute on Drug Abuse, and the National Institute of Mental Health, all part of the National Institutes of Health. Professor Eyrun Kjetland was supported by South-Eastern Regional Health Authority, Norway project #2016055 and from the European Research Council under the European Union's Horizon 2020 /ERC Grant agreement no. 101057853 (DUALSAVE-FGS).

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

[version 2; peer review: 6 approved]

Author contributions

Amy S. Sturt – data curation, formal analysis, BILHIV project administration, visualization, original manuscript preparation, manuscript editing and revision

Henrietta Bristowe – formal analysis, original manuscript preparation, manuscript editing and revision

Emily L. Webb – data curation, formal analysis, supervision, original manuscript preparation, manuscript editing and revision

Isaiah Hansingo – resources, supervision, writing – review and editing

Comfort R. Phiri – BILHIV project administration, writing – review and editing

Maina Mudenda – investigation, writing – review and editing

Joyce Mapani – investigation, writing – review and editing

Tobias Mweene – investigation, writing – review and editing

Bruno Levecke – resources, manuscript editing and revision

Piet Cools – resources, investigation, manuscript editing and revision

Govert J. van Dam – investigation, writing – review and editing

Paul L.A.M. Corstjens – investigation, writing – review and editing

Helen Ayles – resources, writing – review and editing

Richard J. Hayes – supervision, resources, writing – review and editing

Suzanna C. Francis – supervision, writing – review and editing

Lisette van Lieshout – investigation, writing – review and editing

Bellington Vwalika - investigation, writing – review and editing

Eyrun F. Kjetland – investigation, writing – review and editing

Amaya L. Bustinduy – funding acquisition, supervision, original manuscript preparation, manuscript editing and revision

References

  • 1. WHO: Schistosomiasis. Geneva: World Health Organization;2022; [accessed November 10, 2022]. Reference Source [Google Scholar]
  • 2. Bustinduy AL, Randriansolo B, Sturt AS, et al. : An update on female and male genital schistosomiasis and a call to integrate efforts to escalate diagnosis, treatment and awareness in endemic and non-endemic settings: The time is now. Adv Parasitol. 2022;115:1–44. 10.1016/bs.apar.2021.12.003 [DOI] [PubMed] [Google Scholar]
  • 3. Kjetland EF, Ndhlovu PD, Mduluza T, et al. : Simple clinical manifestations of genital Schistosoma haematobium infection in rural Zimbabwean women. Am J Trop Med Hyg. 2005;72(3):311–9. 10.4269/ajtmh.2005.72.311 [DOI] [PubMed] [Google Scholar]
  • 4. Kjetland EF, Kurewa EN, Ndhlovu PD, et al. : Female genital schistosomiasis--a differential diagnosis to sexually transmitted disease: genital itch and vaginal discharge as indicators of genital Schistosoma haematobium morbidity in a cross-sectional study in endemic rural Zimbabwe. Trop Med Int Health. 2008;13(12):1509–17. 10.1111/j.1365-3156.2008.02161.x [DOI] [PubMed] [Google Scholar]
  • 5. Kjetland EF, Kurewa EN, Mduluza T, et al. : The first community-based report on the effect of genital Schistosoma haematobium infection on female fertility. Fertil Steril. 2010;94(4):1551–3. 10.1016/j.fertnstert.2009.12.050 [DOI] [PubMed] [Google Scholar]
  • 6. Norseth HM, Ndhlovu PD, Kleppa E, et al. : The colposcopic atlas of schistosomiasis in the lower female genital tract based on studies in Malawi, Zimbabwe, Madagascar and South Africa. PLoS Negl Trop Dis. 2014;8(11):e3229. 10.1371/journal.pntd.0003229 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Arenholt LTS, Aaroe KK, Norderud K, et al. : Cervical lesion proportion measure using a digital gridded imaging technique to assess cervical pathology in women with genital schistosomiasis. PLoS Negl Trop Dis. 2022;16(7): e0009995. 10.1371/journal.pntd.0009995 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Søfteland S, Sebitloane MH, Taylor M, et al. : A systematic review of handheld tools in lieu of colposcopy for cervical neoplasia and female genital schistosomiasis. Int J Gynaecol Obstet. 2021;153(2):190–9. 10.1002/ijgo.13538 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Sturt AS, Webb EL, Phiri CR, et al. : Genital self-sampling compared with cervicovaginal lavage for the diagnosis of female genital schistosomiasis in Zambian women: The BILHIV study. PLoS Negl Trop Dis. 2020;14(7):e0008337. 10.1371/journal.pntd.0008337 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Kjetland EF, Leutscher PD, Ndhlovu PD: A review of female genital schistosomiasis. Trends Parasitol. 2012;28(2):58–65. 10.1016/j.pt.2011.10.008 [DOI] [PubMed] [Google Scholar]
  • 11. Tushabe JV, Lubyayi L, Sserubanja J, et al. : Does Intensive Treatment Select for Praziquantel Resistance in High-Transmission Settings? Parasitological Trends and Treatment Efficacy Within a Cluster-Randomized Trial. Open Forum Infect Dis. 2020;7(4):ofaa091. 10.1093/ofid/ofaa091 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Summers S, Bhattacharyya T, Allan F, et al. : A review of the genetic determinants of praziquantel resistance in Schistosoma mansoni: Is praziquantel and intestinal schistosomiasis a perfect match? Front Trop Dis. 2022;3:933097. 10.3389/fitd.2022.933097 [DOI] [Google Scholar]
  • 13. Wells N, Chappuis F, Beran D: Spotlight on experiences of medicine unavailability: access to medicines challenges for NCDs and NTDs - the contrasting cases of insulin and praziquantel. Expert Rev Clin Pharmacol. 2020;13(4):341–53. 10.1080/17512433.2020.1740589 [DOI] [PubMed] [Google Scholar]
  • 14. Hayes RJ, Donnell D, Floyd S, et al. : Effect of Universal Testing and Treatment on HIV Incidence - HPTN 071 (PopART). N Engl J Med. 2019;381(3):207–18. 10.1056/NEJMoa1814556 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Kalinda C, Chimbari MJ, Mukaratirwa S: Schistosomiasis in Zambia: a systematic review of past and present experiences. Infect Dis Poverty. 2018;7(1):41. 10.1186/s40249-018-0424-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. World Health Organization: Female genital schistosomiasis: a pocket atlas for clinical health-care professionals. World Health Organization,2015; Accessed April 30, 2021. Reference Source [Google Scholar]
  • 17. Zambian Ministry of Health: Guidelines for the Etiological and Clinical Management of Sexually Transmitted Infections in Zambia.2017. [Google Scholar]
  • 18. Rafferty H, Sturt AS, Phiri CR, et al. : Association between cervical dysplasia and female genital schistosomiasis diagnosed by genital PCR in Zambian women. BMC Infect Dis. 2021;21(1):691. 10.1186/s12879-021-06380-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Sankaranarayanan R, Wesley R, Thara S, et al. : Test characteristics of visual inspection with 4% acetic acid (VIA) and Lugol's iodine (VILI) in cervical cancer screening in Kerala, India. Int J Cancer. 2003;106(3):404–8. 10.1002/ijc.11245 [DOI] [PubMed] [Google Scholar]
  • 20. Corstjens PL, De Dood CJ, Kornelis D, et al. : Tools for diagnosis, monitoring and screening of Schistosoma infections utilizing lateral-flow based assays and upconverting phosphor labels. Parasitology. 2014;141(14):1841–55. 10.1017/S0031182014000626 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Corstjens P, de Dood CJ, Knopp S, et al. : Circulating Anodic Antigen (CAA): A Highly Sensitive Diagnostic Biomarker to Detect Active Schistosoma Infections-Improvement and Use during SCORE. Am J Trop Med Hyg. 2020;103(1_Suppl):50–7. 10.4269/ajtmh.19-0819 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Obeng BB, Aryeetey YA, de Dood CJ, et al. : Application of a circulating-cathodic-antigen (CCA) strip test and real-time PCR, in comparison with microscopy, for the detection of Schistosoma haematobium in urine samples from Ghana. Ann Trop Med Parasitol. 2008;102(7):625–33. 10.1179/136485908X337490 [DOI] [PubMed] [Google Scholar]
  • 23. Sturt AS, Webb EL, Himschoot L, et al. : Association of Female Genital Schistosomiasis With the Cervicovaginal Microbiota and Sexually Transmitted Infections in Zambian Women. Open Forum Infect Dis. 2021;8(9):ofab438. 10.1093/ofid/ofab438 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Sturt AS, Webb EL, Patterson C, et al. : Cervicovaginal Immune Activation in Zambian Women With Female Genital Schistosomiasis. Front Immunol. 2021;12(181):620657. 10.3389/fimmu.2021.620657 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Sturt AS, Webb EL, Phiri CR, et al. : Female Genital Schistosomiasis and HIV-1 Incidence in Zambian Women: A Retrospective Cohort Study. Open Forum Infect Dis. 2021;8(7):ofab349. 10.1093/ofid/ofab349 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Land KJ, Boeras DI, Chen XS, et al. : REASSURED diagnostics to inform disease control strategies, strengthen health systems and improve patient outcomes. Nat Microbiol. 2019;4(1):46–54. 10.1038/s41564-018-0295-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Kjetland EF, Norseth HM, Taylor M, et al. : Classification of the lesions observed in female genital schistosomiasis. Int J Gynaecol Obstet. 2014;127(3):227–8. 10.1016/j.ijgo.2014.07.014 [DOI] [PubMed] [Google Scholar]
  • 28. Engels D, Hotez PJ, Ducker C, et al. : Integration of prevention and control measures for female genital schistosomiasis, HIV and cervical cancer. Bull World Health Organ. 2020;98(9):615–24. 10.2471/BLT.20.252270 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Poggensee G, Kiwelu I, Weger V, et al. : Female genital schistosomiasis of the lower genital tract: prevalence and disease-associated morbidity in northern Tanzania. J Infect Dis. 2000;181(3):1210–3. 10.1086/315345 [DOI] [PubMed] [Google Scholar]
  • 30. Gandasegui J, Fernández-Soto P, Carranza-Rodríguez C, et al. : The Rapid-Heat LAMPellet Method: A Potential Diagnostic Method for Human Urogenital Schistosomiasis. PLoS Negl Trop Dis. 2015;9(7):e0003963. 10.1371/journal.pntd.0003963 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Archer J, Patwary FK, Sturt AS, et al. : Validation of the isothermal Schistosoma haematobium Recombinase Polymerase Amplification (RPA) assay, coupled with simplified sample preparation, for diagnosing female genital schistosomiasis using cervicovaginal lavage and vaginal self-swab samples. PLoS Negl Trop Dis. 2022;16(3):e0010276. 10.1371/journal.pntd.0010276 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Swai B, Poggensee G, Mtweve S, et al. : Female genital schistosomiasis as an evidence of a neglected cause for reproductive ill-health: a retrospective histopathological study from Tanzania. BMC Infect Dis. 2006;6:134. 10.1186/1471-2334-6-134 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Galappaththi-Arachchige HN, Holmen S, Koukounari A, et al. : Evaluating diagnostic indicators of urogenital Schistosoma haematobium infection in young women: A cross sectional study in rural South Africa. PLoS One. 2018;13(2): e0191459. 10.1371/journal.pone.0191459 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Holmen SD, Kleppa E, Lillebø K, et al. : The first step toward diagnosing female genital schistosomiasis by computer image analysis. Am J Trop Med Hyg. 2015;93(1):80–6. 10.4269/ajtmh.15-0071 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Holmen SD, Kjetland EF, Taylor M, et al. : Colourimetric image analysis as a diagnostic tool in female genital schistosomiasis. Med Eng Phys. 2015;37(3):309–14. 10.1016/j.medengphy.2014.12.007 [DOI] [PubMed] [Google Scholar]
Wellcome Open Res. 2023 Jul 3. doi: 10.21956/wellcomeopenres.21450.r56976

Reviewer response for version 2

Soledad Colombe 1

I thank the authors for the changes brought to the manuscript. I have no further comments.

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

Yes

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Is the study design appropriate and is the work technically sound?

Yes

Are the conclusions drawn adequately supported by the results?

Yes

Are sufficient details of methods and analysis provided to allow replication by others?

Yes

Reviewer Expertise:

Infectious Diseases ; Epidemiology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Wellcome Open Res. 2023 Jun 6. doi: 10.21956/wellcomeopenres.21450.r56977

Reviewer response for version 2

Vanessa Christinet 1

Thank you for the answers.

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

Partly

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Is the study design appropriate and is the work technically sound?

Yes

Are the conclusions drawn adequately supported by the results?

Partly

Are sufficient details of methods and analysis provided to allow replication by others?

Partly

Reviewer Expertise:

Sexual health, HIV, communicable disease, tropical medicine, public health, epidemiology.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Wellcome Open Res. 2023 May 16. doi: 10.21956/wellcomeopenres.21450.r56974

Reviewer response for version 2

Louise Arenholt 1,2

I find the response and revision sufficient. I therefore recommend the manuscript for indexing.

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

Yes

Are all the source data underlying the results available to ensure full reproducibility?

Partly

Is the study design appropriate and is the work technically sound?

Yes

Are the conclusions drawn adequately supported by the results?

Yes

Are sufficient details of methods and analysis provided to allow replication by others?

Partly

Reviewer Expertise:

Gynecology and obstetrics. Urogynecology.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Wellcome Open Res. 2023 Feb 21. doi: 10.21956/wellcomeopenres.20776.r54085

Reviewer response for version 1

Soledad Colombe 1

I thank the editor and authors for the opportunity to review this article, which brings to light relevant information for those working on female genital schistosomiasis. In this study, the authors tested the validity of the recommended visual diagnosis of FGS by assessing the concordance in visual diagnosis from two independent senior physicians. They found very low agreement between the independent reviewers, highlighting the complexity of using visual imaging for diagnosis of female genital schistosomiasis. The article was overall very sound and well written. I have made a few comments below on both format and scientific content, which I hope will be useful to the authors.

Comments on format:

  • Please define CVL in the abstract before using the accronym.

  • I would quickly define in the abstract what is meant by “expert reviewers”, and would also define it sooner in the methods in the main text.

Comments on scientific content:

I am missing more in-depth description and discussion of the discordance in visual diagnosis between the two expert reviewers.

  • a) I noticed that the discordance was already quite large for the number of images that could not be interpreted (36 vs 106). What does “images inaccessible” in Figure 1 mean? Could you explain this discordance?

  • b) Regarding the discordance in visual diagnosis of FGS: I understand that the presence or absence of a specific FGS lesion was not systematically recorded but did you notice a systematicity in the discordance? Would this discordance have been solved in the majority of cases after discussions between the two expert reviewers?   

  • c) Does the conclusion of “diagnostic uncertain” (n=23) of reviewer 1 overlap with the images discarded by reviewer 2 for “poor cervical visibility”?

  • d) I would suggest to add pictures, if allowed by the journal, of images leading to concordant positive, concordant negative and discordant diagnoses.

It seems that in a number of participants, visual-FGS was “positive” whereas PCR/CAA/Microscopy were negative. In addition to the fact that the lesions thought during visual inspection can be non-specific, could it also be that visual-FGS in some cases is rather the sign of past infection? And in that case can the authors discuss the added-value of visual-FGS diagnostic?

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

Yes

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Is the study design appropriate and is the work technically sound?

Yes

Are the conclusions drawn adequately supported by the results?

Yes

Are sufficient details of methods and analysis provided to allow replication by others?

Yes

Reviewer Expertise:

Infectious Diseases ; Epidemiology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Wellcome Open Res. 2023 Apr 5.
Amy Sturt 1

"Comments on format: -please define CVL in the abstract before using the acronym Thank you for noticing this, this has been amended.   -I would quickly define in the abstract what is meant by expert reviewers and would also define it sooner in the methods in the main text."

Thank you for this suggestion. We inserted “senior physicians who have training and expertise in FGS and standard colposcopy served as expert reviewers” in the abstract at line 64. As you have suggested, we also included an enhanced definition of the expert reviewers in the methods at lines 152-153.  

"Comments on the scientific section: I am missing a more in-depth description and discussion of the discordance in visual diagnosis between the two expert reviewers. -a) I noticed that the discordance was already quite large for the number of images that could not be interpreted (36 vs 106). What does images inaccessible in Figure 1 mean? Could you explain this discordance?"   

Thank you for the opportunity to clarify. In lines 156-157 we explain that, at their discretion, expert reviewers could exclude images that they felt could not be evaluated due to technical issues, image quality, or limited cervical visualization. This number was 36 for Reviewer 1 and 106 for Reviewer 2. Reviewer 1 had difficulty opening 10 of the images. In Figure 1 ‘images inaccessible’ has been changed to “technical difficulty opening images”.  

"-b) Regarding the discordance in visual diagnosis of FGS: I understand that the presence or absence of a specific FGS lesion was not systematically recorded but did you notice a systematicity in the discordance? Would this discordance have been solved in the majority of cases after discussions between the two expert reviewers?"   

We did not observe a systematicity in the discordance. It is difficult to say whether a discussion between experts could have resolved the discordance. We did not include an initial consensus rating agreement in our study as we wanted to simulate real-world conditions in our study, where consultant physicians may not have time to perform a consensus review on a large number of images. We did note in the discussion at lines 407-408 that an initial consensus rating exercise could be incorporated into future work with human expert review for FGS-associated lesions.  

"-c) Does the conclusion of “diagnostic uncertain” (n=23) of reviewer 1 overlap with the images discarded by reviewer 2 for “poor cervical visibility”?"   

Thank you for this question. The images discarded by Reviewer 2 seem to be rejected at random as there were diagnoses of FGS negative, sandy patches and abnormal blood vessels among the discarded images. Within these images there were a minority (n=6) that were also rejected by Reviewer 1.  

"-d) I would suggest to add pictures, if allowed by the journal, of images leading to concordant positive, concordant negative and discordant diagnoses."   

Wellcome Open Research specifies that “Any photographs must be accompanied by written consent to publish from the individuals involved”. We did not obtain consent from the participants to publish cervicovaginal images so unfortunately cannot include these.  

"It seems that in a number of participants, visual-FGS was “positive” whereas PCR/CAA/miscropscopy were negative. In addition to the fact that lesions though during the visual inspection can be non-specific, could it also be that visual-FGS in some cases is rather the sign of past-infection? And in that case can the authors discuss the added value of visual-FGS diagnostic?"   

Thank you for this observation. We agree that it is possible that visual-FGS may be representative of past infection and agree that in this setting, its utility as a FGS diagnostic is limited. We feel strongly that the way forward in FGS diagnosis is through the further development, implementation, and scale-up of molecular diagnostics for current FGS, as outlined in lines 421-423.

Wellcome Open Res. 2023 Feb 21. doi: 10.21956/wellcomeopenres.20776.r54091

Reviewer response for version 1

Susannah Colt 1

Summary of the article:

Female genital schistosomiasis (FGS) is an underreported and debilitating illness affecting women and girls and is widespread among Schistosoma haematobium-endemic regions throughout Africa. There are significant challenges to accurate FGS diagnosis relating to lack of a consensus method, equipment availability, technical expertise, and stigma surrounding women’s reproductive health. While the proposed reference-standard for FGS diagnosis is visual inspection by colposcopy (visual-FGS), this method is subjective to expert interpretation and can be further confounded by some sexually transmitted infections. The authors report secondary findings of the cross-sectional bilharzia and HIV (BILHIV) study, in which, N=527 women from Zambia were enrolled; provided urine, cervical and vaginal self-sampled tissue swabs; and underwent clinical cervicovaginal lavage sampling and portable colposcope examination and imaging. The primary aim was to evaluate the agreement of visual-FGS by two independent experts, and the secondary aims were to compare visual-FGS with other FGS and S. haematobium diagnostic techniques as well as self-reported FGS sequelae. While the proportion of FGS cases—by any diagnostic method—was relatively low in this cohort, the authors report only slight agreement between the two visual-FGS experts. These results further emphasize the challenges associated with FGS diagnosis and the need for more reliable and accessible diagnostic methods. This study provides an important contribution to the FGS literature and calls attention to barriers to providing reproductive health care for African women.

Introduction:

  1. The authors should provide some background epidemiological info on burden of FGS among women and girls in Africa, associated morbidities (relating to both physical and mental health as some of these are a feature of the secondary analysis), and emphasize that cases are underdiagnosed due to a myriad of reasons (equipment, expertise, access to women's health services, etc). If available, include any info on the burden of urogenital schistosomiasis in Zambia or offer context for why Zambia was chosen as the study site. 

  2. Please describe techniques for identifying FGS in the introduction (biopsy of genital tissue, visual inspection by colposcopy, PCR performed on cervicovaginal lavage material) and include the advantages and disadvantages of each. 

  3. It would be helpful to clarify in the introduction that this investigation reports findings of the BILHIV cohort study.

  4. Suggestion to state the primary and two secondary objectives of the present study in the introduction section.

Methods:

  1. The study benefits from the selection of two highly qualified visual-FGS experts, inclusion of advanced techniques in FGS diagnoses including qPCR of CVL material, and application of the Mobile ODT hand-held colposcope which is one of the devices recommended for FGS diagnosis in schistosomiasis-endemic areas in a 2021 systematic review of hand-held colposcopy equipment.

  2. It would be helpful for the authors to clarify which participants were included for STI evaluation and the criteria for how this subset was selected.

  3. Please provide detail about the regression model building approach and explanation for the inclusion for age or age and district as covariates. Have the authors considered additional potential confounding factors, such as co-infection with HIV or other STIs, other co-morbidities, sexual history variables, or pregnancy history?

Results:

  1. There is a typo in the word “assessment” at the end of the "Visual FGS and Schistosoma laboratory tests” paragraph.

  2. It would be interesting for authors to report the agreement between tissue-specific qPCR results (cervical swab, vaginal swab, and CVL).

Discussion:

  1. A strong case is presented for the disadvantages of visual-FGS diagnosis.

  2. The authors discuss the limited feasibility for PCR-based diagnosis in schistosomiasis-endemic areas and make reference to promising developments in Loop-Mediated Isothermal Amplification (LAMP) methods, which offer added accessibility in resource-limited settings. The authors might consider incorporating discussion or reference to demonstrated LAMP sensitivity for S. japonicum and S. haematobium (Wang et al. 2011 1 and Gandasegui et al., 2015 2 ).

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

Yes

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Is the study design appropriate and is the work technically sound?

Yes

Are the conclusions drawn adequately supported by the results?

Yes

Are sufficient details of methods and analysis provided to allow replication by others?

Yes

Reviewer Expertise:

Epidemiology and biostatistics; schistosomiasis; infectious disease immunology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

References

  • 1. : Application of DNA-based diagnostics in detection of schistosomal DNA in early infection and after drug treatment. Parasit Vectors .2011;4: 10.1186/1756-3305-4-164 164 10.1186/1756-3305-4-164 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. : The Rapid-Heat LAMPellet Method: A Potential Diagnostic Method for Human Urogenital Schistosomiasis. PLoS Negl Trop Dis .2015;9(7) : 10.1371/journal.pntd.0003963 e0003963 10.1371/journal.pntd.0003963 [DOI] [PMC free article] [PubMed] [Google Scholar]
Wellcome Open Res. 2023 Apr 5.
Amy Sturt 1

"Introduction – The authors should provide some background epidemiological info on burden of FGS among women and girls in Africa, associated morbidities (relating to both physical and mental health as some of these are a feature of the secondary analysis), and emphasize that cases are underdiagnosed due to a myriad of reasons (equipment, expertise, access to womens health services, etc). If available, include any info on burden of urogenital schistosomiasis in Zambia or offer context for why Zambia was chosen as a study site."  Thank you for your input. In lines 89-90 we have added some associated morbidities as you have suggested. Schistosomiasis is endemic in Zambia as outlined in a recent systematic review: https://pubmed.ncbi.nlm.nih.gov/29706131/ . A sentence regarding the S. haematobium prevalence in Zambia has been inserted at lines 122 – 124, as you have suggested. The BILHIV study was nested within the HPTN 071 (PopART) trial, which was enrolling in Zambia, providing an opportunity to further evaluate urogenital schistosomiasis in this region. We have added additional information about HPTN in Zambia in lines 124-125. We agree 100% that FGS is underdiagnosed due to the reasons you have outlined above. We had a more complete discussion on the challenges of making a FGS diagnosis in an initial draft of the manuscript. Ultimately, we removed these details as we felt the detail detracted from the main focus of this manuscript - visual FGS. We discuss the challenges inherent in making a FGS diagnosis in the main BILHIV manuscript, which we have now referenced earlier at line 96.  

"2) Please describe techniques for identifying FGS in the introduction (biopsy of genital tissue, visual inspection by colposcopy, PCR performed on cervicovaginal lavage material) and include the advantages and disadvantages of each."   

Thank you for this suggestion, we initially had a more complete discussion of FGS methods in an earlier draft of the manuscript, but felt the depth (and length) detracted from the main focus. Instead we have summarized the challenges of making a FGS diagnosis in lines 98 -99 by saying “Confirming a diagnosis of FGS is challenging as there is not a widely accepted diagnostic reference standard for research, diagnosis, and screening.” Additionally, we have included a more complete review of the advantages and disadvantages of the available FGS screening methods in the discussion.  

"3) It would be helpful to clarify in the introduction that this investigation reports findings of the BILHIV cohort study."   

Thank you for this input, this is now clearly stated in the introduction at line 113.  

"4) Suggestion to state the primary and two secondary objectives of the present study in the introduction section."   

Thank you for this suggestion. The primary and secondary objectives of the analysis are now stated in the introduction from lines 114 – 117.  

"Methods: 1.The study benefits from the selection of two highly qualified visual-FGS experts, inclusion of advanced techniques in FGS diagnosis including qPCR of CVL material, and application of the Mobile ODT hand-held colposcope which is one of the devices recommended for FGS diagnosis in schistosomiasis-endemic areas in a 2021 systematic review of hand-held colposcopy equipment."   

We thank the reviewer from highlighting this strength.  

"2.It would be helpful for authors to clarify which participants were included for STI evaluation and the criteria for how this subset was selected."   

Thank you for the opportunity to provide additional information. We have added some information about this subset and their selection criteria in the Methods section “other infections” at lines 205-211.  

"3.Please provide detail about the regression model building approach and explanation for the inclusion of age or age and district as covariates. Have the authors considered additional potential confounding factors such as co-infection with HIV or other STIs, other co-morbidities, sexual history variables or pregnancy history?"   

The regression model was built after construction of causal conceptual frameworks with age and district included a priori . District of residence may potentially be a surrogate for socioeconomic status and/or probability of schistosomiasis exposure (one district was closer to a local river). Additional variables (including prior history of pregnancy) were included in the model in a step-wise fashion and retained if they produced a 10% change in Odds Ratio. No additional variables produced a change in OR and thus were not retained in the model. Since HIV has not been consistently associated with FGS across studies, this variable was not included. STI and cervical pre-cancer were considered for inclusion in the model. Given the large amount of missing data in the STI and cervical pre-cancer variables, these were not included in the regression analysis. We also investigated the effect of adjusting for HIV status and found it made no difference to our findings.  STI and cervical cancer testing were performed on a subset of well under half of the participants. Including these variables as potential confounders in the regression would have led to further data sparsity. Despite this, we conducted an additional exploratory analysis where we controlled for STI (and separately, VIA) when assessing the association between FGS diagnosis and symptoms, and compared the ORs with those from adjusting for only age and district, restricted to participants who had STI (or VIA) information available (i.e. in order to compare ORs on the same subset of participants). We found no suggestion that STI or VIA act as confounders, and, as already noted, power to detect associations was substantially reduced in this analysis.  

"Results: 1.There is a typo in the word assessment at the end of the “visual FGS and schistosoma laboratory tests paragraph”." 

Thank you, this has been amended.  

"2.It would be interesting for authors to report the agreement between tissue specific qPCR results." 

Thank you for this suggestion, the full details regarding agreement between tissue-specific qPCR results are reported in a previous manuscript. We have clarified that further details regarding the operating characteristics of these diagnostics have been previously described and have added in lines at 284-285 referring the reader to the previous manuscript. Full details can be found at https://journals.plos.org/plosntds/article?id=10.1371/journal.pntd.0008337  

"Discussion: 1.A strong case is presented for the disadvanges of visual-FGS diagnosis." 

Thank you for this feedback.

"2.The authors discuss the limited feasibility for PCR-based diagnosis in schistosomiasis-endemic areas and make a reference to promising developments in LAMP methods, which offer added accessibility in resource limited settings. The atuhors might consider incorporating discussion or reference to demonstrated LAMP sensitivity for S. japonicum and S. haematobium (wang 2011- Parasit Vectors and Gandasegui 2015 - PNTD)." 

Thank you for this helpful reference. Gandasegui 2015 was entered at line 354.

Wellcome Open Res. 2023 Feb 8. doi: 10.21956/wellcomeopenres.20776.r54089

Reviewer response for version 1

Vanessa Christinet 1

Thank you for inviting me to review this article on a comparative analysis of visual diagnosis of FGS between two experts in visual diagnosis of cervical images. The authors chose to explore the relationship between the diagnosis based on image analysis of the two reviewers and several objective measurements of S. Heamatobium infection diagnosis and also with symptoms described by the patients.

This study and its results are very relevant and interesting as there is little produced in this field of research. It is indeed very important to evaluate diagnosis by visual inspection of the cervix as this is the recommended method of reference but there is a lack of data on the reliability of the diagnosis.

As this study is nested within the HPTN071 study, I would recommend to briefly describe the HPTN071 study in a single sentence, even if it has already been described in other publications, in order to fully understand the profile of the participants and the general context of the study.

Indeed, the question of HIV diagnosis and the level of immunosuppression (if known) should be addressed as it is certainly an important confounding factor for the interpretation of the visualized lesions and could induce an important heterogeneity of cervical lesions making them more difficult to interpret (association between HIV and FGS, association between HIV and other STIs, association between HIV-HPV-dysplasia).

In the description of the method, it is stated that STIs were not screened for POC but later in the text it is written that a whole panel of STIs was tested. Considering that the authors state that microbiological STI diagnosis can be a confounding factor in the diagnosis and symptoms associated with FGS, it seems important to me that they are taken into account in the analysis. As this is not the case here, it would be relevant to justify why.

Cervical cancer screening with visualization of dysplasia is described in the methodology but is not included in the comparative analysis between the two reviewers despite the fact that it is described as a confounder in terms of vascularization.

In the statistical analysis section it is described that the OR was adjusted for age. I would like the authors to describe the rationale for this adjustment and why other confounding factors were not included in this adjustment (STI, HIV, cervical dysplasia). Age seems to me to be relevant for the adjustment of the association with delay of conception, but I do not understand the interest of using it for the other variables (it should be noted that the unadjusted and adjusted ORs are not very different, which speaks to a moderate interest for the inclusion of this adjustment factor).

In the method, additional detail on the number of photos per patient sent to the reviewers would be an enhancement as this could give more information on what they based their evaluation on. It would also be interesting to have more detailed information on the quality of the images to help understand the discrepancy between the two reviewers' analysis. Was a precise description of the lesions made by each of the reviewers which would allow a better understanding of the level of disagreement?

It is noteworthy that although not reaching statistical significance, both reviewers consistently identified a greater proportion of FGS lesions in individuals with objective markers of schistosome infection. If the sample size had been larger, statistical significance would probably have been reached for some variables. If the data are available, I think it would be very interesting to exploit the STI diagnoses here to see how they probably contributed to the heterogeneity of the analysis.

I find the presentation of the results not very easy to understand. The authors refer several times to the non-association without describing precisely the significance of the analysis which is actually the lack of a statistically significant difference between patients diagnosed by positive or negative visual inspection by the reviewers in relation to objective parameters of schistosome infection. Again, it should be noted that even if the differences are not statistically significant, the ORs are still positive in favour of the objective measurements of schistosomiasis. I suggest rewording the text below Table 3.

It is interesting to note that the symptom with the highest OR for both reviewers is for the association with haematuria, which is the most specific symptom of Schistosoma Heamatobium infection.

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

Partly

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Is the study design appropriate and is the work technically sound?

Yes

Are the conclusions drawn adequately supported by the results?

Partly

Are sufficient details of methods and analysis provided to allow replication by others?

Partly

Reviewer Expertise:

Sexual health, HIV, communicable disease, tropical medicine, public health, epidemiology.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Wellcome Open Res. 2023 Apr 5.
Amy Sturt 1

"As this study is nested within the HPTN study, I would recommend to briefly describe the HPTN study in a single sentence, even if it has already been described in other publications, in order to fully understand the profile of the participants and the general context of the study."    Thank you for this feedback. The HPTN 071 (PopART) trial was a cluster randomized trial assessing the impact of an HIV-1 combination prevention package including “universal testing and treatment”. We have added this information in lines 124-125.  

"Indeed, the question of HIV diagnosis and the level of immunosuppression (if known) should be addressed as it is certainly a confounding factor for the interpretation of the visualized lesions and could induce an important heterogeneity of cervical lesions making them more difficult to interpret (association between HIV and FGS, association between HIV and other STI, association between HIV-HPV-dysplasia)."   

Data were not collected from participants regarding CD4 count or HIV viral load for this study. We agree that HIV-1 status should be assessed in future manuscripts which would evaluate associations between HIV and STI or HIV, HPV and cervical dysplasia. For the primary outcome of this study (agreement between expert reviewers), we acknowledge that HIV could induce heterogeneity in the lesions. However since the participants’ HIV-1 diagnosis in this study was concordant between reviewers, and our primary purpose was to describe agreement between expert reviewers (irrespective of other participant characteristics), we believe that it is appropriate to describe agreement in visual-FGS diagnosis without adjustment for HIV or other factors that could influence the nature of the lesions.   

"In the description of the method, it is stated that STIs were not screened for POC but later in the test it is written that a whole panel of STI was tested. Considering that the authors state that microbiological STI diagnosis can be a confounding factor in the diagnosis and symptoms associated with FGS, it seems important to me that they are taken into account in the analysis. As this is not the case here it would be relevant to justify why."   

Thank you for this enquiry. The expert image review was performed and completed prior to STI testing. Thus, STI test results were not available at the time of expert image review to supplement the reviewers’ findings. The primary aim of this study was to evaluate the agreement of human expert reviewers for the diagnosis of visual-FGS. Due to study design and budgetary constraints, STI testing was only performed on a subset of study participants. This has been further clarified in the manuscript from lines 205-211. Overall, 40% (211/527) of participants in this study had STI data available, limiting power to evaluate associations between STI, visual-FGS, and symptoms. Thus the numbers were not large enough to meaningfully control for STI. Despite this, we have conducted an exploratory analysis where we additionally controlled for STI when assessing the association between FGS diagnosis and symptoms, restricted to the sub-group of participants who had STI information available (i.e. in order to assess the potential confounding affect of STI among the same subset of participants, comparing like with like). In this exploratory analysis we found no suggestion that STI acts as a confounder.  

"Cervical cancer screening with visualization of dysplasia is described in the methodology but is not included in the comparative analysis between the two reviewers despite the fact that it is described as a confounder in terms of vascularization."   

Thank you for this enquiry. Cervical cancer screening was not performed as part of the BILHIV study. Cervical cancer screening results were extracted from participant medical records after expert image review was performed and completed. Thus, cervical pre-cancer status was not available at the time of expert image review to supplement the reviewers’ findings. For the primary outcome of this study (agreement between expert reviewers), we acknowledge that cervical pre-cancer could induce heterogeneity in the lesions. However since the participants’ cervical pre-cancer diagnosis in this study was concordant between reviewers, and our primary purpose was to describe agreement between expert reviewers (irrespective of other participant characteristics), we believe that it is appropriate to describe agreement in visual-FGS diagnosis without adjustment for cervical pre-cancer or other factors that could influence the nature of the lesions.   

"In the statistical analysis section it is described that the OR was adjusted for age. I would like the authors to describe the rationale for this adjustment and why other confounding factors were not included in the adjustment (STI, HIV, cervical dysplasia). Age seems to be to be relevant for the association with delay of conception but I do not understand the interest of using it for the other variables (it should be noted that the unadjusted and adjusted ORs are not different which speaks to a moderate interest for the inclusion of this adjustment factor)."  Thank you for the opportunity to clarify the statistical methods. Age was prioritized as a confounder to include since it has been associated in the literature with both FGS and presence of symptoms. We also investigated the effect of adjusting for HIV status and found it made no difference to our findings.  As described above, STI and cervical cancer testing were performed on a subset of well under half of the participants. Including these variables as potential confounders in the regression would have led to further data sparsity. Despite this, we conducted an additional exploratory analysis where we controlled for STI (and separately, VIA) when assessing the association between FGS diagnosis and symptoms, restricted to the sub-group of participants who had VIA (or STI) information available (i.e. in order to assess the potential confounding affect of VIA among the same subset of participants, comparing like with like). In this exploratory analysis we found no suggestion that VIA acts as a confounder. As already noted, power to detect associations was reduced in this analysis.  

"In the method, additional data on the number of photos per patient sent to reviewers would be an enhancement as this could give more information on what they based their evaluation on. It would also be interesting to have more detailed information on the quality of the images to help understand the the discrepancy between the two reviewers’ analysis. Was a precise description of the lesions made by each of the reviewers which would allow a better understanding of the level of disagreement. It is noteworthy that although not reaching statistical significance, both reviewers consistently identified a greater proportion of FGS lesions in individuals with objective markers of schistoomiasis. If the sample size has been larger, statistical significance would probably be reached for some variables. If the data are available, I think it would be very interesting to exploit the STI diagnoses here to see how they probably contributed to the heterogeneity of the analysis."   

Thank you for this question. Please see lines 149-153 where we have added in additional information about the standardized protocol used to evaluate each participant. The standardized protocol specified that trained midwives evaluated the cervix, anterior fornix, posterior fornix, left and right lateral cervix and vaginal walls for each participant and captured images of each location. In lines 154-155 we describe that each reviewer noted the presence or absence of one of four clinical manifestations associated with FGS. The reviewers did not note the location or further descriptive information regarding the clinical manifestations. It would have been ideal to have a larger sample size, however it is not possible to extrapolate our findings to what they might have been had the sample size been larger. Unfortunately, due to STI only being evaluated on a sub-set of participants, we are not able to adjust associations for STI among the full study population. However, as noted in our response above, in the subset of participants who did have STI information, we adjusted for STI when assessing associations between FGS-image positivity and symptoms and found that it made little difference to the odds ratios, in the subset of participants who had STI information available.  

"I find the presentation of the results not very easy to understand. The authors refer several times to the non-association without describing precisely the significance of the analysis which is actually the lack of a statistically significant difference between patients diagnosed by positive or negative visual inspection by the reviewers in relation to objective parameters of schistosome infection. Again it should be noted that even if the difference is not statistically significant, the ORs are still positive in favor of thee objective measurements of schistosomiasis. I suggest rewording the text below Table 3. It is interesting to note that the symptom with the highest OR for both reviewers is for the association with hematuria, which is the most specific symptoms of Shaematobium infection."     

Thank you for this input and we apologize for any lack of clarity in the presentation of the results. In lines 292-295 we describe that vaginal discharge, vaginal bleeding after sex, the presence of external genital sores, dysuria and abdominal pain were not associated with the presence of visual FGS as diagnosed by either expert reviewer. In regards to the OR in Table 4 being in favor of an association between Schistosoma diagnostics and visual-FGS, the confidence intervals include 1, suggesting that there is not evidence of a difference between groups.   Thank you, we agree this is an interesting finding.

Wellcome Open Res. 2023 Feb 3. doi: 10.21956/wellcomeopenres.20776.r54092

Reviewer response for version 1

Daniela Fusco 1

The manuscript deals with a very relevant topic in need for urgent solutions. The methods and design are adequate for the study and the results are properly described. 

Some minor recommendations to the manuscript:

  • Title: In the title exclusively the aspect of the agreement of experts emerges, while in the results there is a long description of a regression analysis to associate factors (i.e. symptoms) to visual FGS. This aspect should emerge in the title. Additionally in the abstract it is stated that "The agreement of expert human reviewers for visual-FGS has not been previously described". In this view, the hand-held colposcopy doesn't play a central role in the study hence it should be considered to be removed from the title. 

  • Methods: In the paragraph "Home and clinic-based sample collection" the type health care professionals performing the sampling should be specified. In the paragraphs "qPCR for detection of Schistosoma DNA" and "Other infections" samples storage conditions should be specified. 

  • Discussion: As per in the title, in this statement "This study shows “slight” agreement between senior, highly experienced expert reviewers, highlighting the imperfect nature of human expert review of images obtained with hand-held colposcopy for FGS." the reader can have the feeling that the agreement might be different if another type of colposcope would be used. It would be advisable to reconsider the statement. From the reference 26 the statement "The Tanzanian study illustrates the limited specificity of visual techniques, since one-third of the women had cervical lesions in communities where S. haematobium is not endemic.” cannot be really deduced since, i.e., travel or medical history of women is not described. In this view this statement doesn't seem to reflect the message of the reference and it should be re-considered. The conclusion statement "...we suggest caution when visual imaging is used as a stand-alone FGS diagnostic." should bring some recommendations i.e. on how to interpret or obtain better results through colposcopy since, so far, colposcopy is still the diagnostic standard for the disease and alternatives, even if urgent and needed, are not really available. 

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

Yes

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Is the study design appropriate and is the work technically sound?

Yes

Are the conclusions drawn adequately supported by the results?

Partly

Are sufficient details of methods and analysis provided to allow replication by others?

Yes

Reviewer Expertise:

Public Health, schistosomiasis, FGS, women's health, diseases prevention, vaccinology

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

Wellcome Open Res. 2023 Apr 5.
Amy Sturt 1

"1)In the title exclusively the aspect of the agreement of experts emerges, while in the results there is a long description of regression analysis to associate factors (ie symptoms) to visual FGS. This aspect should emerge in the title. Additionally, in the abstract it is stated that “the agreement of expert reviewers for visual-FGS has not been previously described”. In this view the hand-held colposcopy doesn’t play a central role in the study hence it should be considered to be removed from the title."   

Thank you for this input. We have added “and association with clinical symptoms” to the title to tie in the results from the regression analysis which evaluated the association of visual-FGS with clinical manifestations. We appreciate your perspective on omitting “hand-held colposcopy” from the title. Since hand held colposcopy is a unique and scalable technology, we feel it is important to highlight its use in the title.  

"2) Methods – In the paragraph “home and clinic based sample collection” the type health care professional performing the sampling should be specified. In the paragraphs “qPCR for detection of schistosoma DNA” and “other infections” sample storage conditions should be specified."   

Thank you for the opportunity to clarify. At line 143 we explain that trained midwives performed sample collection. Additionally, all study specimens were stored at -80C. This has been clarified at line 193.  

"3) Discussion: As per the title, in this statement “This study shows slight agreement between senior, highly experienced expert reviewers, highlighting the imperfect nature of human expert review of images obtrained with hand-held colposcopy for FGS”. The reader can have the feeling that the agreement might be different if another type of colposcope would be used. It would be advisable to reconsider the statement. From the reference 26 the statement “the Tanzanian study illustrates the limited specificity of visual techniques since 1/3 of the women had cervical lesions in communities where S. haematobium is not endemic” cannot really be deduced since i.e. travel or medical history of women is not described. In this view this statement doesn’t seem to reflect the message of the reference and it should be reconsidered. The conclusion statement, “we suggest cauthion when visual imaging is used as a stand-alone FGS diagnostic” should bring some recommendations on how to interpret of obtain better results through colposcopy, since, so far, colposcopy is still the diagnostic standard for the disease and alternatives, even if urgent and needed, are not really available."   

Thank you for your input on the discussion. As you have suggested, we have removed “obtained with hand-held colposcopy” from the referenced excerpt in the discussion and the abstract. Thank you for your perspective on the Tanzanian study (reference 26). On page 1210 of the referenced manuscripts the manuscript authors (Poggensee et al) note that “schistosomiasis is not endemic in this area” and used this group as their control group (against which to compare the participants who lived in areas endemic for schistosomiasis), and on page 1211 the authors refer to the participants who lived in non-endemic locations as “women living in schistosomiasis free areas”. However, it is true that travel and medical history of women is not described, and we now acknowledge this in the discussion. Thank you for the opportunity to provide further recommendations about how to obtain better results through colposcopy. This is difficult given the low specificity of visual imaging for FGS. Further studies are needed to evaluate if combining colposcopy for visual-FGS diagnosis with point-of-care STI testing might improve specificity. As you have suggested, this has been added to the manuscript’s discussion at lines 342-343.

Wellcome Open Res. 2023 Jan 31. doi: 10.21956/wellcomeopenres.20776.r54086

Reviewer response for version 1

Alexander B Odaibo 1

The authors evaluated the agreement of human expert reviewers on visual-FGS diagnosis with hand-held colposcopy. Two clinical experts in the diagnosis of FGS in endemic settings were engaged in the study. They assessed 527 women cervicovaginal colposcopy images independently. The study recorded a "slight" agreement between the expert reviewers of the digital images obtained during point-of-care colposcopy. The design was good and adequate but there is a need to improve on the details of some aspects of the methods to allow replication by others. 

  1. Urine sample was collected during home visit - the authors should provide specific time of urine collection, as this is important in the diagnosis of schistosomiasis. Was the time of collection the same for all participants?

  2. Up to 50 mL of urine was centrifuged and examined - the authors should provide the specific amount of urine centrifuged, and why 50 mL and not 10 mL as recommended by WHO (2022 1 ), and for ease of comparison with other studies.

  3. Figure 1. Flowsheet of cervicovaginal image review - the authors should explain what "Images inaccessible" means, particularly as it was among those who attended clinic.

  4. The authors use the phrase "water use" and "water contact" interchangeably. I think "water contact" is best for this study.

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

I cannot comment. A qualified statistician is required.

Are all the source data underlying the results available to ensure full reproducibility?

Yes

Is the study design appropriate and is the work technically sound?

Yes

Are the conclusions drawn adequately supported by the results?

Yes

Are sufficient details of methods and analysis provided to allow replication by others?

No

Reviewer Expertise:

Parasitologist/Epidemiologist

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.

References

  • 1. : WHO guideline on control and elimination of human schistosomiasis. World Health Organization .2022; Reference source [PubMed]
Wellcome Open Res. 2023 Apr 5.
Amy Sturt 1

The design was good and adequate but there is a need to improve on the details of some of the aspects of the methods to allow replication by others.  

  1. Urine sample was collected during home visit – the authors should provide specific time of urine collection, as this is important in the diagnosis of schistosomiasis. Was the time of collection the same for all participants?

  Thank you for the opportunity to clarify. Since visits were performed in the participant’s home, there was not a restriction on the timing of urine sample self-collection and 69.5% (419/603) were performed between 9:00 and 14:00. This has been added from lines 140-142.  

  1. Up to 50mL of urine was centrifuged and examined – the authors should provide the specific amount of urine centrifuged and why 50mL and not 10mL as recommended by WHO (2022), and for ease of comparison with other studies.

  Thank you for this question. We have noted a typo in the manuscript, up to 60mL of urine was centrifuged. This has been corrected at line 191. The full details of the urine collection are reported in a previous manuscript. We have referenced this manuscript at line 184 and full details can be found at https://journals.plos.org/plosntds/article?id=10.1371/journal.pntd.0008337  

  1. Figure 1. Flowsheet of cervicovaginal image review – the authors should explain what “images inaccessible” means, particularly as it was among those who attended clinic.

  We apologize for any confusion. Due to technical difficulties, Reviewer 1 was not able to open 10 images. In the flowsheet, “images inaccessible” has been changed to “technical difficulty opening images”.  

  1. The authors use the phrase ‘water use’ and ‘water contact’ interchangeably. I think water contact is best for this study.

  Thank you for this input and all instances of “water use” have been changed to “water contact”.

Wellcome Open Res. 2023 Jan 27. doi: 10.21956/wellcomeopenres.20776.r54087

Reviewer response for version 1

Louise Arenholt 1,2

Thank you for the opportunity to review this thorough and well written paper.

The paper aims to evaluate the agreement on the diagnosis of visual FGS lesions between two expert reviewers. Furthermore, the association between visual presentation of FGS and symptoms of FGS/PCR detection of FGS were evaluated.

The topic is highly relevant and only a few studies have evaluated the accuracy of visual diagnosis of FGS – even though the visual inspection is the golden standard as also highlighted by the authors.

Introduction:

No comments.

Methods:

I would recommend more detailed information on the hand-held colposcopy and image review. There is no information on how the image was captured and if they had a standard/protocol on which areas to include in the image portfolio (was it only the cervix or also the vaginal walls?). Did they have any criteria for image quality (for example the lighting or zoom level) when capturing the image? How many images were captured per woman? It seems like the reviewers have excluded images on slightly different basis – did you before the review guide the reviewers on how to include or exclude the images? For example, did the reviewers review the images on a computer and were the computers similar (for example resolution on the screen, color setting, brand etc.)? Did they have the opportunity to zoom the image or change color/brightness/contrast/saturation? My experience is that an image can look completely different if you evaluate it on two different computers. A small thing as day light or artificial light can change the image. Could it be different settings/equipment that account for some of the difference between reviewers?

In the methods section I can’t find information on how women reported the abdominal-genitourinary, and reproductive manifestations (table 4) – questionnaire, interview etc.?

Results:

I am not surprised that you do not find an association between visual-FGS and PCR-FGS. I believe that cevicovaginal lesions are results of chronic inflammation and therefore other tests should be used if an association should be found.

Discussion:

You discuss the study from Madagascar, where gridded image technique was used. And you are correct that all included women in that study were to have FGS lesions. I just need to address two things 1) the presence of rubbery papules was not an inclusion criterion for the study. Therefore, the uncertainty about whether a lesion was present or not was indeed an issue. Some of the reviewers in that study found, that more than 20% of the women had no rubbery papules. 2) the images used for the study were not from the inclusion visit, but from later visits where treatment with Praziquantel had been initiated. For that reason, some of the images could be without FGS lesions.

I agree with you that consensus on how to evaluate the images is very important. I believe that if consensus had been reached before the reviewing procedure, your results would have been very much different.

Is the work clearly and accurately presented and does it cite the current literature?

Yes

If applicable, is the statistical analysis and its interpretation appropriate?

Yes

Are all the source data underlying the results available to ensure full reproducibility?

Partly

Is the study design appropriate and is the work technically sound?

Yes

Are the conclusions drawn adequately supported by the results?

Yes

Are sufficient details of methods and analysis provided to allow replication by others?

Partly

Reviewer Expertise:

Gynecology and obstetrics. Urogynecology.

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

Wellcome Open Res. 2023 Apr 5.
Amy Sturt 1

"I would recommend more detailed information on the hand-held colposcopy and image review. There is no information on how the image was captured and if they had a standard/protocol on which areas to include in the image portfolio (was it only the cervix or also the vaginal walls?). Did they have any criteria for image quality (for example the lighting or zoom level) when capturing the image? How many images were captured per woman?"

Thank you for the opportunity to clarify the colposcopy methods. Images were captured according to a standard protocol and the cervix, anterior fornix, posterior fornix, right lateral cervix, left lateral cervix and adjacent cervical trunk were inspected routinely. The clinician was instructed to take a photograph of each location as well as the cervix. The clinician then examined the anterior and posterior vaginal walls. Per the protocol, the clinician was instructed to capture a minimum of 5 images per participant. All participants were examined in the same clinic room with the overhead lights on. The midwives were allowed to choose to use the in-device lighting or zoom level on the MobileODT colposcope. These details have been added to the manuscript at lines 149-153.  

"It seems like the reviewers have excluded images on slightly different basis – did you before the review guide the reviewers on how to include or exclude the images? For example, did the reviewers review the images on a computer and were the computers similar (for example resolution on the screen, color setting, brand etc.)? Did they have the opportunity to zoom the image or change color/brightness/contrast/saturation? My experience is that an image can look completely different if you evaluate it on two different computers. A small thing as day light or artificial light can change the image. Could it be different settings/equipment that account for some of the difference between reviewers?"

At their discretion, expert reviewers could exclude images that they felt could not be evaluated due to technical issues, image quality, or limited cervical visualization. The reviewers independently evaluated the images on desktop computers. The reviewers were able to zoom and make adjustments to the digital images as needed. We feel it is important to consider that in real-world conditions, if a number of technical factors are required to ensure successful image review, the successful roll-out and scale up of this method of diagnosis will be constrained. To replicate real-world conditions, standardized equipment was not provided to the reviewers by the BILHIV study. However, you are correct that we cannot exclude that differences in settings and equipment may theoretically contribute for some differences between reviewers. We have added this as a limitation in the discussion of the manuscript at lines 392 -395.

"In the methods section I can’t find information on how women reported the abdominal-genitourinary, and reproductive manifestations (table 4) – questionnaire, interview etc.?"    Women reported symptoms in a structured questionnaire with a community health worker, this is described in line 139.  

"Results:

I am not surprised that you do not find an association between visual-FGS and PCR-FGS. I believe that cevicovaginal lesions are results of chronic inflammation and therefore other tests should be used if an association should be found."

We agree that visual-FGS represents a different phenotype than PCR-FGS. Thank you for your input on the results.

"Discussion:

You discuss the study from Madagascar, where gridded image technique was used. And you are correct that all included women in that study were to have FGS lesions. I just need to address two things 1) the presence of rubbery papules was not an inclusion criterion for the study. Therefore, the uncertainty about whether a lesion was present or not was indeed an issue. Some of the reviewers in that study found, that more than 20% of the women had no rubbery papules. 2) the images used for the study were not from the inclusion visit, but from later visits where treatment with Praziquantel had been initiated. For that reason, some of the images could be without FGS lesions."

Thank you for the helpful clarifications regarding your outstanding work from Madagascar. In light of the further information you have provided, we have removed the sentences in question in the discussion. These formerly read “However, it is notable that in the Madagascan study, all images were thought to contain FGS lesions, removing the burden of uncertainty and highlighting that in settings where images are known to contain FGS lesions, agreement between reviewers was at best “moderate”. I agree with you that consensus on how to evaluate the images is very important. I believe that if consensus had been reached before the reviewing procedure, your results would have been very much different. Thank you for this input and we included the absence of an initial consensus review as a limitation in the discussion at line 407.


Articles from Wellcome Open Research are provided here courtesy of The Wellcome Trust

RESOURCES