Skip to main content
PLOS Digital Health logoLink to PLOS Digital Health
. 2023 Oct 16;2(10):e0000358. doi: 10.1371/journal.pdig.0000358

Effects of digital chatbot on gender attitudes and exposure to intimate partner violence among young women in South Africa

Alexandra De Filippo 1, Paloma Bellatin 1, Neville Tietz 2, Eli Grant 3, Alexander Whitefield 4,5, Puseletso Nkopane 2, Camilla Devereux 4, Kaitlyn Crawford 2, Benjamin Vermeulen 2, Abigail M Hatcher 6,7,*
Editor: Laura M König8
PMCID: PMC10578594  PMID: 37844088

Abstract

Background

South Africa has among the highest rates of intimate partner violence (IPV) globally, with young women at heightened risk due to inequitable gender roles, limited relationship skills, and inadequate social support. Despite an urgent need for violence prevention in low- and middle-income settings, most efficacious approaches are time-intensive and costly to deliver. Digital, interactive chatbots may help young women navigate safer relationships and develop healthier gender beliefs and skills

Methods

Young women (18–24 years old) across South Africa were recruited via Facebook for participation in an individually randomised controlled trial (n = 19,643) during the period of June 2021-September 2021. Users were randomly allocated, using a pipeline algorithm, to one of four trial arms: Pure Control (PC) had no user engagement outside of study measures; Attention Treatment (T0) provided didactic information about sexual health through a text-based chatbot; Gamified Treatment (T1) was a behaviourally-informed gamified text-based chatbot; Narrative Treatment (T2) was a behaviourally-informed drama delivered through pre-recorded voice notes. All chatbots were delivered in WhatsApp, through which users were invited to complete brief “quizzes” comprising adapted versions of validated scales. Primary outcomes were short-form adaptations of scales for gender attitudes (Gender Relations Scale) and past-month IPV (WHO Multi-country Study Instrument). Secondary outcomes were identification of unhealthy relationship behaviours (Intimate Partner Violence Attitudes Scale) and brief screener for depressive symptoms (Patient Health Questionnaire). A direct chat link to a trained counsellor was a safety measure (accessed by 4.5% of the sample). We estimated treatment effects using ordinary least squares and heteroskedasticity robust standard errors

Findings

The trial retained 11,630 (59.2%) to the primary endpoint of gender attitudes. Compared to control, all treatments led to moderate and significant changes in attitudes towards greater gender equity (Cohen’s D = 0.10, 0.29, 0.20 for T0, T1, and T2, respectively). The gamified chatbot (T1) had modest but significant effects on IPV: 56% of young women reported past-month IPV, compared to 62% among those without treatment (marginal effects = -0.07, 95%CI = -0.09to-0.05). The narrative treatment (T2) had no effect on IPV exposure. T1 increased identification of unhealthy relationship behaviours at a moderate and significant level (Cohen’s D = 0.25). Neither T1 nor T2 had a measurable effect on depressive symptoms as measured by the brief screener. Interpretation: A behaviourally-informed, gamified chatbot increased gender equitable attitudes and was protective for IPV exposure among young women in South Africa. These effects, while modest in magnitude, could represent a meaningful impact given potential to scale the low-cost intervention.

Author summary

Intimate partner violence (IPV) affects one in three women worldwide and harms health and well-being. Yet, there are few low-cost ways proven to prevent IPV. We tested if digital, interactive chatbots could help young women across South Africa stay safer by changing their attitudes and skills. We assigned similar young women to one of two chatbots. The first was gamified (with emojis and small tasks through Whatsapp chat). The second was narrative (with a story pre-recorded using Whatsapp voice notes). Their responses were compared to young women randomly assigned to a control group that received information-only or no chatbot at all. Both gamified and narrative chatbots improved young women’s attitudes and increased skills in identifying unhealthy relationship. The gamified chatbot only reduced IPV after three months. This is one of the first digital approaches to decrease IPV exposure.

Introduction

Exposure to intimate partner violence [IPV] among South African women is higher than most settings globally. One in five women report lifetime IPV exposure [1], and rates of femicide are six times the global average [2]. Younger women report higher rates of IPV compared to older peers [3], and in one study 53% of young women aged 14–19 years reported physical and/or sexual IPV [4]. Subtler forms of unhealthy relationship behaviours are also prominent for young women, including age-disparate relationships, early unintended pregnancies, and relationships characterised by controlling partners [1,5,6].

Extant cross-sectional and qualitative literature points to factors associated with IPV exposure among young women. Inequitable gender attitudes [7,8], inadequate social support [9], and limited knowledge of harmful relationship behaviours [10], are associated with higher IPV exposure. Building skills around autonomy and agency in relationships is tied to lower IPV exposure among young women [11,12].

Experimental data from sub-Saharan Africa include SASA! in Uganda in which critical discussions around gender reduced IPV exposure among women [13], and IMAGE South Africa and Tanzania, where group-based gender workshops combined with microfinance reduced adult women’s IPV [14,15]. Yet, of 36 African trials reviewed in 2022, only one-third reduced IPV among women [16]. Similarly, few rigorously tested approaches have been shown to reduce men’s perpetration of IPV in African trials. Group-based gender training with younger men may decrease reported IPV perpetration [17], particularly if combined with economic livelihoods [18]. However, other trials have had null findings for men’s IPV perpetration [13,19,20].

Mobile devices offer a promising channel to deliver targeted edutainment at scale through mHealth, a strategy shown to effectively encourage health-seeking behaviours related to diabetes, weight loss, physical activity, smoking cessation, and medication adherence [21]. Chatbots are a type of mHealth strategy that builds interactivity into digital interventions. By enabling two-way communications, behaviourally-informed chatbots offer a dynamic format to deliver strategies known to encourage behavioural change more effectively. However, few digital interventions for violence prevention have been tested to date in African settings [16].

Edutainment is another promising approach to reduce young women’s violence exposure, though this has been understudied to date. Through relatable characters or influential messengers, edutainment can convey educational messages, relevant information [such as where to seek help], or promote healthful social norms [22].

The evidence for whether edutainment can reduce IPV exposure or improve gender attitudes is promising, but inconclusive. Soul City, a large-scale edutainment campaign in South Africa, led to knowledge about IPV, improved self-efficacy, and increased helpline calls [23]. The MyPlan app in Kenya improved women’s safety strategies but had null effects on violence exposure [24]. In Uganda, a media campaign in 112 villages featuring video vignettes during community film festivals significantly reduced IPV but had null effects on attitudes towards IPV [25]. In Egypt, short videos delivered via Whatsapp or Facebook detected no improvements in violence exposure or gender beliefs but did increase women’s knowledge and intention to access resources in future cases of IPV [26].

We conducted a trial testing whether two versions of a behaviourally-informed chatbot could improve gender attitudes in order to reduce IPV exposure among young women in South Africa.

Materials & methods

We conducted a digital, individually randomised controlled trial with 4 arms, consisting of:

  • Pure Control: quizzes measuring outcomes only

  • Attention Treatment: interactive chatbot featuring information about sexual health

  • T1: gamified interactive chatbot delivering behaviourally-informed content

  • T2: narrative-based interactive chatbot delivering behaviourally-informed content

The target audience for ChattyCuz was young women aged 18 to 24-years-old. Our primary hypotheses are as follows: If chatbots can be effective at improving attitudes and relationship safety among young South African women, compared to pure control T1 and T2 will demonstrate:

  1. higher mean score on gender attitudes scale, and;

  2. lower women’s past-month self-reported IPV exposure

We secondarily hypothesised that identification of unhealthy relationship behaviours would be higher and depressive symptoms would be lower for T1 and T2 compared to control.

Inclusion criteria

We targeted a sample of South African young women who responded to targeted advertising on Facebook, inviting them to engage with a WhatsApp chatbot. Advertising prompts were sent only to individuals who self-identify on their Facebook profiles as female between 18 and 24 years; and were living in South Africa (though they may be national or non-national residents).

Treatment arms

Pure control arm exposed eligible participants only to specific evaluation quizzes. The quiz pertaining to gender attitudes was delivered directly after eligibility screening. Follow-up quizzes three months later evaluated IPV exposure and the brief depressive symptoms screener.

Attention treatment arm (T0) was modelled on an existing chatbot, Big Sis. It featured the ChattyCuz brand and persona, but did not include any of the behaviourally-informed content modules. Instead, the Attention Treatment chatbot addressed the basics of sex, reproduction and puberty, and allowed users to browse topics by thematic areas, such as “love and relationships” (e.g. “Do relationships = sex?”) “sex basics” (e.g. “What is virginity?”, “Why might sex hurt?”) and “protecting yourself” (e.g. “How to use a condom”). The content comprised overlapping concepts and thematic areas, which enabled the delivery of the same evaluation quizzes as the treatment arms.

ChattyCuz-Gamified (T1) is a WhatsApp chatbot that aims to support young women in navigating intimate relationships. ChattyCuz is designed as a safe, private space. It offers simple, symbolic rewards to guide and motivate users throughout their interactions. T1 follows an interactive script that adapts depending on responses provided by the user. Regardless of whether the user answers a question correctly, ChattyCuz gives feedback on the response. T1 was designed to appear as a unique personality based on the characteristics of an older cousin (approachable, relatable, and supportive of the user). She takes users through a series of different topics, offering a critical reflection on power in relationships; skills practice to identify unhealthy relationship behaviours; learn healthy communication and coping mechanisms; affirming values to build self-efficacy; and planning for safety. Users are encouraged to collect six Squad Members to become a Relationship Superstar (Table 1).

Table 1. Module content for gamified chatbot and narrative chatbot.

Module Challenge Module aim
I have… I can…
Module 1 Limited awareness of unhealthy relationship behaviors Identify unhealthy relationship behaviors
Module 2 Limited social support Identify a trusted person to speak to about an unhealthy relationship
Module 3 Low autonomy and self-efficacy Improve self-efficacy by engaging in self-affirmation
Module 4 Limited understanding of healthy communication skills in my intimate relationships Practise healthy communication skills for relationships
Module 5 An action-intention gap preventing safety seeking Make a safety plan
Module 6 Limited psychological support during difficult experiences Identify practical coping mechanisms for difficult experiences
All Modules Limited awareness of power imbalance and control in relationships Reflect critically on power imbalance and control in relationships
Gamified Chatbot rationale Gamification–enhancing a service or activity with a motivational reward to provoke a specific knowledge-based, behavioral, or attitudinal outcome [39]—has promise for participant engagement and propensity to learn from their experiences. There is some evidence to suggest that timely, structured feedback can improve one’s ability to successfully complete logical tasks, such as mathematics tests [40]. Game designs have been successfully applied to digital sexual and reproductive health interventions, such as interactive computer programs aimed to enhance adolescents’ sense of self-efficacy, and digital games designed to improve adolescents’ knowledge [41].
Narrative Chatbot rationale Narrative-based edutainment can also have positive effects on sex and relationship behaviors. A study of the phased rollout of a soap opera promoting family planning in Brazil points to the show’s effect on people’s choices: areas where the show was broadcast recorded lower fertility rates, and changes in naming conventions.[42] In Nigeria, exposure to MTV Shuga –a drama featuring educational storylines about HIV/AIDS–has been shown to improve viewers’ knowledge and attitudes towards HIV and reduce women’s prevalence of sexually transmitted infections [43]. In a quasi-experimental study in Brazil, the multimedia narrative Program H have been shown to improve knowledge, attitudes, practices related to sexual health, and reduce gender-based violence [44].

ChattyCuz-Narrative (T2) uses storytelling to connect with audiences and delivers the themes through WhatsApp voice notes (short digital audio clips) and short sections of text. The narrative follows the life of Sandiswa, a young woman engaged in an embattled relationship with a man named Jabu. Each voice clip opens with a short, ambiance-rich scene (two characters arguing; a character at a family party with her sister-in-law; a character playing a song). The scene then fades and Sandiswa speaks directly to the user to summarise what happened, how she feels and the dilemma she faces. By ‘helping’ Sandiswa to make life choices (who she should trust, how to communicate with Jabu, and how to plan for safety) ChattyCuz-Narrative’s design aims to improve users’ knowledge and relationship skills in line with module aims. The aim of this format is to make the user feel as if she is a trusted person that the narrator wants to speak to during her daily life. To maximise accessibility of the narrative, the user is presented with the choice of hearing the story through voice clips, or reading it as text.

Additional information about each chatbot can be found in the Online Supplemental Appendix.

Measures

The outcome variables for this trial are adapted versions of existing validated scales translated into quizzes (mimicking those in teen magazines) through an iterative, user-centred process. We conducted cognitive interviews to test quiz questions with end users, gathering their feedback on question phrasing and level of comprehension. In October 2020, we validated multiple, short-form adapted surveys (n = 1,088 young women), and invited a sub-sample to retest in December 2020 (n = 231).

Gender attitudes were measured 4 days after allocation using an adapted Gender Relations Scale (aGRS) [27]. The aGRS is comprised of 6 items scored on Likert-type responses and had an acceptable internal consistency in the pre-trial validation with n=1,088 participants (Cronbach’s ⍺ = 0.75). An example item is “It’s natural for a guy to have girls on the side while he’s dating you.” We analysed aGRS as a continuous outcome with higher scores denoting more gender equitable beliefs.

IPV exposure was assessed 3 months after allocation using an adapted WHO Multi-country Study Instrument (aWHO) [28]. comprising two physical and one sexual violence measure shown to be sensitive and specific in the South African setting. An example item is “In the past month, did your bae ever slap, hit you or throw something at you?”. In addition, we added two technological violence items. An example is “If you sent your bae a sexy photo of yourself, are you afraid they might share it?” In the validation assessment (n = 67), internal consistency of the aWHO was acceptable (Cronbach’s ⍺ = 0.75).

Identification of unhealthy relationship behaviors was measured 3 days after allocation using an adapted Intimate Partner Violence Attitudes Scale (aVAS) [29]. This scale comprises abuse and control subscales to measure psychological violence identification. An example item is “My bae never hurts me but I understand why sometimes he threatens to do it so that I get in line.” Validation with 74 participants found that internal consistency was high (Cronbach’s ⍺ = 0.82).

Depressive symptoms were assessed using a brief self-reported symptom screener: the two-item Patient Health Questionnaire (PHQ-2). A score of 3 or greater is suggestive of probable depression.

Socio-demographic variables were ascertained in single item form at baseline. Age was self-completed. Gender was self-reported as either female, male, or non-binary. Sexual orientation was ascertained through a single item asking users whether they were attracted to men or women, “it is complicated” or “prefer not to say”. Relationship status was assessed through a single item asking whether users were seeing “someone special”. The state of participants’ mental distress was assessed through one item asking if, in the past month, they had been mostly “energised and hopeful”, “just getting along”, or “down and hopeless”. Food security level was assessed by an item asking if users had been hungry in the last two months because “there was not enough food in their house”. Household composition was measured using a single item asking how many people lived in the household with the user. Participants were also asked whether they had any children. Province was self-reported.

Procedures

Facebook advertisements targeted mobile devices (to be viewed on mobile news feeds only) and seamlessly directed interested participants to WhatsApp. We conducted pipeline randomisation using a custom-built randomisation algorithm implemented as an R application and run as a separate web service. After consenting to participate in the study, users were randomised into either Pure Control (20%), Attention Control (20%), ChattyCuz-Gamified (T1), or ChattyCuz-Narrative (T2).

The trial was set up on RapidPro (open source jointly owned by UNICEF and Nyaruka), an open-sourced user management system that records user states and manages inbound and outbound messages between users and the chatbot [30]. The trial was set up with content stored in the form of RapidPro “flows”, a series of steps with built-in logic to direct users through the experience based on their interactions with the chatbot.

When users were allocated into the different trial arms using the randomisation algorithm, users’ records were stored in a PostgreSQL database, along with user data and message interactions exchanged between users and ChattyCuz. To support chatbot users, careful monitoring of the database was required as it grew throughout the trial (possibly due to the design of the program, which involved a high ratio of outbound to inbound messages).

Users participating in the trial were remunerated with ZAR 30 (approximately $2.00) on completion of all components of the respective trial arms, and ZAR 10 (approximately $0.70) on completion of a follow-up quiz three months later. Remuneration was carried out by connecting RapidPro to a DT One account with pre-loaded airtime. This integration with DT One is built into RapidPro and requires only the DT One account and API key to be specified to connect the account. The airtime remunerations were defined based on local research standards and triggered automatically when users reached the end of specified RapidPro flows within the different trial arms.

Data privacy and protection

Personal data collected in the trial was covered by Praekelt.org’s data protection policy. Data handling, storage and transfers comply with national and international guidelines, standards and legal frameworks, notably the General Data Protection Regulation of the European Union. ChattyCuz complies with the record management in the Protection of Personal Information Act 4 of 2013.

User data recorded during the trial included age, gender and telephone number, and responses to quizzes, polls or surveys, or questions asked by the user. This data was stored in a secure PostgreSQL database. Users were exposed to Terms and Conditions as part of the consent procedure (S1 Table). The Terms and Conditions explained, in simple terms, that should they choose to participate, their data would be collected and used for statistical or research purposes. This data was visible to both Praekelt.org and BIT throughout the trial. All personally identifiable information was removed from the data (i.e. the data was anonymised) before we transferred it from Praekelt.org to BIT for analysis.

Consent procedures

During the onboarding process, users received information about the intervention and the research study. After users clicked on a Facebook advertisement on their device, they were directed to WhatsApp. On WhatsApp, users were exposed to a brief description of the trial with all the key information required for consent. The user then received a PDF explaining the nature of the trial in plain terms. The user was asked if they wanted to participate in the trial; if they agreed, they were allocated to a trial arm at random using a built-in algorithm. The user was automatically remunerated with preloaded airtime after the completion of the primary outcome quiz (S1 Fig). Users who could not receive automated airtime remunerations were sent money vouchers, redeemable as cash or airtime at any bank ATM, manually via SMS.

Analysis

For primary analysis 1, we estimated the following model:

γι=α+β1T1ι+β2T2ι+β3BAUι+Xιβ4+μι

Where:

  • γi is Primary Outcome 1; a score of attitudes and beliefs on power equity in relationships;

  • T1 is a dummy variable = 1 if individual i was allocated to T1 (Gamified bot);

  • T2 is a dummy variable = 1 if individual i was allocated to T2 (Narrative bot);

  • BAU is a dummy variable = 1 if individual i was allocated to Attention control;

  • Xi is vector of control variables, which includes:
    • ○ A measure of Baseline Attitudes and Beliefs on Power equity in the relationships (taken at the beginning of the chatbot experience).
    • ○ Region;
    • ○ Gender;
    • ○ Age;
    • ○ Mental health score;
    • ○ Socioeconomic status—food security;
    • ○ Household composition;
    • ○ Children;
    • ○ Partnership status;
    • ○ Province.
  • ui is the error term.

The coefficients of interest are β1, which measures the impact of being allocated to T1 compared to Control; β2, which measures the impact of being allocated to T2 compared to Control; and β3, which measures the impact of being allocated to Attention Control compared to Control, on the outcome measure of interest.

We estimated the model using ordinary least squares and used heteroskedasticity robust standard errors. We modelled the primary outcome with a series of three regressions: (1) an unadjusted treatment-only model, (2) a model adjusted for covariates gender and region of recruitment, (3) a model adjusted for a full set of collected covariates. The models will be compared using a goodness-of-fit statistic, AIC. If there are qualitative differences between the results of these models, we will focus our reporting on the results of the model with the lowest AIC.

We adopted a strategy of using multiple comparisons adjustments for the primary analysis of primary and secondary outcomes, employing the Benjamini-Hochberg step-up procedure. In brief, this uses the traditional p<0.05 threshold as a marker of statistical significance, but reduces it each time an additional comparison is made. The primary analysis adjusts for 2 comparisons: T1 vs. Pure Control; T2 vs. Pure Control. The primary analysis for secondary outcomes adjusted for 2 comparisons: T1 vs Attention, and T2 vs Attention.

Ethical considerations

Our trial received ethical approval by Pharma-Ethics (Protocol number 2019063; 21 September 2020). We pre-registered our trial on the Open Science Framework (16 March 2021).

Given the sensitive nature of this study, the safety for the study population informed the design of the trial and the platforms through which it was delivered. Participants in the trial were 18 or older, the age when young persons are able to consent on their own to research in South Africa. For participants requiring additional support, access to text-based individual counselling services was provided by an external counselling service provider, MobieG. This non-governmental organisation already offered text-based counselling, and our trial provided additional training specific to responding to IPV.

We established a platform for the trained MobieG counsellors to use WhatsApp for individual counselling. Reply messages from counsellors were channelled through RapidPro without interrupting the active conversations between users and counsellors. A benefit of this approach is that it allowed us to keep the interactions with the counsellors within WhatsApp, rather than directing users to an external platform that would have disrupted the user experience. A total of n = 631 users took up the counselling (4·5% of the sample who were followed to midline). The quality of the counselling experience and moderation of trial outcomes will be the subject of a separate paper.

Results

We recruited 19,643 women in 31 days. As expected, 20.0% were allocated to Pure Control, 20.0% to Attention Control, 30.0% to ChattyCuz-Gamified and 30.0% ChattyCuz-Narrative (Fig 1).

Fig 1. Trial flow diagram.

Fig 1

As Table 2 indicates, baseline characteristics are very similar for all these conditions. Across all arms, the average age was 21.2 and 70.7% were in the relationship. Overall, 31.7% of participants reported to have symptoms consistent with probable depression, with similar rates across arms.

Table 2. Participant socio-demographics by trial arm.

Baseline Midline (4 days after allocation) Endline (3 months after allocation)
Pure Control Attention T0 Gamified T1 Narrative T2 Control T0 T1 T2 Control T0 T1 T2
Proportion followed - - - - 98.2% 24.0% 60.6% 56.5% 38.1% 39.4% 32.0% 31.4%
Participants 3,930 3,929 5,891 5,893 3,867 928 3,517 3,318 1,474 366 1127 1043
Average age 21.2 21.2 21.2 21.2 21.2 21.2 21.1 21.1 21.4 21.3 21.2 21.3
In current relationship 70.7% 70.7% 70.7% 70.7% 70.9% 73.2% 70.0% 70.5% 75.1% 80.1% 75.9% 77.1%
Mental distress at baseline 31.7% 31.8% 31.7% 31.7% 31.7% 28.8% 30.5% 30.8% 30.9% 29.0% 29.3% 30.5%
Gender attitudes at baseline 1* - - - - 0.00 0.00 0.04 0.07 -0.20 -0.10 -0.10 -0.10
Gender attitudes at baseline 2 * - - - - 0.00 0.08 0.05 0.08 0.00 -0.10 0.00 0.00
Food insecure - - - - - - - - 54.2% 55.2% 51.2% 52.3%
Members in household - - - - - - - - 3.9 4.0 3.9 4.0
Has 1+ child - - - - - - - - 41.8% 38.0% 43.7% 39.4%

* Gender attitudes at baseline normalized by subtracting the Pure Control mean from the mean, and dividing by the control group standard deviation; higher scores denote stronger beliefs in power equity

Bold text denotes balance checks indicate greater than 10% variation in socio-demographic

Of the 19,643 participants we recruited, 11,630 (59.2%) completed the Adapted GRS quiz, our first primary outcome. We refer to these participants as midline participants. As expected, almost all (98%) of participants in the Pure Control completed the midline quiz as this was asked on the same day as recruitment in this trial arm. There were differences between trial arms regarding retention of participants, with T0 having notably high attrition. The baseline covariates are similar across all trial arms. However, in terms of statistical significance, the treatment groups are slightly different. Descriptive statistics for all groups and highlights statistically significant imbalances, where groups differ on attributes at the 10% level confidence level, in red (Table 2). These differences are due to different rates of attrition and perhaps different types of attrition across the treatment groups.

Primary outcome: Equitable attitudes

We found that all three chatbots improved beliefs in power equity in relationships compared to receiving no intervention (“pure control”). The effect sizes for the Attention Control, Narrative and Gamified bot were 0.10, 0.20 and 0.29 (Cohen’s D, 2.s.f.) respectively. While these effect sizes are modest, they are substantial in terms of real-world significance given the potential scale at which chatbots can be delivered.

Effect sizes for the experimental chatbots (Narrative and Gamified) were found to be relatively stable after adding important controls to the models (gender attitudes, age, mental health, and partnership status at baseline). Effect sizes for the Attention Control, however, decreased as more controls were included, confirming the suspicion that attrition and differences in observed variables could account for the treatment effects observed in this trial arm. As indicated in Table 3, we find the effect of all chatbots are significant at 5% level across specifications, after correcting for multiple comparisons.

Table 3. Primary trial outcomes, by randomization arm: Gender beliefs and partner violence.

Gender attitudes Past-month IPV
Unadjusted Adjusted^ Adjusted^ Unadjusted Adjusted^
OLS OLS CEM OLS OLS
Attention Control (T0) 0.70** 0.50** 0.43* - -
-0.2 -0.2 -0.2
ChattCuz
Gamified (T1)
1.66** 1.49** 1.50** -0.06** -0.07**
-0.1 -0.1 -0.1 -0.02 -0.02
ChattCuz
Narrative (T2)
1.08** 0.99** 0.96** -0.02 -0.02
-0.1 -0.1 -0.1 -0.02 -0.02
Observations N = 11,630 N = 11,519 N = 11,447 N = 4139 N = 3955

+ p < 0.10

* p < 0.05

** p < 0.01

^ additional controls are: baseline gender attitudes, age, mental health at baseline, partnership status, food security, household composition, number of children

Fig 2 presents the outcome for regression 3, the prespecified model.

Fig 2. Gender relations scale (T1, T2, and Attention Control compared to Control).

Fig 2

Primary outcome: IPV Exposure

We found small but significant effects for T1 on overall past-month IPV exposure at three-month follow-up (Table 3). The 1043 women who completed the follow-up quiz from the ChattyCuz-Gamified arm reported lower rates of IPV exposure (55%) than young women in the pure control group (62%, Fig 3).

Fig 3. Past-month IPV exposure (T1 and T2 compared to Control).

Fig 3

T2 demonstrated no effects on IPV exposure. Estimates for Attention Control were not robust to additional controls, suggesting differential attrition based on observed variables account fully for treatment effects observed in this arm. We therefore omit T0 in the presentation of data.

In supplemental analysis we examine IPV by type and see that all types of violence are reduced, with psychological and technological forms of IPV significantly reducing this exposure by 8% (S2 Table). Physical and/or sexual IPV was reduced by 2% and this point estimate did not reach statistical significance (S3 Table).

Secondary outcomes

Identification of unhealthy relationship behaviours

We found that both T1 and T2 increased users’ ability to correctly identify unhealthy relationship behaviours compared to the Attention Control. On a scale of 6–18 the score of the Attention Control group was 14.80 while the score of the T1 and T2 chatbots were 15.51 and 15.22 respectively (Fig 4).

Fig 4. Identification of unhealthy relationship behaviors (T1 and T2 compared to Attention Control).

Fig 4

Mental health

T1 was slightly protective for depressive symptoms, but this was not statistically significant (S2 Fig).

The proportion of users who expressed interest in receiving counselling was much higher in the control conditions (above 17%), compared to 8.4% of users in the ChattyCuz-Narrative group and 6.9% in the ChattyCuz-Gamified group. Similarly, users in the control conditions accessed counselling at more than twice the rate of the treatment arms.

Discussion

In an experimental trial among young women in South Africa, a gamified digital chatbot improved equitable gender beliefs and made a small, but significant, reduction in past-month IPV exposure. This gamified chatbot is among the first digital interventions to effectively reduce IPV exposure among young women. The chatbot significantly improved skills around identifying unhealthy relationships, but did not alter depressive symptoms compared to the control. These primary results are robust across specifications and robustness checks, including comparison to an attention control condition.

Ours is the first study, to our knowledge, to compare two digital communication approaches in relation to actual IPV exposure that offers depth to the growing edutainment field. We hypothesise that the observed reduction in IPV is due to the gamified chatbot offering tailored, immediate feedback on gender beliefs, identifying unhealthy relationship behaviours, and implementing skills to stay safe–all shown to be precursors for staying safe in relationships [13,31,32]. The MyPlan app in Kenya had promising results around safety skills and intentions, but no effect on violence exposure [24].

While the narrative chatbot did improve gender equitable attitudes, it had no measurable effect on IPV exposure. Two hypotheses may frame these null findings. Firstly, the narrative chatbot was less directive, allowing users to reach their own conclusions from the experience of a fictional character. In India, “explicit” calls to action were more impactful than “implicit” narratives in terms of changing beliefs about IPV [33]. Similarly, the effective intervention using mass media in Uganda reduced IPV not by shifting attitudes [as these remained largely unchanged] but by increasing women’s willingness to report violence [25]. Secondly, since many young South Africans already have a high saturation of narrative programming through Soul City [23], which may offer a ceiling effect in this particular setting.

Overall, both behaviourally-informed chatbots were significantly better at encouraging user retention and engagement, compared to information-focused chatbots. Technological solutions may improve youth’s access to care and could complement existing in-person services if young people feel more engaged or less threatened with this medium of communication [34]. Only 24% of Attention Treatment participants were retained (compared to 56% and 60% for T1 and T0, respectively). This could indicate that the behaviourally-informed chatbots are more engaging than typical information-only digital approaches. On the other hand, the high attrition limits our ability to assess impact on IPV exposure for the Attention Treatment arm.

The study was not powered to detect distinctions by IPV type, but this sub-analysis is important for the field. The gamified chatbot most effectively decreased reported exposure to technological and psychological safety. This has relevance for young women’s wellbeing, since technological and psychological forms of IPV may be more common, and have demonstrated preliminary causal associations with mental and physical health [35,36]. Physical and/or sexual IPV trended towards decreased exposure (p=0.02), suggesting this may require more targeted or in-person approaches.

Prevalence of depressive symptoms among this relatively large sample of young women was high, with 32% reporting symptoms consistent with probable depression on a brief screener. This rate is similar to studies nationally, which find up to 41% of young women exhibit elevated depressive symptoms [37]. Neither experimental chatbot was effective at reducing depressive symptoms, nor were they associated with increased distress. This offers an important confirmatory finding for the IPV field, since one ethical consideration for working with younger populations or intervening through low-cost digital interventions is the risk that disclosure of IPV could cause additional distress or even psychological harm. Our findings refute this claim and suggest that—with appropriate links to online counselling services—technologies like chatbots may neither increase depressive symptoms, nor be protective.

Implications for research and practice

The WhatsApp application is a popular internet protocol messaging platform, providing an opportunity to deliver chatbots at greater scale as the market expands. There are, however, inherent risks of digital IPV interventions, particularly if phones are not private or if male partners use technological violence or other controlling behaviours. Our opt-in, live individual counselling delivered via WhatsApp was an important safety consideration of the trial. This safety enhancement was able to draw upon a South African organisation with expertise in chat-based counselling for young people–a partnership that may be harder to identify in other LMIC settings. Providing this counselling service at-scale could be cost-effective (if it serves to identify and refer roughly 4.5% of those at highest need for individualised care), but requires a more integrated approach to de-identifying user data.

Several potential avenues for future research exist. Primary prevention with younger adolescent women, male peers, and adolescent pregnant women seems most promising given the high exposure to violence victimisation and perpetration among these groups. ChattyCuz was designed to reach girls at a young age, though South African ethical constraints mandate that research working directly with participants occur over the age of 18. Future studies should adapt and pilot the intervention for adolescents aged 17 or younger, finding creative methods of working with ethics boards to waive the need for parental informed consent [which negate the chatbot potential to reach young persons directly and may itself lead to safety concerns]. It would be important to evaluate whether this intervention could have a similar, or even more significant, effect among younger girls if we reach them prior to sexual debut or while forming initial intimate relationships.

Limitations

Findings should be viewed in light of study limitations. We recruited young women who were already on Facebook with access to a mobile phone with internet connection. While Facebook is one of the most widely used platforms, we are unable to generalise findings to young women with no private phone access or who do not use WhatsApp or Facebook. This is an important sampling limitation, since those young people who are highly vulnerable to violence exposure may have limited access to digital technologies. The heteronormative framing of the intervention is imprecise and deserves increased attention in South Africa, where sexual and gender minorities are at higher risk of IPV exposure than heterosexual counterparts [38].

Several design characteristics of digital intervention research are worth noting as potential limitations. Low literacy may have inhibited the ability of some participants to take part in the intervention or quizzes. We chose not to measure education as a socio-demographic covariate, but future studies should include this. It is possible that participants completing the quizzes were different from those who filled out baseline surveys, but we chose not to ask for South African identity numbers or private details because of confidentiality risks. Attrition can only be partially accounted for in our analytical approach. While retention was better than for most digital interventions, it is plausible that those in the endline sample are systematically different across arms. Indeed, we find that in the endline sample, there are statistically significant differences in observable characteristics between subgroups. However, because our models are able to control for these measurable differences, differential attrition due to observed characteristics should not affect the overall results. However, we are unable to control for unobserved characteristics and our main analysis requires a missing at random assumption. We conduct additional “robustness checks” in response to concerns that data is missing not at random, including a sensitivity analysis.

Conclusion

Digital solutions represent an important approach to scale efforts to reduce IPV among young women. A behaviourally-informed chatbot improved gender equitable attitudes and reduce exposure to IPV in the short-term. When coupled with tailored support for a small number who would benefit from live online counsellors, behaviourally-informed chatbots have potential to reach a large subset of population at low-cost. Chatbots may offer a promising strategy to support healthy norms and safe relationships.

Supporting information

S1 Fig. Consent flow.

(DOCX)

S2 Fig. Mental health by trial arm, n = 5,022.

(DOCX)

S1 Table. Intervention modules.

(DOCX)

S2 Table. Gender equitable beliefs regression.

(DOCX)

S3 Table. Any IPV exposure regression.

(DOCX)

S4 Table. Technological and/or psychological IPV.

(DOCX)

S5 Table. Physical and/or sexual IPV.

(DOCX)

Acknowledgments

We are appreciative in the many young women who took part in user testing, cognitive interviews, the pilot, formative co-creation workshops, and the Community Advisory Board. We thank Nkeletseng Tsesane for supporting the Community Advisory Board recruitment and Lele van Eck for facilitating their group discussions. We thank the MobieG team for direct counselling support for participants who required additional help. Girl Effect was monumental to the design of the gamified chatbot and the attention condition was composed of their Big Sis chatbot. The research team at BIT supported design and analysis.

Data Availability

Data is fully available without restriction at our trial registry: https://osf.io/jcu4n.

Funding Statement

This work was supported by Wellspring Philanthropic Fund to BV and National Institute of Mental Health (K01MH121185 to AMH). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.SA S. South Africa Demographic and Health Survey 2016: Key Indicator Report. Pretoria: Stats SA, 2018. [Google Scholar]
  • 2.Abrahams N, Mathews S, et al. Intimate partner femicide in South Africa in 1999 and 2009. PLoS Med. 2013;10(4):e1001412. doi: 10.1371/journal.pmed.1001412 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Groves AK, Gebrekristos LT, et al. Describing Relationship Characteristics and Postpartum HIV Risk Among Adolescent, Young Adult, and Adult Women in South Africa. Journal of Adolescent Health. 2020. doi: 10.1016/j.jadohealth.2019.12.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Shamu S, Gevers A, et al. Prevalence and risk factors for intimate partner violence among Grade 8 learners in urban South Africa: baseline analysis from the Skhokho Supporting Success cluster randomised controlled trial. Int Health. 2016;8(1):18–26. doi: 10.1093/inthealth/ihv068 [DOI] [PubMed] [Google Scholar]
  • 5.Closson K, Hatcher A, et al. Gender role conflict and sexual health and relationship practices amongst young men living in urban informal settlements in South Africa. Culture, Health & Sexuality. 2020;22(1):31–47. doi: 10.1080/13691058.2019.1568578 [DOI] [PubMed] [Google Scholar]
  • 6.Hatcher AM, Gibbs A, et al. Gendered syndemic of intimate partner violence, alcohol misuse, and HIV risk among peri-urban, heterosexual men in South Africa. Soc Sci Med. 2022;295:112637. doi: 10.1016/j.socscimed.2019.112637 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Mpondo F, Ruiter RAC, et al. Intimate Partner Violence and Its Association With Self-Determination Needs and Gender-Power Constructs Among Rural South African Women. J Interpers Violence. 2019;34(14):2975–95. doi: 10.1177/0886260516664316 [DOI] [PubMed] [Google Scholar]
  • 8.Stockl H, March L, et al. Intimate partner violence among adolescents and young women: prevalence and associated factors in nine countries: a cross-sectional study. BMC Public Health. 2014;14:751. doi: 10.1186/1471-2458-14-751 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Russell M, Cupp PK, et al. Intimate partner violence among adolescents in Cape Town, South Africa. Prev Sci. 2014;15(3):283–95. doi: 10.1007/s11121-013-0405-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Makongoza M, Nduna M. Awareness and rejection accounts of intimate partner violence by young women in Soweto, Johannesburg, South Africa. Journal of interpersonal violence. 2021;36(1–2):7–32. doi: 10.1177/0886260517726413 [DOI] [PubMed] [Google Scholar]
  • 11.Zembe YZ, Townsend L, et al. Intimate Partner Violence, Relationship Power Inequity and the Role of Sexual and Social Risk Factors in the Production of Violence among Young Women Who Have Multiple Sexual Partners in a Peri-Urban Setting in South Africa. PLoS One. 2015;10(11):e0139430. doi: 10.1371/journal.pone.0139430 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Willan S, Ntini N, et al. Exploring young women’s constructions of love and strategies to navigate violent relationships in South African informal settlements. Cult Health Sex. 2019;21(11):1225–39. doi: 10.1080/13691058.2018.1554189 [DOI] [PubMed] [Google Scholar]
  • 13.Abramsky T, Devries K, et al. Findings from the SASA! Study: a cluster randomized controlled trial to assess the impact of a community mobilization intervention to prevent violence against women and reduce HIV risk in Kampala, Uganda. BMC medicine. 2014;12(1):122. doi: 10.1186/s12916-014-0122-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Kapiga S, Harvey S, et al. A social empowerment intervention to prevent intimate partner violence against women in a microfinance scheme in Tanzania: findings from the MAISHA cluster randomised controlled trial. Lancet Glob Health. 2019;7(10):e1423–e34. doi: 10.1016/S2214-109X(19)30316-X [DOI] [PubMed] [Google Scholar]
  • 15.Pronyk PM, Hargreaves JR, et al. Effect of a structural intervention for the prevention of intimate-partner violence and HIV in rural South Africa: a cluster randomised trial. Lancet. 2006;368(9551):1973–83. doi: 10.1016/S0140-6736(06)69744-4 [DOI] [PubMed] [Google Scholar]
  • 16.Keith T, Hyslop F, et al. A Systematic Review of Interventions to Reduce Gender-Based Violence Among Women and Girls in Sub-Saharan Africa. Trauma Violence Abuse. 2022:15248380211068136. doi: 10.1177/15248380211068136 [DOI] [PubMed] [Google Scholar]
  • 17.Jewkes R, Nduna M, et al. Impact of stepping stones on incidence of HIV and HSV-2 and sexual behaviour in rural South Africa: cluster randomised controlled trial. BMJ. 2008;337:a506. doi: 10.1136/bmj.a506 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Gibbs A, Washington L, et al. Stepping Stones and Creating Futures Intervention to Prevent Intimate Partner Violence Among Young People: Cluster Randomized Controlled Trial. J Adolesc Health. 2019. doi: 10.1016/j.jadohealth.2019.10.004 [DOI] [PubMed] [Google Scholar]
  • 19.Hossain M, Zimmerman C, et al. Working with men to prevent intimate partner violence in a conflict-affected setting: a pilot cluster randomized controlled trial in rural Cote d’Ivoire. BMC public health. 2014;14:339. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Pettifor A, Lippman SA, et al. Community mobilization to modify harmful gender norms and reduce HIV risk: results from a community cluster randomized trial in South Africa. J Int AIDS Soc. 2018;21(7):e25134. doi: 10.1002/jia2.25134 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Hall AK, Cole-Lewis H, et al. Mobile text messaging for health: a systematic review of reviews. Annu Rev Public Health. 2015;36:393–415. doi: 10.1146/annurev-publhealth-031914-122855 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Paluck EL, Ball L, et al. Social norms marketing aimed at gender based violence: A literature review and critical assessment. New York: International Rescue Committee, 2010. [Google Scholar]
  • 23.Scheepers E, Christofides NJ, et al. Evaluating health communication–a holistic overview of the impact of Soul City IV. Health Promotion Journal of Australia. 2004;15(2):121–33. [Google Scholar]
  • 24.Decker MR, Wood SN, et al. Safety decision-making and planning mobile app for intimate partner violence prevention and response: randomised controlled trial in Kenya. BMJ Glob Health. 2020;5(7). doi: 10.1136/bmjgh-2019-002091 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Green DP, Wilke AM, et al. Countering violence against women by encouraging disclosure: A mass media experiment in rural Uganda. Comparative Political Studies. 2020;53(14):2283–320. [Google Scholar]
  • 26.Fotini C, Larreguy H, et al. Can media campaigns empower women facing gender-based violence amid COVID-19? Toulouse School of Economics, 2022. [Google Scholar]
  • 27.Stephenson R, Bartel D, et al. Constructs of power and equity and their association with contraceptive use among men and women in rural Ethiopia and Kenya. Global public health. 2012;7(6):618–34. doi: 10.1080/17441692.2012.672581 [DOI] [PubMed] [Google Scholar]
  • 28.Ellsberg M, Jansen HA, et al. Intimate partner violence and women’s physical and mental health in the WHO multi-country study on women’s health and domestic violence: an observational study. Lancet. 2008;371(9619):1165–72. doi: 10.1016/S0140-6736(08)60522-X [DOI] [PubMed] [Google Scholar]
  • 29.Smith BA, Thompson S, et al. Development of the intimate partner violence attitude scales (IPVAS) with a predominantly Mexican American college sample. Hispanic Journal of Behavioral Sciences. 2005;27(4):442–54. [Google Scholar]
  • 30.UNICEF. Case Study: RapidPro. 2019. [Google Scholar]
  • 31.Parker EM, Gielen AC. Intimate partner violence and safety strategy use: Frequency of use and perceived effectiveness. Women’s health issues. 2014;24(6):584–93. doi: 10.1016/j.whi.2014.08.001 [DOI] [PubMed] [Google Scholar]
  • 32.Boyer C, Levy Paluck E, et al. Religious leaders can motivate men to cede power and reduce intimate partner violence: Experimental evidence from Uganda. Proceedings of the National Academy of Sciences. 2022;119(31):e2200262119. doi: 10.1073/pnas.2200262119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Donati D, Orozco-Olvera V, et al. Using Social Media to Change Gender Norms: An Experimental Evaluation Within Facebook Messenger in India. Available at SSRN 4158065. 2022. [Google Scholar]
  • 34.Storer HL, Nyerges EX, et al. Technology “Feels Less Threatening”: The processes by which digital technologies facilitate youths’ access to services at intimate partner violence organizations. Children and youth services review. 2022;139:106573. [Google Scholar]
  • 35.Henry N, Powell A. Technology-facilitated sexual violence: A literature review of empirical research. Trauma, violence, & abuse. 2018;19(2):195–208. doi: 10.1177/1524838016650189 [DOI] [PubMed] [Google Scholar]
  • 36.Sanz-Barbero B, Barón N, et al. Prevalence, associated factors and health impact of intimate partner violence against women in different life stages. PLoS one. 2019;14(10):e0221049. doi: 10.1371/journal.pone.0221049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Das-Munshi J, Lund C, et al. Mental health inequalities in adolescents growing up in post-apartheid South Africa: cross-sectional survey, SHaW study. PloS one. 2016;11(5):e0154478. doi: 10.1371/journal.pone.0154478 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Hatcher AM, de Kadt J, et al. Experiences of violence and safety. In: de Kadt J, Hamann C, Mkhize SP, Parker AE, editors. Quality of Life Survey 6 (2020/21):Overview Report. Johannesburg: Gauteng City-Region Observatory (GCRO); 2021. [Google Scholar]
  • 39.Hamari J, Koivisto J, et al., editors. Does gamification work?—a literature review of empirical studies on gamification. 2014 47th Hawaii international conference on system sciences; 2014: Ieee.
  • 40.Butler M, Pyzdrowski L, et al. The effects of feedback on online quizzes. International Journal for Technology in Mathematics Education. 2008;15(4). [Google Scholar]
  • 41.Datta S, Burns J, et al. Risking it all for love? Resetting beliefs about HIV risk among low-income South African teens. Journal of Economic Behavior & Organization. 2015;118:184–98. [Google Scholar]
  • 42.La Ferrara E, Chong A, et al. Soap operas and fertility: Evidence from Brazil. American Economic Journal: Applied Economics. 2012;4(4):1–31. [Google Scholar]
  • 43.Banerjee A, La Ferrara E, et al. The entertaining way to behavioral change: Fighting HIV with MTV. National Bureau of Economic Research, 2019. [Google Scholar]
  • 44.Pulerwitz J, Barker G, et al. Promoting more gender-equitable norms and behaviors among young men as an HIV/AIDS prevention strategy. 2006. [Google Scholar]
PLOS Digit Health. doi: 10.1371/journal.pdig.0000358.r001

Decision Letter 0

Padmanesan Narasimhan, Laura M König

9 May 2023

PDIG-D-22-00354

Effects of digital chatbot on gender beliefs and exposure to intimate partner violence among young women in South Africa

PLOS Digital Health

Dear Dr. Hatcher,

Thank you for submitting your manuscript to PLOS Digital Health. After careful consideration, we feel that it has merit but does not fully meet PLOS Digital Health's publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript within 30 days Jun 08 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at digitalhealth@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pdig/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

We look forward to receiving your revised manuscript.

Kind regards,

Laura M. König

Academic Editor

PLOS Digital Health

Journal Requirements:

1. Please amend your detailed Financial Disclosure statement. This is published with the article. It must therefore be completed in full sentences and contain the exact wording you wish to be published.

a. State the initials, alongside each funding source, of each author to receive each grant.

b. State what role the funders took in the study. If the funders had no role in your study, please state: “The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.”

c. If any authors received a salary from any of your funders, please state which authors and which funders.

2. We ask that a manuscript source file is provided at Revision. Please upload your manuscript file as a .doc, .docx, .rtf or .tex.

3. Please provide separate figure files in .tif or .eps format.

For more information about figure files please see our guidelines: LINK

https://journals.plos.org/digitalhealth/s/figures

https://journals.plos.org/digitalhealth/s/figures#loc-file-requirements

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article's retracted status in the References list and also include a citation and full reference for the retraction notice.

Additional Editor Comments (if provided):

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Does this manuscript meet PLOS Digital Health’s publication criteria? Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe methodologically and ethically rigorous research with conclusions that are appropriately drawn based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

--------------------

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

--------------------

3. Have the authors made all data underlying the findings in their manuscript fully available (please refer to the Data Availability Statement at the start of the manuscript PDF file)?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception. The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

--------------------

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS Digital Health does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

--------------------

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: I would like to thank the publisher and the journal for involving me in the peer review of this very interesting study on the “Effects of digital chatbot on gender beliefs and exposure to intimate partner violence among young women in South Africa”.

The study is conducted with great scientific attention and deals with a level of prevention and promotion of crucial importance for people's health and particularly for young women who may be victims of IPV. However, in order for the study to be even more accessible and interesting, some points of reflection are suggested (Note: in the absence of the number of lines, the name of the chapter, paragraph, any SUB, and the line number within the cited chapter, paragraph or sub are indicated: for example, "Abstract", paragraph "Methods", LINE 2 does not refer to line 2 of the entire document or page, but to line 2 of the "Methods" paragraph):

"Abstract", paragraph "Methods", LINE 2, in addition to the number indicated as recruited people "(n = 19,643)" it would be interesting to know, in principle, also the number of people who actually participated in the clinical study (n = 11,630) and the relationship that exists between these two numbers (for example: were they recruited but did not participate or was there abandonment after the initial recruitment?);

With reference to the same lemma, and to ensure greater accessibility in reading for a possible international reader, it would be of great help to use the official rule for punctuation in English, which provides for a comma to separate thousands and a period to separate decimal numbers. Just for example, the following cases are cited that could generate confusion:

"Abstract", paragraph "Methods", PAGE 2, LINE 2 "(n = 19,643)";

"Abstract", paragraph "Methods", PAGE 2, LINE 9 "(accessed by 4·5% of the sample)": for some people who refer to international numbering rules, it could be difficult to understand that it refers to a decimal number, which could be indicated by the notation "4.5%", same problem in "Abstract", "Findings", PAGE 2 LINE 2;

"PROCEDURES", PAGE 6, LINE 15: "Users participating in the trial were remunerated with ZAR 30 (approximately £1·50)": for some people who refer to international numbering rules, it could be difficult to understand that it refers to a decimal number, which could be indicated by the notation "£1.50%";

"RESULTS", LINE 1, "We recruited 19,643 women in 31 days. As expected, 20·0%": the same correction as the previous points is proposed for similar numbering;

"Abstract", paragraph "Methods" PAGE 2, LINE 9 "Primary outcomes were adapted (to?) validated scales of gender beliefs and past-month IPV. Secondary outcomes were identification of unhealthy relationship behaviors and depressive symptoms" would seem to be in contrast with some subsequent sentences in the rest of the document, such as: "We conducted a trial testing whether two versions of a behaviorally-informed chatbot could improve gender attitudes and skills in order to reduce IPV exposure among young women in South Africa." (INTRODUCTION, PAGE 3, LINE 37) or "Digital solutions represent an important approach to scale efforts to reduce IPV among young women. A behaviorally-informed chatbot helped women identify unhealthy relationship behaviors, improve gender equitable norms, and reduce exposure to IPV in the short-term." ("DISCUSSION", "Conclusion", PAGE 11, LINE 1) which instead would seem more consistent with the rest of the document. If the first hypothesis indicated in "Methods" is correct, perhaps it could be written in a way that leaves no doubt for the reader;

"Abstract", "Findings" paragraph, PAGE 2, LINE 1: a reader may be interested in, at this point, knowing which tool was used to evaluate "gender beliefs";

"Abstract", "Findings'' paragraph, PAGE 2, LINE 6: "Neither T1 nor T2 had a measurable effect on mental health" refers to a measurement of the absence of depression (PHQ-2). The extension of the concept of "absence of depression/anhedonia" generalized to the concept of "mental health," may lead the reader to believe that specific tools were used to evaluate "mental health," while instead, a brief screening tool was used to evaluate depression symptoms;

"INTRODUCTION", PAGE 3: the review section is treated very interestingly, a reader may be interested, in addition to the rightly cited information, if interventions on perpetrators/abusers have also been carried out in the same territories;

"INTRODUCTION", PAGE 3 LINE 5: "reported physical or sexual IPV" WHO defines IPV as "behaviour within an intimate relationship that causes physical, sexual or psychological harm, including acts of physical aggression, sexual coercion, psychological abuse and controlling behaviours." As can be seen from this definition, IPV already includes physical partner violence. It is understood that the citation in the text (4) reports exactly the wording of the results in table 2 (4), but probably a more effective wording can be chosen (also among those chosen in the research used as a source), to avoid that the term IPV is interpreted as exclusive for sexual violence and separate from physical violence, if both are practiced by the partner;

"MATERIALS & METHODS", SUB "Treatment Arms" SUB "Pure Control arm", PAGE 4 LINE 2 "The quiz pertaining to power equity was delivered directly after onboarding and eligibility questions.": from this sentence it seems that a questionnaire on "power equity" was administered. It would be interesting to read the questionnaire: is it a previously validated questionnaire or was it constructed ad hoc? What variables does it measure? Is it one of the tools mentioned in the "Measures" paragraph?

"MEASURES", PAGE 5, SUB "Gender equitable beliefs" LINE 3: What is the sample size in the validation assessment?

"MEASURES", PAGE 5, SUB "Mental Health" LINE 1: "Mental health was assessed using the brief screener of depressive symptoms, the two-item Patient Health Questionnaire (PHQ-2): as already noted in point 5, a reader may have incomplete information if it is not specified that the indices used do not measure "mental health" but the absence of depression/anhedonia;

Also in SUB "Socio-demographic," PAGE 5 LINE 5: "The state of users' mental health was assessed through one item asking if, in the past month, they had been mostly "energised and hopeful," "just getting along," or "down and hopeless." " can lead the reader to believe that tools for the evaluation of "mental health" were used;

"PROCEDURES", PAGE 6, LINE 6: "The trial was set up on RapidPro": an inexperienced reader may be interested in the reference to the software used and/or the organization that developed it;

“PROCEDURES”, PAGE 6, LINE 12: “To support 18,000 users on the chatbot…”: It could be indicated that the number is approximate or the precise number of people supported by the chatbot could be provided;

“RESULTS”, PAGE 8, LINE 6: “participants reported to have symptoms consistent with probable depression or anxiety…”: No instruments were indicated for evaluating anxiety;

“DISCUSSION”, PAGE 10, LINE 4 “The chatbot significantly improved skills around identifying unhealthy relationships, but did not alter mental health compared to the control.” and “DISCUSSION”, PAGE 10, LINE 35 “Prevalence of mental health symptoms among this relatively large sample of young women was high…”: As observed in points 5 and 10, the concept of "mental health" is broader than the absence of elements of depression/anhedonia, which makes it complex to refer to this construct without a reader wondering how it was measured;

“DISCUSSION”, “Limitations”: It is suggested to further reflect on the mode of data collection (online mode), which does not guarantee:

that the person who initially declared themselves is actually the one who completed the courses;

that the person actually completes the activities;

it is also possible that the activity is promoted/maintained by the possible receipt of financial compensation;

Thank you for your valuable contribution to research, which touches on a very important and sensitive area for the health of all people. I hope these observations can be useful for a more specific reflection on some of the issues identified.

Best wishes for your work.

Reviewer #2: the Participants in the trial were 18 or older (24yr), what their education background or education level? In this study, the literacy level could play a crucial role. In this study, mental health was assessed using the brief screener of depressive symptoms in this study, there are more complicated to assess and identify depression.

--------------------

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

Do you want your identity to be public for this peer review? If you choose “no”, your identity will remain anonymous but your review may still be made public.

For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Andrea Moi

Reviewer #2: No

--------------------

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLOS Digit Health. doi: 10.1371/journal.pdig.0000358.r003

Decision Letter 1

Padmanesan Narasimhan, Laura M König

25 Aug 2023

Effects of digital chatbot on gender attitudes and exposure to intimate partner violence among young women in South Africa

PDIG-D-22-00354R1

Dear Dr Hatcher,

We are pleased to inform you that your manuscript 'Effects of digital chatbot on gender attitudes and exposure to intimate partner violence among young women in South Africa' has been provisionally accepted for publication in PLOS Digital Health.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow-up email from a member of our team. 

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they'll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact digitalhealth@plos.org.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Digital Health.

Best regards,

Laura M. König

Academic Editor

PLOS Digital Health

***********************************************************

Reviewer Comments (if any, and for reference):

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Does this manuscript meet PLOS Digital Health’s publication criteria? Is the manuscript technically sound, and do the data support the conclusions? The manuscript must describe methodologically and ethically rigorous research with conclusions that are appropriately drawn based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: N/A

**********

4. Have the authors made all data underlying the findings in their manuscript fully available (please refer to the Data Availability Statement at the start of the manuscript PDF file)?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception. The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS Digital Health does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: We thank the editor and the authors of this interesting contribution and congratulate them for all the changes made that make the work very clear, understandable and interesting to the reader.

Reviewer #2: agree to acceptation

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

Do you want your identity to be public for this peer review? If you choose “no”, your identity will remain anonymous but your review may still be made public.

For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Andrea Moi

Reviewer #2: No

**********

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Consent flow.

    (DOCX)

    S2 Fig. Mental health by trial arm, n = 5,022.

    (DOCX)

    S1 Table. Intervention modules.

    (DOCX)

    S2 Table. Gender equitable beliefs regression.

    (DOCX)

    S3 Table. Any IPV exposure regression.

    (DOCX)

    S4 Table. Technological and/or psychological IPV.

    (DOCX)

    S5 Table. Physical and/or sexual IPV.

    (DOCX)

    Attachment

    Submitted filename: Response to reviewer comments 20.docx

    Data Availability Statement

    Data is fully available without restriction at our trial registry: https://osf.io/jcu4n.


    Articles from PLOS Digital Health are provided here courtesy of PLOS

    RESOURCES