Abstract
Background
The potential of chatbots for screening and monitoring COVID-19 was envisioned since the outbreak of the disease. Chatbots can help disseminate up-to-date and trustworthy information, promote healthy social behavior, and support the provision of health care services safely and at scale. In this scenario and in view of its far-reaching postpandemic impact, it is important to evaluate user experience with this kind of application.
Objective
We aimed to evaluate the quality of user experience with a COVID-19 chatbot designed by a large telehealth service in Brazil, focusing on the usability of real users and the exploration of strengths and shortcomings of the chatbot, as revealed in reports by participants in simulated scenarios.
Methods
We examined a chatbot developed by a multidisciplinary team and used it as a component within the workflow of a local public health care service. The chatbot had 2 core functionalities: assisting web-based screening of COVID-19 symptom severity and providing evidence-based information to the population. From October 2020 to January 2021, we conducted a mixed methods approach and performed a 2-fold evaluation of user experience with our chatbot by following 2 methods: a posttask usability Likert-scale survey presented to all users after concluding their interaction with the bot and an interview with volunteer participants who engaged in a simulated interaction with the bot guided by the interviewer.
Results
Usability assessment with 63 users revealed very good scores for chatbot usefulness (4.57), likelihood of being recommended (4.48), ease of use (4.44), and user satisfaction (4.38). Interviews with 15 volunteers provided insights into the strengths and shortcomings of our bot. Comments on the positive aspects and problems reported by users were analyzed in terms of recurrent themes. We identified 6 positive aspects and 15 issues organized in 2 categories: usability of the chatbot and health support offered by it, the former referring to usability of the chatbot and how users can interact with it and the latter referring to the chatbot’s goal in supporting people during the pandemic through the screening process and education to users through informative content. We found 6 themes accounting for what people liked most about our chatbot and why they found it useful—3 themes pertaining to the usability domain and 3 themes regarding health support. Our findings also identified 15 types of problems producing a negative impact on users—10 of them related to the usability of the chatbot and 5 related to the health support it provides.
Conclusions
Our results indicate that users had an overall positive experience with the chatbot and found the health support relevant. Nonetheless, qualitative evaluation of the chatbot indicated challenges and directions to be pursued in improving not only our COVID-19 chatbot but also health chatbots in general.
Keywords: user experience, chatbots, telehealth, COVID-19, human-computer interaction, HCI, empirical studies in human-computer interaction, empirical studies in HCI, health care information systems
Introduction
The burden on health systems during the COVID-19 pandemic reached unprecedented levels in both high- and low-income countries globally. The increase in demand for the provision of care through the several COVID-19 pandemic waves required global public health responses and challenged health care systems’ capacity as well as health units’ resilience [1]. Concomitantly, there was a sudden unprecedented demand for information and a widespread amount of unreliable and fake information—an “infodemic” [2]—putting lives at risk by prompting the population to try unproven medications in the hope of preventing the disease or finding a “cure” [3]. In this context, telehealth and digital health solutions, including chatbots, emerged as a quick and viable response, acting as a symptom checker in digital triage approaches [1,4,5].
Chatbots are conversational agents that interact with people using a text-based interface or spoken natural language [6]. They are usually deployed through website widgets or instant messaging apps and have been increasingly adopted in several different fields such as finance, commerce, marketing, and fitness [7]. They have only recently started to expand into health care [8]. Their method of communication makes it suitable for a variety of target populations; various health conditions; and a broad range of purposes such as patient triage, clinical decision support, and self-management [9-12].
The potential of chatbots for screening and monitoring COVID-19 was envisioned since the disease outbreak as a strategy not only to disseminate up-to-date and trustworthy information but also to promote healthy social behavior and to support the provision of health care services safely and at scale [13]. For the purpose of pandemic management, chatbots might teach people about social distancing and other prevention measures; clarify doubts about symptoms, treatments, and vaccines; and help screen patients remotely, avoiding unnecessary visits to health care centers that could implicate crowding and taking up valuable time of health care professionals [14].
In this scenario and in view of its far-reaching postpandemic impact, it is critically important to evaluate user experience with this type of technology. Despite the World Health Organization (WHO) recommendation regarding the assessment of user interaction for the adoption of digital technologies in health care, evidence on chatbot assessment in the context of the COVID-19 pandemic and other conditions is still scarce [4,5,15]. This is of utmost importance not only as a way to assess and enhance users’ experiences but also to improve the technology itself, so that it can fulfill its ultimate goal of promoting public health and saving lives even during a scenario of uncertainties from the lack of evidence and ethical risks. In addition, assessment can provide insights for the development of chatbots for other conditions. The better the quality of user experience, the greater the chances of adoption and benefits for most users.
Therefore, this paper sought to evaluate the quality of user interaction with a chatbot developed to respond to the COVID-19 pandemic by a large telehealth service in Brazil to assess users’ overall experiences, including strengths and shortcomings, as reported by participants.
Methods
Chatbot Development and Implementation
The planning and development of our COVID-19 chatbot were described in detail previously [16,17]. The bot was developed in March 2020 at the beginning of the first wave of the COVID-19 pandemic in Brazil to provide 2 core functionalities. The first was assisting web-based screening of COVID-19 symptom severity based on a decision tree that considered available evidence and recommendations from the Brazilian Ministry of Health [18] and the WHO [19]. This functionality was meant to (1) advise the population whether and when to seek care, with people with no warning signs advised to stay home; and (2) queue patients for teleconsultation, prioritizing those with warning sign severity and comorbidities [20]. Figure 1 shows a flowchart of the stages the user traverses guided by the chatbot questions. Colors are used to screen cases: (1) red (user advised to search for immediate, emergency care); (2) orange (user advised to search for urgent care at the hospital); (3) yellow (user advised to search for care in reference centers); and (4) green (user advised to stay at home unless new warning signs appear).
The second functionality aimed to supply evidence-based information to the population at a time of uncertainty, misinformation, and widespread dissemination of fake news. Misleading information can be created and used unintentionally or intentionally to cause harm (misinformation vs disinformation vs malinformation) [21]. However, there is misleading information from the lack of consistent evidence regarding many aspects of this recent disease, which demanded continuous revision in the scientific basis of the chatbot. This was provided as question and answer (Q&A) based on frequently asked questions in the database at the Telehealth Center of the University Hospital at Universidade Federal de Minas Gerais [22]. The questions were initially grouped into 11 topics—general information, transmission, symptoms, advice for suspected cases, treatment, home care, hygiene, lifestyle, mask use, pregnancy, and pet care—and later expanded to include diagnosis. A group of health care professionals at the Telehealth Center selected 85 Q&A pairs based on the best available evidence and following the Brazilian Ministry of Health [18] and WHO [19] recommendations.
Our chatbot, having a female identity and the name Ana, was developed using BLiP [23]—a proprietary software platform—as a service for the development of conversational agents. The chatbot was available via different channels, namely as an app on WhatsApp (Meta Platforms Inc); as a webchat on the web sites maintained by the Telehealth Center [24] (Figure 2), the city of Teófilo Otoni [25], and the University of São João del Rei in Divinópolis [26]; and as an “embedded” app hosted by Divinópolis municipal health department [20]. A version of the chatbot with a male identity and the name Pedro was also made available on the website maintained by Universidade Federal de Minas Gerais for students and personnel to queue for teleconsultations and have access to frequently asked questions. For the purposes of our study, we focused solely on the chatbot Ana.
Study Design
A mixed methods approach was used, and user experience with the chatbot was evaluated through (1) a posttask usability survey administered to a sample of users who resorted to the bot for symptom checking to gather participants’ impressions immediately after concluding their interaction with the bot and (2) an interview with volunteer participants who engaged in simulated interaction with the bot guided by the interviewer. We performed a convergent parallel mixed methods design [27], in which data were collected and analyzed separately, and the results were presented side by side and then related at the end. Both studies address the same macro–research question regarding user experience with the chatbot. The quantitative study is meant to indicate a broad trend, whereas the qualitative study is meant to provide deeper insight into the user experience.
Ethics Approval
The study protocol was approved by the Brazilian National Commission for Research Ethics (CAAE 35953620.9.0000.5149). Individual informed consent was obtained for all the participants.
Usability Survey
A brief usability survey was used to assess users’ overall impressions after they had concluded using the chatbot. The survey was intended to evaluate chatbot usability at scale and was administered to all users after concluding their interaction with the chatbot. As they were symptomatic users who were actually quite concerned about their health condition and were not very willing to spend time answering a questionnaire, we opted to use a small set of 4 questions drawing on the classic criteria for usability assessment [28,29]. The questions inquired on 4 usability aspects, namely ease of use, usefulness, satisfaction, and likelihood of recommending the bot to other users. Answers were collected using a 5-point Likert scale, ranging from 1 (worst score) to 5 (best score), representing the strength with which the respondent agreed or disagreed with each question. All users were invited to reply to the survey, but replying was optional, and users could comply and accept the invitation or conclude their chatbot session without answering our survey. From October 2020 to January 2021, 622 complete interactions with the chatbot were recorded. In total, 63 out of 622 users agreed to fill in our usability survey (response rate of 10.1%). Table 1 shows the sociodemographic data of respondents and nonrespondents.
Table 1.
Users | Age (years), mean (SD) | Women (n=380), n (%) | Men (n=237), n (%) | Not declared (n=5), n (%) | Total (n=622), n (%) |
Respondents | 36.1 (14.4) | 46 (12.1) | 17 (7.2) | 0 (0) | 63 (10.1) |
Nonrespondents | 34.5 (14.1) | 334 (87.9) | 220 (92.8) | 5 (100) | 559 (89.9) |
Total | 34.7 (14.1) | 380 (100) | 237 (100) | 5 (100) | 622 (100) |
aInformation was recorded during interaction as informed by users.
The respondents had a mean age of 36.1 (SD 14.4) years and were predominantly women (46 out of 63 respondents, 73%).
Descriptive statistics assessed the characteristics of the users and responses to the usability questions. To summarize the quantitative variables, we used averages, SDs, medians, minimum and maximum, or IQRs depending on the data distribution. Qualitative variables were presented as absolute values and percentages. Box plots enhanced the visualization of grades assigned by users on each criterion for assessment.
Qualitative Assessment: Users’ Interviews and Analysis
To tap users’ assessment of the chatbot interface and core functionalities (screening and educational session), we conducted a remote teleconference session with 15 invited volunteer asymptomatic participants having different age, sex, and occupation profiles recruited by the research team. Each participant received a scenario describing a situation that would prompt their interaction with the chatbot. The researcher observed and recorded their interaction. The session was followed by a semistructured interview to gather insights on their experience with the chatbot and their perceptions of the strengths and shortcomings of the bot as reported by them.
The evaluation was conducted through a teleconference system and took place between November 2020 and January 2021 as the second wave of the pandemic started in Brazil. The interviews were transcribed, and a thematic analysis was performed [30].
Among the 15 participants, 53% (n=8) were female, with ages ranging from 18 to 62 (mean 38.1, SD 15.7; median 37; minimum=18, Q1=25, Q3=51, maximum=62) years, and 73% (n=11) had a higher education degree. Out of the participants, 33% (n=5) were engaged in teaching or research at the university, 27% (n=4) were students and 40% (n=6) were regular or self-employed workers. With regard to the device used to interact with the chatbot, 80% (n=12) used a desktop or laptop computer, whereas 20% (n=3) used a smartphone. The participants’ data are detailed in Table 2.
Table 2.
Participant | Age (years) | Sex | Education | Occupation | Device used |
P01 | 37 | Male | Graduate degreea | University lecturer | Desktop or laptop computer |
P02 | 48 | Female | Graduate degreea | University lecturer | Desktop or laptop computer |
P03 | 25 | Male | Graduate degreea | Attorney | Desktop or laptop computer |
P04 | 40 | Male | Graduate degreea | IT or user experience designer | Smartphone |
P05 | 25 | Female | Bachelor’s degree in linguistics | Student pursuing a master’s degree | Desktop or laptop computer |
P06 | 58 | Female | Bachelor’s degree in nutrition science | Credentialed dietitian and undergraduate student in psychology | Smartphone |
P07 | 27 | Female | Bachelor’s degree in veterinary studies | Undergraduate student in linguistics | Desktop or laptop computer |
P08 | 52 | Female | Graduate degreea | Lecturer | Desktop or laptop computer |
P09 | 33 | Male | Graduate degreea | Sociologist | Desktop or laptop computer |
P10 | 50 | Female | Graduate degreea | Psychologist | Desktop or laptop computer |
P11 | 20 | Female | High school degree | Undergraduate student in psychology | Desktop or laptop computer |
P12 | 18 | Female | High school degree | Student | Desktop or laptop computer |
P13 | 59 | Male | Bachelor’s degree in computer science | IT analyst | Desktop or laptop computer |
P14 | 18 | Male | High school degree | Undergraduate student | Desktop or laptop computer |
P15 | 62 | Male | High school degree | Insurance broker | Smartphone |
aMaster’s or doctoral degree.
In the evaluation session, the participants received a scenario describing a situation that would prompt their interaction with the chatbot. A set of 10 different scenarios were prepared to cover different chatbot interactive paths in the screening functionality, from severe to light symptoms, with and without comorbidities (Multimedia Appendix 1). Participants were designated to scenarios according to their actual profiles to make the interaction as realistic as possible. Sample scenarios included an adult woman in her 30s being assigned a scenario of a pregnant woman, a participant in their 60s being assigned a scenario of a person with some comorbidity, among others. Similarly, each scenario included 3 topics to assess the educational functionality of the chatbot, 2 of them being preassigned topics, and a third one free for the participant to choose. Most sessions lasted between 30 minutes and 1 hour. During the sessions, the participants interacted with the chatbot while the researcher observed and recorded their interactions. Afterward, they were interviewed about their experience with the chatbot (the interview script used is available in Multimedia Appendix 2). The evaluation was conducted through a teleconference system chosen by the participant, in an individual session and in Portuguese (participants’ mother tongue), which took place between November 2020 and January 2021.
The interviews were recorded and included screen recordings of participants’ interactions with the chatbot. The interviews were transcribed by the research team. Thematic analysis [30] of the interview transcripts was carried out to find recurrent themes in participants’ interviews that could be matched to the research questions guiding our study as follows:
What are the strengths and shortcomings of our bot as perceived by users?
What particular insights can be drawn from our study to inform prospective chatbot design?
Our thematic analysis was conducted in an inductive way, that is, a bottom-up approach, where the analysis is not driven by a preexisting framework or theory, but the researchers search for codes and themes in a data-driven way [30]. This approach is applicable for qualitative analysis of interview data [31] and is more suitable for broad rather than specific research questions, as was our case [30].
We applied triangulation as a typical strategy to improve the quality and reliability of our qualitative results [32]. In particular, the data were analyzed by multiple researchers (investigator triangulation [33,34]) and the outcome of their analysis was discussed until consensus was reached. The transcripts were coded by 2 senior and 2 junior researchers, with a set of at least 5 transcripts being assigned to each one for analysis and coding. Thus, every interview was analyzed by at least 2 different researchers. Interviews were recorded and analyzed in the qualitative data analysis using Miner Lite (Provalis Research) software [35], which is adequate for qualitative analysis. Finally, the codes were presented to peers, refined, and organized as per this report in discussions with other senior researchers from the team.
Results
Usability Questionnaire
Table 3 shows the questions asked and the number of users who assigned each grade to each criterion. The bot obtained high grades on all evaluation criteria. App usefulness obtained the highest mean (4.57), whereas satisfaction attained the lowest mean (4.38). Figure 3 shows a box plot of the grades assigned by users as per quartile distribution in Table 4, clearly indicating predominance of grades 4 and 5 with few outliers.
Table 3.
Question | Grade, n | Total, n | Values, mean (SD) | ||||
|
1 | 2 | 3 | 4 | 5 |
|
|
1. Was this app easy to use? | 4 | 3 | 1 | 8 | 47 | 63 | 4.44 (1.16) |
2. Was this app useful to you? | 2 | 0 | 6 | 6 | 47 | 61 | 4.57 (0.92) |
3. Was this app satisfactory to use? | 4 | 1 | 6 | 6 | 43 | 60 | 4.38 (1.17) |
4. Would you recommend this app to other people? | 4 | 2 | 2 | 5 | 46 | 59 | 4.47 (1.16) |
Table 4.
|
Ease of use, n | Usefulness, n | Satisfaction, n | Recommendation, n |
Minimum value | 1 | 1 | 1 | 1 |
Quartile 1 (25%) | 4 | 5 | 4 | 5 |
Quartile 2 (50%): median | 5 | 5 | 5 | 5 |
Quartile 3 (75%) | 5 | 5 | 5 | 5 |
Maximum value | 5 | 5 | 5 | 5 |
Qualitative Assessment
Initially, excerpts were annotated with the following tags: positive feedback (aspects reported as positive by the participants regarding interaction, interface, and content of the chatbot); negative feedback (points considered negative by the participants pertaining to interaction, interface, and content of the chatbot); and neutral (comments that did not qualify as either ostensibly positive or negative). Different themes emerged in each broad category (positive or negative).
As a following step, we analyzed each of the themes and organized them based on whether they were related to the usability of the chatbot or to the health support it offered. Themes associated with the usability of the chatbot pertained to issues related to the system’s interface, the users’ perspective of the effectiveness and efficiency of the proposed functionalities, and users’ perceptions and responses to the use of the system. Health support included all themes that addressed aspects of how the chatbot achieved its goal to offer support regarding COVID-19 screening and education to users.
Our classification allowed us to reveal and point to problems related to different sources in our COVID-19 chatbot—design decisions of the technology itself and how it supports users’ needs for health information in the context of COVID-19 screening and education. Next, we present the results of our analysis and describe each identified theme. We present both the positive and negative aspects that emerged from our analysis. Nonetheless, we examined the negative aspects in more detail, as they point to the aspects that still need to be improved and dealt with in health chatbots.
Positive Feedback
On the basis of our analysis of the positive comments from participants, 6 different themes emerged—3 related to the usability of the chatbot and the other 3 pertaining to health support.
Regarding the chatbot’s usability (Table 5), participants in general found the interface esthetically pleasing and with good usability (C1). They reported an overall positive experience with the chatbot, mostly because of its ease of use (C2). Finally, some participants showed a reasonable level of understanding about the chatbot’s underlying logic, which is positive in the sense that the interaction improves as the user understands how the technology works (C3).
Table 5.
Code | Description | Occurrences, n | Examples |
C1. Chatbot interface design and functionalities | Comments on chatbot graphic interface design, including font-size and text display on screen, chatbot esthetics, and use of button-limited options | 4 |
|
C2. Positive user experience | General positive comments on overall experience of interacting with the chatbot, for example, ease of use | 26 |
|
C3. Understanding chatbot underlying rationale | Comments on user perception and understanding of rationale behind the chatbot operation | 3 |
|
As for the themes associated with health support (Table 6), participants valued the fact that the screening process was simple and straight to the point, helping users understand the action they should take (C4). Furthermore, they found that there was a broad range of topics in the Q&As, including content related to fake news that had been circulating at the time, and considered the answers concise and easy to understand, a very frequent comment in their interviews (C5). Finally, participants found that the chatbot was useful and valuable, especially considering the circumstances they were living in at the time—it was trustworthy and allowed them to obtain reliable information without the risk of getting infected (C6).
Table 6.
Code | Description | Occurrences, n | Examples |
C4. Patient screening session—process and guidelines | Considerations about directions given, color system used in the triage phase, and chatbot guidance during the screening session | 18 |
|
C5. Question and answer session—range of topics and trustworthiness of information provided | Number and content of answers considered satisfactory as well as effective in expanding knowledge about disease | 67 |
|
C6. Reported advantages of chatbot use during COVID-19 pandemic | Motivations and advantages of using a chatbot during the COVID-19 pandemic | 18 |
|
Negative Feedback
On the negative side, although we obtained approximately the same number of excerpts as in positive feedback, our analysis led to a larger set of categories, 10 of them related to usability and 5 related to health support.
Regarding usability (Table 7), different types of problems emerged, from technical problems to interaction and interface design problems to problems with the expectation of better communicative capability (which would require artificial intelligence [AI] support). Technical problems were reported by participants who faced difficulties when sharing their location with the chatbot (C7), and sometimes, the app became slow or unresponsive in their mobile phones (C8).
Table 7.
Code | Description | Occurrences, n | Examples |
C7. Difficulty in sharing location | Participant unable to share device GPS location when requested by chatbot | 10 |
|
C8. Technical problem in mobile app or phone | Chatbot stops responding or gets slow; interaction is interrupted | 4 |
|
C9. Need for an option to go back and make a different choice during interaction | Participant needs to start over and repeat entire session, as the chatbot does not have an option for backtracking or choosing a different path during conversation | 2 |
|
C10. User choice repeatedly prompted by option menu and at high pace | Participant complaint about being prompted to make a choice in option menu and finding it too fast to be able to read the whole answer provided by the bot | 6 |
|
C11. Conversation flow management | Participant does not succeed in keeping the conversation flowing with the chatbot owing to unperceived feedback or lack of it from the chatbot (eg, turn taking management) | 4 |
|
C12. Better interface resources | Additional features in chatbot interface to enhance interaction | 12 |
|
C13. Insufficient directions on how to interact with the chatbot | Participant requesting directions or help from the interviewer | 22 |
|
C14. Chatbot language need to be adapted to meet different user profiles | Language used by the chatbot needs to be adapted to be understood by user with low-literacy level | 2 |
|
C15. Chatbot fails to understand unexpected user responses | Chatbot does not successfully process information entered by the participant | 11 |
|
C16. Participants expectations exceed chatbot’s actual communicative ability | Participant tries to interact in a way not supported by the chatbot, for example, by trying to speak to the chatbot by voice | 5 |
|
Although participants considered their overall experience with the chatbot to be good, they commented on many issues that could improve the interaction if solved. Regarding the flow of the conversation, some participants had difficulties when trying to go back after typing a wrong option (eg, C9). Some participants complained about the menu being displayed too quickly and hindering their ability to read the chatbot’s (previous) response (C10). Another problem related to conversation flow was observed when participants did not understand how their interaction with the chatbot evolved. For instance, a participant missed the cue indicating that the chatbot was answering and did not wait for it to respond before sending another message (C11).
The lack of graphical interactive resources (eg, clickable menu options) was also an issue for some participants (C12). Another problem we observed in some sessions was participants not knowing how to start the conversation with the chatbot and asking the interviewer for guidance owing to absence of basic initial directions (C13). Some comments about the chatbot language were also pointed out by participants who thought it may not be adequate for users with lower levels of literacy (ie, they may not be able to understand it), which indeed can be an issue in Brazil (C14).
Finally, we also observed some interaction problems related to our chatbot technology limitations. Some participants entered unexpected text inputs into the chatbot that it was not prepared to handle (C15). Similarly, others tried to interact with the chatbot by typing or even speaking in natural language (C16). Both issues could be addressed by applying better support for natural language processing and understanding using AI, which was already commonly found in several conversational systems at the time.
Regarding health support (Table 8), participants commented on some outdated or missing information they noticed in some answers (C17 and C18). It is worth pointing out that interviews took place at the end of 2020 when there was still much to be learned about COVID-19. Furthermore, this was around the time when the vaccine was underway and the chatbot did not have any information about it yet. In some cases, participants reported dismay with the briefness of the clinical evaluation during the screening session (a participant, for instance, expected a more detailed and thorough evaluation of her symptoms before the chatbot gave her instructions) and the lack of mechanisms to mitigate responses for severe symptoms (C19). Finally, the last themes have to do with the need, mentioned by some participants, for more practical and situated guidance or information both in the screening section (C20) and in the Q&A section (C21).
Table 8.
Code | Description | Occurrences, n | Examples |
C17. Outdated information or answer | Participants noticed some outdated information or questioned whether the information presented in the Q&Aa session was updated | 4 |
|
C18. Missing information or explanation | Participants suggested a topic in the Q&A session that should be included or further explained | 5 |
|
C19. Unfulfilled expectations during the screening session | Participants mentioned interesting insights and broken expectations during the screening process | 13 |
|
C20. Need or demand for actionable orientation during the screening session | Participants expected to receive more practical instructions at the end of the screening session | 5 |
|
C21. Demand for situation-oriented answers to questions | Participants expected to find answers that could be more directly applied to a particular situation in the question and answer session | 7 |
|
aQ&A: question and answer.
As is the case with any qualitative analysis, numeric information should be interpreted cautiously and is presented here for the sake of transparency. It should be noted that the number of tagged occurrences of a code is not a general indicator of relevance or importance, because our analysis was not based on frequency or other statistical metrics. Thus, this information is not meaningful to discuss codes’ validity [31] and was included as an index of the overall analysis process, not to indicate any validation of the analysis. Table 9 shows that the number of negative codes is greater than the number of positive codes. This is expected because we analyzed the negative aspects more thoroughly, as stated earlier, leading to individually less frequent and more fine-grained negative codes. In the category level, frequency of codes can be an approximate indicator of the distribution of positive and negative aspects. In Table 9, we can see that 54.8% (136/248) of the excerpts were identified as positive, and the remaining 45.2% (112/248) were identified as negative.
Table 9.
Category | Codes, n | Tagged occurrences (n=248), n (%) | |
Positive | 6 | 136 (54.8) | |
|
User experience | 3 | 33 (13.3) |
|
Health support | 3 | 103 (41.5) |
Negative | 15 | 112 (45.2) | |
|
User experience | 10 | 78 (31.5) |
|
Health support | 5 | 34 (13.7) |
Total | 21 | 248 (100) |
Discussion
Overall Findings
The WHO guidelines point out that considering the potential impact that interface and interaction issues have on health care services and even on clinical practice [15], it is essential to evaluate user experience in health care systems. Despite the increased use of chatbots in a range of fields, this form of technology has yet to be robustly assessed, and the literature regarding these conversational agents’ formats, focusing on their acceptability, safety, and effectiveness, is still incipient [7]. Moreover, the lack of standardization and paucity of objective measures make it difficult to compare the performance of health chatbots [36].
In this paper, we present users’ evaluations of a chatbot developed specifically for screening cases and supplying information regarding COVID-19. We performed a brief, quantitative assessment with actual chatbot users and an in-depth evaluation with participants through simulated scenarios (volunteers who were asymptomatic and engaged in chatbot interactions as guided by the interviewer).
Although our quantitative analysis indicated that overall users were satisfied with the chatbot, our qualitative analysis allowed us to identify participants’ perspectives of positive and negative aspects regarding usability and health support, as described in sections Positive Feedback and Negative Feedback. The positive comments from the qualitative study corroborate the quantitative results we found, as positive comments represented approximately 55% of all comments, and the most frequent codes emphasized an overall positive experience (C1) and the usefulness of the provided health support during the pandemic context (C4-C6). At the same time, the negative comments in the qualitative study are not in conflict with the overall positive experience from the quantitative study. All volunteers from the qualitative study reported having an overall positive experience with our chatbot during the interviews. The negative comments should be interpreted as opportunities for improvement that did not compromise the overall experience. In the subsequent sections, we discuss some of the main issues based on our analysis.
Updated Chatbot Information
The results indicate that the pandemic context created specific circumstances that led participants to assign value to having a chatbot available—fake news dissemination about COVID-19 and the disease’s high transmission rate. This means participants welcomed the possibility of having access to reliable information at a time when plenty of fake news about COVID-19 was circulating in Brazil, presumably connected to political interests and governmental sources as well as misinformation and infoxication from inappropriate scientific papers [37]. Furthermore, knowledge about COVID-19 was rapidly evolving, and the population was seeking sources of trustworthy information. Participants also felt that obtaining directions as to how to proceed in case of symptoms without having to be exposed to chances of getting infected by the virus was a positive factor. On the other hand, because information evolved so quickly, participants noticed that information provided by the chatbot was not fully up to date (eg, about vaccines, which were underway). This was perceived as a negative impact that could undermine the reliability assigned to the chatbot and points to the challenge of the need for constant information updating in conversational agents. This includes deciding which pieces of new information are relevant to be included and how to best translate new scientific evidence for the lay population, a similar challenge faced by decision support systems in general [38]. As previously stated, developing a high-quality COVID-19 chatbot is critical but not enough for widespread adoption. It is fundamental to demonstrate and emphasize that chatbots are able to deliver the same quality service as human agents [39].
Universal Usability
Another aspect that emerged in our analysis, which is very relevant to the Brazilian context and may also be relevant to other resource-limited countries, is the need for universal usability [40], that is, to provide access to technology to all citizens. In the case of our chatbot, issues related to access quality were observed by the interviewer or directly reported by the participants in our evaluation. Some participants used their own cell phones for assessing the chatbot. However, in one case, the participant had technical problems that were not software bugs strictly speaking but seemed to be associated with users’ device limitations related to the operational system and to hardware resources. Although currently there are more smartphones in use in the country than citizens [41], owing to the inequalities in our country, the chatbot may not be universally accessible through all smartphone models in use. Furthermore, one participant specifically raised the issue of the educational level of other Brazilian citizens and pointed out the need to adapt the chatbot’s language to a larger variety of user profiles. These results corroborate not only the need to assess user experience of health care technology in general but also issues brought about by local conditions of technology use in a country or region.
Beside quality of access to technology, literary and accessibility issues are also issues to be tackled in chatbot development. Srivastava [42] reviewed gaps found in using chatbots during COVID-19, and one of them was “inaccessible information,” that is, most of the chatbots created assumed that the users were literate, experienced with digital technology, and did not have any disabilities. These assumptions prevented a considerable part of the society from benefiting from chatbot technology. In the case of our chatbot Ana, our team performed several updates in the chatbot language, aiming at making language more accessible to less-literate users and enhancing user experience.
Expected Communicative Abilities
Analyzing the negative aspects of interacting with the chatbot, we noticed that most of them (6 out of 10—C9, C10, C11, C13, C15, and C16) were related to the conversational paradigm adopted in such technologies. Participants reported on interactive breakdowns (ie, problems they had as they interacted with the system) that were generated by many different causes—from not knowing exactly how to interact with it (C13) to expecting too much of the bot’s communicative abilities (C16). Although these challenges are mainly related to chatbots in general [43-47] and not only in the health domain, they emerged as hindering users’ interaction with the system and impacted (negatively) their experience with it, which could lead them to not fully embrace or adopt the technology.
Complete Health Care Information
It is important to understand the negative aspects related to the health support offered by the chatbot, as this is the technology’s main goal. Out of the 5 themes describing these negative aspects, 2 of them, as mentioned, were related to the need to keep information updated and complete when knowledge of the disease was continuously evolving during the beginning of the pandemic. A third one was the system not fulfilling users’ expectations regarding a more thorough clinical evaluation or more careful instructions to patients. The decision-making framework of a chatbot is crucial to address this issue. Models based on user-initiated solutions are usually easier to deploy; however, this type of solution may be insufficient in some scenarios and may lead to situations in which a high-risk person or a person with issues regarding specific conditions or contexts would rather seek an in-person assessment in a health care facility, because they did not feel safe or could not follow the recommendations. On the other hand, models based on provider-initiated solutions allow providers to “close the loop” and properly address more specific conditions [11].
Contextual Information and Adaptive Ability
Finally, the last 2 themes (C20 and C21) point to the expectation that some users highlighted of having more practical orientation and situation-oriented information. These kinds of features would demand more sophisticated technologies that might be able to handle contextual information from users to identify contextual needs and adaptively respond to them. To achieve this type of goal, we would need at least a richer data set comprising a reasonable set of different situations and Q&A pairs and more sophisticated technologies able to detect and handle users’ contexts appropriately. Context could be inferred from isolated conversations but would probably be better constructed by technologies that combine external variables (historical data, location, etc) such as search engines and advertisement technologies. We believe that adaptive AI capabilities such as recommender systems can be included in the chatbot to provide more specific instructions that would take into consideration the users’ specific condition (eg, comorbidities) or context (eg, location) in the answers and piece of advice given.
An adaptive approach can also be used as a strategy to address users’ diversity in skills and preferences. We observed conflicts between the participants’ comments and opinions, such as C1 contradicting C9, C10, and C12 and C3 contradicting C13. As mentioned before, most participants in the qualitative study had an overall positive experience with our chatbot, and negative comments should be interpreted as opportunities for improvement and not as a conflict of results. However, the participants were bothered by our chatbot’s problems when interacting with the system in different ways, depending on their profile, background, patience at the time, etc. Fulfilling the goals and needs of a large diversity of users with different profiles, backgrounds, and preferences is also a goal at the core of the universal usability principle [40] and one of the major interaction design challenges. We believe that adaptive chatbots should be investigated as a promising technology to help in this regard.
Directions for Future Research
The construction and deployment of a chatbot for COVID-19 is a dynamic project that demands collaboration among multiple disciplines such as health professionals, linguists, technology designers, and developers [16]. The results of our qualitative analysis and discussions provide directions for multidisciplinary teams to approach projects of prospective bots and are expected to help organize the problem space of regarding interaction decisions and issues to help understand users’ needs and expectations in such endeavors.
Limitations
Although our qualitative evaluation of the chatbot included a small sample of 15 participants, there was a distribution of gender (7 female and 8 male participants) and age, varying from 18 to 62 years. Nonetheless, as the assessment was performed during the pandemic through teleconference, it required participants who had access to computers and good internet bandwidth. Thus, it does not represent the variety of educational or economic groups in Brazil. In the future, our goal is to broaden our evaluation to include other groups of our population who represent potential users of the system. Moreover, we did not investigate the perceptions of physicians, nurses, and caregivers regarding the use of this COVID-19 chatbot, including their benefits, challenges, and risks to patients.
This qualitative study was designed to allow the collection of rich, in-depth data containing participants’ thoughts and insights about their experience of using our chatbot. At that time, using chatbots for health purposes was not common in Brazil, and the interviewer’s clarifications during sessions were given to participants to unblock them from dead ends, thus enabling us to collect more rich and useful data. All such cases were annotated and considered in the analysis.
Our quantitative assessment of our COVID-19 chatbot was evaluated by 10.1% (63/622) of users who chose to participate in the evaluation process. Further analysis is needed to test their statistical significance. As the system continues to be used, we expect more users to willingly participate and more data to be collected regarding their attitudes toward the system.
Conclusions
This study evaluated the quality of user experience with a chatbot designed in response to the COVID-19 pandemic by a large telehealth service in Brazil through an analysis of usability with real users and an exploration of strengths and shortcomings of the chatbot, as revealed in reports by participants in simulated scenarios. Our results indicate that overall, users had a positive experience with the chatbot and found the health support relevant. Nonetheless, the qualitative evaluation of the chatbot indicated challenges and directions to be pursued in improving not only our COVID-19 chatbot but also health chatbots in general.
Acknowledgments
The authors thank all the participants of the chatbot assessment as well as chatbot users who took the time to evaluate the chatbot for their contribution to this research. The authors also thank Fernanda Rocha Gonçalves for her contribution to the study design.
This study is supported in part by the Brazilian research agencies Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (grant 88887.516155/2020-00), Fundação de Amparo à Pesquisa do Estado de Minas Gerais (FAPEMIG, grant RED-00081-16), and Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)/Instituto de Avaliação de Tecnologias em Saúde (grant 465518/2014-1).
ALPR was supported in part by Brazilian research agencies CNPq (310790/2021-2 and 465518/2014-1) and FAPEMIG (PPM-00428-17 and RED-00081-16). MSM was supported in part by CNPq (grant 310561/2021-3). ASP has a CNPq (313103/2021-6) and a FAPEMIG (APQ-01.461-14) grant; BAC was supported by a Pró-Reitoria de Pesquisa, Universidade Federal de Minas Gerais, Secretaria de Educação Superior, and Ministério da Educação grant (23072.211119/2020-10); TMP received scholarships from Pró-Reitoria de Extensão from the Federal University of Minas Gerais. ALPR and MSM are members of the National Institute of Science and Technology for Health Technology. The sponsors had no role in the design of this study during its execution, analyses, interpretation of the data, or decision to submit the results.
Abbreviations
- AI
artificial intelligence
- Q&A
question and answer
- RCT
randomized controlled trial
- WHO
World Health Organization
Scenarios for in-depth chatbot evaluation (asymptomatic users).
Script for the Interview conducted with asymptomatic (healthy) participants following their interaction with the chatbot.
Complete version of Table 7.
Complete version of Table 8.
Data Availability
Data are available on reasonable request.
Footnotes
Authors' Contributions: All authors reviewed and edited the manuscript and approved the final version. AB, ASP, BAC, CRAO, ECP, HV, KF, MSM, ROP, and TMP drafted the manuscript. ASP, ALPR, BAC, MSM, ROP, and ZSNR were responsible for the research protocol. ASP, BAC, MSM, and ROP coordinated the study. BAC and ROP were responsible for the application of the questionnaire. BAC, ECP, HV, and KF performed data analysis. CRAO, LBR, MSM, and ZSNR participated in chatbot development and testing. CRAO, LBR, MSM, and ZSNR participated in chatbot implementation.
Conflicts of Interest: None declared.
References
- 1.Al Knawy B, Adil M, Crooks G, Rhee K, Bates D, Jokhdar H, Klag M, Lee U, Mokdad AH, Schaper L, Al Hazme R, Al Khathaami AM, Abduljawad J. The Riyadh Declaration: the role of digital health in fighting pandemics. Lancet. 2020 Nov 14;396(10262):1537–9. doi: 10.1016/S0140-6736(20)31978-4. https://europepmc.org/abstract/MED/32976771 .S0140-6736(20)31978-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Zarocostas J. How to fight an infodemic. Lancet. 2020 Feb 29;395(10225):676. doi: 10.1016/S0140-6736(20)30461-X. https://europepmc.org/abstract/MED/32113495 .S0140-6736(20)30461-X [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.During this coronavirus pandemic, ‘fake news’ is putting lives at risk: UNESCO. United Nations Office for the Coordination of Humanitarian Affairs (OCHA) 2020. Apr 13, [2022-09-30]. https://reliefweb.int/report/world/during-coronavirus-pandemic-fake-news-putting-lives-risk-unesco .
- 4.Morse KE, Ostberg NP, Jones VG, Chan AS. Use characteristics and triage acuity of a digital symptom checker in a large integrated health system: population-based descriptive study. J Med Internet Res. 2020 Nov 30;22(11):e20549. doi: 10.2196/20549. https://www.jmir.org/2020/11/e20549/ v22i11e20549 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Lai L, Wittbold KA, Dadabhoy FZ, Sato R, Landman AB, Schwamm LH, He S, Patel R, Wei N, Zuccotti G, Lennes IT, Medina D, Sequist TD, Bomba G, Keschner YG, Zhang HM. Digital triage: novel strategies for population health management in response to the COVID-19 pandemic. Healthc (Amst) 2020 Dec;8(4):100493. doi: 10.1016/j.hjdsi.2020.100493. https://europepmc.org/abstract/MED/33129176 .S2213-0764(20)30092-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Rapp A, Curti L, Boldi A. The human side of human-chatbot interaction: a systematic literature review of ten years of research on text-based chatbots. Int J Human Comput Stud. 2021 Jul;151:102630. doi: 10.1016/j.ijhcs.2021.102630. [DOI] [Google Scholar]
- 7.Tudor Car L, Dhinagaran DA, Kyaw BM, Kowatsch T, Joty S, Theng YL, Atun R. Conversational agents in health care: scoping review and conceptual analysis. J Med Internet Res. 2020 Aug 07;22(8):e17158. doi: 10.2196/17158. https://www.jmir.org/2020/8/e17158/ v22i8e17158 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Palanica A, Flaschner P, Thommandram A, Li M, Fossat Y. Physicians' perceptions of chatbots in health care: cross-sectional web-based survey. J Med Internet Res. 2019 Apr 05;21(4):e12887. doi: 10.2196/12887. https://www.jmir.org/2019/4/e12887/ v21i4e12887 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Golinelli D, Boetto E, Carullo G, Nuzzolese AG, Landini MP, Fantini MP. Adoption of digital technologies in health care during the COVID-19 pandemic: systematic review of early scientific literature. J Med Internet Res. 2020 Nov 06;22(11):e22280. doi: 10.2196/22280. https://www.jmir.org/2020/11/e22280/ v22i11e22280 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Luo B, Lau RY, Li C, Si Y. A critical review of state-of-the-art chatbot designs and applications. Data Min Knowl Discov. 2022;12(1):e1434. doi: 10.1002/widm.1434. [DOI] [Google Scholar]
- 11.Espinoza J, Crown K, Kulkarni O. A guide to chatbots for COVID-19 screening at pediatric health care facilities. JMIR Public Health Surveill. 2020 Apr 30;6(2):e18808. doi: 10.2196/18808. https://publichealth.jmir.org/2020/2/e18808/ v6i2e18808 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Hauser-Ulrich S, Künzli H, Meier-Peterhans D, Kowatsch T. A smartphone-based health care chatbot to promote self-management of chronic pain (SELMA): pilot randomized controlled trial. JMIR Mhealth Uhealth. 2020 Apr 03;8(4):e15806. doi: 10.2196/15806. https://mhealth.jmir.org/2020/4/e15806/ v8i4e15806 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Miner AS, Laranjo L, Kocaballi AB. Chatbots in the fight against the COVID-19 pandemic. NPJ Digit Med. 2020 May 4;3:65. doi: 10.1038/s41746-020-0280-0. doi: 10.1038/s41746-020-0280-0.280 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Almalki M, Azeez F. Health chatbots for fighting COVID-19: a scoping review. Acta Inform Med. 2020 Dec;28(4):241–7. doi: 10.5455/aim.2020.28.241-247. https://europepmc.org/abstract/MED/33627924 .AIM-28-241 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.World Health Organization. Geneva, Switzerland: World Health Organization; 2016. [2021-05-01]. Monitoring and evaluating digital health interventions: a practical guide to conducting research and assessment. https://apps.who.int/iris/handle/10665/252183 . [Google Scholar]
- 16.Azevedo Chagas B, Ferreguetti K, Ferreira TC, Prates RO, Ribeiro LB, Pagano AS, Reis ZS, Meira Jr W, Ribeiro AL, Marcolino MS. Chatbot as a telehealth intervention strategy in the COVID-19 pandemic: lessons learned from an action research approach. CLEI Electron J. 2021 Dec 13;24(3):6. doi: 10.19153/cleiej.24.3.6. http://www.clei.org/cleiej/index.php/cleiej/article/view/515/421 . [DOI] [Google Scholar]
- 17.Alkmim MB, Marcolino MS, de Oliveira CR, Borges IN, Cardoso CS, Rocha GM, Ribeiro LB, De Sousa LA, Mendes MS, Da Paixão MC, Figueira RM, Ribeiro AL. TeleCOVID-19: a multifaceted strategy from a public Brazilian telehealth service during the COVID-19 pandemic. Stud Health Technol Inform. 2021;277:1–10. doi: 10.3233/SHTI210022. [DOI] [Google Scholar]
- 18.Secretaria de Atenção Primária à Saúde (SAPS) Brazil DF: SAPS, Ministério da Saúde; 2020. May, [2021-05-31]. Protocolo de Manejo Clínico do Coronavírus (COVID-19) na Atenção Primária à Saúde - Versão 9. https://www.unasus.gov.br/especial/covid19/pdf/37 . [Google Scholar]
- 19.World Health Organization. Geneva, Switzerland: World Health Organization; 2020. [2022-09-30]. Clinical care for severe acute respiratory infection: toolkit: COVID-19 adaptation. https://apps.who.int/iris/handle/10665/331736 . [Google Scholar]
- 20.Marcolino MS, Diniz CS, Chagas BA, Mendes MS, Prates R, Pagano A, Ferreira TC, Alkmim MB, Oliveira CR, Borges IN, Raposo MC, Reis ZS, Paixão MC, Ribeiro LB, Rocha GM, Cardoso CS, Ribeiro AL. Synchronous teleconsultation and monitoring service targeting COVID-19: leveraging insights for postpandemic health care. JMIR Med Inform. 2022 Dec 22;10(12):e37591. doi: 10.2196/37591. https://medinform.jmir.org/2022/12/e37591/ v10i12e37591 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Wardle C. Information disorder: toward an interdisciplinary framework for research and policy making. Council of Europe. 2017. [2022-12-31]. https://edoc.coe.int/en/media/7495-information-disorder-toward-an-interdisciplinary-framework-for-research-and-policy-making.html .
- 22.Soriano Marcolino M, Minelli Figueira R, Pereira Afonso Dos Santos J, Silva Cardoso C, Luiz Ribeiro A, Alkmim MB. The experience of a sustainable large scale Brazilian telehealth network. Telemed J E Health. 2016 Nov;22(11):899–908. doi: 10.1089/tmj.2015.0234. [DOI] [PubMed] [Google Scholar]
- 23.Take Blip | Automated communication platform. 2022. [2022-09-30]. https://www.take.net/
- 24.Centro de Telessaúde - Hospital das Clínicas UFMG. 2022. [2022-09-30]. https://telessaude.hc.ufmg.br/
- 25.PMTO | Prefeitura de Teófilo Otoni. 2022. [2022-09-30]. https://teofilootoni.mg.gov.br/
- 26.UFSJ | Universidade Federal de São João del-Rei. [2022-09-30]. https://www.ufsj.edu.br/covid19/
- 27.Creswell JW. Research Design: Qualitative, Quantitative and Mixed Methods Approaches. 4th edition. Thousand Oaks, CA, USA: Sage Publications; 2014. [Google Scholar]
- 28.Nielsen J. Usability Engineering. Cambridge, MA, USA: Elsevier Academic Press; 1994. [Google Scholar]
- 29.Ergonomics of human-system interaction — Part 110: Interaction principles. ISO 9241-110:2020. International Organization for Standardization. 2020. [2022-12-31]. https://www.iso.org/cms/render/live/en/sites/isoorg/contents/data/standard/07/52/75258.html .
- 30.Braun V, Clarke V. Using thematic analysis in psychology. Qual Res Psychol. 2006 Jan;3(2):77–101. doi: 10.1191/1478088706qp063oa. [DOI] [Google Scholar]
- 31.Ranney ML, Meisel ZF, Choo EK, Garro AC, Sasson C, Morrow Guthrie K. Interview-based qualitative research in emergency care part II: data collection, analysis and results reporting. Acad Emerg Med. 2015 Sep;22(9):1103–12. doi: 10.1111/acem.12735. https://europepmc.org/abstract/MED/26284572 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Denzin NK, Lincoln YS. Handbook of Qualitative Research. 2nd edition. Thousand Oaks, CA, USA: Sage Publications; 2000. [Google Scholar]
- 33.Turner P, Turner S. Triangulation in practice. Virtual Real. 2009 May 12;13(3):171–81. doi: 10.1007/s10055-009-0117-2. [DOI] [Google Scholar]
- 34.Denzin NK. The Research Act: A Theoretical Introduction to Sociological Methods. 2nd edition. New York, NY, USA: McGraw-Hill; 1978. [Google Scholar]
- 35.QDA Miner Lite - Free Qualitative Data Analysis Software. Provalis Research. [2021-06-30]. https://provalisresearch.com/products/qualitative-data-analysis-software/freeware/
- 36.Abd-Alrazaq A, Safi Z, Alajlani M, Warren J, Househ M, Denecke K. Technical metrics used to evaluate health care chatbots: scoping review. J Med Internet Res. 2020 Jun 05;22(6):e18301. doi: 10.2196/18301. https://www.jmir.org/2020/6/e18301/ v22i6e18301 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.de Barcelos TN, Muniz LN, Dantas DM, Cotrim Jr DF, Cavalcante JR, Faerstein E. Analysis of fake news disseminated during the COVID-19 pandemic in Brazil. Rev Panam Salud Publica. 2021 May 13;45:e65. doi: 10.26633/RPSP.2021.65. https://europepmc.org/abstract/MED/34007263 .RPSP.2021.65 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kilsdonk E, Peute LW, Jaspers MW. Factors influencing implementation success of guideline-based clinical decision support systems: a systematic review and gaps analysis. Int J Med Inform. 2017 Feb;98:56–64. doi: 10.1016/j.ijmedinf.2016.12.001.S1386-5056(16)30270-2 [DOI] [PubMed] [Google Scholar]
- 39.Dennis AR, Kim A, Rahimi M, Ayabakan S. User reactions to COVID-19 screening chatbots from reputable providers. J Am Med Inform Assoc. 2020 Nov 01;27(11):1727–31. doi: 10.1093/jamia/ocaa167. https://europepmc.org/abstract/MED/32984890 .5867913 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Shneiderman B. Universal usability. Commun ACM. 2000 May;43(5):84–91. doi: 10.1145/332833.332843. [DOI] [Google Scholar]
- 41.Julião F. Brasil tem mais smartphones que habitantes, aponta FGV. CNN Brasil. 2022. May 26, [2021-06-30]. https://www.cnnbrasil.com.br/business/brasil-tem-mais-smartphones-que-habitantes-aponta-fgv/
- 42.Srivastava B. Did chatbots miss their "Apollo Moment"? Potential, gaps, and lessons from using collaboration assistants during COVID-19. Patterns (N Y) 2021 Aug 13;2(8):100308. doi: 10.1016/j.patter.2021.100308. https://linkinghub.elsevier.com/retrieve/pii/S2666-3899(21)00151-3 .S2666-3899(21)00151-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Følstad A, Brandtzæg PB. Chatbots and the new world of HCI. Interactions. 2017;24(4):38–42. doi: 10.1145/3085558. [DOI] [Google Scholar]
- 44.Brandtzaeg PB, Følstad A. Chatbots: changing user needs and motivations. Interactions. 2018;25(5):38–43. doi: 10.1145/3236669. [DOI] [Google Scholar]
- 45.Følstad A, Araujo T, Law EL, Brandtzaeg PB, Papadopoulos S, Reis L, Baez M, Laban G, McAllister P, Ischen C, Wald R, Catania F, Meyer von Wolff R, Hobert S, Luger E. Future directions for chatbot research: an interdisciplinary research agenda. Computing. 2021 Oct 19;103(12):2915–42. doi: 10.1007/s00607-021-01016-7. [DOI] [Google Scholar]
- 46.Moore RJ, Arar R. Conversational UX design: an introduction. In: Moore RJ, Szymanski MH, Arar R, Ren GJ, editors. Studies in Conversational UX Design. Cham, Switzerland: Springer; 2018. pp. 1–16. [Google Scholar]
- 47.Piccolo LS, Mensio M, Alani H. Chasing the chatbots. Proceedings of the 2018 International Conference on Internet Science; INSCI '18; October 24-26, 2018; St. Petersburg, Russia. 2018. pp. 157–69. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Scenarios for in-depth chatbot evaluation (asymptomatic users).
Script for the Interview conducted with asymptomatic (healthy) participants following their interaction with the chatbot.
Complete version of Table 7.
Complete version of Table 8.
Data Availability Statement
Data are available on reasonable request.