Abstract
Background
Online medical education often faces challenges related to communication and comprehension barriers, particularly when the instructional language differs from the healthcare providers' and caregivers' native languages. Our study addresses these challenges within pediatric healthcare by employing generative language models to produce a linguistically tailored, multilingual curriculum that covers the topics of team training, surgical procedures, perioperative care, patient journeys, and educational resources for healthcare providers and caregivers.
Methods
An interdisciplinary group formulated a video curriculum in English, addressing the nuanced challenges of pediatric healthcare. Subsequently, it was translated into Spanish, primarily emphasizing Latin American demographics, utilizing OpenAI's GPT-4. Videos were enriched with synthetic voice profiles of native speakers to uphold the consistency of the narrative.
Results
We created a collection of 45 multilingual video modules, each ranging from 3 to 8 min in length and covering essential topics such as teamwork, how to improve interpersonal communication, “How I Do It” surgical procedures, as well as focused topics in anesthesia, intensive care unit care, ward nursing, and transitions from hospital to home. Through AI-driven translation, this comprehensive collection ensures global accessibility and offers healthcare professionals and caregivers a linguistically inclusive resource for elevating standards of pediatric care worldwide.
Conclusion
This development of multilingual educational content marks a progressive step toward global standardization of pediatric care. By utilizing advanced language models for translation, we ensure that the curriculum is inclusive and accessible. This initiative aligns well with the World Health Organization's Digital Health Guidelines, advocating for digitally enabled healthcare education.
Keywords: pediatric, global health, generative language models, digital health, education, artificial intelligence (AI), curriculum, healthcare
1 Introduction
In pediatric healthcare, each stage of the patient journey—from initial assessments to post-treatment convalescence to eventual safe discharge home—poses its own set of unique challenges. These complexities are universal, transcending geographical and cultural boundaries (1). The COVID-19 pandemic has further spotlighted the crucial impact of cultural and linguistic differences on various aspects of healthcare (2). From determining patient access (3) to affecting continuity of care (4), these elements have been empirically shown to significantly influence health outcomes (5–10), especially during critical health events such as global pandemics (11–13).
Building on the understanding of the role of cultural and linguistic elements, a particularly pressing challenge arises in the domain of healthcare education (1, 14). Caregivers, parents, and healthcare professionals often face communication and comprehension barriers when the predominant language of instruction is not their native language (15). This issue becomes especially acute in pediatric care, where accuracy and in-depth understanding are imperative.
Considering the well-documented gaps in communication within healthcare settings (15, 16) and their subsequent ramifications (4, 17), there is a clear and pressing need for multimedia resources that are not just comprehensive but also culturally and linguistically tailored. In addressing this need, CareWays Collaborative was founded in 2022 with a mission to partner with health teams around the globe to transform the culture and delivery of surgical care so that all patients can achieve the most favorable outcome. This group of healthcare professionals based in Boston and affiliated with Massachusetts Eye and Ear (MEE) and Massachusetts General Hospital (MGH), aims to create and disseminate educational materials that bridge communication gaps and distribute knowledge effectively across diverse healthcare settings. This effort aligns with the World Health Organization's Digital Health Guidelines, which underscores the importance of leveraging digital interventions for health worker training and education (18). In this vein, CareWays Collaborative created video-based educational content aided with artificial intelligence (AI) to ensure both clarity and cultural relevance for various healthcare provider and caregiver demographics.
AI is playing an increasing role in bridging gaps in healthcare education (19, 20). With the emergence of diverse techniques, from computer vision (21, 22) and data analytics (23) to Natural Language Processing (NLP) (24, 25), AI offers promising solutions to longstanding challenges related to language and cultural barriers. Notably, innovative tools such as Nvidia's Riva (26) and state-of-the-art generative language models—including OpenAI's ChatGPT 4 (27), Meta's LLaMA 2 (28), and Microsoft's PaLM 2 (29)—are enabling real-time translation. This advancement not only makes medical education more accessible but also emphasizes the essential role of AI in enhancing these resources.
Drawing on a decade of empirical research, a curriculum has been developed using artificial intelligence and voice cloning technologies to address communication gaps in pediatric healthcare. It offers insights into effective healthcare practices and highlights strategies to mitigate potential adverse events. In the form of an educational video series, the curriculum extends beyond recorded lectures or textbook material by including live demonstrations of important techniques and procedures used in operating rooms and at patients' bedsides. The videos focus on very subspecialized critical care and surgical techniques geared toward critical care nurses, physicians and respiratory therapists, pediatric anesthesiologists and surgeons, and pediatric speech-language pathologists—all crucial members of the specialized team caring for pediatric patients with airway pathology and/or critical illness. Each of these videos has been organized into sub-specialty-specific chapters within the Canvas educational platform to serve as a resource. For example, there are 19 instructional videos for Pediatric Intensivists, including video instruction on “how to tape and secure an endotracheal tube,” “how to dress, access and care for a central venous catheter,” and “how to prevent ventilator-related pneumonias.” These chapters were developed after qualitative interviews with intensivists in low/middle-resource countries (30). By dividing the content into these targeted volumes, we believe that healthcare providers can access the in-depth material they need, tailored to their specific specialty, to address any knowledge gaps they wish to fill. The curriculum complements what you can read in a book and offers greater accessibility than a standard textbook—users can scan a QR code and watch a video for immediate guidance before diagnosing and assessing a patient, after an initial diagnosis when specific questions arise due to a knowledge or educational gap, or before performing a specific procedure.
This study undertakes an exploratory effort to leverage Generative Language Models (GLMs) for translation and voice synthesis to create a more inclusive and accessible educational environment in pediatric healthcare. The integration of software tools with AI functionalities forms a strategic blueprint for creating medical content with various languages and cultural backgrounds. While we initially targeted Spanish due to its global prevalence, the same methodology can be applied to diverse languages. This study primarily aims to provide an in-depth, sub-specialty-specific educational curriculum for critical care nurses, physicians, respiratory therapists, pediatric anesthesiologists, surgeons, and pediatric speech-language pathologists. It is designed to bridge educational gaps identified by their peers and to help improve the quality and care they can provide to their patients.
2 Materials and methods
Guided by the results of a learning behavior assessment survey administered to providers in a Central America-based public children's hospital within a low-resource setting (30), a narrated online video format was selected to ensure learning continuity even after the departure of a mission team. During two recent missions, first to Guatemala and then to Colombia, a comprehensive needs assessment was undertaken by an interdisciplinary team of experienced nurses (15), respiratory therapists (4), speech-language pathologists (2), anesthesiologists (10), intensivists (10), general practitioners (2), and surgeons (8) to identify gaps and process maps that could be addressed through video content (30). Using these insights, concise narrated videos were crafted at Mass General Brigham sites. These videos encapsulate key aspects of pediatric care, such as surgical procedures, perioperative care, patient journeys, and educational resources with best-practice guidelines for patients and caregivers. With durations ranging from 3 to 8 min, these high-quality videos, embedded with QR codes, were made freely available on learning platforms such as Canvas.
To improve accessibility, English narratives embedded in the videos were transcribed utilizing the “Dictate” function of Microsoft Word 365, prioritizing linguistic accuracy and contextual fidelity. Subsequently, OpenAI's Generative Pre-trained Transformer (GPT-4) was employed to facilitate the translation of content into Spanish. Native Spanish-speaking medical professionals, who were selected from the missions and well-versed in the challenges of low-resource settings, meticulously reviewed the translated content to ensure that the tone and sentiment aligned with the original material.
Building on the contextual translation, we focused on transmitting an authentic and relatable auditory experience. We collected voice samples from five candidates, each reciting the “Rainbow Passage” for 2 min. The Rainbow Passage is a standard reading passage used in speech-language pathology and voice assessment. It contains many sounds and intonations found in the English language, which improves the quality of the AI-driven synthetic voices. These recordings, captured via a mobile phone in lossless quality, were processed using the ElevenLabs Python library (elevenLabs_multilingual_v1). The model is derived from an extensive dataset of audio recordings and audiobooks designed to generate synthetic voices that closely emulate human speech characteristics. Voice synthesis was calibrated with tunable parameters emphasizing clarity (60%) and stability (30%). These parameters can be adjusted to fine-tune the synthetic speech output. Once synthesized, the audio was rendered as an mp3 file with a 44.1 KHz sampling rate. This synthesized audio track was integrated into the original video using Apple's iMovie software (v.10.3.8). To achieve optimal synchronization between visual and auditory elements, the playback speed of the synthesized audio was occasionally adjusted using the original English audio track as a reference.
Figure 1 outlines the procedure for creating a 5-min educational video, which typically requires about an hour to produce. A significant part of this time is allocated to achieving narrative consistency. Our rationale for selecting the software tools used in this study was guided by their widespread accessibility, simplicity, and advanced capabilities. Additionally, many offer APIs that facilitate the adaptation of these tools to suit diverse workflows.
3 Results
We have developed a series of videos aimed at enhancing pediatric care, covering crucial areas from surgical interventions to nursing protocols. To enhance accessibility, these videos were translated into Spanish using state-of-the-art generative models, focusing on the primary demographic of North and Latin America, regions where our non-profit CareWays Collaborative frequently operates. Figure 2 displays snapshots from various curriculum chapters. For a detailed breakdown, refer to Supplementary Figure S1.
Subsequent sections outline the objectives and content of individual chapters.
Introduction: Provides an overview of the curriculum, highlighting the motivations and emphasizing the patient's journey across various stages of their care, including initial evaluation, surgery, intensive care, swallowing therapy, safe discharge planning, the transition from hospital to home, including caregiver responsibilities, and patient-caregiver interactions. The introduction is showcased in the English version (Supplementary Video S1) and in the machine-learning-enabled Spanish version (Supplementary Video S2). Although these examples are provided in English and Spanish, the methodology allows the creation of similar content with any other supported languages.
Ward nursing: Discusses transitional patient care from an intensive care unit to a general ward, emphasizing parent-centric care and individualized insights to improve patient outcomes. The nursing introduction here is presented both in the English version (Supplementary Video S3) and the machine-learning-enabled Spanish version (Supplementary Video S4).
Surgery: Offers a curriculum for specific aspects of global health surgery, covering various surgical techniques.
Intensive care unit: Presents a comprehensive guide for intensive care training, detailing best practices for respiratory management, intravenous procedures, and patient transition from the operating room to the intensive care unit (ICU).
Speech therapy: Introduces the role of Speech-Language Pathology in patient care, covering preoperative and postoperative dysphagia screening.
Anesthesia: Focuses on the preparation and execution of bronchoscopy cases, stressing equipment familiarity and best practices.
Patient or caregiver: Provides a tailored educational approach for caregivers and patients, covering the entire journey of surgery and postoperative care, aiming to prepare them emotionally and intellectually for the medical process and for parental care responsibilities after discharge.
4 Discussion
Global healthcare presents universal challenges that go beyond language and cultural boundaries. This study aims to address these challenges by creating a comprehensive curriculum tailored for a true global collaborative where education can be shared across various languages. The curriculum is built on a decade of research into effective healthcare practices. With a particular focus on Latin America, it emphasizes the importance of a holistic, patient-centered approach to global healthcare. It covers various topics, from surgery and intensive care to guidance for parents and caregivers. This broad scope ensures that the educational material has the potential to significantly impact healthcare professionals and caregivers by providing them with tailored information.
A key innovation of this study is the use of artificial intelligence, specifically generative language models, to surmount existing language and cultural barriers. While our initial focus is on Spanish translations, integrating GLMs allows an easy transition to other languages and cultures. We plan to extend our curriculum to include French, Portuguese, and Arabic translations, aiming for an even greater linguistic and cultural footprint. Additionally, our animated content is designed to mirror global diversity, creating a universal connection that is relatable for learners worldwide. However, incorporating AI into the curriculum development requires oversight due to the potential for inaccuracies and the lack of validation mechanisms (ground truth). In this study, we use native speakers to maintain linguistic precision and cultural nuances.
To evaluate the potential of translations produced by these GLMs, we randomly selected three surgical videos translated by AI and had their original English versions reviewed by medical translators. The translations from the medical professionals and the AI were then assessed by a Spanish-speaking surgeon in Colombia. She reported that both sets of translated videos would provide equal value in supporting her in treating her patients. The Spanish translations completed by the medical professionals were subsequently re-translated into English using ChatGPT and evaluated by a surgeon in Boston. He determined that both the re-translated videos and the original English source material, which had undergone translation and back-translation, were equally informative and offered the same level of support for treating his patients.
There are several potential avenues for enhancing the current workflow. Firstly, the utilization of GLMs for translation can be refined by parallelizing the process and incorporating efficient speech-to-text tools, such as open-source solutions like SpeechBrain, DeepSpeech, or APIs like AWS Transcribe, known for its expertise with medical content. Secondly, the text-to-speech translation can be accelerated by leveraging solutions like ElevenLabs or IBM Watson TTS API.
In conclusion, generative language models can potentially transform global healthcare education, provided they are applied with the necessary safeguards, including data privacy measures and ethical use (31). Our initiative leverages machine learning to create educational resources that are both globally relevant and locally sensitive.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.
Author contributions
FB: Formal analysis, Methodology, Visualization, Writing—original draft. EH: Formal analysis, Methodology, Visualization, Writing—review & editing. EZ: Formal analysis, Project administration, Validation, Visualization, Writing—review & editing. CHe: Formal analysis, Resources, Visualization, Writing—review & editing. KC: Methodology, Validation, Visualization, Writing—review & editing. PV: Formal analysis, Methodology, Visualization, Writing—review & editing. PY: Conceptualization, Formal analysis, Methodology, Supervision, Visualization, Writing—review & editing. CHa: Conceptualization, Formal analysis, Methodology, Supervision, Validation, Visualization, Writing—review & editing.
Funding Statement
The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpubh.2024.1337395/full#supplementary-material
References
- 1.Marmot M, Friel S, Bell R, Houweling TA, Taylor S, Commission Commission on Social Determinants of Health . Closing the gap in a generation: health equity through action on the social determinants of health. Lancet. (2008) 372:1661–69. 10.1016/S0140-6736(08)61690-6 [DOI] [PubMed] [Google Scholar]
- 2.Blumenthal D, Fowler EJ, Abrams M, Collins SR. COVID-19—implications for the health care system. N Engl J Med. (2020) 383:1483–8. 10.1056/NEJMsb2021088 [DOI] [PubMed] [Google Scholar]
- 3.Newacheck PW, Hughes DC, Stoddard JJ. Children's access to primary care: differences by race, income, and insurance status. Pediatrics. (1996) 97:26–32. [PubMed] [Google Scholar]
- 4.Stevens GD, Seid M, Mistry R, Halfon N. Disparities in primary care for vulnerable children: the influence of multiple risk factors. Health Serv Res. (2006) 41:507–31. 10.1111/j.1475-6773.2005.00498.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Smedley BD, Stith AY, Nelson AR. Unequal Treatment: Confronting Racial and Ethnic Disparities in Health Care. Washington, DC: National Academies Press; (2003). 10.17226/12875 [DOI] [PubMed] [Google Scholar]
- 6.Chan KS, Keeler E, Schonlau M, Rosen M, Mangione-Smith R. How do ethnicity and primary language spoken at home affect management practices and outcomes in children and adolescents with asthma? Arch Pediatr Adolesc Med. (2005) 159:283–9. 10.1001/archpedi.159.3.283 [DOI] [PubMed] [Google Scholar]
- 7.Muñoz-Price LS, Nattinger AB, Rivera F, Hanson R, Gmehlin CG, Perez A, et al. Racial disparities in incidence and outcomes among patients with COVID-19. JAMA Netw Open. (2020) 3:e2021892. 10.1001/jamanetworkopen.2020.21892 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Al Shamsi H, Almutairi AG, Al Mashrafi S, Al Kalbani T. Implications of language barriers for healthcare: a systematic review. Oman Med J. (2020) 35:1–7. 10.5001/omj.2020.40 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Cohen S, Moran-Ellis J, Smaje C. Children as informal interpreters in GP consultations: pragmatics and ideology. Sociol Health Illn. (1999) 21:163–86. 10.1111/1467-9566.00148 [DOI] [Google Scholar]
- 10.Jacobs EA, Shepard DS, Suaya JA, Stone E-L. Overcoming language barriers in health care: costs and benefits of interpreter services. Am J Public Health. (2004) 94:866–9. 10.2105/AJPH.94.5.866 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ryan JM. COVID-19. London: Routledge; (2021). 10.4324/9781003155911 [DOI] [Google Scholar]
- 12.Lemkow–Tovías G, Lemkow L, Cash-Gibson L, Teixidó-Compaño E, Benach J. Impact of COVID-19 inequalities on children: an intersectional analysis. Sociol Health Illn. (2022) 45:145–62. 10.1111/1467-9566.13557 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Mishra V, Seyedzenouzi G, Almohtadi A, Chowdhury T, Khashkhusha A, Axiaq A, et al. Health inequalities during COVID-19 and their effects on morbidity and mortality. J Healthc Leadersh. (2021) 13:19–26. 10.2147/JHL.S270175 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.MacLachlan, M. Culture and Health: A Critical Perspective towards Global Health, 2nd ed. Hoboken, NJ: John Wiley and Sons Ltd; (2006). [Google Scholar]
- 15.Cohen AL, Christakis DA. Primary language of parent is associated with disparities in pediatric preventive care. J Pediatr. (2006) 148:254–8. 10.1016/j.jpeds.2005.10.046 [DOI] [PubMed] [Google Scholar]
- 16.Becker Herbst R, Margolis KL, Millar AM, Muther EF, Talmi A. Lost in translation: identifying behavioral health disparities in pediatric primary care. J Pediatr Psychol. (2016) 41:481–91. 10.1093/jpepsy/jsv079 [DOI] [PubMed] [Google Scholar]
- 17.Flores G, Tomany-Korman SC. The language spoken at home and disparities in medical and dental health, access to care, and use of services in US children. Pediatrics. (2008) 121:e1703–14. 10.1542/peds.2007-2906 [DOI] [PubMed] [Google Scholar]
- 18.World Health Organization . WHO Guideline: Recommendations on Digital Interventions for Health System Strengthening. Geneva World Health Organization (2019). [PubMed] [Google Scholar]
- 19.Seneviratne M, Shah N, Chu L. Bridging the implementation gap of machine learning in healthcare. BMJ Innov. (2019) 6:bmjinnov-2019. 10.1136/bmjinnov-2019-000359 [DOI] [Google Scholar]
- 20.Amisha F, Malik P, Pathania M, Rathaur V. Overview of artificial intelligence in medicine. J Family Med Prim Care. (2019) 8:2328. 10.4103/jfmpc.jfmpc_440_19 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Esteva A, Chou K, Yeung S, Naik N, Madani A, Mottaghi A, et al. Deep learning-enabled medical computer vision. NPJ Digit Med. (2021) 4:5. 10.1038/s41746-020-00376-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Khan S, Naseer M, Hayat M, Zamir SW, Khan FS, Shah M. Transformers in vision: a survey. arXiv. (2021). 10.1145/3505244 [DOI] [Google Scholar]
- 23.Lauritsen SM, Kristensen M, Olsen MV, Larsen MS, Lauritsen KM, Jørgensen MJ, et al. Explainable artificial intelligence model to predict acute critical illness from electronic health records Nat Commun. (2020) 11:3852. 10.1038/s41467-020-17431-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Murff HJ, FitzHenry F, Matheny ME, Gentry N, Kotter KL, Crimin K, et al. Automated identification of postoperative complications within an electronic medical record using natural language processing. JAMA. (2011) 306:848–55. 10.1001/jama.2011.1204 [DOI] [PubMed] [Google Scholar]
- 25.Yim W, Yetisgen M, Harris WP, Kwan SW. Natural language processing in oncology: a review. JAMA Oncol. (2016) 2:797–804. 10.1001/jamaoncol.2016.0213 [DOI] [PubMed] [Google Scholar]
- 26.Li J, Lavrukhin V, Ginsburg B, Leary R, Kuchaiev O, Cohen JM, Nguyen H, et al. Jasper: an end-to-end convolutional neural acoustic model. arXiv [Preprint]. arXiv: abs/1904.03288 (2019). [Google Scholar]
- 27.Achiam J, Adler S, Agarwal S, Ahmad L, Akkaya I, Aleman FL, et al. GPT-4 technical report. arXiv [Preprint]. arXiv: abs/2303.08774 (2023). [Google Scholar]
- 28.Touvron H, Martin L, Stone K, Albert P, Almahairi A, Babaei Y, et al. Llama 2: open foundation and fine-tuned chat models. arXiv [Preprint].arXiv: abs/2307.09288 (2023). [Google Scholar]
- 29.Anil R, Dai AM, Firat O, Johnson M, Lepikhin D, Passos A, et al. PaLM 2 technical report. arXiv [Preprint].arXiv: abs/2305.10403 (2023). [Google Scholar]
- 30.Jayawardena ADL, Ghersin ZJ, Mirambeaux M, Bonilla JA, Quiñones E, Zablah E, et al. A sustainable and scalable multidisciplinary airway teaching mission: the operation airway 10-year experience. Otolaryngol Head Neck Surg. (2020) 163:971–8. 10.1177/0194599820935042 [DOI] [PubMed] [Google Scholar]
- 31.Doo FX, Cook TS, Siegel EL, Joshi A, Parekh V, Elahi A, et al. Exploring the clinical translation of generative models like ChatGPT: promise and pitfalls in radiology, from patients to population health. J Am Coll Radiol. (2023) 20:877–85. 10.1016/j.jacr.2023.07.007 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.