Abstract
Objective
The objective of our research is to conduct a comprehensive review that aims to systematically map, describe, and summarize the current utilization of artificial intelligence (AI) in the recruitment and retention of participants in clinical trials.
Materials and Methods
A comprehensive electronic search was conducted using the search strategy developed by the authors. The search encompassed research published in English, without any time limitations, which utilizes AI in the recruitment process of clinical trials. Data extraction was performed using a data charting table, which included publication details, study design, and specific outcomes/results.
Results
The search yielded 5731 articles, of which 51 were included. All the studies were designed specifically for optimizing recruitment in clinical trials and were published between 2004 and 2023. Oncology was the most covered clinical area. Applying AI to recruitment in clinical trials has demonstrated several positive outcomes, such as increasing efficiency, cost savings, improving recruitment, accuracy, patient satisfaction, and creating user-friendly interfaces. It also raises various technical and ethical issues, such as limited quantity and quality of sample size, privacy, data security, transparency, discrimination, and selection bias.
Discussion and Conclusion
While AI holds promise for optimizing recruitment in clinical trials, its effectiveness requires further validation. Future research should focus on using valid and standardized outcome measures, methodologically improving the rigor of the research carried out.
Keywords: artificial intelligence, recruitment and retention, clinical trials, scoping review, benefits and drawbacks
Introduction
Any computer program that shows characteristics indicative of intelligent behavior, such as problem-solving, learning, perception, and reasoning aimed at emulating, augmenting, and expanding human intelligence is considered to be a form of artificial intelligence (AI).1 It encompasses a wide spectrum of realms, such as knowledge discovery, data mining, natural language processing (NLP), computer vision systems, speech recognition, and expert systems.1 Collectively, these technologies form the backbone of AI, driving innovation and transformation across a multitude of industries and applications.2 In recent years, modern AI technology has achieved notable advancements and is actively deployed across the medical domains.3 Medical services stand to gain significant benefits from the integration of AI across various domains such as patient administration, clinical decision support, patient monitoring, and healthcare interventions, thereby fostering the development of AI-augmented health systems.4 The latest frontier in AI’s positive impact on medicine lies in its application in the design, implementation, and analysis of clinical trials. AI technologies are becoming increasingly attractive to the clinical trial industry due to their automated nature, predictive power, and consequent expected increases in efficiency.5 AI has the capability to automatically identify meaningful patterns in large datasets, thereby expediting clinical trials through efficient participant match and comprehensive data analysis. The impact of AI technology on clinical trials has been steadily growing, leading to improved efficiency and reduced costs.6 AI also plays a crucial role in analyzing biomedical information and enhancing the decision-making process for patient recruitment in clinical trials.7 AI has the potential to address the shortcomings in traditional clinical trial design by harnessing the power of knowledge discovery, data mining, and NLP. These methods can be applied to diverse sources, such as electronic health records (EHRs) and medical literature, to improve patient-test matching and recruitment prior to trial initiation, and to enable automated and continuous patient monitoring throughout the trial process. Some recent tools have already shown promising results in terms of system information extraction. For example, researchers used NLP and other scalable methods to create the Clinical Trial Knowledge Base which summarized and standardized the free-text eligibility criteria from more than 350,000 trials registered in ClinicalTrials.gov.8 AI could provide a fast and facilitated approach to internal investigator ranking of patient eligibility, which would speed up site initiation and consequently positively impact recruitment.9 By leveraging AI in clinical trials, researchers can enhance their ability to identify and manage risks, thereby improving overall trial outcomes.
While significant progress is being made in the field of AI for clinical trials, it is crucial to acknowledge the responsibility that accompanies the opportunity to transform the clinical trial cycle through AI.10 The review of relevant regulatory documents indicates that AI is welcomed by regulators as an innovative technology with broad applications and disruptive potential but requires strategic planning for implementation.11 Furthermore, the integration of AI in the recruitment and retention process of clinical trials can give rise to potential technical and ethical concerns.12 For instance, incomplete data or selection bias in their data sources may hinder the accuracy of the AI tools and introduce bias into trial recruitment efforts.13 In light of these considerations, we have developed a comprehensive scoping review that aims to provide a detailed overview, description, and summary of the current status of AI implementation in recruitment and retention in clinical trials. Additionally, we aim to identify the benefits and drawbacks, ethical and technical issues with the application of AI in optimizing recruitment and retention in clinical trials. The ultimate goal of this review is to provide valuable insights and recommendations for the future development of AI in clinical trials.
Materials and methods
A comprehensive scoping review was conducted, encompassing both peer-reviewed and grey literature, to explore the use of AI in optimizing recruitment and retention in clinical trials. The review followed the methodological framework provided by Joanna Briggs Institute (JBI).14 To ensure transparency and adherence to reporting standards, the preferred reporting items for systematic reviews and meta-analyses extension for scoping reviews (PRISMA-ScRs) served as a guide for reporting the review15 (see Additional file S1).
Protocol registration
The study protocol was registered in Open Science Framework, an international platform for registration of reviews (https://osf.io/c9yhk/), and a paper was published in BMJ Open (doi: 10.1136/bmjopen-2023-080032).16
Stage 1: identifying the research question
Based on the objectives of this scoping review, we have formulated the following questions.
What specific types of clinical trials are AI tools focused on recruitment and retention predominantly applied to, and what are the design details (eg, study design) of those clinical trials?
What are the different types or names of AI tools that are currently available for assisting in optimizing the recruitment and retention process in the studies reviewed?
What are the outcomes, benefits, and drawbacks associated with the use of AI tools in assisting in optimizing recruitment and retention in clinical trials?
What are the ethical and technical issues encountered in the utilization of AI tools to assist in optimizing the recruitment and retention process in clinical trials?
Stage 2: identifying relevant studies
An electronic search was conducted in leading medical and computer science databases, including PubMed, Embase, Scopus, IEEE Xplore, and Web of Science Core Collection (SCI-EXPANDED, SSCI, A&HCI, and ESCI). We developed a comprehensive search strategy in the above databases (see Additional file S2). A search was performed on January 14, 2024 by X.L. English papers published with no time limits were included in this study. The search terms included artificial intelligence, AI, clinical trials, and recruitment and retention. Search terms were different according to different databases. Boolean terms were used to separate the keywords. To compile all relevant evidence sources, identify, remove duplicate records, and screen literature, Covidence was employed.
Stage 3: study selection
We established eligibility criteria to ensure that the selected studies included specific information relevant to our research question.
Inclusion criteria
Studies were included when they met the following criteria:
The study focused on the application of AI technology.
The study specifically addressed the use of AI technology to assist in recruitment or retention in clinical trials.
The study was original research, descriptive study, randomized controlled trial, pilot study, or feasible or acceptable study.
Exclusion criteria
Studies were excluded when they were:
Not available in English.
Did not include the details of AI technology used.
Just simply mentioned AI applications in recruitment and retention.
Focused on clinical trials conducted on AI, rather than the application of AI to clinical trials.
Not application of AI in recruitment and retention in clinical trials, but in other areas, such as AI in clinical trial design
Stage 4: charting the evidence
Covidence was used to identify and remove all duplicate articles. All publications identified according to these criteria went through title and abstract screening and then full-text screening, both of which were conducted independently by X.L. and C.Y. In case of disagreements or discrepancies, a consensus was reached through group discussions involving X.L., C.Y., and L.L.
Stage 5: extracting the data
A data extraction form was developed to systematically gather relevant information from the included articles. C.Y. and L.L. independently extracted the data from the selected studies using a pilot-tested form. In cases of discrepancies, X.L. was consulted to reach a consensus. Characteristics of included studies, including year of publication, country, scope of application, type of AI tools, study design, and aim of the study were extracted using the template in Additional file S3. We also extracted the funding sources of the included studies, presented in Additional file S4.
Stage 6: collating, summarizing, and reporting the results
The data collected were collated and summarized using a basic descriptive approach.17 The content analysis categorized the main outcomes, as well as the technical and ethical issues, benefits, and drawbacks that influence their development, implementation, and assessment.
Results
Selection of resources
The search yielded 5731 articles. A total of 1419 publications were found to be duplicates and were discarded, thus leaving 4312 articles, of which 4176 were excluded after evaluation of the title and abstract. After a full-text reading of the 136 selected articles, a total of 51 were included in the review (see Additional file S4 for key data extracted for each of the 51 studies and funding information), with a PRISMA flow diagram presented in Figure 1.
Figure 1.
The PRISMA-ScR flow diagram shown was used to illustrate the processes involved in conducting this scoping review. These processes mainly consisted of (1) identification: stated the number of articles identified and their respective sources, and provided the number of duplicates found and removed; (2) screening: provided the total number of articles screened and detailed information on the number of full-text articles that have been evaluated as eligible, the number of eligible studies, and the number of excluded studies; and (3) included: the number of studies provided in this section included following quality assessment, data extraction, and thematic analysis, respectively. PRISMA-ScR, preferred reporting items for systematic reviews and meta-analyses extension for scoping review.
Characteristics of included studies
The vast majority of publications had primary authors in the United States (n = 37),9,13,18–52 others were conducted in China (n = 3),53–55 Australia (n = 2),56,57 United Kingdom (n = 2),58,59 Canada (n = 2),60,61 France (n = 2),62,63 followed by Singapore (n = 1),64 Japan (n = 1),65 and Italy (n = 1),66 and the most frequent clinical areas covered were oncology studies (n = 14), diversified clinical trials (n = 10) and Alzheimer’s disease (n = 9). While our literature search yielded publications between 2004 and 2023, most of them were published after 2020 (n = 35). The most frequent study designs were cohort studies (n = 14) and non-randomized controlled trials (n = 14). We classified case-control study, simulation study, comparative study, cross-sectional study, etc as other studies because the number of studies applying these methods is relatively small.
We categorized AI applications into the following 4 main types: (1) Medical tools, which were used here to refer to using a mature AI tool to assist recruitment in clinical trials. Medical tools like Watson for Clinical Trial Matching (WCTM) and Mendel.ai leveraged AI and machine learning (ML) to analyze diverse medical data sources for tasks such as patient selection. They offered features like predictive analytics and personalized medicine recommendations, thereby enhancing clinical trial recruitment workflows; (2) Traditional model-based approaches, which rely on established mathematical or statistical models, such as linear regression or decision trees, to make predictions or decisions. These approaches involved manually selecting features and fitting the model to the data; (3) Deep model-based approaches, modern deep approaches use complex neural network architectures to automatically learn patterns and relationships directly from the data. It does not require manual feature engineering and can capture intricate patterns that traditional models may miss; (4) Other, mixed methods, unknown methods, and unmentioned methods were grouped into this category.
The term “medical tools” in our article refers to commercially available tools like WCTM and Mendel.ai. These tools typically include a user interface designed for end-users, such as clinicians or researchers. They are used not only for clinical trials but also for a variety of other purposes in the medical field, such as patient data analysis and decision support. On the other hand, the “traditional and deep model-based” algorithms mentioned are primarily derived from recent academic research. These models are often in the experimental stage and have been tested in limited settings, such as small-scale pilot studies or controlled environments, rather than being widely deployed in real-world applications. Traditional approaches, deep methods, and medical tools all occasionally leveraged NLP to analyze clinical data. Four traditional methods utilized NLP in our review, where NLP extracted key information from unstructured clinical text, such as EHR, which was then used in statistical or ML models. Two deep methods employed NLP, with NLP being integrated into deep learning (DL) architectures to directly process raw text data, allowing the model to learn representations of the text’s meaning and context. Seven medical tools employed NLP to enhance various aspects of recruitment processes, such as eligibility screening and data extraction. These tools exploited NLP to automatically process and interpret unstructured clinical text from EHR and other medical documents, facilitating more efficient identification of potential trial participants. In all cases, NLP played a crucial role in extracting valuable insights from clinical text.
According to our research, all the AI applications included are solely used for recruitment and not for retention. The deployment of AI applications in real-world settings encompasses various types, including: (1) Implementing AI-based tools to enhance patients’ access to clinical trials. For instance, Jordan et al36 utilized NLP to restructure trial information for efficient search and automated identification of relevant trials for individual patients. Patients interacted with the novel search tool through a free online website and completed a targeted questionnaire customized for each cancer type, focusing on key eligibility criteria that differentiated between trials to facilitate quick matching with relevant trials; (2) Utilizing AI tools for optimizing eligibility screening for clinical trials. For instance, WCTM consists of 3 main components: a trial data intake process, a patient data intake process, and a matching process. The matching process compares trial eligibility and exclusion criteria with patient data to generate a list of qualifying trials for patients. WCTM’s trial and patient data intake components use NLP to process unstructured information. In brief, the sponsor or client provides the trial protocol initially, which is ingested by the system. The output produced from WCTM’s trial intake process details criteria for the trial and provides additional information related to clinical concepts and values with associated WCTM attributes22; (3) Enhancing patient identification. For example, Tissot et al59 applied an algorithm to identify eligible patients using a moving 1-hour time window, they simulated the review of all patients in the intensive care unit (ICU) every hour, looking back on clinical data collected during the previous 24 hours to classify whether the patient had new onset septic shock, the patient was marked as eligible at the earliest timepoint that they fulfilled the selection criteria, and they were not considered eligible for future time points.
Table 1 lists a summary of the key characteristics of the included studies.
Table 1.
Key characteristics of the included studies.
| Characteristics of included studies (N = 51) | ||
|---|---|---|
| Element in study | Categories | Frequency (%) |
| Publication year | ||
| 2023 | 2 (4%) | |
| 2022 | 8 (16%) | |
| 2021 | 15 (29%) | |
| 2020 | 11 (21%) | |
| 2019 | 5 (10%) | |
| 2018 | 1 (2%) | |
| 2017 | 1 (2%) | |
| 2016 | 2 (4%) | |
| 2015 | 3 (6%) | |
| 2011 | 1 (2%) | |
| 2005 | 1 (2%) | |
| 2004 | 1 (2%) | |
| Country | ||
| United States | 37 (72%) | |
| China | 3 (6%) | |
| Australia | 2 (4%) | |
| Canada | 2 (4%) | |
| France | 2 (4%) | |
| United Kingdom | 2 (4%) | |
| Italy | 1 (2%) | |
| Japan | 1 (2%) | |
| Singapore | 1 (2%) | |
| Scope of application | ||
| Alzheimer’s disease | 8 (15%) | |
| Diversified clinical trials | 7 (14%) | |
| Oncology studies | 14 (27%) | |
| Neuroprotective trials | 1 (2%) | |
| Stroke trial | 2 (4%) | |
| Acute respiratory distress syndrome | 1 (2%) | |
| Age-related macular degeneration | 1 (2%) | |
| Diabetes | 1 (2%) | |
| Cardiovascular trial | 2 (4%) | |
| Heart disease | 3 (6%) | |
| Knee osteoarthritis | 1 (2%) | |
| Critical care | 1 (2%) | |
| Non-alcoholic steatohepatitis (NASH) clinical trials | 1 (2%) | |
| Patients diagnosis of RA | 1 (2%) | |
| Emergency department | 2 (4%) | |
| People living with HIV (PLWH) | 1 (2%) | |
| Response-adaptive randomization (RAR) design | 1 (2%) | |
| Schizophrenia | 2 (4%) | |
| Transgender population | 1 (2%) | |
| Type of AI tools | ||
| Medical tools | 10 (20%) | |
| Traditional model-based approaches | 22 (43%) | |
| Deep model-based approaches | 5 (10%) | |
| Other | 14 (27%) | |
| NLP | 13 (25%) | |
| Study design | ||
| Cohort study | 14 (27%) | |
| Non-randomized controlled trial | 14 (27%) | |
| Other | 14 (27%) | |
| Descriptive research | 4 (8%) | |
| Pilot study | 3 (7%) | |
| Randomized controlled trial | 2 (4%) | |
Abbreviations: AI, artificial intelligence; NLP, natural language processing.
Benefits and drawbacks
Table 2 summarizes the benefits and drawbacks that arise during the application of AI. AI systems can be integrated into recruitment in clinical trials in a variety of ways and bring convenience and benefits along with many shortcomings.
Table 2.
Description of benefits and drawbacks.
| Main theme | Subtheme |
|---|---|
| Benefits of AI tools | Increasing efficiency, cost savings, and improving recruitment9,13,19,20–24,26,28–31,33–45,48,49,52–54,56–61,65,66 |
| Improving accuracy and predictive power9,19,20–29,31,33,36,38,39,41,42,44,48,49,51,57,59,61 | |
| Create user-friendly interfaces and improve patient satisfaction58,60 | |
| Allow remote access and monitoring45,55 | |
| Drawbacks of AI tools | Unreliability of results under data constraints9,13,19,22,24–27,29,31,32,34,35,37,39,43,44,47,51,56,57,59,64 |
| Limited generalizability13,18,21–24,29–31,33,35,37,42,44,45,48,49,54,57,61 | |
| The complexity of algorithms and blind trust issues9,28,36,39,43,44,47,48,52,53,56,60,62,64 | |
| Insufficient external validation data and lack of universal guidelines22,27,29,35,40,51,56 | |
| Adds additional costs21,62 |
Abbreviations: AI, artificial intelligence.
One of the main drawbacks of implementing AI was the reliability of results under data constraints. Errors and incompleteness in the data sources used to train the AI algorithms and in the available data for potentially eligible individuals can lead to problematic results. Limited generalization capabilities were also a significant drawback; AI tools developed for specific purposes (eg, breast cancer lexicon31) may perform poorly in dealing with different populations or situations, and the lack of generalizability and transferability will limit the scalability and use of AI tools in clinical trials. It is also important to identify that the complexity of algorithms and blind trust in AI tools could hinder the recruitment process. Incomplete algorithm data training sets may lead to bias in patient inclusion selection,37 and excessive trust tends to overlook the possibility of AI making mistakes. Insufficient external validation and lack of universal guidelines for AI tools, as well as additional costs, are also important considerations. Lack of content quality assessment or task-based evaluation could lead to inaccuracies in AI systems, and the lack of uniform standards for the application of AI to clinical trials may lead to nonstandard application of AI tools and confusion in clinical trial transformation, such as nonstandard informed consent notification.
On a positive note, we have identified some benefits for the effective implementation of AI tools. Increasing efficiency, cost savings, and improving recruitment were essential benefits that can help promote recruitment in clinical trials. We define efficiency as completing tasks with fewer resources,67 which implies spending less time and incurring lower costs to obtain more patients who meet the enrollment criteria.68 For instance, in an ongoing oncology trial, the utilization of Mendel.ai led to a 24%-50% increase in the number of patients accurately identified as potentially eligible compared to the standard practice. Furthermore, using Mendel.ai to determine potential eligibility only takes minutes, while it takes an average of 19 days for standard prescreening of breast cancer patients and 263 days for lung cancer patients.21 Improving accuracy and predictive power was also crucial for involving communities for better acceptance. Creating user-friendly interfaces and improving patient satisfaction were crucial for researchers and other team members in the recruitment process. Allowing remote access and monitoring could provide valuable improvements for clinical trials.
Technical issues
Facilitators
The AI system itself possesses numerous advantages that can promote its application in clinical trial recruitment, and we consider these advantages as its facilitators. Firstly, Mutasa et al30 showed that AI systems allow the computer to automatically construct predictive statistical models, tailored to solve a specific problem subset. For example, the performance of Convolutional Neural Networks (CNNs) has been demonstrated to logarithmically increase with larger datasets, thereby enhancing its effectiveness in clinical trial recruitment. Additionally, utilizing tools like CogStack can potentially reduce the number of patient notes that need manual review by 85%, achieve rapid search and filtering, reducing the workload of manual audit.59 Secondly, Cai et al40 mentioned that by incorporating data from EHR, the algorithm significantly reduced the number of patients requiring manual chart review by 40.5%, while not excluding eligible patients, thereby allowing for a more accurate screening process. Thirdly, Kehl et al39 showed that the heterogeneity of the training and testing data significantly strengthens the model to identify patients. Tomaszewski et al49 showed similar results, by identifying an enriched subset (52%) of patients who had a significantly longer overall survival in cohorts treated with doxorubicin. Finally, the use of algorithms to identify eligible patients from EHR can streamline the recruitment process and reduce the necessity for manual chart review. Conducting consent, intervention, and follow-up remotely enables greater accessibility and participation, particularly for patients who are unable to travel to enroll sites.51
Barriers
However, AI itself also has several shortcomings that could hinder its use in clinical trial recruitment, such as the black box problem, and we consider these shortcomings as its barriers. For example, it should be considered that although using these methods may increase the efficient use of resources dedicated to clinical trial recruitment, unexpected biases may be introduced into the clinical trial cohort.13 Sato et al65 mentioned that AI tools could limit the applicability of the A4-fitted models to younger participants. The limited quantity and quality of sample size (training data) was also an important issue. For example, Lanera et al66 mentioned that the predictive performance of the proposed tool was not optimal for the case study considered due to the limited sample size. Besides, because training a CNN is an end-to-end process, it did not clearly reveal the reasoning behind the final result in a deterministic manner (ie, black box operation)30 and Fink et al32 found that the system did not keep track of temporal changes in the data. Finally, integrating the framework into existing systems and ensuring its usability was also a challenge since it has not been applied in a real-world setting and consumer-oriented search engines for clinical trial registries and/or patient-trial matching systems, and the generalizability of the approach to other common diseases has not been investigated.45
Ethical risks
Despite its many advantages and facilitators, the use of AI in clinical trial recruitment inevitably raised a host of ethical issues.
Privacy and data security were the main ethical issues associated with the use of AI in clinical trial recruitment; 15 studies included in this review mentioned them. For example, AI platforms often require access to sensitive patient data, such as medical images. It is crucial to protect the privacy and confidentiality of these data, ensuring that they are securely stored and only accessed by authorized personnel.25
Informed consent was another ethical issue related to the use of AI tools. Kirshner et al50 mentioned that ethical concerns arise regarding informed consent from patients for the use of their EHR data in research studies. Patients should be adequately informed about how their data will be used and have the option to opt-out if they do not wish to participate. Schwager et al26 showed that EHRs could raise questions about who owns and controls the data. Patients should have control over their own health information and be able to make decisions about how it is used and shared. When using EHR or other health data sources, researchers should consider the ethical implications of data sharing and secondary use. They should obtain necessary permissions and adhere to data protection regulations to ensure that data are used responsibly and for legitimate purposes.59
Transparency was also an important ethical consideration of AI. Studies by Vazquez et al13 and Chen et al18 have shown that it is crucial to ensure that AI-driven patient enrichment is implemented in a fair and responsible manner. Hassan et al25 and 3 other studies13,18,53 showed that the use of AI in clinical trial recruitment should be transparent, with clear communication about how the technology works and the role it plays in the decision-making process. There should also be mechanisms in place to address any concerns or complaints related to transparency and accountability of the AI system.
Eleven studies mentioned fairness, discrimination, and selection bias problems in AI tools. For example, ML models may inadvertently perpetuate biases present in the training data, leading to unfair or discriminatory outcomes.27 Kirshner et al50 mentioned that EHRs rely on accurate and up-to-date information to provide quality care. Ethical concerns arise when there are errors or discrepancies in the data, which can lead to incorrect diagnoses or treatments. Healthcare providers have an ethical responsibility to ensure the accuracy and integrity of the information entered into EHR.
Discussion
Predominant trends of AI application in clinical trial recruitment
The results of our scoping review of studies of the application of AI for optimizing recruitment and retention in clinical trials reveal several predominant trends. Firstly, we have observed a notable increase in publications since 2020, with 11 in 2020 compared to 5 in 2019, and over 80% of articles were published between 2019 and 2023. This indicates an increasing interest in using AI as an adjunct for clinical trial recruitment and this surge can be attributed to 2 main factors. First, the field of AI itself has undergone transformative advancements, particularly in ML and related DL methods, facilitated by hardware enhancements and extensive training datasets.69 Additionally, medical data have become increasingly available in digital format due to technological advancements and public policy initiatives. The large-scale application of medical data, such as retrieval of patient information buried in the EHR, has facilitated the transformation of clinical trials to intelligence. For instance, medical data can support automatic or semi-automatic clinical trial screening, either through a patient-centered approach or in the form of decision support for clinicians, such as point-of-care alerts.59 Second, the outbreak of COVID-19 has driven significant digital transformation within clinical trials.70 There is a heightened interest in leveraging AI and big data analytics across clinical trials, especially for recruitment. This is evidenced by the increasing number of startups operating within this domain, rising agreements for drug development activities, and record levels of funding.71
Secondly, an AI technology needs to be working well before real-world testing and implementation. Good efficiency, accuracy, and patient satisfaction and so on have been demonstrated in our research findings. Based on our review, in comparison to other features, studies have frequently focused on the development of AI for identifying patients who are more suitable or interested in participating in clinical trials. This emphasis may stem from the potential simplification of patient identification through algorithmic automation, which can identify optimal trial participants and input their medical history and details into trial databases.72 Research by Chopra et al71 also demonstrates that AI’s application in patient identification is particularly prominent, especially in the analysis of social media information. AI has the capability to analyze online discussions from patient support groups to detect clusters of illnesses within specific regions. By analyzing population data, AI can aid in identifying individuals who would derive maximum benefit from participating in clinical trials.73 Through evaluating hospital medical data and informing both physicians and patients about potential clinical trial opportunities, AI holds promise for expediting the process of finding suitable participants.71
Interaction between benefits, drawbacks, technical issues, and ethical issues
Cascini et al74 conducted a scoping review of the current landscape of AI-based applications in clinical trials. Their study presented encouraging results on the implementation of AI-based applications to the development of clinical trials, showing that AI-based applications have a lot of potential. In comparison to their review, our research raised new questions, such as technical issues, ethical issues, benefits and drawbacks of AI tools, and so on, each of which were both interdependent and mutually reinforcing. Discrimination and selection bias, for instance, were noted ethical concerns when utilizing AI for recruitment in clinical trials, while also recognized drawbacks in limiting the generalizability because of potential data bias. The security of confidential patient data, in turn, was identified as critical for extending generalizability in the use of AI technology to recruitment and in clinical trials. In addition, the quality and diversity of training data significantly impact the accuracy and reliability of AI models, and insufficient data pertaining to uncommon diseases or conditions may impede the effectiveness of AI models within clinical trial environments.71 One suggestion offered to combat the threat of discrimination and selection bias in AI is the larger sample size and other cohorts, and a longer duration of follow-up or using different subsets.
Furthermore, acquiring informed consent from patients during the deployment of AI algorithms can pose challenges due to their limited comprehension regarding operational mechanisms and data utilization practices. Ensuring transparency and comprehensibility is essential for fostering patient confidence and upholding rigorous ethical standards in relation to these advanced technologies.71 Nevertheless, numerous types of AI such as DL neural networks are often labeled as “black box” models owing to their intricate internal processes that defy easy comprehension or explanation. This lack of interpretability could hinder widespread adoption given that explanations for specific predictions or recommendations made by an AI model are frequently ambiguous.71 It is therefore clear from our review that the aforementioned themes cannot be considered in isolation, but rather must be viewed in relation to one another issues when considering the application of AI in clinical trials.
Lack of application of AI in retention
None of the studies identified addressed retention of participants. An analysis involving 95 clinical trials revealed that close to 40% of patients discontinued their prescribed medication within the initial year.75 The lack of adequate technical infrastructure to manage trial complexities without reliable adherence control and patient monitoring may result in underpowered trials,10 while elevated nonadherence rates pose substantial challenges for sponsors and researchers. Despite the seriousness of the retention problem, our review found no research on applying AI to it. Abiodun et al76 developed a conceptual framework of remote health monitoring in a clinical trial using ML techniques, with various classifications to determine if a participant should be allowed to continue in the trial or not, but did not validate the effect of this framework.
In a recent review article by Blaschke et al,77 various applications of AI were discussed in clinical trial settings. These applications encompass utilizing historical medical data for predicting potential dropouts among patients in order for medical professionals to provide timely intervention and employing AI technology for analyzing video recordings of patients while taking medications aimed at ensuring accurate dosages. Currently, there are few startups focusing specifically on preventing clinical trial dropouts as well as engaging participants, Brite Health stands out as a company actively addressing clinical trial dropout rates through its innovative product which processes both structured and unstructured patient data before delivering tailored messages and alerts designed to sustain participants’ involvement in trials while also identifying individuals at risk of dropping out for prompt intervention.78 Additionally, solutions such as emocha Mobile Health and AiCure aim at enhancing medication adherence through digital methods known as directly observed therapy (DOT), where an AI system monitors patients’ drug intake.78 Notably complex strategies have been necessary even for minor enhancements whereas simple measures often yield limited impact. Furthermore, without continuous reinforcement tactics, all intervention effects tend to diminish over time,79 highlighting the need for multifaceted retention approaches incorporating patient-centric care along with education, dosage monitoring, counseling, prompt packaging, and reminders.80 As patient dropout will likely continue to be a significant problem in the foreseeable future, this area has plenty of opportunities for innovation.
Classical models as dominant subfields, alongside a general transition toward deep models, medical tools, and NLP
The dominant subfields of AI identified by our review mirror trends in AI advances and align with other characteristics of the included studies. More than 40% of the studies reviewed used traditional model-based approaches, but the use of medical tools (20% of studies) and deep model-based approaches (5%) is increasing, reflecting a general shift in AI transitioning from smaller and structured datasets to massive and unstructured one. This trend also aligns with the current development of AI technology. Classical, or “non-deep” ML is more dependent on human intervention to learn; human experts determine the set of features to understand the knowledge within input data, usually requiring more expertise knowledge and structured data to learn.81 However, the driving force behind the maturity of AI research is the availability and quality of data, particularly after the shift toward data-driven deep models.82 Despite the potential for AI to revolutionize clinical trials, it still encounters numerous challenges, with unstructured data being a significant obstacle. For instance, in many clinical studies, faxed requests for patient records remain common, and hospitals often respond with PDFs or photos.83 When a spreadsheet is faxed or converted to a read-only document, much of its original structure is lost. Researchers conducting clinical trials struggle to gather precise information required to assess a patient’s eligibility using the current manual approach.84 Deep models enable systems to learn and integrate feedback on the quality of their analytic output into adapted underlying algorithms. Assistive systems utilizing these AI techniques or subsets thereof can automatically analyze EHR and clinical trial eligibility databases, identify matches between specific patients and recruiting trials, and recommend these matches to doctors and patients.85
Additionally, NLP, a theory-driven range of computational techniques for automatic analysis and representation of human language from various unstructured data types,86 has demonstrated its value in clinical trials. For instance, the use of the IBM Watson Health’s Clinical Trials Matching (CTM) system with screening coordinators facilitated an increase in clinical trial enrollment and promoted awareness of clinical trial opportunities within the lung oncology practice.38 This system employs NLP to abstract patient and trial data from unstructured sources, aiding in matching patients to trials. It has shown positive results in increasing enrollment in clinical trials and raising awareness of clinical trial opportunities within the field of lung oncology. Other studies have demonstrated that the utilization of automated EHR text-mining for identifying suitable participants for cardiovascular trials could result in a 79.9% reduction in the number of patients needed to be screened.87 Additionally, an information extraction approach to the EHR has proven effective in efficiently identifying and subphenotyping patients with heart failure with preserved ejection fraction and other disorders, achieving a sensitivity and positive predictive value of 0.95 and 0.86, respectively.88 NLP integrates computational linguistics to empower computers to understand human language in both text and voice formats, comprehending its full meaning, including the speaker or writer’s intent and sentiment.
The utilization of AI-powered medical tools in clinical trial recruitment, such as WCTM and Mendel.ai, has been increasing, paralleling the trends in DL and NLP. These tools leverage AI to enhance various aspects of trial recruitment, including patient selection and eligibility screening, driven by the growing availability of large-scale digital medical data and the demand for more efficient and accurate recruitment processes. Advances in AI have enabled the development of more sophisticated tools capable of processing extensive datasets with higher precision. Combined with the digitization of healthcare records, regulatory acceptance, and the digital transformation accelerated by the COVID-19 pandemic, it has further fueled their adoption. As these trends continue, the deployment of AI-driven tools is expected to expand, contributing to the broader integration of advanced AI medical tools in clinical trial recruitment process.
Limitations
Although 51 studies were included in our research which described the brief current status of AI application in recruitment in clinical trials, the results of the interventions were too heterogeneous to perform a meta-analysis. By using search terms only in the English language, we might have not been able to identify articles in the national languages of different countries. In our review, no manual search was conducted and grey literature was not included. Due to the nature of this review, there were also heterogeneous populations, variable outcome measures, variable study methods, and methodological limitations, so we were not able to do a formal assessment of the predictive validity of the different AI models. The study found that current studies lack evidence of AI benefits and drawbacks, and it is difficult to draw conclusions and synthesize studies with inconsistent outcome measures, so there is an urgent need for a standardized measurement method to standardize measurement results. Finally, as most of the included studies were conducted in the United States, it is unclear whether the findings of these studies can be generalized to other countries, especially developing countries. However, our work could inform systematic reviews aimed at answering more focused research questions about the use of AI for specific diseases in clinical trials, including which AI techniques are most appropriate given different contexts.
Conclusion
To our knowledge, this review is the first comprehensive, interdisciplinary summary of research on AI application in recruitment and retention in clinical trials. Its 2 fundamental aims were to provide a comprehensive overview of the current status of the use of AI tools in the recruitment and retention process of clinical trials and to identify the benefits and drawbacks and ethical and technical issues that have influenced the application of these tools. Overall, the application of AI in clinical trials is in the early stages of maturity with not a lot of tools ready for widespread implementation. Its effectiveness needs to be further tested, requiring more obvious, higher-quality research evidence and clearer reporting than identified in this scoping review. The application of AI innovation can foster clinical trial transformation and improve clinical trial systems’ efficacy and quality. Future research should focus on using valid and standardized outcome measures, improving the methodological rigor of studies, formulating relevant policies, and issuing corresponding guidance documents to facilitate the engagement of clinical researchers and ethicists in AI-based clinical trials.
Supplementary Material
Acknowledgments
We are most grateful to Kate Nyhan from Yale University for her assistance in designing the search strategy. We are also extremely grateful to Mingan Chen, Zhuolin Lu, and Xiaoting Shi for their assistance in developing the study protocol.
Contributor Information
Xiaoran Lu, Department of Philosophy, School of the Art, University of Liverpool, Liverpool L69 3BX, United Kingdom.
Chen Yang, Department of Philosophy, School of Humanities, Central South University, Changsha, Hunan 410075, P.R. China.
Lu Liang, Department of Philosophy, School of Humanities, Central South University, Changsha, Hunan 410075, P.R. China.
Guanyu Hu, School of Computer Science and Technology, Xi’an Jiaotong University, Xi’an, Shanxi 710049, P.R. China; School of Electronic Engineering and Computer Science, Queen Mary University of London, London E1 4NS, United Kingdom.
Ziyi Zhong, Institute of Life Course and Medical Sciences, University of Liverpool, Liverpool L69 3BX, United Kingdom.
Zihao Jiang, School of Marxism, Shenzhen Polytechnic University, Shenzhen, Guangdong 518055, P.R. China.
Author contributions
Xiaoran Lu led the drafting and writing of the manuscript, Chen Yang contributed to preparing the additional files and reviewing the article. Lu Liang assisted in developing the methodology of the protocol, designing the search strategy, and reviewing the article. Guanyu Hu and Ziyi Zhong provided feedback and critically revised the manuscript for important intellectual content. Zihao Jiang, the senior author, formulated the study conception with Xiaoran Lu, assisted in drafting the protocol, and reviewed the article. All authors read and approved the submitted copy.
Supplementary material
Supplementary material is available at Journal of the American Medical Informatics Association online.
Funding
This work was supported by the Central South University Graduate Independent Innovation Project (2022ZZTS0333). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Conflicts of interest
The authors declare no conflict of interest.
Data availability
The data underlying this article are available in the article and in its online supplementary material.
References
- 1. Lamberti MJ, Wilkinson M, Donzanti BA, et al. A study on the application and use of artificial intelligence to support drug development. Clin Ther. 2019;41:1414-1426. [DOI] [PubMed] [Google Scholar]
- 2. Copeland BJ. Artificial intelligence. Encyclopedia Britannica; 2023. Accessed February 25, 2024. https://www.britannica.com/technology/artificial-intelligence
- 3. Walczak S. The role of artificial intelligence in clinical decision support systems and a classification framework. Int J Comput Clin Pract. 2018;3:31-47. [Google Scholar]
- 4. Reddy S, Fox J, Purohit MP. Artificial intelligence-enabled healthcare delivery. J R Soc Med. 2019;112:22-28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Kolluri S, Lin J, Liu R, Zhang Y, Zhang W. Machine learning and artificial intelligence in pharmaceutical research and development: a review. AAPS J. 2022;24:19. 10.1208/s12248-021-00644-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Mak KK, Pichika MR. Artificial intelligence in drug development: present status and future prospects. Drug Discov Today. 2019;24:773-780. [DOI] [PubMed] [Google Scholar]
- 7. Miao ZQ, Zhang QH, Qian YH, et al. From human intelligence to its implementation model-granular computing theory and method. J Intell Syst. 2016;11:743-757. [Google Scholar]
- 8. Liu H, Chi Y, Butler A, Sun Y, Weng C. A knowledge base of clinical trial eligibility criteria. J Biomed Inform. 2021;117:103771. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Gligorijevic J, Gligorijevic D, Pavlovski M, et al. Optimizing clinical trials recruitment via deep learning. J Am Med Inform Assoc. 2019;26:1195-1202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Harrer S, Shah P, Antony B, Hu J. Artificial intelligence for clinical trial design. Trends Pharmacol Sci. 2019;40:577-591. [DOI] [PubMed] [Google Scholar]
- 11. Askin S, Burkhalter D, Calado G, El Dakrouni S. Artificial intelligence applied to clinical trials: opportunities and challenges. Health Technol (Berl). 2023;13:203-213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Oliveira AL. Biotechnology, big data and artificial intelligence. Biotechnol J. 2019;14:e1800613. [DOI] [PubMed] [Google Scholar]
- 13. Vazquez J, Abdelrahman S, Byrne LM, et al. Using supervised machine learning classifiers to estimate likelihood of participating in clinical trials of a de-identified version of ResearchMatch. J Clin Transl Sci. 2020;5:e42. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Peters MDJ, Godfrey C, McInerney P, et al. Chapter 11: scoping reviews (2020 version). In: JBI Manual for Evidence Synthesis. 2020. Accessed February 25, 2024. https://jbi-globalwiki.refined.site/space/MANUAL/4687342/Chapter+11%3A+Scoping+reviews
- 15. Tricco AC, Lillie E, Zarin W, et al. PRISMA extension for scoping reviews (PRISMA-ScR): checklist and explanation. Ann Intern Med. 2018;169:467-473. 10.7326/M18-0850 [DOI] [PubMed] [Google Scholar]
- 16. Lu X, Chen M, Lu Z, Shi X, Liang L. Artificial intelligence tools for optimising recruitment and retention in clinical trials: a scoping review protocol. BMJ Open. 2024;14:e080032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Peters MDJ, Marnie C, Tricco AC, et al. Updated methodological guidance for the conduct of scoping reviews. JBI Evid Synth. 2020;18:2119-2126. [DOI] [PubMed] [Google Scholar]
- 18. Chen A, Montesano G, Lu R, et al. Visual field endpoints for neuroprotective trials: a case for AI-driven patient enrichment. Am J Ophthalmol. 2022;243:118-124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Ezzati A, Lipton RB, Alzheimer’s Disease Neuroimaging Initiative. Machine learning predictive models can improve efficacy of clinical trials for Alzheimer’s disease. J Alzheimers Dis. 2020;74:55-63. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Miotto R, Weng C. Case-based reasoning using electronic health records efficiently identifies eligible patients for clinical trials. J Am Med Inform Assoc. 2015;22:e141-e150. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Calaprice-Whitty D, Galil K, Salloum W, Zariv A, Jimenez B. Improving clinical trial participant prescreening with artificial intelligence (AI): a comparison of the results of AI-assisted vs standard methods in 3 oncology trials. Ther Innov Regul Sci. 2020;54:69-74. [DOI] [PubMed] [Google Scholar]
- 22. Beck JT, Rammage M, Jackson GP, et al. Artificial intelligence tool for optimizing eligibility screening for clinical trials in a large community cancer center. JCO Clin Cancer Inform. 2020;4:50-59. [DOI] [PubMed] [Google Scholar]
- 23. Bieganek C, Aliferis C, Ma S. Prediction of clinical trial enrollment rates. PLoS One. 2022;17:e0263193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Lin M, Gong P, Yang T, et al. Big data analytical approaches to the NACC dataset: aiding preclinical trial enrichment. Alzheimer Dis Assoc Disord. 2018;32:18-27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Hassan AE, Ravi S, Desai S, et al. An artificial intelligence (AI)-based approach to clinical trial recruitment: the impact of Viz RECRUIT on enrollment in the EMBOLISE trial. Interv Neuroradiol. 2023;6:15910199231184604. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Schwager E, Jansson K, Rahman A, et al. Utilizing machine learning to improve clinical trial design for acute respiratory distress syndrome. NPJ Digit Med. 2021;4:133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Maharjan J, Ektefaie Y, Ryan L, et al. Enriching the study population for ischemic stroke therapeutic trials using a machine learning algorithm. Front Neurol. 2021;12:784250. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Reith FH, Mormino EC, Zaharchuk G. Predicting future amyloid biomarkers in dementia patients with machine learning to improve clinical trial patient selection. Alzheimers Dement (N Y). 2021;7:e12212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Yaghy A, Lee AY, Keane PA, et al. Artificial intelligence-based strategies to identify patient populations and advance analysis in age-related macular degeneration clinical trials. Exp Eye Res. 2022;220:109092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Mutasa S, Chang P, Van Sant EP, et al. Potential role of convolutional neural network based algorithm in patient selection for DCIS observation trials using a mammogram dataset. Acad Radiol. 2020;27:774-779. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Jung E, Jain H, Sinha AP, Gaudioso C. Building a specialized lexicon for breast cancer clinical trial subject eligibility analysis. Health Informatics J. 2021;27:1460458221989392. [DOI] [PubMed] [Google Scholar]
- 32. Fink E, Kokku PK, Nikiforou S, et al. Selection of patients for clinical trials: an interactive web-based system. Artif Intell Med. 2004;31:241-254. [DOI] [PubMed] [Google Scholar]
- 33. Meystre SM, Heider PM, Cates A, et al. Piloting an automated clinical trial eligibility surveillance and provider alert system based on artificial intelligence and standard data models. BMC Med Res Methodol. 2023;23:88. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Jiang Y, Guarino P, Ma S, et al. Bayesian accrual prediction for interim review of clinical studies: open source R package and smartphone application. Trials. 2016;17:336. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Pakhomov SV, Buntrock J, Chute CG. Prospective recruitment of patients with congestive heart failure using an ad-hoc binary classifier. J Biomed Inform. 2005;38:145-153. [DOI] [PubMed] [Google Scholar]
- 36. Jordan ER, Jahreiss L, Kasi PM. Deploying an artificial intelligence-based online search tool to increase patients’ access to and understanding of solid tumor gastrointestinal clinical trials. J Gastrointest Oncol. 2021;12:2045-2051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Jering KS, Campagnari C, Claggett B, et al. Improving clinical trial efficiency using a machine learning-based risk score to enrich study populations. Eur J Heart Fail. 2022;24:1418-1426. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Leventakos K, Helgeson J, Mansfield A, et al. Effects of an artificial intelligence (AI) system on clinical trial enrollment in lung cancer. J Thorac Oncol. 2019;14:S592. [Google Scholar]
- 39. Kehl KL, Groha S, Lepisto EM, et al. Clinical inflection point detection on the basis of EHR data to identify clinical trial-ready patients with cancer. JCO Clin Cancer Inform. 2021;5:622-630. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Cai T, Cai F, Dahal KP, et al. Improving the efficiency of clinical trial recruitment using an ensemble machine learning to assist with eligibility screening. ACR Open Rheumatol. 2021;3:593-600. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Haddad T, Helgeson JM, Pomerleau KE, et al. Accuracy of an artificial intelligence system for cancer clinical trial eligibility screening: retrospective pilot study. JMIR Med Inform. 2021;9:e27767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Ni Y, Beck AF, Taylor R, et al. Will they participate? Predicting patients’ response to clinical trial invitations in a pediatric emergency department. J Am Med Inform Assoc. 2016;23:671-680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Ni Y, Wright J, Perentesis J, et al. Increasing the efficiency of trial-patient matching: automated clinical trial eligibility pre-screening for pediatric oncology patients. BMC Med Inform Decis Mak. 2015;15:28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Ni Y, Kennebeck S, Dexheimer JW, et al. Automated clinical trial eligibility prescreening: increasing the efficiency of patient identification for clinical trials in the emergency department. J Am Med Inform Assoc. 2015;22:166-178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Zhang K, Demner-Fushman D. Automated classification of eligibility criteria in clinical trials to facilitate patient-trial matching for specific patient populations. J Am Med Inform Assoc. 2017;24:781-787. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Wang Y, Carter BZ, Li Z, Huang X. Application of machine learning methods in clinical trials for precision medicine. JAMIA Open. 2022;5:ooab107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Mellem MS, Kollada M, Tiller J, Lauritzen T. Explainable AI enables clinical trial patient selection to retrospectively improve treatment effects in schizophrenia. BMC Med Inform Decis Mak. 2021;21:162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Podichetty JT, Silvola RM, Rodriguez-Romero V, et al. Application of machine learning to predict reduction in total PANSS score and enrich enrollment in schizophrenia clinical trials. Clin Transl Sci. 2021;14:1864-1874. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Tomaszewski MR, Fan S, Garcia A, et al. AI-radiomics can improve inclusion criteria and clinical trial performance. Tomography. 2022;8:341-355. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Kirshner J, Cohn K, Dunder S, et al. Automated electronic health record-based tool for identification of patients with metastatic disease to facilitate clinical trial patient ascertainment. JCO Clin Cancer Inform. 2021;5:719-727. [DOI] [PubMed] [Google Scholar]
- 51. Yao X, Attia ZI, Behnken EM, et al. Batch enrollment for an artificial intelligence-guided intervention to lower neurologic events in patients with undiagnosed atrial fibrillation: rationale and design of a digital clinical trial. Am Heart J. 2021;239:73-79. [DOI] [PubMed] [Google Scholar]
- 52. Chen L, Gu Y, Ji X, et al. Clinical trial cohort selection based on multi-level rule-based natural language processing system. J Am Med Inform Assoc. 2019;26:1218-1226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Xiong Y, Shi X, Chen S, et al. Cohort selection for clinical trials using hierarchical neural network. J Am Med Inform Assoc. 2019;26:1203-1208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Chen B, Jin H, Yang Z, et al. An approach for transgender population information extraction and summarization from clinical trial text. BMC Med Inform Decis Mak. 2019;19:62. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Dai HJ, Wang FD, Chen CW, et al. Cohort selection for clinical trials using multiple instance learning. J Biomed Inform. 2020;107:103438. [DOI] [PubMed] [Google Scholar]
- 56. Alexander M, Solomon B, Ball DL, et al. Evaluation of an artificial intelligence clinical trial matching system in Australian lung cancer patients. JAMIA Open. 2020;3:209-215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Hassanzadeh H, Karimi S, Nguyen A. Matching patients to clinical trials using semantically enriched document representation. J Biomed Inform. 2020;105:103406. [DOI] [PubMed] [Google Scholar]
- 58. Escudero J, Zajicek JP, Ifeachor E. Selection of subjects for clinical trials in Alzheimer’s disease and mild cognitive impairment with machine learning analysis of MRI and CSF biomarkers. Trials. 2011;12:A18. [DOI] [PubMed] [Google Scholar]
- 59. Tissot HC, Shah AD, Brealey D, et al. Natural language processing for mimicking clinical trial recruitment in critical care: a semi-automated simulation based on the LeoPARDS trial. IEEE J Biomed Health Inform. 2020;24:2950-2959. [DOI] [PubMed] [Google Scholar]
- 60. Tam A, Laurent C, Gauthier S, Dansereau C. Prediction of cognitive decline for enrichment of Alzheimer’s disease clinical trials. J Prev Alzheimers Dis. 2022;9:400-409. [DOI] [PubMed] [Google Scholar]
- 61. Arcia N, Dulawat MPS, Sommerford N, Ghosh B. Identifying appropiate hospitals for patient recruitment in clinical trials for Canada using machine learning algorithms. Value Health. 2020;23:S104-S105. [Google Scholar]
- 62. Delorme J, Charvet V, Wartelle M, et al. Natural language processing for patient selection in phase I or II oncology clinical trials. JCO Clin Cancer Inform. 2021;5:709-718. [DOI] [PubMed] [Google Scholar]
- 63. Ansart M, Epelbaum S, Gagliardi G, et al. ; Alzheimer’s Disease Neuroimaging Initiative and the INSIGHT-preAD study. Reduction of recruitment costs in preclinical AD trials: validation of automatic pre-screening algorithm for brain amyloidosis. Stat Methods Med Res. 2020;29:151-164. [DOI] [PubMed] [Google Scholar]
- 64. Leow WQ, Bedossa P, Liu F, et al. An improved qFibrosis algorithm for precise screening and enrollment into non-alcoholic steatohepatitis (NASH) clinical trials. Diagnostics (Basel). 2020;10:643. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Sato K, Ihara R, Suzuki K, et al. Predicting amyloid risk by machine learning algorithms based on the A4 screen data: application to the Japanese trial-ready cohort study. Alzheimers Dement (N Y). 2021;7:e12135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Lanera C, Berchialla P, Lorenzoni G, et al. A SuperLearner approach to predict run-in selection in clinical trials. Comput Math Methods Med. 2022;2022:4306413. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Jongsma KR, Sand M, Milota M. Why we should not mistake accuracy of medical AI for efficiency. npj Digit Med. 2024;7:57. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Ismail A, Al-Zoubi T, El Naqa I, Saeed H. The role of artificial intelligence in hastening time to recruitment in clinical trials. BJR Open. 2023;5:20220023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69. Wang F, Casalino LP, Khullar D. Deep learning in medicine-promise, progress, and challenges. JAMA Intern Med. 2019;179:293-294. [DOI] [PubMed] [Google Scholar]
- 70.COVID-19 digital transformation & technology. McKinsey & Company. Accessed July 28, 2024. https://www.mckinsey.com/capabilities/strategy-and-corporate-finance/our-insights/how-covid-19-haspushed-companies-over-the-technology-tipping-point-and-transformed
- 71. Chopra H, Shin DK, Munjal K, et al. Revolutionizing clinical trials: the role of AI in accelerating medical breakthroughs. Int J Surg. 2023;109:4211-4220. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72. Hardman TC, Aitchison R, Scaife R, Edwards J, Slater G. The future of clinical trials and drug development: 2050. Drugs Context. 2023;12:2023-2-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. Woo M. An AI boost for clinical trials. Nature. 2019;573:S100-S102. [DOI] [PubMed] [Google Scholar]
- 74. Cascini F, Beccia F, Causio FA, et al. Scoping review of the current landscape of AI-based applications in clinical trials. Front Public Health. 2022;10:949377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75. Hutson M. How AI is being used to accelerate clinical trials. Nature. 2024;627:S2-S5. [DOI] [PubMed] [Google Scholar]
- 76. Abiodun TN, Okunbor D, Osamor VC. Remote health monitoring in clinical trial using machine learning techniques: a conceptual framework. Health Technol (Berl). 2022;12:359-364. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77. Blaschke TF, Osterberg L, Vrijens B, Urquhart J. Adherence to medications: insights arising from studies on the unreliable link between prescribed and actual drug dosing histories. Annu Rev Pharmacol Toxicol. 2012;52:275-301. [DOI] [PubMed] [Google Scholar]
- 78. Kaufman J. The innovative startups improving clinical trial recruitment, enrollment, retention, and design. MobiHealthNews [published online November 30, 2018]. Accessed July 28, 2024. https://www.mobihealthnews.com/news/innovative-startups-improving-clinical-trial-recruitment-enrollment-retention-and-design
- 79. Murray MD, Young J, Hoke S, et al. Pharmacist intervention to improve medication adherence in heart failure: a randomized trial. Ann Intern Med. 2007;146:714-725. [DOI] [PubMed] [Google Scholar]
- 80. Christensen A, Osterberg LG, Hansen EH. Electronic monitoring of patient adherence to oral antihypertensive medical treatment: a systematic review. J Hypertens. 2009;27:1540-1551. [DOI] [PubMed] [Google Scholar]
- 81. Coursera Staff. What is machine learning: definition, types, applications and examples. 2023. Accessed February 25, 2024. https://www.potentiaco.com/what-is-machine-learning-definition-types-applications-and-examples/
- 82. Kueper JK, Terry AL, Zwarenstein M, Lizotte DJ. Artificial intelligence and primary care research: a scoping review. Ann Fam Med. 2020;18:250-258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83. Reisman M. EHRs: the challenge of making electronic data usable and interoperable. P T. 2017;42:572-575. [PMC free article] [PubMed] [Google Scholar]
- 84. Chaudhari N, Ravi R, Gogtay NJ, Thatte UM. Recruitment and retention of the participants in clinical trials: challenges and solutions. Perspect Clin Res. 2020;11:64-69. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85. Helgeson J, Rammage M, Urman A, et al. Clinical performance pilot using cognitive computing for clinical trial matching at Mayo Clinic. JCO. 2018;36:e18598. [Google Scholar]
- 86. Young T, Hazarika D, Poria S, Cambria E. Recent trends in deep learning based natural language processing. IEEE Comput Intell Mag. 2018;13:55-75. [Google Scholar]
- 87. Van Dijk WB, Fiolet ATL, Schuit E, et al. Text-mining in electronic healthcare records can be used as an efficient tool for screening and data collection in cardiovascular trials: a multicenter validation study. J Clin Epidemiol. 2021;132:97-105. [DOI] [PubMed] [Google Scholar]
- 88. Jonnalagadda SR, Adupa AK, Garg RP, et al. Text mining of the electronic health record: an information extraction approach for automated identification and subphenotyping of HFpEF patients for clinical trials. J Cardiovasc Transl Res. 2017;10:313-321. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data underlying this article are available in the article and in its online supplementary material.

