Abstract
Background
Artificial intelligence (AI)–based therapeutics, devices, and systems are vital innovations in cancer control; particularly, they allow for diagnosis, screening, precise estimation of survival, informing therapy selection, and scaling up treatment services in a timely manner.
Objective
The aim of this study was to analyze the global trends, patterns, and development of interdisciplinary landscapes in AI and cancer research.
Methods
An exploratory factor analysis was conducted to identify research domains emerging from abstract contents. The Jaccard similarity index was utilized to identify the most frequently co-occurring terms. Latent Dirichlet Allocation was used for classifying papers into corresponding topics.
Results
From 1991 to 2018, the number of studies examining the application of AI in cancer care has grown to 3555 papers covering therapeutics, capacities, and factors associated with outcomes. Topics with the highest volume of publications include (1) machine learning, (2) comparative effectiveness evaluation of AI-assisted medical therapies, and (3) AI-based prediction. Noticeably, this classification has revealed topics examining the incremental effectiveness of AI applications, the quality of life, and functioning of patients receiving these innovations. The growing research productivity and expansion of multidisciplinary approaches are largely driven by machine learning, artificial neural networks, and AI in various clinical practices.
Conclusions
The research landscapes show that the development of AI in cancer care is focused on not only improving prediction in cancer screening and AI-assisted therapeutics but also on improving other corresponding areas such as precision and personalized medicine and patient-reported outcomes.
Keywords: scientometrics, cancer, artificial intelligence, global, mapping
Introduction
Background
Every year, over 200 million healthy life years are lost because of cancer, making it one of the highest health care burden causing disability and mortality among men and women [1]. Fortunately, many types of cancers can be prevented or effectively treated if patients are diagnosed in a timely manner and offered optimal therapies. In many parts of the world, however, programs for cancer control and prevention are facing multiple barriers because of limited health service infrastructure, availability of treatment options, and health worker capacities.
Artificial intelligence (AI) is considered a disruptive innovation in health and medicine. Over the past six decades, AI has been widely applied to many areas of medical research and clinical practice. The number of published papers on AI and its impacts has been rapidly growing within the research community over the past decade. A bibliometric study has shown that the number of studies on AI applications in medicine has tripled in the past 3 years, with the highest interest in cancer research [2]. Various techniques, such as robotics, machine learning, and artificial neural networks, have been applied to the study of cancer, showing promising improvements in clinical prediction, treatment, and diagnosis. For instance, machine learning techniques in the application of proteomics and genomics could increase precision in estimating survival and inform the selection of therapies [3]. In large populations, the development and application of AI also holds potential in screening for cancer and scaling up treatment services in a timely manner.
Literature Review
Many approaches and products have been developed to support cancer treatment and for prevention at health facilities and within communities. However, the synthesis of resulting evidence from these efforts is necessary to inform decision making. Some authors have conducted systematic reviews of the performance and effectiveness of AI techniques and products in specific cancers [3-10]. Overall, these reviews found that almost all AI-assisted interventions led to greater effectiveness than conventional approaches. However, insights from these efforts have raised some important points for further exploration. Lisboa et al reviewed predictive models using artificial neural networks and suggested the need for rigorous evaluation of results [4]. In addition, Spelt et al emphasized the importance of justifying the complex structure of datasets and individual factors in these models [5]. Ray et al reviewed the wearable systems for cancer detection and found that cloud computing and long-range communication paradigms are still lacking, and that AI and machine learning should be applied to current products [8]. Other authors affirmed the greater performance of image-based AI applications to breast cancer diagnosis, but few studies have been supported by a high level of evidence. Conducting further clinical research and health technology assessment is recommended.
Objectives
With the rapid development of technologies, AI-based therapeutics, devices, and systems will be vital innovations in cancer control. To accelerate research and development, it is critical to understand current approaches in the applications of AI in cancer care, multiple disciplines involved, and the trends and establishment of the research landscapes. To our knowledge, none of the previous studies have systematically quantified the development of AI in the bibliographic literature of cancer studies. This study analyzes the global trends, patterns, and development of interdisciplinary landscapes in AI and cancer studies.
Methods
Search Strategy
We searched and retrieved all papers related to AI in cancer care on the Web of Science (WOS) that is a Web-based database covering the largest proportion of peer-reviewed literature in this field. The full search strategy has been presented elsewhere [2]. In short, we used a set of predefined search terms related to artificial intelligence and health and medicine to search the WOS for publications (inclusion step) and then excluded those that did not satisfy our eligibility criteria of publication year from 1991 to 2018 and publications other than articles and reviews (exclusion step). In this analysis, we selected all the documents of retrieved data on AI applications related to cancer care.
Data Extraction
We downloaded all data from the WOS database in .txt format, including all information such as author names, paper title, journals, keywords, affiliations of institutions, the prevalence of citation, categories, and abstracts. All of these data were converted to an Excel file (Microsoft Excel, Microsoft Corporation) for checking the data error. A process of standardization was carried out by 2 researchers to bring together the different names of an author. Then, we filtered all downloaded data using the following criteria: (1) not original articles and reviews, (2) not about cancer and AI, and (3) not in English. Any conflict was solved by discussion (Figure 1). The combined dataset was transferred into Stata (version 14.0, STATA Corporation) for further analysis.
Data Analysis
Data were resolved based on basic indicators of publication (number of authors, publication years, and main categories), keywords (most common keywords and co-occurrence keywords), citations, usages, and abstracts. After downloading and extracting the data, we applied the descriptive statistical analysis using Stata to calculate country citations and intercountry collaboration. A network graph illustrating the network of countries by sharing the co-authorships was created, along with the author keyword co-occurrence network and countries network. VOSviewer (version 1.6.8, Center for Science and Technology, Leiden University) was used to establish a co-occurrence network and a countries network. The principles of underlying algorithms used by the software for clustering have been documented elsewhere [11-14] For content analysis of the abstracts, we applied the exploratory factor analysis to identify research domains emerging from all content of the abstracts, loadings of 0.4 [15]. The Jaccard similarity index was utilized to identify research topics or terms most frequently co-occurring with each other [16]. Latent Dirichlet Allocation (LDA) was used for classifying papers into corresponding topics [17-21]. The summary of analytical techniques for each data type is presented in Table 1.
Table 1.
Type of data | Unit of analysis | Analytical methods | Presentations of results |
Authors, keywords, countries | Words | Frequency of co-occurrence | Map of authors keywords clusters |
Abstracts | Words | Exploratory factors analyses | Top 50 constructed research domains; clustering map of the landscapes constructed by these domains |
Abstracts | Papers | Latent Dirichlet Allocation | 10 classifications of research topics |
WOSa classification of research areas | WOS research areas | Frequency of co-occurrence | Dendrogram of research disciplines (WOS classification) |
aWOS: Web of Science.
Results
The Number of Published Items and Publication Trend
There has been a rapid increase in the number of studies applying AI to cancer research from 1991 to 2018. In particular, the research productivity of the past 10 years has accounted for over 90.66% (3223/3555) of the total papers. Rates of citation and usage are also growing fast. The mean usage (downloads) in the past 6 months of papers published in the past 1 to 2 years was twice that of those published in the past 3 to 4 years (Table 2).
Table 2.
Year published | Total number of papers | Total citations | Mean citation rate per yeara | Total usage in the last 6 monthsb | Total usage in the last 5 yearsb | Mean use rate for the last 6 monthsc | Mean use rate for the last 5 yearsd |
2018 | 661 | 809 | 1.22 | 2489 | 3858 | 3.77 | 1.17 |
2017 | 503 | 3206 | 3.19 | 994 | 4663 | 1.98 | 1.85 |
2016 | 435 | 3680 | 2.82 | 431 | 5529 | 0.99 | 2.54 |
2015 | 349 | 4524 | 3.24 | 304 | 3713 | 0.87 | 2.13 |
2014 | 284 | 4131 | 2.91 | 140 | 2914 | 0.49 | 2.05 |
2013 | 268 | 5167 | 3.21 | 118 | 2893 | 0.44 | 2.16 |
2012 | 202 | 4642 | 3.28 | 66 | 1511 | 0.33 | 1.50 |
2011 | 173 | 4706 | 3.40 | 64 | 1150 | 0.37 | 1.33 |
2010 | 146 | 5474 | 4.17 | 51 | 881 | 0.35 | 1.21 |
2009 | 114 | 3550 | 3.11 | 55 | 729 | 0.48 | 1.28 |
2008 | 88 | 3671 | 3.79 | 36 | 478 | 0.41 | 1.09 |
2007 | 68 | 2480 | 3.04 | 24 | 388 | 0.35 | 1.14 |
2006 | 58 | 2324 | 3.08 | 18 | 238 | 0.31 | 0.82 |
2005 | 45 | 1885 | 2.99 | 14 | 219 | 0.31 | 0.97 |
2004 | 26 | 1582 | 4.06 | 6 | 134 | 0.23 | 1.03 |
2003 | 39 | 3115 | 4.99 | 22 | 399 | 0.56 | 2.05 |
2002 | 17 | 3208 | 11.10 | 32 | 297 | 1.88 | 3.49 |
2001 | 15 | 964 | 3.57 | 2 | 75 | 0.13 | 1.00 |
2000 | 18 | 2040 | 5.96 | 11 | 192 | 0.61 | 2.13 |
1999 | 13 | 1043 | 4.01 | 5 | 51 | 0.38 | 0.78 |
1998 | 12 | 548 | 2.17 | 4 | 31 | 0.33 | 0.52 |
1997 | 9 | 420 | 2.12 | 5 | 28 | 0.56 | 0.62 |
1996 | 2 | 52 | 1.13 | 0 | 4 | 0.00 | 0.40 |
1995 | 2 | 297 | 6.19 | 5 | 28 | 2.50 | 2.80 |
1994 | 4 | 172 | 1.72 | 0 | 9 | 0.00 | 0.45 |
1993 | 0 | 0 | 0 | 0 | 0 | 0.00 | 0.00 |
1992 | 3 | 105 | 1.30 | 2 | 6 | 0.67 | 0.40 |
1991 | 1 | 2 | 0.07 | 0 | 2 | 0.00 | 0.40 |
aMean citation rate per year=total citations/(total citations×[2018−that year]).
bTotal usage: total downloads.
cMean use rate for the last 6 months=total usage in the last 6 months/total number of papers.
dMean use rate for the last 5 years=total usage in the last 5 years/(total number of papers×5).
In Table 3, we examine the study settings mentioned in the abstracts of publications. The bibliography included country settings 749 times, and in those, the United States was mentioned 46.5% of the times. Over 90% of the total settings were in developed countries. Noticeably, 2 countries with large populations, China and India, accounted for 3.3% and 4.4%, respectively.
Table 3.
Rank | Country settings | Frequency, n (%) |
1 | United States | 348 (46.5) |
2 | Ireland | 47 (6.3) |
3 | Taiwan | 44 (5.9) |
4 | Japan | 41 (5.5) |
5 | United Kingdom | 37 (4.9) |
6 | India | 33 (4.4) |
7 | China | 25 (3.3) |
8 | Australia | 22 (2.9) |
9 | Italy | 14 (1.9) |
10 | Mali | 12 (1.6) |
11 | Sweden | 11 (1.5) |
12 | Wallis and Futuna | 10 (1.3) |
13 | Germany | 9 (1.2) |
14 | Netherlands | 9 (1.2) |
15 | Poland | 9 (1.2) |
16 | France | 7 (0.9) |
17 | Spain | 7 (0.9) |
18 | Denmark | 6 (0.8) |
19 | Hong Kong | 6 (0.8) |
20 | Canada | 5 (0.7) |
21 | Finland | 5 (0.7) |
22 | Iran | 5 (0.7) |
23 | Singapore | 4 (0.5) |
24 | Belgium | 3 (0.4) |
25 | Brazil | 3 (0.4) |
26 | Egypt | 3 (0.4) |
27 | Israel | 3 (0.4) |
28 | Malaysia | 3 (0.4) |
29 | Turkey | 3 (0.4) |
30 | New Zealand | 2 (0.3) |
31 | Norway | 2 (0.3) |
32 | Antarctica | 1 (0.1) |
33 | Austria | 1 (0.1) |
34 | Georgia | 1 (0.1) |
35 | Greece | 1 (0.1) |
36 | Iceland | 1 (0.1) |
37 | Indonesia | 1 (0.1) |
38 | Jersey | 1 (0.1) |
39 | Jordan | 1 (0.1) |
40 | Pakistan | 1 (0.1) |
41 | Saint Pierre | 1 (0.1) |
42 | Saudi Arabia | 1 (0.1) |
Figure 1 presents the global network among 53 countries having at least five co-authorships with other countries. The range of nodes represents the contribution of each country to the total number of publications, and the thickness of lines indicates the proportion of the volume of collaborations. These countries were classified into 8 clusters depending on their level of international collaborations.
Analyses of keywords and abstract contents provide us with a better understanding of the scopes of studies and development of the research landscapes. Figure 2 describes the co-occurrence of keywords with the most frequent groups of terms. There were 8 major clusters emerging from 180 most frequent keywords with a co-occurrence of 30 times and higher. Some major clusters included the following: Cluster 1 (red) refers to surgery and treatment outcomes; Cluster 2 (green) focuses on the applications of AI techniques in some specific cancers; Cluster 3 (yellow) describes the therapies for colorectal cancers; and Cluster 4 (blue) illustrates applications of chemotherapy and radiotherapy. The colors of the nodes indicate principal components of the data structure; the node size was scaled to the keyword occurrences; and the thickness of the lines is based on the strength of the association between 2 keywords.
As for the content analysis of abstracts, the top 50 emerging research domains are listed in Table 4. AI techniques have been applied to various aspects of cancer research, including therapies (radiotherapy, chemotherapy, and surgery), capacities (prediction, screening, and treatment), and factors associated with outcomes (physical, social, and economic).
Table 4.
Number | Name | Keywords | Eigen value | Cases, n (%) |
1 | Classification; feature selection | Classification; feature; proposed; features; selection; breast; performance; algorithm; diagnosis; classifier; paper; accuracy; machine | 4.71 | 6104 (58.09) |
2 | Disease-free survival | Survival; free; recurrence; follow; disease; local | 2.94 | 3413 (52.69) |
3 | Medical; processing | Medical; processing; information; system; systems | 1.85 | 2218 (42.31) |
4 | Blood loss; hospital stay | Loss; blood; stay; operative; complications; length; hospital; min; postoperative; conversion; perioperative; complication; safe; intraoperative; feasible | 17.16 | 5050 (41.13) |
5 | Prostate; assisted radical | Prostatectomy; prostate; radical; RARPa; localized; men; assisted; radical prostatectomy; robot; Gleason; RALPb; PSAc | 3.97 | 4050 (41.04) |
6 | Gy; radiation dose | Gy; dose; SBRTd; radiation; radiotherapy; therapy; local; body; treated | 5.18 | 2495 (37.16) |
7 | Predict; prediction | Predict; prediction; predictive; models; predicting; prognostic; variables; validation | 2.34 | 2562 (36.71) |
8 | Machine learning | Learning; machine; accuracy | 2.27 | 2398 (36.68) |
9 | Cohort; risk | Cohort; risk; outcome; retrospective | 1.44 | 1759 (36.54) |
10 | PSA; Gleason | PSA; Gleason; specific; biopsy; serum; prostate | 1.38 | 1970 (34.04) |
11 | Early stage | Early; cervical; stage; hysterectomy | 1.48 | 1700 (33.47) |
12 | Evaluate | Evaluate; evaluated; according | 1.37 | 1185 (28.16) |
13 | Training set | Training; set; test; sets; validation | 1.60 | 1573 (27.74) |
14 | Adjuvant chemotherapy | Chemotherapy; adjuvant; therapy; advanced | 1.51 | 1323 (27.34) |
15 | Tumor | Tumor; tumors; size | 1.46 | 1253 (27.12) |
16 | Morbidity and mortality | Mortality; morbidity; rate | 1.36 | 1194 (26.69) |
17 | Staging for endometrial; hysterectomy | Endometrial; hysterectomy; laparotomy; staging; lymphadenectomy; pelvic; cervical; laparoscopy; women | 3.09 | 1759 (25.26) |
18 | Sensitivity and specificity | Specificity; sensitivity; serum; detection; diagnostic | 2.17 | 1405 (23.74) |
19 | Plans; planning | Plans; planning; target; mm; volume; dose; average | 1.53 | 1267 (23.57) |
20 | Cystectomy; bladder | Cystectomy; bladder; RARCe; urinary; radical | 2.83 | 1235 (22.53) |
21 | Case | Cases; case | 1.26 | 913 (22.50) |
22 | Artificial neural | Neural; artificial; network; ANNf; networks | 3.32 | 2060 (22.17) |
23 | Image | Images; image; imaging; deep; CTg; MRIh | 2.59 | 1363 (21.77) |
24 | Quality of life | Life; quality; health; sexual | 2.09 | 1148 (21.55) |
25 | Lymph node | Lymph; node; dissection; nodes; pelvic; lymphadenectomy | 2.47 | 1909 (21.24) |
26 | Safe and feasible | Safe; feasible; procedure | 1.25 | 1020 (20.39) |
27 | Decision support | Support; SVMi; decision; classifier | 1.45 | 1062 (19.86) |
28 | Rectal resection | Rectal; colorectal; resection; conversion | 1.57 | 926 (19.16) |
29 | Oncological and functional; sexual function | Functional; function; sexual; oncological | 1.31 | 903 (19.10) |
30 | Gene expression | Gene; expression; genes; molecular; protein; samples; mutations | 3.24 | 1369 (18.90) |
31 | Purpose | Purpose; materials | 1.51 | 815 (18.71) |
32 | Pathology; reports | Pathology; reports; processing; report | 1.41 | 824 (18.54) |
33 | Women diagnosed | Diagnosed; screening; women | 1.29 | 728 (17.64) |
34 | Transoral; TORSj | Transoral; tors; oropharyngeal; neck; head; HPVk; carcinoma | 3.48 | 1183 (16.48) |
35 | Margin; PTl | Margin; PT; margins; pathologic; Gleason; RALP | 1.80 | 884 (16.15) |
36 | Cost | Cost; costs; care | 1.93 | 712 (16.06) |
37 | Surgeon experience | Experience; surgeons; surgeon | 1.70 | 690 (15.22) |
38 | Small cell lung | Lung; small | 1.29 | 628 (14.91) |
39 | Body mass | Mass; index; body | 1.89 | 823 (14.91) |
40 | Multi drug | Multiple; multi; drug | 1.56 | 588 (14.68) |
41 | Operating curve | Curve; operating; AUCm | 1.68 | 649 (12.52) |
42 | Benign and malignant | Malignant; benign; lesions | 1.59 | 579 (11.14) |
43 | Normal tissue | Tissue; normal | 1.40 | 460 (11.11) |
44 | Imaging (MRI) | MRI; imaging | 1.39 | 435 (10.63) |
45 | Metastases | Metastases; metastasis; liver | 1.39 | 416 (9.51) |
46 | Renal | Renal; partial; sparing | 1.74 | 376 (8.27) |
47 | HPV-negative | Negative; HPV | 1.33 | 312 (8.07) |
48 | Trials | Trials | 1.34 | 239 (6.72) |
49 | Biomarkers | Biomarkers | 1.25 | 171 (4.81) |
50 | Gastrectomy for gastric | Gastric; gastrectomy | 1.77 | 205 (3.63) |
aRARP: robotic-assisted radical prostatectomy.
bRALP: robot assisted laparoscopic prostatectomy.
cPSA: prostate specific antigen.
dSBRT: stereotactic body radiation therapy.
eRARC: remittance advice remark code.
fANN: artificial neural network.
gCT: computed tomography.
hMRI: magnetic resonance imaging.
iSVM: support vector machine.
jTORS: transoral robotic surgery.
kHPV: human papilloma virus.
lPT: prothrombin time.
mAUC: area under the curve.
Figure 3 illustrates the classification of the co-occurrence of research domains into principal components. Primarily, we have the following major landscapes: (1) robotic surgery (blue), (2) AI techniques for detection and prediction (gray), (3) chemotherapy (jade), and (4) radiotherapy (yellow).
In Table 5, we present the research topics that were constructed using LDA. The labels of the topics were manually annotated by scrutinizing the most frequent words and titles for each topic. Topics with the highest volume of publications included (1) machine learning, (2) comparative effectiveness evaluation of AI-assisted medical therapies, and (3) AI-based prediction. Noticeably, this classification has revealed topics examining the incremental effectiveness of AI applications (Topic 2) and, more interestingly, the quality of life outcomes and functioning of patients receiving these innovations. The changes in research productivity over time are illustrated in Figure 4, which shows the rapid growth of Topics 1, 2, 3, and 4, especially in recent years.
Table 5.
Topics | Research areas | Frequency (N=3555), n (%) |
Topic 1 | Machine learning | 824 (23.18) |
Topic 2 | Comparative effectiveness evaluation of AIa-assisted medical therapies | 513 (14.43) |
Topic 3 | AI-based prediction | 456 (12.83) |
Topic 4 | Multidisciplinary care, precision, and personalized medicine | 371 (10.44) |
Topic 5 | Quality of life outcomes, physical and mental health, and functioning | 312 (8.78) |
Topic 6 | Enhanced radiotherapy | 270 (7.59) |
Topic 7 | Robotic surgery | 229 (6.44) |
Topic 8 | AI-assisted imaging and signals | 215 (6.05) |
Topic 9 | Data mining and natural language processing | 183 (5.15) |
Topic 10 | AI and robotic-assisted cancer diagnosis and therapies | 182 (5.12) |
aAI: artificial intelligence.
Figure 5 presents the hierarchical clustering of research disciplines used in AI and cancer research. The horizontal axis of the dendrogram represents the distance or dissimilarity between clusters. The vertical axis represents the research disciplines. It shows that AI applications in cancer care are rooted in the following disciplines: robotics, multidisciplinary engineering, and multidisciplinary sciences. Imaging science and photography was very close to oncology, obstetrics and gynecology, dentistry, radiology, and optics. Those biomedical and clinical aspects account for the major areas of AI application; meanwhile, health service–focused areas, for example, operations and management, are rather distant.
Discussion
Principal Findings
By systematically synthesizing and analyzing the bibliography of AI applications in cancer studies, we have characterized the development of its research landscape over the period from 1991 to 2018. The findings illustrate the rapidly growing research productivity and expansion of multidisciplinary approaches, largely driven by machine learning, artificial neural networks, and AI in various clinical practices. Our analysis highlights the most frequent areas of research and the paucity of research in other areas. The research topics and landscapes constructed show that the development of AI in cancer care is focused on improving prediction in cancer screening and AI-assisted therapeutics and corresponding areas of precision and personalized medicine. Our findings show the rapid growth in these areas over the past decade. Although cancer outcomes of interest covering clinical and physical functioning and mental and quality of life measures are on the rise, our analysis indicates the relative paucity of research focusing on cancer outcomes and survivorship. This is of special relevance, considering the continuously growing cancer survivor population [22].
Comparison With Past Work
This study supplements the previous global mapping on AI in medicine by analyzing the content and characteristics of studies of specific applications of AI in cancer research and clinical practice [2]. Compared with previous reviews, this study is more comprehensive in describing the research trends by applying content analysis and topic modeling [4-10]. Therefore, the findings are helpful to inform the design and priority of the settings of future studies. Classifying information sources and content in corresponding topics to identify priorities for interventions has been widely applied in many studies. For example, previous authors have analyzed newspaper and social media content to understand topics of interest related to breast cancer and secondhand smoking [23-28]. However, none of the previous studies have analyzed the scientific bibliography to determine the development of research landscapes in AI applied in cancer care. Li et al proposed a text-mining framework using LDA to construct topics that were helpful for supporting systematic reviews [29]. In this study, we applied this approach to classify topics that a paper belongs to. Moreover, we further analyzed the frequency of concurrence of terms and their associated clusters using factor analysis. These clusters of terms enrich the understanding of scopes of each topic, especially for diseases involving the development of multidisciplinary research.
The findings from this study help inform the future development of AI applications in cancer research and clinical practices of cancer control and management. First, the difference in citation rates between very recent articles and older articles demonstrates the speed of knowledge accumulation in this area. Understanding the scope of research landscapes helps inform the selection of variables and topics to develop an application or conduct a study. Moreover, the previous bibliometric analysis could only distinguish and determine trends in the applications of AI techniques in cancer care, whereas this study showed that research trends have also expanded to encompass the comparative effectiveness of these innovations compared with traditional practices [2]. In addition, research landscapes have expanded beyond clinics to evaluate the functioning and performance of the patients being treated, in addition to their mental well-being and quality of life. To support this research topic, there should be more exploration of different study settings and incorporation of individual characteristics to improve the validity of AI techniques. One important question is how to integrate and scale-up AI-based applications in cancer care into clinical practice and community prevention. Currently, little is known on the adaptation and integration of AI applications into health systems and communities; future implementation research should be conducted.
Limitations
One of the shortcomings of this study is that we used only WOS databases. Although the WOS covers the greatest proportion of the literature in the field of AI research, it might not be fully representative of all databases. Another limitation is that only documents in English were selected for this study. Finally, the content analysis included only abstracts instead of full texts. Nonetheless, this topic modeling serves to expand, improve, and supplement previous systematic reviews in this field.
Conclusions
In conclusion, AI applications have been rapidly growing in cancer clinical practices, including prediction, diagnosis, enhanced therapeutics, and optimal selection. As interest in AI in medicine continues to grow, it will be increasingly critical to better understand the incremental effectiveness of these innovations and their validities in supporting the performance and quality of life of individuals after getting treated.
Abbreviations
- AI
artificial intelligence
- ANN
artificial neural network
- AUC
area under the curve
- CT
computed tomography
- HPV
human papilloma virus
- LDA
Latent Dirichlet Allocation
- MRI
magnetic resonance imaging
- PSA
prostate specific antigen
- PT
prothrombin time
- RALP
robot assisted laparoscopic prostatectomy
- RARC
remittance advice remark code
- RARP
robotic-assisted radical prostatectomy
- SBRT
stereotactic body radiation therapy
- SVM
support vector machine
- TORS
transoral robotic surgery
- WOS
Web of Science
Footnotes
Conflicts of Interest: None declared.
References
- 1.GBD 2017 DALYs and HALE Collaborators Global, regional, and national disability-adjusted life-years (DALYs) for 359 diseases and injuries and healthy life expectancy (HALE) for 195 countries and territories, 1990-2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018 Nov 10;392(10159):1859–922. doi: 10.1016/S0140-6736(18)32335-3. https://linkinghub.elsevier.com/retrieve/pii/S0140-6736(18)32335-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Tran BX, Vu GT, Ha GH, Vuong QH, Ho MT, Vuong TT, La VP, Ho MT, Nghiem KC, Nguyen HL, Latkin CA, Tam WW, Cheung NM, Nguyen HK, Ho CS, Ho RC. Global evolution of research in artificial intelligence in health and medicine: a bibliometric study. J Clin Med. 2019 Mar 14;8(3):E360. doi: 10.3390/jcm8030360. http://www.mdpi.com/resolver?pii=jcm8030360. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Bashiri A, Ghazisaeedi M, Safdari R, Shahmoradi L, Ehtesham H. Improving the prediction of survival in cancer patients by using machine learning techniques: experience of gene expression data: a narrative review. Iran J Public Health. 2017 Feb;46(2):165–72. http://europepmc.org/abstract/MED/28451550. [PMC free article] [PubMed] [Google Scholar]
- 4.Lisboa PJ, Taktak AF. The use of artificial neural networks in decision support in cancer: a systematic review. Neural Netw. 2006 May;19(4):408–15. doi: 10.1016/j.neunet.2005.10.007. [DOI] [PubMed] [Google Scholar]
- 5.Spelt L, Andersson B, Nilsson J, Andersson R. Prognostic models for outcome following liver resection for colorectal cancer metastases: a systematic review. Eur J Surg Oncol. 2012 Jan;38(1):16–24. doi: 10.1016/j.ejso.2011.10.013. [DOI] [PubMed] [Google Scholar]
- 6.Jalalian A, Mashohor SB, Mahmud HR, Saripan MI, Ramli AR, Karasfi B. Computer-aided detection/diagnosis of breast cancer in mammography and ultrasound: a review. Clin Imaging. 2013;37(3):420–6. doi: 10.1016/j.clinimag.2012.09.024. [DOI] [PubMed] [Google Scholar]
- 7.Tucker SR, Speer SA, Peters S. Development of an explanatory model of sexual intimacy following treatment for localised prostate cancer: a systematic review and meta-synthesis of qualitative evidence. Soc Sci Med. 2016 Aug;163:80–8. doi: 10.1016/j.socscimed.2016.07.001. [DOI] [PubMed] [Google Scholar]
- 8.Ray PP, Dash D, De D. A systematic review of wearable systems for cancer detection: current state and challenges. J Med Syst. 2017 Oct 2;41(11):180. doi: 10.1007/s10916-017-0828-y. [DOI] [PubMed] [Google Scholar]
- 9.Sadoughi F, Kazemy Z, Hamedan F, Owji L, Rahmanikatigari M, Azadboni TT. Artificial intelligence methods for the diagnosis of breast cancer by image processing: a review. Breast Cancer (Dove Med Press) 2018;10:219–30. doi: 10.2147/BCTT.S175311. doi: 10.2147/BCTT.S175311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Marka A, Carter JB, Toto E, Hassanpour S. Automated detection of nonmelanoma skin cancer using digital images: a systematic review. BMC Med Imaging. 2019 Feb 28;19(1):21. doi: 10.1186/s12880-019-0307-7. https://bmcmedimaging.biomedcentral.com/articles/10.1186/s12880-019-0307-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Waltman L, van Eck NJ, Noyons EC. A unified approach to mapping and clustering of bibliometric networks. J Informetr. 2010 Oct;4(4):629–35. doi: 10.1016/j.joi.2010.07.002. [DOI] [Google Scholar]
- 12.van Eck NJ, Waltman L. Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics. 2010 Aug;84(2):523–38. doi: 10.1007/s11192-009-0146-3. http://europepmc.org/abstract/MED/20585380. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Waltman L, van Eck NJ. A smart local moving algorithm for large-scale modularity-based community detection. Eur Phys J B. 2013 Nov 13;86(11):471. doi: 10.1140/epjb/e2013-40829-0. [DOI] [Google Scholar]
- 14.van Eck NJ, Waltman L. Visualizing bibliometric networks. In: Ding Y, Rousseau R, Wolfram D, editors. Measuring Scholarly Impact: Methods and Practice. New York City: Springer; 2014. pp. 285–320. [Google Scholar]
- 15.de Araújo CC, Pedron CD, Picoto WN. What's behind CRM research? A bibliometric analysis of publications in the CRM research field. J Relatsh Mark. 2018 Apr 2;17(1):29–51. doi: 10.1080/15332667.2018.1440139. [DOI] [Google Scholar]
- 16.Huang A. Similarity Measures for Text Document Clustering. Proceedings of the New Zealand Computer Science Research Student Conference; NZCSRSC'08; April 28-29, 2008; Christchurch, New Zealand. 2008. [Google Scholar]
- 17.Li Y, Rapkin B, Atkinson TM, Schofield E, Bochner BH. Leveraging latent dirichlet allocation in processing free-text personal goals among patients undergoing bladder cancer surgery. Qual Life Res. 2019 Jun;28(6):1441–55. doi: 10.1007/s11136-019-02132-w. http://europepmc.org/abstract/MED/30798421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Valle D, Albuquerque P, Zhao Q, Barberan A, Fletcher Jr RJ. Extending the latent dirichlet allocation model to presence/absence data: a case study on North American breeding birds and biogeographical shifts expected from climate change. Glob Chang Biol. 2018 Nov;24(11):5560–72. doi: 10.1111/gcb.14412. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Chen C, Zare A, Trinh HN, Omotara GO, Cobb JT, Lagaunne TA. Partial membership latent dirichlet allocation for soft image segmentation. IEEE Trans Image Process. 2017 Dec;26(12):5590–602. doi: 10.1109/TIP.2017.2736419. [DOI] [PubMed] [Google Scholar]
- 20.Lu HM, Wei CP, Hsiao FY. Modeling healthcare data using multiple-channel latent Dirichlet allocation. J Biomed Inform. 2016 Apr;60:210–23. doi: 10.1016/j.jbi.2016.02.003. https://linkinghub.elsevier.com/retrieve/pii/S1532-0464(16)00025-3. [DOI] [PubMed] [Google Scholar]
- 21.Gross A, Murthy D. Modeling virtual organizations with latent Dirichlet allocation: a case for natural language processing. Neural Netw. 2014 Oct;58:38–49. doi: 10.1016/j.neunet.2014.05.008. [DOI] [PubMed] [Google Scholar]
- 22.Miller KD, Nogueira L, Mariotto AB, Rowland JH, Yabroff KR, Alfano CM, Jemal A, Kramer JL, Siegel RL. Cancer treatment and survivorship statistics, 2019. CA Cancer J Clin. 2019 Jun 11; doi: 10.3322/caac.21565. doi: 10.3322/caac.21565. (epub ahead of print)(forthcoming)(forthcoming) [DOI] [PubMed] [Google Scholar]
- 23.Liu Q, Chen Q, Shen J, Wu H, Sun Y, Ming WK. Data analysis and visualization of newspaper articles on thirdhand smoke: a topic modeling approach. JMIR Med Inform. 2019 Jan 29;7(1):e12414. doi: 10.2196/12414. https://medinform.jmir.org/2019/1/e12414/ [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Tang C, Zhou L, Plasek J, Rozenblum R, Bates D. Comment topic evolution on a cancer institution's Facebook page. Appl Clin Inform. 2017 Aug 23;8(3):854–65. doi: 10.4338/ACI-2017-04-RA-0055. http://europepmc.org/abstract/MED/28832069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Nzali MD, Bringay S, Lavergne C, Mollevi C, Opitz T. What patients can tell us: topic analysis for social media on breast cancer. JMIR Med Inform. 2017 Jul 31;5(3):e23. doi: 10.2196/medinform.7779. https://medinform.jmir.org/2017/3/e23/ [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Westmaas JL, McDonald BR, Portier KM. Topic modeling of smoking- and cessation-related posts to the american cancer society's cancer survivor network (CSN): implications for cessation treatment for cancer survivors who smoke. Nicotine Tob Res. 2017 Aug 1;19(8):952–9. doi: 10.1093/ntr/ntx064. [DOI] [PubMed] [Google Scholar]
- 27.Thackeray R, Burton SH, Giraud-Carrier C, Rollins S, Draper CR. Using Twitter for breast cancer prevention: an analysis of breast cancer awareness month. BMC Cancer. 2013 Oct 29;13:508. doi: 10.1186/1471-2407-13-508. https://bmccancer.biomedcentral.com/articles/10.1186/1471-2407-13-508. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Huang Z, Dong W, Ji L, Gan C, Lu X, Duan H. Discovery of clinical pathway patterns from event logs using probabilistic topic models. J Biomed Inform. 2014 Feb;47:39–57. doi: 10.1016/j.jbi.2013.09.003. https://linkinghub.elsevier.com/retrieve/pii/S1532-0464(13)00144-5. [DOI] [PubMed] [Google Scholar]
- 29.Li D, Wang Z, Wang L, Sohn S, Shen F, Murad MH, Liu H. A text-mining framework for supporting systematic reviews. Am J Inf Manag. 2016 Nov;1(1):1–9. doi: 10.11648/j.infomgmt.20160101.11. http://europepmc.org/abstract/MED/29071308. [DOI] [PMC free article] [PubMed] [Google Scholar]