Abstract
The advent of radiomics has revolutionized medical image analysis, affording the extraction of high dimensional quantitative data for the detailed examination of normal and abnormal tissues. Artificial intelligence (AI) can be used for the enhancement of a series of steps in the radiomics pipeline, from image acquisition and preprocessing, to segmentation, feature extraction, feature selection, and model development. The aim of this review is to present the most used AI methods for radiomics analysis, explaining the advantages and limitations of the methods. Some of the most prominent AI architectures mentioned in this review include Boruta, random forests, gradient boosting, generative adversarial networks, convolutional neural networks, and transformers. Employing these models in the process of radiomics analysis can significantly enhance the quality and effectiveness of the analysis, while addressing several limitations that can reduce the quality of predictions. Addressing these limitations can enable high quality clinical decisions and wider clinical adoption. Importantly, this review will aim to highlight how AI can assist radiomics in overcoming major bottlenecks in clinical implementation, ultimately improving the translation potential of the method.
Keywords: radiomics, quantitative imaging biomarkers, artificial intelligence, machine learning, reproducibility, standardization, deep learning, deep radiomics, large language models, convolutional neural networks
Introduction
Radiomics, a rapidly growing field in medical imaging, offers a promising avenue for extracting quantitative features from radiographic images to inform clinical decision-making. Since radiomics endeavours to become integral to personalized medicine, the application of artificial intelligence (AI) holds immense potential in augmenting its capabilities. This article delves into the integration of AI techniques within radiomics workflows, elucidating their roles in enhancing data extraction, preprocessing, model development, and clinical implementation.
Historically, radiomics have faced challenges related to the standardization of imaging protocols, variability in image acquisition, the selection of segmentation methodology, the standardization of extracted features, the methods used for feature selection and the rigorous development and testing of predictive models using these features.1 This is the reason why quality scores such as the Radiomics Quality Score (RQS) was developed in 2017 to assist authors in avoiding methodological errors that can reduce the quality of radiomics analysis.1 Due to the lack of reproducibility of RQS,2 the European Society of Medical Imaging Informatics (EuSoMII) proposed in 2024 the METhodological RadiomICs Score (METRICS)3 as a tool that can assess the quality of radiomics studies and guide the development of high quality radiomics models. The integration of AI into radiomics holds the potential to address the issues and provide more robust and accurate analysis through advanced machine learning (ML) algorithms. Several steps within the process of radiomics analysis can benefit from AI integration, from image segmentation to feature selection4 and model development. Within the realm of data extraction and preprocessing, AI algorithms automate laborious tasks such as region of interest (ROI) segmentation and dimensionality reduction, while also facilitating the exploration of deep radiomics techniques. In model development, AI offers diverse methodologies tailored to the nuances of radiomics data, including traditional ML models, deep learning architectures, and Large Language Models (LLMs). Moreover, as radiomics advances towards clinical implementation, AI-driven approaches can automate processes address critical bottlenecks surrounding validation, reproducibility, and generalizability, thereby accelerating the integration of radiomics into routine clinical practice. Through a comprehensive exploration of AI integration in radiomics research, this article aims to present the transformative potential of AI in reshaping the landscape of radiomics in medical imaging and personalized healthcare and overcoming current barriers in the clinical translation of radiomics.
Steps of radiomics analysis where AI can be applied
The pipeline of radiomics analysis consists of a series of steps that allow the extraction of quantitative data from medical images and the subsequent use of the data for the characterization of disease processes. Radiomics analysis has certain peculiarities which have to do with the type of data used (radiomics features) which are different to the data used for deep learning (whole images) or for applications in other domains of medicine which may utilize clinical data. These peculiarities pertain to the types of features extracted, the methodology for feature extraction, the preprocessing of the images to get quality features, the segmentation of regions of interest to extract features, the selection of relevant features for model building and the use of suitable algorithms for this type of data.5 These issues are unique to radiomics analysis and cannot be found in other types of AI applications encountered in other domains of medicine dealing with different types of data. Radiomics analysis starts from the construction of a suitable dataset of medical images and the preprocessing of these images to ensure image standardization and homogenization prior to radiomics data extraction. Once the dataset has been constructed and the images have been adequately prepared, a ROI needs to be segmented to mark the area where quantitative radiomics data will be extracted from. Radiomics features are then extracted from the selected ROIs and the dataset is subjected to a series of preparation steps to ensure that robust and informative features are used for further analysis. This data preparation step includes the exclusion of collinear features, the evaluation of feature robustness to segmentation and other sources of variability and the selection of the most informative features for model building. Finally, the dataset can be used for predictive model development. AI methods can be used in the majority of the aforementioned radiomics pipeline steps.
Standardizing the acquisition and reconstruction of medical images can be achieved with deep learning. Generative adversarial networks (GANs) can be used for image translation, resolution enhancement, and contrast synthesis while preserving texture for radiomics analysis.6 Such models can enhance dataset homogeneity by generating synthetic images that mimic variations observed in real-world medical imaging datasets. Deep learning can be used for the extraction of radiomics features from medical images (deep radiomics)7 and for semi-automated or automated ROI segmentation.8 ML models can also be used for efficient feature selection in an attempt to reduce the risk of overfitting9 and to ensure robust model generalization.10 Finally, AI can be used for the development of predictive classification11 or regression models using radiomics features. The following section with describe in detail how AI can enhance each of these steps of the radiomics pipeline. An overview of these steps and the contribution of AI can be seen on Figure 1.
Figure 1.
Steps of the radiomics pipeline where AI can be applied. From image acquisition to model training and validation all steps where AI can be used are explained and examples of potential uses are given in the bubbles below each step (created with biorender.com).
AI for data acquisition and preprocessing
AI algorithms for data acquisition and preparation
In radiomics studies, the precision of initial steps is fundamental to the reliability and applicability of outcomes. The process begins with the acquisition of medical images using standardized protocols to maintain consistency across studies and ensure comparability.12 There’s a preference for single modality radiomics to prevent the complexities and potential overfitting associated with multi-modality data, unless multi-modality approaches demonstrate clear superiority.3 It is crucial that acquisition parameters, like slice thickness, contrast agent use, and imaging resolution, align with current clinical practices for the findings to be practically useful.
Biomedical images, obtained through diverse modalities such as MRI, CT, and PET scans, inherently exhibit variability due to differences in acquisition protocols and equipment characteristics and patient factors.13 Standardizing the acquisition and reconstruction processes is paramount to mitigate this variability. Here, AI interventions can prove instrumental: deep neural networks (DNNs) facilitate the generation of synthetic images, thereby ameliorating the effects of MRI acquisition variations. DNNs, particularly convolutional neural networks (CNNs), can learn complex patterns and variations within medical images. By training on diverse datasets, CNNs can adapt to variations in acquisition techniques, equipment, or patient-related, thus mitigating inherent variability in biomedical images. Architectures like GANs and UNet are adept at image translation,14 resolution enhancement, and contrast synthesis.15 It has been shown that denoising of CT images can increase the reproducibility of radiomics features.16 Additionally, these models enhance dataset homogeneity by generating synthetic images that mimic variations observed in real-world medical imaging datasets.
AI algorithms for the automated ROI segmentation
Convolutional neural networks used for segmentation, such as U-Net, have emerged as powerful tools for semantic segmentation tasks in medical imaging.17 U-Net architectures demonstrate remarkable accuracy and reproducibility in ROI segmentation, comparable to or even surpassing manual delineation methods. Moreover, CNNs facilitate accurate contour propagation during image registration, enhancing longitudinal analysis by ensuring consistency across multiple time points. V-Net excels in semantic segmentation and detection tasks, particularly in volumetric image segmentation on CT and PET images. The V-Net architecture, a three-dimensional (3D) CNN, exhibits superior performance in capturing intricate anatomical structures and lesion boundaries,18 crucial for precise ROI delineation. Moreover, cascading 2 V-Nets to form a W-Net architecture enhances segmentation accuracy, particularly for bone-specific lesions, by leveraging the strengths of multiple networks.19
In essence, ML techniques, particularly segmentation CNNs like UNet and V-Net architectures, can revolutionize ROI segmentation in radiomics analysis. By automating segmentation processes, these algorithms streamline workflow, reduce interobserver variability, and enable robust feature selection for comprehensive characterization of imaging data.20 The integration of ML in ROI segmentation underscores its significance in advancing medical imaging and facilitating accurate diagnosis and treatment planning in clinical practice. Deep learning based organ segmentation with tools such as the TotalSegmentator21 and Multiple-Organ Objective Segmentation22 can be used for the construction of open access large scale radiomics datasets.23
AI algorithms for dimensionality reduction and feature selection
Feature selection is the process to strategically balance the extracted features ability to tackle a specific task, while maintaining model simplicity and effectiveness.24 Effective feature selection is essential for avoiding overfitting, enhancing model interpretability, and ensuring the practicality of AI models in radiology. First of all, determining the appropriate number of features is a crucial, often overlooked initial step. This determination could be based on achieving similar performance across training and test datasets, employing statistical methods to identify the optimal feature count, consulting relevant literature. Alternatively, one heuristic rule suggests having 10 samples for each feature25 or n2 cases in the minority class for every n features considered, which can help in maintaining a balance between model complexity and predictive performance.
The application of unsupervised learning techniques, such as clustering or Principal Component Analysis (PCA), in feature selection is nuanced. While these methods can reveal data structure or reduce dimensionality,26 they do not directly align with the predictive or classification goals typical of supervised learning tasks. Their contributions, although valuable for insight, need careful consideration against the specific aims of a radiomics project.
Filter methods evaluate features based on their statistical characteristics, like their correlation with the target variable. The Pearson correlation coefficient is a widely used filter method that identifies features with a strong linear relationship to the target, discarding those with weak correlations. In contrast, the Minimum Redundancy Maximum Relevance method offers greater versatility. It is well-suited for multivariate analyses and accommodates various data types, unlike Pearson, which is tailored for bivariate analyses of continuous variables.27
Wrapper methods, such as Recursive Feature Elimination and Stepwise Selection, assess subsets of features based on a model’s performance. They provide insights into the interactions of features within the context of a predictive model. Although computationally demanding, they can identify the most effective feature combinations due to their model-centric approach. Innovative ML approaches like Boruta are also emerging.28 Boruta enhances the wrapper concept by incorporating the Random Forest (RF) algorithm with shadow features. This ML method evaluates the significance of actual features against random ones, offering a comprehensive and reliable feature selection mechanism.29
Embedded methods on the other hand integrate feature selection within the model training process, with Least Absolute Shrinkage and Selection Operator (LASSO) regression and Elastic Net being prime example. These methods penalize irrelevant features, thereby simplifying the model and reducing overfitting risk. LASSO, for instance, prunes the feature space by eliminating less important coefficients, retaining only impactful ones.30 Decision trees and their ensembles, like RF, inherently perform feature selection by selecting the most informative features for splitting data at each decision node, advantageous for managing complex datasets.
AI algorithms to deal with class imbalance
Class imbalance is an important problem that can significantly affect AI algorithm results. This is even more important in small sample sizes where the low prevalence of one of the examined diseases can be more prominent, creating a bias of the predictions towards the more prevalent disease.31 To address this problem ML and more advanced AI algorithms can be employed. One of the most commonly used methods is Synthetic Minority Oversampling Technique (SMOTE).32 SMOTE creates synthetic samples of the minority class, increasing the number of instances of the minority class in the data, by creating new samples on the line between minority samples that are close to each other in the feature space.32 This strategy has been shown to alleviate bias related to class imbalance. Another approach that is commonly used is Adaptive Synthetic Sampling (ADASYN) which adaptively synthesizes minority samples by calculating a weighted distribution of various minority classes. ADASYN automatically decides how many synthetic cases to create in parts of the dataset where classification is difficult, compensating for scattered data, thus taking into account the peculiarities of the dataset. Ideally these techniques should not be applied to the whole dataset prior to the analysis but should be separately applied to the training set.31 Both SMOTE and ADASYN have been used in radiomics studies.33–36
Modern generative AI methods can be also used for data augmentation, alleviating problems related to class imbalance. GANs can be used for this purpose generating synthetic images to augment a dataset and have been shown to be used for state-of-the-art oversampling.37 Conditional GANs have been used to tackle class imbalance38 and has been also used in radiomics.39
Deep radiomics as an alternative to handcrafted radiomics features
Deep radiomics integrate deep learning with radiomics to automate the extraction of feature representations directly from medical imaging data, thus eliminating the need for manual feature design. This approach relies on CNNs to analyse medical images and identify higher level abstractions that are predictive of various outcomes such as disease characteristics,40 treatment responses,41,42 and overall survival rates.43 The process begins with the use of a deep learning model, most commonly CNNs to extract features that are tailored to the specific dataset and clinical problem. ML techniques are then applied to select the most representative features, which are subsequently used in classification or regression models to address the question at hand.
CNNs are characterized by their ability to process data with grid patterns, such as medical images, and discern spatial hierarchies of features.44 Composed of convolution, pooling, and fully connected layers, CNNs are trained to minimize differences between outputs and ground truth labels through backpropagation and gradient descent. Unlike handcrafted radiomics, CNNs can be used to extract deep radiomics features eliminating the need for hand-crafted feature selection or expert segmentation.45 CNNs trained end-to-end have demonstrated superior performance over ImageNet pre-trained advanced networks in predicting radiomic features, particularly those related to lesion size and maximum/mean intensities, highlighting their potential for advancing diagnostic accuracy and clinical decision-making in radiomics.46 Deep radiomics spans several medical fields, including oncology (specifically GI, liver, and lung cancers), neurology,47 and cardiology,48 to support diagnosis, prognosis, and treatment planning. The goal is to advance personalized medicine by enabling more detailed and accurate patient assessments based on imaging data.
While deep radiomics share some limitations with handcrafted radiomics, such as the predominance of retrospective and uni-institutional studies and the potential lack of feature generalizability, it is distinguished by its robustness. Nonetheless, they require substantial data and computational resources, often necessitating GPUs for efficient processing. Additionally, deep radiomics lack in interpretability when compared to handcrafted radiomics. Such interpretability problems are extremely important for the clinical adoption of radiomics methods.20 To prevent overfitting, deep radiomics requires careful feature selection and can be further improved by incorporating clinical data, other omics data, or handcrafted radiomics, leading to models with enhanced performance.49
AI for model development using radiomics data
Traditional machine learning models utilizing tabular radiomics data
Traditional ML algorithms are commonly used in radiomics research, to build models using radiomics features. This section will focus on some of the most commonly used of these algorithms, presenting advantages and disadvantages of their use.
Support vector machines
Traditional ML models have emerged as powerful tools for analysing radiomics data in medical research and clinical practice. Among these models, Support Vector Machines (SVM) have demonstrated high stability and accuracy,50 outperforming linear regression, k-nearest neighbours, multilayer perceptron (MLP), and light gradient boosting machines in various metrics.51 SVMs adopt a nonparametric strategy, dynamically adjusting their parameters in response to the intricacies of the dataset they are trained on.52 SVM constructs decision boundaries (hyperplanes) between classes in an n-dimensional space (n represents the number of features for every sample in the database) and maximizes margins between these boundaries. In cases where the data is not linearly separable in the original feature space, SVMs use a kernel function to map the data into a higher-dimensional space where it becomes linearly separable. Common kernel functions include linear, polynomial, radial basis function, and sigmoid kernels.53 SVM classification models have been proposed in different clinical scenarios utilizing radiomics such as to distinguish between high-grade gliomas and low-grade gliomas, to distinguish between transient osteoporosis and avascular necrosis of the hip, to identify high risk endoleaks after aortic aneurysm repair, to detect lymph node metastases and many others.54–56
Random forests
Random forests are composed of multiple tree-structured classifiers, where each tree is built independently using a random vector k.57 The random vector k introduces variability into the tree construction process, ensuring that each tree in the forest is unique. During classification, each tree casts a unit vote for the most popular class at a given input. After generating a large number of trees, they collectively vote to determine the final class prediction. These models are effective in discovering interactions and non-linear effects of predictors, making them well-suited for analysing radiomics data.58 In fact, a study of tumour heterogeneity and angiogenesis properties on MRI, using a RF model has demonstrated high AUC and showed promise for non-invasive prediction of prognostic factors in breast cancer.59 However, RFs with deep trees and either no subsampling or excessive subsampling can be inconsistent.60
Gradient boosting
Gradient Boosting operates by sequentially adding weak learners, usually decision trees, to an ensemble, with each tree aiming to correct errors made by the previous ones.61 The model starts with a simple estimator representing the average of target values for regression or the most common class for classification. The weak learner’s predictions are combined with the ensemble’s predictions using a weight determined by a learning rate parameter. This process continues iteratively, with each new weak learner enhancing the overall predictive power of the ensemble by addressing the remaining errors. Ultimately, the final prediction is made by aggregating the predictions of all weak learners, weighted by their corresponding learning rates. This algorithmic framework has been applied to various radiomics scenarios such as the prediction of cardiovascular events and hospital readmissions.62 Regularization techniques such as subsampling, shrinkage, and early stopping mitigate overfitting. Tools like relative variable influence and partial dependence plots offer insights into variable importance and their effects on response, augmenting gradient boosting’s potential in extracting valuable insights from medical data.63 Gradient Boosting algorithms like Extreme Gradient Boosting (XGBoost), LightGBM, and CatBoost are widely used due to their effectiveness in capturing complex relationships in the data.
Extreme Gradient Boosting is an implementation of gradient boosting that is highly optimized and efficient. It constitutes an ensemble learning method for tabular datasets, combines decision trees sequentially using gradient boosting. It allows configuration of hyperparameters such as the number of estimators, maximum tree depth, and minimum child weight.64 XGBoost models that integrate radiomic features and clinical information have shown exceptional performance, surpassing other prediction models like SVM, Generalized Linear Model, and k-nearest neighbour (k-NN).65 XGBoost has been used in a wide variety of clinical scenarios such as the prediction of IDH1 mutation status in gliomas66 and in cases of multi-omics integration scenarios, such as in radio-transcriptomics67 and radio-metabolomics analyses.36
Light Gradient Booster, particularly LightGBM, stands out as a fast, high-performance gradient boosting framework. LightGBM optimizes efficiency by utilizing a decision tree algorithm based on histograms and Gradient-based One-side Sampling, which reduces time and memory usage compared to XGBoost. LightGBM’s leaf-wise algorithm with depth constraints enhances efficiency while preventing overfitting, contrasting with the level-wise decision tree growth strategies commonly used. Its ability to find meaningful node splits and impose a maximum depth limit leaf-wise further enhances efficiency and prevents overfitting.68 A study has shown that among 6 ML algorithms implemented (Linear Regression, RF, SVM, Linear discriminant analysis, LightGBM, and XGBoost.), LightGBM, alongside RF and XGBoost (XGB), emerged as optima ensemble learning techniques.69 The implementations of LightGBM in radiomics range from predicting papillary thyroid microcarcinoma,70 to estimating the haemorrhagic transformation risk in patients who suffered a stroke.71
CatBoost, proposed by Prokhorenkova et al., is an advanced refinement of the Gradient Boosted Decision Trees (GBDT) algorithm.72 Unlike traditional GBDT methods, CatBoost introduces innovations like the “Ordered Target Statistic” technique to efficiently handle high cardinality categorical variables.73 This method encodes categorical variables without risking target leakage, ensuring that no specific training example influences the encoding process. CatBoost excels in heterogeneous and categorical data settings although it may not be optimal for homogeneous or purely numerical datasets, as observed in biometric identification and activity recognition tasks. CatBoost is faster than alternatives like LightGradientBooster (LightGBM) and XGBoost and its automatic handling of categorical values and competitive performance make it a suitable choice for many applications, especially when considering the trade-off between speed and performance.74 Radiomics implementations of CatBoost include predicting fragility fractures75 and prostate cancer.76
In conclusion, traditional ML models, including SVM, RFs, and Gradient Boosting, hold significant potential for analysing radiomics data in medical research and clinical practice. These models offer high accuracy, stability, and flexibility in handling complex datasets, making them valuable tools for predicting prognostic factors and evaluating treatment outcomes. However, careful consideration of model parameters and interpretation of results are crucial for optimizing performance and ensuring clinical relevance.
Deep learning models using radiomics data
Neural networks are increasingly being used in radiomics research. Compared to traditional ML algorithms, these approaches usually require more data to yield satisfactory results. The most important of these networks are presented in this section to demonstrate how deep learning can enhance radiomics analyses.
Artificial neural networks
Neural networks and deep learning have revolutionized medical image analysis, offering nuanced insights into complex patterns and features. Artificial neural networks (ANNs) have emerged as foundational tools in this domain by emulating the structure and function of biological neural networks, employing sigmoid functions and differentiable activation functions to facilitate learning by modifying weights. Neurons within ANNs receive multiple inputs, summing them and processing the sum through sigmoid functions to generate output values. Structurally, ANNs consist of layers—input, hidden, and output—transferring data via synapses, albeit the “black box” nature of hidden layers complicates interpretation. Learning in ANNs involves updating variables, adjusting connection strength between neurons, and enhancing output accuracy through backpropagation, wherein weights between neurons are adjusted based on errors between predicted and correct outputs. The gradient descent method minimizes errors by finding the lowest point in a cost function, streamlining the learning process of ANNs and enabling iterative adjustments for improved predictive performance across various tasks.77 Notably, use of the MLP ANN with back-propagation, has yielded promising outcomes, boasting high accuracy (90.97%), sensitivity (89.36%), and specificity (92.33%) in radiomics tasks. Moreover, an independent validation test demonstrated the model’s ability to generalize learning without overfitting, highlighting the potential of ANNs in unravelling intricate radiomic signatures.78 ANNs have been employed to predict cardiac event risk79 and to identify metastasis of tongue cancer.80
Recurrent neural networks
Recurrent Neural Networks (RNNs) are pivotal in radiomics for their capacity in processes like working memory and decision-making, which are crucial in analysing medical imaging data.81 RNNs are specifically designed for processing sequential or time-series data, featuring feedback connections that enable the retention of information from previous steps.82 Variants like Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) have evolved to address challenges like vanishing gradients, making them particularly suited for radiomics tasks where long-term dependencies in medical imaging data need to be captured effectively. Bidirectional RNNs (BRNNs) combine forward and backward RNNs, aiding in capturing context from both past and future sequences, which can be advantageous in analysing medical imaging sequences. RNNs find extensive application in radiomics, ranging from image classification83 and segmentation to disease diagnosis and prognosis prediction, leveraging their sequential processing capabilities to derive meaningful insights from medical imaging data. RNNs with radiomics, have been used to predict the progression of prostate cancer84 and classify breast cancer.85
Transformers
Transformers, shaped by self-attention mechanisms and pre-training on large corpora followed by fine-tuning, have not been widely used in radiomics research but offer significant potential in radiomics applications. Self-attention allows capturing long-range dependencies in sequences, crucial for tasks like language translation, while pre-training facilitates meaningful representation learning.86 Transformers, with their encoder-decoder structure and bidirectional representations, have significantly impacted language modelling and are starting to get implemented in medicine for detecting anomalies,87 segmentation and visualization tasks, particularly in brain tumour (BT) diagnosis and classification.88 They excel in capturing long-range spatial dependencies and utilizing whole-brain anatomical data, offering advantages over traditional CNNs. Visual transformers have been used to combine multimodal data, including radiomics and demographics, with MR imaging to predict the O6-Methylguanine-DNA methyltransferase status in patients with diffuse glioma.89 However, addressing challenges such as interpretability, limited datasets and high memory consumption remains essential for wider adoption in clinical settings and radiomics workflow.90
Large language models in radiomics research
Large language models like GPT-3.5 and GPT-4 can handle complex textual data and perform natural language processing (NLP) tasks without extensive fine-tuning.91 These models are trained on vast amounts of text data, enabling them to recognize, interpret, and generate text with minimal or zero-shot learning properties. In healthcare, including radiomics, LLMs have shown promise in automating cognitive tasks traditionally performed by humans. Multimodal LLMs, such as Google’s Med-PaLM M and Microsoft’s LLaVA-Med, specifically designed for medicine, are being explored for clinical radiology applications,92,93 could potentially enhance radiomics analysis with their capabilities that integrate text, images, audio, and video data. Studies have demonstrated the efficacy of LLMs, including Bidirectional Encoder Representation from Transformers (BERT), BioBERT, and others, in extracting relevant information from radiology reports to predict clinical outcomes such as isocitrate dehydrogenase mutation in glioma patients, often outperforming or comparable to human readers.94 Examples of BERT models are: ViLBERT, LXMERT, and VisualBERT. Furthermore, the integration of NLP with radiomics facilitates the development of scalable pipelines for pain identification in cancer patients undergoing radiotherapy. By extracting pain scores from consultation notes and combining them with lesion-based radiomics features, these pipelines can enable efficient pain measurement, enhancing the overall diagnostic process in oncology.95 Nonetheless, drawbacks of LLMs such as the presence of hallucinations, necessitate caution when used for clinical applications including cases where radiomics are involved.93 Examples of studies with various model architectures can be found in Table 1.
Table 1.
Examples of studies utilizing various types of AI architectures for data acquisition/preprocessing and predictive model development.
| Topic | Author/year (reference) | Title | Aim of AI use | AI architecture(s) |
|---|---|---|---|---|
| AI for data acquisition and preprocessing | Ungan et al. (2022)28 | Metastatic melanoma treated by immunotherapy: discovering prognostic markers from radiomics analysis of pretreatment CT with feature selection and classification | Feature selection and classification | SVM, Boruta, RF, LR, k-NN |
| Zhao et al. (2021)18 | Lung segmentation and automatic detection of COVID-19 using radiomic features from chest CT images | Segmentation | V-Net | |
| Xu et al. (2024)14 | Synthesis of virtual monoenergetic images from kilovoltage peak images using wavelet loss enhanced CycleGAN for improving radiomics features reproducibility | Image generation | GAN | |
| Gitto et al. (2024)35 | X-rays radiomics-based ML classification of atypical cartilaginous tumour and high-grade chondrosarcoma of long bones | Class imbalance correction | ADASYN | |
| Rich et al. (2021)34 | Radiomics Predicts for Distant Metastasis in Locally Advanced Human Papillomavirus-Positive Oropharyngeal Squamous Cell Carcinoma | Class imbalance correction | synthetic minority over-sampling technique (SMOTE), ADASYN, and borderline SMOTE | |
| Ziegelmayer et al. (2020)40 | Deep Convolutional Neural Network-Assisted Feature Extraction for Diagnostic Discrimination and Feature Visualization in Pancreatic Ductal Adenocarcinoma (PDAC) versus Autoimmune Pancreatitis (AIP) | Deep radiomics extraction | CNN, RF | |
| AI for radiomics model development | Lee et al. (2022)59 | Radiomic ML for predicting prognostic biomarkers and molecular subtypes of breast cancer using tumour heterogeneity and angiogenesis properties on MRI | Classification | naïve Bayes, linear regression, ANN, Decision Tree, RF |
| Zhou et al. (2024)50 | CT-Based Radiomics Analysis of Different ML Models for Discriminating the Risk Stratification of Pheochromocytoma and Paraganglioma: A Multicenter Study | Risk stratification (classification) | MLPs, SVM, RFs, k-NN | |
| Currie et al. (2019)79 | Intelligent Imaging: Radiomics and ANN in Heart Failure | Risk stratification (classification) | ANN | |
| Klontzas et al. (2021)56 | Radiomics and ML Can Differentiate Transient Osteoporosis from Avascular Necrosis of the Hip | Classification | XGboost, CatBoost and SVM | |
| Sakai et al. (2020)66 | MRI Radiomic Features to Predict IDH1 Mutation Status in Gliomas: A Machine Learning Approach using Gradient Tree Boosting | Classification | XGBoost | |
| Chiari-Correia et al. (2023)78 | A 3D Radiomics-Based ANN Model for Benign Versus Malignant Vertebral Compression Fracture Classification in MRI | Classification | MLP neural network with a back-propagation algorithm | |
| Sushentsev et al. (2023)84 | Time series radiomics for the prediction of prostate cancer progression in patients on active surveillance | Classification using time series data | LSTM RNN | |
| Usuzaki et al. (2024)89 | Identifying key factors for predicting O6-Methylguanine-DNA methyltransferase status in adult patients with diffuse glioma: a multimodal analysis of demographics, radiomics, and MRI by variable Vision Transformer | Classification using multimodal data | Vision transformer (vViT) |
How can AI overcome bottlenecks in the clinical implementation of radiomics?
Establishing radiomics models of clinical value
A fundamental step towards the clinical acceptance of radiomics is establishing not merely its diagnostic performance but its tangible benefits on patient outcomes. This necessitates AI models that are designed not just for diagnostic accuracy but for practical clinical utility, such as enhancing decision-making processes in treatment planning or patient monitoring.96 Decision curve analysis, could be instrumental in demonstrating the clinical benefits of radiomics models.97 Importantly, the focus should be on conditions where the precision of the model can directly influence therapeutic decisions and outcomes. In this context, the role of radiologists becomes crucial; they not only interpret radiomic data but also integrate it with clinical insights to optimize patient care.
Increasing the reproducibility of radiomics
Reproducibility and generalizability have been major stumbling blocks in radiomics98 and ML models in general.99 Studies often showcase models developed and tested in highly controlled environments, which may not perform equally well across different settings, scanners, or patient populations.12 By leveraging data from diverse sources, Radiomics models can be trained and tested for performance consistency, ensuring that they maintain accuracy and reliability regardless of variations in imaging protocols or equipment.
Radiomics faces a significant challenge with temporal variability, which can introduce variations in data even when the equipment, settings, and patient remain unchanged, exhibiting fluctuations in the same patient over time, highlighting the necessity for more rigorous test-retest studies to establish repeatability.100 While phantom studies offer a method to assess this variability, they may fall short in accurately emulating the complexity of human tissues, thus limiting their efficacy in fully capturing the nuances of temporal changes in radiomic features. This lack of repeatability undermines the reliability of radiomics, particularly in the context of delta-radiomics, where changes in radiomic features before and after treatment are analysed to predict outcomes such as response and survival.101
To tackle reproducibility issues in radiomics, implementing comprehensive solutions at different stages of image handling is essential.102 The adoption of AI-algorithms during image acquisition to standardize the imaging protocols will ensure consistency across different scanners and settings, crucial for minimizing initial data variability.16 Post-acquisition, image standardization is further refined using AI-driven techniques such as image denoising and brightness or contrast adjustment, which help to maintain the quality and comparability of the imaging data. Additionally, feature preprocessing methods like the ComBat algorithm103 or standardization algorithms play a pivotal role in correcting batch effects, applied directly on the radiomic data. Collectively, these approaches establish a comprehensive methodology for improving the consistency and utility of radiomics models in diverse clinical environments.
Improving the translational potential of radiomics
AI offers targeted solutions to bridge the gap between controlled studies and real-world clinical applicability. First, to tackle the issue of data diversity and representation, AI can apply data augmentation and synthetic data generation, ensuring models are trained on datasets that more accurately mirror the patient population’s complexity,104 including those with comorbidities. Second, AI-driven bias mitigation algorithms can be employed to address the inherent selection bias and demographic imbalances, promoting a more balanced and equitable model performance. Last, the adoption of prospective, real-world datasets for model validation, facilitated by AI techniques such as transfer learning, ensures that models are not only theoretically sound but also practically viable in diverse clinical environments.105,106 By integrating these solutions, AI can effectively address the outlined problems, enhancing the reliability and real-world applicability of ML models in healthcare.
The comprehensibility of AI models is crucial for their endorsement by healthcare practitioners. This is particularly true for deep learning (DL) models, which are often perceived as more enigmatic and challenging to decipher by medical professionals. Radiologists and clinicians need to understand how models arrive at their conclusions to trust and effectively use them.107 Tools like Gradient-weighted Class Activation Mapping (Grad-CAM) and integrated gradients provide visual explanations of DL model decisions, highlighting regions of an image that influenced the model’s prediction.108,109 Shapley Additive exPlanations (SHAP) provide a sophisticated mechanism for highlighting the features within a radiomics dataset that are most critical for addressing a particular clinical issue. Additionally, SHAP quantifies feature influence on model outcome by assessing their weights within the algorithm.110 This transparency is essential for building trust and ensuring that AI-assisted decisions are aligned with clinical knowledge and experience.
Case-Based Reasoning could enhance radiomics interpretability by employing ΑΙ algorithms, for example, clustering to identify and retrieve similar previous patient cases. The idea focuses on demonstrating how the proposed model has analysed these cases and their outcomes, providing clinicians with a clear, relatable example of the AI decision-making process.111 This approach not only could elucidate the reasoning behind AI predictions but also anchor them in tangible clinical experiences.
Bottlenecks of radiomics analysis that can be overcome by radiomics are presented in Figure 2.
Figure 2.
Bottlenecks of radiomics analysis which can be overcome by the use of AI. Data availability, diversity/representation, explainability, and reproducibility are the main bottlenecks that AI can alleviate (created with biorender.com).
Achieving high reporting standards for AI-driven radiomics
In the quest to increase the quality of published radiomics research, the use of reporting checklists is of utmost importance. Such checklists enforce and ensure the inclusion of all important information that will allow the reproduction of research results by researchers, while assisting reviewers of radiomics manuscripts.112 Radiomics-specific checklists have been developed including the CheckList for EvaluAtion of Radiomics research (CLEAR) which has been endorsed by the European Society of Radiology and EuSoMII.113 CLEAR covers all types of radiomics including AI assisted deep radiomics and ensures the sufficient reporting of all AI methods used in radiomics manuscripts. CLEAR can be used together with the METRICS score3 which has also been endorsed by EuSoMII for the evaluation of the quality of radiomics research. METRICS covers AI-assisted automated segmentation, deep radiomics and end-to-end deep learning solutions rendering it suitable for the assessment of manuscripts where AI methods have been used for the development of radiomics signatures. RQS had also been widely used in the past for the evaluation of radiomics research, but it has recently been shown that it lacks reproducibility.2
Bias related to the use of AI in radiomics studies
Bias can be present at all steps of AI application, from study design to modelling and deployment.114 At the step of study design and data preprocessing, the use of AI in radiomics analysis can alleviate bias related to class imbalance, data preprocessing (eg, image normalisation) and feature selection, which can greatly affect the results of predictive models (discussed in detail in AI for model development using radiomics data section).115 Nonetheless, at the modelling step, the use of AI may be inherently linked to other types of bias which should be carefully avoided. These include data leakage and bias introduced by the selection of inappropriate evaluation metrics used to assess the performance of AI algorithms.116,117 To avoid such types of bias, feature selection, data normalisation and hyperparameter tuning should be performed at the training set and then the models should be tested at a completely unrelated hold-out internal test set and at an external test set.117,118 Finally, at the deployment step bias can be introduced by misuse of the algorithm for applications that it has not been trained on. Another important source of bias at this step is concept drift, where the relationship between the input and the output of the model changes for reasons such as changes in the reference standard (eg, the use of new guidelines) or changes in the equipment used to acquire images.114 Future research should focus on addressing all the aforementioned types of bias introduced by the use of AI in radiomics analysis.
Conclusion
AI has the potential to significantly advance the clinical implementation of radiomics by addressing key challenges related to validation, reproducibility, and generalizability. By ensuring that radiomics models are validated against real-world data, designed with interpretability in mind, and integrated into clinical workflows, these models can move from being a promising research tool to becoming a cornerstone of personalized medicine. This transition will require close collaboration between AI researchers, radiologists, and clinicians to ensure that radiomic models are not only technically sound but also clinically relevant and trusted by those who use them to make critical healthcare decisions.
Contributor Information
Konstantinos Vrettos, Artificial Intelligence and Translational Imaging (ATI) Lab, Department of Radiology, School of Medicine, University of Crete, Voutes Campus, Heraklion, 71003, Greece.
Matthaios Triantafyllou, Artificial Intelligence and Translational Imaging (ATI) Lab, Department of Radiology, School of Medicine, University of Crete, Voutes Campus, Heraklion, 71003, Greece; Department of Medical Imaging, University Hospital of Heraklion, Heraklion, Crete, 71110, Greece.
Kostas Marias, Computational BioMedicine Laboratory, Institute of Computer Science, Foundation for Research and Technology (FORTH), Heraklion, Crete, 70013, Greece; Department of Electrical and Computer Engineering, Hellenic Mediterranean University, Heraklion, Crete, 71410, Greece.
Apostolos H Karantanas, Artificial Intelligence and Translational Imaging (ATI) Lab, Department of Radiology, School of Medicine, University of Crete, Voutes Campus, Heraklion, 71003, Greece; Department of Medical Imaging, University Hospital of Heraklion, Heraklion, Crete, 71110, Greece; Computational BioMedicine Laboratory, Institute of Computer Science, Foundation for Research and Technology (FORTH), Heraklion, Crete, 70013, Greece.
Michail E Klontzas, Artificial Intelligence and Translational Imaging (ATI) Lab, Department of Radiology, School of Medicine, University of Crete, Voutes Campus, Heraklion, 71003, Greece; Department of Medical Imaging, University Hospital of Heraklion, Heraklion, Crete, 71110, Greece; Computational BioMedicine Laboratory, Institute of Computer Science, Foundation for Research and Technology (FORTH), Heraklion, Crete, 70013, Greece; Division of Radiology, Department for Clinical Science, Intervention and Technology (CLINTEC), Karolinska Institutet, Huddinge, 14152, Sweden.
Author contributions
K. Vrettos and M. Triantafyllou contributed equally to this work.
Funding
None declared.
Conflicts of interest
None declared.
References
- 1. Lambin P, Leijenaar RTH, Deist TM, et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol. 2017;14(12):749-762. 10.1038/nrclinonc.2017.141 [DOI] [PubMed] [Google Scholar]
- 2. Akinci D'Antonoli T, Cavallo AU, Vernuccio F, et al. Reproducibility of radiomics quality score: an intra- and inter-rater reliability study. Eur Radiol. 2023;34(4):2791-2804. 10.1007/S00330-023-10217-X/FIGURES/5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Kocak B, Akinci D'Antonoli T, Mercaldo N, et al. METhodological RadiomICs Score (METRICS): a quality scoring tool for radiomics research endorsed by EuSoMII. Insights Imaging. 2024;15(1):8. 10.1186/S13244-023-01572-W/FIGURES/9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Qian L, Wu T, Kong S, et al. Could the underlying biological basis of prognostic radiomics and deep learning signatures be explored in patients with lung cancer? A systematic review. Eur J Radiol. 2024;171:111314. 10.1016/j.ejrad.2024.111314 [DOI] [PubMed] [Google Scholar]
- 5. Horvat N, Papanikolaou N, Koh DM.. Radiomics beyond the hype: a critical evaluation toward oncologic clinical use. Radiol Artif Intell. 2024;6(4):e230437. 10.1148/ryai.230437 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Forghani R, Savadjiev P, Chatterjee A, Muthukrishnan N, Reinhold C, Forghani B.. Radiomics and artificial intelligence for biomarker and prediction model development in oncology. Comput Struct Biotechnol J. 2019;17:995-1008. 10.1016/j.csbj.2019.07.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Rodrigues A, Rodrigues N, Santinha J, et al. Value of handcrafted and deep radiomic features towards training robust machine learning classifiers for prediction of prostate cancer disease aggressiveness. Sci Rep. 2023;13(1):6206-6210. 10.1038/s41598-023-33339-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Sheng L, Yang C, Chen Y, Song B.. Machine learning combined with radiomics facilitating the personal treatment of malignant liver tumors. Biomedicines. 2023;12(1):58. 10.3390/biomedicines12010058 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Fusco R, Granata V, Simonetti I, et al. An informative review of radiomics studies on cancer imaging: the main findings, challenges and limitations of the methodologies. Curr Oncol. 2024;31(1):403-424. 10.3390/curroncol31010027 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Lohmann P, Galldiks N, Kocher M, et al. Radiomics in neuro-oncology: basics, workflow, and applications. Methods. 2021;188:112-121. 10.1016/j.ymeth.2020.06.003 [DOI] [PubMed] [Google Scholar]
- 11. Lin CY, Guo SM, Lien JJJ, et al. Combined model integrating deep learning, radiomics, and clinical data to classify lung nodules at chest CT. Radiol Med. 2023;129(1):56-69. 10.1007/s11547-023-01730-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Pfaehler E, Zhovannik I, Wei L, et al. A systematic review and quality of reporting checklist for repeatability and reproducibility of radiomic features. Phys Imaging Radiat Oncol. 2021;20:69-75. 10.1016/j.phro.2021.10.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Fu C, Zhang B, Guo T, Li J.. Imaging evaluation of peritoneal metastasis: current and promising techniques. Korean J Radiol. 2024;25(1):86-102. 10.3348/kjr.2023.0840 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Xu Z, Li M, Li B, Shu H.. Synthesis of virtual monoenergetic images from kilovoltage peak images using wavelet loss enhanced CycleGAN for improving radiomics features reproducibility. Quant Imaging Med Surg. 2024;14(3):2370-2390. 10.21037/qims-23-922 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Vrettos K, Koltsakis E, Zibis AH, Karantanas AH, Klontzas ME.. Generative adversarial networks for spine imaging: a critical review of current applications. Eur J Radiol. 2024;171:111313. 10.1016/J.EJRAD.2024.111313 [DOI] [PubMed] [Google Scholar]
- 16. Lee J, Jeon J, Hong Y, et al. Generative adversarial network with radiomic feature reproducibility analysis for computed tomography denoising. Comput Biol Med. 2023;159:106931. 10.1016/J.COMPBIOMED.2023.106931 [DOI] [PubMed] [Google Scholar]
- 17. Scalco E, Rizzo G, Mastropietro A.. The stability of oncologic MRI radiomic features and the potential role of deep learning: a review. Phys Med Biol. 2022;67(9):09TR03. 10.1088/1361-6560/ac60b9 [DOI] [PubMed] [Google Scholar]
- 18. Zhao C, Xu Y, He Z, et al. Lung segmentation and automatic detection of COVID-19 using radiomic features from chest CT images. Pattern Recognit. 2021;119:108071. 10.1016/j.patcog.2021.108071 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Xu L, Tetteh G, Lipkova J, et al. Automated whole-body bone lesion detection for multiple myeloma on 68 Ga-Pentixafor PET/CT imaging using deep learning methods. Contrast Media Mol Imaging. 2018;2018:2391925. 10.1155/2018/2391925 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Klontzas ME. Radiomics feature reproducibility: the elephant in the room. Eur J Radiol. 2024;175:111430. 10.1016/J.EJRAD.2024.111430 [DOI] [PubMed] [Google Scholar]
- 21. Wasserthal J, Breit HC, Meyer MT, et al. TotalSegmentator: robust segmentation of 104 anatomic structures in CT images. Radiol Artif Intell. 2023;5(5):e230024. 10.1148/RYAI.230024/ASSET/IMAGES/LARGE/RYAI.230024.FIG6.JPEG [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Shiyam Sundar LK, Yu J, Muzik O, et al. Fully automated, semantic segmentation of whole-body 18F-FDG PET/CT images based on data-centric artificial intelligence. J Nucl Med. 2022;63(12):1941-1948. 10.2967/JNUMED.122.264063 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Kapetanou E, Malamas S, Leventis D, Karantanas AH, Klontzas ME.. Developing a radiomics atlas dataset of normal abdominal and pelvic computed tomography (RADAPT). J Imaging Inform Med. 2024;1-9. 10.1007/S10278-024-01028-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Dai H, Lu M, Huang B, et al. Considerable effects of imaging sequences, feature extraction, feature selection, and classifiers on radiomics-based prediction of microvascular invasion in hepatocellular carcinoma using magnetic resonance imaging. Quant Imaging Med Surg. 2021;11(5):1836-1853. 10.21037/qims-20-218 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. Gillies RJ, Kinahan PE, Hricak H.. Radiomics: images are more than pictures, they are data. Radiology. 2016;278(2):563-577. 10.1148/radiol.2015151169 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Jia J, Liu Z, Wang F, Bai G.. Consensus clustering analysis based on enhanced-CT radiomic features: esophageal squamous cell carcinoma patients’ 3-year progression-free survival. Acad Radiol. 2024. 10.1016/j.acra.2023.12.025 [DOI] [PubMed] [Google Scholar]
- 27. Papanikolaou N, Matos C, Koh DM.. How to develop a meaningful radiomic signature for clinical use in oncologic patients. Cancer Imaging. 2020;20(1):33. 10.1186/s40644-020-00311-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Ungan G, Lavandier AF, Rouanet J, et al. Metastatic melanoma treated by immunotherapy: discovering prognostic markers from radiomics analysis of pretreatment CT with feature selection and classification. Int J Comput Assist Radiol Surg. 2022;17(10):1867-1877. 10.1007/s11548-022-02662-8 [DOI] [PubMed] [Google Scholar]
- 29. Degenhardt F, Seifert S, Szymczak S.. Evaluation of variable selection methods for random forests and omics data sets. Brief Bioinform. 2019;20(2):492-503. 10.1093/BIB/BBX124 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Zhang B, Ni-Jia-Ti MY, Yan R, et al. CT-based radiomics for predicting the rapid progression of coronavirus disease 2019 (COVID-19) pneumonia lesions. Br J Radiol. 2021;94(1122):20201007. 10.1259/bjr.20201007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Demircioğlu A. Applying oversampling before cross-validation will lead to high bias in radiomics. Sci Rep. 2024;14(1):11563. 10.1038/s41598-024-62585-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP.. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. 2002;16:321-357. [Google Scholar]
- 33. Zhang Y, Oikonomou A, Wong A, Haider MA, Khalvati F.. Radiomics-based prognosis analysis for non-small cell lung cancer. Sci Rep. 2017;7:46349. 10.1038/srep46349 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Rich B, Huang J, Yang Y, et al. Radiomics predicts for distant metastasis in locally advanced human papillomavirus-positive oropharyngeal squamous cell carcinoma. Cancers (Basel). 2021;13(22):5689. 10.3390/cancers [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Gitto S, Annovazzi A, Nulle K, et al. X-rays radiomics-based machine learning classification of atypical cartilaginous tumour and high-grade chondrosarcoma of long bones. EBioMedicine. 2024;101:105018. 10.1016/j.ebiom.2024.105018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Klontzas ME, Koltsakis E, Kalarakis G, et al. A pilot radiometabolomics integration study for the characterization of renal oncocytic neoplasia. Sci Rep. 2023;13(1):12594. 10.1038/s41598-023-39809-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Sampath V, Maurtua I, Aguilar Martín JJ, Gutierrez A.. A survey on generative adversarial networks for imbalance problems in computer vision tasks. J Big Data. 2021;8(1):27. 10.1186/s40537-021-00414-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Xu L, Skoularidou M, Cuesta-Infante A, Veeramachaneni K.. Modeling tabular data using conditional GAN. ArXiv 1907.00503v2.http://arxiv.org/abs/1907.00503, 2019. [Google Scholar]
- 39. Rožanec JM, Fortuna B, Poštuvan T, Mladenić D.. Tackling Class Imbalance in Radiomics: The COVID-19 Use Case. SiKDD ’21. Vol. 2872. CEUR-WS; 2021:29–35.
- 40. Ziegelmayer S, Kaissis G, Harder F, et al. Deep convolutional neural network-assisted feature extraction for diagnostic discrimination and feature visualization in pancreatic ductal adenocarcinoma (Pdac) versus autoimmune pancreatitis (aip). J Clin Med. 2020;9(12):1-8. 10.3390/jcm9124013 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Hu Y, Xie C, Yang H, et al. Computed tomography-based deep-learning prediction of neoadjuvant chemoradiotherapy treatment response in esophageal squamous cell carcinoma. Radiother Oncol. 2021;154:6-13. 10.1016/j.radonc.2020.09.014 [DOI] [PubMed] [Google Scholar]
- 42. Ypsilantis PP, Siddique M, Sohn HM, et al. Predicting response to neoadjuvant chemotherapy with PET imaging using convolutional neural networks. PLoS One. 2015;10(9):e0137036. 10.1371/journal.pone.0137036 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Wei L, Owen D, Rosen B, et al. A deep survival interpretable radiomics model of hepatocellular carcinoma patients. Phys Med. 2021;82:295-305. 10.1016/j.ejmp.2021.02.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Klyuzhin IS, Xu Y, Ortiz A, Ferres JL, Hamarneh G, Rahmim A.. Testing the ability of convolutional neural networks to learn radiomic features. Comput Methods Programs Biomed. 2022;219:106750. 10.1016/j.cmpb.2022.106750 [DOI] [PubMed] [Google Scholar]
- 45. Avanzo M, Wei L, Stancanello J, et al. Machine and deep learning methods for radiomics. Med Phys. 2020;47(5):e185-e202. 10.1002/mp.13678 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Yamashita R, Nishio M, Do RKG, Togashi K.. Convolutional neural networks: an overview and application in radiology. Insights Imaging. 2018;9(4):611-629. 10.1007/s13244-018-0639-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Kobayashi K, Miyake M, Takahashi M, Hamamoto R.. Observing deep radiomics for the classification of glioma grades. Sci Rep. 2021;11(1):10942. 10.1038/s41598-021-90555-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Chun SH, Suh YJ, Han K, Kwon Y, Kim AY, Choi BW.. Deep learning-based reconstruction on cardiac CT yields distinct radiomic features compared to iterative and filtered back projection reconstructions. Sci Rep. 2022;12(1):15171. 10.1038/s41598-022-19546-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Wong PK, Chan IN, Yan HM, et al. Deep learning based radiomics for gastrointestinal cancer diagnosis and treatment: a minireview. World J Gastroenterol. 2022;28(45):6363-6379. 10.3748/wjg.v28.i45.6363 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. Zhou Y, Zhan Y, Zhao J, et al. CT-based radiomics analysis of different machine learning models for discriminating the risk stratification of pheochromocytoma and paraganglioma: a multicenter study. Acad Radiol. 2024. 10.1016/j.acra.2024.01.008 [DOI] [PubMed] [Google Scholar]
- 51. Cheng Q, Lin H, Zhao J, Lu X, Wang Q.. Application of machine learning-based multi-sequence MRI radiomics in diagnosing anterior cruciate ligament tears. J Orthop Surg Res. 2024;19(1):99. 10.1186/s13018-024-04602-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Kecman V. Support Vector Machines—An Introduction. Springer, Berlin, Heidelberg; 2005:1-47. 10.1007/10984697_1 [DOI]
- 53. Huang S, Cai N, Pacheco PP, Narrandes S, Wang Y, Xu W.. Applications of support vector machine (SVM) learning in cancer genomics. Cancer Genomics Proteomics. 2018;15(1):41-51. 10.21873/cgp.20063 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Cho H. h, Lee S. h, Kim J, Park H.. Classification of the glioma grading using radiomics analysis. PeerJ. 2018;6:e5982. 10.7717/peerj.5982 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Charalambous S, Klontzas ME, Kontopodis N, et al. Radiomics and machine learning to predict aggressive type 2 endoleaks after endovascular aneurysm repair: a proof of concept. Acta Radiol. 2021;63(9):1293-1299. [DOI] [PubMed] [Google Scholar]
- 56. Klontzas ME, Manikis GC, Nikiforaki K, et al. Radiomics and machine learning can differentiate transient osteoporosis from avascular necrosis of the hip. Diagnostics. 2021;11(9):1686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Breiman L. Random forests. Mach Learn. 2001;45(1):5-32. 10.1023/A:1010933404324 [DOI] [Google Scholar]
- 58. Rigatti SJ. Random forest. J Insur Med. 2017;47(1):31-39. 10.17849/insm-47-01-31-39.1 [DOI] [PubMed] [Google Scholar]
- 59. Lee JY, sig LK, Seo BK, et al. Radiomic machine learning for predicting prognostic biomarkers and molecular subtypes of breast cancer using tumor heterogeneity and angiogenesis properties on MRI. Eur Radiol. 2022;32(1):650-660. 10.1007/s00330-021-08146-8 [DOI] [PubMed] [Google Scholar]
- 60. Tang C, Garreau D, Von Luxburg U. When do random forests fail? Advances in Neural Information Processing Systems. Vol. 2018-December; Curran Associates Inc.; 2018.
- 61. Zhang Y, Haghani A.. A gradient boosting method to improve travel time prediction. Transp Res Part C Emerg Technol. 2015;58:308-324. 10.1016/j.trc.2015.02.019 [DOI] [Google Scholar]
- 62. Zhang Z, Zhao Y, Canes A, Steinberg D, Lyashevska O, written on behalf of AME Big-Data Clinical Trial Collaborative Group. Predictive analytics with gradient boosting in clinical medicine. Ann Transl Med. 2019;7(7):152-152. 10.21037/atm.2019.03.29 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Natekin A, Knoll A.. Gradient boosting machines, a tutorial. Front Neurorobot. 2013;7:21. 10.3389/fnbot.2013.00021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Moore A, Bell M.. XGBoost, a novel explainable ai technique, in the prediction of myocardial infarction: a UK biobank cohort study. Clin Med Insights Cardiol. 2022;16:11795468221133611. 10.1177/11795468221133611 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Nazari M, Shiri I, Zaidi H.. Radiomics-based machine learning model to predict risk of death within 5-years in clear cell renal cell carcinoma patients. Comput Biol Med. 2021;129:104135. 10.1016/j.compbiomed.2020.104135 [DOI] [PubMed] [Google Scholar]
- 66. Sakai Y, Yang C, Kihira S, et al. MRI radiomic features to predict IDH1 mutation status in gliomas: a machine learning approach using gradient tree boosting. Int J Mol Sci. 2020;21(21):8004. 10.3390/ijms21218004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Kotanidis CP, Xie C, Alexander D, et al. Constructing custom-made radiotranscriptomic signatures of vascular inflammation from routine CT angiograms: a prospective outcomes validation study in COVID-19. Lancet Digit Health. 2022;4(10):e705-e716. 10.1016/S2589-7500(22)00132-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68. Fu XY, Mao XL, Wu HW, et al. Development and validation of LightGBM algorithm for optimizing of Helicobacter pylori antibody during the minimum living guarantee crowd based gastric cancer screening program in Taizhou, China. Prev Med. 2023;174:107605. 10.1016/j.ypmed.2023.107605 [DOI] [PubMed] [Google Scholar]
- 69. Lam LHT, Chu NT, Tran TO, Do DT, Le NQK.. A radiomics-based machine learning model for prediction of tumor mutational burden in lower-grade gliomas. Cancers (Basel). 2022;14(14):3492. 10.3390/cancers14143492 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Chen Z, Zhan W, Wu Z, et al. The ultrasound-based radiomics-clinical machine learning model to predict papillary thyroid microcarcinoma in TI-RADS 3 nodules. Transl Cancer Res. 2024;13(1):278-289. 10.21037/tcr-23-1375 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71. Heo J, Sim Y, Kim BM, et al. Radiomics using non-contrast CT to predict hemorrhagic transformation risk in stroke patients undergoing revascularization. Eur Radiol. 2024. 10.1007/s00330-024-10618-6 [DOI] [PubMed] [Google Scholar]
- 72. Prokhorenkova L, Gusev G, Vorobev A, Dorogush AV, Gulin A. Catboost: unbiased boosting with categorical features. Advances in Neural Information Processing Systems. Vol. 2018-December. Curran Associates Inc.; 2018.
- 73. Hancock JT, Khoshgoftaar TM.. CatBoost for big data: an interdisciplinary review. J Big Data. 2020;7(1):94. 10.1186/s40537-020-00369-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74. Bentéjac C, Csörgő A, Martínez-Muñoz G.. A comparative analysis of gradient boosting algorithms. Artif Intell Rev. 2021;54(3):1937-1967. 10.1007/s10462-020-09896-5 [DOI] [Google Scholar]
- 75. Kong SH, Ahn D, Kim B(R), et al. A novel fracture prediction model using machine learning in a community‐based cohort. JBMR Plus. 2020;4(3):e10337. 10.1002/jbm4.10337 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76. Isaksson LJ, Repetto M, Summers PE, et al. High-performance prediction models for prostate cancer radiomics. Inform Med Unlocked. 2023;37:101161. 10.1016/j.imu.2023.101161 [DOI] [Google Scholar]
- 77. Han SH, Kim KW, Kim S, Youn YC.. Artificial neural network: understanding the basic concepts without mathematics. Dement Neurocogn Disord. 2018;17(3):83-89. 10.12779/dnd.2018.17.3.83 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78. Chiari-Correia NS, Nogueira-Barbosa MH, Chiari-Correia RD, Azevedo-Marques PM.. A 3D radiomics-based artificial neural network model for benign versus malignant vertebral compression fracture classification in MRI. J Digit Imaging. 2023;36(4):1565-1577. 10.1007/s10278-023-00847-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79. Currie G, Iqbal B, Kiat H.. Intelligent imaging: radiomics and artificial neural networks in heart failure. J Med Imaging Radiat Sci. 2019;50(4):571-574. 10.1016/j.jmir.2019.08.006 [DOI] [PubMed] [Google Scholar]
- 80. Zhong YW, Jiang Y, Dong S, et al. Tumor radiomics signature for artificial neural network-assisted detection of neck metastasis in patient with tongue cancer. J Neuroradiol. 2022;49(2):213-218. 10.1016/j.neurad.2021.07.006 [DOI] [PubMed] [Google Scholar]
- 81. Barak O. Recurrent neural networks as versatile tools of neuroscience research. Curr Opin Neurobiol. 2017;46:1-6. 10.1016/j.conb.2017.06.003 [DOI] [PubMed] [Google Scholar]
- 82. Das S, Tariq A, Santos T, Kantareddy SS, Banerjee I.. Recurrent Neural Networks (RNNs): Architectures, Training Tricks, and Introduction to Influential Research. Humana; 2023:117-138. 10.1007/978-1-0716-3195-9_4 [DOI] [PubMed]
- 83. Wang J, Yang Y, Mao J, Huang Z, Huang C, Xu W. CNN-RNN: a unified framework for multi-label image classification. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE; 2016.
- 84. Sushentsev N, Rundo L, Abrego L, et al. Time series radiomics for the prediction of prostate cancer progression in patients on active surveillance. Eur Radiol. 2023;33(6):3792-3800. 10.1007/s00330-023-09438-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85. Subasree S, Sakthivel NK, Tripathi K, Agarwal D, Tyagi AK.. Combining the advantages of radiomic features based feature extraction and hyper parameters tuned RERNN using LOA for breast cancer classification. Biomed Signal Process Control. 2022;72:103354. 10.1016/j.bspc.2021.103354 [DOI] [Google Scholar]
- 86. Khan S, Naseer M, Hayat M, Zamir SW, Khan FS, Shah M.. Transformers in vision: a survey. ACM Comput Surv. 2022;54(10s):1-41. 10.1145/3505244 [DOI] [Google Scholar]
- 87. Qiu J, Mitra J, Ghose S, et al. A multichannel CT and radiomics-guided CNN-ViT (RadCT-CNNViT) ensemble network for diagnosis of pulmonary sarcoidosis. Diagnostics. 2024;14(10):1049. 10.3390/diagnostics14101049 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88. Lan Y, Zou S, Qin B, Zhu X.. Potential roles of transformers in brain tumor diagnosis and treatment. Brain-X. 2023;1(2):e23. 10.1002/brx2.23 [DOI] [Google Scholar]
- 89. Usuzaki T, Takahashi K, Inamori R, et al. Identifying key factors for predicting O6-Methylguanine-DNA methyltransferase status in adult patients with diffuse glioma: a multimodal analysis of demographics, radiomics, and MRI by variable Vision Transformer. Neuroradiology. 2024;66(5):761-773. 10.1007/S00234-024-03329-8/TABLES/4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90. Fanizzi A, Fadda F, Comes MC, et al. Comparison between vision transformers and convolutional neural networks to predict non-small lung cancer recurrence. Sci Rep. 2023;13(1):20605. 10.1038/s41598-023-48004-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91. Thirunavukarasu AJ, Ting DSJ, Elangovan K, Gutierrez L, Tan TF, Ting DSW.. Large language models in medicine. Nat Med. 2023;29(8):1930-1940. 10.1038/s41591-023-02448-8 [DOI] [PubMed] [Google Scholar]
- 92. Bhayana R. Chatbots and large language models in radiology: a practical primer for clinical and research applications. Radiology. 2024;310(1):e232756. 10.1148/radiol.232756 [DOI] [PubMed] [Google Scholar]
- 93. Akinci D'Antonoli T, Stanzione A, Bluethgen C, et al. Large language models in radiology: fundamentals, applications, ethical considerations, risks, and future directions. Diagn Interv Radiol. 2024;30(2):80-90. 10.4274/DIR.2023.232417 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94. Kim M, Ong KI, Choi S, et al. Natural language processing to predict isocitrate dehydrogenase genotype in diffuse glioma using MR radiology reports. Eur Radiol. 2023;33(11):8017-8025. 10.1007/s00330-023-10061-z [DOI] [PubMed] [Google Scholar]
- 95. Naseri H, Skamene S, Tolba M, et al. A scalable radiomics- and natural language processing-based machine learning pipeline to distinguish between painful and painless thoracic spinal bone metastases: retrospective algorithm development and validation study. JMIR AI. 2023;2:e44779. 10.2196/44779 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96. Lambin P, Leijenaar RTH, Deist TM, et al. Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol. 2017;14(12):749-762. 10.1038/nrclinonc.2017.141 [DOI] [PubMed] [Google Scholar]
- 97. Vickers AJ, Woo S.. Decision curve analysis in the evaluation of radiology research. Eur Radiol. 2022;32(9):5787-5789. 10.1007/S00330-022-08685-8/FIGURES/4 [DOI] [PubMed] [Google Scholar]
- 98. Akinci D'Antonoli T, Cuocolo R, Baessler B, Pinto Dos Santos D.. Towards reproducible radiomics research: introduction of a database for radiomics studies. Eur Radiol. 2023;34(1):436-443. 10.1007/s00330-023-10095-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99. Traverso A, Wee L, Dekker A, Gillies R.. Repeatability and reproducibility of radiomic features: a systematic review. Int J Radiat Oncol Biol Phys. 2018;102(4):1143-1158. 10.1016/j.ijrobp.2018.05.053 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100. Jha AK, Mithun S, Jaiswar V, et al. Repeatability and reproducibility study of radiomic features on a phantom and human cohort. Sci Rep. 2021;11(1):2055. 10.1038/s41598-021-81526-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101. Cousin F, Louis T, Dheur S, et al. Radiomics and delta-radiomics signatures to predict response and survival in patients with non-small-cell lung cancer treated with immune checkpoint inhibitors. Cancers (Basel). 2023;15(7):1968. 10.3390/cancers15071968 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102. Alberich-Bayarri A, Sourbron S, Golay X, et al. ESR statement on the validation of imaging biomarkers. Insights Imaging. 2020;11(1):76. 10.1186/s13244-020-00872-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103. Orlhac F, Frouin F, Nioche C, Ayache N, Buvat I.. Validation of a method to compensate multicenter effects affecting CT radiomics. Radiology. 2019;291(1):53-59. 10.1148/RADIOL.2019182023 [DOI] [PubMed] [Google Scholar]
- 104. Park JE, Eun D, Kim HS, Lee DH, Jang RW, Kim N.. Generative adversarial network for glioblastoma ensures morphologic variations and improves diagnostic model for isocitrate dehydrogenase mutant type. Sci Rep. 2021;11(1):9912. 10.1038/s41598-021-89477-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105. Mei X, Liu Z, Robson PM, et al. RadImageNet: an open radiologic deep learning research dataset for effective transfer learning. Radiol Artif Intell. 2022;4(5):e210315. 10.1148/ryai.210315 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106. Kim HE, Cosa-Linan A, Santhanam N, Jannesari M, Maros ME, Ganslandt T.. Transfer learning for medical image classification: a literature review. BMC Med Imaging. 2022;22(1):69. 10.1186/s12880-022-00793-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107. Reyes M, Meier R, Pereira S, et al. On the interpretability of artificial intelligence in radiology: challenges and opportunities. Radiol Artif Intell. 2020;2(3):e190043. 10.1148/ryai.2020190043 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108. Zhang Y, Hong D, McClement D, Oladosu O, Pridham G, Slaney G.. Grad-CAM helps interpret the deep learning models trained to classify multiple sclerosis types using clinical brain magnetic resonance imaging. J Neurosci Methods. 2021;353:109098. 10.1016/j.jneumeth.2021.109098 [DOI] [PubMed] [Google Scholar]
- 109. Zhang J, Chao H, Dasegowda G, Wang G, Kalra MK, Yan P.. Revisiting the trustworthiness of saliency methods in radiology AI. Radiol Artif Intell. 2024;6(1):e220221. 10.1148/RYAI.220221/ASSET/IMAGES/LARGE/RYAI.220221.FIG4.JPEG [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110. Shi Y, Zou Y, Liu J, et al. Ultrasound-based radiomics XGBoost model to assess the risk of central cervical lymph node metastasis in patients with papillary thyroid carcinoma: individual application of SHAP. Front Oncol. 2022;12:897596. 10.3389/fonc.2022.897596 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111. Wolf TN, Bongratz F, Rickmann AM, Pölsterl S, Wachinger C.. Keep the faith: faithful explanations in convolutional neural networks for case-based reasoning. Proc AAAI Conf Artif Intell. 2024;38(6):5921-5929. 10.1609/AAAI.V38I6.28406 [DOI] [Google Scholar]
- 112. Klontzas ME, Gatti AA, Tejani AS, Kahn CE.. AI reporting guidelines: how to select the best one for your research. Radiol Artif Intell. 2023;5(3):e230055. 10.1148/RYAI.230055/ASSET/IMAGES/LARGE/RYAI.230055.TBL2.JPEG [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113. Kocak B, Baessler B, Bakas S, et al. CheckList for EvaluAtion of Radiomics research (CLEAR): a step-by-step reporting guideline for authors and reviewers endorsed by ESR and EuSoMII. Insights Imaging. 2023;14(1):75. 10.1186/S13244-023-01415-8/TABLES/1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Koçak B, Ponsiglione A, Stanzione A, et al. Bias in artificial intelligence for medical imaging: fundamentals, detection, avoidance, mitigation, challenges, ethics, and prospects. Diagn Interv Radiol. 2024. 10.4274/dir.2024.242854 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115. Demircioğlu A. Measuring the bias of incorrect application of feature selection when using cross-validation in radiomics. Insights Imaging. 2021;12(1):172. 10.1186/s13244-021-01115-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116. Maier-Hein L, Reinke A, Godau P, et al. Metrics reloaded: recommendations for image analysis validation. Nat Methods. 2024;21(2):195-212. 10.1038/s41592-023-02151-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117. Gidwani M, Chang K, Patel JB, et al. Inconsistent partitioning and unproductive feature associations yield idealized radiomic models. Radiology. 2023;307(1):e220715. 10.1148/radiol.220715 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118. Tejani AS, Klontzas ME, Gatti AA, et al. Checklist for Artificial Intelligence in Medical Imaging (CLAIM): 2024 update. Radiology Artif Intell. 2024;6(4):e240300. 10.1148/ryai.240300 [DOI] [PMC free article] [PubMed] [Google Scholar]


