Abstract
Simple Summary
Breast cancer is considered as the most common malignancy among females, and its treatment takes many forms and types. Neoadjuvant chemotherapy (NACT), which is the treatment precedes the surgical intervention, became the preferred treatment approach for some subtypes of breast tumors. However, some patients exhibit good response to the neoadjuvant treatment, while others do not. Therefore, the proactive prediction of patients’ response to NACT is a necessity to reduce the exposure to unnecessary doses of treatment, treatment costs, and side effects. Many researchers proposed prediction models to predict patients’ response to NACT either at early stage of treatment or prior to the initiation of the first cycle. They used various radiomics, pathological, and clinical predictors and markers. This review discusses some of the researches conducted the last decade based on statistical, machine learning, or deep learning approaches.
Abstract
Breast cancer retains its position as the most prevalent form of malignancy among females on a global scale. The careful selection of appropriate treatment for each patient holds paramount importance in effectively managing breast cancer. Neoadjuvant chemotherapy (NACT) plays a pivotal role in the comprehensive treatment of this disease. Administering chemotherapy before surgery, NACT becomes a powerful tool in reducing tumor size, potentially enabling fewer invasive surgical procedures and even rendering initially inoperable tumors amenable to surgery. However, a significant challenge lies in the varying responses exhibited by different patients towards NACT. To address this challenge, researchers have focused on developing prediction models that can identify those who would benefit from NACT and those who would not. Such models have the potential to reduce treatment costs and contribute to a more efficient and accurate management of breast cancer. Therefore, this review has two objectives: first, to identify the most effective radiomic markers correlated with NACT response, and second, to explore whether integrating radiomic markers extracted from radiological images with pathological markers can enhance the predictive accuracy of NACT response. This review will delve into addressing these research questions and also shed light on the emerging research direction of leveraging artificial intelligence techniques for predicting NACT response, thereby shaping the future landscape of breast cancer treatment.
Keywords: breast cancer, computed tomography, mammography, magnetic resonance imaging, multi-modal imaging, neoadjuvant chemotherapy, pathological markers, predictive models, radiomic markers, treatment response
1. Introduction
Breast cancer stands as the most prevalent malignancy among females globally, comprising 11.7% of all cancer incidences worldwide, with 2.3 million new cases and 685,000 fatalities recorded in 2020. It ranks as the fifth leading cause of cancer-related deaths on a global scale [1,2]. In the United States, projections for 2023 indicate an estimated 300,590 new cases, resulting in 43,700 deaths across both genders [3]. Among females, breast cancer contributes to 31% of all cancer instances, foreseeing 297,790 new cases and 43,170 deaths in the US during 2023 [3]. The significance of early breast cancer detection cannot be overstated, holding the potential to substantially lower mortality and morbidity rates, enhance prospects of cure and survival, and facilitate accessible and cost-effective treatment options [4].
There are two primary treatment options for breast cancer: local therapy, which comprises surgery and radiation therapy; and systemic treatment, which includes chemotherapy, endocrine (hormone) therapy, and targeted therapy [5,6]. The selection of the appropriate treatment for each patient depends on various factors, such as age, menopausal status, tumor subtype, and stage, as well as the patient’s overall health and preferences [5,7]. Moreover, systemic therapy traditionally follows surgery in the adjuvant setting, while in the neoadjuvant setting, it precedes surgery [8,9]. Neoadjuvant chemotherapy (NACT) can, in some cases, reduce tumor size, downstage the disease, minimize the extent of local surgery, and increase the likelihood of breast-conserving surgery (lumpectomy) over mastectomy [10,11]. Currently, it is considered the gold standard treatment for locally advanced breast cancer (LABC), which refers to an aggressive stage of breast cancer where the tumor size exceeds 5cm and may involve the skin or chest wall [12,13,14].
Additionally, NACT became the preferred treatment approach for HER2-positive (HER2+) and triple-negative breast cancers (TNBC), even in very early breast cancer (breast mass more than 1.5 cm), according to the American Society of Clinical Oncology (ASCO) and National Comprehensive Cancer Network (NCCN) guidelines [15,16,17,18]. In the case of HER2+ tumors, the neoadjuvant regimen often comprises the HER2-targeted therapy drugs (such as trastuzumab and pertuzumab) in addition to chemotherapy (e.g., anthracycline- or taxane-based therapies). In contrast, immunotherapy drugs (e.g., pembrolizumab) may be included in the neoadjuvant treatment of TNBC tumors [18,19]. Incorporation of these new drugs yields a paradigm shift in the management of these molecular types of breast cancer and increases the pathological complete response after neoadjuvant therapy. Neoadjuvant therapy (NAT) also has a crucial role in the guidance of the adjuvant treatment by evaluating the sensitivity of the tumor to therapy [20,21]. Moreover, the tumor response to NAT can be considered a significant prognostic factor for the likelihood of recurrence [21,22]. Given the numerous advantages of NACT treatment, this review will primarily focus on exploring its aspects.
The response to NACT can be classified into four categories: pathological Complete Response (pCR), pathological Partial Response (pPR), Progressive Disease (PD), and Stable Disease with No Response (SDNR) [10,23]. Several grading systems are used to evaluate pathological response, and the definition of pCR varies depending on these systems. For instance, while some systems assess the breast only, others consider both the breast and the Lymph Nodes (LNs). These grading systems encompass the Sinn Score, Sataloff, Miller-Payne grading system, Residual Cancer Burden (RCB), and AJCC ypTNM (8th Edition), among other systems discussed in a review by See et al. [23]. The neoadjuvant treatment setting often utilizes the same chemotherapeutic drugs as the adjuvant treatment setting, potentially resulting in comparable long-term and short-term side effects. These effects may include leukemia, infertility, osteoporosis, cardiomyopathy/heart diseases, fatigue, risk of infection, neuropathy, vomiting, nausea, cognitive impairment, and hair loss [24,25]. Therefore, the early prediction of patient responses aids in determining suitable treatment regimens, thereby reducing unnecessary toxicities, costs, and side effects related to NACT treatment. As a result, significant interest exists in developing predictive methods that utilize clinical, radiomic, pathological, and molecular markers [26]. The motivation for this review article stems from the unmet need for these prediction models. It is worth noting that some researchers such as O’Donnell et al. [27], Pesapane et al. [28], and Liang et al. [29] conducted systematic reviews and meta-analyses to compare the accuracies of the predictive models based on MRI images.
Recently, Artificial Intelligence (AI) and its sub-fields, namely machine learning (ML) and Deep Learning (DL), have assumed an increasingly pivotal role in predicting patients’ responses to treatment, while also optimizing personalized or precision medicine—a paradigm that intricately considers the individuality of each patient. This trend is underpinned by promising outcomes in detecting, diagnosing, and prognosing various cancers. Figure 1 delineates the fundamental constituents of a typical AI-driven treatment response prediction system. This system leverages medical images procured through diverse imaging modalities, encompassing both anatomical and functional perspectives, as its primary model inputs, followed by an intricate radiomics feature extraction procedure. Moreover, the prediction model can be supplemented with clinical markers (e.g., CA 125 and CA15-3), pathological indicators (such as estrogen receptor (ER), progesterone receptor (PR), Ki67 index, and HER2 expression), and treatment alternatives. The ultimate culmination of these inputs is the prediction of the patient’s response to NACT treatment, encompassing potential outcomes like pCR, pPR, PD, or SDNR.
Figure 1.
The main components of a typical AI-driven treatment response prediction system.
This review discusses studies published from 2010 to 2023 that utilized ML models, DL models, or statistical methods to predict responses in breast cancer patients who underwent NACT. Some researchers predicted Recurrence-Free Survival (RFS), Event-Free Survival (EFS), Disease-Specific Survival (DSS), or Progression-Free Survival (PFS), in addition to predicting the response to NACT. It is important to note that this review exclusively focuses on the outcomes of predicting patients’ responses to NACT. Studies that constructed their models based on conventional mammography and Contrast-Enhanced Spectral Mammography (CESM) images are detailed in Section 2, with their results, findings, markers, and patient numbers summarized in Table 1. Ultrasound-based studies are outlined in Section 3 and summarized in Table 2. Research-based on Positron Emission Tomography/Computed Tomography (PET/CT), Dynamic Contrast-Enhanced Magnetic Resonance Imaging (DCE-MRI), and Multi-modal imaging are addressed in Section 4, Section 5 and Section 6, respectively, with summaries presented in Tables 3–5. Throughout this paper, abbreviations such as ACC, AUC, SEN, SPE, PPV, and NPV denote accuracy, area under the curve, sensitivity, specificity, positive predictive value, and negative predictive value, respectively, in all tables.
2. Mammography and Contrast-Enhanced Spectral Mammography
Mammography serves as the standard imaging modality for the screening and detection of breast tumors. This low-dose X-ray imaging technique is commonly employed to identify masses, asymmetries, micro-calcifications, and architectural distortions with an accuracy ranging between 85% and 90% [30,31,32]. The two primary mammogram views are the mediolateral oblique (MLO) and craniocaudal (CC) views, supplemented by additional views like actual lateral, magnification, and point compression views, which are used for further assessment of abnormalities [33,34]. Notably, a limited number of studies (as indicated in Table 1) have employed mammography or contrast-enhanced spectral mammography to predict patients’ responses to NACT.
Recent AI studies predict early response to NACT using baseline mammogram images, showcased by Shin et al. [35] and Skarping et al. [36]. Shin et al. [35] developed a multi-scale patch-net resizing CC and MLO images to three scales (1792 × 1792, 1356 × 1356, 896 × 896) for a three-level image pyramid. They extracted features using ResNet-34 with a 3 × 3 kernel, and the classifier included a sigmoid activation followed by a fully connected layer. Kernel size affected performance: Area Under the Curves (AUCs) were (kernel 3) and (kernel 7); multi-scale outperformed single-scale (AUC = vs. AUC < ). Moreover, Skarping et al. [36] introduced a tumor detection model that distinguishes between pCR and non-pCR. The network features dual pathways based on ResNet-18. One processes tumor image patches, while the other handles contralateral cancer-free patches. Extracted features are concatenated and fed through fully connected layers to a soft-max layer. The model achieved notable metrics: AUC of , sensitivity of , and specificity of . However, the study’s limitations include a heterogeneous cohort in terms of tumor subtype and NACT period.
Contrast-Enhanced Spectral Mammography (CESM) is a dual-energy technique where low-energy and high-energy images, obtained after iodinated contrast administration, are subtracted to create diagnostic subtraction images [37,38]. It serves as an MRI alternative, especially for patients with contraindications like severe claustrophobia or MRI-incompatible implants (pacemakers, defibrillators, neurostimulators, cochlear implants) [39,40]. CESM shows comparable diagnostic accuracy and sensitivity to MRI in various studies [40,41,42], finding applications in breast cancer screening, diagnosis, tumor staging, monitoring, and predicting NACT response [37,38,43].
In addition to employing AI methodologies for predicting NACT responses, Xing et al. [44] utilized statistical approaches to forecast patient responses to NACT. Specifically, they employed a t-test statistical method to assess the significance of the reduction percentage of the CESM gray value (ΔCGV) in CC and MLO views for early response prediction. Their findings indicated that gray values of pCR were notably lower compared to non-pCR instances, revealing a significant discrepancy in ΔCGV between the two response categories after the second treatment cycle. Within the pCR group, ΔCGV was observed to be higher than in the non-pCR group, with a p-value of less than 0.001 and t-values of 5.430 and 3.942 for CC and MLO views, respectively. Moreover, the CC view exhibited an AUC of 0.776, a sensitivity of 75%, and a specificity of 72.15% at a cut-off value exceeding 26.41. However, it is noteworthy that relying solely on a single factor, namely the gray value, could be considered a significant limitation of their study.
In a separate study, Wang et al. [45] developed a radiomics nomogram that incorporates the radiomics score along with three features: Background Parenchymal Enhancement (BPE), Human Epidermal Growth Factor Receptor-2 status (HER-2 status), and Ki-67 index. This nomogram aims to predict the effectiveness of NACT in cancer treatment, utilizing multivariate logistic regression analysis. The study yielded an impressive AUC of 0.81. Their findings clearly highlight the radiomics model’s heightened discriminative power when compared to the pathological model. Notably, the radiomics model achieved AUCs of and in the validation set, outperforming the pathological model in both instances. Additionally, Mao et al. [46] employed multiple ML algorithms to examine the efficacy of radiomics features derived from both the tumor and its neighboring areas, encompassing intratumoral and peritumoral regions. They derived five distinct radiomics signatures from these regions: intratumoral, 5 mm peritumoral, 10 mm peritumoral, intratumoral + 5 mm peritumoral, and intratumoral + 10 mm peritumoral. Their analysis culminated in the finding that the radiomics features extracted from the intratumoral + 5 mm peritumoral region exhibited superior predictive performance, as evaluated by the Least Absolute Shrinkage and Selection Operator (LASSO) regression, with an AUC of , sensitivity of , and specificity of .
It is worth noting that the studies conducted by Wang and Mao [45,46] exhibited certain limitations, given their nature as single-institutional retrospective investigations reliant on relatively modest sample sizes. Such constraints undermine the broad applicability of their findings. Furthermore, the utilization of manual (as seen in [45]) and semi-automated (as observed in [46]) ROI segmentation introduces the potential for inter- and intra-observer discrepancies. Table 1 provides a comprehensive overview of these studies, encompassing mammography and CESM, detailing the research objectives, patient count, markers or predictors, outcomes, and significant discoveries.
Table 1.
Conventional Mammography & Contrast Enhanced Spectral Mammography (CESM).
Reference | Study Aim | Number of Patients & Study Type | Markers | Results & Findings |
---|---|---|---|---|
Shin et al. [35] | To construct a multi-scale patch learning method to early predict pCR to NACT using pre-NACT mammogram images. | 288 patients
Training (n = 228) Test (n = 60) Study type: single-center study |
|
The prediction performance using:
kernel size of 3: AUC: 0.803, SEN: 0.733, SPE: 0.767 kernel size of 7: AUC: 0.661, SEN: 0.5, SPE: 0.833 They found that when using extracted patches, the model performance was affected by kernel size. In addition, using the whole CC & MLO mammogram images outperformed ROI-based approaches. |
Skarping et al. [36] | To propose a DL-based model to predict the pCR to NACT depending on baseline digital mammograms. | 453 patients Training (n = 400) Validation (n = 53) Study type: single-center study based on both retrospective & prospective cohorts |
|
Prediction accuracy of the AI model: AUC: 0.71, SEN: 0.46, SPE: 0.9 They concluded that AI has the potential to assist in clinical decision-making. However, further research is needed with refined approaches and larger data sets to explore the utility of AI in predicting patients’ responses to NACT. |
Xing et al. [44] | To investigate the effect of the reduction percentage of the CESM gray value (CGV) in the early prediction of patients’ response to NACT (whether pCR or non-pCR). | 111 patients Study type: single-center retrospective study |
|
Before NACT, the differences in gray values between the pCR and non-pCR were not statically significant (p-value > 0.05). CGV after two cycles in pCR patients was higher than the non-pCR (p-value < 0.001). The diagnostic value of CGV using: CC view: AUC: 0.776, cut-off > 26.41, SEN: 75%, SPE: 72.15% MLO view: AUC: 0.733, cut-off > 13.59, SEN: 81.25%, SPE: 51.9% They found that CGV can predict response to NACT after the second cycle. |
Wang et al. [45] | Developed a radiomics nomogram to predict NACT-insensitive cancers prior to treatment based on CESM. | 117 patients Training (n = 97)
Validation (n = 20) Study type: single-center retrospective study |
|
Prediction accuracy for:
|
Mao et al. [46] | To study the performance of intratumoral and peritumoral radiomics acquired from CESM to predict the effect of NACT preoperatively. | 118 patients Training (n = 81) Validation (n = 37) Study type: single-center retrospective study |
|
The prediction accuracy for: Pathological markers: no significant risk factors were found. Radiomics: the AUCs based on: tumoral region: 0.74 5 mm peritumoral: 0.75 10 mm peritumoral: 0.78 tumor + 5 mm peritumoral: 0.85 tumor + 10 mm peritumoral: 0.84 The prediction model based on intratumoral+5 mm peritumoral yielded AUC: 0.85, SEN: 0.577, & SPE: 0.909 They concluded that the features extracted from the intratumoral + 5 mm peritumoral regions exhibited the best performance using LASSO Regression. |
3. Ultrasound
Ultrasound modality relies on detecting the reflected echoes of transmitted high-frequency sound waves [47]. Several researchers have utilized ultrasound-acquired images and data to predict patients’ response to NACT at an early stage, leveraging its accessibility and affordability. Moreover, its suitability for repeated scans during treatment is attributed to its independence from contrast agents and long scanning times [48,49,50]. Table 2 provides a summary of studies utilizing B-mode images, quantitative ultrasound parameters, and other ultrasound-based modalities for the early prediction of patients’ response to NACT. It is evident from Table 2 that the most frequently employed imaging markers in the literature to predict the NACT response encompass the spectral slope (SS), spectral intercept or 0-MHz intercept (SI), mid-band fit (MBF), average scatterer diameter (ASD), average acoustic concentration (AAC), attenuation coefficient estimate (ACE), and spacing among scatterers (SAS) [51,52]. Below, we will emphasize the most relevant factors for predicting NACT by utilizing markers extracted from ultrasound images.
Tadayyon et al. [52] utilized quantitative ultrasound (QUS) parameters derived from baseline images, as well as images captured after one week, four weeks, and eight weeks of NACT, to distinguish between responders and non-responders. Employing the k-nearest neighbor (KNN) approach, they assessed the potential of US parameters in discerning these response groups. Their investigation revealed that the most favorable outcomes were achieved through a combination of MBF, SS, and SAS, yielding accuracies ranging from to across different post-treatment time points. Additionally, they observed enhanced predictive accuracies, ranging from to , when baseline QUS parameters were amalgamated. Notably, despite conducting a prospective study, limitations were evident due to a relatively small sample size and an imbalanced dataset, consisting of 42 responders and 16 non-responders. To counteract this imbalance, they implemented random sampling with replacement.
Sadeghi-Naini et al. [48], Sannachi et al. [53], DiCenzo et al. [54], and Dasgupta et al. [55] incorporated US parameters with textural features extracted from each parametric map as non-invasive predictors of NACT response. Sadeghi-Naini et al. [48] employed Linear Discriminant Analysis (LDA) to investigate the efficacy of MBF, spectral slope, and 0-MHz intercept in predicting treatment response. They discovered that the optimal separability between the response groups was achieved by combining textural and spectral features extracted from MBF and 0-MHz intercept parametric images acquired after one week of treatment, yielding sensitivity and specificity of and an AUC of 1.
Furthermore, Sannachi et al. [53] and DiCenzo et al. [54] conducted textural analyses of SS, SI, MBF, ASD, AAC, ACE, and SAS parametric maps, employing three classifiers: k-Nearest Neighbor (KNN), Support vector Machine (SVM), and LDA. Sannachi et al. [53] investigated the predictive capabilities of spectral and textural features in relation to treatment response at one week, four weeks, and eight weeks. Their findings demonstrated that SVM outperformed KNN and LDA, achieving AUCs of 0.71, 0.87, and 0.92 at weeks 1, 4, and 8, respectively.
In contrast, DiCenzo et al. [54] conducted a multicenter study utilizing pre-treatment spectral and textural features. Their results indicated superior prediction performance by KNN with an accuracy of , sensitivity of , specificity of , and an AUC of . The optimal features were AAC-homogeneity, SI-energy, and SAS-energy. Other classifiers exhibited inferior performance: SVM attained an accuracy of and an AUC of , while FLD demonstrated an accuracy of and an AUC of .
In a prospective study conducted by Dasgupta et al. [55], five quantitative US parametric maps, twenty texture maps, and eighty higher-order texture derivatives were generated to discriminate between responders and non-responders. The authors employed three ML algorithms, namely FLD, SVM, and KNN. Notably, KNN exhibited exceptional performance, surpassing the other algorithms with an accuracy of , sensitivity of , specificity of , and an AUC of . In contrast, the AUCs of FLD and SVM did not exceed , and their accuracies remained below . The authors concluded that utilizing US texture-derivative features resulted in superior predictive performance compared to using textural features alone.
On the other hand, certain studies have integrated radiomics features with pathological features, which can be considered invasive predictors. Tadayyon et al. [56], Sannachi et al. [57], and Tadayyon et al. [58] incorporated US parameters and textural features with specific pathological features, namely estrogen receptor (ER), progesterone receptor (PR), and human epidermal growth factor receptor 2 (HER2). In 2017, Tadayyon et al. [56] conducted a prospective study and employed FLD, KNN, and SVM classifiers to analyze the predictive performance of pathological features and US parameters acquired prior to treatment from both the tumor core and its margins (at thicknesses of 3, 5, and 10 mm). The KNN classifier exhibited the highest performance with an accuracy of , a sensitivity of , a specificity of , and an AUC of when utilizing radiomics features for the tumor core and 5mm margins. However, the incorporation of pathological features with radiomics led to a decline in performance, as the accuracy and AUC decreased to and , respectively.
Furthermore, Sannachi et al. [57] categorized patients’ responses into three groups: complete responders, partial responders, and non-responders. They employed SVM to distinguish between these groups based on pathological and radiomics features extracted at 1, 4, and 8 weeks after the initiation of NACT. Their conclusion highlighted that the combination of pathological features with mean US parameters and texture features yielded the most robust prediction results, achieving accuracies of at week 1, at week 4, and at week 8. Conversely, the accuracies achieved using either pathological features or radiomics features alone did not surpass across all time points. Their findings underscored the need for a more extensive dataset encompassing an ample number of patients for each tumor subtype to facilitate more precise cross-validation.
In 2019, Tadayyon et al. [58] compared the prediction performances of the KNN classifier and the artificial neural network (ANN) classifier during the pretreatment stage. This analysis was based on features extracted from both the tumor and its margins (within a 5 mm perimeter around the tumor). The study encompassed both 2-class classification (categorizing responders and non-responders) and 3-class classification (identifying complete, partial, and non-responders). The results unveiled the superior performance of the ANN classifier across all experiments. In the context of binary classification, the ANN achieved an average accuracy of and an AUC of , while the KNN demonstrated an accuracy of and an AUC of . Furthermore, the study highlighted that pathological features played a pivotal role in the 3-class classification, whereas such significance was absent in the binary classification scenario.
Some studies utilized ultrasound-based techniques, including compression or strain elastography, as well as shear-wave elastography (SWE), for the purpose of monitoring and predicting NACT response. Both of these modalities evaluate the mechanical characteristics of tissues, particularly their stiffness and elasticity. Compression elastography involves assessing tissue deformation or strain following the application of static compression through a manual maneuver using an ultrasound transducer. In contrast, shear-wave elastography (SWE) quantifies the velocity of shear waves within the tissue, which are induced by focused acoustic radiation force [14,59,60]. Fernandes and colleagues [14] employed compression elastography to calculate relative changes in strain ratio (SR) of tumors throughout the treatment course. The strain ratio (SR) exhibited a noteworthy distinction between the two response groups following 2 weeks of NACT (p-value < ). They utilized two classifiers, KNN and Naive Bayes, for response prediction. The accuracy rates achieved by KNN at 1, 4, 8 weeks, and the preoperative scans, were , , , and , respectively. Notably, the Naive Bayes classifier outperformed its counterpart, attaining accuracies of , , , and across the four time points.
Additionally, Ma et al. [60] and Gu et al. [61] investigated the interplay between shear-wave elastography parameters and pathological features in predicting response. Ma et al. [60] introduced a multivariable linear regression model and demonstrated that the combination of the Ki-67 index with relative changes in SWE parameters (tumor stiffness) after the second cycle of NACT yielded effective predictive capability, surpassing the individual parameters. The AUCs for predicting non-responders using the Ki-67 index, the relative change of stiffness after the second cycle, and their combination were , , and , respectively. Similarly, Gu et al. [61] concluded that integrating Ki-67 with shear-wave parameters enhanced prediction performance at the mid-treatment stage, yielding an AUC of . Moreover, they identified mass characteristic frequency () as a novel predictor capable of determining the NACT endpoint for responders.
Several studies have employed DL techniques to predict treatment response, including Byra et al. [49,62], Xie et al. [63], Jiang et al. [50], Liu et al. [64], and Gu et al. [65]. In 2020, Byra et al. [62] introduced transfer learning using Siamese CNNs for pairwise image comparison between pre-treatment and post-first/second cycle images. With model fine-tuning, the AUC was for differentiating malignant and benign masses in US images, while without fine-tuning, the AUC reached . This led to the conclusion that the discriminatory features might not effectively predict response. In 2022, Byra et al. [49] proposed an RNN capable of processing both US images and raw RF data, using pre-trained CNNs as feature extractors. AUC was for response prediction from pre-treatment data using CNN pre-trained on US images and using RGB image-pre-trained CNN for data after the fourth cycle. Similarly, Xie et al. [63] constructed a dual-branch CNN where baseline and post-first cycle images were fed into separate branches, yielding better results than individual information use. Their model achieved notable metrics: sensitivity (), specificity (), and AUC (). For enhanced model generalization, further validation with multi-center data is recommended.
On the other hand, some researchers have incorporated invasive predictors into their DL models. Liu et al. [64], Gu et al. [65], and Jiang et al. [50] integrated pathological markers such as ER, PR, HER2, and Ki-67 into their models. Liu et al. [64] captured the dynamic changes of tumors before and after the first or second cycle using a Siamese network. The authors concluded that relying solely on pathological features did not yield satisfactory performance (with AUC not exceeding ). However, combining these features with dynamic change features resulted in impressive AUCs of and for the two external validation cohorts.
Furthermore, Gu et al. [65] developed a stepwise DL model to predict responses after the second and fourth cycles of NACT. They assessed the prediction performance of pathological features, radiomics features extracted by the DL model, and the combination of radiomics and pathological features at two time points (after the 2nd and 4th cycles). The results, as presented in Table 2, led to the conclusion that the proposed model effectively assists in early stepwise prediction. Nomograms have also been utilized in predicting NACT responses. Yang et al. [66] and Jiang et al. [50] developed nomograms for early assessment and prediction of treatment response. Yang et al. [66] combined pathological features with manually crafted radiomics features extracted from baseline images, images taken after the second cycle, and delta radiomics (the difference between pre- and post-treatment features). Their findings revealed that the nomogram, incorporating the Ki-67 index along with radiomics features from images captured before and after the second cycle, demonstrated the highest predictive accuracy, achieving an AUC of 0.866. In contrast, Jiang et al. [50] fused pathological features with radiomics features extracted through a DenseNet201-based CNN, in addition to manually crafted features from pre-treatment and post-treatment images. The resultant nomogram achieved an AUC of , an accuracy of , a sensitivity of , and a specificity of . Their study highlighted the efficacy of DL-based nomograms in effectively predicting pCR and offering significant insights for individualized therapy.
Table 2.
Ultrasound.
Reference | Study Aim | Number of Patients & Study Type | Markers | Results & Findings |
---|---|---|---|---|
Tadayyon et al. [52] | To evaluate the potential Quantitative Ultrasound (QUS) parameters for early prediction of LABC patients’ clinical and pathological response to NACT. | 58 patients (leave-one-out cross-validation) Study Type: single-center prospective study |
|
The prediction ACCs using KNN based on the combination of MBF, SS, and SAS were 60%, 77%, and 75% using images acquired at weeks 1, 4, & 8, respectively. SENs: 61%, 79%, & -. SPEs: 59%, 76%, & -.
Combining the QUS parameters at each week with pre-treatment achieved ACCs of 70%, 80%, and 81%, respectively. SENs: 76%, 80%, & -. SPEs: 64%, 79%, & -. Consequently, they found that incorporating pre-NACT QUS parameters could improve the prediction performance. |
Sadeghi-Naini et al. [48] | To investigate the efficacy of textural analysis of quantitative ultrasound (QUS) spectral parametric maps for the early prediction of clinical and pathological response to NACT in LABC patients. | 20 LABC patients Study Type: single-center study |
|
The predictive performance using LDA when combining spectral and textural markers extracted from MBF and 0-MHz intercept (AUC: 1, SEN: 100%, SPE: 100%) Other combinations of features yielded AUCs from 0.59 to 0.99, SENs (40–100%), and SPEs (47–93%). They found that combining textural & spectral biomarkers showed the best separability between responders & non-responders at early stages of NACT. |
Sannachi et al. [53] | To early predict LABC patients’ clinical and pathological response to NACT by developing computational algorithms based on quantitative ultrasound (QUS) & textural analysis. | 100 LABC patients (leave-one-out cross-validation) in addition to an independent test set for SVM-RBF (n = 24) Study Type: single-center study |
|
The accuracy of the SVM-RBF model in independent validation cohort at weeks 1, 4, & 8, respectively: Validation(1): ACCs: 82%, 78%, & 88% SENs: 87%, 80%, & 87% SPEs: 50%, 67%, & 100% Validation(2): ACCs: 72%, 81%, & 93% SENs: 73%, 84%, & 93% SPEs: 50%, 67%, & 100% They conclude that SVM-RBF outperformed the other classifiers (LDA & KNN) in differentiating responders & non-responders at all time points. Also, the most relevant features in distinguishing the two groups at weeks 1 & 4 were the changes in texture features, while at week 8 the change in mean QUS parameters were more significant. |
DiCenzo et al. [54] | To construct a model for the early prediction of LABC patients’ clinical-pathological response to NACT using radiomics extracted from pre-NACT quantitative ultrasound (QUS) images. | 82 LABC patients (leave-one-out cross validation) Study type: multi-center prospective study |
|
Features showed statistically significant differences between responders & non-responders (p < 0.05) were: SS, MBF, ASD, AAC, ASD-contrast, AAC-contrast, AAC-energy, and AAC-homogeneity.
The best performing ML classifier was KNN as AUC: 0.73, ACC: 87%, SEN: 91%, SPE: 83% They found that the patients’ responses can be predicted based on pre-NACT QUS radiomics with acceptable accuracy. |
Dasgupta et al. [55] | To evaluate the baseline QUS higher-order texture derivatives in predicting LABC patients’ clinical-pathological responses to NACT. | 100 LABC patients (leave-one-out cross validation) Study type: single-center prospective study |
|
The AUCs yielded by 3 ML algorithms (FLD, KNN, & SVM) using (QUS-Tex-Tex) were 0.61, 0.86 & 0.79, respectively. The best prediction performance achieved by KNN using (QUS-Tex-Tex) as AUC: 0.86, ACC: 82%, SEN: 87%, SPE: 81% They concluded that QUS texture-derivative features (AAC-CON-ENE, MBF-COR-ENE, SI-COR- ENE) can predict tumor response before the initiation of NACT. |
Tadayyon et al. [56] | To priori predict LABC patients’ clinical and pathological response to NACT based on QUS parameters and texture features extracted from tumor core and margins using ML algorithms. | 56 LABC patients (leave-one-out cross-validation) Study type: single-center prospective study |
|
Using KNN, ER, PR, & HER2, respectively, achieved AUCs: 0.67, 0.48, & 0.37. ACCs: 61%, 71%, & 48%. SENs: 55%, 95%, & 60%. SPEs: 79%, 0%, & 14%. Three ML algorithms were used (FLD, SVM, & KNN) and KNN showed the best prediction performance depending on the tumor core and 5 mm margin, it yielded AUC: 0.81, ACC: 88%, SEN: 90%, SPE: 79%. Combining molecular markers decreased the model performance AUC: 0.71, ACC: 79%, SEN: 86%, SPE: 57%. They found that response to NACT can be predicted using non-invasive QUS features extracted from the tumor core and margins, and combining molecular markers with QUS did not improve the prediction power. |
Sannachi et al. [57] | To early predict tumor clinical and pathological response to NACT in LABC patients using molecular markers, quantitative ultrasound (QUS) parameters, and textural features extracted from baseline and after 1, 4, and 8 weeks of NACT. They can differentiate between 3 groups of responses (complete, partial, and no response). | 96 LABC patients (leave-one-out cross-validation) Study type: single-center study |
|
The accuracies of the SVM-RBF classifier to differentiate between the 3 response groups at weeks 1, 4, & 8 using the following markers were:
Molecular alone: 38%, 37%, & 50% (SEN: -, SPE: -). Mean QUS + texture: 54%, 60%, & 59% (SEN: -, SPE: -). Mean QUS + texture + molecular: 79%, 86%, & 83% (SEN: -, SPE: -). They found that combining molecular features with mean QUS values and texture features improved the discrimination power between the three response groups. |
Tadayyon et al. [58] | To construct an artificial neural network (ANN) model to predict patients’ clinical and pathological response & survival prior to the start of NACT based on quantitative ultrasound (QUS) imaging and molecular markers. | 100 patients (they can be classified either into 2 groups: responders & non-responders or into 3 groups: pCR, pPR, & no-response) Study type: prospective study |
|
The best performance was attained using ANN when differentiating patients who showed some response (pCR + pPR) from no response patients AUC: 0.96, ACC: 96%, SEN: 93%, SPE: 98%. Using the KNN classifier to differentiate (pCR + pPR) from no-response patients led to lower predicting performance AUC: 0.67, ACC: 65%, SEN: -, SPE: -. The authors found that ANN showed good predictive performance and can be used to evaluate the effectiveness of the treatment as a step toward personalized medicine. |
Fernandes et al. [14] | To evaluate the ability of ultrasound elastography to differentiate between responders (pCRs) and non-responders (non-pCR) to NACT by monitoring changes in tumor stiffness induced by treatment. | 92 LABC patients
(leave-one-out cross-validation) Study type: single-center study |
|
Using the Naive Bayes classifier, pCR was distinguished from non-pCR at weeks 1, 4, 8, and at preoperative scan, respectively, achieved: AUCs: 0.64, 0.75, 0.77, & 0.81. ACCs: 72%, 84%, 83%, & 84%. SENs: 80%, 85%, 87%, & 84%. SPEs: 64%, 83%, 80%, & 85%. Using KNN & same time points AUCs: 0.44, 0.72, 0.66, & 0.64. ACCs: 60%, 73%, 74%, & 72%. SENs: 84%, 81%, 95%, & 85%. SPEs: 36%, 65%, 54%, & 55%. Their findings include: 1. Changes in the strain ratio (SR) correlate with tumor response. 2. Strain elastography can be used to predict response after 2 weeks. 3. The best classification performance attained at the preoperative scan using the NB classifier. |
Ma et al. [60] | To investigate the potential utility of share wave elastography (SWE) and Ki67 index as response predictors to NACT in invasive breast cancer. | 66 patients (response was classified according to the RCB protocol to RCB 0, I, II, &III). Study type: single-center prospective study |
|
The accuracies of predicting (pCR & RCBI) versus (RCBII & RCBIII) using E2, Ki67, and their combination, respectively, yielded: AUCs: 0.76, 0.79, & 0.88. SENs: 66.7%, 66.7%, & 100%. SPEs: 88.9%, 96.3%, & 72.2&. However, the accuracies of predicting (RCB-III) versus other response groups using the same features achieved AUCs: 0.82, 0.84, & 0.93. SENs: 68.18%, 86.36%, & 95.45. SPEs: 79.55%, 72.73%, & 79.55%. They found that a multivariable linear regression model combining ki67 with SWE parameters after the 2nd cycle of NACT showed better diagnostic performance than using each of them alone. |
Gu et al. [61] | To evaluate the role of share wave elastography (SWE) in early predicting of invasive breast cancer patients’ response to NACT according to the RCB score. | 62 patients (leave-one-out cross-validation) Study type: single-center prospective study |
|
Using SWE parameters achieved AUC: 0.75, SEN: 0.77, & SPE: 0.75 at mid-course, while adding Ki67 achieved AUC: 0.80, SEN: 0.72, & SPE: 0.73. They concluded that combining Ki67 with some SWE parameters improves the prediction performance. Moreover, can be considered to be a new response predictor & can determine the NACT endpoint. |
Byra et al. [62] | To propose two transfer learning approaches to early predict patients’ response to NACT based on US images acquired before and after the first and second cycles of NACT. | 39 tumors from 30 patients Study type: single-center retrospective study |
|
|
Byra et al. [49] | To develop recurrent neural networks (RNN) that can process regular US images and raw radio-frequency (RF) data to predict patients’ response to NACT. | 51 breast cancers from 39 patients (5-fold cross-validation) Study type: single-center study |
|
Using the pre-NACT data, the AUCs of the models which pre-trained based on US images, RF data, & ImageNet were 0.81, 0.72, & 0.71, respectively. SENs: 0.83, 0.57, & 0.69. SPEs: 0.70, 0.89, & 0.70. Using data acquired after 4th cycle AUCs: 0.91, 0.85, & 0.93. SENs: 0.9, 0.81, & 0.9. SPEs: 0.83, 0.83, & 0.87. They revealed that: 1. Pre-trained networks used for breast mass segmentation can be good feature extractors for response prediction problems. 2. Models based on b-mode images might be sufficient for accurate response prediction as RF data acquisition is considered to be difficult. |
Xie et al. [63] | To early predict the LABC patients’ pathological response to NACT by developing a novel DL approach named the dual-branch convolution neural network (DBNN) based on ultrasound images acquired before and after the first cycle of NACT. | 114 LABC patients Training (n = 91) Test (n = 23) Study type: single-center retrospective study |
|
The prediction results of:
Combining the US image information from pre-NACT & after 1st cycle yielded AUC: 0.939, SEN: 90.67%, SPE: 85.67%
Using only pre-NACT images achieved AUC: 0.73, SEN: 76%, SPE: 68.38% Using images after 1stcycle only, AUC: 0.739, SEN: 53.3%, SPE: 86.38%. They found that: Combining data from pre-NACT & after 1st cycle outperformed the models using each of them separately. DBNN achieved outstanding results in the noninvasive prediction of response. |
Liu et al. [64] | To early predict pCR in HER2-positive breast cancer patients using a Siamese multi-task network (SMTN) which performs tumor segmentation of pre- and early-treatment longitudinal ultrasound images, followed by capturing the dynamic change information of the tumor. | 393 HER2-positive breast cancer patients
Training
(n = 215)
Validation (two cohorts
n = 95 & 83) Study type: multi-center retrospective study |
|
Mean dice coefficient (DICE) of tumor segmentation in validation cohorts > 0.764 The accuracy of predicting pCR in the two validation cohorts using different models:
|
Gu et al. [65] | To early predict patients’ pathological response to NACT based on US images acquired prior to NACT, and after the second and the fourth cycles using the proposed novel deep learning radiomics pipeline (DLRP) which consists of two deep learning models. | 168 patients Training (n = 126) validation (n = 42) Study type: single-center prospective study |
|
The prediction performance of: Pathological2 model: (based on ER, HER2) AUC: 0.717, SEN: 76.2%, SPE: 61.9% Pathological4 model: (PR, HER2, reduction of tumor volume) AUC: 0.825, SEN: 61.9%, SPE: 76.2% DLR2: AUC: 0.812, SEN: 90.5%, SPE: 47.6% DLR4: AUC: 0.937, SEN: 81%, SPE: 90.5%. They concluded that depending on pathological markers only is not reliable enough for response prediction, while DLRP can effectively aid in early stepwise prediction. Moreover, hybrid models (pathological + DLR) showed no improvements in the AUC. |
Yang et al. [66] | To combine pathological markers with radiomics extracted from pre-treatment and early-treatment ultrasound images for developing a nomogram used in the early prediction of patients’ radiological response to NACT. | 217 patients
Training (n = 152) Test (n = 65) Study type: single-center retrospective study |
|
Radiomics features (baseline images) yielded AUC: 0.725, ACC: 67.7%, SEN: 77.8%, SPE: 65.8%. Radiomics features (after 2nd cycle) yielded AUC: 0.793, ACC: 72.3%, SEN: 60.5%, SPE: 92.6%. The nomogram combining Ki67 and radiomics signature achieved AUC: 0.866, ACC: 78.5%, SEN: 85.2%, SPE: 79.8%. They found that a nomogram combining Ki67 and radiomics signature showed the best-predicting performance. |
Jiang et al. [50] | To construct and validate a DL radiomics nomogram (DLRN) to predict pCR to NACT based on ultrasound images acquired before and after treatment. | 592 patients
Training (n = 356)
external validation cohort (n = 236) Study type: retrospective study |
|
|
4. PET/CT
Positron Emission Tomography (PET) serves as a prevalent nuclear imaging method for evaluating the glycolytic metabolism of tumors [47,67]. This technique enables the characterization of primary tumors, determination of lymph node stages, and assessment of residual tumors following NACT [68,69]. Distinguishing malignant tumors from adjacent normal cells relies on the heightened glucose metabolism within abnormal tissues compared to normal ones [47,68]. Typically, PET scans are complemented by other imaging modalities like computed tomography (CT), magnetic resonance imaging (MRI), and mammography, subsequent to the introduction of a radioactive tracer such as 18F-fluorodeoxyglucose (18F-FDG), 18F-fluorothymidine (18F-FLT), 18F-Fluciclovine, and others [70,71]. 18F-fluorodeoxyglucose (18F-FDG) stands as the most commonly employed radioactive tracer in oncology [72,73]. Pertaining to PET parameters, the maximum standardized uptake value (SUVmax) signifies the highest concentration of the tracer (typically FDG) within the region of interest (ROI) or volume of interest (VOI) [74,75]. Due to potential noise-induced variability in SUVmax values, additional parameters like SUVpeak and SUVmean may be calculated. SUVmean reflects the average standardized uptake value of all voxels within the voxel of interest, while SUVpeak represents the parameter derived from averaging the standardized uptake values within a fixed-size, small volume of interest containing the region with the highest SUV [76,77,78].
Beyond these PET parameters, the metabolic tumor volume (MTV) denotes the total count of voxels in a VOI exhibiting uptake surpassing a specific SUV threshold, and the total lesion glycolysis (TLG) emerges as the product of MTV and SUVmean [76,79]. Despite the elevated costs and radiation exposure associated with PET scans, they hold promise in predicting treatment responses [47]. This section will delve into studies utilizing either FDG PET/CT or FLT PET/CT for the early prediction of NACT responses, and these studies will be summarized in Table 3. Furthermore, key factors will be underscored for predicting NACT outcomes by leveraging markers extracted from PET/CT images.
Buchbender et al. [80] and Andrade et al. [75] employed statistical methods, specifically the Mann–Whitney test, to predict patients’ responses. Their approach relied on assessing the relative change in the standardized uptake value (ΔSUV) between images acquired before treatment and those obtained after the second cycle. Additionally, they determined the optimal ΔSUV cut-off for distinguishing between pathological pCR and non-pCR cases, as well as responders and non-responders, utilizing receiver-operating curve (ROC) analysis. A statistically significant difference in ΔSUV between the two response groups was observed. Buchbender et al. [80] identified a ΔSUV of for pCR cases and for non-pCR cases, yielding a p-value of . Furthermore, the optimal ΔSUV threshold for discriminating pCR from non-pCR was determined to be , resulting in a sensitivity of , specificity of , and accuracy of . Similarly, differentiation of responders from non-responders was achieved using an optimal SUV threshold, yielding a sensitivity of , specificity of , and accuracy of , with an overall accuracy of .
In contrast, Andrade et al. [75] discovered a significantly higher SUV value for pCR cases in comparison to non-pCR cases , with a p-value of . They determined the optimal SUV threshold for distinguishing pCR from non-pCR to be , resulting in a sensitivity of , specificity of , and accuracy of . Moreover, their differentiation of responders from non-responders using an optimal SUV threshold yielded a sensitivity of , specificity of , and accuracy of with an overall accuracy of .
On the other hand, Koolen et al. [81] investigated changes in the maximum standardized uptake value (SUVmax) after 2–3 weeks and 6–8 weeks of treatment administration. They calculated SUVmax for the primary tumor, lymph nodes, a combination of both (using logistic regression), and the highest SUVmax (either primary tumor or lymph node). Their findings revealed that in HER2+ tumors, the AUC for predicting response using SUVmax after 3 weeks of NACT ranged from to (specifically, for the primary tumor, for axillary nodes, for the combined approach, and when using the region of the highest SUVmax). However, the AUCs derived from SUVmax after 8 weeks did not exceed across all regions. In contrast, the prediction of response in triple-negative breast cancer (TNBC) tumors proved to be more accurate. The AUCs achieved using SUVmax after 2 weeks of NACT ranged from to (specifically, for the primary tumor, for axillary nodes, for the combined approach, and when using the region of the highest SUVmax). Remarkably, the AUCs yielded from using SUVmax after 6 weeks exceeded .
Some researchers, such as Groheux et al. [82,83], Humbert et al. [84], and Luo et al. [85], have integrated pathological markers with PET/CT parameters to predict the response to NCAT. In 2013, Groheux et al. [82] utilized a t-test to examine the correlation between the response and SUVmax of the primary tumor and lymph nodes at baseline and after two cycles in HER2-positive patients. They found that neither the pathological markers nor the SUVmax at baseline significantly correlated with patients’ response; the associated p-values were greater than . However, non-pCR could be effectively predicted using SUVmax after the second cycle of NACT, achieving an accuracy of , a sensitivity of , and a specificity of with a cut-off value of less than . In 2014, Groheux et al. [83] investigated the predictive value of PET parameters and pathological markers in triple-negative tumor response prediction. They demonstrated that using a SUVmax threshold of in the primary tumor achieved an accuracy of , while a threshold of achieved an accuracy of , a sensitivity of , and a specificity of . Furthermore, they noted that pathological markers yielded lower prediction accuracy (with tumor grade achieving and T-stage achieving ).
On the other hand, Humbert et al. [84] and Luo et al. [85] combined pathological markers with radiomics and employed logistic regression to predict treatment response. Humbert et al. [84] explored the value of SUVmax after the first treatment cycle, in conjunction with various clinical and pathological markers (such as CA , ACE, CA-125 values, estrogen receptor, progesterone receptor, HER2 expression, Ki-67 index, androgen receptor, epidermal growth factor receptor (EGFR), Scarff-Bloom-Richardson (SBR) grade, and cytokeratin (CK) tumor expression). Their findings indicated that high SUVmax (p = ), elevated Ki-67 (p = ), and negative EGFR status (p = ) demonstrated significant associations with pCR. They achieved a predictive accuracy of for pCR cases using SUVmax, with a sensitivity of , specificity of , and an optimal SUVmax cut-off value of . Additionally, the authors identified that non-pCR cases could be predicted with an accuracy of by combining positive EGFR status and SUVmax less than .
Luo et al. [85], on the other hand, integrated Ki-67 (a proliferation marker) with SUVmax. They computed SUVmax values at baseline, after the second cycle, and after the fourth cycle of treatment. The AUC values for using Ki-67 alone, SUVmax after the second cycle, and SUVmax after the fourth cycle were , , and , respectively. Their analysis revealed that while Ki-67 alone did not significantly predict pCR, its combination with radiomics markers enhanced predictive power, achieving an AUC of , sensitivity of , and specificity of .
Cheng et al. [86], Antunovic et al. [87], and Li et al. [88] investigated the predictive potential of pathological markers and textural features for pCR prediction using PET/CT. Cheng and colleagues [86] explored changes in SUVmax, MTV, TLG, and textural features (outlined in Table 3) between pre- and post-second cycle images, alongside pathological markers. Significant differences in relative changes of SUVmax, entropy, and coarseness were observed between pCR and non-pCR cases in the HER2-negative group, yielding AUCs of 0.928, 0.808, and 0.8, respectively, (p-values < ). In the HER2-positive group, skewness and SUVmax moderately predicted pCR with AUCs of and , respectively, (p-values: and ). The authors concluded HER2 status strongly correlated with response, favoring pCR in HER2-positive tumors. Additionally, the textural analysis predicted pCR after two NACT cycles, while TLG and MTV lacked predictive efficiency.
Conversely, Antunovic et al. [87] and Li et al. [88] assessed pathological and radiomics markers’ predictive power for pCR before NACT initiation. Antunovic et al. [87] proposed four logistic regression models. Model (1) included age and tumor molecular subtype, achieving an AUC of . Discrimination improved slightly by adding SUVmax and TLG (AUC = ). Model (2) combined second-order (correlationGLCM) and higher-order (CoarsenessNGLDM, GLNUGLZLM) textural features with age and molecular subtype, yielding an AUC of . Models (3) and (4) employed different predictor thresholds ( and , respectively). Model (3) achieved an AUC of , featuring age, molecular subtype, correlation, and coarseness. Model (4) attained an AUC of , incorporating model (3) features along with ER, NACT type, Ki67, and GLNU. The authors concluded that augmenting with SUVmax, TLG, second-order, and higher-order radiomics features did not enhance response prediction performance.
Li et al. [88] employed a Random forest (RF) model for predicting treatment response by identifying PET/CT radiomics predictors. They extracted 2210 radiomic features including Run VarianceGLRLM, Zone varianceGLSZM, LGLREGLRLM, and Difference AverageGLCM. These radiomic features resulted in an AUC of , sensitivity of , specificity of , and accuracy of . Incorporating these features with age enhancements improved performance, yielding an AUC of , sensitivity of , specificity of , and accuracy of in the independent validation set. In contrast, the pathological model exhibited poor predictive performance, with an AUC of .
As previously mentioned, various tracers can be administered for the PET/CT scan. FLT has served as an indicator of proliferation [89], and a handful of studies have utilized FLT PET/CT scans to early predict responses to NACT. Crippa et al. [90] computed the SUVmax values in both the primary tumor and the predominant lymph node, along with their relative changes between baseline and post-first-cycle NACT images. They introduced a linear combination based on SUVmax in both the tumor and predominant lymph node, yielding an impressive AUC of and a p-value below . This combination effectively discriminated between patients with RCB indices of 0 & I (indicating near-complete response) and patients with RCB indices of II & III (representing moderate & extensive residual disease). The authors highlighted the potential utility of FLT PET/CT in predicting NACT response, stressing the importance of further validation in a larger population.
Table 3.
PET/CT.
Reference | Study Aim | Number of Patients & Study Type | Markers | Results & Findings |
---|---|---|---|---|
Buchbender et al. [80] | To test the ability of FDG-PET/CT to differentiate pCR lesions from non-pCR lesions early, after the second cycle of NACT. | 26 patients Study type: retrospective study |
|
The Mann–Whitney test was used to discriminate between response groups. After 2nd cycle, SUV were significantly higher for pCR (−89%) than non-pCR (−51%) and the p-value = 0.003.
The optimal threshold of SUV that discriminates:
|
Andrade et al. [75] | To investigate the correlation between the relative change in the standardized uptake value (SUV) and the pathological response to NACT using FDG-PET/CT. | 40 patients (with invasive ductal breast carcinomas) Study type: single-center prospective study |
|
The Mann–Whitney test was used to discriminate between response groups. After the 2nd cycle, SUV were significantly higher for pCR (−81.58%) than non-pCR (−40.18%) and the p-value = 0.001.
The optimal threshold of SUV that discriminates :
|
Koolen et al. [81] | To assess the value of FDG PET/CT scans of the primary tumor and lymph nodes in predicting pCR to NACT taking tumor subtype into consideration. | 107 patients Study type: prospective study |
|
The AUCs ranges of predicting response using SUVmax for the 4 regions of interest were:
In HER2+ tumors
They concluded that PET/CT could accurately predict pCR in triple-negative tumors, especially after 6 weeks of NACT. However HER2+ tumors showed a weaker association between response and the changes in SUVmax. |
Groheux et al. [82] | To examine the value of 18F-FDG-PET/CT in the early identification of non-pCR cases after the second cycle of NACT in HER2+ breast cancer patients. | 30 HER2+ locally advanced breast cancer patients Study type: single-center prospective study |
|
They used a t-test to find the correlation between the response and
tumor grade, ER, axillary status, SUVmax1, SUVmax2, and SUVmax, and
p-values were 0.5, 0.8, 0.3, 0.08, 0.0001, and 0.001, respectively. Predicting non-pCR using:
|
Groheux et al. [83] | To assess the value of FDG PET parameters and pathological markers in the early prediction of patients’ pathological response to NACT and event-free survival (EFS) in TNBC patients. | 50 TNBC patients Study type: prospective study |
|
The ACCs of predicting response using tumor grade and T-stage were 54% and 68%, respectively, (SEN: -, SPE: -). Predicting response using SUVmax in the primary tumor at a cut-off = −50%, achieved an ACC of 80% (SEN: -, SPE: -), while using a cut-off = −42% achieved ACC: 74%, SEN: 58%, SPE: 100%. The threshold of −42% was chosen because it achieved a better prediction of relapse. They revealed that pathological markers were less predictive of response compared to PET parameters which can predict response in TNBC patients. |
Humbert et al. [84] | To assess the value of tumor metabolic response (acquired by FDG-PET/CT), in addition to clinical and pathological markers in the early prediction of pCR to NACT. | 50 TNBC patients Study type: single-center prospective study |
|
High SUVmax (p = 0.002), high Ki-67 (p = 0.016), and negative EGFR (p = 0.042) showed significant association with pCR. Predicting pCR using SUVmax: cut-off = −50 %, ACC: 75%, SEN: 74%, SPE: 76%. Non-pCR could be predicted by combining +ve EGFR status and SUVmax < −50% with an ACC of 92%, SEN: -, SPE: -. They concluded that metabolic response combined with EGFR status can help in the early prediction of response. |
Luo et al. [85] | To assess the value of Ki-67 expression and FDG PET/CT in predicting pathological response to NACT in LABC patients. | 361 patients
Training (n = 301) Validation (n = 60) Study type: single-center prospective study |
|
The prediction accuracy of pCR:
|
Cheng et al. [86] | To determine whether textural features extracted from 18F-FDG PET/CT images acquired before and after the second cycle of treatment can predict pCR to NACT. | 61 patients with LABC Study type: single-center retrospective study |
|
|
Antunovic et al. [87] | To assess the role of radiomics (extracted from FDG PET/CT) combined with pathological markers in the prediction of pCR to NACT in patients with locally advanced breast cancer. | 79 patients
100 iterations of 10-fold cross-validation Study type: single-center retropective study |
|
They proposed 4 logistic regression models, and the selected features for each model were mentioned in the main paragraph. The AUCs of models 1, 2, 3, and 4 were 0.71, 0.72, 0.70, and 0.73, respectively, (SENs: -, SPEs: -). They concluded that: 1. SUVmax and TLG did not show a good prediction performance of pCR. 2. The discriminatory power of the model did not improve by adding second and higher-order radiomics features. 3. A larger cohort is still needed to better investigate/judge the potential predictive role of radiomics. |
Li et al. [88] | Construct an automated model to specify radiomics predictors of pCR and treatment response prior to NACT based on FDG PET/CT images. | 100 patients
Training (n = 70, 30 times 10-fold cross-validation) Independent validation (n = 30) Study type: single-center retrospective study |
|
Prediction accuracy for:
|
Crippa et al. [90] | To examine the ability of 18F-3-deoxy-3-fluorothymidine positron emission tomography (FLT PET) in predicting breast cancer patients’ pathological response after the first cycle of NACT. | 15 LABC patients Study type: prospective study |
|
The SUVTmax can predict (pCR+RCBI) yielding AUC: 0.91, p < 0.001, cut-off ≤ −52.9%, ACC: 93.3%, SEN: 83.3%, SPE: 100%,
while SUVNmax can differentiate (RCBIII) from other response groups yielding AUC: 0.77, p = 0.119, SEN: -, SPE: -. The linear predictive score achieved AUC: 0.94, p < 0.001, SEN: -, SPE: - to differentiate RCB (0 & I) from RCB (II & III). They preliminary found a potential utility of FLT PET in predicting & monitoring response to NACT. However, these results need to be validated on a large patient population. |
5. DCE-MRI
Dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) typically necessitates the administration of a contrast agent (CA) for delivering functional (dynamic) insights into the tissues. It offers the ability to reveal details about tumor vascularity by non-invasively gauging fluctuations in contrast to agent uptake [91,92]. Additionally, DCE-MRI furnishes information about tumor morphology encompassing size, shape features, and textural heterogeneity. This imaging technique finds utility in screening high-risk women, diagnosing breast cancer, staging breast tumors, evaluating treatment efficacy, and predicting early response to NACT [91,93]. Table 4 presents a compilation of studies employing DCE-MRI data to forecast NACT response based on quantitative or semi-quantitative parameters.
NACT Prediction using DCE-MRI Radiomics: Several studies employed statistical methods, such as the t-test, Mann–Whitney test, or Kruskal–Wallis test, to assess the effectiveness of textural features in early response prediction based on pre-treatment DCE-MRI images. Ahmed et al. [94] and Teruel et al. [95] extracted 16 Gray-Level Co-occurrence Matrix (GLCM) texture features and examined their significance in distinguishing various response groups using statistical techniques. Ahmed et al. [94] observed that contrast and difference variance yielded p-values ranging from to , demonstrating a significant distinction between two response groups (partial responders & non-responders) using images captured at 1 and 2 minutes after contrast administration. Furthermore, they noted elevated contrast and difference variance values in the non-responders group, suggesting that tumors with heightened heterogeneity might exhibit reduced responses to chemotherapy.
In a similar vein, Teruel et al. [95] employed the same 16 GLCM features to differentiate between stable disease, partial response, and complete response. They identified that sum variance, entropy, and difference variance displayed statistical significance in distinguishing response groups at the 2-minute post-contrast mark, achieving p-values of , , and , respectively. Additionally, they compared textural features between stable disease and complete response groups, finding that 8 features achieved p-values below . The most noteworthy features were entropy, sum variance, and angular second moment. Using these features for predicting stable disease, ROC analysis produced AUCs of , , and , respectively.
Furthermore, sum variance, sum entropy, entropy, and difference variance exhibited significant differences between the pathological minimal residual disease group and the pathological non-responder group. Sum variance and sum entropy yielded AUCs of 0.689 and 0.686 for predicting non-responders. While Ahmed et al. [94] and Teruel et al. [95] employed the same 16 GLCM features, disparities in study design and response group definitions rendered their results incomparable. Nevertheless, they concurred that the optimal time point for response prediction was 2 min post-contrast, and these texture features could aid physicians in pre-treatment prediction of patient responses.
Machine learning algorithms have also been employed for predicting patients’ responses based on radiomics features, specifically GLCM or Gray-Level Run Length Matrix (GLRLM) features extracted from DCE-MRI images, as demonstrated by Giannini et al. [96], Fan et al. [97], and Cain et al. [98].
Giannini et al. [96] developed a Computer-Aided Diagnosis (CAD) system that automates tumor segmentation, extracts GLCM and GLRLM textural features from baseline images, and subsequently predicts pathological responses. They conducted predictions at both breast levels, including pCR and non-pCR cases, and breast and axillary levels encompassing pathologic complete response with or without residual metastatic lymph nodes (pCRN and non-pCRN, respectively). Logistic regression and Bayesian classifiers were employed for predicting pCR and pCRN. The logistic regression model achieved an AUC of 0.795 for pCR prediction (SEN: , SPE: ). In contrast, the same model achieved an AUC of for pCRN prediction (SEN: , SPE: ). Conversely, the Bayesian classifier yielded an accuracy, sensitivity, and specificity of , , and , respectively, for pCR prediction, and , , and , respectively, for pCRN prediction.
Fan et al. [97] employed an evolutionary algorithm to select optimal features from pre-treatment images, encompassing morphological, dynamic, textural (GLCM), first-order statistical, and background parenchymal (BPE) attributes. Their retrospective study encompassed two patient cohorts: the main group () and the reproducibility group (). Utilizing logistic regression, they predicted treatment response and conducted leave-one-out cross-validation (LOOCV) on the main cohort, yielding an AUC of , sensitivity of , and specificity of . Conversely, the reproducibility cohort exhibited an AUC of , sensitivity of , and specificity of . Moreover, validating the main cohort features on the reproducibility cohort produced an AUC of , while reproducibility cohort features on the main cohort resulted in an AUC of . BPE features were identified as performance-enhancing, with combined lesion and BPE features outperforming their individual counterparts.
Similarly, Cain et al. [98] introduced logistic regression and SVM models for predicting pCR to NAT. Encompassing 529 radiomics features, these models incorporated tumor and surrounding tissue attributes, with 12 features selected via stepwise multilinear regression. Significant features included the tumor-based attribute “change_in_variance_of_uptake” and volumetric Fibroglandular Tissue (FGT) enhancement. Models were trained on all NAT recipients and subsequently applied to sub-populations: NACT recipients and (TN/HER2+) NAT recipients. AUC values for (TN/HER2+) patients were and using logistic regression and SVM, respectively. Other sub-cohorts exhibited lower prediction performance, with AUCs below . The study underscored the potential of DCE-MRI images preceding NAT initiation in early pCR prediction, particularly for (triple negative/HER2+) patients undergoing NAT.
Eom et al. [99] employed statistical methods to assess the correlation between DCE-MRI and clinicopathological features with pCR. Their findings indicated that the tumor enhancement pattern (extracted from pre-NACT images) and the shrinkage pattern (extracted from post-NACT images) exhibited associations with pCR, with corresponding p-values of and . Furthermore, individuals achieving pCR were more inclined to display homogeneous enhancement and a concentric tumor shrinkage pattern compared to non-pCR patients, with odds ratios of and , respectively.
Further, Li et al. [100] investigated the correlation between pCR and four quantitative features mentioned in Table 4 through logistic regression analysis. Their analysis encompassed the entire patient cohort, revealing individual feature AUCs that did not surpass ; however, a fusion of these features resulted in an enhanced AUC of . Furthermore, the researchers categorized the data into four subgroups based on tumor subtype, classified by HR/HER2 status. The researchers concluded that the integration of the four features substantially enhanced the predictive capacity for pCR across all subtypes and the entire dataset.
Several studies have employed both DCE-MRI quantitative parameters (Ktrans, Kep, Ve, Vp, and i) and semi-quantitative parameters (such as wash-in, wash-out, peak enhancement, etc.) to predict treatment response. Li et al. [101], Tudorica et al. [102], and Drisis [103] utilized statistical methods for response prediction. In one study, Li et al. [101] assessed both quantitative and semi-quantitative DCE parameters to identify response predictors and distinguish between responders and non-responders. The semi-quantitative parameter, signal enhancement ratio washout volume, exhibited significant differences between the two response groups with an AUC of and a p-value of . Regarding quantitative parameters, Kep (estimated by three models: Tofts-Kety, extended Tofts-Kety model, and fast exchange regime model) achieved AUCs of , , and , respectively. The authors proposed Kep and signal enhancement ratio washout volume as potential response predictors, but emphasized the need for confirmation through further prospective studies.
In another study, Tudorica et al. [102] employed univariate logistic regression and C statistics to predict early pCR and non-pCR. They evaluated pharmacokinetic parameters using both the Tofts model (TM) and the Shutter speed model (SSM), along with the longest tumor diameter. The percentage change in Ktrans (TM), Ktrans (SSM), kep (TM), and i after one cycle of NACT yielded C values exceeding , indicating excellent predictive capability. Moreover, ve (TM) and ve (SSM), estimated after one cycle or at the midpoint, demonstrated C values between 0.8 and 0.9, indicating good predictive potential. However, the percentage changes in the longest diameter after one cycle and at the midpoint exhibited poor prediction with C values below .
Additionally, Drisis et al. [103] statistically assessed the predictive potential of maximum tumor diameter, Ktrans and Ve for early identification of responders and non-responders. The dataset was stratified into three subgroups based on tumor subtype. Using the pre-treatment scan, Ktrans achieved AUCs of () and () for the entire population and the triple-negative subgroup, respectively. Additionally, the baseline maximum diameter exhibited higher values in the non-response group and achieved an AUC of for the entire dataset. For the second scan after 2 or 3 NACT cycles, Ktrans achieved AUCs of , , and for the entire population, TN, and HER2+ groups, respectively.
On the other hand, Thibault et al. [104] extracted 1043 textural features from each of the 13 parametric maps. They employed a ridge regression model to predict RCB index values based on each map-feature pair (where RCB=0 indicates pCR, while any other RCB index indicates non-pCR). Their results revealed that distinguishing between pCR and non-pCR using the Ktrans (SM) map with RLM and the Gray-level non-uniformity feature achieved a sensitivity and specificity of (with an AUC of 1). Discrimination based on the Ve(SSM) map with Haralick and contrast features yielded a sensitivity of and specificity of . While validation on a larger cohort is necessary, these findings suggest that analyzing tumor heterogeneity through textural analysis of DCE parametric maps can aid in the early prediction of treatment response.
Furthermore, Lee’s group [105] assessed baseline perfusion parameters (Ktrans, Kep, and Ve) for both the tumor and contralateral breast’s background parenchyma (BPCL). They also performed 3D histogram analysis (including skewness, mean, kurtosis, 25th, 50th, and 75th percentiles) for each perfusion parameter using logistic regression. Although individual perfusion parameters exhibited limited predictive capacity for pCR, with AUCs ranging from 0.449 to 0.683, combining these parameters enhanced prediction performance. Notably, combining the skewness of tumor Ktrans and background parenchyma Ktrans yielded an AUC of 0.760 (p-value = ). The most robust prediction performance emerged from combining the skewness and 50th percentile of tumor Ve with Ve of BPCL, achieving an AUC of ().
Aside from employing statistical methods, researchers also employed ML algorithms for predicting treatment response based on images acquired either before treatment or after the first cycle of NACT. Ashraf et al. [106], Braman et al. [107], Caballo et al. [108], and Drukker et al. [109] conducted their studies using pre-treatment images, whereas Machireddy et al. [110] utilized images acquired after the initial cycle of NACT along with baseline images.
Ashraf et al. [106] employed logistic regression with leave-one-out cross-validation (LOOCV) to differentiate between pCR and non-pCR cases, based on pre-treatment images from 15 patients in the I-SPY-1 trial dataset. They incorporated morphological, kinetic features, and kinetic statistic features, comparing their outcomes with commonly used features. The proposed kinetic statistic features achieved an AUC of , while conventional features such as longest diameter, size-to-energy ratio (SER), maximum peak enhancement (MPE), and features derived from the characteristic kinetic curve achieved AUCs of , , , and , respectively. Their findings suggest that heterogeneity in kinetic statistics could enhance the predictive performance for treatment response using baseline images.
Furthermore, Machireddy and colleagues [110] employed a support vector machine (SVM) classifier to distinguish between pCR and non-pCR cases based on data acquired before and after the initial treatment cycle. They evaluated the predictive performance of GLCM, GLRLM, single-resolution, and multi-resolution fractal features from Ktrans, Kep, Ve, and i parametric maps. They observed that GLCM and GLRLM features tended to overfit the data, yielding markedly lower AUCs in the test set and higher AUCs in the training set. Moreover, they found that multi-resolution fractal features outperformed other features, achieving AUCs of , , , , and for Kep, i, Ve, Ktrans, and the concatenation of all parametric maps, respectively.
Conversely, Braman et al. [107] and Caballo et al. [108] utilized intratumoral and peritumoral features extracted from pre-treatment images to predict pCR and non-pCR cases. In their study, Braman et al. [107] included 117 patients (78 in the training set and 39 in the test set) and employed five different ML classifiers to distinguish pCR cases.
The optimal AUCs for SVM, LDA, Diagonal Linear Discriminant Analysis (DLDA), Naive Bayes (NB), and Quadratic Discriminant Analysis (QDA) were , , , , and , respectively. Furthermore, an additional experiment was conducted where patients were categorized into subgroups based on hormone receptor type: (HR+/HER2-) and (TN/HER2+). The AUCs of the classifiers are outlined in Table 4. In this second experiment, a 3-fold cross-validation approach was adopted due to the limited sample size in the two subgroups. The authors discovered that kinetic features displayed no significant disparity between the two response groups in all experiments. Intriguingly, peritumoral features, obtained from the region adjacent to the tumor, could be amalgamated with intratumoral features to enhance response prediction.
Conversely, Caballo et al. [108] devised logistic regression ML models to differentiate between pCR and non-pCR cases, utilizing baseline intra-tumoral and peri-tumoral features. The multivariate analysis achieved an AUC of when predicting pCR using the complete dataset. Additionally, upon stratifying the dataset according to tumor molecular subtype, the model demonstrated promising outcomes with AUCs of , , , and for luminal A, luminal B, HER2-enriched, and TN tumors, respectively. The study revealed that enhancement kinetics heterogeneity and time-dependent texture (4D texture) features held predictive value. Moreover, features extracted from both the tumor core and peritumoral region exhibited significance in predicting pCR, with predictive features from the tumor core contributing to and peritumoral features to of the prediction.
Furthermore, Drukker et al. [109] employed baseline radiomics features to predict pCR and lymph node status (LN status) following NACT in LN-positive patients. Linear discriminant analysis was employed to assess the classification performance of each feature individually, distinguishing between responders and non-responders. In contrast to most studies, they unveiled that features extracted from the primary tumor lacked meaningful predictive capability. Intriguingly, the features derived from LNs, especially statistics, demonstrated robust predictive performance, with an AUC of up to for pCR prediction and up to for LN status prediction following NACT.
In contrast to conducting a comprehensive investigation of the entire tumor for response prediction, certain researchers, such as Wu et al. [111] and El Adoui et al. [112], have adopted a strategy of partitioning the tumor into distinct subregions based on their respective response characteristics. Wu and colleagues [111] employed K-means clustering to segment the tumor into subregions using eigenmaps derived from principal component analysis (PCA). This process yielded three clusters or subregions within the tumor, identified by patterns of contrast agent washin and washout. Within each subregion, they extracted four Haralick texture features and leveraged the variations in these features between the baseline scan and the scan conducted after one treatment cycle to predict the tumor’s response. Their analysis encompassed textural features from enhancement maps, as well as eigenmaps both of the entire tumor and of individual subregions. Notably, their findings indicated that the most effective prediction performance was achieved by utilizing radiomics predictors derived from eigenmaps of distinct tumor subregions, resulting in an impressive AUC of , a sensitivity of , and a specificity of .
On a related note, El Adoui et al. [112] introduced a parametric response map (PRM) as a visual representation, employing a color map to delineate subregions of the tumor in accordance with their respective response types. This method involved a comparative analysis of tumor volumes and intensities before and after the initial NCAT cycle.
The classification of patients as positive responders under the PRM paradigm hinged on the percentage of positive responses exceeding that of negative responses, and vice versa for negative responders. Notably, their analysis demonstrated a correspondence between the proposed PRM approach and the ground truth, substantiated by p-values of for positive responses and for negative responses, as well as impressive AUC values of and , respectively, underscoring the effectiveness of their method.
El Adoui et al. [113] constructed a CNN model trained using baseline images and the images acquired after the initial NACT cycle, along with a fusion of these images to differentiate responders and non-responders to NACT. The experiments were replicated twice: once using images without tumor segmentation, and once using images with tumor segmentation by the U-Net DL architecture. When utilizing segmented tumor images, the AUC values were for baseline images, for images after the first cycle, and an improved for their combination. In contrast, the unsegmented tumor images demonstrated higher AUCs of , , and notably elevated performance of for baseline, post-first cycle, and combined images, respectively. They revealed that the utilization of unsegmented images outperformed segmented images, suggesting the potential utility of peritumoral areas in enhancing response prediction.
In a separate study, Khanna et al. [114] explored the role of transfer learning in response prediction. Their approach involved integrating pre-trained CNNs with ML techniques for prediction tasks. Feature extraction employed ResNet-18 and ResNet-50, while classification utilized ML methods like Decision Trees, Naïve Bayesian, KNN, and SVM. The dataset was divided using two strategies: 10-fold cross-validation and a hold-out validation, with a training to testing ratio. The authors highlighted the efficacy of integrating Fine KNN (with K = 1) and ResNet-18, employing Mann–Whitney U test for feature selection. This combination yielded outstanding predictive performance, achieving an AUC of 1, a sensitivity of , a specificity of , and an accuracy of when using the hold-out validation technique.
NACT Prediction using DCE-MRI Radiomics and Pathological Markers: Researchers also explored the combination of pathological markers and radiomics markers to predict response, either before neoadjuvant chemotherapy (pre-NACT) or at an early stage. Jimenez et al. [115] studied the role of integrating radiomics features extracted from pretreatment scans with a pathological marker which is tumor-infiltrating lymphocyte (TIL) in predicting pCR cases. They suggested that patients who achieved high levels of TIL (more than ) and a radiomics signature value < were considered to be pCR. Their model achieved an AUC of , an accuracy of , a sensitivity of , and a specificity of . They found that this combination of features outperformed using either radiomics features or pathological features alone.
Conversely, Golden et al. [116] performed textural analysis on kinetic maps extracted from pre and post-NACT scans. Also, morphological and pathological features were utilized. The logistic regression model’s AUC values for predicting pCR, residual lymph nodes (LNs), and residual tumor and lymph nodes, using pre-NACT radiomics features, were , , and , respectively. However, the use of pathological markers alone exhibited limited predictive power, with AUCs below for pCR and residual tumor and LNs. Conversely, it demonstrated moderate predictive power (AUC = ) for predicting residual LN metastases. Moreover, patterns of tumor response did not exhibit significant predictive power in any model (AUC < ).
Jahani’s group [117] extracted two distinct categories of features from the registered DCE scans, obtained before and after the initial cycle of NACT. The first category encompassed voxel-wise tumor deformation features, which elucidate alterations in tumor size, orientation, and shape (specifically, Jacobian (which is the ratio of the volume after the first cycle of NACT to the baseline volume), Anisotropic Deformation Index (ADI), and Slab-Rod Index (SRI)). The second category comprised voxel-wise changes in kinetic features (PE, WIS, WOS, and SER). These radiomics features achieved an AUC of 0.74 when employed in a logistic regression model to predict pCR. Additionally, an exploration into demographic and pathological markers’ potential for predicting pCR and RFS was conducted. The combination of age, race, hormone receptor status, and FTV resulted in an AUC of for predicting pCR and incorporating demographic, pathological, and voxel-wise features raised the AUC to . The authors proceeded to compare the predictive efficacy of voxel-wise features with that of aggregate features (representing mean or average value changes across the entire tumor). The aggregate features exhibited an AUC of 0.71, leading to the author’s conclusion that, for predicting pCR, voxel-wise features outperformed aggregate features.
Conversely, Sutton et al. [118] utilized a random forest classifier to evaluate and categorize pCR based on radiomics features extracted from both pre and post-NACT images, in conjunction with the tumor molecular subtype. Three models were constructed; the first model, utilizing solely radiomics features, yielded an AUC of , a sensitivity of , and a specificity of . Subsequent integration of the tumor molecular subtype with radiomics features marginally improved performance, resulting in an AUC of , a sensitivity of , and a specificity of . The third model, based on radiomics features excluding DCE-MRI intensity features, demonstrated similar performance to the first model (AUC: , SEN: , SPE: ), underscoring the value of textural and edge-based radiomics analysis in pCR classification.
In a separate study, Fan and colleagues [119] scrutinized tumor heterogeneity changes via morphological, first-order statistical, and textural analysis of images obtained before and after two cycles of NACT. A Support Vector Machine (SVM) classifier was employed to predict responders and non-responders. The prediction performance of baseline images and the Jacobian map proved subpar, with AUCs of and , respectively. Conversely, using images from the second cycle of NACT yielded an AUC of . Furthermore, a model based on feature changes between images taken during and prior to treatment resulted in an AUC of . Notably, combining features from all aforementioned images with the molecular subtype produced the most robust prediction performance, boasting an AUC of , a sensitivity of , and a specificity of .
Hussain et al. [120] integrated molecular subtyping and the Ki-67 index with radiomics features, subsequently comparing the predictive capabilities of five ML classifiers (SVM, Naive Bayes, KNN, Decision Tree, and Ensemble RUSBoosted Tree) in determining pCR patients, utilizing data from the I-SPY-1 TRIAL dataset. Employing the Ensemble RUSBoosted Tree model, when considering solely the molecular subtype, they achieved an AUC of 0.82 alongside an accuracy of . Conducting a textural analysis on MRI images acquired at baseline, after the first cycle, and at mid-treatment resulted in AUCs of , , and , and corresponding accuracies of , , and , respectively. Notably, merging the textural analysis results from the two time points (before and after the first cycle) yielded an impressive AUC of (with accuracy at ). However, the integration of the molecular subtype further enhanced prediction performance, elevating the AUC to (with accuracy reaching ).
Parametric response maps have been utilized to investigate regions of increased and decreased intensity within tumors at an early stage of treatment, aiding in response prediction. This approach has been adopted by Cho et al. [121] and Drisis et al. [122]. In the study by Cho et al. [121], a t-test was employed to compare the predictive performance of traditional pharmacokinetic parameters (specifically Ktrans, Kep, and Ve) with PRM analysis. The PRM analysis involved a voxel-wise comparison between baseline images and those acquired after the first cycle of NACT. Voxels with increased intensity (>10%) were labeled as PRMSI+, while those with decreased signal intensity were labeled as PRMSI-. Contrary to the conclusions drawn by many studies, the authors found no significant difference between pCR and non-pCR groups in terms of pharmacokinetic (PK) parameters and tumor volume change. Cho and colleagues demonstrated that voxels with increased signal intensity (PRMSI+) achieved an AUC of , a sensitivity of , and a specificity of at a cut-off of . Additionally, they examined the potential of pathological markers using the Fisher exact test and observed no significant distinction between pCR and non-pCR groups.
Drisis et al. [122], on the other hand, generated parametric response maps (PRMs) through an affine registration method, involving the subtraction of images acquired at baseline (reference image) and after the start of NACT (transformed image). Regions where voxel values showed an increase of more than were classified as non-responding regions (PRMdce+) while responding regions (PRMdce-) exhibited a decrease of more than in their voxel values post-therapy initiation. Voxels not falling within these subregions were categorized as (PRMstable). The study explored the potential of both pathological markers and PRMs in predicting non-pCR. The authors determined that pathological markers and PRMs achieved AUCs of and 0.88, respectively. They also identified PRMdce+ and grade (II) tumors as significant factors for non-pCR prediction, yielding an AUC of with a p-value below .
Table 4.
DCE-MRI.
Reference | Study Aim | Number of Patients & Study Type | Markers | Results & Findings |
---|---|---|---|---|
Ahmed et al. [94] | To evaluate the efficacy of textural features extracted from DCE-MRI in predicting response to chemotherapy. | 100 patients Study type: retrospective study |
|
Texture features that showed a significant difference between partial responders and non-responders were contrasting (p-value: 0.042) and showed differences in variance (p-value: 0.043). They concluded that texture features showed significant differences between the two response groups at 1–2 min post-contrast time points, while pre-contrast time points did not. |
Teruel et al. [95] | To examine the potential of texture analysis to predict the clinical and pathological response prior to NACT in LABC patients. | 58 LABC patients Study type: single-center retrospective study |
|
Eight features showed a significant difference between stable disease and complete response groups (p-values < 0.05). The most 3 significant features for predicting stable disease were: entropy (AUC: 0.77, SEN: 0.842, SPE: 0.684), sum variance (AUC: 0.742, SEN: 0.842, SPE: 0.684), and angular second moment (AUC: 0.742, SEN: 0.895, SPE: 0.632). They found that textural analysis can assist clinicians in the response prediction prior to therapy. |
Giannini et al. [96] | To construct a CAD system that performs fully automatic segmentation of the tumor and extracts its’ textural features, in addition to the prediction of pCR & pCRN to NACT prior to treatment. (pCRN: is pCR with the absence of residual metastatic lymph nodes) | 44 patients Study type: single-center retrospective study |
|
Mono-parametic model Seven individual features correlated with pCR: AUCs from 0.674 to 0.722, SENs from 46.7% to 93.3%, and SPEs from 51.7% to 93.1%. Four features were correlated with pCRN: AUCs from 0.685 to 0.747, SENs from 84.6% to 100%, SPEs from 41.9% to 61.3%. Logistic regression model for predicting pCR: AUC: 0.795, SEN: 80%, SPE: 69% for predicting pCRN: AUC: 0.764, SEN: 46%, SPE: 100% Bayesian model for predicting pCR : ACC: 70%, SEN: 67%, SPE: 72% for predicting pCRN: ACC: 64%, SEN: 69%, SPE: 61%. They found that paients’ responses can be predicted using a CAD system that automatically segments the tumor and extracts texture features, and their results need to be validated on a larger population. |
Fan et al. [97] | Using quantitative analysis of pretreatment DCE-MRI images to enhance the prediction of NACT response. | main cohort (n = 57), independent validation cohort (n = 46) Study type: single-center retrospective study |
|
The prediction performance using multivariate logistic regression model and LOOCV based on: Main cohort: AUC: 0.910, SEN: 87.2%, SPE: 90.0%. Independent cohort: AUC: 0.874, SEN: 78.4%, SPE: 88.9% The selected features (based on the main cohort) achieved an AUC of 0.713 when tested on the independent cohort (SEN: -, SPE: -). Their results suggest that BPE features can be used as response predictors combined with lesion features. |
Cain et al. [98] | To assess the utility of multivariate ML models in predicting pCR to NAT based on radiomics features extracted from DCE-MRI breast images. | 288 patients divided equally into training and test. Study type: single-center retrospective study |
|
Pediction AUCs using logistic regression model range from 0.589 to 0.707 (SEN: -, SPE: -), while using SVM model range from 0.593 to 0.705 (SEN: -, SPE: -). They found that pre-treatment breast MRI can be used in pCR prediction, especially in TN/HER2+ patients who had neoadjuvant therapy. |
Eom et al. [99] | To evaluate the association between tumor pathological response and DCE-MRI features in addition to clinicopathologic factors in TNBC patients. | 73 patients Study type: single-center retrospective study |
|
The p-values of factors associated with pCR according to univariate analysis were 0.025, 0.037, & 0.009 for tumor shape, homogeneous enhancement, & concentric shrinkage pattern, respectively. The p-values of factors associated with pCR according to multivariate analysis were 0.017 and 0.015 for the enhancement pattern and shrinkage pattern, respectively. They concluded that homogeneous enhancement & concentric tumor shrinkage patterns are associated with pCR. |
Li et al. [100] | To evaluate the performance of the predictive model combining multiple MRI features. | 384 patients with LABC who enrolled in the I-SPY-2 trial. Patients were stratified into 4 subgroups according to tumor subtype Study type: retrospective study |
|
Using all patients: the AUCs of FTV, BPE, SPH, & LD were 0.77, 0.69, 0.69, & 0.79, respectively, (SEN: -, SPE: -); however, combining all features yielded an AUC of 0.81 (SEN: -, SPE: -). Moreover, combining all features while dividing patients into subgroups achieved AUCs of 0.83, 0.88, 0.83, & 0.82 for (HR+/HER2-), (HR+/HER2+), (HR-/HER2+), & (HR-/HER2-), respectively, (SEN: -, SPE: -). They concluded that combining the four features outperformed using each feature alone. |
Li et al. [101] | To determine whether early changes in quantitative and semiquantitative DCE-MRI parameters acquired after the first cycle of NACT can differentiate between responders and non-responders, in addition to the possibility of using them as a prognostic indicator of patients’ responses to NACT. | 28 patients Study type: single-center prospective study |
|
The AUC of SER washout volume was 0.75 and p-value = 0.03 (SEN: -, SPE: -)
The AUCs of Kep estimated by three models (Tofts-Kety, extended Tofts-Kety model, and fast exchange regime model) were 0.78, 0.76, and 0.73, and the p-values were 0.01, 0.02, and 0.048, respectively, (SEN: -, SPE: -). They found that the SER washout volume and Kep can be used in the response prediction after one cycle of NACT. |
Tudorica et al. [102] | To compare the efficacy of quantitative DCE-MRI parameters and tumor size in the early prediction of patients’ response to NACT. | 28 patients Study type: N/A |
|
Univariate logistic regression C statistics values for:
Percentage changes in Ktrans, Kep, Ve, and i after one cycle range from 0.804 to 0.967 Changes in LD after 1st cycle and at midpoint were 0.609 and 0.673, respectively. Predicting pCR at SEN: 100% achieved SPEs: 92% & 17% using the change in Ktrans(TM) and LD, respectively. Their results suggested that LD changes were poor predictors, whereas PK parameters derived by either TM or SSM analyses were effective response predictors |
Drisis et al. [103] | To determine whether pharmacokinetic (PK) parameters obtained from DCE-MRI images acquired before and after the second or third cycle of chemotherapy can predict pCR for different breast cancer subgroups. | 84 LABC patients Study type: single-center retrospective study |
|
At baseline: Ktrans achieved AUC: 0.66, SEN: 68%, SPE: 66% using the whole population, while it achieved AUC: 0.78, SEN: 85%, SPE: 70% using TN subgroup. At the early stage of NACT: Ktrans achieved AUC: 0.79, SEN: 69%, SPE: 87% for the Whole population. For TN subgroup it achieved AUC: 0.9, SEN: 86%, SPE: 88%, while in HER2+ subgroup AUC: 0.81, SEN: 67%, SPE: 91%. Moreover, Ve achieved AUC: 0.74, SEN: 87%, SPE: 59% for the entire dataset, and AUC: 0.83, SEN: 86%, SPE: 69% for TN subgroup. They found that DCE-MRI parameters showed a significant prediction performance, especially in TN tumors. |
Thibault at al. [104] | To investigate the capability of texture features generated from parametric maps of quantitative and semi-quantitative pharmacokinetic metrics acquired from DCE-MRI images at baseline and after the first cycle to early predict patients’ pathological response to NACT. | 38 patients with LABC Study type: N/A |
|
The ridge regression model can differentiate between pCR & non-pCR using Ktrans(SM)+ GLRLM+ Gray-level nonuniformity and achieved AUC: 1, SEN: 100%, & SPE: 100%.
Using Ve(SSM)+ Haralick+ contrast feature achieved SEN: 100% and SPE: 96.7% The authors found that SSM parametric maps were more predictive than the SM parameters or the semi-quantitative metrics |
Lee et al. [105] | To evaluate the utility of imaging parameters extracted from pretreatment DCE-MRI in the prediction of pCR to NACT. | 74 patients Study type: single-center retrospective study |
|
Each perfusion parameter for both tumor and BPCL did not show high predictive ability for pCR (AUCs: 0.449 to 0.683, SENs: 15.4% to 100%, SPEs: 18% to 96.7%).
Combination of Ve (BPCL) with 50th percentile and skewness of Ve in tumor had the highest predictive value (AUC: 0.807, p = 0.002, SEN: -, SPE: -). They revealed that the combination of perfusion parameters of tumor and BPCL showed higher predictive ability for pCR than every individual parameter. |
Ashraf et al. [106] | To assess the role of heterogenity-based kinetic features derived from DCE-MRI in predicting treatment pathological response. | 15 patients from I-SPY-1 trial (leave-one-out cross-validation) Study type: single-center study |
|
Logistic regression model based on the proposed features (kinetic statistics) yielded an AUC of 0.84 (SEN: -, SPE: -).
Most individual kinetic statistics features obtained AUCs ranging from 0.73 to 0.81 (SEN: -, SPE: -). They concluded that: Morphological features showed poor prediction performance. Moreover, the heterogeneity-based kinetic statistics outperformed the individual conventional kinetic features & their combinations, and they can be used as NACT response predictors. |
Machireddy et al. [110] | To evaluate the capability of multiresolution fractal analysis of voxel-based DCE-MRI parametric maps extracted before and after the first cycle of NACT for early prediction of pCR. | 55 patients
Training (n = 40) Test (n = 15) Study type: single-center study |
|
The AUCs of multiresolution fractal features extracted from Ktrans, Kep, Ve, i, and all parametric maps were 0.80, 0.63, 0.74, 0.70, and 0.78, respectively.
At SEN: 60%, their SPEs: 89.3%, 70.7%, 80.7%, 68.7%, and 82.7%. At SEN: 80%, their SPEs: 68.7%, 49.3%, 62%, 62%, and 62%, respectively. They revealed that multiresolution fractal features generally have better predictive performances than those extracted with conventional methods. Moreover, concatenated features from all DCE-MRI parameters improve the prediction performance rather than individual parametric maps. |
Braman et al. [107] | To assess the ability of pretreatment intratumoral and peritumoral textural radiomics in predicting pCR to NACT. | 117 patients
Training
(n = 78), Test (n = 39) Study type: retrospective study |
|
The AUCs of LDA, DLDA, SVM, NB, and QDA
classifiers (LDA: linear discriminant analysis, DLDA: diagonal linear discriminant analysis, NB: Naive Bayes, QDA: quadratic discriminant analysis), respectively, using: All patients: 0.69, 0.74, 0.72, 0.72, and 0.74 (ACC: 0.59, 0.67, 0.72, 0.64, & 0.64). (HR+,HER2-)cohort: 0.8, 0.83, 0.82, 0.81, and 0.81 (ACC: 0.77, 0.79, 0.87, 0.78, and 0.88). (TN/HER2+) cohort: 0.87, 0.89, 0.89, 0.93, and 0.85 (ACC: 0.81, 0.83, 0.82, 0.84, and 0.81). They found that intratumoral & peritumoral features can robustly predict pCR across multiple classifiers. However, PK showed no difference between pCR and non-pCR tumors. |
Caballo et al. [108] | To use the spatio-temporal radiomic analysis of pretreatment DCE-MRI images in identifying patients who achieve pCR. | 251 patients (leave-one-out cross validation) Study type: single-center retrospective study |
|
Using univariate ML analysis: some individual
features can be used as pCR predictors
and AUCs range from 0.60 to 0.83 (SEN: -, SPE: -). Using multivariate ML logistic regression models: the AUCs of the models were 0.707, 0.824, 0.823, 0.844, & 0.803 using all patients, Luminal A, Luminal B, HER2 enriched & TN groups, respectively, (SEN: -, SPE: -). These results suggested that changes in enhancement kinetics heterogeneity and texture features over time (4D features) were significant predictors. |
Drukker et al. [109] | To predict the pCR to NACT using pretreatment MRI radiomics in patients with invasive lymph node (LN positive), in addition to predicting LN status after NACT. | 158 patients Study type: single-center retrospective study |
|
Pre-NACT lymph node features showed significance in predicting pCR and LN status after NACT with AUCs up to 0.82 and 0.72, respectively, using the LDA classifier (SEN: -, SPE: -). They concluded that tumor features did not show significance in predicting post-NACT pCR or LN status, on the other hand, features extracted from LN did. Note: Analysis was performed using individual features, not a combination of them. |
Wu et al. [111] | Using quantitative image features extracted from different tumor sub-regions to predict pathological response to NACT. | 35 patients
(leave-one-out cross validation) Study type: retrospictive study |
|
The response prediction accuracies based on texture features extracted from the three groups of predictors were:
enhancement maps: AUC: 0.67, SEN: 58%, SPE: 70% eigenmaps of the whole tumor: AUC: 0.65, SEN: 58%, SPE: 70% eigenmaps of the tumor’s subregions: AUC: 0.79, SEN: 75%, SPE: 78%. They found that eigenmaps of tumor subregions with elevated washout rate had a superior prediction performance. |
El Adoui et al. [112] | To provide an algorithm based on parametric response map (PRM) which depends on the intra-tumor changes between MRI images acquired before & after the first cycle of treatment to predict patients’ response after the first cycle. | 40 patients Study type: retrospective study |
|
The AUCs were 0.97 and 0.96 for the positive response and the negative response, and the p-values were 0.19 and 0.45, respectively. (SEN: -, SPE: -) These results indicate the absence of a significant difference between the suggested method and the ground truth. |
El Adoui et al. [113] | To conduct a CNN architecture used in predicting patients’ pathological response to NACT based on multiple DCE-MRI inputs (pre-NACT, after the first cycle of chemotherapy, and their combination). | 42 LABC patients in addition to 14 external independent validation cases. Study type: retrospective study |
|
The performance of the model based on the following scans 1. With tumor segmentation:
|
Khanna et al. [114] | To integrate pre-trained CNN with ML techniques to predict pCR to NACT using DCE-MRI images acquired prior to the initiation of treatment. | 64 patients Study type: retrospective study |
|
The best prediction performance was attained when integrating ResNet-18 (the feature extractor) with KNN (the classifier) Using hold-out validation (70:30) AUC: 1, ACC: 99.8%, SEN: 1, SPE: 99.3% Using 10-fold cross-validation AUC: 0.99, ACC: 99.1%, SEN: 99.4%, SPE: 99.1% They found that Fine KNN (K = 1) showed a superior prediction performance than other classifiers. |
Jimenez et al. [115] | To predict pCR in TNBC patients who underwent neoadjuvant systemic therapy based on baseline DCE-MRI scans and tumor-infiltrating lymphocyte (TIL) levels. | 80 TNBC patients (5-fold cross-validation for radiomics & radiomics+ pathological features models) Study type: single-center retrospective study |
|
Pathological model (TIL model):
AUC: 0.632, ACC: 70%, SEN: 57.6%, SPE: 78.7%, PPV: 65.5%, NPV: 72.6% Radiomics model: AUC: 0.712, ACC: 72.5%, SEN: 85.1%, SPE: 54.6%, PPV: 72.7%, NPV: 72% Radiomics + pathological features: AUC: 0.752, ACC: 83.3%, SEN: 55.6%, SPE: 97.2%, PPV: 90.9%, NPV: 81.4% They suggested that integrating radiomics features with a pathological marker (TIL) could be utilized in predicting pCR before the initiation of the therapy. |
Golden et al. [116] | To predict the patients’ response to NACT using quantitative measurements of spatial heterogeneity extracted from DCE-MRI kinetic maps. | 60 patients with triple-negative early-stage breast cancer. Study type: multi-center prospective study |
|
The AUCs of the logistic regression model using pre-NACT imaging features for predicting pCR, residual LN metastases, and residual tumor with LN metastases were 0.68, 0.84, and 0.83, respectively, (SEN: -, SPE: -). They found that pathological markers and patterns of tumor response were not significant for the prediction. Otherwise, pre-NACT radomics features yielded showed a good prediction performance. |
Jahani et al. [117] | To assess changes in the intratumoral heterogeneity measured by voxel-wise image registration to perform early prediction of pCR and RFS in LABC patients. | 132 LABC patients from the I-SPY-1 trial (five-fold cross-validation was performed). Study type: multi-center study |
|
The AUCs of predicting pCR using the following features were:
Baseline features: (age, race, HR status, and FTV): 0.71 Voxel-wise+baseline features: 0.78 Voxel-wise features only: 0.74 Aggregate measures: (PE, WIS, WOS, SER, FTV2/FTV1): 0.71 For all features: (SEN: -, SPE: -) They found that HR status showed a significant association with pCR. Moreover, voxel-wise features showed an association with pCR, whereas the aggregate measures did not improve the model performance. |
Sutton et al. [118] | To develop a classifier that assesses and classifies pCR using molecular subtypes and features extracted from pre-NACT and post-NACT scans. | 273 patients
278 cancers (n = 5 bilateral) Training (n = 222 cancers) Test (n = 56 cancers) Study type: single-center retrospective study |
|
The performance of the 3 RF classification models: Model (1): (radiomics only) AUC: 0.83, SEN: 0.77, SPE: 0.69 Model (2): (radiomics and molecular subtype) AUC: 0.78, SEN: 0.79, SPE: 0.69 Model (3): (radiomics without intensity metrics) AUC: 0.78, SEN: 0.79, SPE: 0.69 They suggested that radiomics features extracted before and after NACT could help in assessing pCR. |
Fan et al. [119] | To demonstrate how the heterogeneity changes in DCE-MRI images at baseline & after the second cycle of NACT could affect the prediction accuracy. | 114 patients
Training (n = 61) Test (n = 53) Study type: retrospective study |
|
The AUCs of the model based on:
Pre-NACT radiomics: 0.568
Early-NACT radiomics: 0.767 Jacobian maps: 0.630 deltaRAD: 0.726 Combination of features: 0.771 Fusing molecular subtype with the combination of features: 0.809 The SENs of the model using the same features: 91.3%, 56.5%, 60.9%, 91.3%, 52.2%, & 82.6%, respectively, while SPEs: 36.7%, 90%, 70%, 53.3%, 96.7%, & 80%. They found that the reduction in tumor heterogeneity (indicated by texture feature) is higher among responders than non-responders. |
Hussain et al. [120] | Using multiple ML classifiers to predict pCR to NACT based on molecular subtype and texture features extracted from MR images of tumor and peri-tumoral region at different treatment time points. | 166 patients from I-SPY-1 trial Study type: multi-center retrospective study |
|
The prediction performance using Ensemble Random Undersampling Boosting (RUSBoosted) classifier Tree based on: Molecular subtype AUC: 0.82, ACC: 84%, SEN: 86.48%, SPE: 76.92% Radiomics extracted from pre, early, & mid-NACT images AUCs: 0.88, 0.72, & 0.78 ACCs: 86%, 82%, & 76% SENs: 86.48%, 97.3%, & 92.85% SPEs: 84.62%, 38.46%, & 30% Combining pre and early-NACT images with molecular subtype AUC: 0.98, ACC: 94%, SEN: 94.59%, SPE: 92.31% They concluded that combining molecular subtype with radiomics extracted at pre- and early-NACT (with 3–5 pixels of the peritumoral region) had the best performance. |
Cho et al. [121] | To compare the performance of the parametric response map (PRM) acquired from DCE-MRI with the pharmacokinetic parameters (PK) in the early prediction of pathological response to NACT. | 48 patients Study type: prospective study |
|
The prediction performance of voxels with increased signal intensity (PRMSI+) in predicting pCR vs. non-pCR cutoff: 20.8%, AUC: 0.77, SEN: 100%, SPE: 71% They revealed that PRM analysis could enable the early prediction of response (after the first cycle), whereas tumor size, volume, and PK parameters do not. Moreover, pathological markers showed no differences between the pCR and non-pCR. |
Drisis et al. [122] | To determine whether the parametric response mapping (PRM) can be used in the prediction of early morphological response (EMR) & pCR within 72 h after the initiation of chemotherapy. | 39 patients Study type: single-center retrospective study |
|
Logistic regression analysis using demographic and pathological markers only obtained an AUC of 0.71, SEN: -, SPE: - PRM obtained an AUC of 0.88 for the prediction of non-pCR (SEN: -, SPE: -). Integrating the demographic and pathological markers with PRM achieved an AUC of 0.94 (SEN: -, SPE: -). They found that grade II tumors (pathological marker) and PRMdce+ (PRMdce+: voxels that showed an increment in their value of more than 10%(non-responding regions)) were significant for the prediction of non-pCR. |
Comes et al. [123] | To conduct a transfer learning approach based on pre-treatment and early-treatment DCE-MRI scans to predict patients’ pathological response to NACT. | 134 patients
Fine-tuning (n = 108) Test (n = 26) Study type: worked on a subset of public dataset (I-SPY-1 trial) |
|
Pathological features only: ACC: 69.2%, SEN: 42.9%, 78.9% Combining pathological & radiomics features: AUC: 0.90, ACC: 92.3%, SEN: 85.7%, SPE: 94.7% They concluded that low-level CNN features extracted from pre-and-early treatment images have a significant role in the early prediction of pCR. |
Peng et al. [124] | To use the pretreatment DCE-MRI in comparing the prediction performances of radiomics models with DL models. | 356 patients Study type: single-center retrospective study |
|
|
Li et al. [125] | To construct a nomogram to predict the probability of pCR in TNBC patients based on pretreatment DCE-MRI & clinicopathological features. | 108 TNBC patients
Training (n = 87) Validation (n = 21) Study type: single-center retrospective study |
|
The nomogram achieved an AUC of 0.79 in the validation cohort (SEN: -, SPE: -). They concluded that a nomogram incorporating 3 pretreatment factors (tumor volume, TTP, and AR status) had a good ability to predict pCR. Moreover, higher TTP is associated with a lower probability of achieving pCR. |
Deep Learning approaches have demonstrated promising potential in predicting responses based on imaging and pathological features. Comes et al. [123], a pre-trained Convolutional Neural Network (CNN) was employed to automatically extract low-level features from images obtained before and after the initial cycle of NACT, replacing the need for manual feature extraction. The study also investigated the predictive capability of pathological features. Subsequently, the most optimal features were selected and utilized to construct an SVM classifier. The model relying on pathological features achieved an accuracy of , a sensitivity of , and a specificity of . By incorporating pathological features with pre- and early-treatment radiomics features, an impressive AUC of was achieved, along with an accuracy of , a sensitivity of , and a specificity of on the independent test set.
In a study by Peng et al. [124], a comparison was made between DL and conventional ML approaches in predicting response based on baseline pathological, kinetic, and radiomics features. Conventional ML involved handcrafted radiomics feature extraction and LASSO regression for optimal feature selection, with LDA emerging as the most robust classifier among the five ML models tested. In contrast, the DL approach employed a ResNeXt50 CNN for radiomics feature extraction and a Multilayer Perceptron Neural Network (MLP) for constructing models based on pathological and kinetic features. AUC values for models utilizing radiomics, kinetics, or pathological features alone, whether through LDA or DL, did not exceed 0.752. A modest enhancement in prediction performance was observed upon combining these features. However, the DL approach, incorporating all features, notably outperformed other models with an AUC of 0.832 and an accuracy of 77.2%.
Another noteworthy contribution by Li et al. [125] involved the creation of a nomogram to predict pCR in TNBC patients. This nomogram integrated logistic regression to identify independent response predictors from a collection of pathological, clinical, and radiomics markers extracted from baseline images. Notably, the nomogram was built upon three key predictors: AR status, tumor volume, and time to peak (TTP). The predictive performance of the nomogram was impressive, achieving an AUC of 0.79 in the validation cohort. The study further highlighted that tumors exhibiting a TTP of 2 min, large volumes, and positive androgen receptor expression had a lower probability of achieving pCR.
6. NACT Prediction Using Multi-Modal Radiomics
Liang et al. [126] employed statistical methods, including the t-test and Mann–Whitney U test, to examine the efficacy of DCE-MRI parameters and ADC values acquired at baseline and after the second cycle for the early prediction of pCR. They observed that the pre-treatment parameters did not exhibit a significant difference between pCR and non-pCR, with the individual parameters’ AUCs not exceeding 0.6. Nevertheless, all parameters displayed robust predictive performance after the second NACT cycle, except Ve and washout. Furthermore, a combination of wash-in, TTP, and ADC yielded the highest AUC of 0.886, accompanied by a sensitivity of 87.5% and a specificity of 82.11%.
On another note, Li et al. [127] explored the potential of DCE and DWI parameters obtained before and after the initial NACT cycle, alongside the percentage change in values between the two scans, to predict the response. Through ROC curve analysis, they evaluated the predictive ability of each parameter and the derived parameter (Kep/ADC). The researchers uncovered that the (Kep/ADC) parameter, calculated after one NACT cycle, outperformed all other individual parameters, achieving an AUC of 0.88, a sensitivity of 92%, and a specificity of 78% at a threshold value of 3.32 mm−2 [127].
O’Flynn et al. [128] investigated the predictive ability of MRI parameters acquired before and after the second cycle of neoadjuvant chemotherapy (NACT), along with their percentage changes, to anticipate treatment response. The most significant predictors were determined using LDA. Their findings indicated that a decrease in the enhancement fraction and tumor volume signified a pCR. The enhancement fraction exhibited an AUC of 0.76, a sensitivity of 63.2%, and a specificity of 76.9% at a threshold of −7%. In comparison, the reduction in tumor volume (with a threshold of −71%) achieved an AUC, sensitivity, and specificity of 0.77, 71.4%, and 76.9%, respectively.
On the contrary, Zhao et al. [129] utilized multivariate logistic regression to explore independent predictors of pCR from radiomic features extracted from Diffusion-Weighted Imaging (DWI) and DCE-MRI scans conducted before and after the second NACT cycle. Significant pCR-independent predictors included the Apparent Diffusion Coefficient (ADC) values after 2 cycles and the percentage decrease in both the early enhanced ratio E90 and tumor size. The corresponding odds ratios (ORs) were calculated as 1.041 (p = 0.037), 0.927 (p = 0.004), and 0.948 (p = 0.011), respectively. Their prediction model yielded an impressive AUC of 0.944, a sensitivity of 100%, and a specificity of 86.7% in the validation cohort. A nomogram based on this model was subsequently established. The study also revealed that baseline MRI features, age, molecular subtype, and other pathological markers did not exhibit significant differences between the pCR and non-pCR groups.
Bian et al. [130] developed a nomogram to forecast both NACT sensitivity and pCR. They assessed the predictive capabilities of two logistic regression models based on radiomic features (as detailed in Table 5) extracted from a combination of DCE, T2WI, and DWI. Model (1) achieved AUC values of 0.56 and 0.64 for predicting NACT sensitivity and pCR, respectively. However, model (2) exhibited superior performance, achieving an AUC of 0.91, an accuracy of 81.8%, a sensitivity of 100%, and a specificity of 75% in predicting NACT sensitivity. Similarly, it achieved an AUC, accuracy, sensitivity, and specificity of 0.91, 88.9%, 88.2%, and 90.9%, respectively, in predicting pCR. The authors developed a nomogram that integrated radiomic features from both models, revealing comparable discriminative power between model (2) and the nomogram.
Some studies, such as Tahmassebi et al. [131] and Eun et al. [132], have assessed the effectiveness of radiomics features in predicting treatment response through the application of ML classifiers. In their study, Tahmassebi et al. [131] employed both qualitative and quantitative features extracted from multi-sequence MRI scans before and after the initial NACT cycle. The authors employed eight classifiers, namely LR, SVM, SGD (stochastic gradient descent), LDA, RF, DT, adaptive boosting (AdaBoost), and extreme gradient boosting (XGBoost), to predict DSS, RFS, and RCB—an indicator of whether patients achieved pCR. The results revealed XGBoost’s superior performance in terms of high accuracy and stability, with AUCs ranging from 0.8577 to 0.9430 for RCB prediction. Notably, response predictors included peritumoral edema (on T2WI), minimum ADC (on DWI), complete shrinkage pattern, changes in tumor size, and mean transit time (on DCE). Eun et al. [132] explored the potential of textural features extracted from pre- and mid-treatment images, as well as the differences between them, using contrast-enhanced T1-weighted, ADC mapping, DWI, and T2-weighted MRI. Employing seven ML classifiers, they determined that random forest surpassed other classifiers. Furthermore, mid-treatment scans demonstrated optimal prediction performance for T1 contrast-enhanced-weighted, DWI, and ADC mapping, yielding AUCs of 0.82, 0.73, and 0.69, respectively. In contrast, the difference between pre- and mid-treatment features excelled exclusively in the T2-weighted sequence, attaining an AUC of 0.67. Notably, pre-treatment feature AUCs across all sequences did not surpass 0.57.
Other researchers, including Liu et al. [133], Syed et al. [134], Chen et al. [135], Chen et al. [136], and Xiong et al. [137], explored the fusion of pathological markers with pretreatment multi-sequence MRI radiomics features to enhance the prediction of patient responses to NACT.
In a study conducted by Liu et al. [133], radiomics features were extracted from T2WI, DWI, T1+C sequences, and their combinations. Furthermore, the study assessed the predictive capacity of pathological markers. The proposed SVM models were validated across three independent cohorts (n = 99, 107, & 80). The radiomics features from individual MRI sequences yielded AUCs not exceeding 0.64 in any validation cohort. In contrast, the multi-sequence radiomics model achieved AUCs of 0.70, 0.68, and 0.79 across the respective validation cohorts. The integration of pathological markers (ER, PR, and HER2) with multi-parametric MRI radiomics enhanced performance, resulting in AUCs of 0.79, 0.71, and 0.80.
Similarly, Syed et al. [134] combined non-imaging features (pathological markers) with GLCM features to predict treatment response. However, radiomics features were extracted from DWI and DCE sequences at three distinct time points (pre-treatment, early treatment at 3 weeks, and mid-treatment at 12 weeks of NACT). Utilizing an XGBoost ML classifier with 5-fold cross-validation on publicly available data, the author’s predicted response. The mean AUCs of prediction models based on ADC, DWI, DCE, the combination of DWI and DCE, and all MRI features combined were 0.85, 0.871, 0.903, 0.916, and 0.933, respectively. Additionally, non-imaging features achieved an AUC of 0.919. The amalgamation of all MRI features with non-imaging attributes yielded optimal prediction performance, resulting in an AUC of 0.951, precision of 0.815, and recall of 0.926. The study concluded that XGBoost holds promise for early patient response prediction based on GLCM and pathological features.
Chen et al. [135] extracted a total of 984 radiomics features from pre-treatment DCE, T2WI, and DWI images. Among these, six features were selected—three from T2WI and three from DCE—to construct the radiomics signature using a random forest approach. The resulting radiomics signatures demonstrated notable performance metrics in the test set, including an AUC of 0.834, a sensitivity of 80%, and a specificity of 73.21%. The study also explored the relationship between pathological markers and pCR, revealing that ER and PR statuses could serve as independent predictors of pCR. Furthermore, a predictive nomogram was developed by incorporating ER, PR, and the radiomics signature, yielding an improved AUC of 0.879, a sensitivity of 83.57%, and a specificity of 82.19%. The combined use of DWI and T2WI, along with ER and PR status, exhibited strong predictive capabilities.
In a related work, Chen et al. [136] constructed a novel nomogram that integrated baseline radiomics features from DCE and ADC maps with pathological markers to distinguish between pCR (grade 5) and pPR (grades 1–4 on the Miller-Payne grading system). The authors employed logistic regression to establish prediction models based on ADC maps alone, DCE images alone, and a combination of ADC maps and DCE. The corresponding AUCs were measured at 0.639, 0.789, and 0.68, respectively. Additionally, an exploration of the predictive capacity of pathological markers resulted in an AUC of 0.793. Notably, the study concluded that the most effective predictive performance was achieved by combining ER and PR statuses with radiomics features from both ADC maps and DCE images. This combined approach outperformed the use of individual feature sets, achieving an AUC of 0.837, an accuracy of 0.893, a sensitivity of 0.714, and a specificity of 0.952.
Xiong et al. [137] assessed the role of T2WI, DCE, and DWI sequences in predicting the response of NACT-insensitive breast cancers prior to treatment (specifically, grade 1 & 2 cases in the Miller-Payne grading system). They constructed a prediction model that integrated four radiomics markers with two independent pathological factors (HER2 status and Ki-67 index), identified through multivariable logistic regression. This composite model achieved an AUC of 0.935 and an accuracy of 93.55%, surpassing both radiomics-only models (AUC: 0.83) and pathological marker-based models (AUC: 0.792) during validation. These findings suggest the efficacy of a nomogram based on radiomics extracted from pre-treatment multi-sequence MRI scans, along with HER2 status and Ki-67 index, in effectively predicting NACT-insensitive cases.
Joo et al. [138] developed a multimodal DL model for predicting pCR based on pre-treatment T2WI and DCE-T1WI subtraction images, along with clinico-pathological markers. They constructed two 3D ResNet-50-based CNNs to extract radiomics features from the entire bilateral MRI images. The AUCs for the models using T2WI images, DCE-T1WI subtraction images, and clinico-pathological markers were 0.663, 0.725, and 0.827, respectively. Combining T2W and DCE-T1W subtraction images produced an AUC of 0.745 and an accuracy of 73.6% (sensitivity: 48.6%, specificity: 85.3%). Furthermore, They found that the optimal prediction performance emerged from the integration of clinicopathological markers, T2W, and DCE-T1W subtraction images, yielding an AUC of 0.888, accuracy of 85%, sensitivity of 66.7%, and specificity of 93.2%.
Yoon et al. [139] utilized multiple logistic regression to assess the value of textural features extracted from baseline PET/CT and DWI scans in distinguishing responders from non-responders. The ranges of p-values for the selected texture features are presented in Table 5. They concluded that textural features hold predictive potential, offering insights into tumor heterogeneity.
On the other hand, Umutlu et al. [140] examined the potential of radiomics features to predict treatment response using baseline PET/MRI scans. Employing SVM with 5-fold cross-validation, they predicted pCR across the entire patient cohort and within two molecular subtype subgroups ((HR+/HER2-) and (TN/HER2+)). The combined application of all MRI sequences and PET yielded the most robust predictive performance for the entire cohort, resulting in an AUC of 0.8, an accuracy of 77.4%, a sensitivity of 81%, a specificity of 73.8%, a PPV of 75.6%, and NPV of 79.5%. Within the (HR+/HER2-) subgroup, the same imaging dataset achieved superior AUC and accuracy values (0.94 and 85.2%, respectively), accompanied by an 85.2% sensitivity and specificity. The authors’ conclusion emphasized the capacity of PET/MRI to offer an accurate radiomics analysis for early pCR prediction, particularly in HR+/HER2- patients.
A comparison was made by Choi and colleagues [141] between an Alexnet-based CNN model and conventional methods, to predict responders and non-responders to NACT based on PET/CT and DWI scans. The proposed CNN model demonstrated AUCs of 0.886, 0.98, 0.602, and 0.701, along with accuracies of 97%, 95%, 85%, and 88% for PET0, PET1, MRI0, and MRI1, respectively, (where 0 represents baseline scans and 1 represents scans after the first NACT cycle). Conversely, the AUCs of conventional parameters including SUV, TLG, MTV, and ADC remained below 0.7 in both baseline and follow-up scans, with accuracies falling short of 84%. Furthermore, improvements were observed in AUC values (0.805, 0.737, 0.758, and 0.752) for differences in these parameters between the two scans, denoted as SUVmax, MTV, TLG, and ADC.
Montemezzi et al. [142] integrated pathological and radiomics features obtained from pre-NACT PET/CT and DCE-MRI scans to predict pCR.They employed three classifiers (Random Forest, Support Vector Machine Regression (SVR), and Logistic Regression) in conjunction with leave-one-out and leave-two-out cross-validation techniques. The evaluation encompassed five feature groups (mentioned underneath Table 5). They found that the incorporation of both radiomics and pathological markers resulted in elevated AUC values of 0.96 and 0.98. The study demonstrated that the fusion of DCE-MRI and PET/CT radiomics with pathological markers led to enhanced predictive performance.
Table 5.
Multi-modal Imaging.
Reference | Study Aim | Number of Patients & Study Type | Markers | Results & Findings |
---|---|---|---|---|
Liang et al. [126] | To investigate the usefulness of combining DCE-MRI parameters with ADC values for the early prediction of pCR to NACT. | 119 patients Study type: single-center retrospective study |
|
The AUCs of ADC, TTP, Kep, Ktrans, IAUC, and washing
after the 2nd cycle
were 0.721, 0.725, 0.805, 0.825, 0.824, and 0.866, respectively, while it did not exceed 0.57 for Ve and washout.
The SENs were 87.5%, 100%, 62.5%, 83.33%, 83.33%, 83.33%, 45.83%, and 62.5%. The SPEs were 56.84%, 42.11%, 92.63%, 75.79%, 78.95%, 84.21%, 71.58%, and 58.95%. Combining ADC, TTP, & washing achieved AUC: 0.886, SEN: 87.5%, SPE: 82.11%. They found that baseline features did not show a significant difference between pCR & non-pCR; however, they showed good predictive performance after two cycles. |
Li et al. [127] | To evaluate the utility of multiparametric MRI parameters acquired from DCE-MRI and DWI acquired before and after the first cycle of NACT in predicting pCR in patients with breast cancer. | 42 patients
(data after the first cycle of NACT was available for only 36 patients) Study type: prospective study |
|
The AUCs (after the first cycle of NACT) for LD, Ve, Vp, Ktrans, Kep, ADC, and Kep/ADC were 0.57, 0.54, 0.61, 0.68, 0.76, 0.82, and 0.88, respectively. Moreover, the SENs were 0.83, 0.67, 0.5, 0.67, 0.83, 0.83, and 0.92. The SPEs were 0.42, 0.48, 0.78, 0.74, 0.65, 0.67, and 0.78, respectively. They revealed that combining DWI & DCE parameters (i.e., Kep/ADC) yielded a superior performance than using each of them alone. In addition, the mean parameters after one cycle of therapy outperformed the baseline parameters and the percentage change between the two scans. |
O’Flynn et al. [128] | To determine whether individual functional MRI parameters can predict pCR to NACT in breast cancer patients after two treatment cycles. | 32 patients Study type: single-center prospective study |
|
The AUCs of the percentage change in EF, tumor volume, IAUGC, Ktrans, Kep, Ve, ADC, & R2* were 0.76, 0.77, 0.64, 0.6, 0.68, 0.58, 0.69, & 0.41, respectively. SENs: 63.2%, 71.4%, 73.7%, 63.2%, 63.2%, 57.9%, 78.9%, & 63.2%. SPEs: 76.9%, 76.9%, 61.5%, 53.8%, 69.2%, 53.8%, 69.2%, & 30.8%. They found that the reduction in ER & tumor volume was significantly greater in patients achieving pCR, and they can be used as early response predictors. |
Zhao et al. [129] | To investigate the ability of DWI combined DCE-MRI in the prediction of pCR after the second cycle of NACT by developing a nomogram based on MRI features. | 87 patients
Training (n = 66) Validation (n = 21) Study type: single-center retrospective study |
|
Multivariate logistic regression showed that the following features were independent pCR predictors:
They found no significant difference in pathological markers, age, and baseline radiomics features between the pCR and non-pCR groups. Moreover, the nomogram & the predictive model showed strong predictive value. |
Bian et al. [130] | To evaluate the ability of radiomics signatures to predict the efficacy of NACT and the probability of pCR based on pretreatment T2WI, DWI, and DCE MRI scans. | 152 patients
Training
(n = 107) Validation (n = 45) Study type: single-center retrospective study |
|
|
Tahmassebi et al. [131] | To assess the utility of ML algorithms in the early prediction of survival outcomes and pCR to NACT using multi-parametric MRI scans acquired before and after two cycles of NACT. | 38 patients (4-fold cross-validation) Study type: single-center prospective study |
|
The best AUCs in predicting RCB using 8 ML classifiers (LR, SVM, SGD, LDA, RF, DT, AdaBoost, and XGBoost) were 0.868, 0.88, 0.83, 0.75, 0.89, 0.81, 0.85, and 0.94, respectively, (SEN: -, SPE: -). They concluded that the XGBoost outperformed other classifiers as it achieved higher accuracy and more stable performance. Moreover, peritumoral edema, min ADC, complete shrinkage pattern, changes in tumor size, and MTT can be used as RCB predictors. |
Eun et al. [132] | To determine whether texture features from different MRI sequences at pre- and mid-treatment are associated with pCR to NACT. | 136 patients (5-fold cross-validation) Study type: single-center retrospictive study |
|
Texture features at mid-treatment contrast-enhanced-T1WI showed the best performance compared to other MRI sequences, the prediction model based on random forest classifier achieved: AUC: 0.82, ACC: 83.1%, SEN: 62.5%, SPE: 91.7%. They revealed that the RF model had better performance showing the association between texture features and pCR compared with the other six ML classifiers. |
Liu et al. [133] | To investigate the efficacy of radiomics analysis of pretreatment multi-parametric MRI (T2WI, DWI, and T1 + C) scans in the prediction of pCR to NACT. | 414 patients
Training (n = 128) Three independent validation cohorts (n = 99, 107, & 80) Study type: multi-center retrospective study |
|
The AUCs of the three validation cohorts using:
|
Syed et al. [134] | To integrate radiomics features extracted from DWI and DCE MRI scans with non-imaging features to predict pCR to NACT using XGBoost ML classifier. Radiomics features were extracted from pre-, early-, & mid-treatment scans |
117 patients from the Breast Multi-parametric MRI for prediction of NAC Response-2 competition dataset (BMMR2) (a subset of I-SPY-2 TRIAL) competition dataset (5-fold cross-validation) Study type: retrospective study based on multi-center dataset |
|
Below, the mean AUCs, SENs, SPEs, & precisions of the XGBoost prediction models based on: ADC: 0.85, 0.827, -, & 0.752. DWI: 0.871, 0.926, -, & 0.779. DCE: 0.903, 0.939, -, & 0.856. DWI + DCE: 0.916, 0.915, -, & 0.779. all MRI sequences: 0.933, 0.889, -, & 0.824. pathological markers: 0.919, 0.914, -, & 0.762. combining pathological markers with all MRI sequences: 0.951, 0.926, -, & 0.815. They found that XGBoost can accurately predict response based on non-imaging and GLCM features. |
Chen et al. [135] | Predicting the efficacy of NACT by constructing a nomogram based on pathological factors and multi-sequence MRI (T2WI, DWI, and DCE). | 158 patients
Training (n = 110) Test (n = 48) Study type: single-center retrospective study |
|
The prediction performance of: Radiomics signature AUC: 0.834, SEN: 80%, SPE: 73.21% The nomogram (integrating radiomics signature with PR and ER status) AUC: 0.879, SEN: 83.57%, SPE: 82.19 % They revealed that ER and PR status showed significant differences between the two response groups while the other pathological markers did not. Moreover, DWI & T2WI could predict response effectively. |
Chen et al. [136] | To develop a radiomics nomogram combining pre-treatment DCE-MRI and ADC maps with pathological risk factors to predict pCR to NACT. | 91 patients
Training
(n = 63) Test (n = 28) Study type: single-center retrospective study |
|
Multivariate logistic regression model yielded the following AUCs, ACCs, SENs, & SPEs in the test set when using:
DCE features alone: 0.789, 78.6%, 71.4%, & 81% ADC features alone: 0.639, 53.6%, 100%, & 38.1% Combining DCE & ADC: 0.68, 78.6%, 71.4%, & 81% Pathological markers: 0.793, 75%, 57.1%, & 81% Combining pathological with radiomics (DCE+ADC): 0.837, 89.3%, 71.4%, & 95.2% They concluded that ER & PR showed a significant difference between pCR & pPR groups (p < 0.05). Moreover, combining radiomics from DCE-MRI & ADC maps with pathological data can be potential response predictors. |
Xiong et al. [137] | To assess the value of multi-parametric MRI (T2WI, DCE, and DWI) in the prediction of NACT-insensitive breast cancers based on pretreatment scans. | 125 patients
Training (n = 63) Validation (n = 62) Study type: single-center retrospective study |
|
The prediction model based on pathological markers achieved AUC: 0.792, ACC: 87.1%, SEN: -, SPE: - The model based on radiomics markers attained AUC: 0.83, ACC: -, SEN: -, SPE: - Combining radiomics with pathological markers achieved AUC: 0.935, ACC: 93.55%, SEN: -, SPE: - They suggested that a nomogram built based on HER2 status, Ki-67 index, and radiomics features extracted from pretreatment multi-parametric MRI can predict NACT-insensitive effectively. |
Joo et al. [138] | To conduct a multimodal DL model that combines clinicopathological information with MR images acquired before the initiation of NACT to help in the prediction of pCR. | 536 patients
Training
(n = 429)
Validation
(n = 107) Study type: single-center retrospective study |
|
The models based on T1W, T2W images, and clinicopathological markers individually achieved AUCs: 0.725, 0.663, and 0.827, respectively. ACCs: 71.8%, 70.9%, & 78.5%. SENs: 31.4%, 45.7%, & 84.8%. SPEs: 90.7%, 82.4%, & 75.7%. Integrating the aforementioned markers achieved AUC: 0.888, ACC: 85%, SEN: 66.7%, SPE: 93.2% They concluded that the best prediction performance was attained by combining baseline MRI images with clinical and pathological markers. |
Yoon et al. [139] | To evaluate the efficacy of textural features extracted from pretreatment F-18 FDG PET/CT and DWI scans to predict the LABC patients’ pathological response to NACT and progression-free survival (PFS). | 83 patients with locally advanced breast cancer (LABC). Study type: retrospective study |
|
-The range of p-values of the selected
PET features: CM: 0.008–0.02, VAM: 0.008–0.047 NIDM: 0.012–0.024, ISZM: 0.005–0.032 NGLCM: 0.045, TSM: 0.009 NGLDM: 0.009–0.019 (SEN: -,SPE: -). -The range of p-values of the selected ADC features: histogram analysis (entropy): 0.024 NGLCM: 0.033–0.025 (SEN: -,SPE: -). They found that tumor texture features are useful for the prediction of NACT response in as they indicate tumor heterogeneity. |
Umutlu et al. [140] | To evaluate the potential of radiomics analysis of multi-parametric 18F-FDG PET/MRI pre-treatment images to predict pCR to NACT. | 73 patients (5-fold cross-validation) Study type: retrospective study |
|
Combining the PET data with all MRI sequences achieved the following results using:
|
Choi et al. [141] | To assess the value of PET/CT and DWI parameters in predicting pathological response to NACT using CNNs and compare their performance with conventional imaging parameters. | 56 patients (3-fold cross-validation) Study type: single-center retrospective study |
|
|
Montemezzi et al. [142] | To study the effect of combining DCE-MRI radiomics with histological and radiological information (PET/CT) on the performance of the prediction model of pCR to NACT. | 60 patients
(leave one out & leave two out cross-validation 60-fold and 30-fold cross-validation). Study type: single-center retrospective study |
|
LR, SVR, and RF were used with different combinations of markers (5 groups of features (The five groups of features were: (1) tumor characteristics such as shape, type, grade, margin, internal enhancement, curve type, SUVmax, ADC, and patient age; (2) the selected radiomics features; (3) pathological features; (4) a combination of features from groups 1 and 2; and
(5) an amalgamation of features from groups 1, 2, and 3.))
, and the AUCs were as follows: (SENs: -, SPEs: -) Group 1: 0.7–0.75 Group 3 (pathological): 0.8–0.85 Group 4: 0.85–0.9 Group 5: 0.96–0.98 (using LR) They found that the introduction of DCE-MRI radiomics showed significant improvement in predictive power. The selected radiomics were dependence variance, sphericity, kurtosis, and LRHGLE (LRHGLE: long run high gray-level emphasis). |
7. Discussion
This comprehensive review article has delved into the intricate challenge of the variable responses observed in NACT. The central thrust of this discourse lies in the exploration and development of predictive models capable of stratifying patients based on their probable response to NACT. A nuanced emphasis has been placed on the evolution of the predictive landscape, specifically the integration of radiomic and pathological markers. The objectives sought to uncover resilient radiomic markers that exhibit correlations with NACT response, while concurrently probing the potential amplification of predictive precision through the amalgamation of radiomic markers derived from radiological images and classical pathological markers.
The endeavor to unravel the complexities of NACT response prediction has invoked diverse avenues of research. Extensive investigations have been conducted, traversing the terrain of the tumor core and its margins (peritumoral regions), as evident in [58,98,107]. A pivotal narrative echoed by [46,108,113,120] accentuates the merits of encompassing a broader spatial context within feature extraction strategies.
Our review has shown that the predictive landscape of NACT responses has undergone considerable evolution, especially with the incorporation of both radiomic and pathological markers. Emphasis on the evolution of this landscape becomes apparent when noting that 96% of articles published in the last 15 years employ radiomic markers derived from radiological images. Furthermore, 92% of these articles have documented a statistically significant correlation between at least one radiomic feature and NACT response.
Diverse imaging modalities have been harnessed to navigate the objective of NACT outcomes prediction. Mammography,traditionally seen as the cornerstone of breast tumor detection, has undergone a transformative phase with artificial intelligence integrations. Various features have been extracted from mammogram images to predict the outcome of NACT. Shape features, textural features, and the percentage of gray value change between scans have been utilized as predictive indicators. Shin et al. [35] and Skarping et al. [36], for instance, have effectively used AI to derive insights from mammogram images, with Skarping et al. distinguishing between pCR and non-pCR, recording a specificity of 90% and sensitivity of 46%. Moreover, CESM, as demonstrated by Xing et al. [44], has struck a balanced performance with a specificity of 72.15% in the CC view and 75% sensitivity. The research landscape, marked by studies like those from Wang et al. [45] and Mao et al. [46], is hinting toward the increasing importance of radiomics, with sensitivities and specificities touching the ranges of 57.7% and 90.9% respectively.
A parallel trajectory unfolds in studies utilizing ultrasound, where QUS parameters (SS, SI, MBF, ASD, AAC, ACE, and SAS) have been scrutinized extensively, often integrated with textural analyses. The exploration of elastography variants, namely strain elastography and shear-wave elastography, has also unveiled pivotal features such as strain ratio, tumor stiffness, and elasticity. Noteworthy is the DL radiomics nomogram by Jiang et al. [50], which registers a sensitivity of , and a specificity of .
The scope broadens further, encapsulating PET/CT investigations, where SUVmax represents the maximum standardized uptake value change between baseline and follow-up scans. Augmenting this are MTV, TLG, SUVpeak, SUVmean, and an array of textural features, contributing collectively to the predictive tapestry. Research spearheaded by Buchbender et al. [80], Andrade et al. [75], and Groheux et al. [82,83] have established SUVmax’s promise, with sensitivities ranging from 75% to 85.7%. Moreover, merging SUVmax with textural features, as demonstrated by Cheng et al. [86], holds promise in enhancing predictive accuracy through a multi-faceted approach.
However, amidst all imaging modalities, DCE-MRI emerges as the undeniable frontrunner. Its omnipresence in the literature and the stellar results linked with it accentuate its unparalleled value. The domain of DCE-MRI has proven equally intricate, unfurling a plethora of features. These encompass quantitative parameters like Ktrans, Kep, Ve, Vp, and i. Semi-quantitative parameters like wash-in, wash-out, and peak enhancement intertwine with morphological, statistical, and textural features such as GLCM and GLRLM. Moreover, the convergence of DCE-MRI with DWI holds promise, prominently featuring ADC maps and values (maximum, minimum, and mean) as pivotal predictors. The versatility of DCE-MRI allows for the integration of data from digital breast tomosynthesis. Further, its adeptness in synthesizing molecular subtypes, clinical markers, and radiomic features, has a capability that appears challenging for other modalities to rival. For instance, Comes et al. [123]’s combined strategy (incorporating pathological features with radiomics features) achieved an impressive AUC of was achieved, along with an accuracy of , a sensitivity of , and a specificity of . Additionally, Hussain et al. [120] reached an AUC of 0.98 and an accuracy of , emphasizing the potential of DCE-MRI’s radiomics when integrated with the molecular subtype. Nevertheless, it’s imperative to emphasize that while DCE-MRI seems superior, the best modality or marker may still oscillate depending on the clinical scenario, resource allocation, and individualized patient parameters.
Shining brightly on the horizon is the concept of multi-modal imaging that truly provides insights into the potential of integrating predictive markers from various imaging modalities. As this approach beckons exploration, it potentially paves the way for even more refined and accurate predictions in the near future. Across the evaluated imaging modalities, certain markers stood out as robust indicators of treatment response. Notably, in MRI studies, texture features from contrast-enhanced T1-weighted images at mid-treatment consistently exhibited strong predictive power. Noteworthy PET parameters, such as TLG and SUVmax, also showed substantial promise in prognosticating pCR. Variables such as hormone receptor status (ER/PR), HER2 expression, and Ki-67 proliferation index were frequently linked to treatment outcomes. Integrating these factors with multi-modal imaging data, such as radiomics extracted from PET/CT or MRI, markedly improved predictive accuracy. Joo et al. [138], a prime example, showcased a multimodal DL model that combined clinicopathological information with pre-treatment MRI images, attaining a high specificity of 93.2%, an accuracy of 95% and an AUC of 0.89 for post-first-cycle PET images. Similarly, Umutlu et al. [140] leveraged multiparametric PET/MRI to predict pCR with promising results (average AUC = 0.93, 86.4% accuracy, 86.7% sensitivity, and 86% specificity). The fusion of pathological markers with radiomics features has demonstrated varying impacts on prediction models. In most cases, integrating pathological markers has led to improved prediction capabilities. However, exceptions like [45,46,56,64,65,82,116] highlight instances where pathological factors did not significantly correlate with response or failed to enhance model performance.
These clinical implications of accurate NACT response prediction are profound, offering the potential to revolutionize personalized treatment strategies and improve patient outcomes. By identifying patients who would benefit from NACT and sparing others from potentially ineffective treatment, predictive models align with the broader goals of precision medicine.
8. Limitations and Future Perspectives
Despite these promises of NACT response predictive models, there are clear limitations and challenges to be addressed. Differences and variations in therapeutic regimens, tumor molecular subtypes, scan timings, imaging protocols, and sample sizes have led to inconsistencies in the results across studies.
Notably, when the same predictors were employed, different studies yielded conflicting outcomes. For instance, SUVmax showed a correlation with pathological response in TNBC patients as indicated by Groheux et al. [83]. However, contrasting findings were presented by Luo et al. [85] and Groheux et al. [82], showing no significant correlation for LABC and HER2+ patients, respectively. Additionally, Koolen et al. [81] highlighted that SUVmax might be a significant predictor after two weeks of NACT for TNBC patients. Yet, this predictor was not found significant in a study for HER2+ patients by Groheux et al. [82].
Regarding PK parameters extracted from DCE-MRI at baseline scans, disagreements were observed across studies. While Drisis et al. [103] found Ktrans could differentiate responders from non-responders, Braman et al. [107] argued that PK parameters did not show any significant difference between pCR and non-pCR. Lee et al. [105] also asserted that individual perfusion parameters lacked high predictive capability. Tudorica et al. [102] and Li et al. [101] presented percentage changes in PK parameters after one NACT cycle as effective predictors, a finding that was contradicted by Cho et al. [121]. Additionally, the predictive performance evaluation results of the models trained by Fan et al. [97] on two independent patient sample groups showed some differences.
Apart from the aforementioned inconsistencies, another pertinent limitation is the largely retrospective nature of many studies, often encompassing only a small cohort of patients. These studies that might need further validation include those conducted by Sadeghi-Naini et al. [48], Tadayyon et al. [52,56], Antunovic et al. [87], Giannini et al. [96], Li et al. [101], and Crippa et al. [90]. The absence of a standardized protocol further complicates matters, with each study employing unique methodologies, features, CNN layers, and classifiers. The use of multiple software applications for radiomics feature extraction only adds to the lack of uniformity. Moreover, the high cost and complexity associated with AI applications in routine imaging require specialized software and personnel expertise.
From a forward-looking perspective, future research should aim to bridge the remaining gaps in the field. While our understanding of tumor biology deepens and AI technologies advance, the promise of informed NACT strategies to transform patient care becomes increasingly tangible. The domain is broad and rich, extending from the exploration of diverse imaging modalities to the integration of clinical and molecular data. This should be built upon a strong foundation of prospective clinical trials and expansive, balanced multi-institutional datasets. As the field evolves, advanced models can also be refined to contend with dataset limitations, yielding robust, validated, and clinically impactful predictive models.
It is crucial to prioritize the development of standardized protocols, including imaging techniques, feature extraction methodologies, and analysis frameworks. Harmonizing these protocols can reduce variations and facilitate more comparable and reproducible research outcomes. These initiatives can bolster the validity of study findings and enhance the generalizability of predictive models. Moreover, fostering collaborations across research institutions can lead to a broader consensus on predictors and their efficacies, potentially ironing out the inconsistencies that currently plague the field.
In addition to addressing inconsistencies, it’s also imperative to focus on the integration of AI technologies with traditional radiomic analyses. While the potential of AI in enhancing NACT prediction is immense, its adoption is currently hindered by several factors, including the complexities of AI software and a lack of familiarity among radiologists. Efforts should be made to demystify AI technologies and promote their wider acceptance and integration into clinical practice. Workshops, training programs, and interdisciplinary collaborations can bridge the knowledge gap and ensure that the full potential of AI-driven models is realized in the realm of breast cancer treatment. While the trajectory of breast cancer treatment is poised for transformation, the path forward requires concerted efforts to address present challenges. By resolving inconsistencies, standardizing protocols, and embracing AI advancements, we can aspire for a future where NACT response predictions are both reliable and integral to personalized breast cancer management.
9. Conclusions
Breast cancer remains a global challenge necessitating innovative treatments. Neoadjuvant chemotherapy (NACT) holds promise in improving outcomes by diminishing tumor sizes and facilitating less invasive surgeries. However, the diverse patient responses underscore the need for accurate predictive models. This review extensively examines NACT response prediction by integrating radiomic and pathological markers from various imaging modalities. Valuable radiomic features are identified, particularly when considering markers from both tumor core and peritumoral margins. While each imaging modality has its merits, DCE-MRI stands out when combined with pathological markers and clinical data, a sentiment echoed across various studies. Despite challenges like limited data, collaborative efforts can build robust AI models that aid evidence-based clinical decisions. Accurate NACT response prediction has profound clinical significance, optimizing outcomes and reducing unnecessary interventions. With the advancement of AI technologies and collaboration paving the way for personalized care, the synergy between radiomic and pathological markers and the potential of informed NACT strategies can revolutionize breast cancer treatment.
Abbreviations
The following abbreviations are used in this manuscript:
CESM | Contrast Enhanced Spectral Mammography |
NACT | Neoadjuvant Chemotherapy |
NAT | Neoadjuvant Therapy |
pCR | pathological complete response |
pPR | pathological partial response |
ML | Machine Learning |
DL | Deep Learning |
AUC | Area Under Curve (ROC) |
SEN | Sensitivity |
SPE | Specifecity |
HER-2 | Human Epidermal Growth Factor Receptor-2 |
ER | Estrogen Receptor |
PR | Progesterone Receptor |
BPE | Background Parenchymal Enhancement |
DCE-MRI | dynamic contrast-enhanced magnetic resonance imaging |
FGT | Fibroglandular Tissue |
TN | Triple Negative |
NGTDM | Neighborhood Gray Tone Difference Matrix based features |
GLCM | Gray Level Co-occurrence Matrix |
GLRLM | Grey Level Run Length Matrix |
GLSZM | Grey Level Size Zone Matrix |
Author Contributions
Conceptualization, B.E., A.A., M.S., A.M., M.Z., R.A., K.A., M.A. and A.E.-B.; writing—original draft preparation, B.E., A.A., M.S. and A.E.-B.; writing—review and editing, B.E., A.A., M.S., A.M., M.Z., R.A., K.A., M.A., M.G., S.C., H.E.-D.M. and A.E.-B.; supervision, H.E.-D.M. and A.E.-B.; project administration, M.G., S.C., H.E.-D.M. and A.E.-B. All authors have read and agreed to the published version of the manuscript.
Conflicts of Interest
The authors declare no conflict of interest.
Funding Statement
This research received no external funding.
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
References
- 1.Sung H., Ferlay J., Siegel R.L., Laversanne M., Soerjomataram I., Jemal A., Bray F. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 2021;71:209–249. doi: 10.3322/caac.21660. [DOI] [PubMed] [Google Scholar]
- 2.Cancer—who.int. [(accessed on 1 July 2023)]. Available online: https://www.who.int/news-room/fact-sheets/detail/cancer.
- 3.Siegel R.L., Miller K.D., Wagle N.S., Jemal A. Cancer statistics, 2023. CA Cancer J. Clin. 2023;73:17–48. doi: 10.3322/caac.21763. [DOI] [PubMed] [Google Scholar]
- 4.Vargas H.I., Anderson B.O., Chopra R., Lehman C.D., Ibarra J.A., Masood S., Vass L. Diagnosis of breast cancer in countries with limited resources. Breast J. 2003;9:S60–S66. doi: 10.1046/j.1524-4741.9.s2.5.x. [DOI] [PubMed] [Google Scholar]
- 5.Nounou M.I., ElAmrawy F., Ahmed N., Abdelraouf K., Goda S., Syed-Sha-Qhattal H. Breast cancer: Conventional diagnosis and treatment modalities and recent patents and technologies. Breast Cancer Basic Clin. Res. 2015;9:17–34. doi: 10.4137/BCBCR.S29420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Shien T., Iwata H. Adjuvant and neoadjuvant therapy for breast cancer. Jpn. J. Clin. Oncol. 2020;50:225–229. doi: 10.1093/jjco/hyz213. [DOI] [PubMed] [Google Scholar]
- 7.Waks A.G., Winer E.P. Breast cancer treatment: A review. JAMA. 2019;321:288–300. doi: 10.1001/jama.2018.19323. [DOI] [PubMed] [Google Scholar]
- 8.Kerr A.J., Dodwell D., McGale P., Holt F., Duane F., Mannu G., Darby S.C., Taylor C.W. Adjuvant and neoadjuvant breast cancer treatments: A systematic review of their effects on mortality. Cancer Treat. Rev. 2022;105:102375. doi: 10.1016/j.ctrv.2022.102375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Korde L.A., Somerfield M.R., Carey L.A., Crews J.R., Denduluri N., Shelley Hwang E., Khan S.A., Loibl S., Morris E.A., Perez A., et al. Neoadjuvant Chemotherapy, Endocrine Therapy, and Targeted Therapy for Breast Cancer: ASCO Guideline. J. Clin. Oncol. 2021;39:1485–1505. doi: 10.1200/JCO.20.03399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Mani S., Chen Y., Li X., Arlinghaus L., Chakravarthy A.B., Abramson V., Bhave S.R., Levy M.A., Xu H., Yankeelov T.E. Research and applications: Machine learning for predicting the response of breast cancer to neoadjuvant chemotherapy. J. Am. Med. Inform. Assoc. JAMIA. 2013;20:688. doi: 10.1136/amiajnl-2012-001332. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Untch M., Konecny G.E., Paepke S., von Minckwitz G. Current and future role of neoadjuvant therapy for breast cancer. Breast. 2014;23:526–537. doi: 10.1016/j.breast.2014.06.004. [DOI] [PubMed] [Google Scholar]
- 12.Giordano S.H. Update on Locally Advanced Breast Cancer. Oncologist. 2003;8:521–530. doi: 10.1634/theoncologist.8-6-521. [DOI] [PubMed] [Google Scholar]
- 13.Taleghamar H., Jalalifar S.A., Czarnota G.J., Sadeghi-Naini A. Deep learning of quantitative ultrasound multi-parametric images at pre-treatment to predict breast cancer response to chemotherapy. Sci. Rep. 2022;12:2244. doi: 10.1038/s41598-022-06100-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Fernandes J., Sannachi L., Tran W.T., Koven A., Watkins E., Hadizad F., Gandhi S., Wright F., Curpen B., El Kaffas A., et al. Monitoring breast cancer response to neoadjuvant chemotherapy using ultrasound strain elastography. Transl. Oncol. 2019;12:1177–1184. doi: 10.1016/j.tranon.2019.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Liu X., Liu Z., Li C., Song X., Wang X., Li S., Yu Z. PR status is a more decisive factor in efficacy of adding pertuzumab into neoadjuvant therapy for HER2-positive and lymph node-positive breast cancer than ER status: A real-world retrospective study in China. World J. Surg. Oncol. 2023;21:296. doi: 10.1186/s12957-023-03178-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.General Principles of Neoadjuvant Management of Breast Cancer- Neoadjuvant Treatment Options. [(accessed on 4 October 2023)]. Available online: https://www.uptodate.com/contents/general-principles-of-neoadjuvant-management-of-breast-cancer#H179173117.
- 17.New Guidelines Move beyond Chemotherapy for Patients with Triple-Negative Breast Cancer—targetedonc.com. [(accessed on 5 October 2023)]. Available online: https://www.targetedonc.com/view/new-guidelines-move-beyond-chemotherapy-for-patients-with-triple-negative-breast-cancer.
- 18.Han H.S., Vikas P., Costa R.L., Jahan N., Taye A., Stringer-Reasor E.M. Early-Stage Triple-Negative Breast Cancer Journey: Beginning, End, and Everything in Between. Am. Soc. Clin. Oncol. Educ. Book. 2023;43:e390464. doi: 10.1200/EDBK_390464. [DOI] [PubMed] [Google Scholar]
- 19.Neoadjuvant Therapies for Breast Cancer | Susan G. Komen®—komen.org. [(accessed on 4 October 2023)]. Available online: https://www.komen.org/breast-cancer/treatment/type/neoadjuvant-therapy/#:~:text=Types%20of%20neoadjuvant%20chemotherapy%20regimens&text=Most%20are%20anthracycline%2Dbased%20or,)%20and%20pertuzumab%20(Perjeta)
- 20.Villacampa G., Matikas A., Oliveira M., Prat A., Pascual T., Papakonstantinou A. Landscape of neoadjuvant therapy in HER2-positive breast cancer: A systematic review and network meta-analysis. Eur. J. Cancer. 2023;190:112885. doi: 10.1016/j.ejca.2023.03.042. [DOI] [PubMed] [Google Scholar]
- 21.General Principles of Neoadjuvant Management of Breast Cancer- Goals. [(accessed on 27 September 2023)]. Available online: https://www.uptodate.com/contents/general-principles-of-neoadjuvant-management-of-breast-cancer#H4049625435.
- 22.Xin Y., Shen G., Zheng Y., Guan Y., Huo X., Li J., Ren D., Zhao F., Liu Z., Li Z., et al. Immune checkpoint inhibitors plus neoadjuvant chemotherapy in early triple-negative breast cancer: A systematic review and meta-analysis. BMC Cancer. 2021;21:1261. doi: 10.1186/s12885-021-08997-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.See S.H.C., Siziopikou K.P. Pathologic evaluation of specimens after neoadjuvant chemotherapy in breast cancer: Current recommendations and challenges. Pathol.-Res. Pract. 2022;230:153753. doi: 10.1016/j.prp.2021.153753. [DOI] [PubMed] [Google Scholar]
- 24.Yao L., Jia G., Lu L., Ma W. Breast cancer patients: Who would benefit from neoadjuvant chemotherapies? Curr. Oncol. 2022;29:4902–4913. doi: 10.3390/curroncol29070389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Nurgali K., Jagoe R.T., Abalo R. Adverse effects of cancer chemotherapy: Anything new to improve tolerance and reduce sequelae? Front. Pharmacol. 2018;9:245. doi: 10.3389/fphar.2018.00245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Dodington D.W., Lagree A., Tabbarah S., Mohebpour M., Sadeghi-Naini A., Tran W.T., Lu F.I. Analysis of tumor nuclear features using artificial intelligence to predict response to neoadjuvant chemotherapy in high-risk breast cancer patients. Breast Cancer Res. Treat. 2021;186:379–389. doi: 10.1007/s10549-020-06093-4. [DOI] [PubMed] [Google Scholar]
- 27.O’Donnell J.M., Gasior S., Davey M., O’Malley E., Lowery A., McGarry J., O’Connell A., Kerin M., McCarthy P. The accuracy of breast MRI radiomic methodologies in predicting pathological complete response to neoadjuvant chemotherapy: A systematic review and network meta-analysis. Eur. J. Radiol. 2022;157:110561. doi: 10.1016/j.ejrad.2022.110561. [DOI] [PubMed] [Google Scholar]
- 28.Pesapane F., Agazzi G.M., Rotili A., Ferrari F., Cardillo A., Penco S., Dominelli V., D’Ecclesiis O., Vignati S., Raimondi S., et al. Prediction of the pathological response to neoadjuvant chemotherapy in breast cancer patients with MRI-radiomics: A systematic review and meta-analysis. Curr. Probl. Cancer. 2022;2022:100883. doi: 10.1016/j.currproblcancer.2022.100883. [DOI] [PubMed] [Google Scholar]
- 29.Liang X., Yu X., Gao T. Machine learning with magnetic resonance imaging for prediction of response to neoadjuvant chemotherapy in breast cancer: A systematic review and meta-analysis. Eur. J. Radiol. 2022;150:110247. doi: 10.1016/j.ejrad.2022.110247. [DOI] [PubMed] [Google Scholar]
- 30.Na S.P., Houserkovaa D. The role of various modalities in breast imaging. Biomed Pap. Med. Fac. Univ. Palacky Olomouc Czech Repub. 2007;151:209–218. doi: 10.5507/bp.2007.036. [DOI] [PubMed] [Google Scholar]
- 31.Henry N.L., Shah P.D., Haider I., Freer P.E., Jagsi R., Sabel M.S. Abeloff’s Clinical Oncology. 6th ed. Elsevier; Philadelphia, PA, USA: 2020. Cancer of the breast; pp. 1560–1603. [Google Scholar]
- 32.Coleman C. Early Detection and Screening for Breast Cancer. Semin. Oncol. Nurs. 2017;33:141–155. doi: 10.1016/j.soncn.2017.02.009. [DOI] [PubMed] [Google Scholar]
- 33.Iranmakani S., Mortezazadeh T., Sajadian F., Ghaziani M.F., Ghafari A., Khezerloo D., Musa A.E. A review of various modalities in breast imaging: Technical aspects and clinical outcomes. Egypt. J. Radiol. Nucl. Med. 2020;51:57. doi: 10.1186/s43055-020-00175-5. [DOI] [Google Scholar]
- 34.Mammography Views. [(accessed on 1 December 2021)]. Available online: https://radiopaedia.org/articles/mammography-views.
- 35.Shin H.K., Kim W.H., Kim H.J., Kim C., Kim J. Prediction of Pathological Complete Response to Neoadjuvant Chemotherapy Using Multi-scale Patch Learning with Mammography; Proceedings of the International Workshop on Predictive Intelligence in Medicine; Strasbourg, France. 1 October 2021; Cham, Switzerland: Springer; 2021. pp. 192–200. [Google Scholar]
- 36.Skarping I., Larsson M., Förnvik D. Analysis of mammograms using artificial intelligence to predict response to neoadjuvant chemotherapy in breast cancer patients: Proof of concept. Eur. Radiol. 2021;32:3131–3141. doi: 10.1007/s00330-021-08306-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Bhimani C., Matta D., Roth R.G., Liao L., Tinney E., Brill K., Germaine P. Contrast-enhanced spectral mammography: Technique, indications, and clinical applications. Acad. Radiol. 2017;24:84–88. doi: 10.1016/j.acra.2016.08.019. [DOI] [PubMed] [Google Scholar]
- 38.Patel B.K., Lobbes M.B., Lewin J. Contrast Enhanced Spectral Mammography: A Review. Semin. Ultrasound CT MRI. 2018;39:70–79. doi: 10.1053/j.sult.2017.08.005. [DOI] [PubMed] [Google Scholar]
- 39.Mann R.M., Balleyguier C., Baltzer P.A., Bick U., Colin C., Cornford E., Evans A., Fallenberg E., Forrai G., Fuchsjäger M.H., et al. Breast MRI: EUSOBI recommendations for women’s information. Eur. Radiol. 2015;25:3669–3678. doi: 10.1007/s00330-015-3807-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Richter V., Hatterman V., Preibsch H., Bahrs S.D., Hahn M., Nikolaou K., Wiesinger B. Contrast-enhanced spectral mammography in patients with MRI contraindications. Acta Radiol. 2018;59:798–805. doi: 10.1177/0284185117735561. [DOI] [PubMed] [Google Scholar]
- 41.Wang Q., Li K., Wang L., Zhang J., Zhou Z., Feng Y. Preclinical study of diagnostic performances of contrast-enhanced spectral mammography versus MRI for breast diseases in China. Springerplus. 2016;5:763. doi: 10.1186/s40064-016-2385-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Li L., Roth R., Germaine P., Ren S., Lee M., Hunter K., Tinney E., Liao L. Contrast-enhanced spectral mammography (CESM) versus breast magnetic resonance imaging (MRI): A retrospective comparison in 66 breast lesions. Diagn. Interv. Imaging. 2017;98:113–123. doi: 10.1016/j.diii.2016.08.013. [DOI] [PubMed] [Google Scholar]
- 43.Sorin V., Sklair-Levy M. Dual-energy contrast-enhanced spectral mammography (CESM) for breast cancer screening. Quant. Imaging Med. Surg. 2019;9:1914. doi: 10.21037/qims.2019.10.13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Xing D., Mao N., Dong J., Ma H., Chen Q., Lv Y. Quantitative analysis of contrast enhanced spectral mammography grey value for early prediction of pathological response of breast cancer to neoadjuvant chemotherapy. Sci. Rep. 2021;11:5892. doi: 10.1038/s41598-021-85353-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Wang Z., Lin F., Ma H., Shi Y., Dong J., Yang P., Zhang K., Guo N., Zhang R., Cui J., et al. Contrast-Enhanced spectral mammography-based radiomics nomogram for the prediction of neoadjuvant chemotherapy-insensitive breast cancers. Front. Oncol. 2021;11:84. doi: 10.3389/fonc.2021.605230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Mao N., Shi Y., Lian C., Wang Z., Zhang K., Xie H., Zhang H., Chen Q., Cheng G., Xu C., et al. Intratumoral and peritumoral radiomics for preoperative prediction of neoadjuvant chemotherapy effect in breast cancer based on contrast-enhanced spectral mammography. Eur. Radiol. 2022;32:3207–3219. doi: 10.1007/s00330-021-08414-7. [DOI] [PubMed] [Google Scholar]
- 47.Sree S.V., Ng E.Y.K., Acharya R.U., Faust O. WJCO. World. 2011;2:171–178. doi: 10.5306/wjco.v2.i4.171. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Sadeghi-Naini A., Sannachi L., Pritchard K., Trudeau M., Gandhi S., Wright F.C., Zubovits J., Yaffe M.J., Kolios M.C., Czarnota G.J. Early prediction of therapy responses and outcomes in breast cancer patients using quantitative ultrasound spectral texture. Oncotarget. 2014;5:3497. doi: 10.18632/oncotarget.1950. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Byra M., Dobruch-Sobczak K., Piotrzkowska-Wroblewska H., Klimonda Z., Litniewski J. Prediction of response to neoadjuvant chemotherapy in breast cancer with recurrent neural networks and raw ultrasound signals. Phys. Med. Biol. 2022;67:185007. doi: 10.1088/1361-6560/ac8c82. [DOI] [PubMed] [Google Scholar]
- 50.Jiang M., Li C.L., Luo X.M., Chuan Z.R., Lv W.Z., Li X., Cui X.W., Dietrich C.F. Ultrasound-based deep learning radiomics in the assessment of pathological complete response to neoadjuvant chemotherapy in locally advanced breast cancer. Eur. J. Cancer. 2021;147:95–105. doi: 10.1016/j.ejca.2021.01.028. [DOI] [PubMed] [Google Scholar]
- 51.Tadayyon H., Sadeghi-Naini A., Czarnota G.J. Noninvasive characterization of locally advanced breast cancer using textural analysis of quantitative ultrasound parametric images. Transl. Oncol. 2014;7:759–767. doi: 10.1016/j.tranon.2014.10.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Tadayyon H., Sannachi L., Gangeh M., Sadeghi-Naini A., Tran W., Trudeau M.E., Pritchard K., Ghandi S., Verma S., Czarnota G.J. Quantitative ultrasound assessment of breast tumor response to chemotherapy using a multi-parameter approach. Oncotarget. 2016;7:45094. doi: 10.18632/oncotarget.8862. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Sannachi L., Gangeh M., Tadayyon H., Gandhi S., Wright F.C., Slodkowska E., Curpen B., Sadeghi-Naini A., Tran W., Czarnota G.J. Breast cancer treatment response monitoring using quantitative ultrasound and texture analysis: Comparative analysis of analytical models. Transl. Oncol. 2019;12:1271–1281. doi: 10.1016/j.tranon.2019.06.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.DiCenzo D., Quiaoit K., Fatima K., Bhardwaj D., Sannachi L., Gangeh M., Sadeghi-Naini A., Dasgupta A., Kolios M.C., Trudeau M., et al. Quantitative ultrasound radiomics in predicting response to neoadjuvant chemotherapy in patients with locally advanced breast cancer: Results from multi-institutional study. Cancer Med. 2020;9:5798–5806. doi: 10.1002/cam4.3255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Dasgupta A., Brade S., Sannachi L., Quiaoit K., Fatima K., DiCenzo D., Osapoetra L.O., Saifuddin M., Trudeau M., Gandhi S., et al. Quantitative ultrasound radiomics using texture derivatives in prediction of treatment response to neo-adjuvant chemotherapy for locally advanced breast cancer. Oncotarget. 2020;11:3782. doi: 10.18632/oncotarget.27742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Tadayyon H., Sannachi L., Gangeh M.J., Kim C., Ghandi S., Trudeau M., Pritchard K., Tran W.T., Slodkowska E., Sadeghi-Naini A., et al. A priori prediction of neoadjuvant chemotherapy response and survival in breast cancer patients using quantitative ultrasound. Sci. Rep. 2017;7:45733. doi: 10.1038/srep45733. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Sannachi L., Gangeh M., Tadayyon H., Sadeghi-Naini A., Gandhi S., Wright F.C., Slodkowska E., Curpen B., Tran W., Czarnota G.J. Response monitoring of breast cancer patients receiving neoadjuvant chemotherapy using quantitative ultrasound, texture, and molecular features. PLoS ONE. 2018;13:e0189634. doi: 10.1371/journal.pone.0189634. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Tadayyon H., Gangeh M., Sannachi L., Trudeau M., Pritchard K., Ghandi S., Eisen A., Look-Hong N., Holloway C., Wright F., et al. A priori prediction of breast tumour response to chemotherapy using quantitative ultrasound imaging and artificial neural networks. Oncotarget. 2019;10:3910. doi: 10.18632/oncotarget.26996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Prado-Costa R., Rebelo J., Monteiro-Barroso J., Preto A.S. Ultrasound elastography: Compression elastography and shear-wave elastography in the assessment of tendon injury. Insights Imaging. 2018;9:791–814. doi: 10.1007/s13244-018-0642-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Ma Y., Zhang S., Zang L., Li J., Li J., Kang Y., Ren W. Combination of shear wave elastography and Ki-67 index as a novel predictive modality for the pathological response to neoadjuvant chemotherapy in patients with invasive breast cancer. Eur. J. Cancer. 2016;69:86–101. doi: 10.1016/j.ejca.2016.09.031. [DOI] [PubMed] [Google Scholar]
- 61.Gu J., Polley E.C., Denis M., Carter J.M., Pruthi S., Gregory A.V., Boughey J.C., Fazzio R.T., Fatemi M., Alizad A. Early assessment of shear wave elastography parameters foresees the response to neoadjuvant chemotherapy in patients with invasive breast cancer. Breast Cancer Res. 2021;23:1–13. doi: 10.1186/s13058-021-01429-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Byra M., Dobruch-Sobczak K., Klimonda Z., Piotrzkowska-Wroblewska H., Litniewski J. Early prediction of response to neoadjuvant chemotherapy in breast cancer sonography using Siamese convolutional neural networks. IEEE J. Biomed. Health Inform. 2020;25:797–805. doi: 10.1109/JBHI.2020.3008040. [DOI] [PubMed] [Google Scholar]
- 63.Xie J., Shi H., Du C., Song X., Wei J., Dong Q., Wan C. Dual-branch convolutional neural network based on ultrasound imaging in the early prediction of neoadjuvant chemotherapy response in patients with locally advanced breast cancer. Front. Oncol. 2022;12:812463. doi: 10.3389/fonc.2022.812463. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Liu Y., Wang Y., Wang Y., Xie Y., Cui Y., Feng S., Yao M., Qiu B., Shen W., Chen D., et al. Early prediction of treatment response to neoadjuvant chemotherapy based on longitudinal ultrasound images of HER2-positive breast cancer patients by Siamese multi-task network: A multicentre, retrospective cohort study. EClinicalMedicine. 2022;52:101562. doi: 10.1016/j.eclinm.2022.101562. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Gu J., Tong T., He C., Xu M., Yang X., Tian J., Jiang T., Wang K. Deep learning radiomics of ultrasonography can predict response to neoadjuvant chemotherapy in breast cancer at an early stage of treatment: A prospective study. Eur. Radiol. 2022;32:2099–2109. doi: 10.1007/s00330-021-08293-y. [DOI] [PubMed] [Google Scholar]
- 66.Yang M., Liu H., Dai Q., Yao L., Zhang S., Wang Z., Li J., Duan Q. Treatment Response Prediction Using Ultrasound-Based Pre-, Post-Early, and Delta Radiomics in Neoadjuvant Chemotherapy in Breast Cancer. Front. Oncol. 2022;12:748008. doi: 10.3389/fonc.2022.748008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Su T.P., Huang J.S., Chang P.H., Lui K.W., Hsieh J.C.H., Ng S.H., Chan S.C. Prospective comparison of early interim 18F-FDG-PET with 18F-FLT-PET for predicting treatment response and survival in metastatic breast cancer. BMC Cancer. 2021;21:908. doi: 10.1186/s12885-021-08649-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Lind P., Igerc I., Beyer T., Reinprecht P., Hausegger K. Advantages and limitations of FDG PET in the follow-up of breast cancer. Eur. J. Nucl. Med. Mol. Imaging. 2004;31:S125–S134. doi: 10.1007/s00259-004-1535-8. [DOI] [PubMed] [Google Scholar]
- 69.Romeo V., Accardo G., Perillo T., Basso L., Garbino N., Nicolai E., Maurea S., Salvatore M. Assessment and prediction of response to neoadjuvant chemotherapy in breast cancer: A comparison of imaging modalities and future perspectives. Cancers. 2021;13:3521. doi: 10.3390/cancers13143521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Ming Y., Wu N., Qian T., Li X., Wan D.Q., Li C., Li Y., Wu Z., Wang X., Liu J., et al. Progress and future trends in PET/CT and PET/MRI molecular imaging approaches for breast cancer. Front. Oncol. 2020;10:1301. doi: 10.3389/fonc.2020.01301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Bertagna F., Biasiotto G., Giubbini R. The role of F-18-fluorothymidine PET in oncology. Clin. Transl. Imaging. 2013;1:77–97. doi: 10.1007/s40336-013-0014-2. [DOI] [Google Scholar]
- 72.Hicks R.J. Beyond FDG: Novel PET tracers for cancer imaging. Cancer Imaging. 2004;4:22. doi: 10.1102/1470-7330.2003.0032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Treglia G., Sadeghi R., Del Sole A., Giovanella L. Diagnostic performance of PET/CT with tracers other than F-18-FDG in oncology: An evidence-based review. Clin. Transl. Oncol. 2014;16:770–775. doi: 10.1007/s12094-014-1168-8. [DOI] [PubMed] [Google Scholar]
- 74.Kato A., Nakamoto Y., Ishimori T., Saga T., Togashi K. Prognostic value of quantitative parameters of 18F-FDG PET/CT for patients with angiosarcoma. Am. J. Roentgenol. 2020;214:649–657. doi: 10.2214/AJR.19.21635. [DOI] [PubMed] [Google Scholar]
- 75.Andrade W., Lima E., Osório C., do Socorro Maciel M., Baiocchi G., Bitencourt A., Fanelli M., Damascena A., Soares F. Can FDG-PET/CT predict early response to neoadjuvant chemotherapy in breast cancer? Eur. J. Surg. Oncol. (EJSO) 2013;39:1358–1363. doi: 10.1016/j.ejso.2013.08.025. [DOI] [PubMed] [Google Scholar]
- 76.Kendi A.T.K., Magliocca K.R., Corey A., Galt J.R., Switchenko J., Wadsworth J.T., El-Deiry M.W., Schuster D.M., Saba N.F., Hudgins P.A. Is there a role for PET/CT parameters to characterize benign, malignant, and metastatic parotid tumors? AJR Am. J. Roentgenol. 2016;207:635. doi: 10.2214/AJR.15.15590. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Sher A., Lacoeuille F., Fosse P., Vervueren L., Cahouet-Vannier A., Dabli D., Bouchet F., Couturier O. For avid glucose tumors, the SUV peak is the most reliable parameter for [18F] FDG-PET/CT quantification, regardless of acquisition time. EJNMMI Res. 2016;6:21. doi: 10.1186/s13550-016-0177-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Vanderhoek M., Perlman S.B., Jeraj R. Impact of the definition of peak standardized uptake value on quantification of treatment response. J. Nucl. Med. 2012;53:4–11. doi: 10.2967/jnumed.111.093443. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Steiger S., Arvanitakis M., Sick B., Weder W., Hillinger S., Burger I.A. Analysis of prognostic values of various PET metrics in preoperative 18F-FDG PET for early-stage bronchial carcinoma for progression-free and overall survival: Significantly increased glycolysis is a predictive factor. J. Nucl. Med. 2017;58:1925–1930. doi: 10.2967/jnumed.117.189894. [DOI] [PubMed] [Google Scholar]
- 80.Buchbender C., Kuemmel S., Hoffmann O., Stahl A.R., Kimmig R., Otterbach F., Ladd S., Koeninger A., Forsting M., Bockisch A., et al. FDG-PET/CT for the early prediction of histopathological complete response to neoadjuvant chemotherapy in breast cancer patients: Initial results. Acta Radiol. 2012;53:628–636. doi: 10.1258/ar.2012.110699. [DOI] [PubMed] [Google Scholar]
- 81.Koolen B.B., Pengel K.E., Wesseling J., Vogel W.V., Vrancken Peeters M.J.T., Vincent A.D., Gilhuijs K.G., Rodenhuis S., Rutgers E.J.T., Valdés Olmos R.A. Sequential 18F-FDG PET/CT for early prediction of complete pathological response in breast and axilla during neoadjuvant chemotherapy. Eur. J. Nucl. Med. Mol. Imaging. 2014;41:32–40. doi: 10.1007/s00259-013-2515-7. [DOI] [PubMed] [Google Scholar]
- 82.Groheux D., Giacchetti S., Hatt M., Marty M., Vercellino L., De Roquancourt A., Cuvier C., Coussy F., Espié M., Hindié E. HER2-overexpressing breast cancer: FDG uptake after two cycles of chemotherapy predicts the outcome of neoadjuvant treatment. Br. J. Cancer. 2013;109:1157–1164. doi: 10.1038/bjc.2013.469. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Groheux D., Hindié E., Giacchetti S., Hamy A.S., Berger F., Merlet P., de Roquancourt A., de Cremoux P., Marty M., Hatt M., et al. Early assessment with 18F-fluorodeoxyglucose positron emission tomography/computed tomography can help predict the outcome of neoadjuvant chemotherapy in triple negative breast cancer. Eur. J. Cancer. 2014;50:1864–1871. doi: 10.1016/j.ejca.2014.04.020. [DOI] [PubMed] [Google Scholar]
- 84.Humbert O., Riedinger J.M., Charon-Barra C., Berriolo-Riedinger A., Desmoulins I., Lorgis V., Kanoun S., Coutant C., Fumoleau P., Cochet A., et al. Identification of Biomarkers Including 18FDG-PET/CT for Early Prediction of Response to Neoadjuvant Chemotherapy in Triple-Negative Breast CancerPredicting the Response in Triple-Negative Breast Cancer. Clin. Cancer Res. 2015;21:5460–5468. doi: 10.1158/1078-0432.CCR-15-0384. [DOI] [PubMed] [Google Scholar]
- 85.Luo J., Zhou Z., Yang Z., Chen X., Cheng J., Shao Z., Guo X., Tuan J., Fu X., Yu X. The value of 18F-FDG PET/CT imaging combined with pretherapeutic Ki67 for early prediction of pathologic response after neoadjuvant chemotherapy in locally advanced breast cancer. Medicine. 2016;95:e2914. doi: 10.1097/MD.0000000000002914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Cheng L., Zhang J., Wang Y., Xu X., Zhang Y., Zhang Y., Liu G., Cheng J. Textural features of 18F-FDG PET after two cycles of neoadjuvant chemotherapy can predict pCR in patients with locally advanced breast cancer. Ann. Nucl. Med. 2017;31:544–552. doi: 10.1007/s12149-017-1184-1. [DOI] [PubMed] [Google Scholar]
- 87.Antunovic L., De Sanctis R., Cozzi L., Kirienko M., Sagona A., Torrisi R., Tinterri C., Santoro A., Chiti A., Zelic R., et al. PET/CT radiomics in breast cancer: Promising tool for prediction of pathological response to neoadjuvant chemotherapy. Eur. J. Nucl. Med. Mol. Imaging. 2019;46:1468–1477. doi: 10.1007/s00259-019-04313-8. [DOI] [PubMed] [Google Scholar]
- 88.Li P., Wang X., Xu C., Liu C., Zheng C., Fulham M.J., Feng D., Wang L., Song S., Huang G. 18F-FDG PET/CT radiomic predictors of pathologic complete response (pCR) to neoadjuvant chemotherapy in breast cancer patients. Eur. J. Nucl. Med. Mol. Imaging. 2020;47:1116–1126. doi: 10.1007/s00259-020-04684-3. [DOI] [PubMed] [Google Scholar]
- 89.Fantini L., Belli M.L., Azzali I., Loi E., Bettinelli A., Feliciani G., Mezzenga E., Fedeli A., Asioli S., Paganelli G., et al. Exploratory Analysis of 18F-3’-Deoxy-3’-Fluorothymidine (18F-FLT) PET/CT-Based Radiomics for the Early Evaluation of Response to Neoadjuvant Chemotherapy in Patients With Locally Advanced Breast Cancer. Front. Oncol. 2021;11:2315. doi: 10.3389/fonc.2021.601053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Crippa F., Agresti R., Sandri M., Marfiani G., Padovano B., Alessi A., Bianchi G., Bombardieri E., Maugeri I., Rampa M., et al. 18F-FLT PET/CT as an imaging tool for early prediction of pathological response in patients with locally advanced breast cancer treated with neoadjuvant chemotherapy: A pilot study. Eur. J. Nucl. Med. Mol. Imaging. 2015;42:818–830. doi: 10.1007/s00259-015-2995-8. [DOI] [PubMed] [Google Scholar]
- 91.Fusco R., Sansone M., Filice S., Carone G., Amato D.M., Sansone C., Petrillo A. Pattern recognition approaches for breast cancer DCE-MRI classification: A systematic review. J. Med. Biol. Eng. 2016;36:449–459. doi: 10.1007/s40846-016-0163-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Chavhan G.B., Humphries P.D. Functional MRI: DWI and DCE-MRI. Imaging in Pediatric Oncology. Springer; Cham, Switzerland: 2019. pp. 91–106. [Google Scholar]
- 93.Banaie M., Soltanian-Zadeh H., Saligheh-Rad H.R., Gity M. Spatiotemporal features of DCE-MRI for breast cancer diagnosis. Comput. Methods Programs Biomed. 2018;155:153–164. doi: 10.1016/j.cmpb.2017.12.015. [DOI] [PubMed] [Google Scholar]
- 94.Ahmed A., Gibbs P., Pickles M., Turnbull L. Texture analysis in assessment and prediction of chemotherapy response in breast cancer. J. Magn. Reson. Imaging. 2013;38:89–101. doi: 10.1002/jmri.23971. [DOI] [PubMed] [Google Scholar]
- 95.Teruel J.R., Heldahl M.G., Goa P.E., Pickles M., Lundgren S., Bathen T.F., Gibbs P. Dynamic contrast-enhanced MRI texture analysis for pretreatment prediction of clinical and pathological response to neoadjuvant chemotherapy in patients with locally advanced breast cancer. NMR Biomed. 2014;27:887–896. doi: 10.1002/nbm.3132. [DOI] [PubMed] [Google Scholar]
- 96.Giannini V., Mazzetti S., Marmo A., Montemurro F., Regge D., Martincich L. A computer-aided diagnosis (CAD) scheme for pretreatment prediction of pathological response to neoadjuvant therapy using dynamic contrast-enhanced MRI texture features. Br. J. Radiol. 2017;90:20170269. doi: 10.1259/bjr.20170269. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Fan M., Wu G., Cheng H., Zhang J., Shao G., Li L. Radiomic analysis of DCE-MRI for prediction of response to neoadjuvant chemotherapy in breast cancer patients. Eur. J. Radiol. 2017;94:140–147. doi: 10.1016/j.ejrad.2017.06.019. [DOI] [PubMed] [Google Scholar]
- 98.Cain E.H., Saha A., Harowicz M.R., Marks J.R., Marcom P.K., Mazurowski M.A. Multivariate machine learning models for prediction of pathologic response to neoadjuvant therapy in breast cancer using MRI features: A study using an independent validation set. Breast Cancer Res. Treat. 2019;173:455–463. doi: 10.1007/s10549-018-4990-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Eom H.J., Cha J.H., Choi W.J., Chae E.Y., Shin H.J., Kim H.H. Predictive clinicopathologic and dynamic contrast-enhanced MRI findings for tumor response to neoadjuvant chemotherapy in triple-negative breast cancer. Am. J. Roentgenol. 2017;208:W225–W230. doi: 10.2214/AJR.16.17125. [DOI] [PubMed] [Google Scholar]
- 100.Li W., Newitt D.C., Gibbs J., Wilmes L.J., Jones E.F., Arasu V.A., Strand F., Onishi N., Nguyen A.A.T., Kornak J., et al. Predicting breast cancer response to neoadjuvant treatment using multi-feature MRI: Results from the I-SPY 2 TRIAL. NPJ Breast Cancer. 2020;6:63. doi: 10.1038/s41523-020-00203-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.Li X., Arlinghaus L.R., Ayers G.D., Chakravarthy A.B., Abramson R.G., Abramson V.G., Atuegwu N., Farley J., Mayer I.A., Kelley M.C., et al. DCE-MRI analysis methods for predicting the response of breast cancer to neoadjuvant chemotherapy: Pilot study findings. Magn. Reson. Med. 2014;71:1592–1602. doi: 10.1002/mrm.24782. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 102.Tudorica A., Oh K.Y., Chui S.Y., Roy N., Troxell M.L., Naik A., Kemmer K.A., Chen Y., Holtorf M.L., Afzal A., et al. Early prediction and evaluation of breast cancer response to neoadjuvant chemotherapy using quantitative DCE-MRI. Transl. Oncol. 2016;9:8–17. doi: 10.1016/j.tranon.2015.11.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 103.Drisis S., Metens T., Ignatiadis M., Stathopoulos K., Chao S.L., Lemort M. Quantitative DCE-MRI for prediction of pathological complete response following neoadjuvant treatment for locally advanced breast cancer: The impact of breast cancer subtypes on the diagnostic accuracy. Eur. Radiol. 2016;26:1474–1484. doi: 10.1007/s00330-015-3948-0. [DOI] [PubMed] [Google Scholar]
- 104.Thibault G., Tudorica A., Afzal A., Chui S.Y., Naik A., Troxell M.L., Kemmer K.A., Oh K.Y., Roy N., Jafarian N., et al. DCE-MRI texture features for early prediction of breast cancer therapy response. Tomography. 2017;3:23–32. doi: 10.18383/j.tom.2016.00241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Lee J., Kim S.H., Kang B.J. Pretreatment prediction of pathologic complete response to neoadjuvant chemotherapy in breast cancer: Perfusion metrics of dynamic contrast enhanced MRI. Sci. Rep. 2018;8:9490. doi: 10.1038/s41598-018-27764-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Ashraf A., Gaonkar B., Mies C., DeMichele A., Rosen M., Davatzikos C., Kontos D. Breast DCE-MRI kinetic heterogeneity tumor markers: Preliminary associations with neoadjuvant chemotherapy response. Transl. Oncol. 2015;8:154–162. doi: 10.1016/j.tranon.2015.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 107.Braman N.M., Etesami M., Prasanna P., Dubchuk C., Gilmore H., Tiwari P., Plecha D., Madabhushi A. Intratumoral and peritumoral radiomics for the pretreatment prediction of pathological complete response to neoadjuvant chemotherapy based on breast DCE-MRI. Breast Cancer Res. 2017;19:57. doi: 10.1186/s13058-017-0846-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.Caballo M., Sanderink W.B., Han L., Gao Y., Athanasiou A., Mann R.M. Four-Dimensional Machine Learning Radiomics for the Pretreatment Assessment of Breast Cancer Pathologic Complete Response to Neoadjuvant Chemotherapy in Dynamic Contrast-Enhanced MRI. J. Magn. Reson. Imaging. 2023;57:97–110. doi: 10.1002/jmri.28273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 109.Drukker K., Edwards A.V., Doyle C., Papaioannou J., Kulkarni K., Giger M.L. Breast MRI radiomics for the pretreatment prediction of response to neoadjuvant chemotherapy in node-positive breast cancer patients. J. Med. Imaging. 2019;6:034502. doi: 10.1117/1.JMI.6.3.034502. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Machireddy A., Thibault G., Tudorica A., Afzal A., Mishal M., Kemmer K., Naik A., Troxell M., Goranson E., Oh K., et al. Early prediction of breast cancer therapy response using multiresolution fractal analysis of DCE-MRI parametric maps. Tomography. 2019;5:90–98. doi: 10.18383/j.tom.2018.00046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Wu J., Gong G., Cui Y., Li R. Intratumor partitioning and texture analysis of dynamic contrast-enhanced (DCE)-MRI identifies relevant tumor subregions to predict pathological response of breast cancer to neoadjuvant chemotherapy. J. Magn. Reson. Imaging. 2016;44:1107–1115. doi: 10.1002/jmri.25279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.El Adoui M., Drisis S., Benjelloun M. A PRM approach for early prediction of breast cancer response to chemotherapy based on registered MR images. Int. J. Comput. Assist. Radiol. Surg. 2018;13:1233–1243. doi: 10.1007/s11548-018-1790-y. [DOI] [PubMed] [Google Scholar]
- 113.El Adoui M., Drisis S., Benjelloun M. Multi-input deep learning architecture for predicting breast tumor response to chemotherapy using quantitative MR images. Int. J. Comput. Assist. Radiol. Surg. 2020;15:1491–1500. doi: 10.1007/s11548-020-02209-9. [DOI] [PubMed] [Google Scholar]
- 114.Khanna P., Sahu M., Singh B.K., Bhateja V. Early prediction of pathological complete response to neoadjuvant chemotherapy in breast cancer MRI images using combined Pre-trained convolutional neural network and machine learning. Measurement. 2023;207:112269. doi: 10.1016/j.measurement.2022.112269. [DOI] [Google Scholar]
- 115.Jimenez J.E., Abdelhafez A., Mittendorf E.A., Elshafeey N., Yung J.P., Litton J.K., Adrada B.E., Candelaria R.P., White J., Thompson A.M., et al. A model combining pretreatment MRI radiomic features and tumor-infiltrating lymphocytes to predict response to neoadjuvant systemic therapy in triple-negative breast cancer. Eur. J. Radiol. 2022;149:110220. doi: 10.1016/j.ejrad.2022.110220. [DOI] [PubMed] [Google Scholar]
- 116.Golden D.I., Lipson J.A., Telli M.L., Ford J.M., Rubin D.L. Dynamic contrast-enhanced MRI-based biomarkers of therapeutic response in triple-negative breast cancer. J. Am. Med. Inform. Assoc. 2013;20:1059–1066. doi: 10.1136/amiajnl-2012-001460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Jahani N., Cohen E., Hsieh M.K., Weinstein S.P., Pantalone L., Hylton N., Newitt D., Davatzikos C., Kontos D. Prediction of treatment response to neoadjuvant chemotherapy for breast cancer via early changes in tumor heterogeneity captured by DCE-MRI registration. Sci. Rep. 2019;9:12144. doi: 10.1038/s41598-019-48465-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118.Sutton E.J., Onishi N., Fehr D.A., Dashevsky B.Z., Sadinski M., Pinker K., Martinez D.F., Brogi E., Braunstein L., Razavi P., et al. A machine learning model that classifies breast cancer pathologic complete response on MRI post-neoadjuvant chemotherapy. Breast Cancer Res. 2020;22:57. doi: 10.1186/s13058-020-01291-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Fan M., Chen H., You C., Liu L., Gu Y., Peng W., Gao X., Li L. Radiomics of tumor heterogeneity in longitudinal dynamic contrast-enhanced magnetic resonance imaging for predicting response to neoadjuvant chemotherapy in breast cancer. Front. Mol. Biosci. 2021;8:622219. doi: 10.3389/fmolb.2021.622219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Hussain L., Huang P., Nguyen T., Lone K.J., Ali A., Khan M.S., Li H., Suh D.Y., Duong T.Q. Machine learning classification of texture features of MRI breast tumor and peri-tumor of combined pre-and early treatment predicts pathologic complete response. BioMedical Eng. Online. 2021;20:63. doi: 10.1186/s12938-021-00899-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Cho N., Im S.A., Park I.A., Lee K.H., Li M., Han W., Noh D.Y., Moon W.K. Breast cancer: Early prediction of response to neoadjuvant chemotherapy using parametric response maps for MR imaging. Radiology. 2014;272:385–396. doi: 10.1148/radiol.14131332. [DOI] [PubMed] [Google Scholar]
- 122.Drisis S., El Adoui M., Flamen P., Benjelloun M., Dewind R., Paesmans M., Ignatiadis M., Bali M., Lemort M. Early prediction of neoadjuvant treatment outcome in locally advanced breast cancer using parametric response mapping and radial heterogeneity from breast MRI. J. Magn. Reson. Imaging. 2020;51:1403–1411. doi: 10.1002/jmri.26996. [DOI] [PubMed] [Google Scholar]
- 123.Comes M.C., Fanizzi A., Bove S., Didonna V., Diotaiuti S., La Forgia D., Latorre A., Martinelli E., Mencattini A., Nardone A., et al. Early prediction of neoadjuvant chemotherapy response by exploiting a transfer learning approach on breast DCE-MRIs. Sci. Rep. 2021;11:14123. doi: 10.1038/s41598-021-93592-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 124.Peng Y., Cheng Z., Gong C., Zheng C., Zhang X., Wu Z., Yang Y., Yang X., Zheng J., Shen J. Pretreatment DCE-MRI-Based Deep Learning Outperforms Radiomics Analysis in Predicting Pathologic Complete Response to Neoadjuvant Chemotherapy in Breast Cancer. Front. Oncol. 2022;12:846775. doi: 10.3389/fonc.2022.846775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 125.Li Y., Chen Y., Zhao R., Ji Y., Li J., Zhang Y., Lu H. Development and validation of a nomogram based on pretreatment dynamic contrast-enhanced MRI for the prediction of pathologic response after neoadjuvant chemotherapy for triple-negative breast cancer. Eur. Radiol. 2022;32:1676–1687. doi: 10.1007/s00330-021-08291-0. [DOI] [PubMed] [Google Scholar]
- 126.Liang X., Chen X., Yang Z., Liao Y., Wang M., Li Y., Fan W., Dai Z., Zhang Y. Early prediction of pathological complete response to neoadjuvant chemotherapy combining DCE-MRI and apparent diffusion coefficient values in breast Cancer. BMC Cancer. 2022;22:1250. doi: 10.1186/s12885-022-10315-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 127.Li X., Abramson R.G., Arlinghaus L.R., Kang H., Chakravarthy A.B., Abramson V.G., Farley J., Mayer I.A., Kelley M.C., Meszoely I.M., et al. Multiparametric magnetic resonance imaging for predicting pathological response after the first cycle of neoadjuvant chemotherapy in breast cancer. Investig. Radiol. 2015;50:195–204. doi: 10.1097/RLI.0000000000000100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.AM O’Flynn E., Collins D., D’Arcy J., Schmidt M., de Souza N.M. Multi-parametric MRI in the early prediction of response to neo-adjuvant chemotherapy in breast cancer: Value of non-modelled parameters. Eur. J. Radiol. 2016;85:837–842. doi: 10.1016/j.ejrad.2016.02.006. [DOI] [PubMed] [Google Scholar]
- 129.Zhao R., Lu H., Li Y.B., Shao Z.Z., Ma W.J., Liu P.F. Nomogram for early prediction of pathological complete response to neoadjuvant chemotherapy in breast cancer using dynamic contrast-enhanced and diffusion-weighted MRI. Acad. Radiol. 2022;29:S155–S163. doi: 10.1016/j.acra.2021.01.023. [DOI] [PubMed] [Google Scholar]
- 130.Bian T., Wu Z., Lin Q., Wang H., Ge Y., Duan S., Fu G., Cui C., Su X. Radiomic signatures derived from multiparametric MRI for the pretreatment prediction of response to neoadjuvant chemotherapy in breast cancer. Br. J. Radiol. 2020;93:20200287. doi: 10.1259/bjr.20200287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Tahmassebi A., Wengert G.J., Helbich T.H., Bago-Horvath Z., Alaei S., Bartsch R., Dubsky P., Baltzer P., Clauser P., Kapetas P., et al. Impact of machine learning with multiparametric magnetic resonance imaging of the breast for early prediction of response to neoadjuvant chemotherapy and survival outcomes in breast cancer patients. Investig. Radiol. 2019;54:110. doi: 10.1097/RLI.0000000000000518. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 132.Eun N.L., Kang D., Son E.J., Park J.S., Youk J.H., Kim J.A., Gweon H.M. Texture analysis with 3.0-T MRI for association of response to neoadjuvant chemotherapy in breast cancer. Radiology. 2020;294:31–41. doi: 10.1148/radiol.2019182718. [DOI] [PubMed] [Google Scholar]
- 133.Liu Z., Li Z., Qu J., Zhang R., Zhou X., Li L., Sun K., Tang Z., Jiang H., Li H., et al. Radiomics of multiparametric MRI for pretreatment prediction of pathologic complete response to neoadjuvant chemotherapy in breast cancer: A multicenter study. Clin. Cancer Res. 2019;25:3538–3547. doi: 10.1158/1078-0432.CCR-18-3190. [DOI] [PubMed] [Google Scholar]
- 134.Syed A., Adam R., Ren T., Lu J., Maldjian T., Duong T.Q. Machine learning with textural analysis of longitudinal multiparametric MRI and molecular subtypes accurately predicts Pathologic Complete Response in patients with invasive breast cancer. PLoS ONE. 2023;18:e0280320. doi: 10.1371/journal.pone.0280320. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 135.Chen S., Shu Z., Li Y., Chen B., Tang L., Mo W., Shao G., Shao F. Machine learning-based radiomics nomogram using magnetic resonance images for prediction of neoadjuvant chemotherapy efficacy in breast cancer patients. Front. Oncol. 2020;10:1410. doi: 10.3389/fonc.2020.01410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 136.Chen X., Chen X., Yang J., Li Y., Fan W., Yang Z. Combining dynamic contrast-enhanced magnetic resonance imaging and apparent diffusion coefficient maps for a radiomics nomogram to predict pathological complete response to neoadjuvant chemotherapy in breast cancer patients. J. Comput. Assist. Tomogr. 2020;44:275–283. doi: 10.1097/RCT.0000000000000978. [DOI] [PubMed] [Google Scholar]
- 137.Xiong Q., Zhou X., Liu Z., Lei C., Yang C., Yang M., Zhang L., Zhu T., Zhuang X., Liang C., et al. Multiparametric MRI-based radiomics analysis for prediction of breast cancers insensitive to neoadjuvant chemotherapy. Clin. Transl. Oncol. 2020;22:50–59. doi: 10.1007/s12094-019-02109-8. [DOI] [PubMed] [Google Scholar]
- 138.Joo S., Ko E.S., Kwon S., Jeon E., Jung H., Kim J.Y., Chung M.J., Im Y.H. Multimodal deep learning models for the prediction of pathologic response to neoadjuvant chemotherapy in breast cancer. Sci. Rep. 2021;11:18800. doi: 10.1038/s41598-021-98408-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 139.Yoon H.J., Kim Y., Chung J., Kim B.S. Predicting neo-adjuvant chemotherapy response and progression-free survival of locally advanced breast cancer using textural features of intratumoral heterogeneity on F-18 FDG PET/CT and diffusion-weighted MR imaging. Breast J. 2019;25:373–380. doi: 10.1111/tbj.13032. [DOI] [PubMed] [Google Scholar]
- 140.Umutlu L., Kirchner J., Bruckmann N.M., Morawitz J., Antoch G., Ting S., Bittner A.K., Hoffmann O., Häberle L., Ruckhäberle E., et al. Multiparametric 18F-FDG PET/MRI-Based Radiomics for Prediction of Pathological Complete Response to Neoadjuvant Chemotherapy in Breast Cancer. Cancers. 2022;14:1727. doi: 10.3390/cancers14071727. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 141.Choi J.H., Kim H.A., Kim W., Lim I., Lee I., Byun B.H., Noh W.C., Seong M.K., Lee S.S., Kim B.I., et al. Early prediction of neoadjuvant chemotherapy response for advanced breast cancer using PET/MRI image deep learning. Sci. Rep. 2020;10:21149. doi: 10.1038/s41598-020-77875-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 142.Montemezzi S., Benetti G., Bisighin M.V., Camera L., Zerbato C., Caumo F., Fiorio E., Zanelli S., Zuffante M., Cavedon C. 3T DCE-MRI radiomics improves predictive models of complete response to neoadjuvant chemotherapy in breast cancer. Front. Oncol. 2021;11:1289. doi: 10.3389/fonc.2021.630780. [DOI] [PMC free article] [PubMed] [Google Scholar]