Abstract
This study established a fully automated computer-aided diagnosis (CAD) system for the classification of malignant and benign masses via breast magnetic resonance imaging (BMRI). A breast segmentation method consisting of a preprocessing step to identify the air-breast interfacing boundary and curve fitting for chest wall line (CWL) segmentation was included in the proposed CAD system. The Chan-Vese (CV) model level set (LS) segmentation method was adopted to segment breast mass and demonstrated sufficiently good segmentation performance. The support vector machine (SVM) classifier with ReliefF feature selection was used to merge the extracted morphological and texture features into a classification score. The accuracy, sensitivity, and specificity measurements for the leave-half-case-out resampling method were 92.3%, 98.2%, and 76.2%, respectively. For the leave-one-case-out resampling method, the measurements were 90.0%, 98.7%, and 73.8%, respectively.
1. Introduction
Because early detection of breast cancer offers the best chance for a cure, regular screening has been identified as a key to improving breast cancer survival rates. Breast cancer is commonly based on X-ray mammography and ultrasound, which have a low sensitivity and are not effective in dense breast tissue. Dynamic contrast-enhanced (DCE) magnetic resonance imaging (MRI) has been shown to be the most sensitive screening methodology for the detection of invasive breast cancer and can detect breast cancer missed by mammography [1, 2]. Computer image analysis provides various techniques for analyzing medical images. Computerized methods have recently shown great potential for providing radiologists with a second opinion about the visual diagnosis of the malignancy of mammographic masses.
However, compared with mammography, relatively fewer computer-aided diagnosis (CAD) systems have been developed specifically for breast MRIs (BMRIs). Most CAD systems require radiologists to manually (or semiautomated) segment tumors from the imaging data [3–6]. As a result, there is an urgent need to develop a fully automated CAD system that allows radiologists to diagnose the data more efficiently. Three key components of such an automated CAD system commonly include an appropriate segmentation algorithm, an appropriate feature extraction algorithm, and an appropriate classification algorithm responsible for the differential diagnosis of malignant and benign masses.
DCE-MRI automated segmentation should include breast and mass segmentation. However, few studies have focused on breast segmentation. Several mass segmentation methods such as the threshold, region growing, clustering, and 3D level set (LS) methods have been proposed. Shi et al. [6] used the fuzzy c-mean (FCM) clustering algorithm followed by a 3D LS method to refine segmentation. Region growing methods [7] gather pixels or subregions from larger regions using predetermined similarity criteria, which suffer from sensitivity to the selection of initial seed points. Among the clustering techniques, the FCM [8] has received much attention, but it exhibits a low performance [9] owing to its oversensitivity to noise. Liney et al. [10] presented a user-interaction-threshold method to extract the region of interest (ROI), a method that requires manual intervention. A novel two-step approach that incorporates FCM clustering and a gradient vector flow (GVF) snake algorithm for mass contour segmentation in BMRIs was also designed and obtained encouraging results [3].
Features can be extracted from original ROIs, segmented masses, and patient information. Most CAD systems in the literature have explored morphological and texture features based on segmented masses. As adjunct diagnostic criteria, the morphological features of BMRIs have proven useful in improving specificity without significantly decreasing sensitivity [3, 11–13]. Texture analysis (i.e., homogeneity and regularity with diagnosis potential in MRIs) is significantly associated with breast tumor subtype and neoadjuvant therapy response and has been used extensively to quantify MRI characteristics [3, 14–16]. A preliminary study [3] proved the potential discriminatory power of the image features estimated from both morphological and texture features.
Feature selection, which refers to the choosing of a subset of attributes from a set of original attributes, is an important issue in building classification systems. However, few studies have investigated the feature selection performance of both morphological and texture features in discriminating pathologically verified breast masses.
Classifier selection is a crucial step for computerized classification of malignant and benign breast masses. Classical breast CAD classification algorithms include the support vector machine (SVM) [17], naive Bayes (NB) classifier technique [18], k-nearest neighbors (KNN) [18], logistic regression (LR) model [19], and linear discriminant method (LDA) [20]. SVMs have been shown to outperform many alternative pattern-recognition techniques for breast cancer detection from MRI [5, 21, 22]. The preliminary study [3] systematically investigated diagnostic performance by combining the merits of both morphological and texture features using the Fisher stepwise discriminant analysis model.
In this study, a fully automated DCR-MRI CAD system is developed, in which a fully automated breast segmentation algorithm based on curve fitting is proposed. The Chan-Vese (CV) model and LS method are used to evolve the segmentation of breast masses. Both the morphological and texture features of a BMRI mass are calculated based on the proposed computerized segmentation contour and radiologists' delineation, respectively. To remove redundancy and increase the diagnostic capabilities of the features, a ReliefF algorithm [23], which is one of the most successful feature selection algorithms, is adopted to select and optimize the features. SVMs have been used to diagnose breast cancer and achieved the highest classification accuracy among the available artificial intelligent methods. This study focuses on evaluating the SVM as a potential classifier in combination with ReliefF feature selection to classify benign and malignant masses. The computational results of both the segmentation and characterization of breast masses are compared via manual delineation and the pathological results given by experienced radiologists.
2. Materials and Methods
Figure 1 shows a flowchart of the main steps of the proposed DCE-MRI CAD system. All five steps were fully automated. The first step was to segment the breast as an organ from other parts in the BMRI via preprocessing to identify the air-breast interfacing boundary and curve fitting for the chest wall line (CWL) segmentation. Second, the segmented breast was processed further by applying the LS model to obtain the final mass segmentation. Morphological and texture features were extracted from the LS segmentation in the computerized characterization section. A ReliefF algorithm has been successfully used in many large subset feature selection tasks, and here it was guided to estimate the weight of the morphological and texture feature. It was fundamentally important to select the relevant and necessary features in the preprocessing step. Finally, a SVM classifier was used to evaluate the ability of the mass descriptors to discriminate the different ROIs to determine whether they represented malignant or benign masses.
2.1. Breast Mass Database
The dataset consisted of 120 female patients (42 benign and 78 malignant) who had been examined with a final histopathology confirmation (age range = 29–66 years, mean age = 47.5 years) from Sun Yat-sen University Cancer Center (Guangzhou, China). Patients with suspicious breast masses were recruited with written informed consent. The Ethics Committee of Sun Yat-sen University Cancer Center approved the study. Patients were scanned in the prone position using a 1.5 T superconductive magnetic system (GE, Signa, HDx) with a breast-specific 4-channel phased-array surface coil. The patients were injected with a contrast medium using a hand venipuncture technique and then scanned in the prone position with the bilateral breast naturally hanging into the two holes of the coil and their feet placed into the machine. The patients had not received treatment before nuclear magnetic detection. In this study, only mass-like masses that showed strong contrast enhancements were selected. The database of the images for each case included one sagittal postcontrast image slice that showed an obvious contrast enhancement and demonstrated the maximum dimension of a mass. All of the images in this dataset were 512 × 512 pixels in size and had an 8-bit gray-level resolution [3].
2.2. Breast Segmentation
Breast segmentation is performed to separate the breast as an organ from the chest wall, a critical and challenging first step in automated BMRI analysis. For this task, the segmentation algorithm requires the identification of both the air-breast interface and the CWL. Although breast segmentation in BMRIs is an important topic for cancer treatment and diagnosis, it is mostly performed using a manual or semiautomated delineation method [3, 24, 25]. Few automated methods have reportedly been used in an MRI CAD system [26–28]. These methods require a large number of training samples or complex calculation. The segmentation method used in this study consisted of a preprocessing step to identify both the air-breast interfacing boundary and curve fitting for the CWL segmentation. The air-breast interface is initially identified given a BMRI scan sagittal postcontrast image. The air-breast boundary is relatively easy to identify in a BMRI due to the highly intense contrast of the boundary. Preprocessing techniques including thresholding (image binarization), image morphological opening, morphological closing, hole filling, connected components extraction, and edge contour extraction were applied sequentially to each 2D slice (Figure 2). Once the air-breast interface was identified, there were three main steps to refining the breast segmentation: (1) using the curve fitting method to fine fit the outline of the breast; (2) drawing a straight line according to the outer contour line of the two vertices as the initial CWL; and (3) adopting the GVF snake iteration approach and curve fitting iteration to increase the accuracy of the CWL edge extraction. Figure 2(h) shows the result of the outer contour. Figure 2(i) shows the result of using the two vertices to draw a straight line as the initial CWL. The straight line based on the GVF snake iterative algorithm is shown in yellow in Figure 2(j). The green line represents the fitting of the breast contour line, with the polynomial fitting method used to obtain the CWL.
2.3. Mass Segmentation
Because breast masses on DCE-MRI scans may be more pronounced at the periphery than the internal region of the mass, the object segmented by the FCM clustering algorithm may contain holes. The preliminary study [3] implemented a novel two-step approach that incorporated FCM clustering and a GVF snake algorithm for mass contour segmentation on a BMRI. Although the snake model allows for fast evaluation, it makes handling topological changes difficult. The main drawbacks of the snake algorithm are its sensitivity to initial conditions and the difficulties associated with topological transformations. Moreover, the snake segmentation method of a contour is too smooth and has a disadvantageous influence on the subsequent characterization and differentiation of benign and malignant breast masses. In this study, an LS-based method was adopted to produce a refined ROI. An FCM-based method was not used to produce an initial segmentation of the ROI. FCM initial segmentation has been proven not to improve performance. A current implementation of the GVF snake was compared with a previous implementation. The LS segmentation method used is detailed as follows.
The LS method is a deformable model that can capture object's shape or surface by numerically solving a well-designed partial differential equation (PDE). The LS method has increasingly been applied to image segmentation in the past decade, as it allows for cusps, corners, and automatic topological changes such as object splitting and merging. It has several advantages over its predecessor, the explicit active contour model. The curve is represented implicitly via a Lipschitz function ϕ in level C = {(x, y)∣ϕ(x, y) = 0}, and the evolution of the curve is given by the zero-level curve at time t of the function ϕ(x, y, t). The isocontour ϕ(x, y) = 0, which encloses a region Ω, has specifically been referred to as the zero LS in the literature. Evolving the curve C in a normal direction at a speed F amounts to solving the differential equation. Consider
(1) |
In the LS formulation of the model, C ⊂ Ω is represented by the zero LS of a Lipschitz function ϕ : R → Ω, such that
(2) |
Because the classical LS models rely on the edge function (depending on the image gradient) to stop the curve evolution, they can detect only objects with edges defined by gradients. In practice, the discrete gradients are bounded. The stopping function is never zero on the edges, and the curve may pass through the boundary, especially for the models used in [29].
Chan and Vese [30] proposed a new model for active contours to detect objects in a given image based on curve evolution techniques, the Mumford-Shah functional for segmentation, and LSs. The Chan-Vese model can detect objects whose boundaries are not necessarily defined by gradients.
Define the evolving curve C in Ω as the boundary of an open subset ω of Ω (i.e., ω ⊂ Ω and C = ∂ω). In what follows, inside (C) denotes the region ω and outside (C) denotes the region Ω∖ϖ.
The Mumford-Shah functional for segmentation is
(3) |
where u 0 is a given image and μ and λ are positive parameters. The solution image μ obtained by minimizing this functional is formed by smooth regions denoted by R i and sharp boundaries denoted by C.
The energy functional of the Chan-Vese model is defined by
(4) |
where μ > 0, v > 0, λ 1 > 0, λ 2 > 0 are fixed parameters.
The Heaviside function H and one-dimensional Dirac measure δ 0 are, respectively, defined as
(5) |
For the LS formulation of the variational active contour model, the Chan-Vese model replaces the unknown variable C with the unknown variable ϕ. The energy F(c 1, c 2, ϕ) can then be written as
(6) |
(7) |
The Dirac delta function δ, which is the derivative of the Heaviside function H, is accordingly replaced by the derivative of H ε, which is calculated as
(8) |
Denote by F ε the associated regularized functional, defined as
(9) |
Keeping c 1 and c 2 fixed and minimized in terms of F ε, the associated Euler-Lagrange equation can be deduced for ϕ. Consider
(10) |
Formulations (7), (8), (9), and (10) are numerical approximations of the CV LS model.
The mass extracted by FCM-based segmentation, FCM-GVF, and LS was compared with the reference standard, that is, the radiologist's manual segmentation. Figure 3(a) shows an example ROI that contains a mass proven to be a malignant tumor via biopsy. The mass edge is blurred and partially overlapped by other soft tissues, so the traditional segmentation methods are prone to segmentation leak. Figure 3(b) shows the boundary resulting from the FCM clustering and morphological opening. Although the FCM segmentation covers most of the mass edges visually, it is still slightly undersegmented on the lower right corner of the mass. The boundary was then refined by the GVF snake segmentation as shown in Figure 3(c) and by the LS segmentation as shown in Figure 3(d), respectively. In both figures, although the FCM-GVF segmentation covers most of the mass edges visually, it is still slightly too smooth. The LS segmentation covers most of the mass edges visually, allowing for cusps, corners, and automatic topological changes.
2.4. Feature Extraction and Selection
2.4.1. Morphological and Texture Features
The morphological and texture features are the most commonly used features of a breast cancer CAD system. The CAD system in this study directly obtained the two features and required no other software. Texture is an intrinsic characteristic of an object and is important for medical image analysis [31]. Researchers have proposed various textural algorithms such as fractal-based description, texture spectrum, and the Markov random field model [32–34]. The gray-level cooccurrence matrix (GLCM) texture method has been investigated heavily since its introduction by Haralick et al. in 1973 and has demonstrated considerable promise in MRI texture analysis. Important texture information exists in the tissue surrounding a mass margin. In this study, 13 textural measures were calculated for the nearest pixels (distance: 1 pixel) in 4 limited directions: 0°, 45°, 90°, and 135°, respectively. Thirteen features including the angular second moment, contrast, correlation, inverse difference moment, sum average, sum variance, sum entropy, entropy, difference average, difference variance, difference entropy, information measure of correlation 1, and information measure of correlation 2 were calculated from the GLCM. Owing to the isotropic texture of the images investigated, the features evaluated in the current study were the averages over the four directions. These texture features contained some important information about homogeneity, contrast, and other organized image structures.
In addition to the texture feature, eight morphological features were selected and calculated to describe the morphological properties as defined in the breast imaging reporting and data system lexicon. These features included compactness, spiculation, extent, elongation, solidity, circularity, entropy of radial length distribution, and eccentricity. A detailed description of these features can be found in the preliminary study [3].
2.4.2. Feature Selection
Feature selection [17, 35–39] is used to identify and remove as much irrelevant and redundant information as possible. It can improve the accuracy of the resulting model and decrease the calculation time of the induction algorithm. In this study, texture and morphological feature subset selection was used to find the set of features that best distinguished malignant from benign masses.
Feature subset selection research has traditionally looked at relevant features. Feature selection algorithms usually fall into two categories [35]: the filter and wrapper methods. Although the wrapper has the advantage of better performance, its usage in the biomedical arena is limited due to its high computational cost [35]. A filter algorithm was used in this study to alleviate this problem. Relief [39] is a well-known filter algorithm that estimates the quality of attributes according to how well their values distinguish between close instances. However, Relief is ineffective at removing redundant features, as two predictive but highly correlated features are probably both highly weighted. ReliefF [23] extends relief, enabling the method to work with noisy and incomplete datasets and to deal with multiclass problems. ReliefF is a simple yet efficient procedure used to estimate the quality of attributes in problems with strong dependencies between attributes. In practice, ReliefF is usually applied independently of the chosen predictor in data preprocessing as a feature subset selection method. The key idea of ReliefF is to estimate the quality of attributes according to how well their values distinguish between close instances. Given a randomly selected instance R i from class L, ReliefF searches for k of its nearest neighbors from the same class, known as nearest hits H, and also k of its nearest neighbors from each of the different classes, known as nearest misses M. It updates the quality estimation W(F) for all attributes F depending on their values for R i, hits H j, and misses M j(C). The updated average of the contribution of all of the hits and misses can be calculated via the following equation:
(11) |
where Function diff(F; I1; I2) calculates the difference between the attribute F values for two instances I1 and I2. The contribution for each class of misses is weighted with the prior probability of that class P (C) (estimated from the training set). 1 − P (class(R i)) represents the sum of the probabilities for the miss classes.
In this study, ReliefF was applied to find a candidate feature subset from the available morphological and texture features. The parameters of the weight distribution histogram were obtained as shown in Figure 4. The weight coefficient range was [−1, 1], with values closer to 1 indicating a stronger classification ability. The model with the features selected from the ReliefF feature selection methods was tested on the SVM classifier. Eight higher weight features were used to classify benign and malignant breast masses according to the weight distribution of the characteristic parameters.
2.5. Classification
Once the features were extracted and selected from the segmented masses, the data with n selected features could be fed into an appropriate classification model. The literature has discussed many different approaches to diagnosing breast cancer, such as SVM, LDA, NB, KNN, and ANN. SVMs have been used to diagnose breast cancer and achieved the highest classification accuracy among the available artificial intelligent methods. Therefore, in this study, an SVM classifier was used to evaluate the diagnostic performance of carefully selected variables. The Fisher classifier was used for comparison, as the preliminary study [3] found that it was more generalizable for unknown cases than other more complex classifiers given a limited training sample size.
The SVM was developed by Vapnik [40] based on Vapnik-Chervonenkis (VC) theory and the structural risk minimization (SRM) principle and has been used for many machine learning tasks such as pattern recognition, object classification, and regression analysis. It seeks a tradeoff between minimizing the training set error and maximizing the margin to achieve a high level of generalization and remain resistant to overfitting. In addition, SVMs have a strict theory and mathematical foundation that presents no local optimization or dimensional problems. Chang and Lin [41] developed LIBSVM, which was implemented for the purposes of this study. There were two steps involved in the LIBSVM implementation: (1) the dataset was trained to obtain a model and (2) the model was used to predict the information for the testing dataset. The final output was the classification accuracy for breast cancer prognosis, which classified the patients as having a malignant or benign diagnosis with the optimum feature of the subset.
3. Results and Discussion
3.1. Segmentation Performance
Automated breast mass segmentation is an important step for the CAD system. The accurate delineation of masses in a BMRI is crucial for diagnosis and the associated image-guided biopsy. Table 1 summarizes the mean values and standard deviations of the areas from the mass contours, which were segmented by the FCM-based method, FCM-GVF method, LS method, and radiologists' manual delineation, respectively. The differences between the computerized method and radiologists' manual delineation were analyzed using Pearson's correlation coefficient (Pearson's r) and paired Student's t-test (Table 1). According to the original hypothesis, there is no significant difference between the two groups of mass areas segmented by different methods. Pearson's correlation coefficient was used to measure the correlation between the computer segmentation and reference standard. The paired Student's t-test was used to evaluate the significance of the differences between the segmentation.
Table 1.
Segmentation method | Area (mean ± SD pixels) |
Pearson's correlation |
t-test P value |
AOR1 (mean ± SD) |
AOR2 (mean ± SD) |
---|---|---|---|---|---|
FCM | 1,439.5 ± 1,300.7 | 0.9807 | 0.7173 | 0.84 ± 0.14 | 0.75 ± 0.15 |
GVF-snake-FCM | 1,474.7 ± 1,333.9 | 0.9828 | 0.8098 | 0.87 ± 0.09 | 0.78 ± 0.14 |
CV-level set | 1,526.4 ± 1,334.8 | 0.9868 | 0.9449 | 0.89 ± 0.10 | 0.79 ± 0.14 |
Radiologists' manual | 1,547.1 ± 1,380.5 | — | — | — | — |
Pearson's r between the mass areas segmented by the FCM-based method and the radiologists' manual delineation was 0.9807, and the paired t-test between the areas extracted by the two methods achieved a P value of 0.7173. This indicates that the areas worked out by the two methods were highly correlated without a significant difference in averages. After the r and P values were refined using the GVF and LS methods, they increased and continued to show a high correlation between the areas without a significant difference in average (P > 0.05). This indicates that the three computerized methods could help radiologists achieve accurate delineation. The LS method showed the best performance among the three methods.
Figure 5 shows the log-log scatter plot of the areas measured using the computerized method versus radiologists' manual segmentation. The mass area is calculated by the number of pixels in the mass region. The log-log scatter plot was drawn because the mass area had a wide range. Judging by the distribution of the data points in Figure 5, the computerized methods somewhat underestimated the mass area compared with the radiologists' reference area, as most of the data points are distributed below the reference diagonal line. The FCM-GVF method had a smaller underestimation than the FCM-based method. One drawback of the FCM-based method is that it depends simply on intensity information and does not include the spatial relationships of pixels. For a more complicated mass enhancement, it is difficult for the FCM-based method to locate the contour that approaches near to the realistic mass contour. The FCM-GVF method improves the initial segmentation when deforming to a balance of internal and external forces. However, one drawback of the FCM-GVF method is that it depends on image edge information. The CV LS method can detect objects whose boundaries are not necessarily defined by gradients. In this study, it showed a densely distributed scatter along the diagonal. As such, its segmentation results (i.e., better segmentation) approximated those of the radiologists' hand-painted results.
Figure 6 exhibits the histograms of the overlap measures for the FCM-based, FCM-GVF, and LS methods. All of the masses segmented using the three methods have values of AOR1 and AOR2, with the most concentrated distribution over 0.6. Figure 6 also shows that the bars of the LS method are denser from 0.7 to 1. These results indicate that the LS automated segmentation method performed better for the masses.
3.2. Feature Select and Classification Performance
The preliminary study [3] proved that morphological and texture features can be used to classify breast masses and that the features of the computerized segmentation method can provide a more efficient and objective diagnostic performance when discriminating between benign and malignant masses. This study sought to verify whether the features of LS computerized segmentation and the ReliefF feature selection method can work together to improve diagnostic performance when discriminating between benign and malignant masses. Two classification methods including the Fisher and SVM methods were experimented with and their results were subsequently compared. The Fisher classifier was chosen because the preliminary study [3] found it to be more generalizable to unknown cases than other more complex classifiers when the training sample size is limited. SVMs have been used to diagnose breast cancer and have achieved the highest classification accuracy among the available artificial intelligent methods according to the literature. In the current study, a two-loop leave-one-case-out resampling procedure was designed to train the Fisher and SVM classifiers and test performance using N available cases, where N = 120.
To verify the classification accuracy of the newly developed CAD system, a subset of eight features whose weights ranked at the top in the ReliefF algorithm was selected as an independent test set. In addition, as these eight features could form two hundred and fifty-five different cases, all of the cases were tested and the optimal classification result was chosen as the independent experimental result. The corresponding selected features are shown in Tables 2 and 3.
Table 2.
Segmentation method | Classification model | Accuracy (%) | Sensitivity (%) | Specificity (%) |
---|---|---|---|---|
FCM | Fisher | 79.5 | 87.7 | 57.1 |
SVM | 74.4 | 82.5 | 52.4 | |
| ||||
FCM-GVF-snake | Fisher | 80.8 | 86.0 | 66.7 |
SVM | 82.1 | 86.0 | 71.4 | |
| ||||
CV-level set | Fisher | 91.0 | 96.5 | 76.2 |
SVM | 92.3 | 98.2 | 76.2 |
Table 3.
Segmentation method | Classification model | Accuracy (%) | Sensitivity (%) | Specificity (%) |
---|---|---|---|---|
FCM | Fisher | 83.3 | 91.0 | 69.0 |
SVM | 80.8 | 93.6 | 57.1 | |
| ||||
FCM-GVF-snake | Fisher | 78.3 | 82.1 | 71.4 |
SVM | 82.5 | 93.6 | 61.9 | |
| ||||
CV-level set | Fisher | 90.8 | 94.9 | 83.3 |
SVM | 90.0 | 98.7 | 73.8 |
As shown in Tables 2 and 3, when all eight of the features, including entropy, entropy of sum, entropy of radius distribution, area, boundary of fractal dimension, entropy of difference, compactness, and speculation, were selected as a subset for the experiment, their accuracy, sensitivity, and specificity varied among the different segmentation and classification methods. The two tables make it clear that the combination of the CV LS segmentation method and SVM classifier achieved the best performance out of all of the methods. The leave-half-case-out test in Table 2 exhibits an accuracy level of 92.3%, a sensitivity level of 98.2%, and a specificity level of 76.2%. The leave-one-case-out test in Table 3 exhibits an accuracy level of 90.0%, a sensitivity level of 98.7%, and a specificity level of 73.8%.
3.3. Discussion
The results of our experiment demonstrate that our new DCE-MRI CAD system using CV LS/ReliefF/SVM hybrid model exhibited the best diagnostic performance. Nowadays, our CAD system has shown that the eight morphological and texture features in breast MRI as adjunct diagnostic criteria can improve the specificity without significantly reducing the sensitivity. The most widespread CAD applications in the breast mostly will take into account dynamic features. For example, Baltzer et al. [42] have shown that fast visual assessment of dynamic data using CAD calculated parametric images is feasible without a decrease in diagnostic accuracy. They also proved that the combination of multiple dynamic and morphological MRI criteria seems to have the potential for a differential diagnosis of inflammatory breast carcinomas and acute mastitis [43]. Our future work will evaluate whether a combination with dynamic features evaluation could further improve our CAD system diagnostic accuracy.
4. Conclusion
This study developed a fully automated BMRI prognostic system that implemented breast segmentation, tumor segmentation, feature extraction, feature selection, and classification between benign and malign tumors. Compared with the FCM and GVF snake segmentation methods, the segmentation performance indicated that the CV LS computerized segmentation method is a more accurate method for automatically determining a suspicious mass region and can help radiologists in their detection and delineation of BMRIs. The ReliefF algorithm was useful in selecting an optimal subset of breast tumor features. The subset could be used to decrease feature dimensions and weight minimum distance classifiers. In terms of computerized characterization, the Fisher and SVM methods were used separately to select morphological and texture features and make classifications with the adoption of a leave-one-case-out cross-validation method and a leave-half-case-out validation method. In conclusion, the ReliefF/SVM/CV LS hybrid model exhibited the best performance (accuracy = 90.0%, sensitivity = 98.7%, and specificity = 73.8% for the leave-one-case-out validation; accuracy = 92.3%, sensitivity = 98.2%, and specificity = 76.2% for the leave-half-case-out validation). The new DCE-MRI CAD system may assist radiologists in delineating and characterizing BMRI masses, for example, by quantifying morphological and texture features and characterizing DCE-MRI masses as malignant or benign. It also has the potential to assist radiologists in decreasing the biopsy rate without increasing false negatives.
Additional tests and experiments must be conducted to further verify the results obtained in this study. Future work could increase the sample size of the dataset by providing more medical samples to reflect the real population. A much greater effort will be required to design effective computer-vision methods that can fully exploit the image information in DCE-MRIs to improve segmentation and feature selection.
Acknowledgments
This work was supported by the National Natural Science Foundation of China (Grant no. 11274394), the Natural Science Foundation of Guangdong Province (Grant no. S2012010010542), the Fundamental Research Funds for the Central Universities (Grant no. 11lgjc12), and the Specialized Research Fund for the Doctoral Program of Higher Education (Grant no. 20110171110023).
Conflict of Interests
The authors declare that there is no conflict of interests regarding the publication of this paper.
Authors' Contribution
The first and second author contributed equally to this study.
References
- 1.Lehman C. D., Gatsonis C., Kuhl C. K., et al. MRI evaluation of the contralateral breast in women with recently diagnosed breast cancer. New England Journal of Medicine. 2007;356(13):1295–1303. doi: 10.1056/NEJMoa065447. [DOI] [PubMed] [Google Scholar]
- 2.Shute N. Beyond mammograms: research to improve breast cancer screening focuses on sound, light, breath and tissue elasticity. Scientific American. 2011;304:32–34. doi: 10.1038/scientificamerican0511-32. [DOI] [PubMed] [Google Scholar]
- 3.Pang Y., Li L., Hu W., Peng Y., Liu L., Shao Y. Computerized segmentation and characterization of breast lesions in dynamic contrast-enhanced MR images using fuzzy c-means clustering and snake algorithm. Computational and Mathematical Methods in Medicine. 2012;2012:10. doi: 10.1155/2012/634907.634907 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Cai H., Peng Y., Ou C., Chen M., Li L. Diagnosis of breast masses from dynamic contrast-enhanced and diffusion-weighted MR: a machine learning approach. PLOS ONE. 2014;9(1) doi: 10.1371/journal.pone.0087387.e87387 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Klifa C., Carballido-Gamio J., Wilmes L., et al. A vector machine formulation with application to the computer-aided diagnosis of breast cancer from DCE-MRI screening examinations. Journal of Digital Imaging. 2014;27(1):145–151. doi: 10.1007/s10278-013-9621-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Shi J., Sahiner B., Chan H.-P., et al. Treatment response assessment of breast masses on dynamic contrast-enhanced magnetic resonance scans using fuzzy c-means clustering and level set segmentation. Medical Physics. 2009;36(11):5052–5063. doi: 10.1118/1.3238101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Chen W., Giger M. L., Lan L., Bick U. Computerized interpretation of breast MRI: investigation of enhancement-variance dynamics. Medical Physics. 2004;31(5):1076–1082. doi: 10.1118/1.1695652. [DOI] [PubMed] [Google Scholar]
- 8.Chen W., Giger M. L., Bick U. A fuzzy c-means (FCM)-based approach for computerized segmentation of breast lesions in dynamic contrast-enhanced MR images. Academic Radiology. 2006;13(1):63–72. doi: 10.1016/j.acra.2005.08.035. [DOI] [PubMed] [Google Scholar]
- 9.Azmi R., Norozi N., Anbiaee R., Salehi L., Amirzadi A. IMPST: a new interactive self-training approach to segmentation suspicious masses in breast MRI. Journal of Medical Signals and Sensors. 2011;1(2):138–148. [PMC free article] [PubMed] [Google Scholar]
- 10.Liney G. P., Sreenivas M., Gibbs P., Garcia-Alvarez R., Turnbull L. W. Breast lesion analysis of shape technique: semiautomated vs. manual morphological description. Journal of Magnetic Resonance Imaging. 2006;23(4):493–498. doi: 10.1002/jmri.20541. [DOI] [PubMed] [Google Scholar]
- 11.Wedegärtner U., Bick U., Wörtler K., Rummeny E., Bongartz G. Differentiation between benign and malignant findings on MR-mammography: usefulness of morphological criteria. European Radiology. 2001;11(9):1645–1650. doi: 10.1007/s003300100885. [DOI] [PubMed] [Google Scholar]
- 12.Nie K., Chen J.-H., Yu H. J., Chu Y., Nalcioglu O., Su M.-Y. Quantitative analysis of mass morphology and texture features for diagnostic prediction in breast MRI. Academic Radiology. 2008;15(12):1513–1525. doi: 10.1016/j.acra.2008.06.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Malich A., Fischer D. R., Wurdinger S., et al. Potential MRI interpretation model: differentiation of benign from malignant breast masses. The American Journal of Roentgenology. 2005;185(4):964–970. doi: 10.2214/AJR.04.1073. [DOI] [PubMed] [Google Scholar]
- 14.Haralick R. M., Shanmugam K., Dinstein I. Textural features for image classification. IEEE Transactions on Systems, Man and Cybernetics. 1973;3(6):610–621. doi: 10.1109/TSMC.1973.4309314. [DOI] [Google Scholar]
- 15.Chan H.-P., Wei D., Helvie M. A., et al. Computer-aided classification of mammographic masses and normal tissue: linear discriminant analysis in texture feature space. Physics in Medicine and Biology. 1995;40(5):857–876. doi: 10.1088/0031-9155/40/5/010. [DOI] [PubMed] [Google Scholar]
- 16.Gibbs P., Turnbull L. W. Textural analysis of contrast-enhanced MR images of the breast. Magnetic Resonance in Medicine. 2003;50(1):92–98. doi: 10.1002/mrm.10496. [DOI] [PubMed] [Google Scholar]
- 17.Akay M. F. Support vector machines combined with feature selection for breast cancer diagnosis. Expert Systems with Applications. 2009;36(2):3240–3247. doi: 10.1016/j.eswa.2008.01.009. [DOI] [Google Scholar]
- 18.Aci M., Inan C., Avci M. A hybrid classification method of k nearest neighbor, Bayesian methods and genetic algorithm. Expert Systems with Applications. 2010;37(7):5061–5067. doi: 10.1016/j.eswa.2009.12.004. [DOI] [Google Scholar]
- 19.Langer D. L., van der Kwast T. H., Evans A. J., Trachtenberg J., Wilson B. C., Haider M. A. Prostate cancer detection with multi-parametric MRI: logistic regression analysis of quantitative T2, diffusion-weighted imaging, and dynamic contrast-enhanced MRI. Journal of Magnetic Resonance Imaging. 2009;30(2):327–334. doi: 10.1002/jmri.21824. [DOI] [PubMed] [Google Scholar]
- 20.Chan H.-P., Wu Y.-T., Sahiner B., et al. Characterization of masses in digital breast tomosynthesis: comparison of machine learning in projection views and reconstructed slices. Medical Physics. 2010;37(7):3576–3586. doi: 10.1118/1.3432570. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Nattkemper T. W., Arnrich B., Lichte O., et al. Evaluation of radiological features for breast tumour classification in clinical screening with machine learning methods. Artificial Intelligence in Medicine. 2005;34(2):129–139. doi: 10.1016/j.artmed.2004.09.001. [DOI] [PubMed] [Google Scholar]
- 22.Levman J., Leung T., Causer P., Plewes D., Martel A. L. Classification of dynamic contrast-enhanced magnetic resonance breast lesions by support vector machines. IEEE Transactions on Medical Imaging. 2008;27(5):688–696. doi: 10.1109/TMI.2008.916959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Kononenko I. Estimating attributes: analysis and extensions of relief. In: de Raedt and L., Bergadano F., editors. Machine Learning: ECML-94. New York, NY, USA: Springer; 1994. pp. 171–182. [Google Scholar]
- 24.Klifa C., Carballido-Gamio J., Wilmes L., et al. Magnetic resonance imaging for secondary assessment of breast density in a high-risk cohort. Magnetic Resonance Imaging. 2010;28(1):8–15. doi: 10.1016/j.mri.2009.05.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Kontos D., Xing Y., Bakic P. R., Conant E. F., Maidment A. D. A. A comparative study of volumetric breast density estimation in digital mammography and magnetic resonance imaging: results from a high-risk population. Medical Imaging 2010: Computer-Aided Diagnosis; February 2010; San Diego, Calif, USA. [DOI] [Google Scholar]
- 26.Yao J., Chen J., Chow C. Breast tumor analysis in dynamic contrast enhanced MRI using texture features and wavelet transform. IEEE Journal on Selected Topics in Signal Processing. 2009;3(1):94–100. doi: 10.1109/JSTSP.2008.2011110. [DOI] [Google Scholar]
- 27.Wu S. D., Weinstein S. P., Conant E. F., Localio A. R., Schnall M. D., Kontos D. Fully automated chest wall line segmentation in breast MRI by using context information. Proceedings of SPIE; 2012; 831507 [Google Scholar]
- 28.Wu S., Weinstein S. P., Conant E. F., Schnall M. D., Kontos D. Automated chest wall line detection for whole-breast segmentation in sagittal breast MR images. Medical Physics. 2013;40(4) doi: 10.1118/1.4793255.042301 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Malladi R., Sethian J. A., Vemuri B. C. Shape modeling with front propagation: a level set approach. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1995;17(2):158–175. doi: 10.1109/34.368173. [DOI] [Google Scholar]
- 30.Chan T. F., Vese L. A. Active contours without edges. IEEE Transactions on Image Processing. 2001;10(2):266–277. doi: 10.1109/83.902291. [DOI] [PubMed] [Google Scholar]
- 31.Castellano G., Bonilha L., Li L. M., Cendes F. Texture analysis of medical images. Clinical Radiology. 2004;59(12):1061–1069. doi: 10.1016/j.crad.2004.07.008. [DOI] [PubMed] [Google Scholar]
- 32.Jiang C., Pitt R. E., Bertram J. E. A., Aneshansley D. J. Fractal-based image texture analysis of trabecular bone architecture. Medical & Biological Engineering & Computing. 1999;37(4):413–418. doi: 10.1007/BF02513322. [DOI] [PubMed] [Google Scholar]
- 33.Wang L., He D.-C. Texture classification using texture spectrum. Pattern Recognition. 1990;23(8):905–910. doi: 10.1016/0031-3203(90)90135-8. [DOI] [Google Scholar]
- 34.Cross G. R., Jain A. K. Markov random field texture models. IEEE Transactions on Pattern Analysis and Machine Intelligence. 1983;5(1):25–39. doi: 10.1109/tpami.1983.4767341. [DOI] [PubMed] [Google Scholar]
- 35.Guyon I., Weston J., Barnhill S., Vapnik V. Gene selection for cancer classification using support vector machines. Machine Learning. 2002;46(1–3):389–422. doi: 10.1023/A:1012487302797. [DOI] [Google Scholar]
- 36.Saeys Y., Inza I., Larrañaga P. A review of feature selection techniques in bioinformatics. Bioinformatics. 2007;23(19):2507–2517. doi: 10.1093/bioinformatics/btm344. [DOI] [PubMed] [Google Scholar]
- 37.Chen H.-L., Yang B., Liu J., Liu D.-Y. A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis. Expert Systems with Applications. 2011;38(7):9014–9022. doi: 10.1016/j.eswa.2011.01.120. [DOI] [Google Scholar]
- 38.Stokes M. E., Visweswaran S. Application of a spatially-weighted Relief algorithm for ranking genetic predictors of disease. BioData Mining. 2012;5, article 20 doi: 10.1186/1756-0381-5-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Kira K., Rendell L. A. A practical approach to feature selection. In: Sleeman D., Edwards P., editors. Proceedings of the 9th International Conference on Machine Learning (ICML '92); 1992; Morgan Kaufmann; pp. 249–256. [Google Scholar]
- 40.Vapnik V. N. An overview of statistical learning theory. IEEE Transactions on Neural Networks. 1999;10(5):988–999. doi: 10.1109/72.788640. [DOI] [PubMed] [Google Scholar]
- 41.Chang C. C., Lin C. J. LIBSVM: a library for support vector machines. 2001, http://www.csie.ntu.edu.tw/~cjlin/libsvm/
- 42.Baltzer P. A. T., Renz D. M., Beger S., et al. Comparison of the diagnostic value of visual assessment of parametric images calculated by a computer assisted diagnosis system with region of interest (ROI) measurement in MR-mammography. Proceedings of the European Congress of Radiology, Scientific Exhibit/Poster Presentation; 2008; Vienna, Austria. [Google Scholar]
- 43.Renz D. M., Baltzer P. A. T., Böttcher J., et al. Magnetic resonance imaging of inflammatory breast carcinoma and acute mastitis. A comparative study. European Radiology. 2008;18(11):2370–2380. doi: 10.1007/s00330-008-1029-3. [DOI] [PubMed] [Google Scholar]