Skip to main content
Breast Cancer : Targets and Therapy logoLink to Breast Cancer : Targets and Therapy
. 2018 Nov 30;10:219–230. doi: 10.2147/BCTT.S175311

Artificial intelligence methods for the diagnosis of breast cancer by image processing: a review

Farahnaz Sadoughi 1, Zahra Kazemy 1, Farahnaz Hamedan 1, Leila Owji 1, Meysam Rahmanikatigari 2, Tahere Talebi Azadboni 1,
PMCID: PMC6278839  PMID: 30555254

Abstract

Breast cancer is the most common cancer among women around the world. Despite enormous medical progress, breast cancer has still remained the second leading cause of death worldwide; thus, its early diagnosis has a significant impact on reducing mortality. However, it is often difficult to diagnose breast abnormalities. Different tools such as mammography, ultrasound, and thermography have been developed to screen breast cancer. In this way, the computer helps radiologists identify chest abnormalities more efficiently using image processing and artificial intelligence (AI) tools. This article examined various methods of AI using image processing to diagnose breast cancer. It was a review study through library and Internet searches. By searching the databases such as Medical Literature Analysis and Retrieval System Online (MEDLINE) via PubMed, Springer, IEEE, ScienceDirect, and Gray Literature (including Google Scholar, articles published in conferences, government technical reports, and other materials not controlled by scientific publishers) and searching for breast cancer keywords, AI and medical image processing techniques were extracted. The results were provided in tables to demonstrate different techniques and their results over recent years. In this study, 18,651 articles were extracted from 2007 to 2017. Among them, those that used similar techniques and reported similar results were excluded and 40 articles were finally examined. Since each of the articles used image processing, a list of features related to the image used in each article was also provided. The results showed that support vector machines had the highest accuracy percentage for different types of images (ultrasound =95.85%, mammography =93.069%, thermography =100%). Computerized diagnosis of breast cancer has greatly contributed to the development of medicine, is constantly being used by radiologists, and is clear in ethical and medical fields with regard to its effects. Computer-assisted methods increase diagnosis accuracy by reducing false positives.

Keywords: breast cancer, breast cancer screening techniques, artificial intelligence techniques, medical image processing

Introduction

Breast cancer is the most common cancer and the second leading cause of death among women around the world.1 Breast cancer occurs when the cell tissues of the breast become abnormal and uncontrollably divided. These abnormal cells form a large lump of tissues, which consequently becomes a tumor.2 It was reported that 1.7 million cases of breast cancer were identified in the world in 2012. Breast cancer is the second cause of cancer death with the standardized mortality rate of 12.9 per 100,000, and its incidence has increased over the years.35

Breast cancer could be successfully treated if detected early. Thus, it is of importance to have appropriate methods for screening the earliest signs of breast cancer.6 There are various imaging methods available for the screening and diagnosis of breast cancer, the most important of which are mammography, ultrasound, and thermography.7 Mammography is one of the most important early diagnosis methods for breast cancer. Since mammography is not very successful for dense breasts, ultrasound or diagnostic sonography techniques are recommended.8 Taking into account that small masses may pass radiations of radiography, thermography can be more powerful in diagnosing smaller cancerous masses than the ultrasound technique.9

There is no doubt that evaluation of patient data and expert judgment is the most important factor in image-based diagnosis; however, there are many factors affecting this type of diagnosis. Among the factors affecting image-based diagnosis are the presence of noise in images, the radiologist’s visual perception ability, inadequate clarity, poor contrast, and the less experienced radiologist.10

Due to inherent problems associated with an image, including poor contrast, noise, and lack of recognition with the eye, tools have been developed to create and develop image processing. Currently, medical image processing is one of the fastest growing areas in the health care sector. The purpose of the image processing is to use techniques for making proper images of the human body, which are reliable for use in the diagnosis and treatment processes.6

In the early 1980s, there was an increment in the use of neural networks in the field of image and signal processing. Since diagnosis of breast cancer is very difficult, statistical methods and artificial intelligence (AI) techniques can be very important in this regard.11 The AI is said to be an artificial intelligent machine in various situations. In other words, these are systems that can respond to similar conditions such as an intelligent human, including understanding complex situations, simulating thinking processes and human reasoning methods, and demonstrating accurate responding, learning and ability to acquire knowledge, and reasoning for solving problems.12 For example, Dheeba and Selvi26 used a particle swarm-optimized wavelet neural network (PSOWNN) to identify breast cancer on the mammogram. This method was applied with real data and had a sensitivity and accuracy of 94% and 92%, respectively. Moreover, the result showed that the area under the receiver operating characteristic (ROC) curve of the proposed algorithm was 0.96, indicating the excellent performance of the system. In addition, new tools including image processing tools have been developed to facilitate the diagnosis of breast cancer masses. Image processing methods identify abnormal features in medical images more easily. By using image processing, pattern recognition, and AI, researchers have been able to provide techniques that accurately detect masses.13

This article examined various AI techniques that use medical images to detect breast cancer. The findings are presented in the form of tables with the aim of demonstrating different methods and results of using each method in recent years.

Materials and methods

The search strategy was performed by searching the databases such as Medical Literature Analysis and Retrieval System Online (MEDLINE) via PubMed, Springer, IEEE, ScienceDi-rect, and Gray Literature (including Google Scholar, articles published in conferences, government technical reports, and other materials not controlled by scientific publishers) for relevant publications from 2007 to 2017.

Keyword searches based on Mesh included “breast cancer”, “breast cancer screening techniques”, “artificial intelligence techniques”, and “medical image processing”. The symbol “*” was also used to allow retrieving all variations with suffixes of the source words. The above terms were combined using the logical connectives “AND”, “OR”, and “NOT”.

Among the articles searched, non-English articles as well as articles that used similar techniques and reported similar results were excluded. Moreover, articles whose full texts were not available were also excluded.

Two experts independently reviewed all potentially relevant studies. Disagreements were solved with discussion and by using the viewpoint of a third expert.

Finally, 18,651 articles were extracted; however, majority of the extracted articles were deleted due to being repetitive and not having access to their full texts, and, 40 articles remained for review (Figure 1 shows the process.). The selected articles were examined in terms of the name of the method used, type of the image, advantages and limitations of each method, features used, and application results of each method, and all the results were presented in the form of tables.

Figure 1.

Figure 1

Stages of systematic review.

Results

If we discard skin cancer, breast cancer is the largest cause of cancer among women, accounting for one-third of all the cancer types. Obtaining the best outcomes in breast cancer depends on early diagnosis. Therefore, imaging techniques have been developed to increase the likelihood of early diagnosis of breast cancer and reduce unnecessary biopsy.14 Table 1 shows a summary of advantages and disadvantages of each method. Currently, digital image processing techniques are often used in solving machine visual problems and have provided good results.15

Table 1.

Advantages and disadvantages of various imaging techniques in breast cancer

Imaging method Application Advantage Disadvantage
Mammography Golden standard imaging and diagnosis of breast cancer early stages • It uses low levels of X-rays for imaging
• This method is good for detecting DCIS and calcifications
• Mammography is the gold standard method to detect early-stage breast cancer before the lesions become clinically palpable
• Radiation risk and other risks
• Risk of false alarm
• It is difficult for the radiologist to interpret the results from mammograms as mammograms generally have low contrast
• Double reading of mammogram leads to increase in the cost of detection
• Mammography alone misses many cancers in dense-breasted women
Ultrasound Suitable for dense and soft tissues • Widely available and accessible
• Noninvasive
• Quick
• Highly sensitive
• Suitable for women with dense breasts
• Quality and interpretation of the image depends highly on the skill of the person doing the scan
Thermography Suitable for muscle tissue • Noninvasive • Physicians can have difficulty interpreting the images because of the low quality and low resolution of the images taken by the first generation of the medical infrared imaging cameras

Note: Data adapted from Kerlikowske et al,23 Heywang-Köbrunner et al,33 and Qi and Diakides.34

Abbreviation: DCIS, ductal carcinoma in situ.

The importance and necessity of processing digital images are examined in the following two directions: 1) to improve images for human interpretation and 2) to process images for automatic understanding and interpretation by the machine.15

In the field of diagnosis, first, medical images are collected and, then, preprocessing, segmentation, extraction of features, and eventually categorization are performed.9,16 Figure 2 shows the steps involved.

Figure 2.

Figure 2

Stages of cancer detection by image processing.

Note: Data from Pradeep et al16 and Lin et al.19

Image acquisition

The first step in processing images is to capture an image. In this step, data are collected in the form of digital images. The image format is usually a portable gray map, which is fixed format, and does not erase the image data when compressing images.9,16

Image preprocessing

The next step is to preprocess input images to improve their quality by eliminating noise. Preprocessing of images is done through the middle filter. Preprocessing, in addition to deleting or reducing noise, improves image quality through increased contrast.17 Some of the most important preprocessing techniques are presented in Table 2.

Table 2.

Preprocessing techniques

Preprocessing image techniques
FPN FPN is the result of differences in responsivity of the detectors to incoming irradiance. It is a common problem when working with FPA. FPN for a particular configuration can be recovered from a blackbody image for later subtraction from the thermogram sequence.
Badpixels A badpixel can be defined as an anomalous pixel behaving differently from the rest of the array. For instance, a dead pixel remains unlit (black), while a hot pixel is permanently lit (white). In any case, badpixels do not provide any useful information and only contribute to deteriorate the image contrast. A map of badpixels is generally known from the FPA manufacturer or they can be detected manually or automatically; the value at badpixel locations is then replaced by the average value of neighboring pixels.
Vignetting Vignetting is another source of noise on thermograms that cause a darkening of the image corners with respect to the image center due to limited exposure. It depends on both pixel location and temperature difference with respect to the ambient. A correction procedure has been proposed.3
Temperature calibration A transformation function is used to convert the grayscale values g provided by the infrared camera into a linearly incrementing physical quantity, eg, temperature. The procedure3 is used to position the IR camera in front of a reference temperature source (such as a blackbody and a thick copper plate) brought to various known temperatures. As the reference temperature source is varied, the IR images are recorded. Average of the central pixels in the field of view allows getting the calibration curve through a polynomial fit
Noise smoothing One of the most useful preprocessing (and postprocessing) techniques is noise smoothing. For instance, neighbor processing can be performed bypassing a mask or kernel through the image. More elaborate noise removal techniques are available.

Note: Data adapted from Ibarra-Castanedo et al.17

Abbreviations: FPA, focal plane array; FPN, fixed pattern noise; IR, infrared.

Image segmentation

Segmentation refers to cutting an image and dividing it into its constructive parts. In image analysis and processing, the most important step is to fragment an image. Segmentation is actually dividing an image into sub-sections that can be similar or different depending on its features. The output depends largely on the accuracy of the measurements of the features.18 Some of the most important segmentation techniques are presented in Table 3.

Table 3.

Image segmentation methods

Name of segmentation method Description of method Benefits Limitations
Edge detection method Depends on discontinuity detection; generally aims to situate points with less or more rapid gray-level changes • A human-like approach
• Appropriate methods to get the best picture disproportionate area
• Not good with images where edges are unclearly defined
• Not suitable for images whose edges are not completely clear
• Not suitable for images with a large number of edges
• It has the lowest noise compared to other methods
Thresholding method Requires images with diffident sharp edges, each of which fits a single area • No need for prior knowledge about the image
• Minimum complexity of computation
• Not good for image with no clear peaks
• Not suitable for images that do not have any clear edge
• Do not consider distances, so there is no guarantee for the division of continuous areas
Region-dependent method • The same pixel in nearby areas are similar
• Counting the growing area, separating, assembling or reversing them
• Does well if region homogeneity norm is painless to define
• Extra noise resistant as compared to edge detection method
• The amount of memory and calculation time is very costly
• Region growing relies on seed region selection and sequence by which regions, pixels are inspected
• Output segments by region splitting emerge too square because of splitting format
Fuzzy method Uses operators, mathematics, properties and rules of fuzzy inference • Fuzzy membership function can be used to represent the degree of low properties or linguistic phrases • Determining the fuzzy membership function is not easy
• The calculations used in the fuzzy method can be difficult
Neural network method Uses for clustering or categorization • Does not require writing tedious programs
• Could entirely exploit the parallel nature of neural net
• Need a lot of time to train
• Initialization might affect the outcome

Note: Data adapted from Pradeep et al.16

Feature extraction

The conversion of input data into a set of extracted features is called feature extraction. There are many ways to extract features. Some of the most commonly used methods are as follows: 1) spatial features, 2) transform feature, 3) edge and boundary features, 4) color features, 5) shape features, and 6) texture features.

Feature extraction techniques play a very important role in the diagnosis of disorders. Tissue features are very useful in differentiating the mass from the natural tissue of the breast. These categories of features are capable of separating natural and abnormal lesions of the masses or microcalci-fications.16,19 Table 4 refers to some of the most important features of the tissue.

Table 4.

The most important texture features16,19

Texture features
Mean
Skewness
Entropy
SD
Line likeness
Homogeneity
Smoothness
Correlation
Sum average
Sum entropy
Variance
Kurtosis
Energy
LBP
Regularity
Contrast
Coarseness
Kurtosis
Sum variance
Difference variance

Note: Data adapted from Pradeep et al16 and Lin et al.19

Abbreviation: LBP, local binary pattern.

Introduction of various AI techniques in image processing

Support vector machine (SVM)

One of the most widely used techniques for diagnosing breast cancer is the SVM. This technique is a brilliant among the learning algorithms inspired by the statistical learning theory and has been incorporated in the machine learning set in recent decades. In this way, the overfitting problem in the training data is reduced and it is possible to identify a large training set with small subsets of training points. Moreover, this technique can operate on optional features with no need to create independent hypotheses. In the fuzzy SVM, each sample of {xi, yi} is weighted in the training data set using the fuzzy membership function. This set becomes {xi, yi, si}, where si is a fuzzy membership function, which indicates that the membership of this sample is assured by one or more classes. In other words, each training example {xi, yi} is associated with a fuzzy membership function (0≤ si ≤1).2022

Cascade forward back-propagation network

The cascade forward back-propagation model uses the postpropagation algorithm to update weights such as back-propagation neural network; however, the main characteristic of this network is that each layer of neurons is linked to all previous neuron layers.24

Feed forward back-propagation network

This model includes input, output, and hidden layers. The back-propagation learning algorithm is used for learning in these networks. During the training, computation is performed from the input layer to the output layer and error values are released to the previous layer.24

k-nearest neighbor (k-NN)

The k-NN selects a group of K records from a training record set that has the closest records to the test record and decides about the class of the test record based on their rank or label superiority. Simply put, this method selects the rank with the highest number of records in the selected neighborhood.25

Genetic algorithm as optimizer

This algorithm can quickly scan a set of broad solutions and eliminate bad suggestions without negatively affecting the final result. The genetic algorithm works on its own rules and therefore can be used for problems that are defined as irregular.22,26

Naive Bayes classifier

Bayes classifier is estimated by the covariance matrix and is a simple probability classifier based on the theory of Bayes with powerful independent assumptions. The advantage is that this model requires a small amount of training data to estimate the required classification parameters.27

Deep learning technology

A deep learning network contains more image processing layers than the conventional image feature-based machine learning classifiers. Each layer is a typical neural network, such as a convolution neural network. Instead of using a set of manually or automatically selected image features computed from images, the deep learning network utilizes the image itself as a single input.28 Effective image features are automatically learned and extracted with lower layer networks. Accordingly, higher layer networks use the extracted feature patterns and classify images into different target categories.29

A number of AI techniques used to classify breast cancer into two malignant and benign forms using medical images are presented in Table 5. In this table, for each technique, the type of image, along with its strengths and weaknesses, is mentioned. Since each of the discussed articles used image processing, a list of image features was also provided. Investigations showed that the SVM classification method had a higher accuracy than the other methods. The results also indicated that the method had the highest accuracy for different types of images (98.58% ultrasound, 93.063% mammography, and 100% thermography).

Table 5.

Artificial intelligence techniques for classifying breast cancer

Methods Images Attributes Advantages Disadvantages Reported performance Reference
SVM Medical infrared thermal imaging • Energy, contrast, correlation, sum of squares: variance, inverse difference moment, sum variance, sum entropy, entropy, difference variance, difference entropy, information measures of correlation 1, information measures of correlation 2 • Use of curve let before feature extraction Accuracy: 90.91
Sensitivity: 81.82
Specificity: 100
35
SVM Ultrasound • Histogram features
• MRF
• Tamura features
• GLRLM features
• GLCM features • Morphological features (RS, aspect ratio, C, NRV, Pratio)
• Combined textural and morphological features
• Being independent of settings of US systems or different US machines in the diagnosis
Accuracy: 95.85
Sensitivity: 96
Specificity: 91.46
36
Fuzzy SVM Ultrasound • One pixel distance in the mean of the information measure of correlation (d=1)
• One pixel distance in the mean maximum correlation coefficient (d=1)
• One pixel distance in the range of different entropies (d=1)
• Two pixels distance in the variance ranges (d=2)
• Two pixels distance in the range of the information measure of correlation (d=2)
• Four pixels distance in the mean maximum correlation coefficient (d=4)
• Eight pixels distance in the mean of the information measure of correlation (d=8)
• The mean of inverse different moment with the distance of 16 pixels (d=16)
• Using of fractal dimensions
• Powerful in solving two-class classification problem
• The importance of the training sample can be measured by the fuzzy membership
• In practical classification problems, the effects of the samples in training dataset may be different Accuracy: 94.25
Sensitivity: 91.67
Specificity: 96.08
37
Cascade-forward back-propagation artificial neural network Mammography • Autocorrelation
• Correlation
• Maximum probability
• Dissimilarity energy
• Entropy
• Inverse difference normalized
• Features, extracted from the known mammogram images, are stored in the knowledge base Accuracy: 67.8 38
Feed-forward back-propagation artificial neural network Mammography • Autocorrelation
• Correlation
• Maximum probability
• Dissimilarity energy
• Entropy
• Inverse difference normalized
• Using image registration techniques Accuracy: 92.8% 39
k-NN Mammography • GLCM features (angular second moment, entropy, contrast, local homogeneity, correlation, shade, prominence, variance, sum average, sum entropy, difference entropy, sum variance, and difference variance)
• GLRLM features (SRE, LRE, GLNU, RLNU, and RPERC)
• LTEM features (mean, SD, range, skewness, and kurtosis)
• Texture analysis of tissue surrounding shows promising results in computer-aided diagnosis of breast cancer and may contribute to the reduction in unnecessary biopsies • Completion of the proposed method should include the investigation of additional classification schemes and textural features, as well as validation over a larger dataset Accuracy: 64.0–89.0
Sensitivity: 72.2–92.6
Specificity: 54.4–87.0
40
SVM Mammography • Wavelet features
• Statistical intensity-based features (mean value, SD, smoothness, entropy, skewness, kurtosis, uniformity, shape features, compactness, aspect ratio, area to background percentage, perimeter ratio)
• Intensity- and shape-driven features have the greatest role on the later classification stage
• Normal–abnormal classification stage before benign–malignant classification can be effective and reduce false positives
• Find out a specific hyperplane among many
• Do not require independent assumptions
• Obvious since increasing the sensitivity will yield in a system that identifies more regions as abnormal in order to achieve higher sensitivity
• The classification accuracy is slightly reduced after performing benign– malignant classification
Accuracy: 93.06
Sensitivity: 97.82
Specificity: 89.09
41
Genetic algorithm and back-propagation neural network Medical infrared thermal imaging • Age
• Mean
• Differences among the two breasts
• Variance
• Skewness
• Kurtosis
• Entropy
• Thermal pattern of breasts
• This method is beneficial to patients with and without symptoms • Still need to improve the methods Accuracy: 70
Sensitivity: 50
Specificity: 75
9
SVM Medical infrared thermal imaging • Kurtosis
• Skewness
• Contrast
• Inverse difference moment
• Correlation
• Generates input–output mapping functions which can be used for classification or for regression
• Use of four texture features, namely, moment 1, moment 3, run percentage, and gray-level nonuniformity for representing the thermograms
• Improved by extracting better texture features and by using a larger sample size Accuracy: 88.10
Sensitivity: 85.71
Specificity: 90.48
42
k-NN Medical infrared thermal imaging • Angular second moment (energy)
• Contrast
• Correlation
• Sum of squares: variance
• Inverse difference moment (homogeneity)
• Sum average
• Sum variance
• Sum entropy
• Entropy
• Difference variance
• Difference entropy
• Information measure of correlation 1
• Information measure of correlation 2
• Autocorrelation
• Dissimilarity
• Cluster shade
• Cluster prominence
• Maximum probability
• Inverse difference normalized
• Inverse difference moment normalized
• Detection using texture and features and minimum variance quantization is based on the data acquisition protocol parameter and the image statistics Accuracy: 92.5
Sensitivity: 100
Specificity: 78.6
43
Naive Bayes Medical infrared thermal imaging • Same above • Requires a small amount of training data to estimate the parameters necessary • SVM classification results are better than the naive Bayes classification results Accuracy: 80.0
Sensitivity: 76.9
Specificity: 85.7
44
Deep learning Mammogram • Density
• Margin
• Classifying between malignant and benign masses
• Enable to segment the diverse lesions
• Avoid the image feature errors introduced from the error or inconsistency of lesion segmentation.
• Can be directly applied to all testing cases
• Do not need to human intervention to design
• Heavily depends on the distribution and/or diversity of the ROIs (or cases) in the specific training and testing datasets ROC curve: 0.790±0.019 45

Abbreviations: C, circularity (the ratio of square of the perimeter to the tumor area); GLCM, gray-level co-occurrence matrix; GLNU, gray-level no uniformity; GLRLM, gray-level run length matrix; k-NN, k-nearest neighbor; LRE, long-run emphasis; LTEM, laws texture energy measures; MRF, Markov Random Fields; NRV, normalized residual value; RLNU, run length non-uniformity; ROC, receiver operating characteristic; ROI, region of interest; RPERC, run percentage; RS, overlap ratio; SRE, short-run emphasis; SVM, support vector machine; US, ultrasound.

The highest accuracy in the SVM method was observed in the results of a research, which used an appropriate segmentation method for obtaining the desired area in the image. The shape and intensity of the extracted features had the most effect in the classification. The combination of gray-level co-occurrence matrix (GLCM) and Pratio features along with morphological features resulted in the highest accuracy.

Discussion

Computerized diagnosis of breast cancer has a number of great advantages; thus, it is constantly being used by radiologists and its impact on the field of medicine is clear. Computer-assisted methods increase diagnosis accuracy by reducing false positives. Advantages of using developed systems for diagnosing breast cancer include 1) helping radiologists in the process of interpretation and screening as a second interpreter after the radiologist; 2) reducing the number of false positives, which will eliminate the need for unnecessary biopsy and lead to cost savings; and 3) reducing the time of the patient’s examination by reviewing and reporting the findings in a few seconds.

So far, creative methods have been developed to diagnose and classify breast cancer; however, none of the methods has been able to accurately classify all cancer cases. In recent years, application of the AI techniques along with image processing has yielded significant results. This article examined these techniques, which used image processing to diagnose breast cancer. The results of the study showed that the SVM had the highest accuracy in cancer diagnosis.

The SVM theory has some limitations. Typically, we know that each sample {xi, yi} belongs to one class in a training dataset. For example, the value of yi is set to only 1 or –1. In this way, in the learning process of the SVM, all instances in the training dataset will be grouped into a single class.

The SVM was invented by Vladimir Vapnik in 1963 in the field of statistical learning theory. This model has been able to succeed in various classifications and in predicting different problems. The model can be used in many regression estimation and pattern recognition problems and is also used to predict and build smart machines. This method is considered as a very efficient method when it comes to solving classification problems with noise data. Two of the main reasons behind the reliability of the SVM classification engine include 1) choosing an optimal subset of context properties for learning and 2) appropriately regulating page parameters using the v-fold cross-validation approach.

The main advantage of the SVM is the ability to use it in a data set that has many features while only a few of them may be involved in the training process. In addition, the SVM has several other notable advantages, including 1) having a large number of nonlinear split pages that give it a higher degree of differentiation; 2) having high generalization ability for the classification of hidden data; and 3) having the ability to determine the optimal network structure (such as its hidden layers and neurons) without tuning external parameters.

As the popularity of the SVM is on the rise, various applications have been published to facilitate a broader view of the model in practical and academic terms. However, many scholars have cited some of its limitations including the following: 1) difficulty in choosing the kernel function for a problem; 2) low performance of the machine in training and testing; 3) lower covariance in the test stage; 4) difficulty in choosing the appropriate kernel parameters; 5) requiring a lot of memory space to run the model; 6) requiring to select one of two parametric and nonparametric methods to run the model; and 7) dependence of the function of each learning machine on the problem. The reason for this is that the function is based on multifactors such as the test dataset, selected optimal subset of the model, and method by which the data samples are divided between the test sets and training sets.

Conclusion

Although the diagnosis of breast cancer can be highly accurate, it is not necessarily the same as the results obtained in other sets of images. Therefore, future research can be done to improve the system’s performance and validate it by conducting tests on a larger set of images. In addition, the following can be suggested as future work: 1) setting SVM parameters through using a genetic algorithm: traditionally, feature selection and parameter optimization are performed independently; independently performing these two problems may result in a loss of information related to the classification process; motivated by these views, the trend in recent years is to simultaneously select feature subsets and optimize parameters of SVMs30; genetic algorithms have the potential to generate both the optimal feature subset and SVM parameters simultaneously; and the most widely used genetic algorithm is proposed by Huang et al31 and 2) evaluating other texture approaches: texture extraction methods are classified into three main categories: structural, statistical, and spectral.

First, in structural approaches, “texture primitive”, as a basic element of texture, is used to construct more complex texture patterns with grammar rules that determine the generation of texture patterns. Second, in the statistical approaches, for the discrimination of various textures, items such as gray-level histogram moments and statistics based on the GLCM are calculated. Finally, in the spectral approaches, the textured image is converted into frequency domains. Then, the extraction of texture features can be done by analyzing the power spectrum. Markov random field models and texture spectrum are some other texture descriptors.32 Then, the extraction of the texture features can be done by analyzing the power spectrum by methods such as the Markov model and the texture spectrum.

Acknowledgments

We are grateful to all those who helped us in the implementation of the research project.

Footnotes

Disclosure

The authors report no conflicts of interest in this work.

References

  • 1.Ferlay J, Soerjomataram I, Dikshit R, et al. Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012. Int J Cancer. 2015;136(5):E359–E386. doi: 10.1002/ijc.29210. [DOI] [PubMed] [Google Scholar]
  • 2.Mehdy MM, Ng PY, Shair EF, Saleh NIM, Gomes C. Artificial neural networks in image processing for early detection of breast cancer. Comput Math Methods Med. 2017;2017:1–15. doi: 10.1155/2017/2610628. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Ghoncheh M, Pournamdar Z, Salehiniya H. Incidence and Mortality and Epidemiology of Breast Cancer in the World. Asian Pac J Cancer Prev. 2016;17(S3):43–46. doi: 10.7314/apjcp.2016.17.s3.43. [DOI] [PubMed] [Google Scholar]
  • 4.WHO Cancer. 2017. [Accessed September 20, 2017]. Available from: http://www.who.int/mediacentre/factsheets/fs297/en/
  • 5.Oliver A, Freixenet J, Martí R, et al. A novel breast tissue density classification methodology. IEEE Trans Inf Technol Biomed. 2008;12(1):55–65. doi: 10.1109/TITB.2007.903514. [DOI] [PubMed] [Google Scholar]
  • 6.Tarique M, Elzahra F, Hateem A, Mohammad M. Fourier Transform Based Early Detection of Breast Cancer by Mammogram Image Processing. J Biomed Eng Med Imaging. 2015;2(4):17. [Google Scholar]
  • 7.American Cancer Society [webpage on the Internet] How is Breast Cancer Diagnosed? 2014. [Accessed September 20, 2017]. Available from: http://www.cancer.org/cancer/breastcancer/detailedguide/breast-cancer-diagnosis.
  • 8.Hou MF, Chuang HY, Ou-Yang F, et al. Comparison of breast mammography, sonography and physical examination for screening women at high risk of breast cancer in Taiwan. Ultrasound Med Biol. 2002;28(4):415–420. doi: 10.1016/s0301-5629(02)00483-0. [DOI] [PubMed] [Google Scholar]
  • 9.Haddadnia J, Hashemian M, Hassanpour K. Diagnosis of breast cancer using a combination of genetic algorithm and artificial neural network in medical infrared thermal imaging. Iran J Med Phys. 2012;9(4):265–274. [Google Scholar]
  • 10.Giger ML. Computer-Aided Diagnosis in Radiology. Acad Radiol. 2002;9(1):1–3. doi: 10.1016/s1076-6332(03)80289-1. [DOI] [PubMed] [Google Scholar]
  • 11.Pendharkar P, Rodger J, Yaverbaum G, Herman N, Benner M. Association, statistical, mathematical and neural approaches for mining breast cancer patterns. Expert Syst Appl. 1999;17(3):223–232. [Google Scholar]
  • 12.Poole DL, Mackworth AK. Artificial Intelligence: Foundations of Computational Agents. New York: Cambridge University Press; 2010. [Google Scholar]
  • 13.Quintanilla-Dominguez J, Cortina-Januchs MG, Ojeda-magaña B, Jevtić A, Vega-Corona A, Andina D, editors. 2010 World Automation Congress. New Jersey: IEEE; 2010. Microcalcification detection applying artificial neural networks and mathematical morphology in digital mammograms. [Google Scholar]
  • 14.Jalalian A, Mashohor SB, Mahmud HR, Saripan MI, Ramli AR, Karasfi B. Computer-aided detection/diagnosis of breast cancer in mammography and ultrasound: a review. Clin Imaging. 2013;37(3):420–426. doi: 10.1016/j.clinimag.2012.09.024. [DOI] [PubMed] [Google Scholar]
  • 15.Kumar A, Shaik F. Image Processing in Diabetic Related Causes. Singapore: Springer; 2015. [Google Scholar]
  • 16.Pradeep N, Girisha H, Sreepathi B, Karibasappa K. Feature extraction of mammograms. Int J Bioinform Res. 2012;4(1):241–244. [Google Scholar]
  • 17.Ibarra-Castanedo C, Bendada A, Maldague X. Thermographic image processing for NDT; Paper presented at: IV Conferencia Panamericana de END; 2007; Buenos Aires, Argentina. [Google Scholar]
  • 18.Sonawane M, Dhawale C. IJCA Proceedings on National Conference on Digital Image and Signal Processing. New York: IJCA Journal; 2015. A brief survey on image segmentation methods. [Google Scholar]
  • 19.Lin HC, Chiu CY, Yang SN. Finding textures by textual descriptions, visual examples, and relevance feedbacks. Pattern Recognit Lett. 2003;24(14):2255–2267. [Google Scholar]
  • 20.Zafiropoulos E, Maglogiannis I, Anagnostopoulos I. A Support Vector Machine Approach to Breast Cancer Diagnosis and Prognosis. Boston, MA: Springer; 2006. [Google Scholar]
  • 21.Kourou K, Exarchos TP, Exarchos KP, Karamouzis MV, Fotiadis DI. Machine learning applications in cancer prognosis and prediction. Comput Struct Biotechnol J. 2015;13:8–17. doi: 10.1016/j.csbj.2014.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Russell S, Norvig P. Artificial intelligence: a modern approach. Artificial Intel Prent Hall Egnlewood Cliffs. 1995;25:27. [Google Scholar]
  • 23.Kerlikowske K, Grady D, Rubin SM, Sandrock C, Ernster VL. Efficacy of screening mammography. A meta-analysis. JAMA. 1995;273(2):149–154. [PubMed] [Google Scholar]
  • 24.Saini S, Vijay R. Proceedings; 2015 Fifth International Conference on Communication Systems and Network Technologies. New York: IEEE; 2015. Mammogram analysis using feed-forward back propagation and cascade-forward back propagation artificial neural network; pp. 1170–1180. [Google Scholar]
  • 25.Medjahed SA, Saadi TA, Benyettou A. Breast cancer diagnosis by using k-nearest neighbor with different distances and classification rules. Int J Computer Applications. 2013;62(1):1–5. [Google Scholar]
  • 26.Dheeba J, Selvi ST. A CAD System for Breast Cancer Diagnosis Using Modified Genetic Algorithm Optimized Artificial Neural Network. Berlin, Heidelberg: Springer; 2011. [Google Scholar]
  • 27.Karabatak M. A new classifier for breast cancer detection based on Naïve Bayesian. Measurement. 2015;72:32–36. [Google Scholar]
  • 28.Szegedy C, Liu W, Jia Y, et al. 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) New York: IEEE; 2015. Going deeper with convolutions; pp. 1–9. [Google Scholar]
  • 29.Abdel-Zaher AM, Eldeib AM. Breast cancer classification using deep belief networks. Expert Syst Appl. 2016;46:139–144. [Google Scholar]
  • 30.Danenas P, Garsva G. Selection of Support Vector Machines based classifiers for credit risk domain. Expert Syst Appl. 2015;42(6):3194–3204. [Google Scholar]
  • 31.Xu H, Chen T, Lv J, Guo J. A combined parallel genetic algorithm and support vector machine model for breast cancer detection. J Comput Method Sci Eng. 2016;16(4):773–785. [Google Scholar]
  • 32.Sheshadri HS, Kandaswamy A. Experimental investigation on breast tissue classification based on statistical feature extraction of mammograms. Comput Med Imaging Graph. 2007;31(1):46–48. doi: 10.1016/j.compmedimag.2006.09.015. [DOI] [PubMed] [Google Scholar]
  • 33.Heywang-Köbrunner SH, Hacker A, Sedlacek S. Advantages and Disadvantages of Mammography Screening. Breast Care. 2011;6(3):2–207. doi: 10.1159/000329005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Qi H, Diakides NA. Augmented Vision Perception in Infrared. London: Springer; 2009. Thermal infrared imaging in early breast cancer detection; pp. 139–152. [Google Scholar]
  • 35.Francis SV, Sasikala M, Saranya S. Detection of breast abnormality from thermograms using curvelet transform based feature extraction. J Med Syst. 2014;38(4):23. doi: 10.1007/s10916-014-0023-3. [DOI] [PubMed] [Google Scholar]
  • 36.Prabusankarlal KM, Thirumoorthy P, Manavalan R. Assessment of combined textural and morphological features for diagnosis of breast masses in ultrasound. Human Centr Comput Inform Sci. 2015;5(1):12. [Google Scholar]
  • 37.Shi X, Cheng HD, Hu L. Mass detection and classification in breast ultrasound images using fuzzy SVM; Proceedings of the 2006 Joint Conference on Information Sciences, JCIS 2006; Kaohsiung, Taiwan, ROC. October 8–11, 2006; Paris: Atlantis Press; 2006. pp. 1759–1777. [Google Scholar]
  • 38.Saini S, Vijay R. Mammogram analysis using feed-forward back propagation and cascade-forward back propagation artificial neural network; Proceedings; 2015 Fifth International Conference on Communication Systems and Network Technologies; New York. IEEE; 2015. pp. 1177–1180. [Google Scholar]
  • 39.Saini S, Vijay R. Optimization of Artificial Neural Network Breast Cancer Detection System Based on Image Registration Techniques. Optimization. 2014;105(14):26–29. [Google Scholar]
  • 40.Karahaliou A, Skiadopoulos S, Boniatis I, et al. Texture analysis of tissue surrounding microcalcifications on mammograms for breast cancer diagnosis. Br J Radiol. 2007;80(956):648–656. doi: 10.1259/bjr/30415751. [DOI] [PubMed] [Google Scholar]
  • 41.Basheer NM, Mohammed MH. Classification of breast masses in digital mammograms using support vector machines. Int J Adv Res Comput Sci Software Eng. 2013;2277:57–63. [Google Scholar]
  • 42.Acharya UR, Ng EY, Tan JH, Sree SV. Thermography based breast cancer detection using texture features and Support Vector Machine. J Med Syst. 2012;36(3):1503–1510. doi: 10.1007/s10916-010-9611-z. [DOI] [PubMed] [Google Scholar]
  • 43.Ali MA, Sayed GI, Gaber T, Hassanien AE, Snasel V, Silva LF. Proceedings of the 2015 Federated Conference on Computer Science and Information Systems. Vol. 2015. New York: IEEE; 2015. Detection of breast abnormalities of thermograms based on a new segmentation method; pp. 255–261. [Google Scholar]
  • 44.Milosevic M, Jankovic D, Peulic A. Thermography based breast cancer detection using texture features and minimum variance quantization. Excli J. 2014;13:1204. [PMC free article] [PubMed] [Google Scholar]
  • 45.Qiu Y, Yan S, Gundreddy RR, et al. A new approach to develop computer-aided diagnosis scheme of breast mass classification using deep learning technology. J Xray Sci Technol. 2017;25(5):751–763. doi: 10.3233/XST-16226. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Breast Cancer : Targets and Therapy are provided here courtesy of Dove Press

RESOURCES