A decision support system for mammography reports interpretation

Marzieh Esmaeili; Seyed Mohammad Ayyoubzadeh; Nasrin Ahmadinejad; Marjan Ghazisaeedi; Azin Nahvijou; Keivan Maghooli

doi:10.1007/s13755-020-00109-5

. 2020 Apr 1;8(1):17. doi: 10.1007/s13755-020-00109-5

A decision support system for mammography reports interpretation

Marzieh Esmaeili ^1,², Seyed Mohammad Ayyoubzadeh ^1,^2,^✉, Nasrin Ahmadinejad ^3,⁴, Marjan Ghazisaeedi ¹, Azin Nahvijou ⁵, Keivan Maghooli ⁶

PMCID: PMC7113352 PMID: 32257128

Abstract

Purpose

Mammography plays a key role in the diagnosis of breast cancer; however, decision-making based on mammography reports is still challenging. This paper aims to addresses the challenges regarding decision-making based on mammography reports and propose a Clinical Decision Support System (CDSS) using data mining methods to help clinicians to interpret mammography reports.

Methods

For this purpose, 2441 mammography reports were collected from Imam Khomeini Hospital from March 21, 2018, to March 20, 2019. In the first step, these mammography reports are analyzed and program code is developed to transform the reports into a dataset. Then, the weight of every feature of the dataset is calculated. Random Forest, Naïve Bayes, K-nearest neighbor (K-NN), Deep Learning classifiers are applied to the dataset to build a model capable of predicting the need for referral to biopsy. Afterward, the models are evaluated using cross-validation with measuring Area Under Curve (AUC), accuracy, sensitivity, specificity indices.

Results

The mammography type (diagnostic or screening), mass and calcification features mentioned in the reports are the most important features for decision-making. Results reveal that the K-NN model is the most accurate and specific classifier with the accuracy and specificity values of 84.06% and 84.72% respectively. The Random Forest classifier has the best sensitivity and AUC with the sensitivity and AUC values of 87.74% and 0.905 respectively.

Conclusions

Accordingly, data mining approaches are proved to be a helpful tool to make the final decision as to whether patients should be referred to biopsy or not based on mammography reports. The developed CDSS may also be helpful especially for less experienced radiologists.

Keywords: Breast cancer, Mammography report, BI-RADS, CDSS, Data mining

Introduction

Breast cancer is the most prevalent cancer among women in the majority of countries all over the world [1]. It is also highly rated as the leading cause of cancer death [2]. Therefore, early detection strategies are of the utmost importance to reduce the cancer burden. One of these strategies is non-invasive imaging techniques such as mammography [3].

Currently, mammography is the most effective method for the detection of early-stage breast tumors [4]. Radiologists provide a narrative report based on their observations from mammography breast images. This free-text report includes information about breast composition, calcification, mass, asymmetry, and other findings. American College of Radiology developed a widely accepted reporting system entitled Breast Imaging-Reporting and Data System (BI-RADS) [5] as a scoring tool to assess the risk of breast cancer. BI-RADS presents standard terminology, report structures, and classification abnormalities; thus, it enables health providers to effectively communicate with each other. Based on the mammography findings, BI-RADS suggests a category that ranges from 0–6, namely BI-RADS score. Therefore, mammography reports usually include one or two categories for the assessment of findings. According to the BI-RADS score, a decision about referring the case for further investigation or biopsy will be made.

Despite the simplicity of using BI-RADS, the number of patients referred to biopsy – which poses serious physical and psychological threats to the human body – is unreasonably high [4]. This is due to a great deal of uncertainty in the interpretation of mammography results [6] that might cause overdiagnosis [7], one of the most important problems in breast cancer screening. In [8] the authors reported that about 25% of detected breast cancer cases through mammography are overdiagnosed. In other words, if these women were not screened, they would live with no problem. Also, these interpretations could be various due to the radiologists’ experiences [9]. For reducing these problems, utilizing computer systems could be helpful.

Clinical Decision Support Systems (CDSS) is one of the computer systems that are beneficial for experts to make decisions more accurately [10]. These systems could employ decision rules [11] or data mining techniques to create predictive models based on previous data to diminish human errors [12]. Data Mining represents the process of extracting knowledge from data to discover hidden relationships [13]. One of the most applicable areas of data mining in the healthcare industry. In healthcare, these systems could be used for decision-making regarding diagnosis and treatment [14–16], management [17] and detecting fraud in insurance [18]. Some studies applied the CDSS for breast cancer detection [19, 20]. Some of these systems [21, 22] utilized data mining techniques to make models based on datasets to predict breast cancer. Most of these datasets are publicly available. It seems that there is a need to evaluate these techniques on other datasets rather than public ones.

In this study, a CDSS using data mining approaches is proposed to decrease the uncertainty in the interpretation of mammography reports. The dataset is extracted from obtained mammography free-text reports of a referral hospital in Iran. The CDSS can recommend whether women should be referred for breast biopsy for further examinations or not.

Methods

This study is conducted in the Imam Khomeini Hospital Complex, the largest educational-medical hospital in Tehran, Iran and a referral hospital for breast cancer in the country.

The overall steps of performing the current project are presented in Fig. 1.

Analyzing reports

In the first step, 2441 mammography reports belonging to the women who had mammography during a year from March 21, 2018, to March 20, 2019, at Imam Khomeini Hospital Complex, were considered. The reports were extracted from the internal reporting system website of the hospital. The reports were recorded in a free-text format. An example of these reports is as follows:

A 69 years old woman without family history and chief complaint

This is screening mammography and previous mammography is not available. Scattered areas of glandular density are noted in both breasts (breast composition b). There is no evidence of suspicious mass, microcalcification or any definite sign of malignancy in the breasts. Intramammary lymph node is seen at UOQ of left breast Accessory breast is noted at right axillary. Asymmetry is seen in left inner part. US exam is helpful in this kind of breast tissue.

BIRADS:2

Benign finding(s).

Annual screening mammogram is recommended.

Creating dataset

In the second step, based on the reports and reporting items for BI-RADS [23], a dataset is created with 14 features shown in Table 1.

Table 1.

Features of the mammography reports dataset

Feature name	Feature short name	Value description
Age	age	< 100
Mammography type	mtype	1 = screening 2 = diagnostic
Breast cancer history	hist	0 = no (default value) 1 = yes
Sign	sign	0 = without chief complaint 1 = mass palpation or mass sensation 2 = pain or itching 3 = mass palpation or mass sensation or stiffness 4 = bloody secretion or nipple discharge 5 = at least 2 items 6 = at least 3 items
Breast composition	comp	1 = a 2 = b 3 = c 4 = d
Shape (of mass)	shape	0 = none (default) 1 = round 2 = oval 3 = irregular
Margin (of mass)	margin	0 = none (default value) 1 = circumscribed or well-defined 2 = obscured or partially obscured 3 = micro lobulated 4 = Indistinct or ill-defined 5 = spiculate
Density (of mass)	dens	0 = none (default value) 1 = low or fat-containing 2 = equal 3 = high
Asymmetries	asym	0 = none (default value) 1 = asymmetry 2 = global asymmetry 3 = focal asymmetry 4 = developing asymmetry
Morphology (of calcifications)	morph	0 = none (default value) 1 = skin, vascular, coarse, large rod-like, round or punctate (< 1 mm), rim, dystrophic, milk of calcium, suture 2 = amorphous, coarse heterogeneous, fine pleomorphic, fine linear or fine-linear branching
Distribution (of calcifications)	dist	0 = none (default value) 1 = diffuse or scattered 2 = regional 3 = clustered or grouped 4 = segmental 5 = linear
Associated features	af	0 = none (default value) 1 = skin retraction, nipple retraction, skin thickening, trabecular thickening, axillary adenopathy, architectural distortion, calcifications
Special cases	special	0 = None (Default Value) 1 = intramammary lymph node, a wart on the skin
BI-RADS (referral label)	BI-RADS (REFERRAL)	1, 2, 3 = no referral for biopsy 4a, 4b, 4c, 5, 6 = referral to biopsy

Open in a new tab

The transformation of the reports to the dataset with the features (mentioned in Table 1) has been done by a program written in Visual Studio 2019 C# [24]. This program detects the features’ values by matching the different expressions. The matching rules are created based on expressions seen in the reports.

Weight of factors analysis

In the third step, the weight of each factor is calculated based on Information Gain [25]; then, they are normalized and presented using a bar chart. The information gain metric is expected to determine the more informative features. It is helpful to build predictive models depending on the weight of features to reduce entropy.

Building and evaluating models

In the fourth step, to build a predictive model, data preprocessing is required. Then the predictive models are built using three classification methods. Finally, the evaluation of these models with well-known metrics is performed. This step is taken using RapidMiner Studio 9.3 software [26].

Data preprocessing

The records missing age or BI-RADS scores are removed. These records indicate that the report file is incomplete. A target feature named REFERRAL as a label is generated based on the BI-RADS score, which is indicative of the need for referral for biopsy. The target feature is set to 0 if BI-RADS is less than 4; otherwise, if BI-RADS is greater than 3, the target value is set to 1. Also, records with a BI-RADS score of 0 indicating incomplete assessment are removed. Due to data imbalance between two classes, the Synthetic Minority Over-sampling Technique (SMOTE) [27] is applied to the training dataset. In real-world problems, mostly, the proportion of classes in a dataset is not the same, thus, it has a negative effect on classifier performance. SMOTE solves this problem by synthesizing new instances of the minority class.

Creating models

A common strategy in developing a predictive model is classification. In classification learning methods, a given dataset with a predefined label is utilized to create a classifier [28]. To support the decision in our problem, we employ Random forest [29], Naïve Bayes [30] and K-nearest neighbor (K-NN) [31] as classifiers. The K-NN and the Naïve Bayes classifications are usually selected in basic studies to achieve some induction about data relationships. Besides, they are relatively fast and simple. The random forest classification is more powerful due to combining multiple decision tree classifiers and voting among their results. The definition of these classifiers is presented as follows:

Random Forest: Random forest classification comprises various decision tree classifiers based on the assumption that combining multiple classifiers improves accuracy [32]. It creates several decision tree classifiers with different parameters. finally, to determine the label for each new instance it votes among all acquired results from each decision tree. The combination and voting system make it a powerful method [33].

Naïve Bayes

Naïve Bayes classification is a probabilistic approach based on Bayes theory. It assumes all features are independent [34]. While, in fact, many datasets especially medical datasets do not support it. This assumption makes it a simple method with acceptable speed and accuracy; however, the performance becomes lower in cases with dependency among features [28]. This classification can be used as a primary method in studies due to its fast induction [34].

K-NN

K-nearest neighbor is another simple and basic classification that is broadly applied in studies. The most important parameter in this classifier is a distance metric that is determined relevant to a problem [35]. This model classifies each new instance with calculating the most frequent class among its k-nearest neighbors in the dataset. This algorithm is also simple and fast but noisy data could reduce its performance [28].

Deep learning

Deep learning classifiers automatically extract features of raw data in large datasets with multiple computational layers that represent features in different abstract levels [36]. This classifier had shown great performance in various areas in processing multimedia and unstructured text data.

Evaluating models

The test dataset instances categorized into four categories shown in a matrix called the confusion matrix (Table 2). The performance of classification algorithms is evaluated with the well-known indices like accuracy (Eq. 1), sensitivity (Eq. 2), specificity (Eq. 3) that are calculated using the instances frequency in each category. Besides, The Area Under Curve (AUC) of a receiver operating characteristic (ROC) is evaluated to measure the performance of the classifiers.

A c c u r a c y = \frac{T P + T N}{T P + F P + F N + T N}

S e n s i t i v i t y = \frac{TP}{T P + F N}

S p e c i f i c i t y = \frac{TN}{F P + T N}

Table 2.

Confusion matrix of a binominal classifier

	Actual positive	Actual negative
Predictive positive	TP	FP
Predictive negative	FN	TN

Open in a new tab

Results

1204 records were extracted from the reports and included in the dataset. The dataset contained 1138 records, which were related to the women who had not needed to be referred to biopsy (REFFERRAL = 1), and 66 records, which were for those who had needed to be referred to biopsy for further tests (REFFERRAL = 0).

The weight of the features based on their importance is shown in Fig. 2.

The comparison between the models is presented in Table 3. The best value for each index is shown in bold text. The K-NN model is the most accurate and specific classifier with the values of 84.06% and 84.72% respectively. The Random Forest classifier has the best sensitivity and AUC with the values of 87.74%, 0.905 respectively.

Table 3.

Comparison between classifiers for detecting the need for referral to biopsy

Model	Accuracy (S.D%)	Sensitivity (S.D%)	Specificity (S.D%)	AUC (S.D)	Execution time(s)
Random Forest	82.06% (5.12)	87.74% (14.24)	81.72% (5.40)	0.905 (0.084)	14
Naïve Bayes	80.65% (3.62)	77.68% (19.10)	80.84% (3.01)	0.867 (0.097)	1
K-NN	84.06% (3.32)	72.98% (18.25)	84.72% (3.91)	0.729 (0.188)	1
Deep Learning	77.16% (6.92)	76.55% (17.99%)	77.26% (8.05)	0.859 (0.098)	23

Open in a new tab

Discussion

In this study, classifier models have been proposed for helping clinicians to decide about referring the woman to biopsy or not. Although the K-NN model has been shown to have the best accuracy (84.06%), it seems more sensitive models such as Random Forest—with the sensitivity of 87.74% in this case—might be recommended to avoid misdiagnosing breast cancer. The results revealed the mammography type (diagnostic or screening), mass and calcification features mentioned in the reports were the most important features for decision making about referring to biopsy. Other studies [37] have also mentioned these important factors. Also, Ferreira et al. proposed a Support Vector Machine (SVM) classifier with an accuracy of 85.6% to predict malignancy mainly based on the mass and density properties [38]. The deep learning classifier proposed in this paper showed not such a high performance in comparison with other classifiers. One justification for this issue could be explained as the labeled dataset for training accurate deep learning classifiers should include enormous samples [39]. Since our dataset is relatively small, the deep learning classifiers didn’t perform well. In addition, the deep learning classifier’s power is feature extraction from raw data.

Several studies reported a higher accuracy for predicting breast cancer, this could be due to the difference in chosen datasets. As mentioned earlier, they mostly used datasets that are publicly available. In this study, the dataset was obtained by converting free-text mammography reports to dataset format. Chaurasia and Pal [40] used Naïve Bayes, RBF Network, and Decision Tree techniques to predict breast cancer on Wisconsin dataset provided by the University of California Irvine machine learning repository [41]. They achieved the best accuracy of %97.36 related to the Naïve Bayes classifier. Asri et al. [42] used Decision Tree, Naïve Bayes, KNN, and SVM techniques to predict breast cancer risk, they achieved an accuracy of %97.13 related to SVM classifier. Another study [43] classified breast tissue based on mammography reports on public datasets have reached to the accuracy of > 99%.

Diz et al. [44] proposed a CAD for breast cancer diagnosis. the best accuracy reported for the proposed system was 89.3%. the researchers used naive Bayes, Random Forest, and SVM classifiers. Another study by Kaushik and Kaur [45] utilized data mining techniques for the prediction of breast tissue biopsy results. The researchers proposed a model with an accuracy of 83.5% and a ROC of 0.907.

Chaudhuri et al. [46] in their study predicted the recurrence of breast cancer with an accuracy of 68.5% by using discriminant analysis and decision tree techniques. In addition to modeling the prediction of breast cancer with datasets and texts, other data types such as gene expression data are used to classify breast cancer [47]. Also, there is a trend to use deep learning techniques for breast cancer prediction through histopathologic images [48] and mammography images [49].

The mammography reports in this study were formatted in a free-text format. Many Studies used natural language processing (NLP) and machine learning techniques to extract data from mammography [50–54] and ultrasound mammography reports [55]. A review is conducted on text mining approaches for cancer-related information [56]. This review showed that many studies use symbolic methods based on dictionary lookup and information extraction with the use of pattern matching as the current study.

Banerjee et al. [57] used NLP techniques for automatic inference of BIRADS from mammography reports. They achieved an average precision of 91% for their proposed method. Another study by Bulu et al. [58] proposed terminology of the radiology domain entitled New RadLex based on NLP techniques. The proposed model’s overall accuracy was reported to be 80%.

It is recommended to standardize a structure for reporting to eliminate the possibility of missing information in the free-text format and to make processing easier. For example, missing a feature in the report could be confusing since the absence might be interpreted either as a missing value or an unimportant value from the operator’s view.

This standard might be formed by defining a semi-structural format based on features that have the most effect on the decision-making process. These features could be obtained from the models presented in this and similar papers.

The results revealed that data mining techniques could be used for breast cancer mammography reports interpretation. Further studies might focus on automated CDSS with the input of mammography images and learn from biopsy results as feedback. Moreover, the dataset sample size might be extended to contain more samples especially positive samples to achieve more reliable models. In the case of adequate dataset size, the classifiers’ performance might exceed radiologists’ performance in cancer detection [59]. Besides, the use of ensemble methods might improve the classifier performance [60].

Limitations

The limitations of this study were as follows: The dataset of this project includes mammography reports of women under the supervision of just one clinician (N. A.). Therefore, the dataset size is limited and results might be biased. Also, some reports were written in Persian and, therefore, we excluded them from the analysis.

Conclusions

Overall, we have developed and evaluated a CDSS based on data mining methods for interpreting mammography reports. The input of this system is free-text mammography reports. Afterward, using presented predictive models, the system could suggest the need for biopsy referral as an output. The proposed system could be helpful especially for lower-experienced radiologists as an aid system. Thus, utilizing these CDSS systems could help clinicians to save time in the diagnosis of regular cases and spend more time for patients with more severe problems. Also, the mentioned systems could reduce the uncertainty of diagnosis and unnecessary further examinations—alleviating the negative effects such as overdiagnosis, cost burden, and patient worries.

Acknowledgements

This study was funded and supported by Tehran University of Medical Sciences (TUMS) is part of registered research with the ethical code of IR.TUMS.VCR.REC.1397.476 from Tehran University of Medical Sciences. We wish to thank Ms. Habibeh Amjadi Motlagh and Ms. Manijeh Rezazadeh Khalkhali and all the cancer radiology center staff of Imam Khomeini Hospital Complex.

Compliance with ethical standards

Conflict of interest

The authors report no conflict of interest.

Footnotes

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional ailiations.

References

1.Bray F, et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA. 2018;68(6):394–424. doi: 10.3322/caac.21492. [DOI] [PubMed] [Google Scholar]
2.Ferlay J, et al. Estimating the global cancer incidence and mortality in 2018: GLOBOCAN sources and methods. Int J Cancer. 2019;144(8):1941–1953. doi: 10.1002/ijc.31937. [DOI] [PubMed] [Google Scholar]
3.Mandelblatt JS, et al. Collaborative modeling of the benefits and harms associated with different US breast cancer screening strategies. Ann Intern Med. 2016;164(4):215–225. doi: 10.7326/M15-1536. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Harding C, et al. Long-term relationships between screening rates, breast cancer characteristics, and overdiagnosis in US counties, 1975–2009. J Cancer. 2019;144(3):476–488. doi: 10.1002/ijc.31904. [DOI] [PubMed] [Google Scholar]
5.Liberman L, Menell JH. Breast imaging reporting and data system (BI-RADS) Radiol Clin. 2002;40(3):409–430. doi: 10.1016/s0033-8389(01)00017-3. [DOI] [PubMed] [Google Scholar]
6.Beam CA, Layde PM, Sullivan DC. Variability in the interpretation of screening mammograms by US radiologists: findings from a national sample. Arch Intern Med. 1996;156(2):209–213. [PubMed] [Google Scholar]
7.Welch HG. Overdiagnosis and mammography screening. BMJ. 2009 doi: 10.1136/bmj.b1425. [DOI] [PubMed] [Google Scholar]
8.Welch HG, Black WC. Overdiagnosis in cancer. J Natl Cancer Inst. 2010;102(9):605–613. doi: 10.1093/jnci/djq099. [DOI] [PubMed] [Google Scholar]
9.Barlow WE, et al. Accuracy of screening mammography interpretation by characteristics of radiologists. J Natl Cancer Inst. 2004;96(24):1840–1850. doi: 10.1093/jnci/djh333. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Garg AX, et al. Effects of computerized clinical decision support systems on practitioner performance and patient outcomes: a systematic review. JAMA. 2005;293(10):1223–1238. doi: 10.1001/jama.293.10.1223. [DOI] [PubMed] [Google Scholar]
11.Andrzejewski D, et al. Supporting breast cancer decisions using formalized guidelines and experts decision patterns: initial prototype and evaluation. Health Inf Sci Syst. 2017;5(1):12. doi: 10.1007/s13755-017-0035-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Bellazzi R, Zupan B. Predictive data mining in clinical medicine: current issues and guidelines. Int J Med Inform. 2008;77(2):81–97. doi: 10.1016/j.ijmedinf.2006.11.006. [DOI] [PubMed] [Google Scholar]
13.Han J, Pei J, Kamber M. Data mining: concepts and techniques. Amsterdam: Elsevier; 2011. [Google Scholar]
14.Moitra D, Mandal RK. Automated AJCC (7th edition) staging of non-small cell lung cancer (NSCLC) using deep convolutional neural network (CNN) and recurrent neural network (RNN) Health Inf Sci Syst. 2019;7(1):14. doi: 10.1007/s13755-019-0077-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Xie J, Wang Y, Wu Z. Colon cancer data analysis by chameleon algorithm. Health Inf Sci Syst. 2019;7(1):23. doi: 10.1007/s13755-019-0085-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Shakeel PM, et al. Cloud based framework for diagnosis of diabetes mellitus using K-means clustering. Health Inf Sci Syst. 2018;6(1):16. doi: 10.1007/s13755-018-0054-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Ayyoubzadeh SM, et al. A study of factors related to patients’ length of stay using data mining techniques in a general hospital in southern Iran. Health Inf Sci Syst. 2020;8(1):9. doi: 10.1007/s13755-020-0099-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Koh HC, Tan G. Data mining applications in healthcare. J Health Inf Manag. 2011;19:65. [PubMed] [Google Scholar]
19.Jalalian A, et al. Foundation and methodologies in computer-aided diagnosis systems for breast cancer detection. EXCLI J. 2017;16:113–137. doi: 10.17179/excli2016-701. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Rangayyan RM, Ayres FJ, Leo Desautels JE. A review of computer-aided diagnosis of breast cancer: toward the detection of subtle signs. J Frankl Inst. 2007;344(3–4):312–348. [Google Scholar]
21.Bahmani E, Jamshidi M, Shaltooki A. Breast cancer prediction using a hybrid data mining model. JOIV: Int J Inform Vis. 2019;3(4):327–331. [Google Scholar]
22.Verma G, Verma H. Predicting breast cancer using linear kernel support vector machine. SSRN 3350254. 2019.
23.Zonderland H, Smithuis R. BI-RADS for mammography and ultrasound 2013. The Radiology Assistant Updated version. 2013.
24.Visual Studio 2019. https://visualstudio.microsoft.com/vs/. Accessed 2019.
25.Raileanu LE, Stoffel K. Theoretical comparison between the gini index and information gain criteria. Ann Math Artif Intell. 2004;41(1):77–93. [Google Scholar]
26.rapidminer. 2019. https://rapidminer.com/.
27.Chawla NV, et al. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. 2002;16:321–357. [Google Scholar]
28.Tomar D, Agarwal S. A survey on data mining approaches for healthcare. Int J Biosci Biotechnol. 2012;5(5):241–266. [Google Scholar]
29.Ho TK. Random decision forests. In: Proceedings of 3rd international conference on document analysis and recognition. IEEE; 1995.
30.Kononenko I. Inductive and Bayesian learning in medical diagnosis. Appl Artif Intell Int J. 1993;7(4):317–337. [Google Scholar]
31.Hastie T, et al. The elements of statistical learning: data mining, inference and prediction. J R Stat Soc. 2005;27(2):83–85. [Google Scholar]
32.Breiman LJ. Random forests. Mach Learn. 2001;45(1):5–32. [Google Scholar]
33.Kulkarni VY, Sinha PK. Pruning of random forest classifiers: a survey and future directions. In: 2012 international conference on data science & engineering (ICDSE). IEEE; 2012.
34.Zhang H. The optimality of naïve Bayes. AA. 2004;1(2):3. [Google Scholar]
35.Zhang M-L, Zhou Z-H. A k-nearest neighbor based algorithm for multi-label classification. 2005 IEEE Int Conf Gran Comput. 2005;5:718–721. [Google Scholar]
36.LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521(7553):436–444. doi: 10.1038/nature14539. [DOI] [PubMed] [Google Scholar]
37.Burnside ES, et al. Knowledge discovery from structured mammography reports using inductive logic programming. In: AMIA annual symposium proceedings. American Medical Informatics Association; 2005. [PMC free article] [PubMed]
38.Ferreira P, et al. Predicting malignancy from mammography findings and image-guided core biopsies. Int J Data Min Bioinform. 2015;11(3):257. doi: 10.1504/ijdmb.2015.067319. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Arnold L, et al. An introduction to deep learning. In: European symposium on artificial neural networks (ESANN). Bruges; 2011.
40.Chaurasia V, Pal S, Tiwari B. Prediction of benign and malignant breast cancer using data mining techniques. J Algorithms Comput Technol. 2018;12(2):119–126. [Google Scholar]
41.Dua DAG, Casey. UCI machine learning repository. 2017. https://archive.ics.uci.edu/ml.
42.Asri H, et al. Using machine learning algorithms for breast cancer risk prediction and diagnosis. Procedia Comput Sci. 2016;83:1064–1069. [Google Scholar]
43.Percha B, et al. Automatic classification of mammography reports by BI-RADS breast tissue composition class. J Am Med Inform Accoc. 2012;19(5):913–916. doi: 10.1136/amiajnl-2011-000607. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Diz J, Marreiros G, Freitas A. Applying data mining techniques to improve breast cancer diagnosis. J Med Syst. 2016;40(9):203. doi: 10.1007/s10916-016-0561-y. [DOI] [PubMed] [Google Scholar]
45.Kaushik D, Kaur K. Application of data mining for high accuracy prediction of breast tissue biopsy results. In: 2016 third international conference on digital information processing, data mining, and wireless communications (DIPDMWC). IEEE; 2016.
46.Chaudhuri AK, Sinha D, Thyagaraj KS. Identification of the recurrence of breast cancer by discriminant analysis. In: Emerging technologies in data mining and information security; 2019. p. 519–532.
47.Mendes A. Identification of breast cancer subtypes using multiple gene expression microarray datasets. Heidelberg: Springer; 2011. [Google Scholar]
48.Deniz E, et al. Transfer learning based histopathologic image classification for breast cancer detection. Health Inf Sci Syst. 2018;6(1):18. doi: 10.1007/s13755-018-0057-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Danala G, et al. Classification of breast masses using a computer-aided diagnosis scheme of contrast enhanced digital mammograms. Ann Biomed Eng. 2018;46(9):1419–1431. doi: 10.1007/s10439-018-2044-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Castro SM, et al. Automated annotation and classification of BI-RADS assessment from radiology reports. J Biomed Inform. 2017;69:177–187. doi: 10.1016/j.jbi.2017.04.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Bozkurt S, et al. Automatic abstraction of imaging observations with their characteristics from mammography reports. Am Med Inform Assoc. 2014;22(e1):e81–e92. doi: 10.1136/amiajnl-2014-003009. [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Patel TA, et al. Correlating mammographic and pathologic findings in clinical decision support using natural language processing and data mining methods. Cancer. 2017;123(1):114–121. doi: 10.1002/cncr.30245. [DOI] [PubMed] [Google Scholar]
53.Nassif H, et al. Information extraction for clinical data mining: a mammography case study. In: 2009 IEEE international conference on data mining workshops. IEEE; 2009. [DOI] [PMC free article] [PubMed]
54.Peng Y, et al. NegBio: a high-performance tool for negation and uncertainty detection in radiology reports. AMIA Summits Transl Sci Proc. 2018;2018(2018):188. [PMC free article] [PubMed] [Google Scholar]
55.Miao S, et al. Extraction of BI-RADS findings from breast ultrasound reports in Chinese using deep learning approaches. Int J Med Inform. 2018;119:17–21. doi: 10.1016/j.ijmedinf.2018.08.009. [DOI] [PubMed] [Google Scholar]
56.Spasić I, et al. Text mining of cancer-related information: review of current status and future directions. Int J Med Inform. 2014;83(9):605–623. doi: 10.1016/j.ijmedinf.2014.06.009. [DOI] [PubMed] [Google Scholar]
57.Banerjee I, et al. Automatic inference of BI-RADS final assessment categories from narrative mammography report findings. J Biomed Inform. 2019;92:103137. doi: 10.1016/j.jbi.2019.103137. [DOI] [PMC free article] [PubMed] [Google Scholar]
58.Bulu H, et al. Proposing new radlex terms by analyzing free-text mammography reports. J Digit Imaging. 2018;31(5):596–603. doi: 10.1007/s10278-018-0064-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
59.Burnside ES, et al. Probabilistic computer model developed from clinical data in national mammography database format to classify mammographic findings. Radiology. 2009;251(3):663–672. doi: 10.1148/radiol.2513081346. [DOI] [PMC free article] [PubMed] [Google Scholar]
60.Hosni M, et al. Reviewing ensemble classification methods in breast cancer. Computer methods and programs in biomedicine. Amsterdam: Elsevier; 2019. [DOI] [PubMed] [Google Scholar]

[CR1] 1.Bray F, et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA. 2018;68(6):394–424. doi: 10.3322/caac.21492. [DOI] [PubMed] [Google Scholar]

[CR2] 2.Ferlay J, et al. Estimating the global cancer incidence and mortality in 2018: GLOBOCAN sources and methods. Int J Cancer. 2019;144(8):1941–1953. doi: 10.1002/ijc.31937. [DOI] [PubMed] [Google Scholar]

[CR3] 3.Mandelblatt JS, et al. Collaborative modeling of the benefits and harms associated with different US breast cancer screening strategies. Ann Intern Med. 2016;164(4):215–225. doi: 10.7326/M15-1536. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR4] 4.Harding C, et al. Long-term relationships between screening rates, breast cancer characteristics, and overdiagnosis in US counties, 1975–2009. J Cancer. 2019;144(3):476–488. doi: 10.1002/ijc.31904. [DOI] [PubMed] [Google Scholar]

[CR5] 5.Liberman L, Menell JH. Breast imaging reporting and data system (BI-RADS) Radiol Clin. 2002;40(3):409–430. doi: 10.1016/s0033-8389(01)00017-3. [DOI] [PubMed] [Google Scholar]

[CR6] 6.Beam CA, Layde PM, Sullivan DC. Variability in the interpretation of screening mammograms by US radiologists: findings from a national sample. Arch Intern Med. 1996;156(2):209–213. [PubMed] [Google Scholar]

[CR7] 7.Welch HG. Overdiagnosis and mammography screening. BMJ. 2009 doi: 10.1136/bmj.b1425. [DOI] [PubMed] [Google Scholar]

[CR8] 8.Welch HG, Black WC. Overdiagnosis in cancer. J Natl Cancer Inst. 2010;102(9):605–613. doi: 10.1093/jnci/djq099. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Barlow WE, et al. Accuracy of screening mammography interpretation by characteristics of radiologists. J Natl Cancer Inst. 2004;96(24):1840–1850. doi: 10.1093/jnci/djh333. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR10] 10.Garg AX, et al. Effects of computerized clinical decision support systems on practitioner performance and patient outcomes: a systematic review. JAMA. 2005;293(10):1223–1238. doi: 10.1001/jama.293.10.1223. [DOI] [PubMed] [Google Scholar]

[CR11] 11.Andrzejewski D, et al. Supporting breast cancer decisions using formalized guidelines and experts decision patterns: initial prototype and evaluation. Health Inf Sci Syst. 2017;5(1):12. doi: 10.1007/s13755-017-0035-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Bellazzi R, Zupan B. Predictive data mining in clinical medicine: current issues and guidelines. Int J Med Inform. 2008;77(2):81–97. doi: 10.1016/j.ijmedinf.2006.11.006. [DOI] [PubMed] [Google Scholar]

[CR13] 13.Han J, Pei J, Kamber M. Data mining: concepts and techniques. Amsterdam: Elsevier; 2011. [Google Scholar]

[CR14] 14.Moitra D, Mandal RK. Automated AJCC (7th edition) staging of non-small cell lung cancer (NSCLC) using deep convolutional neural network (CNN) and recurrent neural network (RNN) Health Inf Sci Syst. 2019;7(1):14. doi: 10.1007/s13755-019-0077-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Xie J, Wang Y, Wu Z. Colon cancer data analysis by chameleon algorithm. Health Inf Sci Syst. 2019;7(1):23. doi: 10.1007/s13755-019-0085-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Shakeel PM, et al. Cloud based framework for diagnosis of diabetes mellitus using K-means clustering. Health Inf Sci Syst. 2018;6(1):16. doi: 10.1007/s13755-018-0054-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.Ayyoubzadeh SM, et al. A study of factors related to patients’ length of stay using data mining techniques in a general hospital in southern Iran. Health Inf Sci Syst. 2020;8(1):9. doi: 10.1007/s13755-020-0099-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] 18.Koh HC, Tan G. Data mining applications in healthcare. J Health Inf Manag. 2011;19:65. [PubMed] [Google Scholar]

[CR19] 19.Jalalian A, et al. Foundation and methodologies in computer-aided diagnosis systems for breast cancer detection. EXCLI J. 2017;16:113–137. doi: 10.17179/excli2016-701. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Rangayyan RM, Ayres FJ, Leo Desautels JE. A review of computer-aided diagnosis of breast cancer: toward the detection of subtle signs. J Frankl Inst. 2007;344(3–4):312–348. [Google Scholar]

[CR21] 21.Bahmani E, Jamshidi M, Shaltooki A. Breast cancer prediction using a hybrid data mining model. JOIV: Int J Inform Vis. 2019;3(4):327–331. [Google Scholar]

[CR22] 22.Verma G, Verma H. Predicting breast cancer using linear kernel support vector machine. SSRN 3350254. 2019.

[CR23] 23.Zonderland H, Smithuis R. BI-RADS for mammography and ultrasound 2013. The Radiology Assistant Updated version. 2013.

[CR24] 24.Visual Studio 2019. https://visualstudio.microsoft.com/vs/. Accessed 2019.

[CR25] 25.Raileanu LE, Stoffel K. Theoretical comparison between the gini index and information gain criteria. Ann Math Artif Intell. 2004;41(1):77–93. [Google Scholar]

[CR26] 26.rapidminer. 2019. https://rapidminer.com/.

[CR27] 27.Chawla NV, et al. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. 2002;16:321–357. [Google Scholar]

[CR28] 28.Tomar D, Agarwal S. A survey on data mining approaches for healthcare. Int J Biosci Biotechnol. 2012;5(5):241–266. [Google Scholar]

[CR29] 29.Ho TK. Random decision forests. In: Proceedings of 3rd international conference on document analysis and recognition. IEEE; 1995.

[CR30] 30.Kononenko I. Inductive and Bayesian learning in medical diagnosis. Appl Artif Intell Int J. 1993;7(4):317–337. [Google Scholar]

[CR31] 31.Hastie T, et al. The elements of statistical learning: data mining, inference and prediction. J R Stat Soc. 2005;27(2):83–85. [Google Scholar]

[CR32] 32.Breiman LJ. Random forests. Mach Learn. 2001;45(1):5–32. [Google Scholar]

[CR33] 33.Kulkarni VY, Sinha PK. Pruning of random forest classifiers: a survey and future directions. In: 2012 international conference on data science & engineering (ICDSE). IEEE; 2012.

[CR34] 34.Zhang H. The optimality of naïve Bayes. AA. 2004;1(2):3. [Google Scholar]

[CR35] 35.Zhang M-L, Zhou Z-H. A k-nearest neighbor based algorithm for multi-label classification. 2005 IEEE Int Conf Gran Comput. 2005;5:718–721. [Google Scholar]

[CR36] 36.LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521(7553):436–444. doi: 10.1038/nature14539. [DOI] [PubMed] [Google Scholar]

[CR37] 37.Burnside ES, et al. Knowledge discovery from structured mammography reports using inductive logic programming. In: AMIA annual symposium proceedings. American Medical Informatics Association; 2005. [PMC free article] [PubMed]

[CR38] 38.Ferreira P, et al. Predicting malignancy from mammography findings and image-guided core biopsies. Int J Data Min Bioinform. 2015;11(3):257. doi: 10.1504/ijdmb.2015.067319. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR39] 39.Arnold L, et al. An introduction to deep learning. In: European symposium on artificial neural networks (ESANN). Bruges; 2011.

[CR40] 40.Chaurasia V, Pal S, Tiwari B. Prediction of benign and malignant breast cancer using data mining techniques. J Algorithms Comput Technol. 2018;12(2):119–126. [Google Scholar]

[CR41] 41.Dua DAG, Casey. UCI machine learning repository. 2017. https://archive.ics.uci.edu/ml.

[CR42] 42.Asri H, et al. Using machine learning algorithms for breast cancer risk prediction and diagnosis. Procedia Comput Sci. 2016;83:1064–1069. [Google Scholar]

[CR43] 43.Percha B, et al. Automatic classification of mammography reports by BI-RADS breast tissue composition class. J Am Med Inform Accoc. 2012;19(5):913–916. doi: 10.1136/amiajnl-2011-000607. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR44] 44.Diz J, Marreiros G, Freitas A. Applying data mining techniques to improve breast cancer diagnosis. J Med Syst. 2016;40(9):203. doi: 10.1007/s10916-016-0561-y. [DOI] [PubMed] [Google Scholar]

[CR45] 45.Kaushik D, Kaur K. Application of data mining for high accuracy prediction of breast tissue biopsy results. In: 2016 third international conference on digital information processing, data mining, and wireless communications (DIPDMWC). IEEE; 2016.

[CR46] 46.Chaudhuri AK, Sinha D, Thyagaraj KS. Identification of the recurrence of breast cancer by discriminant analysis. In: Emerging technologies in data mining and information security; 2019. p. 519–532.

[CR47] 47.Mendes A. Identification of breast cancer subtypes using multiple gene expression microarray datasets. Heidelberg: Springer; 2011. [Google Scholar]

[CR48] 48.Deniz E, et al. Transfer learning based histopathologic image classification for breast cancer detection. Health Inf Sci Syst. 2018;6(1):18. doi: 10.1007/s13755-018-0057-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR49] 49.Danala G, et al. Classification of breast masses using a computer-aided diagnosis scheme of contrast enhanced digital mammograms. Ann Biomed Eng. 2018;46(9):1419–1431. doi: 10.1007/s10439-018-2044-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR50] 50.Castro SM, et al. Automated annotation and classification of BI-RADS assessment from radiology reports. J Biomed Inform. 2017;69:177–187. doi: 10.1016/j.jbi.2017.04.011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR51] 51.Bozkurt S, et al. Automatic abstraction of imaging observations with their characteristics from mammography reports. Am Med Inform Assoc. 2014;22(e1):e81–e92. doi: 10.1136/amiajnl-2014-003009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR52] 52.Patel TA, et al. Correlating mammographic and pathologic findings in clinical decision support using natural language processing and data mining methods. Cancer. 2017;123(1):114–121. doi: 10.1002/cncr.30245. [DOI] [PubMed] [Google Scholar]

[CR53] 53.Nassif H, et al. Information extraction for clinical data mining: a mammography case study. In: 2009 IEEE international conference on data mining workshops. IEEE; 2009. [DOI] [PMC free article] [PubMed]

[CR54] 54.Peng Y, et al. NegBio: a high-performance tool for negation and uncertainty detection in radiology reports. AMIA Summits Transl Sci Proc. 2018;2018(2018):188. [PMC free article] [PubMed] [Google Scholar]

[CR55] 55.Miao S, et al. Extraction of BI-RADS findings from breast ultrasound reports in Chinese using deep learning approaches. Int J Med Inform. 2018;119:17–21. doi: 10.1016/j.ijmedinf.2018.08.009. [DOI] [PubMed] [Google Scholar]

[CR56] 56.Spasić I, et al. Text mining of cancer-related information: review of current status and future directions. Int J Med Inform. 2014;83(9):605–623. doi: 10.1016/j.ijmedinf.2014.06.009. [DOI] [PubMed] [Google Scholar]

[CR57] 57.Banerjee I, et al. Automatic inference of BI-RADS final assessment categories from narrative mammography report findings. J Biomed Inform. 2019;92:103137. doi: 10.1016/j.jbi.2019.103137. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR58] 58.Bulu H, et al. Proposing new radlex terms by analyzing free-text mammography reports. J Digit Imaging. 2018;31(5):596–603. doi: 10.1007/s10278-018-0064-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR59] 59.Burnside ES, et al. Probabilistic computer model developed from clinical data in national mammography database format to classify mammographic findings. Radiology. 2009;251(3):663–672. doi: 10.1148/radiol.2513081346. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR60] 60.Hosni M, et al. Reviewing ensemble classification methods in breast cancer. Computer methods and programs in biomedicine. Amsterdam: Elsevier; 2019. [DOI] [PubMed] [Google Scholar]

PERMALINK

A decision support system for mammography reports interpretation

Marzieh Esmaeili

Seyed Mohammad Ayyoubzadeh

Nasrin Ahmadinejad

Marjan Ghazisaeedi

Azin Nahvijou

Keivan Maghooli

Abstract

Purpose

Methods

Results

Conclusions

Introduction

Methods

Fig. 1.

Analyzing reports

Creating dataset

Table 1.

Weight of factors analysis

Building and evaluating models

Data preprocessing

Creating models

Naïve Bayes

K-NN

Deep learning

Evaluating models

Table 2.

Results

Fig. 2.

Table 3.

Discussion

Limitations

Conclusions

Acknowledgements

Compliance with ethical standards

Conflict of interest

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases