Abstract
Aortic dissection is one of the most clinical-challenging and life-threatening cardiovascular diseases associated with high morbidity and mortality. Aortic dissection requires fast diagnosis and timely therapy. Any delay or misdiagnosis can cause severe consequence to aortic dissection patients with even higher mortality. To better help physicians identify the potential dissection within the scope of all misdiagnosed patients, this paper describes a method which is developed with data mining methods for aortic dissection patient classification and prediction in the phase of early diagnosis. Various machine learning algorithms were used to build the models which were all trained and tested on the patient dataset with cross validation. Among them, Bayesian Network model achieved the best performance by predicting at a precision rate of 84.55% with Area Under the Curve (AUC) value of 0.857. On this basis, the Bayesian Network model can help physicians better with early diagnosis of aortic dissection in clinical practice. Beyond this study, more data from diverse regions and the internal pathology can be crucial to further build a universal model with broader predictive power.
Introduction
Cardiovascular diseases are among the major causes of death in most developed countries and many if not all developing countries1. Among all the categories of cardiovascular diseases, aortic dissection is fatal with high mortality. The aorta is a complex organ, of which the wall has a 3-layered anatomic configuration. When aortic dissection occurs, blood flows between the layers of the aortic walls then forces the layers apart and creates a new secondary channel2. Previous literature examined the aortic dissection patients and found some features of the population. For example, a typical aortic dissection patient is a gentleman in his 60 s or 70 s with hypertension, prior cardiac surgery, bicuspid aortic valve, and a history of Marfan syndrome3. However, these features are extracted by only examining the confirmed patients without comparing to the features of other easy-to confuse diseases. Some works from the previous literature show that patients with aortic dissection share pathologic similarities with other diseases4–6. which may not be so useful to distinguish the real ones from all misdiagnosed cases.
In clinical practice, aortic dissection patients usually arrive in an emergency with the common abrupt onset of severe chest pain, which makes it critical and challenging to successfully predict and identify aortic dissection as the actual cause out of all the possible ones7. Up to 30% of aortic dissection patients are initially misdiagnosed to have other conditions, such as acute coronary syndromes, non-dissecting aneurysms, pulmonary embolism, or aortic stenosis8–10. Until now, no unique symptoms have been found for aortic dissection and it always requires imaging detection like chest X-ray and ECG (Electrocardiograph) to further confirm. Acquiring such detection is time-consuming that is not ideal since speed is of utmost importance of aortic dissection diagnosis. However, such tests for misdiagnosed cases can be crucial for the coming therapy after confirmation which may differ in terms of other conditions11. Since there is a wide range of clinical symptoms for aortic dissection patients, misdiagnosed cases should go through quick risk stratification and management12. Beyond traditional detection schema of processing ECG and chest X-ray which is slow that may delay the timely therapy, a new method that can assist physicians to fast and reliably identify the aortic dissection patients from all misdiagnosed cases is badly in need.
In order to fast and reliably identify, or in other words, to classify patients into aortic dissection category if they are, it is necessary to conduct methods which have high effectiveness and efficiency in classifying. In view of this strong demand in clinical practice, machine learning classification can serve as one solution. It is a supervised machine learning technique that assigns items to target classes when class labels are known13. Classification, together with other machine learning techniques, has a potential to make a huge impact on healthcare and biomedical domain14. In several works, classification approach enhanced by attribute selection techniques has been successfully applied in medical informatics15–18. Nonetheless, few researchers focus on the early diagnosis and prediction of aortic dissection due to the imbalanced geographical distribution of patients which makes it difficult to collect high-volume and high-quality dataset to proceed such method.
In our study, we proposed a prediction model by applying classification analysis to classify aortic dissection positive patients from all misdiagnosed cases in the phase of early diagnosis. Such model can help physicians to fast spot aortic dissection positive patients out of all misdiagnosed cases, and to lose no time in understanding the situation and estimating the progression after admission. After training on the real-case data, this prediction model can set the priority of patients by classifying them into two group. The predicted aortic dissection positive patients need more eyes on whereas the predicted aortic dissection negative patients are more likely to be eventually diagnosed with other diseases. In the past, the decisions made in the early diagnosis phase are primarily based on physicians’ expertise and consultation, which can be a barrier to a successful diagnosis. The new model reduces this barrier since all the required data come from admission information plus several simple and routine tests. It saves both patients’ time and hospital resources by making it possible to provide the timely therapy for the most needed patients.
Results
Attributes selection
We examined both the confirmed and misdiagnosed aortic dissection cases in the past three years to build the initial dataset. The raw data may contain irrelevant or redundant information that can make knowledge discovery more difficult. Attributes in this dataset include personal information details, family status, career information, drug allergies, and medical test results. For the aim of quick classification in early diagnosis phrase, the data need to be easy to acquire and have the lowest dimensions while maintaining the accuracy.
The attribute selection process contains two steps: hand selection then algorithm evaluation.
In the first step, we eliminated attributes of highly personal details, general administrative records and specific characteristics which do not apply to all patients. The remaining 40 attributes after the first step include parts of EMRs, blood routine test results and lipid profile: SEX, AGE, OCCUPATION, MARITAL STATUS, IN_TYPE (patient admission type), WEATHER, TEMPRETURE (both highest and lowest), IN_DEPT (first admission department), CHOI, TG, Lp(a), HDL-C, LDL-C, APOA, APOB, APOE, WBC, #BASO, BASO%, #EO, EO%, HCT, HGB, #LYMPH, LYMPH%, MCH, MCHC, MCV, #MONO, MONO%, MPV, #NEUT, NEUT%, PCT, PDW, P-LCR, PLT, RBC, RDW-CV, RDW-SD.
Then in the second step, we applied CFS subset evaluation method with the greedy algorithm on these attributes. CFS method selects the following thirteen attributes that have the best predictive ability with lowest degree of redundancy: IN_TYPE (Admission Type), IN_DEPT (First Admission Department), CHOL (Total Cholesterol), TG (Triglyceride), Lp(a) (Lipoprotein a), HDL-C (High-density Lipoprotein), APOB (Apolipoprotein B), APOE (Apolipoprotein E), WBC (White Blood Cell Count), LYMPH% (Lymphocyte Percentage), #MONO (Monocyte Count), RBC (Red Blood Cell Count) and RDW-CV (Red Cell Distribution Width, Coefficient of Variation). These thirteen attributes would be used to build the machine learning classification model and for future clinical prediction during the early diagnosis phase. These thirteen attributes would be used to build the machine learning classification model and for future clinical prediction during the early diagnosis phase.
Basic Characteristics of the patients
492 cases including 372 male patients and 120 female patients were recruited in the study. They are either with aortic dissection or misdiagnosed as aortic dissection by physicians in early diagnosis phase. The descriptive statistical analysis was performed to get the perceptual image of our case dataset and to show the basic characteristics of the patients. The analysis of selected attributes after applying the attribute selection algorithm with some other descriptive attributes including age, gender, marital situation, occupation, and weather condition, were presented by figure or table based on their nominal or numeric nature. Nominal attributes of aortic dissection positive patients were shown in a stacked bar chart (Fig. 1). The colour set in one attributes column represents the possible nominal values in that specific attribute; the area size of each colour is relative to how many instances an attribute has for that specific nominal value. Same colour across different bars has no necessary connection or implying same value interval. The specific values of each attribute are given in Table 1. Through Fig. 1 and Table 1, it is difficult to tell the overall difference between aortic dissection patients and the misdiagnosed patients. A grouped box chart analysis of selected numeric attributes is shown in Fig. 2. There is no distinction between aortic dissection patients and misdiagnosed patients among all attributes except Lp(a) of misdiagnosed patients has a wide range of distribution. Descriptive analysis of numeric attributes was presented in (Table 2).
Table 1.
Attributes | No. Positive | % Positive | No. Negative | % Negative |
---|---|---|---|---|
Gender | ||||
Male | 255 | 77.3% | 116 | 71.6% |
Female | 75 | 22.7% | 46 | 28.4% |
Marital Status | ||||
Unmarried | 8 | 2.4% | 4 | 2.5% |
Married | 314 | 95.2% | 157 | 96.9% |
Widowed | 6 | 1.8% | 1 | 0.6% |
Divorced | 2 | 0.6% | 0 | 0% |
Admission Approach | ||||
Emergency | 168 | 50.9% | 40 | 24.7% |
Outpatient | 159 | 48.2% | 120 | 74.1% |
Transferred-in | 3 | 0.9% | 2 | 1.2% |
Table 2.
Variables | Minimum | Maximum | Mean | STD | |
---|---|---|---|---|---|
Clinical Data | |||||
Age, year | P | 20 | 87 | 54.0 | 13.1 |
N | 21 | 87 | 57.6 | 14.4 | |
Male, n (%) | P | N/A | N/A | 255 (77.04%) | N/A |
N | N/A | N/A | 116 (71.60%) | N/A | |
Low Air Temp, °C | P | −12 | 28 | 9.3 | 9.2 |
N | −5 | 27 | 12.3 | 8.8 | |
High Air Temp, °C | P | −2 | 37 | 18.4 | 9.9 |
N | 1 | 38 | 21.3 | 9.8 | |
Lipid Profile | |||||
CHOI, mmol/L | P | 1.95 | 7.34 | 3.46 | 0.83 |
N | 2.14 | 7.59 | 3.57 | 1.02 | |
TG, mmol/L | P | 0.25 | 7.40 | 1.28 | 0.77 |
N | 0.40 | 8.84 | 1.53 | 0.95 | |
Lp(a), mg/L | P | 20.00 | 887.30 | 166.67 | 127.01 |
N | 20.60 | 660.00 | 194.41 | 144.66 | |
HDL-C, mmol/L | P | 0.32 | 1.95 | 0.97 | 0.27 |
N | 0.43 | 1.92 | 0.90 | 0.23 | |
APOB, g/L | P | 0.28 | 1.55 | 0.67 | 0.18 |
N | 0.32 | 1.50 | 0.71 | 0.23 | |
APOE, mg/L | P | 11.00 | 115.50 | 33.39 | 12.56 |
N | 13.20 | 139.80 | 34.99 | 14.30 | |
Blood Routine Examination | |||||
WBC, 109/L | P | 2.05 | 26.16 | 10.12 | 4.14 |
N | 2.00 | 24.14 | 7.24 | 3.08 | |
LYMPH%, % | P | 2.00 | 57.10 | 14.89 | 9.38 |
N | 1.80 | 54.51 | 21.09 | 11.12 | |
MONO#, 109/L | P | 0.00 | 4.60 | 0.62 | 0.41 |
N | 1.83 | 6.22 | 4.05 | 0.83 | |
RBC, 1012/L | P | 2.38 | 6.08 | 4.17 | 0.71 |
N | 0.01 | 1.50 | 0.43 | 0.23 | |
RDW-CV, % | P | 11.40 | 21.80 | 13.37 | 1.11 |
N | 11.70 | 23.00 | 13.87 | 1.60 |
Classification models
We used Weka toolkit to employ four common classifiers on our dataset including Bayesian Network, Naïve Bayes, J48, and SMO. Grid search algorithm was used to find the optimal parameters for each model thus could achieve best possible performance19. The result of ZeroR classification and prediction rule was set as the baseline to which, we compared the performance of each model to the accuracy of physicians which is set as the benchmark r0.
1 |
The benchmark prediction precision is 67.07% with neutral AUC value 0.4951. Bayesian Network model achieved precision rate as 84.55% with AUC value 0.8571. Naïve Bayes model achieved precision rate as 79.88% with AUC value 0.78. J48 model achieved precision rats as 76.14% with AUC value 0.8091. SMO model achieved precision rate as 82.11% with AUC value 0.8056. The ROC curves are shown in Fig. 3. Bayes Net model has the best performance among four classifiers. The details of each model can be found as online supplementary materials.
Performance on emergent patients
Even though our dataset contains more nonurgent patients than emergent patients (284:208). Emergent patients usually have more critical situations which is more likely to cause patient death. For this reason, we need to pay extra attention to emergent patients. We reconduct the process and find our model can work even better on emergent patients. The speed of early diagnosis is even more crucial for emergent patients. The CFS attribute selection method select 4 of 40 attributes with the highest performance (IN_DEPT, APOB, WBC, #MONO). This can significantly reduce the dimension of required data and can help physicians handle the situation better. The performance of each selection method is summarized in Table 3.
Table 3.
Classifier | Precision | AUC |
---|---|---|
Benchmark | 80.77% | 0.4952 |
Bayesian Network | 86.54% | 0.8138 |
Naïve Bayes | 86.53% | 0.8057 |
SMO | 84.13% | 0.6256 |
J48 | 81.73% | 0.5905 |
Contributions to clinical diagnosis
One of our research purposes is to find the inherent connection between the multiple attributes of the aortic dissection patients and then to extract a model that can predict the disease better than previous physician judgement. The pathogenesis of aortic dissection is so complex with diverse patient symptoms. Physicians relying on previous experience often leads to misdiagnosis and therefore fails to provide timely treatment. Due to the low morbidity rate and high mortality rate of Aortic dissection, physicians in the past did not have enough cases to summarise the characteristics of such disease, let alone somehow develop an efficient and reliable model for prediction and early diagnosis. Our study involved 527 cases of both confirmed and misdiagnosed aortic dissection patients. We presented a model that can correctly predict the aortic dissection at the ratio over 85%. Compared to the 67% baseline, we improved the prediction precision which is critical to provide appropriate treatment including real-time monitoring for early diagnosis.
For the misdiagnosed cases, we find that many of them are simply hypertension patients (22 cases), coronary artery patients (15 cases), unstable angina pectoris patients (14 cases), chronic nephritis patients (11 cases). To better distinguish these diseases with aortic dissection can be our focus in future research.
Discussion
Machine learning classification, unlike clustering that is another branch of machine learning, is a supervised learning technique which is built on the previous knowledge of raw dataset and relevant class labels20. Since the dataset in this study is the final record version, each case has specific and clear label besides all the relevant attributes. We fully utilized this complete dataset for knowledge mining by machine learning classification approach. Along with classification, we proceeded attribute selection in pre-processing step to remove irrelevant and redundant attributes that can be noises when processing classification. Theoretically, the model we proposed can help classify aortic dissection positive patients apart from other misdiagnosed cases. The further work may include integrating this model and algorithm into the current system or developing an independent application with data imported from EMR system.
To help us better understand the nature of our dataset and the prediction model, we tried to analyse the characteristics of the selected attributes. The selected thirteen attributes contain six from lipid profile and five from blood routine examination while the other two are patient admission attributes. That 6 of total 8 lipid-profile test attributes were used to build our model indicates the lipid profile can be very crucial to the clinical prediction in early diagnosis phase. Previous research finds that many aortic dissection patients show hyperlipidemia, especially for male patients21,22. Lipids infiltrating the aortic wall can cause an immune response known as inflammatory aortic aneurysm which weakens the aortic wall, hastens the wall expansion and causes higher wall stress23. Our model and analysis show that aortic dissection has no distinct gender or occupation trend compared to.
In conclusion, our proposed risk-stratification model can help physicians with early classification and prediction of aortic dissection patients from all the misdiagnosed cases, which can save test and diagnosis time by providing timely therapy for the most urgent patients. Besides, the thirteen attributes after attribute selection process can give physicians new inspiration and reference for further understanding aortic dissection and deeper expending the relevant pathogenesis knowledge. However, the prediction model is based on the misdiagnosed cases that come from a regional medical centre in North-western China. The geographical distinction may lead to a varied model when applying to patients from other regions or countries. Also, more knowledge of inherent pathogenesis is always needed whether for a better understanding and therapy of the disease or for a more precise model with more representative attributes.
Methods
Study population
There were 526 cases of both confirmed and misdiagnosed aortic dissection involved in our study from May 2013 to November 2016. In 34 cases, patients died shortly after admission and lacked chemical test data like blood routine examination and lipid profile. These 34 cases were excluded from the study population thus we have a population size of 492. All data except weather condition and air temperature were collected by hospital’s EMR system. Weather and temperature data were collected from tianqi.com which is a public weather information service provider. All patients were admitted to the First Affiliated Hospital of Xi’an Jiaotong University either in emergency or outpatient approach. After diagnosed in the first admitted department if not cardiac, they were transferred to the cardiac department because of the diagnosis or suspicion of aortic dissection. All patients admitted if not died earlier had accepted blood routine examination and lipid profile test. These chemical test data and the data previously mentioned were used for data mining.
Attribute selection
By using attribute selection techniques, we select a subset of attributes which are most relevant for classification. CFS subset evaluation was employed in our study for evaluating the contribution of attribute subsets to find the relevant attributes24. It is the first of the methods that evaluate subsets of attributes rather than individual attributes25. For the search method, a forward greedy search was applied to search through the space of attribute subsets. CFS algorithm involves a heuristic (1) that considers the usefulness of each feature for predicting the class with the level of intercorrelation among them.
2 |
Merits: the heuristic “merit” of a feature subset S containing k features, : the average feature-class correlation, : the average feature-feature intercorrelation.
Cross validation
The previous study proved that using the whole dataset for training may not fully discover the nature of the dataset thus fail to build the reliable classification model for further prediction. To better train our model out of the dataset and build it more robust, we used a tenfold cross validation technique which divides the dataset equally into 10 folds and iterates with nine folds for training and the rest one for testing until each fold is used for testing. The previous study evidently showed and theoretically backed up that 10 is the right number of folds to get the best error estimate26.
Machine learning classification
The Weka machine learning toolkit was used for building the classification model. Before processing the dataset, we applied attribute selection techniques to select critically relevant attributes out of the overall attributes in EMR system. The selected attributes helped us to better discover the nature of the patient dataset as well as build a more efficient prediction model. When processing, we used multiple classifiers including SMO, J48, Naïve Bayes, Bayesian Network. The Naïve Bayes classifier assumes that all attributes are conditionally independent with given class label. By maximum likelihood estimation, the prior probability of each class label and the conditional likelihood of each attribute value with given class label are learned from the training dataset. After learning process, the trained classifier assigns a class label to the instance in the test set, which maximises the likelihood function. Unlike Naïve Bayes approach, Bayesian Network encodes the set of conditional independence assumption into itself. Besides, the condition is not only on the class label but on any other parents of the given attribute as well. Finally, we assign the instance to that class label which has the highest posterior probability. J48 is an open source Java implementation of the C4.5 decision tree algorithm which is an extension of the ID3 algorithm with some improvements. Two of these improvements is the ability to handle both continuous and discrete attributes and to handle training dataset with missing values. These two improvements are crucial in our study because the aortic dissection dataset has some mixed nominal and numeric attributes with some missing values. Like some other improved Support Vector Machine (SVM) algorithm, SMO is an iterative algorithm which breaks the problem into a series of sub-problems. The only difference is SMO breaks the problem to the smallest scale. A unique feature of SMO is that it solves the problem analytically thus avoids the complex iterations in traditional SVM algorithm which saves computing time and memory and excludes the errors accumulated from iterations.
Model evaluation
Model evaluation is the key to knowledge representation and to making a practical impact from model to the real world. For our study, which is a typical classification problem, it is natural to use the error rate or precision rate to measure a classifier’s performance. The selected classifier predicts the class of each instance. The precision rate is the proportion of successes made by a classifier over the whole instance set. ROC curves were also used to evaluate the model performance which can overcome the distribution bias of class labels. ROC curves plot the true positive (TP) rate on the vertical axis against the true negative (TN) rate on the horizontal axis.
3 |
4 |
5 |
TP: True Positive, FP: False Positive, TN: True Negative, FN: False Negative.
All sensitive data relative to personal information were filtered out from the study. All data collection and the study protocol were approved by the ethics committee of the First Affiliated Hospital, Xi’an Jiaotong University, in compliance with the Declaration of Helsinki. All the participants were informed of the nature and purpose of the study and signed written informed consent.
Acknowledgements
We acknowledge the cooperation from the patients who participated in our research. This study was supported by the National Natural Science Foundation of China (NSFC) [Grant No. 71390333] and Shaanxi Natural Science Foundation [Grant No. 2015JM7369].
Author Contributions
B.K. and D.H. contributed to the design of this study. B.K. and M.L. collected the data. D.H. and M.L. wrote the manuscript. D.H., B.K., M.L. and Z.Z. were involved in the analysis and interpretation of data. B.K. and Z.Z. reviewed the manuscript.
Data Availability
The masked data can be provided for reconduct and discussion upon request.
Competing Interests
The authors declare no competing interests.
Footnotes
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Sans S, Kesteloot H, Kromhout D. The burden of cardiovascular diseases mortality in Europe. Task Force of the European Society of Cardiology on Cardiovascular Mortality and Morbidity Statistics in Europe. European Heart Journal. 1997;18:1231–1248. doi: 10.1093/oxfordjournals.eurheartj.a015434. [DOI] [PubMed] [Google Scholar]
- 2.Criado FJ. Aortic dissection: a 250-year perspective. Texas Heart Institute Journal. 2011;38:694. [PMC free article] [PubMed] [Google Scholar]
- 3.Hagan PG, et al. The International Registry of Acute Aortic Dissection (IRAD): new insights into an old disease. Jama. 2000;283:897–903. doi: 10.1001/jama.283.7.897. [DOI] [PubMed] [Google Scholar]
- 4.Kim EK, et al. Presence of simple renal cysts is associated with increased risk of aortic dissection: a common manifestation of connective tissue degeneration? Heart. 2011;97:55. doi: 10.1136/hrt.2010.205328. [DOI] [PubMed] [Google Scholar]
- 5.Peng W, et al. Potential biomarkers for early diagnosis of acute aortic dissection. Heart & Lung. 2015;44:205–208. doi: 10.1016/j.hrtlng.2015.01.006. [DOI] [PubMed] [Google Scholar]
- 6.Parker FBNJ, Jr., Hanson EL, Mohiuddin S, Webb WR. Management of acute aortic dissection. Annals of Thoracic Surgery. 1975;19:436–442. doi: 10.1016/S0003-4975(10)64045-3. [DOI] [PubMed] [Google Scholar]
- 7.Slater EE, Desanctis RW. The clinical recognition of dissecting aortic aneurysm. American Journal of Medicine. 1976;60:625–633. doi: 10.1016/0002-9343(76)90496-4. [DOI] [PubMed] [Google Scholar]
- 8.Alfonso F, et al. Aortic dissection occurring during coronary angioplasty: Angiographic and transesophageal echocardiographic findings. Catheterization & Cardiovascular Diagnosis. 1997;42:412–415. doi: 10.1002/(SICI)1097-0304(199712)42:4<412::AID-CCD16>3.0.CO;2-L. [DOI] [PubMed] [Google Scholar]
- 9.Svensson LG, Crawford ES. Aortic dissection and aortic aneurysm surgery: clinical observations, experimental investigations, and statistical analyses. Part III. Current Problems in Surgery. 1992;29:913. doi: 10.1016/0011-3840(92)90003-l. [DOI] [PubMed] [Google Scholar]
- 10.Spittell PC, et al. Clinical features and differential diagnosis of aortic dissection: experience with 236 cases (1980 through 1990) Mayo Clinic Proceedings. 1993;68:642. doi: 10.1016/S0025-6196(12)60599-0. [DOI] [PubMed] [Google Scholar]
- 11.Kamp TJ, Goldschmidt-Clermont PJ, Brinker JA, Resar JR. Myocardial infarction, aortic dissection, and thrombolytic therapy. American Heart Journal. 1994;128:1234–1237. doi: 10.1016/0002-8703(94)90756-0. [DOI] [PubMed] [Google Scholar]
- 12.Hebballi R, Swanevelder J. Diagnosis and management of aortic dissection. Continuing Education in Anaesthesia Critical Care & Pain. 2009;9:14–18. doi: 10.1093/bjaceaccp/mkn044. [DOI] [Google Scholar]
- 13.Kesavaraj, G. & Sukumaran, S. In Fourth International Conference on Computing, Communications and NETWORKING Technologies 1–7.
- 14.Hermon, R. & Williams, P. Big Data in Healthcare: What is it used for? Proceedings of Australian e-Health Informatics and Security Conference, Perth, Australia, 40–49 (2014).
- 15.Bai X, Padman R, Ramsey J, Spirtes P. Tabu Search-Enhanced Graphical Models for Classification in High Dimensions. INFORMS Journal on Computing. 2008;20:423–437. doi: 10.1287/ijoc.1070.0255. [DOI] [Google Scholar]
- 16.Fan YJ, Chaovalitwongse WA. Optimizing feature selection to improve medical diagnosis. Annals of Operations Research. 2010;174:169–183. doi: 10.1007/s10479-008-0506-z. [DOI] [Google Scholar]
- 17.Miettinen K, Juhola M. Classification of Otoneurological Cases According to Bayesian Probabilistic Models. Journal of Medical Systems. 2010;34:119–130. doi: 10.1007/s10916-008-9223-z. [DOI] [PubMed] [Google Scholar]
- 18.Gartner D, Kolisch R, Neill DB, Padman R. Machine Learning Approaches for Early DRG Classification and Resource Allocation. INFORMS Journal on Computing. 2015;27:718–734. doi: 10.1287/ijoc.2015.0655. [DOI] [Google Scholar]
- 19.Witten, Ian, H., Frank & Eibe. Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series. (Morgan Kaufmann Publishers Inc., 2011).
- 20.Ramesh, D., Suraj, P. & Saini, L. In International Conference on Microelectronics, Computing and Communications 1–6.
- 21.Endlich M, et al. Long-Term Outcome and Quality of Life in Aortic Type A Dissection Survivors. Thorac Cardiovasc Surg. 2016;64:091–099. doi: 10.1055/s-0035-1548734. [DOI] [PubMed] [Google Scholar]
- 22.Buamina M, et al. Sex-Related Differences Between Patients With Symptomatic Acute Aortic Dissection. Medicine. 2016;95:e3100. doi: 10.1097/MD.0000000000003100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Leu HJ. Classification of vasculitides. A survey. Vasa Zeitschrift Für Gefässkrankheiten. 1995;24:319–324. [PubMed] [Google Scholar]
- 24.Matthews BW. Comparison of the predicted and observed secondary structure of T4 phage lysozyme. Biochim Biophys Acta. 1975;405:442–451. doi: 10.1016/0005-2795(75)90109-9. [DOI] [PubMed] [Google Scholar]
- 25.Hall, M. Correlation-based Feature Selection forMachine Learning. PhD Thesis, Waikato Univer-sity19 (1998).
- 26.Saeys Y, Inza I, Larrañaga P. A review of feature selection techniques in bioinformatics. Bioinformatics. 2007;23:2507–2517. doi: 10.1093/bioinformatics/btm344. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The masked data can be provided for reconduct and discussion upon request.