Abstract
Background & aims
Liver ultrasound scan (US) use in diagnosing Non-Alcoholic Fatty Liver Disease (NAFLD) causes costs and waiting lists overloads. We aimed to compare various Machine learning algorithms with a Meta learner approach to find the best of these as a predictor of NAFLD.
Methods
The study included 2970 subjects, 2920 constituting the training set and 50, randomly selected, used in the test phase, performing cross-validation. The best predictors were combined to create three models: 1) FLI plus GLUCOSE plus SEX plus AGE, 2) AVI plus GLUCOSE plus GGT plus SEX plus AGE, 3) BRI plus GLUCOSE plus GGT plus SEX plus AGE. Eight machine learning algorithms were trained with the predictors of each of the three models created. For these algorithms, the percent accuracy, variance and percent weight were compared.
Results
The SVM algorithm performed better with all models. Model 1 had 68% accuracy, with 1% variance and an algorithm weight of 27.35; Model 2 had 68% accuracy, with 1% variance and an algorithm weight of 33.62 and Model 3 had 77% accuracy, with 1% variance and an algorithm weight of 34.70. Model 2 was the most performing, composed of AVI plus GLUCOSE plus GGT plus SEX plus AGE, despite a lower percentage of accuracy.
Conclusion
A Machine Learning approach can support NAFLD diagnosis and reduce health costs. The SVM algorithm is easy to apply and the necessary parameters are easily retrieved in databases.
Introduction
Non-alcoholic fatty liver disease (NAFLD) is the leading cause of chronic liver disease in Western countries, as well as a condition raising the risk for cardiovascular diseases, type 2 diabetes mellitus and chronic renal disease, and increased mortality [1, 2].
Worldwide, NAFLD prevalence is currently estimated to be around 24% and is constantly increasing (from 15% in 2005 to 25% in 2010).
A meta-analysis published in 2016 reported an average prevalence of 23.71% in Europe [3]. Population-based studies conducted in our geographical area (district of Bari, Apulian Region, Italy), have estimated a NAFLD prevalence of around 30%, mainly among male subjects and elderly people [4].
NAFLD is defined as an accumulation of Triglycerides in hepatocytes (> at 5% of the liver volume) of a patient with reduced alcohol intake (<20 g / day in women or <30 g / day in men), after excluding viral infectious causes or other specific liver diseases [5].
NAFLD can manifest as pure fatty liver disease (hepato-steatosis) or as non-alcoholic steatohepatitis (NASH), an evolution of the former where the steatosis is associated with inflammation and hepatocellular injury, and with fibrogenic activation that can lead to cirrhosis and the onset of hepatocarcinoma [6].
According to recent EASL—EASD—EASO guidelines [7], at individual level the gold standard to identify steatosis in individual patients is magnetic resonance imaging (MRI), but ultrasound scanning (US) is preferred because it is more widely available and cheaper than MRI.
However, for large scale screening studies, serum bio markers and steatosis scores indexes have been preferred, as the availability and cost of US has a substantial impact on screening feasibility.7 One of the best validated INDEXES is the Fatty Liver Index (FLI) [8], although other scores or anthropometric measures work as well as the FLI in predicting the NAFLD risk [9].
The FLI is important and has widespread use in the diagnosis of NAFLD in epidemiological studies because it relies on few parameters that are easy to obtain in clinical practice.
In a previous study, we compared the performance of eight alternative hybrid methods to identify subjects with NAFLD in a large population [9]. In that setting, predictive formulas were used as a first approach strategy, to identify subjects considered to be at greater risk of steatosis, needing to be scheduled for later US. In clinical practice, it is necessary to develop support models for the diagnosis of NAFLD that include only clinical and laboratory routine parameters which are easily retrievable from health databases.
In recent years, and due to the increasing prevalence of NAFLD, there has been a research trend towards low-cost diagnostic methods, and Machine Learning has been identified as a valid tool. Machine Learning (ML) is a branch of artificial intelligence that aims to allow machines to work using intelligent “learning” software [10]. Data sets are provided, that the machine processes through algorithms which develop their own logic to perform the function, the activity or the task required. Machine Learning has already been used as a support tool for the diagnosis of some diseases and for risk quantification, such as Cardiovascular risk in patients with Diabetes Mellitus [11, 12], Ischemic Heart Disease [13] and Cancers [14].
The purpose of our study is to compare various Machine Learning algorithms, using easily available laboratory parameters, in order to find the best algorithm to support the identification of subjects at greater risk of NAFLD to be scheduled for US assessment. We also compared the performances of the trained models; to assess the true reliability and ease of use of each, and validated the results on a large population-based study.
Materials and methods
Population
Details about the study population have been published elsewhere [9]. Briefly, a prospective cohort study was conducted by the Laboratory of Epidemiology and Biostatistics of the National Institute of Gastroenterology, “Saverio de Bellis” Research Hospital (Castellana Grotte, Bari, Italy). The MICOL Study is a population-based prospective cohort study randomly drawn from the electoral list of Castellana Grotte (≥30 years old) in 1985 and followed up in 1992, 2005–2006 and 2013–2016. In 2005–2006 this cohort was added with a random sample of subjects (PANEL Study) aged 30–50 years, to compensate for the cohort aging. In this paper the baseline for the MICOL cohort was established in 2005–2006 to capture all ages.
Participants were interviewed to collect information about sociodemographic characteristics, health status, personal history and lifestyle factors. Weight was taken with the subject in underwear, on a SECA® electronic balance, and approximated to the nearest 0.1 kg. Height was measured with a wall-mounted stadiometer SECA®, approximated to 1 cm. Blood pressure (BP) measurements were performed following international guidelines [15]. The mean of 3 BP measurements was calculated.
A fasting venous blood sample was drawn, and the serum was separated into two different aliquots. One aliquot was immediately stored at −80°C. The second aliquot was used to test biochemical serum markers by standard laboratory techniques in our Central laboratory.
The study included a total of 2970 out of 3000 selected subjects; 56.5% were male. In 1985, 2472 subjects had been enrolled. The cohort was followed-up in 1992 and, in 2005–2006, 1697 from the original cohort were still present. A further sample of 1273 subjects were enrolled in 2005–2006 to compensate for the cohort aging [16]. All subjects gave informed written consent to participate.
All procedures were performed in accordance with the ethical standards of the institutional research committee (IRCCS Saverio de Bellis Research and with the 1964 Helsinki declaration, and had had Ethical Committee approval for the MICOL Study (DDG-CE-347/1984; DDG-CE-453/1991; DDG-CE-589/2004; DDG-CE 782/2013).
Model development
The decisive variables and the training sessions of the models
After the adoption of the Best Subset Selection [17], we compared three methods to choose the optimal model: Mallow’s Cp (Cp) [18], Bayesian Information Criteria (BIC) [19] and, R-squared Adjusted (R2Adj) [20].
The following 27 variables were considered when employing the three subset selection methods: Age, Sex, Height, Body Mass Index (BMI) [21], Waist Circumference (WC) [22], Systolic Blood Pressure (SBP), Diastolic Blood Pressure (DBP), A Body Shape Index (ABSI) [23], Abdominal Volume Index (AVI) [24], Body Adiposity Index (BAI) [25], Body Roundness Index (BRI) [26], Hepatic Steatosis Index (HIS) [27], Leukocyte Alkaline Phosphatase Score [28], Total Bilirubin, Alkaline Phosphatase, Glucose, Gamma-Glutamyl Transferase (GGT), Glutamic-Oxaloacetic Transaminase (GOT), Glutamate Pyruvate Transaminase (GPT), High-Density Lipoproteins Cholesterol (HDL-C), Low-Density Lipoproteins Cholesterol (LDL-C), Triglycerides, FLI, Waist-Hip Ratio (WHR) [22], Waist-to-Height Ratio (WHtR) [22], Waist/Height0.5 (WHt.5R) [29] and Diabetes. All biochemical markers were introduced in the models as continuous variables. We choose to introduce the biochemical markers as continuous variables to better reflect the natural scale of the variables. The assumption behind this choice is that the effect of categorization is the loss of information.
The following best subset results were available after running the code. Cp: Age, Sex, AVI, BRI, FLI, GGT, Glucose, GPT, HIS, PAD, WHR, WHtR, WHt.5R, WC; BIC: Age, Sex, AVI, BRI, FLI, GGT, Glucose and R2-Adj: Age, Sex, AVI, BRI, FLI, GGT, Glucose, GPT, HIS, PAD, PAS, Triglycerides, WHR, WHtR, WHt.5R and WC. The best model was then extrapolated from the BIC method, because the trade-off between number of variables and explained variation did not show substantial changes. The model was composed with one of the following indexes: FLI, AVI or BRI plus Age, Sex, Glucose and Gamma-glutamyl transpeptidase (GGT).
Table 1 shows the formulas employed in this study and the variables necessary for their construction.
Table 1. Indexes formula and their structure.
Reference | Name | Formula |
---|---|---|
Bedogni G [8] | Fatty Liver Index (FLI) | |
Guerrero-Romero F [24] | Abdominal Volume Index (AVI) | |
Thomas DM [26] | Body Roundness Index (BRI) |
Abbreviation TG = Triglycerides; BMI = Body Mass Index; GGT = Gamma-Glutamyl Transferase; WC = Waist Circumference; HC = Hip Circumference; Ht = Height (centimeters).
These formulas are simple and the variables (essential to calculate them) are easily available because they are collected and routinely assessed as anthropometric measures and biochemical markers. The selected database included all the variables essential for the construction of the predictive formulas, along with other values such as: Age, Sex, GGT and Glucose. However, GGT was not included in the model employing the FLI Index formula because it is already included in the formula.
Following the European Association for the Study of the Liver (EASL), European Association for the Study of Diabetes (EASD) and European Association for the Study of Obesity (EASO) recommendations [7], NAFLD diagnosis was performed using an ultrasound scanner Hitachi H21 Vision (Hitachi Medical Corporation, Tokyo, Japan). Examination of the visible liver parenchyma was performed with a 3.5 MHz transducer.
Model implementation
The algorithms considered were the following:
Boosting Tree Classifier (using AdaboostClassifier) [30, 31]
Decision Tree Classifier [32]
K-Nearest Neighbors Classifier [35]
Neural Network Classifier [36]
Random Forest Classifier [37]
Regularized Multinomial Classifier (use Logistic regression) [38, 39]
All eight algorithms considered were supervised learning models, except for the neural network algorithms, which consist of reinforcement learning. Afterwards, a code was developed and due to the interaction with Python, it recalled the functions of the Machine Learning contained in the Scikit-learn [42] library. This was used to compare all models in order to identify the one best suited to diagnose NAFLD, and also the model minimizing false-negative predictive values.
The actual training sessions
The models considered were the following:
2.1. FLI plus Glucose plus Age plus Sex
2.2. AVI plus Glucose plus GGT plus Age plus Sex
2.3. BRI plus Glucose plus GGT plus Age plus Sex
For each model above, all the algorithms were compared using the Meta-learner approach [43, 44]
Initially, it was important to pre-process data to eliminate missing values for features (x) or for target variables (y). Then, we obtained a dataset containing 2868 subject Subsequently, we set the algorithm to randomly extract 50 individuals from the initial database, which were used for the construction of a new dataset. This dataset was used for the prediction phase, making use of machine learning algorithms that had been adequately trained during the training phase. During the training sessions, all 2868 subject was used for the training and for the test by means of the 10-fold cross-validation approach [45]. To retrieve the best parameters for each algorithm, GridSearchCV, which is a method contained in a Scikit-learn in Python, was used. In this way, the algorithm characterized by the lowest variance was identified, to reduce the possibility of a false prediction. After selection of the best parameters for each algorithm, we carried out the prediction on a dataset of 50 individuals (generated by the prediction), thus obtaining the predicted target variable for each algorithm.
The analysis of model performance
We compared eight different types of Machine Learning algorithms, in terms of the percentage accuracy and variance for each model. The parameters used for comparison in this case were: values of accuracy, variance, calculated confidence limits (95%), the weight of each model (as a %) and the number of ultrasound examinations it could avoid.
The accuracy of a model is defined as:
[46]
More specifically, the accuracy of a model is calculated with the following formula:
[46]
Where TP = True Positive, TN = True Negative, FP = False Positive and FN = False Negative.
Another aspect to take into account was the value of variance for each algorithm, as a key indicator of the variability of a dataset.
A low value of variance means that the predictions have a percentage of accuracy with a low variability. Differently a high value of variance indicates a weak prediction also in case of high value of accuracy percentage.
Another fundamental parameter we considered was the weight of the algorithms, i.e. the value indicating the algorithm with the best performance in the test phase.
Of 100%, each algorithm had a weight percentage, and the one that contributed most to the sum of all the weights was taken as the best algorithm. In addition, for each algorithm under study, the optimal parameters for the future development of an algorithm with the absolute highest values of accuracy and weight were calculated.
Results
Subjects characteristics and the performance of each of the three indexes considered are shown in Table 2. NAFLD prevalence was 31.55% and was, as expected, more prevalent among men. Subjects with NAFLD were a little older, and had increased levels of Glucose and GGT.
Table 2. Subset subject characteristics by NAFLD condition.
MICOL Study, Castellana Grotte (BA), Italy, 2005.
Variables | NAFLD | |
---|---|---|
Absent | Present | |
N (%) | 2033 (68.45) | 937(31.55) |
Sex | ||
Female | 1007 (49.5) | 286 (30.5) |
Male | 1026 (50.5) | 651 (69.5) |
Age | 54.04 (15,61) | 55.42 (13,67) |
FLI | 31,09 (25,50) | 64,11 (24,47) |
AVI | 15,96 (4,27) | 21,15 (4,96) |
BRI | 4,48 (1,66) | 6,21 (1,89) |
GLUCOSE | 105,51 (24,03) | 117,62 (33,13) |
GGT | 14,60 (15,18) | 20,57 (19,75) |
Cells reporting subject characteristics contain mean (±SD) or n (%).
FLI plus glucose plus age plus sex predictive model
In the training session of the Model employing FLI plus Glucose plus Age plus Sex as predictive variables, the Boosting Tree was considered as the algorithm with greatest accuracy, characterized by 76% accuracy and by a weight factor of 19.02% while the algorithm with the lowest variance was the Support Vector Machine with 68% accuracy, lower than the Boosting Tree, but the weight of the model was 27.35%. Compared to the Boosting Tree, the Support Vector Machine made less mistakes during the testing phase.
In Fig 1 we show the trend of the error in the test phase and the trend of the error in the training phase for each model.
Fig 1. Machine learning applied to NAFLD diagnosis.
Training Error vs Test Error for the FLI plus Glucose plus Age plus Sex Predictive Model.
AVI plus glucose plus GGT plus age plus sex predictive model
In the training session of the Model employing AVI plus Glucose plus GGT plus Age plus Sex as the predictive variables, the Regularized Multinomial (RM) was considered as the algorithm with the greatest accuracy, characterized by 76% accuracy and by a weight factor of 28.32%. while the algorithm with the lowest variance was the Support Vector Machine, with 68% accuracy, lower than the RM but the weight of the algorithm was 32.62%.
Compared to the RM, the Support Vector Machine made less mistakes during the testing phase.
In Fig 2 we show the trend of the error in the test phase and the trend of the error in the training phase for each model.
Fig 2. Machine learning applied to NAFLD diagnosis.
Training Error vs Test Error for the AVI plus Glucose plus GGT plus Age plus Sex Predictive Model.
BRI plus glucose plus GGT plus age plus sex predictive model
In the training session of the Model employing BRI plus Glucose plus GGT plus Age plus Sex as the predictive variables, the Nearest Neighbor was considered as the algorithm with the greatest accuracy, characterized by 77% accuracy and by a weight factor of 22.21%, while the algorithm with the lowest variance was the Support Vector Machine, with 77% accuracy, equivalent to the Regularized Multinomial RM, but the model weight was 34.70%.
Despite having the same percentage of accuracy as the RM, the Support Vector Machine made less mistakes during the testing phase.
In Fig 3 we show the trend of the error in the test phase and the trend of the error in the training phase for each model.
Fig 3. Machine Learning Applied to NAFLD Diagnosis.
Training Error vs Test Error for the BRI plus Glucose plus GGT plus Age plus Sex Predictive Model.
Figs 4, 5 and 6 show the results, in terms of accuracy, of the test with 95% confidence intervals, the weight of the model and also the value of the variance.
Fig 4. Machine learning applied to NAFLD diagnosis.
Forest Plot for the FLI plus Glucose plus Age plus Sex Predictive Model.
Fig 5. Machine learning applied to NAFLD diagnosis.
Forest Plot for the AVI plus Glucose plus GGT plus Age plus Sex Predictive Model.
Fig 6. Machine learning applied to NAFLD diagnosis.
Forest Plot for the BRI plus Glucose plus GGT plus Age plus Sex Predictive Model.
From the above Figures, it can be seen that the SVM algorithm, even if it did not feature the highest accuracy percentage, was the best for predicting NAFLD.
From the output of the three algorithms, we have also calculated the value of sensibility, specificity, predictive value of a positive and negative test in the prediction phase and are shown in Table 3.
Table 3.
Sensibility | Specificity | VPP | VPN | |
---|---|---|---|---|
FLI plus Glucose plus Age plus Sex | 0.979 | 1.00 | 1.00 | 0.990 |
AVI plus Glucose plus GGT plus Age plus Sex | 0.985 | 1.00 | 1.00 | 0.993 |
BRI plus Glucose plus GGT plus Age plus Sex | 0.967 | 0.99 | 0.997 | 0.990 |
Discussion
In this study, the search for the best algorithm to support NAFLD diagnosis was conducted by comparing the different Machine Learning algorithms for each model.
Specifically, the model with a high level of accuracy and the model with the lowest level of variance were identified in this research. The Machine Learning model presenting the lowest variance was selected.
Today, the search for non-invasive methods is very important, considered as an alternative to the expensive NAFLD diagnostic tools (MRI, Ultrasound). The reorganization of the National Health System requires closer consideration of aspects linked to the performance together with the factors linked to cost-reduction and waiting times. The aim of our study was to use new, modern Machine Learning techniques to support medical decisions during the diagnostic phase using easier and cheaper tools, thus reducing both the costs and waiting times inherent to the use of instrumental methods.
In view of the results of this study, it is possible to state that the most appropriate Machine Learning Algorithm is the Support Vector Machine in Python. In particular, the Support Vector Machine employing AVI plus Glucose plus GGT plus Sex plus Age, despite having an almost identical percentage of accuracy and weight to the other models, produced fewer prediction errors in the test step. We obtained in the test phase for the models composed of FLI plus Glucose plus Age plus Sex and AVI plus Glucose plus GGT plus Age plus Sex a percentage error equal to 32% while for the model composed of BRI plus Glucose plus GGT plus Age plus Sex an error of 23%. However, in the prediction phase, the model that made fewer errors was the one composed of AVI plus Glucose plus GGT plus Age plus Sex with an error of 20% while FLI plus Glucose plus Age plus Sex 26% and BRI plus Glucose plus GGT plus Age plus Sex 28%.
Therefore, AVI plus Glucose plus GGT plus Sex plus Age was the model that contributed most to reducing unnecessary ultrasound examinations.
The good performance of ML Algorithms used to identify NAFLD, applying common anthropometric parameters and other variables, has shown them to be a valid alternative to classic Indexes [47, 48].
Moreover, the SVM was well able to identify subjects without NAFLD. From an ethical perspective, the model with the lowest variance is the best one, as it is characterized by a smaller number of false negatives, despite a lower percentage of accuracy.
This kind of study underlines the fact that this ML Algorithm can be used to find subjects at high risk of NAFLD, who need to undergo US. Furthermore, as low-risk subjects do not undergo US, 81.9% of unnecessary US examinations could be avoided (this value was calculated as the ratio of the total number of subjects in the test set divided by the total number of subjects in the test set plus the number of incorrect predictions.)
Some methodological issues need to be considered. A strength of this study is the population-based random sample from which the observations were drawn. The NAFLD prevalence in the sample is a good estimator of the population prevalence and its age-sex distribution. Limitations include both the limited number of observations and the method used to perform NAFLD diagnosis. It may be criticized the low sensibility of the NAFLD diagnostic methodology, as it fails to detect fatty liver content >25–90% [49]. However, this is a population-based study and subjects were chosen from the electoral register. They did not seek medical attention and participated on volunteer basis. Then, the diagnosis of NAFLD performed by US was the only diagnostic procedure we could propose to the participants. Ethical issues prevent us to propose biopsy or H-MRS. Moreover, and to lighten the waiting lists, our purpose was to find out a machine learning algorithm that permit us to avoid a number of USs which otherwise would have been prescribed. Then, this algorithm is useful to exclude NAFLD and as valid support diagnostic in the context of epidemiologic studies and not as replacement diagnostic tool.
In conclusion, this model, like others based on ML Algorithms, may be considered as a valid support for medical decision making as regards health policies, in epidemiological studies and screening.
Supporting information
(XLS)
Acknowledgments
Computations for this research were performed in the Laboratory of Epidemiology and Biostatistics, National Institute of Gastroenterology, “S de Bellis” Research Hospital, MICOL Working Group: Vittorio Pugliese (Laboratory of Epidemiology and Biostatistics), Mario Correale, Palma Iacovazzi, Anna Mastrosimini, Giampiero De Michele (Laboratory of Clinical Pathology), Osvaldo Burattini (Unit of Gastroenterology), Valeria Tutino, Benedetta D’Attoma (Laboratory of Nutritional Biochemistry), Maria R Noviello (Department of Radiology), National Institute of Gastroenterology “S de Bellis” Research Hospital, Castellana Grotte (BA), Italy.
Abbreviations
- ABSI
A Body Shape Index
- AVI
Abdominal Volume Index
- BAI
Body Adiposity Index
- BIC
Bayesian Information Criteria
- BMI
Body Mass Index
- BP
Blood pressure
- BRI
Body Roundness Index
- Cp
Mallow’s
- DBP
Diastolic Blood Pressure
- FLI
Fatty Liver Index
- FN
False Negative
- FP
False Positive
- GGT
Gamma-Glutamyl Transferase
- GOT Glucose
Glutamic-Oxaloacetic Transaminase
- GPT
Glutamate Pyruvate Transaminase
- HDL-C
High-Density Lipoproteins Cholesterol
- HIS
Hepatic Steatosis Index
- LDL-C
Low-Density Lipoproteins Cholesterol
- ML
machine learning
- MRI
magnetic resonance imaging
- NAFLD
Non-Alcoholic Fatty Liver Disease
- NASH
non-alcoholic steatohepatitis
- R2-Adj
R-squared Adjusted
- RM
Regularized Multinomial
- SBP
Systolic Blood Pressure
- SVM
Support Vector Machine
- TN
True Negative
- TP
True Positive
- US
ultrasound scan
- WC
waist circumference
- WHR
Waist-Hip Ratio
- WHt.5R
Waist/Height0.5
- WHtR
Waist-to-Height Ratio
Data Availability
All relevant data are within the manuscript and its Supporting Information files.
Funding Statement
This work was funded by a grant from the Ministry of Health, Italy (Progetto Finalizzato del Ministero della Salute, ICS 160.2/RF 2003), 2004/2006) and by Apulia Region-D.G.R. n. 1159, 28/6/2018 and 2019.
References
- 1.Fazel Y, Koenig AB, Sayiner M, Goodman ZD, Younossi ZM. Epidemiology and natural history of non-alcoholic fatty liver disease. Metabolism. 2016;65(8):1017–25. Epub 2016/03/22. 10.1016/j.metabol.2016.01.012 . [DOI] [PubMed] [Google Scholar]
- 2.Levene AP, Goldin RD. The epidemiology, pathogenesis and histopathology of fatty liver disease. Histopathology. 2012;61(2):141–52. Epub 2012/03/01. 10.1111/j.1365-2559.2011.04145.x . [DOI] [PubMed] [Google Scholar]
- 3.Younossi ZM, Koenig AB, Abdelatif D, Fazel Y, Henry L, Wymer M. Global epidemiology of nonalcoholic fatty liver disease-Meta-analytic assessment of prevalence, incidence, and outcomes. Hepatology. 2016;64(1):73–84. Epub 2015/12/29. 10.1002/hep.28431 . [DOI] [PubMed] [Google Scholar]
- 4.Cozzolongo R, Osella AR, Elba S, Petruzzi J, Buongiorno G, Giannuzzi V, et al. Epidemiology of HCV infection in the general population: a survey in a southern Italian town. The American journal of gastroenterology. 2009;104(11):2740–6. Epub 2009/07/30. . [DOI] [PubMed] [Google Scholar]
- 5.Neuschwander-Tetri BA, Caldwell SH. Nonalcoholic steatohepatitis: summary of an AASLD Single Topic Conference. Hepatology. 2003;37(5):1202–19. Epub 2003/04/30. 10.1053/jhep.2003.50193 . [DOI] [PubMed] [Google Scholar]
- 6.Ratziu V, Bellentani S, Cortez-Pinto H, Day C, Marchesini G. A position statement on NAFLD/NASH based on the EASL 2009 special conference. Journal of hepatology. 2010;53(2):372–84. Epub 2010/05/25. 10.1016/j.jhep.2010.04.008 . [DOI] [PubMed] [Google Scholar]
- 7.Mahana Douglas, T CM, Kurtz Zachary D., Bokulich Nicholas A., Battaglia Thomas, Chung Jennifer, Müller Christian L., et al. Antibiotic perturbation of the murine gut microbiome enhances the adiposity, insulin resistance, and liver disease associated with high-fat diet. Genome Medicine. 2016;8:48 10.1186/s13073-016-0297-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bedogni G, Bellentani S, Miglioli L, Masutti F, Passalacqua M, Castiglione A, et al. The Fatty Liver Index: a simple and accurate predictor of hepatic steatosis in the general population. BMC Gastroenterol. 2006;6:33 Epub 2006/11/04. 10.1186/1471-230X-6-33 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Procino F, Misciagna G, Veronese N, Caruso MG, Chiloiro M, Cisternino AM, et al. Reducing NAFLD-screening time: A comparative study of eight diagnostic methods offering an alternative to ultrasound scans. Liver International. 2019;39(1):187–96. 10.1111/liv.13970 [DOI] [PubMed] [Google Scholar]
- 10.Eihab Bashier Mohammed Bashier MM, and Muhammad Badruddin Khan. Machine Learning: Algorithms and Applications2016.
- 11.Napoli C, Benincasa G, Schiano C, Salvatore M. Differential Epigenetic Factors in the Prediction of Cardiovascular Risk in Diabetic Patients. Eur Heart J Cardiovasc Pharmacother. 2019. Epub 2019/10/31. 10.1093/ehjcvp/pvz062 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Dagliati A, Marini S, Sacchi L, Cogni G, Teliti M, Tibollo V, et al. Machine Learning Methods to Predict Diabetes Complications. Journal of Diabetes Science and Technology. 2018;12(2):295–302. 10.1177/1932296817706375 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Kukar M, Kononenko I, Groselj C, Kralj K, Fettich J. Analysing and improving the diagnosis of ischaemic heart disease with machine learning. Artif Intell Med. 1999;16(1):25–50. Epub 1999/05/04. 10.1016/s0933-3657(98)00063-3 . [DOI] [PubMed] [Google Scholar]
- 14.Kourou Konstantina, E TP, Exarchos Konstantinos P., Karamouzis Michalis V., and Fotiadis Dimitrios I. "Machine learning applications in cancer prognosis and prediction". Computational and Structural Biotechnology Journal, vol 13 2014. 10.1016/j.csbj.2014.11.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Sever P. New hypertension guidelines from the National Institute for Health and Clinical Excellence and the British Hypertension Society. J Renin Angiotensin Aldosterone Syst. 2006;7(2):61–3. Epub 2006/11/04. 10.3317/jraas.2006.011 . [DOI] [PubMed] [Google Scholar]
- 16.Osella AR, Misciagna G, Leone A, Di Leo A, Fiore G. Epidemiology of hepatitis C virus infection in an area of Southern Italy. Journal of hepatology. 1997;27(1):30–5. Epub 1997/07/01. 10.1016/s0168-8278(97)80276-0 . [DOI] [PubMed] [Google Scholar]
- 17.James G, and Witten D, and Hastie T, and Tibshirani R. Linear Model Selection and Regularization An Introduction to Statistical Learning: with Applications in R: Springer; New York; 2013. p. 203–64. [Google Scholar]
- 18.Stevens VL, Jacobs EJ, Patel AV, Sun J, McCullough ML, Campbell PT, et al. Weight cycling and cancer incidence in a large prospective US cohort. Am J Epidemiol. 2015;182(5):394–404. Epub 2015/07/26. 10.1093/aje/kwv073 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Vrieze SI. Model selection and psychological theory: A discussion of the differences between the Akaike information criterion (AIC) and the Bayesian information criterion (BIC). Psychological Methods. 2012;17(2):228–43. 10.1037/a0027127 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Miles J. R Squared, Adjusted R Squared. American Cancer Society; 2014. 10.1002/9781118445112.stat06627 [DOI] [Google Scholar]
- 21.Nuttall FQ. Body Mass Index: Obesity, BMI, and Health: A Critical Review. Nutrition Today. 2015;50(3). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Bacopoulou F, Efthymiou V, Landis G, Rentoumis A, Chrousos GP. Waist circumference, waist-to-hip ratio and waist-to-height ratio reference percentiles for abdominal obesity among Greek adolescents. BMC Pediatr. 2015;15:50 Epub 2015/05/04. 10.1186/s12887-015-0366-z . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Lee D-Y, Lee M-Y, Sung K-C. Prediction of Mortality with A Body Shape Index in Young Asians: Comparison with Body Mass Index and Waist Circumference. Obesity. 2018;26(6):1096–103. 10.1002/oby.22193 [DOI] [PubMed] [Google Scholar]
- 24.Guerrero-Romero F, Rodríguez-Morán M. Abdominal volume index. An anthropometry-based index for estimation of obesity is strongly related to impaired glucose tolerance and type 2 diabetes mellitus. Arch Med Res. 2003;34(5):428–32. Epub 2003/11/07. 10.1016/S0188-4409(03)00073-0 . [DOI] [PubMed] [Google Scholar]
- 25.Freedman DS, Thornton JC, Pi-Sunyer FX, Heymsfield SB, Wang J, Pierson RN Jr., et al. The body adiposity index (hip circumference ÷ height(1.5)) is not a more accurate measure of adiposity than is BMI, waist circumference, or hip circumference. Obesity (Silver Spring, Md). 2012;20(12):2438–44. Epub 2012/04/10. 10.1038/oby.2012.81 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Thomas DM, Bredlau C, Bosy-Westphal A, Mueller M, Shen W, Gallagher D, et al. Relationships between body roundness with body fat and visceral adipose tissue emerging from a new geometrical model. Obesity (Silver Spring, Md). 2013;21(11):2264–71. Epub 2013/03/23. 10.1002/oby.20408 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Lee JH, Kim D, Kim HJ, Lee CH, Yang JI, Kim W, et al. Hepatic steatosis index: a simple screening tool reflecting nonalcoholic fatty liver disease. Digestive and liver disease: official journal of the Italian Society of Gastroenterology and the Italian Association for the Study of the Liver. 2010;42(7):503–8. Epub 2009/09/22. 10.1016/j.dld.2009.08.002 . [DOI] [PubMed] [Google Scholar]
- 28.Lipshitz J, Limaye S, Patel D. Leukocyte Alkaline Phosphatase Score as a Marker of Severity and Progression of Myelodysplastic Syndrome. Blood. 2007;110(11):4625-. 10.1182/blood.V110.11.4625.4625 [DOI] [Google Scholar]
- 29.Nevill AM, Duncan MJ, Lahart IM, Sandercock GR. Scaling waist girth for differences in body size reveals a new improved index associated with cardiometabolic risk. Scand J Med Sci Sports. 2017;27(11):1470–6. Epub 2016/10/12. 10.1111/sms.12780 . [DOI] [PubMed] [Google Scholar]
- 30.Nevatia WBaR. “Cluster Boosted Tree Classifier for Multi-View, Multi-Pose Object Detection.”. 2007 IEEE 11th International Conference on Computer Vision. 2007: 1–8.
- 31.Freund Y, Schapire RE. A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting. J Comput Syst Sci. 1997;55(1):119–39. 10.1006/jcss.1997.1504 [DOI] [Google Scholar]
- 32.Wu C-C, Chen Y-L, Liu Y-H, Yang X-Y. Decision tree induction with a constrained number of leaf nodes. Applied Intelligence. 2016;45(3):673–85. 10.1007/s10489-016-0785-z [DOI] [Google Scholar]
- 33.Panda M, Patra M. Network intrusion detection using naive bayes. 2007; 7.
- 34.Larsen K. Generalized Naive Bayes Classifiers. SIGKDD Explor Newsl. 2005;7(1):76–81. 10.1145/1089815.1089826 [DOI] [Google Scholar]
- 35.Zuo W, Zhang D, Wang K. On kernel difference-weighted k-nearest neighbor classification. Pattern Analysis and Applications. 2008;11(3):247–57. 10.1007/s10044-007-0100-z [DOI] [Google Scholar]
- 36.SC. W. Artificial Neural Network. In: Interdisciplinary Computing in Java Programming. The Springer International Series in Engineering and Computer Science. 2003; 743. 10.1007/978-1-4615-0377-4_5. [DOI]
- 37.Pal M. Random forest classifier for remote sensing classification. International Journal of Remote Sensing. 2005;26(1):217–22. 10.1080/01431160412331269698 [DOI] [Google Scholar]
- 38.Li J, Qian Y, editors. Regularized Multinomial Regression Method for Hyperspectral Data Classification via Pathwise Coordinate Optimization. 2009 Digital Image Computing: Techniques and Applications; 2009 1–3 Dec. 2009.
- 39.Yu H-F, Huang F-L, Lin C-J. Dual coordinate descent methods for logistic regression and maximum entropy models. Mach Learn. 2011;85(1–2):41–75. 10.1007/s10994-010-5221-8 [DOI] [Google Scholar]
- 40.Karatzoglou A, Meyer D, Hornik K. Support Vector Machines in R. 2006. 2006;15(9):28 Epub 2005-12-05. 10.18637/jss.v015.i09 [DOI] [Google Scholar]
- 41.Cristianini N., S-T J. An Introduction to Support Vector Machines and Other Kernel-based Learning Methods. Press CU, editor 2000. [Google Scholar]
- 42.Pedregosa F, V G. and Gramfort A. and Michel V., and Thirion B, G O. and Blondel M. and Prettenhofer P., et al. Scikit-learn: Machine Learning in Python. Journal of Machine Learning Research. 2011;12:2825–30. [Google Scholar]
- 43.MELANIE H Aa K. MODEL SELECTION VIA META-LEARNING: A COMPARATIVE STUDY. International Journal on Artificial Intelligence Tools. 2001;10:525–54. 10.1142/S0218213001000647. [DOI] [Google Scholar]
- 44.Giovanni C. A Super-Learning Machine for Predicting Economic Outcomes. MPRA Paper. 2020.
- 45.Bro R, Kjeldahl K, Smilde AK, Kiers HA. Cross-validation of component models: a critical look at current methods. Anal Bioanal Chem. 2008;390(5):1241–51. Epub 2008/01/25. 10.1007/s00216-007-1790-1 . [DOI] [PubMed] [Google Scholar]
- 46.Biswas AK, Noman N, Sikder AR. Machine learning approach to predict protein phosphorylation sites by incorporating evolutionary information. BMC Bioinformatics. 2010;11:273 Epub 2010/05/25. 10.1186/1471-2105-11-273 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Yip TC-F, Ma AJ, Wong VW-S, Tse Y-K, Chan HL-Y, Yuen P-C, et al. Laboratory parameter-based machine learning model for excluding non-alcoholic fatty liver disease (NAFLD) in the general population. Alimentary Pharmacology & Therapeutics. 2017;46(4):447–56. 10.1111/apt.14172 [DOI] [PubMed] [Google Scholar]
- 48.Canbay A, Kälsch J, Neumann U, Rau M, Hohenester S, Baba HA, et al. Non-invasive assessment of NAFLD as systemic disease-A machine learning perspective. PLoS One. 2019;14(3):e0214436 Epub 2019/03/27. 10.1371/journal.pone.0214436 declare. JPS and AC state that they received royalties for a scientific lecture, which was in part supported by TECOmedical group. This does not alter the authors’ adherence to PLOS ONE policies on sharing data and materials. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Saadeh S, Younossi ZM, Remer EM, Gramlich T, Ong JP, Hurley M, et al. The utility of radiological imaging in nonalcoholic fatty liver disease. Gastroenterology. 2002;123(3):745–50. Epub 2002/08/29. 10.1053/gast.2002.35354 . [DOI] [PubMed] [Google Scholar]