Abstract
Purpose: The number of patients with alcohol-related problems is steadily increasing. A large-scale survey of alcohol-related problems has been conducted. However, studies that predict hazardous drinkers and identify which factors contribute to the prediction are limited. Thus, the purpose of this study was to predict hazardous drinkers and the severity of alcohol-related problems of patients using a deep learning algorithm based on a large-scale survey data.
Materials and Methods: Datasets of National Health and Nutrition Examination Survey of South Korea (K-NHANES), a nationally representative survey for the entire South Korean population, were used to train deep learning and conventional machine learning algorithms. Datasets from 69,187 and 45,672 participants were used to predict hazardous drinkers and the severity of alcohol-related problems, respectively. Based on the degree of contribution of each variable to deep learning, it was possible to determine which variable contributed significantly to the prediction of hazardous drinkers.
Results: Deep learning showed the higher performance than conventional machine learning algorithms. It predicted hazardous drinkers with an AUC (Area under the receiver operating characteristic curve) of 0.870 (Logistic regression: 0.858, Linear SVM: 0.849, Random forest classifier: 0.810, K-nearest neighbors: 0.740). Among 325 variables for predicting hazardous drinkers, energy intake was a factor showing the greatest contribution to the prediction, followed by carbohydrate intake. Participants were classified into Zone I, Zone II, Zone III, and Zone IV based on the degree of alcohol-related problems, showing AUCs of 0.881, 0.774, 0.853, and 0.879, respectively.
Conclusion: Hazardous drinking groups could be effectively predicted and individuals could be classified according to the degree of alcohol-related problems using a deep learning algorithm. This algorithm could be used to screen people who need treatment for alcohol-related problems among the general population or hospital visitors.
Keywords: machine learning, deep learning, hazardous drinkers, hazardous drinking, alcohol related problems, alcohol use disorder, alcohol dependence, K-NHANES
Introduction
Health problems associated with alcohol-related diseases are becoming prevalent worldwide (1). Common and mild alcohol related diseases tend to ease in adolescence. However, more serious diseases can become chronic and require long-term medical and psychological management (2). Alcohol-related disorders can cause significant disabilities. They are associated with many physical and mental illnesses (3–5). Alcohol-related disorders share similar clinical course and are associated with substantially increased morbidity and mortality (6, 7). Because of these risks, large-scale studies have been conducted in many countries for decades to identify and predict risk factors, prevalence, and prognosis of alcohol use disorders.
Although previous studies have warned dangers of alcohol, patients who are dependent on alcohol or addicted to alcohol rarely visit the hospital due to the lack of insight and the underestimation of urgency of treatment (8). In addition, patients might be hesitant to seek treatment due to stigma surrounding mental illness including alcohol-related disorders. One study found that patients with a higher stigma for alcohol-related disorders were more unlikely seek treatment (9). Most patients with alcohol problems start to search for clinical help only when they have serious complications such as alcoholic hepatitis, cardiovascular disease, or gastrointestinal cancer. In the United States, only 13% of alcohol-dependent patients receive specialized treatment and only 24% of them seek some kind of help (10).
In addition, it is important to intervene at an early stage to minimize alcohol-related damage to patients with alcohol-related problems. To mediate prior to harmful consequences of alcohol drinking, the concept of hazardous drinking as an early stage of alcoholism has been introduced (11). Hazardous drinking is defined as the pattern of alcohol consumption that places patients at risk of adverse health events (12). The recent longitudinal study reported that after 6 years of follow-up of individuals who were hazardous drinkers, 46.9% of them still had alcohol use disorder and 15.4% of them became even more severe (13). Also, hazardous drinkers were more likely to have hypertension, diabetes, and COPD than the abstainers after the follow-up period (13). However, if hazardous drinkers were identified and intervened early, these risks could be reduced with relatively little effort. In one study, hazardous drinkers were offered regular telephone counseling for 1 year (14). As a result, about 5% of hazardous drinkers stopped drinking alcohol, and about 30% were reclassified as low-risk drinkers at the endpoint (14). To screen hazardous drinkers and identify them early, studies have been conducted on which demographic, social, and biological predictors of alcohol use disorder can contribute to alcohol-related problems.
Recently, deep learning has been actively used to screen and predict psychiatric diseases based on these predictors (15). A recent study has effectively identified patients with depression based on data of a large-scale survey from the United States, the National Health and Nutrition Examination Survey (16). Another study has also identified anxious and depressive participants using socio-demographic, occupational, and health-related information (17). On the other hand, early and mid-stage alcohol dependent patients with habitual drinking are often unaware of serious consequences of their alcohol dependence. If these unrecognized patients who normally receive primary care, health check-ups, and outpatient care other than psychiatry can be screened by a deep learning algorithm using predictors, intervention and management could take place at an earlier time to slow down the progression of diseases. However, to the best of our knowledge, no research has demonstrated the use of deep learning for screening and predicting hazardous drinkers based on large-scale data until now.
Large-scale surveys for disease prevalence and risk factor analysis have been conducted in various countries. The Korea National Health and Nutrition Examination Survey (K-NHANES) has been conducted every year in Korea. In this survey, participants were asked detailed questions regarding their alcohol-related problems, which allowed them to be classified into one of the four levels: Zone I, Zone II, Zone III, and Zone IV (12). If deep learning could predict the severity of these problems, it can be used as a reference for determining whether a patient needs hospitalization and the length of treatment.
In this study, we predicted hazardous drinkers in a large survey dataset through a deep learning algorithm and determined which factors contributed more to the deep learning process. In addition, performances of machine learning techniques other than deep learning such as support vector machine, logistic regression, and K-nearest neighbors were determined and compared to the performance of deep learning. We also determined whether deep learning could accurately predict the severity of alcohol-related problems in one of four severity levels.
Materials and Methods
Datasets
Datasets of National Health and Nutrition Examination Survey of South Korea (K-NHANES), a nationally representative survey with a complex, multi-stage stratification sample design for the entire South Korean population, were utilized to train deep learning. The K-NHANES is a national-wide survey conducted by the Division of Chronic Disease Surveillance, Korea Centers for Disease Control and Prevention since 1998 (18). Data of the K-NHANES IV (surveyed from 2007 to 2009), V (surveyed from 2010 to 2012), VI (surveyed from 2013 to 2015), and VII (surveyed from 2016 to 2018) were used. A professional survey team of Korea Centers for Disease Control and Prevention was formed to conduct annual surveys that could produce statistics every year without a seasonal bias. Among 303,180 geographically defined sampling units, 200 (K-NHANES IV) or 192 (K-NHANES V, VI, VII) primary units (PSUs) were sampled considering administrative districts and housing types. A total of 23 (K-NHANES IV, VII) or 20 (K-NHANES V, VI) households were systemically selected with intra-stratification of age, gender, and residential area from each PSU which contained 60 households on average. Nearly 10,000 individuals aged 1 year or more were targeted for K-NHANES. Subjectswere then divided into three groups according to their stage of life: children (aged 1~11 years), adolescents (aged 12~18 years), and adults (aged 19 years and over). Appropriate survey categories were then applied. K-NHANES not only collected questionnaire data such as demographic characteristics and dietary habits, but also obtained medical examination data using various laboratory tests.
In the analysis of K-NHANES, we included all variables from datasets. Initially, to make a model for predicting hazardous drinkers, we had 97,622 individuals and 795 variables from K-NHANES IV, V, VI, and VII. Of these 97,622 individuals, only those aged 19 years or more who responded appropriately to questions of Alcohol Use Disorders Identification Test (AUDIT) were analyzed. Variables directly related to alcohol use (e.g., monthly drinking rate, experience of driving under the influence of alcohol in the past 1 year) were excluded. Also, values of “9,” “99,” “999,” and “9999” meaning “I don't know about that question” for continuous variables were regarded as missing values because these values could mislead the prediction. In addition, variables created by statistical need, such as weights of variable and estimation of variance, were deleted. Variables with more than 18% of missing values were also excluded. As a result, 325 of 795 variables were utilized to build the deep learning model. These variables are listed in Supplementary Table 1. Therefore, 69,187 participants and 325 variables were used in the analysis to predict the hazardous drinking group (Figure 1A).
To establish a model for predicting the severity of alcohol-related problems, only data that asked all 10 questions of AUDIT among K-NHANES could be used. In some survey years, only four questions rather than all 10 questions were examined in a simplified form. Although these four questions could distinguish hazardous drinkers, they were difficult to classify severities. Therefore, we created a model to predict the severity using datasets of K-NHANES IV, V, and a part of VI (2013, 2015) that investigated all 10 questions. Initially, 65,803 participants were extracted from the above datasets. Of them, only those aged 19 years or more who responded correctly to AUDIT questions were used in the analysis. We removed alcohol related variables and regarded non-response values as missing values. We then excluded variables with more than 25% missing values. Thus, a total of 392 variables were used for the final analysis (Supplementary Table 2). Finally, 45,672 individuals and 392 variables were analyzed to make the model for predicting the severity of alcohol-related problems (Figure 1B).
Evaluation of Alcohol Use
Alcohol Use Disorder Identification Test (AUDIT) is one of the most reliable screening tools to test whether someone has an alcohol-related problems (12). In 1989, World Health Organization devised the AUDIT to screen patients with hazardous drinking, harmful drinking, and alcohol dependence. It consists of 10 questions to inquire about quantity and frequency of drinking (Question 1–3, consumption score), symptom of dependence (Question 4–6, dependent score), and alcohol-related problems caused by harmful drinking (Question 7–10, alcohol-related problem score). Recent studies have emphasized that AUDIT is more effective than CAGE questionnaire (Cut down, Annoyed, Guilty, and Eye opener), a previously used test, for diagnosing hazardous drinking, alcohol abuse, and alcohol dependence (19).
To be used in various situations, there are many abbreviated versions of AUDIT, including AUDIT-QF (only contains Questions 1 and 2 of AUDIT), AUDIT-C (Questions 1, 2, 3), AUDIT-4 (Questions 1, 2, 3, 10) (20–27). Since the start of K-NHANES, not all AUDIT questions have always been included in K-NHANES. K-NHANES IV, V, and a part of VI (2013, 2015) included all AUDIT questions in the survey. However, K-NHANES VII and a part of K-NHANES VI (2014) used AUDIT-4 (Questions 1, 2, 3, 10), a simplified version of AUDIT. The abbreviated version of AUDIT had the advantage of being simple. It could classify hazardous drinkers. However, it could not classify the severity of alcohol-related problems. Only the full version of the AUDIT could divide the severity of alcohol problems into four stages: Zone I, Zone II, Zone III, and Zone IV. Zone IV is the most serious condition. Details such as the cut off value of each test are described in Supplementary Methods (28, 29).
In summary, K-NHANES IV (2007–2009), V (2010–2012), VI (2013–2015), and VII (2016–2018) were used to predict hazardous drinkers. However, only K-NHANES IV, V, and a part of VI (2013, 2015) that employed full AUDIT questions were used to predict the severity of alcohol-related problems.
Model Development and Validation
Our deep neural network consisted of the following architectural designs. For each input variable as an encoded response to a survey question, a dense layer was applied and then L2-normalized to project the input onto an eight-dimensional unit hypersphere. All embeddings were then concatenated along the channel dimension to produce an aggregated representation, which was then passed to a multi-layer perceptron (MLP) of several layers to produce the final prediction.
We tried various combinations of hidden layers. We changed the number of hidden layers from 1 to 6. The number of nodes per layer was changed from 4,096 to 2,048, 1,024, and 512 to try various combinations. As a result, in the model predicting hazardous drinkers, the maximal area under the receiver operating characteristic curve (AUC) was derived when the hidden dimension of the MLP were set to be “2,048, 2,048, 2,048, and 512.” In addition, in the model predicting the severity of alcohol problem, optimal results were derived when the MLP was set to be “2,048, 2,048, 2,048, 2,048, 2,048, and 512.” At each layer, we also applied batch normalization (30), Swish activation (31), and dropout with 50% chance (32).
We used an SGD optimizer with an initial learning rate of 0.2, 0.1, or 0.05, a Nesterov momentum of 0.9, 0.99, or 0.999 (33), and a weight decaying at the rate of 10−6. At each epoch, the learning rate was decayed by a factor of 0.999. The model was trained up to 50 epochs and the batch size was 4,096. Hyperparameters were chosen based on a 10-fold cross validation on the entire dataset (34). We also reported our evaluation metrics on the 10-fold cross validation. At test time, the dropout was turned off. Training and evaluation of the deep learning model were performed using Pytorch (35).
Various algorithms of conventional machine learning were used to compare their performances of deep learning, including logistic regression, support vector machine, random forest classifier, and K-nearest neighbors. Logistic regression is a type of regression analysis technique that could be performed especially when the dependent variable was binary among regression analysis for predictive analysis (36). Support vector machine (SVM) is one of techniques for finding a group classification rule for a given sample group (37). Random forest classifier is a type of ensemble learning algorithm used for classification and regression analysis. It is operated by outputting a classification or average predicted value from decision trees constructed during a training process (38). K-nearest neighbors is a methodology that can predict new data using information from the k nearest neighbors among existing data when new data are given (39). In some algorithms, machine learning was performed while changing the parameters, of which parameters showing maximum performance were adopted. These parameters used for machine learning are described in Supplementary Table 3.
We verified all algorithms through 10-fold cross validation. In addition, performances of deep learning and conventional machine learnings were compared using 10 sub-datasets obtained through 10-fold cross validation. One-way analysis of variance (ANOVA) and post-hoc test were then performed to determine whether deep learning showed significantly better performance than each conventional machine learning algorithm (Supplementary Table 4).
Contribution-Ranking Analysis of Variables
There are several ways to obtain the contribution of each variable. Among them, to measure the significance of each survey question used by the trained model, we studied the drop-in performance when the response to a question was removed. A significant drop indicated that the question was serving as a strong cue for making the prediction by the model. Note that our model was in fact trained to deal with missing inputs due to dropout within the model. We summarized and sorted the observed gap of all input variables in Supplementary Table 1.
Result
Characteristics of K-NHANES Dataset for Predicting Hazardous Drinkers
How the subjects used in deep learning analysis were classified was summarized in Figure 1. To classify hazardous drinkers, we extracted data from K-NHANES IV, V, VI, VII surveyed from 2007 to 2018. The total number of participants was initially 97,622. Among them, 22,194 were excluded because they were under 19 years old. Then 6,241 individuals were excluded because they did not respond to questions about alcohol behavior. Finally, 69,187 participants were used in the machine learning model, of which 21,057 (30.4%) participants were classified as hazardous drinkers (Figure 1A).
Characteristics of K-NHANES Dataset for Predicting the Severity of Alcohol-Related Problems
To establish a model for predicting the severity of alcohol-related problems, we extracted data from K-NHANES IV, V, and a part of VI (2013, 2015) investigated from 2007 to 2015 except for 2014. Total number of participants from the dataset was 65,803. Of them, 15,740 participants were excluded because they were under 19 years old. Then 4,391 individuals were excluded because they did not answer questions related to alcohol. The remaining 45,672 participants were used for machine learning modeling, including 33,135 (72.5%) in Zone I, 8,048 (17.6%) in Zone II, 2,261 (5.0%) in Zone III, and 2,228 (4.9%) in Zone IV (Figure 1B).
Predicting Hazardous Drinkers in K-NHANES Dataset
Deep learning and other conventional machine learning algorithms were trained with 325 variables and 69,187 subjects to predict hazardous drinkers in the dataset representing the general population. The performance of each model was evaluated with an area under the receiver operating characteristic curve (AUC) resulting from a 10-fold cross validation. Results are summarized in Figure 2. Deep learning showed the highest performance, predicting hazardous drinkers with an AUC of 0.870, followed by logistic regression, linear support vector machine, random forest classifier, and K-nearest neighbors, with AUC of 0.858, 0.849, 0.810, and 0.740, respectively. Detailed parameters of conventional machine learning algorithms are described in Supplementary Table 1.
There were significant differences in classification performance among algorithms based on AUC (one-way ANOVA, F = 1606.0, p < 0.001). Additionally, post-hoc analysis confirmed that deep learning was significantly superior to logistic regression (p < 0.001), linear support vector machine (p < 0.001), random forest classifier (p < 0.001), and K-nearest neighbors (p < 0.001, Supplementary Table 4).
We also calculated other measures of classifiers. Accuracy, precision, true positive rate, false positive rate, and F1-score of deep learning were 0.822, 0.756, 0.624, 0.090, and 0.684, respectively. Measures of other classifiers are described in Supplementary Table 5.
Contribution of Each Variable to the Prediction of Hazardous Drinkers
We calculated the contribution of each variable based on the decrease of AUC when each variable was deleted for learning. It was considered that the greater the decrease in AUC when deep learning excluding a variable, the greater the contribution of that variable to the prediction from deep learning. Table 1 summarizes the ranking of contributions of top 20 variables. Supplementary Table 1 lists all variables by contribution. Among 325 variables, energy intake was found to be a factor with the greatest contribution to the prediction. Excluding this variable, the AUC of the trained model was only 0.618, which was the lowest AUC value. The second factor with the largest contribution was carbohydrate intake. After excluding this factor, the accuracy of the model was only 0.778 based on AUC. Among categorical variables, sex, lifelong smoking, and current smoking were ranked the 6th, the 13th, and the 16th, respectively, among all variables.
Table 1.
Contribution ranking | Variable code | Variable description | AUC obtained by excluding this variable |
---|---|---|---|
1 | N_EN | Energy intake (Kcal) | 0.6179801 |
2 | N_CHO | Carbohydrate intake (g) | 0.7757575 |
3 | N_FAT | Fat intake (g) | 0.8372522 |
4 | age | Age | 0.8387202 |
5 | HE_HDL_st2 | HDL-cholesterol | 0.8564337 |
6 | sex | Sex | 0.856489 |
7 | N_PROT | Protein intake (g) | 0.8607539 |
8 | HE_RBC | Red blood cells | 0.8614541 |
9 | HE_TG | Triglyceride | 0.8624155 |
10 | HE_ast | Aspartate aminotransferase | 0.8633011 |
11 | HE_alt | Alanine aminotransferase | 0.8653823 |
12 | HE_HB | Hemoglobin | 0.8653889 |
13 | BS1_1 | (Adult) Lifetime smoking | 0.8667786 |
14 | HE_wc | Waist circumference | 0.8674219 |
15 | HE_chol | Total cholesterol | 0.8681976 |
16 | BS3_1 | Current smoking status | 0.8683784 |
17 | N_INTK | Dietary intake (g) | 0.8683819 |
18 | HE_HCT | Hematocrit | 0.8689445 |
19 | HE_BMI | Body mass index | 0.8692696 |
20 | edu | Education level reclassification code | 0.8693084 |
Predicting Hazardous Drinkers by Top Ranked Variables or by Variables Related to Medical Records
In the process of predicting hazardous drinkers, we trained the deep learning model using only specific variables (Figure 3). First, deep learning was trained with the top 20 variables based on findings about contribution of variables. Accordingly, the performance reached an AUC of 0.856. As a result of learned with the top 10 variables, the AUC was 0.836. Based on these results, it could be seen that the rate detection was preserved even when learning was performed on only 10 variables. In addition, we trained the model using only variables that could be extracted from individual medical records to see the clinical applicability of the model. Among 325 variables, 156 were selected as variables that could be extracted from personal medical records. These were laboratory findings such as AST and ALT, body measurements such as weight and height, and the presence or absence of various comorbid diseases. These variables related to medical records are listed in Supplementary Table 1. As a result of learning based on these 156 variables, an AUC of 0.839 was obtained, indicating that the performance was relatively preserved.
Predicting the Severity of Alcohol-Related Problems From K-NHANES Dataset
To predict the degree of individuals' problematic drinking behavior, we used a deep learning model that best predicted hazardous drinkers among various machine learning algorithms. A total of 45,672 individuals and 392 variables were used to train this model. Results are summarized in Figure 4. The model classified Zone I, Zone II, Zone III, and Zone IV with AUCs of 0.881, 0.774, 0.853, and 0.879, respectively (Accuracy: 0.763; Precision: 0.461; Recall: 0.376; F1-score: 0.386). It could be seen that the accuracy of the model for predicting the severity of alcohol-related problems was relatively preserved.
Discussion
Through this study, we found that deep learning techniques could effectively predict hazardous drinking groups in K-NHANES, a large-scale general population dataset of South Korea. Deep learning showed significantly higher performance than conventional machine learning such as support vector machine and logistic regression. Furthermore, we were able to effectively classify individuals into Zone I, Zone II, Zone III, and Zone IV according to the severity of alcohol-related problems. These predictions were based on the definition of the WHO guideline for hazardous drinkers and severity of alcohol-related problems (Zones I–V). To the best of our knowledge, this is the first study to develop deep learning models for predicting hazardous drinkers and severity of alcohol-related problems (Zones I–V).
Previous studies have effectively predicted psychiatric diseases such as depression using large scale data. Similar to our study, one research team has predicted patients of depression in the general population with an AUC of 0.89 using a deep learning technique based on K-NHANES (16). In addition, another group has predicted the response of patients with major depressive disorder to selective serotonin reuptake inhibitors with an AUC of 0.82 by deep learning (40). One study has created a deep learning model for predicting the severity of major depressive disorder with a total of five steps with AUC ranging from 0.63 to 0.76 (41). These findings imply that deep learning is effective in predicting the diagnosis and severity of psychiatric disorders. Findings of our study were encouraging because our results were similar to or better than these previous studies.
In the process of predicting hazardous drinker, the contribution of each variable was calculated based on the value of AUC decreased after excluding that variable. As a result, energy intake, carbohydrate intake, fat intake, protein intake, and dietary intake were ranked high (ranked the 1st, 2nd, 3rd, 7th, and 11th, respectively, Table 1). The T-test also showed that the average intake of each of these variables was significantly higher in hazardous drinkers than other participants (t = 53.1, p < 0.001; t =12.0, p < 0.001; t = 36.3, p < 0.001; t = 42.3, p < 0.001; t = 47.6, p < 0.001, respectively). This is consistent with results of previous studies showing a strong evidence that hazardous drinkers have a significantly higher overall energy intake (42, 43). It may be due to high calories of alcohol itself in part. One study showed that an average of about 60% of a hazardous drinker's energy intake comes from alcohol (44). In addition, hazardous drinkers are known to show significantly higher carbohydrate, protein, and fat intake (44). The same trend was observed in the dataset used in our study.
There are additional explanations for why energy and macronutrient intake have become major factors in predicting hazardous drinkers. Alcohol use disorder is often comorbid with other psychiatric and personality disorders, especially with eating disorders (45). Several studies have shown that the lifetime comorbidity of eating disorder and alcohol use disorder in women was 15–32%, which is significantly higher than the general population (46–49). To the specific, bulimia nervosa and the bulimic subtype of anorexia nervosa were more common than restricting anorexia nervosa in alcoholism (47). Another study also reported that 20% of patients with alcohol use disorder showed binge eating, and 12% of the patients had some form of inappropriate compensatory behaviors related to weight gain (50, 51). Studies of the psychopathology of this relationship reported that impulsivity had a significant impact on the correlation between problematic eating behavior and alcohol intake (52, 53). In addition, alcohol use disorders and eating disorders have similar neurobiological backgrounds. Both disorders are associated with dysfunctional opioid and dopaminergic pathways (54). Furthermore, they show reduced activity in the brain areas associated with self-control such as the orbito-frontal and pre-frontal cortex areas (54). Therefore, because of the strong association between indiscriminate alcohol consumption and inappropriate eating habits, energy and macronutrient intake may be major factors in predicting hazardous drinkers.
As another major predictor, HDL-cholesterol was ranked 5th in the contribution ranking. The HDL level of hazardous drinkers was significantly higher in the dataset (t-test: t = 16.1, p < 0.001). This is consistent with previous studies showing that alcohol consumption can increase transport rates of HDL apolipoproteins ApoA-I and ApoA-II (55, 56). Triglyceride was ranked the 9th as a contribution factor in the present study. Its level was significantly higher in hazardous drinkers. Such higher levels in hazardous drinkers might be due to a decrease in the breakdown of chylomicrons and VLDL remnants caused by the inhibitory effect of alcohol (t = 32.0, p < 0.001) (57). Waist circumstance and BMI were ranked 14th and 19th as contributing factors in the present study. They were significantly higher in hazardous drinkers (t = 28.9, p < 0.001; t = 13.8, p < 0.001, respectively). Alcohol consumption has known to be closely related to weight gain. It especially increases waist circumference (58, 59). Thus, waist circumstance and BMI seem to be strong predictors for hazardous drinkers.
In our results, the ranking of numerical variables was generally higher. Of a total of 324 variables used in the analysis, only 88 (27.2%) were numerical variables. Among the top 20 variables, only three were categorical variables whereas 17 were numerical variables. This might be because numeric variables provided more detailed clinical information about patients. On the other hand, if the purpose of this study was to find factors contributing to the prediction of Zone IV group, a more serious group than hazardous drinkers, results would be different. For example, despite the fact that AST and ALT ranked the 10th and the 11th as contributors for classifying hazardous drinkers, they might result in a greater contribution in predicting the Zone IV group considering liver enzymes are known to deteriorate markedly in a more severe group (60).
The AUC was 0.856 for predicting hazardous drinkers with top 20 variables and 0.836 for such prediction with top 10 variables, suggesting that hazardous drinkers could be effectively predicted with only a few variables. In addition, of a total of 325 variables, when 156 variables that could be automatically extracted from hospital medical records were used, a high AUC value of 0.839 was obtained. Therefore, the deep learning model can be applied to a hospital system so that patients who visit departments other than psychiatry can be automatically consulted if they are hazardous drinkers.
In clinical settings, it is practically difficult to screen all patients for alcohol use. One study has used AUDIT-C for all patients visiting the emergency room for 1 year and found that only 65% of patients are screened (61). It was found that 25% of patients did not undergo such screening because their medical staff forgot to ask questions about alcohol and 8.8% of patients struggled with questions or refused to cooperate. However, if the deep learning model of this study could be embedded in a hospital system, it could be used to screen all patients based on their medical records, which relieves the burden of medical staffs and avoids the problems with patients' refusal. Variables extracted from hospital medical records were sufficient enough to adequately predict hazardous drinkers in this model. Therefore, even without additional information, such as nutritional status, it is possible to predict whether a patient is a hazardous drinker based on the data already collected in the hospital. In other words, automated screening through deep learning can overcome limitations of medical resources and factors associated with medical staff and patients that may appear in the clinical field.
In addition, the deep learning algorithm established in this study might be used to predict the prevalence of hazardous drinkers in a specific group. In the absence of a mental health survey in a specific region, the regional prevalence of hazardous drinkers could be estimated through a trained deep learning model of this study. Even in an environment where health surveys could not be conducted every year, it is possible to infer how the trend of prevalence of hazardous drinkers changes over time through the trained deep learning model.
Our results showed that the severity of an individual's alcohol-related problems could be predicted successfully. In a clinical setting, there might be situations where there is insufficient time to query the entire AUDIT questions, or where the patients refuse to report or reduce their symptoms due to the stigma of alcohol-related disease. In such a situation, a trained deep learning algorithm of this study would be helpful in assessing the severity of alcohol-related problems in patients. Patients with alcohol use disorders can receive outpatient medication, intensive behavioral programs, or inpatient treatment depending on their severity (62). Using the deep learning algorithm, the severity of alcohol problems in patients could be analyzed more efficiently, thus facilitating the treatment plan, period and the prediction of prognosis.
This study has several limitations. First, this study was conducted on the general population of Korea. Thus, different results might be obtained for a population from different countries or with different cultures. Although AUDIT was developed as an international instrument, it should be applied in consideration of each country's culture, society, and environment (12). Second, since our datasets were based on cross-sectional data, it was difficult to judge the future prognosis of patients. Although we predicted hazardous drinkers and the extent of alcohol-related problems at the time of investigation, longitudinal data would be needed to determine their prognosis. Third, in the process of calculating the contribution of each variable in this study, the greater the correlation between one variable with other variables, the more likely that its contribution was underestimated. For example, if a variable was closely related to another variable, removing one of them might not significantly affect the model performance. Thus, the contribution of that variable might have been undervalued.
In conclusion, the current study found that deep learning could successfully predict hazardous drinkers and classify individuals according to the severity of their alcohol-related problems. In addition, it was possible to find out which factors were more important in the process. This means that deep learning can bring about a great increase in the efficiency of diagnosis and treatment of alcohol-related diseases. However, this study was based on cross-sectional data. Additional studies on the prediction of treatment response and long-term prognosis of patients based on longitudinal data are needed in the future.
Data Availability Statement
Publicly available datasets were analyzed in this study. This data can be found here: https://knhanes.kdca.go.kr.
Ethics Statement
The studies involving human participants were reviewed and approved by the Institutional Review Board of the Ethics Committee of St. Vincent's Hospital at The Catholic University of Korea (VC21ZISI0042). The patients/participants provided their written informed consent to participate in this study.
Author Contributions
S-YK: conceptualization, investigation, data curation, formal analysis, and writing – original draft. TP: conceptualization, investigation, formal analysis, and writing – original draft. KK: investigation, formal analysis, and supervision. JO: formal analysis, writing – review and editing, and supervision. YP: investigation and formal analysis. D-JK: conceptualization, investigation, data curation, writing – review and editing, and project administration. All authors commented and supervised on the manuscript and approved the final version of the manuscript.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Footnotes
Funding. This study was supported by Research Fund of Seoul St. Mary's Hospital, The Catholic University of Korea.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2021.684406/full#supplementary-material
References
- 1.Grant BF, Goldstein RB, Saha TD, Chou SP, Jung J, Zhang H, et al. Epidemiology of DSM-5 alcohol use disorder: results from the national epidemiologic survey on alcohol and related conditions III. JAMA Psychiatry. (2015) 72:757–66. 10.1001/jamapsychiatry.2015.0584 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Connor JP, Haber PS, Hall WD. Alcohol use disorders. Lancet. (2016) 387:988–98. 10.1016/S0140-6736(15)00122-1 [DOI] [PubMed] [Google Scholar]
- 3.Hasin DS, Stinson FS, Ogburn E, Grant BF. Prevalence, correlates, disability, and comorbidity of DSM-IV alcohol abuse and dependence in the United States: results from the National Epidemiologic Survey on Alcohol and Related Conditions. Arch Gen Psychiatry. (2007) 64:830–42. 10.1001/archpsyc.64.7.830 [DOI] [PubMed] [Google Scholar]
- 4.Cargiulo T. Understanding the health impact of alcohol dependence. Am J Health Syst Pharm. (2007) 64:S5–S11. 10.2146/ajhp060647 [DOI] [PubMed] [Google Scholar]
- 5.Rehm J. The risks associated with alcohol use and alcoholism. Alcohol Res Health. (2011) 34:135–43. [PMC free article] [PubMed] [Google Scholar]
- 6.Murray CJ, Vos T, Lozano R, Naghavi M, Flaxman AD, Michaud C, et al. Disability-adjusted life years (DALYs) for 291 diseases and injuries in 21 regions, 1990-2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet. (2012) 380:2197–223. 10.1016/S0140-6736(12)61690-0 [DOI] [PubMed] [Google Scholar]
- 7.Lozano R, Naghavi M, Foreman K, Lim S, Shibuya K, Aboyans V, et al. Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: a systematic analysis for the Global Burden of Disease Study 2010. Lancet. (2012) 380:2095–128. 10.1016/S0140-6736(12)61728-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Whelan R. Decisions, decisions: machine learning as a tool to identify alcohol-use disorder treatment seekers. EClinMed. (2019) 12:4–5. 10.1016/j.eclinm.2019.06.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Keyes KM, Hatzenbuehler ML, McLaughlin KA, Link B, Olfson M, Grant BF, et al. Stigma and treatment for alcohol disorders in the United States. Am J Epidemiol. (2010) 172:1364–72. 10.1093/aje/kwq304 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Miller PM, Book SW, Stewart SH. Medical treatment of alcohol dependence: a systematic review. Int J Psychiatry Med. (2011) 42:227–66. 10.2190/PM.42.3.b [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Bendtsen P, Karlsson N, Dalal K, Nilsen P. Hazardous drinking concepts, limits and methods: low levels of awareness, knowledge and use in the Swedish population. Alcohol Alcohol. (2011) 46:638–45. 10.1093/alcalc/agr065 [DOI] [PubMed] [Google Scholar]
- 12.Organization WH . AUDIT: The Alcohol Use Disorders Identification Test: Guidelines for Use in Primary Health Care. Genva: World Health Organization; (2001). [Google Scholar]
- 13.Nadkarni A, Weiss HA, Naik A, Bhat B, Patel V. The six-year outcome of alcohol use disorders in men: a population based study from India. Drug Alcohol Depend. (2016) 162:107–15. 10.1016/j.drugalcdep.2016.02.039 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Heinemans N, Toftgård M, Damström-Thakker K, Galanti MR. An evaluation of long-term changes in alcohol use and alcohol problems among clients of the Swedish National Alcohol Helpline. Subst Abuse Treat Prev Policy. (2014) 9:22. 10.1186/1747-597X-9-22 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Priya A, Garg S, Tigga NP. Predicting anxiety, depression and stress in modern life using machine learning algorithms. Procedia Comput Sci. (2020) 167:1258–67. 10.1016/j.procs.2020.03.442 [DOI] [Google Scholar]
- 16.Oh J, Yun K, Maoz U, Kim T-S, Chae J-H. Identifying depression in the National Health and Nutrition Examination Survey data using a deep learning algorithm. J Affect Disord. (2019) 257:623–31. 10.1016/j.jad.2019.06.034 [DOI] [PubMed] [Google Scholar]
- 17.Sau A, Bhakta I. Screening of anxiety and depression among seafarers using machine learning technology. Inform Med Unlocked. (2019) 16:100228. 10.1016/j.imu.2019.100228 [DOI] [Google Scholar]
- 18.Park HA. The Korea national health and nutrition examination survey as a primary data source. Korean J Family Med. (2013) 34:79. 10.4082/kjfm.2013.34.2.79 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Bush K, Bradley K, McDonell M, Malone T, Fihn S. Screening for problem drinking: comparison of CAGE and AUDIT. Ambulatory care quality improvement project. J Gen Intern Med. (1998) 13:379–88. 10.1046/j.1525-1497.1998.00118.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Bradley KA, Bush KR, Epler AJ, Dobie DJ, Davis TM, Sporleder JL, et al. Two brief alcohol-screening tests From the Alcohol Use Disorders Identification Test (AUDIT): validation in a female Veterans Affairs patient population. Arch Intern Med. (2003) 163:821–9. 10.1001/archinte.163.7.821 [DOI] [PubMed] [Google Scholar]
- 21.Gual A, Segura L, Contel M, Heather N, Colom J. Audit-3 and audit-4: effectiveness of two short forms of the alcohol use disorders identification test. Alcohol Alcohol. (2002) 37:591–6. 10.1093/alcalc/37.6.591 [DOI] [PubMed] [Google Scholar]
- 22.Wu SI, Huang HC, Liu SI, Huang CR, Sun FJ, Chang TY, et al. Validation and comparison of alcohol-screening instruments for identifying hazardous drinking in hospitalized patients in Taiwan. Alcohol Alcohol. (2008) 43:577–82. 10.1093/alcalc/agn036 [DOI] [PubMed] [Google Scholar]
- 23.Bradley KA, DeBenedetti AF, Volk RJ, Williams EC, Frank D, Kivlahan DR. AUDIT-C as a brief screen for alcohol misuse in primary care. Alcohol Clin Exp Res. (2007) 31:1208–17. 10.1111/j.1530-0277.2007.00403.x [DOI] [PubMed] [Google Scholar]
- 24.Dawson DA, Grant BF, Stinson FS, Zhou Y. Effectiveness of the derived Alcohol Use Disorders Identification Test (AUDIT-C) in screening for alcohol use disorders and risk drinking in the US general population. Alcohol Clin Exp Res. (2005) 29:844–54. 10.1097/01.ALC.0000164374.32229.A2 [DOI] [PubMed] [Google Scholar]
- 25.Burns E, Gray R, Smith LA. Brief screening questionnaires to identify problem drinking during pregnancy: a systematic review. Addiction. (2010) 105:601–14. 10.1111/j.1360-0443.2009.02842.x [DOI] [PubMed] [Google Scholar]
- 26.Bush K, Kivlahan DR, McDonell MB, Fihn SD, Bradley KA. The AUDIT alcohol consumption questions (AUDIT-C): an effective brief screening test for problem drinking. Ambulatory Care Quality Improvement Project (ACQUIP). Alcohol Use Disorders Identification Test. Arch Intern Med. (1998) 158:1789–95. 10.1001/archinte.158.16.1789 [DOI] [PubMed] [Google Scholar]
- 27.Seong JH, Lee CH, Do HJ, Oh SW, Lym YL, Choi JK, et al. Performance of the AUDIT alcohol consumption questions (AUDIT-C) and AUDIT-K question 3 alone in screening for problem drinking. Korean J Fam Med. (2009) 30:695–702. 10.4082/kjfm.2009.30.9.695 [DOI] [Google Scholar]
- 28.Lee JH, Kong KA, Lee DH, Choi YH, Jung KY. Validation and proposal for cut-off values of an abbreviated version of the Alcohol Use Disorder Identification Test using the Korean National Health and Nutrition Examination Survey. Clin Exp Emerg Med. (2018) 5:113–9. 10.15441/ceem.17.228 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Miller WR. Motivational Enhancement Therapy Manual: A Clinical Research Guide for Therapists Treating Individuals with Alcohol Abuse and Dependence. Rockville, MD: U.S. Department of Health and Human Services, Public Health Service, Alcohol, Drug Abuse, and Mental Health Administration, National Institute on Alcohol Abuse and Alcoholism; (1992). [Google Scholar]
- 30.Ioffe S, Szegedy C. Batch normalization: accelerating deep network training by reducing internal covariate shift. In: Proceedings of the 32 nd International Conference on Machine Learning. Lille (2015). p. 448–56. [Google Scholar]
- 31.Ramachandran P, Zoph B, Le QV. Searching for activation functions. arXiv arXiv preprint: 171005941. (2017) [Google Scholar]
- 32.Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R. Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res. (2014) 15:1929–58. [Google Scholar]
- 33.Sutskever I, Martens J, Dahl G, Hinton G. On the importance of initialization and momentum in deep learning. In: Proceedings of the 30th International Conference on Machine Learning. Atlanta, GA (2013). p. 1139–47. [Google Scholar]
- 34.Stone M. Cross-validatory choice and assessment of statistical predictions. J R Stat Soc B. (1974) 36:111–33. 10.1111/j.2517-6161.1974.tb00994.x [DOI] [Google Scholar]
- 35.Paszke A, Gross S, Massa F, Lerer A, Bradbury J, Chanan G, et al. Pytorch: an imperative style, high-performance deep learning library. arXiv arXiv preprint:191201703. (2019) [Google Scholar]
- 36.Tolles J, Meurer WJ. Logistic regression: relating patient characteristics to outcomes. JAMA. (2016) 316:533–4. 10.1001/jama.2016.7653 [DOI] [PubMed] [Google Scholar]
- 37.Cortes C, Vapnik V. Support-vector networks. Mach Learn. (1995) 20:273–97. 10.1007/BF00994018 [DOI] [Google Scholar]
- 38.Tin Kam H. The random subspace method for constructing decision forests. IEEE Trans Pattern Anal Mach Intell. (1998) 20:832–44. 10.1109/34.709601 [DOI] [Google Scholar]
- 39.Altman NS. An introduction to Kernel and nearest-neighbor nonparametric regression. Am Stat. (1992) 46:175–85. 10.1080/00031305.1992.10475879 [DOI] [Google Scholar]
- 40.Lin E, Kuo P-H, Liu Y-L, Yu YWY, Yang AC, Tsai S-J. A deep learning approach for predicting antidepressant response in major depression using clinical and genetic biomarkers. Front Psychiatry. (2018) 9:290. 10.3389/fpsyt.2018.00290 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Kessler RC, van Loo HM, Wardenaar KJ, Bossarte RM, Brenner LA, Cai T, et al. Testing a machine-learning algorithm to predict the persistence and severity of major depressive disorder from baseline self-reports. Mol Psychiatry. (2016) 21:1366–71. 10.1038/mp.2015.198 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Tremblay A, St-Pierre S. The hyperphagic effect of a high-fat diet and alcohol intake persists after control for energy density. Am J Clin Nutr. (1996) 63:479–82. 10.1093/ajcn/63.4.479 [DOI] [PubMed] [Google Scholar]
- 43.Tremblay A, Wouters E, Wenker M, St-Pierre S, Bouchard C, Després JP. Alcohol and a high-fat diet: a combination favoring overfeeding. Am J Clin Nutr. (1995) 62:639–44. 10.1093/ajcn/62.3.639 [DOI] [PubMed] [Google Scholar]
- 44.Manari AP, Preedy VR, Peters TJ. Nutritional intake of hazardous drinkers and dependent alcoholics in the UK. Addict Biol. (2003) 8:201–10. 10.1080/1355621031000117437 [DOI] [PubMed] [Google Scholar]
- 45.Grilo CM, Sinha R, O'malley SS. Eating disorders and alcohol use disorders. Alcohol Res Health. (2002) 26:151. [Google Scholar]
- 46.Higuchi S, Suzuki K, Yamada K, Parrish K, Kono H. Alcoholics with eating disorders: prevalence and clinical course. A study from Japan. Br J Psychiatry. (1993) 162:403–6. 10.1192/bjp.162.3.403 [DOI] [PubMed] [Google Scholar]
- 47.Beary MD, Lacey JH, Merry J. Alcoholism and eating disorders in women of fertile age. Br J Addict. (1986) 81:685–9. 10.1111/j.1360-0443.1986.tb00389.x [DOI] [PubMed] [Google Scholar]
- 48.Hudson JI, Weiss RD, Pope HG, Jr, McElroy SK, Mirin SM. Eating disorders in hospitalized substance abusers. Am J Drug Alcohol Abuse. (1992) 18:75–85. 10.3109/00952999209001613 [DOI] [PubMed] [Google Scholar]
- 49.Lilenfeld LR, Kaye WH. The link between alcoholism and eating disorders. Alcohol Health Res World. (1996) 20:94–9. [PMC free article] [PubMed] [Google Scholar]
- 50.Peveler R, Fairburn C. Eating disorders in women who abuse alcohol. Br J Addict. (1990) 85:1633–8. 10.1111/j.1360-0443.1990.tb01653.x [DOI] [PubMed] [Google Scholar]
- 51.Sinha R, Robinson J, Merikangas K, Wilson GT, Rodin J, O'Malley S. Eating pathology among women with alcoholism and/or anxiety disorders. Alcoholism. (1996) 20:1184–91. 10.1111/j.1530-0277.1996.tb01109.x [DOI] [PubMed] [Google Scholar]
- 52.Bruce K, Mansour S, Steiger H. Expectancies related to thinness, dietary restriction, eating, and alcohol consumption in women with bulimia nervosa. Int J Eating Disord. (2009) 42:253–8. 10.1002/eat.20594 [DOI] [PubMed] [Google Scholar]
- 53.Ocampo Ortega R, Chapela B, Unikel Santoncini C. Disordered eating behaviors and binge drinking in female high-school students: the role of impulsivity. Salud mental. (2012) 35:83–9. [Google Scholar]
- 54.Smith DG, Robbins TW. The neurobiological underpinnings of obesity and binge eating: a rationale for adopting the food addiction model. Biol Psychiatry. (2013) 73:804–10. 10.1016/j.biopsych.2012.08.026 [DOI] [PubMed] [Google Scholar]
- 55.De Oliveira e Silva ER, Foster D, McGee Harper M, Seidman CE, Smith JD, Breslow JL, et al. Alcohol consumption raises HDL cholesterol levels by increasing the transport rate of apolipoproteins AI and A-II. Circulation. (2000) 102:2347–52. 10.1161/01.CIR.102.19.2347 [DOI] [PubMed] [Google Scholar]
- 56.Lim JE, Kim JI, Lee SJ, Sull JW, Lee M, Jee SH. The associations between alcohol intake and hdl cholesterol subclasses in Korean population. J Lipid Atheroscler. (2012) 1:61–8. 10.12997/jla.2012.1.2.61 [DOI] [Google Scholar]
- 57.Van de Wiel A. The effect of alcohol on postprandial and fasting triglycerides. Int J Vasc Med. (2012) 2012:862504. 10.1155/2012/862504 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Traversy G, Chaput J-P. Alcohol consumption and obesity: an update. Curr Obes Rep. (2015) 4:122–30. 10.1007/s13679-014-0129-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Butler L, Popkin BM, Poti JM. Associations of alcoholic beverage consumption with dietary intake, waist circumference, and body mass index in US adults: National Health and Nutrition Examination Survey 2003-2012. J Acad Nutr Diet. (2018) 118:409–20. 10.1016/j.jand.2017.09.030 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Abuse A, Alcoholism B. Biomarkers of heavy drinking. In: Allen JP, Wilson VB, editors. Assessing Alcohol Problems: A Guide for Clinicians and Researchers. Rockville, MD: U.S. Department of Health and Human Services, Public Health Service, National Institutes of Health, National Institute on Alcohol Abuse and Alcoholism; (2003). p. 37. [Google Scholar]
- 61.van Loon M, Van der Mast RC, van der Linden MC, van Gaalen FA. Routine alcohol screening in the ED: unscreened patients have an increased risk for hazardous alcohol use. Emerg Med J. (2020) 37:206. 10.1136/emermed-2019-208721 [DOI] [PubMed] [Google Scholar]
- 62.McHugh RK, Hearon BA, Otto MW. Cognitive behavioral therapy for substance use disorders. Psychiatr Clin. (2010) 33:511–25. 10.1016/j.psc.2010.04.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Publicly available datasets were analyzed in this study. This data can be found here: https://knhanes.kdca.go.kr.