Skip to main content
eClinicalMedicine logoLink to eClinicalMedicine
. 2019 Jun 17;12:70–78. doi: 10.1016/j.eclinm.2019.05.008

Using Machine Learning to Classify Individuals With Alcohol Use Disorder Based on Treatment Seeking Status

Mary R Lee a,, Vignesh Sankar a, Aaron Hammer a, William G Kennedy b, Jennifer J Barb c, Philip G McQueen c, Lorenzo Leggio a,d
PMCID: PMC6677650  PMID: 31388665

Abstract

Objective

The authors used a decision tree classifier to reduce neuropsychological, behavioral and laboratory measures to a subset of measures that best predicted whether an individual with alcohol use disorder (AUD) seeks treatment.

Method

Clinical measures (N = 178) from 778 individuals with AUD were used to construct an alternating decision tree (ADT) with 10 measures that best classified individuals as treatment or not treatment-seeking for AUD. ADT's were validated by two methods: using cross-validation and an independent dataset (N = 236). For comparison, two other machine learning techniques were used as well as two linear models.

Results

The 10 measures in the ADT classifier were drinking behavior, depression and drinking-related psychological problems, as well as substance dependence. With cross-validation, the ADT classified 86% of individuals correctly. The ADT classified 78% of the independent dataset correctly. Only the simple logistic model was similar in accuracy; however, this model needed more than twice as many measures as ADT to classify at comparable accuracy.

Interpretation

While there has been emphasis on understanding differences between those with AUD and controls, it is also important to understand, within those with AUD, the features associated with clinically important outcomes. Since the majority of individuals with AUD do not receive treatment, it is important to understand the clinical features associated with treatment utilization; the ADT reported here correctly classified the majority of individuals with AUD with 10 clinically relevant measures, misclassifying < 7% of treatment seekers, while misclassifying 38% of non-treatment seekers. These individual clinically relevant measures can serve, potentially, as separate targets for treatment.

Funding

Funding for this work was provided by the Intramural Research Programs of the National Institute on Alcohol Abuse and Alcoholism (NIAAA), National Institute on Drug Abuse (NIDA) and the Center for Information Technology (CIT).

Research in Context

Evidence Before This Study: Less than 10% of persons who meet lifetime criteria for Alcohol Use Disorder (AUD) receive treatment. As the etiology of AUD represents a complex interaction between neurobiological, social, environmental and psychological factors, low treatment utilization likely stems from barriers on multiple levels. Given this issue, it is important from both a research and clinical standpoint to determine what characteristics are associated with treatment utilization in addition to merely asking individuals if they wish to enter treatment. At the level of clinical research, if there are phenotypic differences between treatment and nontreatment-seekers that directly influence outcomes of early-phase studies, these phenotypic differences are a potential confound in assessing the utility of an experimental treatment for AUD. At the level of clinical practice, distinguishing between treatment- and nontreatment-seekers may help facilitate a targeted treatment approach. Previous efforts to understand the differences between these populations of individuals with AUD leveraged the multidimensional data collected in clinical research settings for AUD that are not well suited to traditional regression methods.

Added Value of This Study: Alternating decision trees are well suited to deep-phenotyping data collected in clinical research settings as this approach handles nonparametric, skewed, and missing data whose relationships are nonlinear. This approach has proved to be superior in some cases to conventional clinical methods to solve diagnostic problems in medicine. We used a decision tree classifier to understand treatment- and non-treatment seeking group differences. The decision tree classifier approach chose a subset of factors arranged in an alternating decision tree that best predicts a given outcome. Assuming that the input measures are clinically relevant, the alternating decision tree that is generated has clinical value. Unlike other machine learning approaches, in addition to its predictive value, the nodes in the tree and their arrangement in a hierarchy have clinical utility. With the “if-then” logic of the tree, the clinician can learn what features become important and which recede in importance as the logic of the tree is followed. The decision tree classifier approach reduced 178 characterization measures (both categorical and continuous) in multiple domains to a decision tree comprised of 10 measures that together best classified subjects by treatment seeking status (yes/no).

Implications After All the Available Evidence: We leveraged a large data set comprised of 178 clinical measures and using the decision tree approach, we have reduced these to a subset of 10 measures that accurately classified individuals with alcohol dependence by treatment utilization. From this analysis, drinking behavior variables and depression measures are strong treatment seeking predictors. Having identified a cluster of factors that predicts treatment seeking, we can assess the influence of these factors directly on the clinical study outcome measures themselves. In clinical practice these factors can be separate targets for treatment. In clinical research, the group differences my directly influence research outcomes for treatment of AUD.

Keywords: Machine learning, Treatment utilization, Alcohol Use Disorder

1. Introduction

Alcohol Use Disorder (AUD), as defined in the Diagnostic and Statistical Manual of Mental disorders, fifth edition (DSM-5), is highly prevalent and is a leading cause of mortality and morbidity worldwide [1], [2]. Less than 10% of persons who meet lifetime criteria for AUD receive any type of treatment [3]. As the etiology of AUD represents a complex interaction between neurobiological, social, environmental and psychological factors, low treatment utilization likely stems from barriers on multiple levels. Large-scale epidemiological surveys identified multiple societal, demographic and personal factors influencing treatment-seeking status [4], [5]. However, the fact remains that the majority of people with AUD do not receive treatment [3]. Given this issue, it is important from both a research and clinical standpoint to determine what characteristics are associated with treatment utilization in addition to merely asking individuals if they wish to enter treatment.

In research settings, proof-of-concept human laboratory studies that include alcohol administration and/or self-administration are a cost-effective approach to study alcohol-related pharmacological and behavioral effects of alcohol and to screen new medications that have shown promise in animal studies [6]. In these human laboratory studies, alcohol may be administered with the study medication to determine the safety of the combination; in addition, early efficacy of the putative treatment on drinking or craving measures may be assessed. Given the ethical concern of administering alcohol to treatment-seeking patients, only nontreatment-seeking subjects are typically studied [7]. If there are phenotypic differences between treatment and nontreatment-seekers that directly influence outcomes of early-phase studies, these phenotypic differences are a potential confound in assessing the utility of an experimental treatment for AUD. Therefore, understanding the phenotypic differences between these populations provides the opportunity to take into account potential confounding variables in these early phase clinical studies.

Further, distinguishing between treatment- and nontreatment-seekers may help facilitate a targeted treatment approach [8]. This is consistent with recent attempts to use deep phenotyping to characterize neurofunctional domains in AUD [9]. Although recovery from AUD is possible without treatment [10], [11], [12], [13], the likelihood of this diminishes with age [10], [14]. Therefore, identifying those individuals reluctant to seek treatment for AUD early in the course of their illness is important since educational, behavioral and/or pharmacological interventions can be used in primary care or specialized settings to help patients recover. We found previously that there are personal attributes that differ between treatment compared to nontreatment-seeking individuals who were diagnosed with DSM-IV alcohol dependence (AD) [8]. These differences persisted even after controlling for drinking behavior. Perhaps these characteristics, such as impulsivity and depression, offer potential as independent targets for treatment. In this vein, the Project MATCH Research Group attempted to match 10 variables, (gender, AUD type, cognitive impairment, concept understanding, meaning seeking, motivation, psychiatric severity, sociopathy and support for drinking) defined a priori, to each of 3 treatment modalities for AUD. [15]. The results were disappointing as only one variable, psychiatric severity, significantly interacted with treatment assignment to reduce variance in drinking outcomes.

Previous efforts to understand the differences between these populations of individuals with AUD leveraged the multidimensional data collected in clinical research settings for AUD [8], [16]. However, these studies [8], [16] were descriptive and exploratory with no correction for multiple comparisons. This multidimensional data is comprised of measures that are relevant to the complex, heterogeneous, multifactorial attributes related to the etiology and clinical course of addiction [17]. In clinical research in the addiction field, there are a myriad of neuropsychosocial factors whose relationships are nonlinear. Traditional regression methods are not well suited to analyze such data given the high degree of collinearity between the variables in these datasets describing characteristics of individuals with a disorder that is heterogeneous in presentation. Decision trees, on the other hand, are well suited to this kind of clinical data because this approach handles nonparametric, skewed, missing data as well as dividing complex input data into subgroups that serve to simplify the relationship between input and outcome variables. This approach has been superior, in some cases, to conventional clinical methods to solve diagnostic problems in medicine [18].

Therefore, in the present study, we used a decision tree classifier to understand treatment- and non-treatment seeking group differences. The decision tree classifier approach chooses a subset of factors arranged in an alternating decision tree (ADT) that best predicts a given outcome. Assuming that the input measures are clinically relevant, the ADT that is generated also has clinical value. Unlike other machine learning approaches, in addition to its predictive value, the nodes in the tree and their arrangement in a hierarchy are understandable for the clinician. With the “if-then” logic of the tree, the clinician can learn what features become important and which recede in importance as the logic of the tree is followed. The decision tree classifier approach reduced 178 characterization measures (both categorical and continuous) in multiple domains (see Supplementary Material) to a decision tree comprised of 10 measures that together best classified subjects by treatment seeking status (yes/no). Given the gender imbalance in our sample, we also constructed separate decision trees for men and women to investigate possible biases associated with gender. Decision trees were validated by cross-validation on the training set of patients with AUD, and by validation on a second, independent dataset of individuals with AUD. For comparison to the ADT approach, two linear models (logistics and simple logistics) and two machine learning techniques (random forest and random tree) were also employed.

2. Methods

2.1. Recruitment

Data were combined from three screening and evaluation protocols conducted at the National Institute on Alcoholism and Alcohol Abuse (NIAAA) Intramural Research Program at the National Institutes of Health (NIH) Clinical Center in Bethesda, MD. Data were collected from 2008 to 2018, under three protocols which were approved by the appropriate NIH Institutional Review Board. All participants provided written informed consent. Under these protocols, participants were recruited via advertisement on university campuses, newspapers, internet and word of mouth. First, participants were assessed with a phone interview. Preference (yes/no) for treatment was first evaluated by a trained screener over this phone screening interview. The phone screen was also used to exclude individuals with the likelihood of a history of severe mental illness or substance dependence, other than alcohol or nicotine.

2.2. In-person Screen

Those individuals meeting these preliminary criteria were assessed during an out- or in-patient screening visit at the NIH Clinical Center, depending on their clinical needs. In the main dataset, participants (N = 778, N = 554 males) met DSM-IV criteria for AD (this corresponds to a DSM-5 diagnosis of moderate or severe Alcohol Use Disorder (AUD)). Preference (yes/no) for treatment was confirmed and determined by in-person clinician interview. For the independent validation dataset, treatment and nontreatment-seeking subjects (N = 236, 162 males) were recruited subsequent to those in the main dataset. They were evaluated with the same measures except they were assessed with the DSM-5 SCID. Table 1 includes demographics for the main and independent validation datasets.

Table 1.

Group differences in demographic characteristics between nontreatment-seeking and treatment-seeking research participants with alcohol dependence.

Primary data set

Validation data set

Nontreatment-seeking Treatment-seeking p-Value Nontreatment-seeking Treatment seeking p-Value
N 169 609 90 146
Age, years < 0.001a
Mean (SD) 42.5 (12.2) 43.1 (10.2) 38.2 (12.8) 47.8 (11.1)
Range 21.3–65.9 19.0–64.4 0.570a 21.1-61.0 23.1-68.9
Education, years 0.736a
Mean (SD) 13.5 (3.0) 13.8 (2.7) 14.3 (3.3) 13.7 (2.4)
Range 2–24 1–24 0.197 2–21 7–20
Intelligence quotient (IQ) 0.121a
Mean (SD) 91.6 (25.8) 97.4 (14.8) 102.1 (19.3) 97.8 (18.8)
Range 58–141 57–110 0.001 56–148 55–144
Gender 0.275b
 Female: n (%) 38 (23%) 186 (30%) 0.04b 32 (36%) 42 (29%)
 Male: n (%) 131 (77%) 423 (70%) 58 (64%) 104 (69%)
Race 0.001b
 African American: n (%) 134 (73%) 237 (39%) < 0.001b 58 (64%) 58 (40%)
 Caucasian: n (%) 34 (20%) 318 (52%) 23(26%) 69 (47%)
 Other: n (%) 11 (7%) 54 (9%) 9 (10%) 19(13%)
a

Independent samples t-test.

b

Chi-square.

Recent and lifetime alcohol consumption were assessed using the timeline follow-back (TLFB) [19] and lifetime drinking history [20], respectively. The Structured Clinical Interview for DSM Disorders (SCID) [21] was used to assess for psychiatric comorbidities and to confirm a DSM-IV diagnosis of Alcohol Dependence (AD) for the main dataset (N = 778). For the validation dataset, (N = 236), the SCID for DSM-5 was used. Only those subjects who met criteria for DSM-5 AUD, moderate or severe were included in the validation dataset. All subjects completed an assessment battery of measures in the following domains: cognitive (Wechsler Abbreviated Scale of Intelligence [22]), mood (Comprehensive Psychopathological Rating Scale (CPRS) [23]), impulsivity (Barratt Impulsivity Scale [24]/UPPS-P [25]), personality (NEO-5-PI-R [26]), aggression (Buss-Perry [27]), and early life stress and childhood trauma (ELSQ [28]/CTQ [29]).

Standard tests of clinical laboratory biomarkers related to AD were a hepatic panel: alanine aminotransferase (ALT); aspartate aminotransferase (AST); gamma-glutamyl transferase (GGT), alkaline phosphatase (Alk Phos), a mineral panel (calcium, magnesium, phosphorus, potassium and sodium), albumin, hemoglobin, mean corpuscular volume (MCV), hepatitis B antigen and hepatitis C antibody. Together, the in-person interview, the assessment battery and biomarkers provided a total of 178 measures in addition to their treatment seeking status.

2.3. Analysis

The alternating decision tree (ADT) method [30], [31], specifically, the Waikato Environment for Knowledge Analysis (WEKA) was used [32] to classify persons with respect to their treatment-seeking status for AUD. An ADT consists of splitter nodes, each of which is an assessment measure (either dichotomous or continuous); the splitter node gives rise to two predictor nodes which assign a numerical score for each branch from the splitter node (Figs. 1.a, 2.a, 3.a). An ADT is constructed using a boosting algorithm that starts with the predictive node of a weak hypothesis and adds more and more nodes to strengthen the classification [30]. Measurement of treatment seeking status is determined from a linear combination of all predictor nodes in the tree added to the root score. The result is a numerical score for a given subject, referred to here as the ADT score. The value of the numerical score indicates classification into treatment (< 0) or nontreatment (> 0) –seeking groups. The size of the absolute magnitude of the score is a measure of the confidence in the classification into one of the two groups. The further the score from zero, the more likely that the score correctly classifies an individual subject by treatment-seeking status. Because the sample was predominantly male, separate ADTs were constructed and assessed for men and women.

Fig. 1.

Fig. 1

Entire Dataset: a) Alternate Decision Tree (ADT) classifying N = 778 individuals with alcohol dependence (AD) as treatment or nontreatment-seekers. 10 characterization measures are noted in dark gray boxes. Classification: Treatment-seeking: total score < 0; Nontreatment-seeking: total score > 0. CPRS: Comprehensive Psychopathological Rating Scale; BMI: body mass index; IQ: Wechsler Abbreviated Scale of Intelligence. b) Confusion Matrix from Cross-validation with Training Subjects. c) Confusion Matrix from Test with Independent Data (N = 115). Tx = Treatment-seeking (bold arrow); Non-Tx = Not treatment-seeking.

Fig. 2.

Fig. 2

Males only: a) Alternate Decision Tree (ADT) classifying N = 554 males with alcohol dependence (AD) as treatment or nontreatment-seekers. 10 characterization measures are noted in dark gray boxes. Classification: Treatment-seeking: total score < 0; Nontreatment-seeking: total score > 0. CPRS: Comprehensive Psychopathological Rating Scale; IQ: Wechsler Abbreviated Scale of Intelligence. b) Confusion Matrix from Cross-validation with Training Subjects. c) Confusion Matrix from Test with Independent Data (N = 79, males only). Tx = Treatment-seeking (bold arrow); Non-Tx = Not treatment-seeking.

Fig. 3.

Fig. 3

Females only: a) Alternate Decision Tree (ADT) classifying N = 224 females with alcohol dependence (AD) as treatment or nontreatment-seekers. 10 characterization measures are noted in dark gray boxes. Classification: Treatment-seeking: total score < 0; Nontreatment-seeking: total score > 0. CPRS: Comprehensive Psychopathological Rating Scale; GGT: gamma-glutamyl transferase; Alk Phos: alkaline phosphatase. b) Confusion Matrix from Cross-validation with Training Subjects. c) Confusion Matrix from Test with Independent Data (N = 36, females only). Tx = Treatment-seeking (bold arrow); Non-Tx = Not treatment-seeking.

Two different approaches were used to assess the accuracy of the ADT to classify subjects. In the first approach, the main data set described above (N = 778) was used in a cross-validation procedure. (In this report, the main data set is the training set for all the comparison methods discussEd.) The 179 attributes were entered into the WEKA tool: one attribute was treatment seeking status and the other 178 attributes were used to train the ADT. The accuracy was found by constructing ADT's with data from 10 randomly-chosen datasets. Each dataset consisted of 90% of the subjects. The accuracy of the ADT in each of the 10 datasets was tested on the remaining 10% of subjects. The reported accuracy was the average over the ten cycles. For the second approach, data from an independent group (the “validation group”) of subjects (N = 236) who were recruited after those in the main group were recruited. This independent group had moderate or severe AUD (DSM-5) and this group was used for testing the classification accuracy of the ADT that was constructed from the main dataset. (In this report, this second independent dataset is the validation set for all the comparison methods discussEd.)

In order to assess how the ADT scores related to prediction status, a script was written in the JMP Statistical Discovery software version 11.1 (SAS headquarters, Cary, NC) using the ADT node values to calculate the ADT score for each individual patient in the independent validation dataset. Thus, this script can be used to predict whether an individual patient is likely to seek treatment or not. The script prompts the user to enter the 10 clinical variable parameters in the ADT and calculates an ADT score (according to the ADT used to generate the script, see Figs. 1.a, 2.a and 3.a). If one of the 10 clinical variable measures was missing for a given subject, then the weight for that variable was ignored in the final ADT score calculation.

To assess the power of the ADT method, four other classifier algorithms available in the WEKA tool were considered for comparison: [1] random forest, [2], random tree, [3] logistic regression, and [4] simple logistics model. The random forest method [33] started by constructing 100 decision trees using a subset of attributes chosen at random. The number of attributes in each tree was determined by: log2 (178) + 1, or 8 attributes. Classification of an individual subject was determined by the majority declaration of the trees. The random tree approach used a random search technique to construct a single large decision tree, in this case, with 576 nodes. The WEKA implementation of the logistic regression method included a “bridge estimator” to control convergence of the logistic probabilities [34]. The simple logistics model was similar to logistic regression, except that only those variables that have the most effect on the least square error were kept [35].

3. Results

For the main dataset of 778 subjects, ADT found 10 measures that classified persons by treatment seeking status. These measures reflected quantity and quality of drinking, depression and drinking-related psychological problems, IQ, race, BMI as well as substance dependence (Fig. 1.a). Specifically, current alcohol consumption, measured in number of drinks in past 30 days (corresponding to ~>8 drinks per day), any current substance dependence as assessed by DSM-IV, drinking despite psychological problems and alcohol abuse all classified individuals toward treatment seeking. Similarly, BMI < 25.5, an IQ > 87.5, depression score on the CPRS depression subscale, non-African American race classified toward treatment seeking. Elevated serum iron, GGT which are generally associated with excessive alcohol consumption were also associated with treatment seeking. Using the cross-validation method, 674/778 (86%) were classified correctly (Fig. 1.b). With the independent dataset (N = 236), 88/236 (78%) were correctly classified (Fig. 1.c). ADT scores generated for individual subjects in the independent validation dataset, (tabulated using a R script), ranged from − 5.2 to 2.8.

The ADT for the males of the main sample (N = 554) (Fig. 2.a), contained nodes that were almost identical to that the non-gender-specific tree (Fig. 1.a), with the addition of sensation seeking from the NEO-5-PI-R as a predictor node (low sensation seeking associated with treatment seeking) and absence of race and BMI as nodes. The correct classification rate using cross-validation for males was 83% (Fig. 2.b). With the independent validation dataset (N = 162 males), the correct classification rate was 77% (Fig. 2.c). For this ADT, the scores for the males in the independent validation dataset ranged from − 4.9 to 2.6.

The ADT for females of the main sample (N = 224) (Fig. 3.a) included nodes for personality measures of self-consciousness and altruism, both of which were positively associated with treatment seeking. A history of sexual abuse was also associated with treatment seeking, as was the number of dependence criteria. The female only ADT classified 90% of females correctly using cross-validation (Fig. 3.b). With the independent dataset, (N = 74 females), it classified almost 72% correctly (Fig. 3.c). For this ADT, the scores for the female patients in the independent dataset ranged from − 7.3 to 2.7

To test whether large ADT scores were associated with a higher confidence in the classification, the absolute values of the non-gender-specific ADT scores from the independent validation dataset (ADT_absscore) were grouped into 4 categories, as shown in Fig. 4. The distribution of ADT scores for the independent validation dataset is shown in Fig. 4.a. The proportion of subjects, in the independent validation data set, whose treatment seeking status was correctly assigned in each ADT score category for all subjects is shown in Fig. 4.b. Nearly 40% of patients with ADT_absscore < 1 were mis-classified. We did not find a clear pattern of amount of missing data vs mis-classification. Fig. S1 shows the number of missing clinical variable entries per subject for the independent validation data set for each category of ADT score, in both correctly and incorrectly assigned groups.

Fig. 4.

Fig. 4

Magnitude of ADT score for the independent validation dataset (N = 236) vs a) number of subjects correctly and incorrectly assigned to treatment seeking status in the independent validation dataset for all subjects; b) proportion of subjects correctly classified in each ADT score group.

Results of comparison analyses are summarized in Table 2. Each alternative method was inferior to the ADT method with respect to accuracy and kappa statistic as a classifier, except for the simple logistic model. In addition, the comparison decision tree approaches produced either a single large, complex decision tree with 576 nodes or a large number of small decision trees. Accordingly, neither random tree, random forest nor logistic regression was clinically useful. In contrast, the ADT approach did aim to reduce the large number of measures from the dataset into a streamlined battery composed of a subset of measures. The simple logistic model approached the ADT in accuracy and kappa statistic, but it required 24 variables (derived from 8 instruments requiring approximately 4–5 h testing time) to make a prediction. This is in contrast to 4 instruments in the ADT that can be completed in about 1½ hours.

Table 2.

Accuracy and kappa statistic for each Random Forest, Random Tree and a logistic model compared to the ADT approach (bold).

Cross validation
Independent validation
Accuracy Kappa Accuracy Kappa
Random forest 84.8% 0.43 73.3% 0.36
Random tree 77.9% 0.37 70.0% 0.33
Logistic model 78.0% 0.40 72.5% 0.42
Simple logistic model 85.9% 0.57 77.5% 0.50
ADT 86.1% 0.57 78.0% 0.51

4. Discussion

We used a WEKA data mining tool to reduce 178 clinical measures obtained in a clinical research setting to a subset of 10 that correctly classified over 85% of individuals with alcohol dependence (DSM-IV) by treatment seeking status (Fig. 1.b). These 10 measures became splitter nodes in an ADT that was used to predict treatment-seeking status in an independent dataset of subjects with an equivalent DSM-5 diagnosis: moderate or severe AUD (N = 236). Validation using an independent dataset is considered the gold standard for machine learning-generated classification systems [36]. With this independent dataset, subjects were classified with high accuracy, namely correct classification of 78% of individuals (Fig. 1.c). The four other comparison classification methods, two machine-learning and two logistic-type methods yielded either inferior accuracy and kappa statistic or required many more measures in order to classify.

The 10 nodes used by the ADT were not defined a priori, but were found by the hypothesis-free, data-driven ADT algorithm to best classify the group. The 10 splitter nodes in the ADT for the 778 individuals (Fig. 1.a), focused largely on drinking behavior, psychological symptoms and substance addiction: drinks per 30 days, drinking-related psychological problems, number of DSM-IV alcohol abuse criteria met, current substance dependence, as well as laboratory values of GGT and serum iron whose elevations are associated with excessive drinking. Of note, the 10 splitter nodes in the ADT that best classified treatment-seekers overlapped with measures that were significantly different between treatment- and nontreatment-seeking groups in the previous exploratory study [8], and the direction of the effect remained the same. Previous studies using data from community samples also found that treatment utilization was associated with comorbid mood, personality and drug use disorders [10] [37]. Notably, the factors from these community studies such as mood and substance use were also nodes in the ADT generated from the current study's research population despite the fact that in the clinical research setting, subjects with comorbid mental illness and/or substance use disorders are less prevalent.

Studies on racial disparities in treatment utilization for AUD are mixed with some indicating racial disparities and others indicating no racial differences in treatment utilization [38]. In the present study, race was one of the 10 nodes in the ADT and African American race was associated with not seeking treatment. However, following the logic of the ADT, the latter was true only in the context of a higher level of depression symptoms. It did not play a classifying role in the context of lower depression scores. Further, when race was omitted as one of the 178 measures, the classification accuracy, kappa value and nodes in the ADT were essentially unchanged indicating that race was not a strong classifier of treatment utilization, in contrast to depression or drinking behavior.

For datasets where the group distribution is skewed, as is the case in the present study, (treatment- vs nontreatment seekers in the main dataset was 78 vs 22%; in the validation dataset, 62 vs 38%, respectively) the kappa statistic is considered a better evaluation of the accuracy of the classifier compared to the percentage of cases classified accurately. The kappa statistic compares the observed accuracy (proportion of cases classified accurately) to random chance (expected accuracy). The kappa statistics for the main dataset confusion matrix (Fig. 1.b) and for the validation dataset matrix (Fig. 1.c) were in the ‘moderate’ agreement range [39] compared to the random forest and random tree methods which were in ‘fair’ agreement. Kappa statistic for the two logistic methods was in the moderate ranges. The ADT generated using the WEKA program tended to overclassify nontreatment seekers as treatment seekers. Importantly, it misclassified very few treatment seekers (7%). Further, the balanced accuracy, of the main and validation datasets was 78 and 73%, respectively which denotes a classification accuracy of the majority-approximately 3 out of 4 subjects-overall.

Due to the gender imbalance in the sample, we constructed separate ADT's for males and females. The ADT for women consisted of distinctly different measures (nodes) that predicted treatment seeking for females. Different personality factors from the NEO-5-PI-R appeared in each gender-specific ADT. For men, sensation seeking (subscale of the extraversion factor of the NEO-5-PI-R) was one of 10 nodes and was associated with treatment seeking. Otherwise the male ADT was minimally changed from the ADT for the entire sample. In contrast, the female-only ADT included NEO-5-PI-R subfactors, altruism and self-consciousness, both of which were positively associated with treatment seeking. In the female ADT, a history of sexual abuse was a node and associated with treatment seeking. IQ was not represented in the ADT for women whereas depression as measured by CPRS appeared in 2 successive nodes. While the ADT for women alone was comprised of different nodes, as described, the classification accuracy of the ADT for women patient in the independent dataset (N = 74) was inferior (Fig. 3.c) to that of the non-gender-specific ADT constructed from the main dataset.

In the previous descriptive paper [8] many differences between treatment and nontreatment seekers with AUD persisted even when controlling for drinking patterns and quantity of alcohol consumption. When we removed these drinking measures from the analysis, the tree had many of the same nodes, but the kappa statistic when evaluating the independent data set fell to chance. There was the similar result when CPRS depression was removed from the analysis. This indicates that, although some of the 10 variables in the ADT are moderately correlated with each other (Supplementary Material), for the ADT analysis, (and possibly in clinical practice applications), drinking behavior variables and depression measures persist as strong treatment seeking predictors.

Machine learning approaches have been applied to guide clinical care in the treatment of alcohol use disorder [40]and substance use disorder [41]. Specifically, Connor and colleagues, compared a decision tree approach to conventional linear techniques to predict relapse after treatment for AUD using demographic, addiction severity, drinking behavior, craving, health and psychological data. Their classification accuracy (77%)[41]with a decision tree approach and validation with an independent dataset was similar to that reported here (78%,Fig. 1.c)With conventional techniques, the prediction accuracy was less than chance which is also similar to what we found when applying logistic regression to the main dataset.

In our clinical research setting for AUD, beyond simply asking patients whether they wish treatment for AUD, we examined with a broad array of clinical measures in multiple domains. Then we used these measures to construct an ADT that identified a cluster of factors that predicted treatment seeking. We can then assess the influence of these factors directly on the clinical study outcome measures of interest in the AUD clinical research setting. For example, we have identified that a higher level of drinking, other substance dependence, depression and IQ is associated with seeking treatment. Therefore, nontreatment-seeking individuals who participate in human lab studies are, by inference, less depressed, drink less, have less comorbid substance dependence and lower IQ. These characteristics may affect the outcome of the experimental intervention such as alcohol consumption or cue induced craving; therefore, the conclusions regarding clinical effectiveness drawn from these studies may not be generalizable to other populations with AUD. The extent to which these characteristics can interact with the mechanisms underlying a given experimental intervention should be considered as a source of variance in this domain of clinical research.

Despite the machine learning approach, this analysis has several limitations. The subjects were predominantly male, therefore, conclusions regarding females should be validated with a larger dataset of females. The dataset was imbalanced with respect to treatment- and nontreatment-seekers; this undoubtedly contributed to the poor accuracy classifying the nontreatment compared to treatment seekers. It is important to follow up on this finding with a more balanced dataset to determine whether this approach performs well to classify each group. With a more numerically balanced dataset, data from the clinical research setting can be leveraged to inform treatment in the primary care setting as the clinical research setting is composed of individuals with AUD who seek or do not seek treatment. This is in contrast to community AUD treatment programs where there are minimal numbers of nontreatment seekers. In effect, the AUD patients in a clinical research setting may be more representative of AUD patients presenting to the primary care setting, notwithstanding the different motivations underlying attendance at a primary care vs clinical research clinic.

Although the nodes closest to the root might be construed as being more important for classification, one should really make inferences about the likelihood of treatment seeking status by considering the magnitude of the absolute values of the score, which is a linear combination of the value of many nodes. This is a novel approach to identify characteristics associated with treatment utilization for AUD. The ADT machine learning technique misclassified very few treatment seekers; therefore, the clinical characteristics identified herein are important features of this group that can be used perhaps to guide patient management and treatment. In addition, the structure of the ADT can indicate how groupings of measures could indicate sub-phenotypes of AUD along the treatment-seeking spectrum.

The following are the supplementary data related to this article.

Supplementary material

mmc1.docx (39.8KB, docx)
Fig. S1 legend

Magnitude of ADT score for the independent validation dataset (N = 236) versus number of missing entries per subject.

mmc2.pptx (225.9KB, pptx)

Author Contributions

MRL, VS, AH collected the data, researched literature and prepared figures. MRL, WGK, JJB, PGM interpreted and analyzed the data. MRL, JJB, PGM, LL prepared the manuscript. All authors read and approved the final version of the manuscript.

Declaration of Competing Interest

Dr. Leggio reports other from MCA, UK, other from Routledge, outside the submitted work. The other authors declare that they have nothing to disclose. The content of this manuscript is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Acknowledgments

Acknowledgments

The authors gratefully acknowledge the National Institute on Alcohol Abuse and Alcoholism (NIAAA) and Clinical Center clinical and research staff members involved in data collection support. The authors are also grateful to Dr. Melanie Schwandt for compiling the NIAAA assessment data set.

Funding

MRL, VS, AH, and LL are supported by National Institutes of Health (NIH) intramural funding ZIA-AA000218 (Section on Clinical Psychoneuroendocrinology and Neuropsychopharmacology; PI: Dr. Lorenzo Leggio), jointly supported by the Division of Intramural Clinical and Biological Research of the National Institute on Alcohol Abuse and Alcoholism (NIAAA) and the Intramural Research Program of the National Institute on Drug Abuse (NIDA). JJB and PGM are supported by the Intramural Research Program of the NIH, Center for Information Technology. The funders had no role in study design, data collection, data analysis, interpretation, or writing of the report.

References

  • 1.Grant B.F., Goldstein R.B., Saha T.D., Chou S.P., Jung J., Zhang H. Epidemiology of DSM-5 alcohol use disorder: results from the National Epidemiologic Survey on Alcohol and Related Conditions III. JAMA Psychiat. 2015;72(8):757–766. doi: 10.1001/jamapsychiatry.2015.0584. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Laramee P., Leonard S., Buchanan-Hughes A., Warnakula S., Daeppen J.B., Rehm J. Risk of all-cause mortality in alcohol-dependent individuals: a systematic literature review and meta-analysis. EBioMedicine. 2015;2(10):1394–1404. doi: 10.1016/j.ebiom.2015.08.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Abuse S. Mental Health Services Administration (SAMHSA) US Department of Health and Human Services, SAMHSA; 2017. Key substance use and mental health indicators in the United States: results from the 2015 national survey on drug use and health. Technical report SMA 17-5044. [Google Scholar]
  • 4.Grant B.F. Barriers to alcoholism treatment: reasons for not seeking treatment in a general population sample. J Stud Alcohol. 1997;58(4):365–371. doi: 10.15288/jsa.1997.58.365. [DOI] [PubMed] [Google Scholar]
  • 5.Zemore S.E., Mulia N., Ye Y., Borges G., Greenfield T.K. Gender, acculturation, and other barriers to alcohol treatment utilization among Latinos in three National Alcohol Surveys. J Subst Abuse Treat. 2009;36(4):446–456. doi: 10.1016/j.jsat.2008.09.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Litten R.Z., Falk D.E., Ryan M.L., Fertig J., Leggio L. Advances in pharmacotherapy development: human clinical studies. Handb Exp Pharmacol. 2018:579–613. doi: 10.1007/164_2017_79. [DOI] [PubMed] [Google Scholar]
  • 7.Acion L., Zwick J., Rojas Saunero L.P., Arndt S. Distinctions between seeking-and non-seeking-treatment research participants: implications for clinical trials effectiveness. Am J Drug Alcohol Abuse. 2017;43(6):628–630. doi: 10.1080/00952990.2017.1339712. [DOI] [PubMed] [Google Scholar]
  • 8.Rohn M.C., Lee M.R., Kleuter S.B., Schwandt M.L., Falk D.E., Leggio L. Differences between treatment-seeking and nontreatment-seeking alcohol-dependent research participants: an exploratory analysis. Alcohol Clin Exp Res. 2017;41(2):414–420. doi: 10.1111/acer.13304. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Kwako L.E., Schwandt M.L., Ramchandani V.A., Diazgranados N., Koob G.F., Volkow N.D. Neurofunctional domains derived from deep behavioral phenotyping in alcohol use disorder. Am J Psychiatry. 2019 doi: 10.1176/appi.ajp.2018.18030357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Cohen E., Feinn R., Arias A., Kranzler H.R. Alcohol treatment utilization: findings from the National Epidemiologic Survey on Alcohol and Related Conditions. Drug Alcohol Depend. 2007;86(2):214–221. doi: 10.1016/j.drugalcdep.2006.06.008. [DOI] [PubMed] [Google Scholar]
  • 11.Dawson D.A., Grant B.F., Stinson F.S., Chou P.S., Huang B., Ruan W. Recovery from DSM-IV alcohol dependence: United States, 2001–2002. Addiction. 2005;100(3):281–292. doi: 10.1111/j.1360-0443.2004.00964.x. [DOI] [PubMed] [Google Scholar]
  • 12.Dawson D.A. Correlates of past-year status among treated and untreated persons with former alcohol dependence: United States, 1992. Alcohol Clin Exp Res. 1996;20(4):771–779. doi: 10.1111/j.1530-0277.1996.tb01685.x. [DOI] [PubMed] [Google Scholar]
  • 13.Bischof G., Rumpf H.J., Meyer C., Hapke U., John U. Influence of psychiatric comorbidity in alcohol-dependent subjects in a representative population survey on treatment utilization and natural recovery. Addiction. 2005;100(3):405–413. doi: 10.1111/j.1360-0443.2005.01008.x. [DOI] [PubMed] [Google Scholar]
  • 14.Hermos J.A., Locastro J.S., Glynn R.J., Bouchard G.R., De Labry L.O. Predictors of reduction and cessation of drinking in community-dwelling men: results from the normative aging study. J Stud Alcohol. 1988;49(4):363–368. doi: 10.15288/jsa.1988.49.363. [DOI] [PubMed] [Google Scholar]
  • 15.Allen J.P., Mattson M., Miller W., Tonigan J., Connors G., Rychtarik R. Matching alcoholism treatments to client heterogeneity. J Stud Alcohol. 1997;58(1):7–29. [PubMed] [Google Scholar]
  • 16.Ray L.A., Bujarski S., Yardley M.M., Roche D.J., Hartwell E.E. Differences between treatment-seeking and non-treatment-seeking participants in medication studies for alcoholism: do they matter? Am J Drug Alcohol Abuse. 2017:1–8. doi: 10.1080/00952990.2017.1312423. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Kwako L.E., Momenan R., Litten R.Z., Koob G.F., Goldman D. Addictions neuroclinical assessment: a neuroscience-based framework for addictive disorders. Biol Psychiatry. 2016;80(3):179–189. doi: 10.1016/j.biopsych.2015.10.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Kononenko I. Machine learning for medical diagnosis: history, state of the art and perspective. Artif Intell Med. 2001;23(1):89–109. doi: 10.1016/s0933-3657(01)00077-x. [DOI] [PubMed] [Google Scholar]
  • 19.Sobell L.C., Sobell M.B., Leo G.I., Cancilla A. Reliability of a timeline method: assessing normal drinkers' reports of recent drinking and a comparative evaluation across several populations. Br J Addict. 1988;83(4):393–402. doi: 10.1111/j.1360-0443.1988.tb00485.x. [DOI] [PubMed] [Google Scholar]
  • 20.Skinner H.A., Sheu W.-J. Reliability of alcohol use indices. The lifetime drinking history and the MAST. J Stud Alcohol. 1982;43(11):1157–1170. doi: 10.15288/jsa.1982.43.1157. [DOI] [PubMed] [Google Scholar]
  • 21.First M., Spitzer R., Gibbon M., Williams J.B. Biometrics Research, New York State Psychiatric Institute; New York: 2002. Structured clinical interview for DSM-IV-TR axis I disorders, research version, non-patient edition. (SCID-I/NP) [November] [Google Scholar]
  • 22.Wechsler D. Psychological Corporation; 1999. Wechsler abbreviated scale of intelligence. [Google Scholar]
  • 23.Mattila-Evenden M., Svanborg P., Gustavsson P., Asberg M. Determinants of self-rating and expert rating concordance in psychiatric out-patients, using the affective subscales of the CPRS. Acta Psychiatr Scand. 1996;94(6):386–396. doi: 10.1111/j.1600-0447.1996.tb09879.x. [DOI] [PubMed] [Google Scholar]
  • 24.Patton J.H., Stanford M.S., Barratt E.S. Factor structure of the Barratt impulsiveness scale. J Clin Psychol. 1995;51(6):768–774. doi: 10.1002/1097-4679(199511)51:6<768::aid-jclp2270510607>3.0.co;2-1. [DOI] [PubMed] [Google Scholar]
  • 25.Lynam D.R., Smith G.T., Whiteside S.P., Cyders M.A. Purdue University; West Lafayette, IN: 2006. The UPPS-P: assessing five personality pathways to impulsive behavior. [Google Scholar]
  • 26.Costa P.T., McCrae R.R. Normal personality assessment in clinical practice: the NEO personality inventory. Psychol Assess. 1992;4(1):5. [Google Scholar]
  • 27.Buss A.H., Perry M. The aggression questionnaire. J Pers Soc Psychol. 1992;63(3):452–459. doi: 10.1037//0022-3514.63.3.452. [DOI] [PubMed] [Google Scholar]
  • 28.McFarlane A., Clark C.R., Bryant R.A., Williams L.M., Niaura R., Paul R.H. The impact of early life stress on psychophysiological, personality and behavioral measures in 740 non-clinical subjects. J Integr Neurosci. 2005;4(1):27–40. doi: 10.1142/s0219635205000689. [DOI] [PubMed] [Google Scholar]
  • 29.Bernstein D.P., Stein J.A., Newcomb M.D., Walker E., Pogge D., Ahluvalia T. Development and validation of a brief screening version of the Childhood Trauma Questionnaire. Child Abuse Negl. 2003;27(2):169–190. doi: 10.1016/s0145-2134(02)00541-0. [DOI] [PubMed] [Google Scholar]
  • 30.Freund Y., Mason L., editors. The alternating decision tree learning algorithm. icml. 1999. [Google Scholar]
  • 31.Pfahringer B., Holmes G., Kirkby R., editors. Pacific-Asia conference on knowledge discovery and data mining. Springer; 2001. Optimizing the induction of alternating decision trees. [Google Scholar]
  • 32.Hall M., Frank E., Holmes G., Pfahringer B., Reutemann P., Witten I.H. The WEKA data mining software: an update. SIGKDD Explor Newslett. 2009;11(1):10–18. [Google Scholar]
  • 33.Breiman L. Random forests. Mach Learn. 2001;45(1):5–32. [Google Scholar]
  • 34.Le Cessie S., Van Houwelingen J.C. Ridge estimators in logistic regression. Appl Stat. 1992:191–201. [Google Scholar]
  • 35.Landwehr N., Hall M., Frank E. Logistic model trees. Mach Learn. 2005;59(1–2):161–205. [Google Scholar]
  • 36.Kohavi R., editor. A study of cross-validation and bootstrap for accuracy estimation and model selectionIjcai. 1995 [Montreal, Canada] [Google Scholar]
  • 37.Ilgen M.A., Price A.M., Burnett-Zeigler I., Perron B., Islam K., Bohnert A.S. Longitudinal predictors of addictions treatment utilization in treatment-naïve adults with alcohol use disorders. Drug Alcohol Depend. 2011;113(2):215–221. doi: 10.1016/j.drugalcdep.2010.08.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Weisner C., Schmidt L.A. Rethinking access to alcohol treatment. Recent Dev Alcohol. 2001;15:107–136. doi: 10.1007/978-0-306-47193-3_7. [DOI] [PubMed] [Google Scholar]
  • 39.Landis J.R., Koch G.G. An application of hierarchical kappa-type statistics in the assessment of majority agreement among multiple observers. Biometrics. 1977:363–374. [PubMed] [Google Scholar]
  • 40.Connor J., Symons M., Feeney G., Young R.M., Wiles J. The application of machine learning techniques as an adjunct to clinical decision making in alcohol dependence treatment. Subst Use Misuse. 2007;42(14):2193–2206. doi: 10.1080/10826080701658125. [DOI] [PubMed] [Google Scholar]
  • 41.Acion L., Kelmansky D., van der Laan M., Sahker E., Jones D., Arndt S. Use of a machine learning framework to predict substance use disorder treatment success. PLoS One. 2017;12(4) doi: 10.1371/journal.pone.0175383. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material

mmc1.docx (39.8KB, docx)
Fig. S1 legend

Magnitude of ADT score for the independent validation dataset (N = 236) versus number of missing entries per subject.

mmc2.pptx (225.9KB, pptx)

Articles from EClinicalMedicine are provided here courtesy of Elsevier

RESOURCES