Development and internal validation of a depression severity prediction model for tinnitus patients based on questionnaire responses and socio-demographics

Uli Niemann; Petra Brueggemann; Benjamin Boecking; Birgit Mazurek; Myra Spiliopoulou

doi:10.1038/s41598-020-61593-z

. 2020 Mar 13;10:4664. doi: 10.1038/s41598-020-61593-z

Development and internal validation of a depression severity prediction model for tinnitus patients based on questionnaire responses and socio-demographics

Uli Niemann ^1,^✉, Petra Brueggemann ², Benjamin Boecking ², Birgit Mazurek ², Myra Spiliopoulou ¹

PMCID: PMC7069984 PMID: 32170136

Abstract

Tinnitus is a complex condition that is associated with major psychological and economic impairments – partly through various comorbidities such as depression. Understanding the interaction between tinnitus and depression may thus improve either symptom cluster’s prevention, diagnosis and treatment. In this study, we developed and validated a machine learning model to predict depression severity after outpatient therapy (T1) based on variables obtained before therapy (T0). 1,490 patients with chronic tinnitus (comorbid major depressive disorder: 52.2%) who completed a 7-day multimodal treatment encompassing tinnitus-specific components, cognitive behavioural therapy, physiotherapy and informational counselling were included. 185 variables were extracted from self-report questionnaires and socio-demographic data acquired at T0. We used 11 classification methods to train models that reliably separate between subclinical and clinical depression at T1 as measured by the general depression questionnaire. To ensure highly predictive and robust classifiers, we tuned algorithm hyperparameters in a 10-fold cross-validation scheme. To reduce model complexity and improve interpretability, we wrapped model training around an incremental feature selection mechanism that retained features that contributed to model prediction. We identified a LASSO model that included all 185 features to yield highest predictive performance (AUC = 0.87 ± 0.04). Through our feature selection wrapper, we identified a LASSO model with good trade-off between predictive performance and interpretability that used only 6 features (AUC = 0.85 ± 0.05). Thus, predictive machine learning models can lead to a better understanding of depression in tinnitus patients, and contribute to the selection of suitable therapeutic strategies and concise and valid questionnaire design for patients with chronic tinnitus with or without comorbid major depressive disorder.

Subject terms: Depression, Signs and symptoms

Introduction

Tinnitus denotes the audiological phantom perception of a sound in the absence of an external source¹. Tinnitus is a common, yet highly severe worldwide health problem that substantially affects quality of life for millions of people^2,3. European studies estimate a tinnitus prevalence between 12% and 30%⁴. Besides potential hearing loss⁵, chronic tinnitus is associated with psychological epiphenomena, including anxiety^4,6, other somatoform disorders^7,8, insomnia⁹ and, first and foremost, depression^10–12. Prevalence rates of depression in patients with chronic tinnitus differ considerably, ranging from 14%¹³, to 25.6%¹⁴ up to 59.3%¹⁵. In clinical practice, it is often difficult to identify whether a depression symptomatology leads to higher tinnitus distress, or whether a higher tinnitus distress causes a persistent depressive mood. The question of comorbid depression in chronic tinnitus is hence of vital interest – both regarding the conceptualisation and measurement of distress, as well as the identification of possible obstacles to tinnitus-treatment in the face of major depressive disorder. Therefore, it is important to identify the set of variables that should be assessed at baseline to predict clinically relevant depression in tinnitus patients.

At first visit in an outpatient clinic, patients with (chronic) tinnitus usually undergo comprehensive medical and psychological assessments concerning tinnitus distress, loudness and frequency as well as the presence and severity of psychological distress. However, completing multiple lengthy questionnaires can be tedious and cumbersome for patients – often at the expense of accuracy. Hence, it is of interest to identify the most relevant questions clinicians should focus on – thereby reducing the overall amount of questions within a questionnaire. Reducing the burden of questionnaire completion may improve the quality of answers and thus, the assessment’s accuracy.

The traditional approach of extracting the most important questionnaire items requires medical researchers to carefully formulate hypotheses on the relationship between one or more independent variables and the outcome which are statistically validated subsequently. However, due to the increasingly large volume of data which are assessed for each patient, this approach becomes inappropriate, since it is very likely to miss important observations. Hence, to automatically generate new hypotheses in this study we utilize machine learning by building an accurate prediction model by capturing the inherent relationships between its features (the independent variables) and a defined outcome (the dependent variable). The quality and interpretability of such models depend considerably on the selection of relevant features. Ideally, a model is highly accurate while using only a small number of features. Often, there is a trade-off between a complex, highly predictive model and a less complex, yet more interpretable and generalisable model that uses fewer features.

While machine learning methods have been extensively used to develop prediction models for depression, e.g. in diabetes patients¹⁶ and in the general population¹⁷, we particularly focus on patients with chronic tinnitus.

In this study, we use novel machine learning algorithms (a) to create an accurate model for depression severity after treatment using data extracted from questionnaire answers before treatment, and (b) to minimise the set of predictive features by incrementally removing features on predictive performance. By trading-off high predictive accuracy with low model complexity, our results can help to identify the most important questions that patients may need to answer to accurately assess their depression status.

Methods

We extracted 185 features from 7 tinnitus-related questionnaires and socio-demographic data for a cohort of 1,490 patients during screening. For these patients, we computed the depression severity after treatment (which lasted 7 days). For prediction of depression severity after treatment, we used the workflow depicted in Fig. 1.

Features

We used a total of 185 features for data analysis, including single items, sub-scales and total scales from 7 questionnaires: (a) General Depression Scale - long form^18,19 (“Allgemeine Depressionsskala” - Langform; ADSL), (b) Perceived Stress Questionnaire²⁰ (PSQ), (c) Short Form 8 Health Survey²¹ (SF8), (d) German version of the Tinnitus Questionnaire²² (TQ), (e) Tinnitus Localisation and Quality²³ (TLQ), (f) visual analogue scales measuring tinnitus loudness, frequency and distress (TINSKAL), and (g) a sociodemographics questionnaire²⁴ (SOZK). Most questionnaire items comprised multiple-choice questions with answers on a Likert scale. The associated ordinal features were handled as numerical features in the analysis. Categorical features, e.g. sex, marital status and graduation, were binarised using one-hot encoding. A brief overview of all features is provided in Supplementary-A.

Dataset

We used data from a cohort of a total of 4,117 tinnitus patients who had been treated at Tinnitus Center, Charité Universitaetsmedizin Berlin, Germany, between January 2011 and October 2015. All included patients had been suffering from tinnitus for 3 months or longer, were 18 years of age or older and had sufficient knowledge of the German language. Treatment comprised an intensive, multimodal and tinnitus-specific 7-day programme that included informational counselling, detailed ENT as well as psychosomatic and psychological diagnostics, cognitive-behaviour therapy interventions, relaxation exercises, and physiotherapy. Ethical approval was granted by the Charité Universitaetsmedizin Ethics Committee (reference number EA1/115/15) and informed written consent was received from all patients. All methods were performed in accordance with the relevant guidelines and regulations. Prior to the analyses, all data had been anonymised. Patients who did not complete all 7 questionnaires both before and after outpatient therapy were excluded from data analysis. From the remaining 1,502 patients, 12 patients with any missing values were excluded leaving 1,490 datasets included in the analysis. Tinnitus distress was measured by the TQ total score²² with a distress-cutoff value of 46²² distinguishing between “compensated” (0–46) and “decompensated” (47–84) tinnitus. Table 1 depicts baseline characteristics of all 1,490 included patients before treatment with respect to their tinnitus distress status. The distribution of the defined outcome, the discrete additive depression score (ADSL_adsl_sum) for the patients prior to and after treatment, is shown in Fig. 2. The mean score upon commencing the therapy was 18.2 $\pm$ 11.7 which was significantly larger ( $p < 0.001$ ) than the mean score at the end of the therapy (13.2 $\pm$ 10.7), indicating a positive effect of the multimodal treatment. The target variable “depression status” was created by dichotomising the depression score using a cutoff of 16¹⁹ distinguishing between “subclinical” (0–15) and “clinical” (16–60) depression. The rate of clinical depression in 755 female patients was 58.6% and significantly larger than the rate of clinical depression in 735 male patients of 45.7% ( $p <$ 0.001, Chi-square test). The mean patient age was 49.8 years (SD 12.2 years).

Table 1.

Baseline characteristics of patients before treatment commencement (T0).

		Tinnitus status
	Total	compensated	decompensated	$p$ -value
Number of subjects, n (%)	1490 (100)	1005 (67)	485 (33)
Age in years	49.8 $\pm$ 12.2	49.3 $\pm$ 12.4	50.8 $\pm$ 11.6	0.023 (TT)
Male sex, n (%)	735 (49)	514 (51)	221 (46)	0.050 (Chi)
Tinnitus duration in years, modus (%)	5 (33)	5 (32)	5 (35)	0.008 (MW)
Number of days until start of an intensive treatment	9.5 $\pm$ 27.0	8.9 $\pm$ 25.1	10.8 $\pm$ 30.5	<0.001 (MW)
TQ total score	38.6 $\pm$ 17.2	29.0 $\pm$ 10.9	58.6 $\pm$ 8.4	<0.001 (TT)
PSQ total score	0.5 $\pm$ 0.2	0.4 $\pm$ 0.2	0.6 $\pm$ 0.2	<0.001 (TT)
SF8 general health score	41.6 $\pm$ 7.1	43.5 $\pm$ 6.4	37.6 $\pm$ 6.6	<0.001 (MW)
ADSL depression score	18.2 $\pm$ 11.7	13.7 $\pm$ 9.2	27.3 $\pm$ 10.9	<0.001 (MW)
Clinical depression, n (%)	777 (52)	362 (36)	415 (86)	<0.001 (Chi)

Open in a new tab

Baseline characteristics for the patients with compensated tinnitus and patients with decompensated tinnitus, respectively. Continuous variables are expressed as mean $\pm$ standard deviation. Categorical variables are expressed as absolute frequency (percentage). $p$ -values were calculated by unpaired two-tailed t-test (TT), Chi-square test (Chi) or two-tailed unpaired Mann-Whitney test (MW). TQ: German version of the Tinnitus Questionnaire²²; PSQ: Perceived Stress Questionnaire²⁰; SF8: Short Form 8 Health Survey²¹; ADSL: General Depression Scale Questionnaire - long form¹⁹.

Relationship between depression score after therapy and other features. Graphical representation of the relationship between the ADSL depression score at the end of therapy (y-axis) with other features (x-axis). Higher values on y-axis represent higher depression severity. Background color represents subclinical (blue) or clinical (red) depression status at the end of therapy. Slight jittering was applied to the points to mitigate overplotting. Marginal histograms depict univariate feature distributions.

Classification model development

We employed eleven machine learning algorithms for classifier training: LASSO²⁵ (lasso), RIDGE²⁶ (ridge), weighted k-nearest neighbour classifier²⁷ (wknn), Naïve Bayes classifier (nb), support vector machine²⁸ (svm), a feed-forward neural network with one single hidden layer²⁹ (nnet), generalised partial least squares³⁰ (gpls), CART decision tree³¹ (cart), C5.0 decision tree³² (c5.0), random forest³³ (rf) and gradient boosted trees³⁴ (gbt). 10-fold stratified cross-validation was used for classifier evaluation. In $k$ -fold cross-validation, the data is split into $k$ partitions. Each partition serves once as test set for the model which is trained on the remainder of the partitions. Finally, the $k$ performance results are averaged. A grid search was employed for hyperparameter tuning using area under the ROC curve (AUC) as evaluation measure. A detailed description of all tuned parameter values can be obtained from Supplementary-B.

Feature selection

We created a novel incremental feature selection wrapper. In particular, we adapted the feature importance score for random forests³³ and its generalisation to any model type³⁵ which is referred to as “model reliance”. The model reliance estimates the difference in the model error after a feature’s values are randomly permuted in the dataset. An estimate of the model reliance for a feature $f \in F$ with respect to a model $ζ$ , a target vector $y$ , a dataset $X$ and a loss function $L (y, ζ (X))$ is calculated as follows. First, the model error on the original training data $X_{o r i g} = X$ is calculated: $e_{o r i g} = L (y, ζ (X_{o r i g}))$ . Secondly, the values of $f$ are randomly permuted and the model error on the perturbed dataset $X_{p e r m}$ is calculated: $e_{p e r m} = L (y, ζ (X_{p e r m}))$ . Finally, the model reliance $M R (f, ζ)$ is calculated as ratio of model error with the permuted feature and model error with the original data: $M R (f, ζ) = \frac{e_{p e r m}}{e_{o r i g}}$ . A $M R$ value greater than 1 suggests that $f$ is important, since randomly permuting its values apparently breaks its relationship with the predicted target. Since feature perturbation involves a degree of uncertainty, $M R$ estimates can be improved by repeating the whole procedure $k$ times and averaging the $k$ $M R$ scores. In this study, $M R$ was calculated as average over 10 runs.

In iteration $i = 1$ , our incremental feature selection wrapper begins by training an initial model $m_{1}$ on the full feature set $F_{1} = F$ . For each feature, the model reliance $M R (f, m_{i})$ is calculated. Features with $M R (f, m_{i}) > 1$ are retained for iteration $i + 1$ while the remaining features are dropped. This procedure continues until either none of the $M R$ values exceed 1, i.e., $\forall f \in F_{i} : M R (f, m_{i}) \leq 1$ , or the feature set in iteration $i$ is identical to the feature set in iteration $i - 1$ , i.e., $F_{i} = F_{i - 1}$ .

Results

Distribution of responses

More than half (52.2%) of the 1,490 subjects suffered from clinical depression either at start (T0) and end of treatment (T1) (Fig. 2). The average difference in ADSL score between T0 and T1 comprised 5.0 points (SD 8.2). Hence, roughly one fifth of the patients (22.7%) showed symptoms of clinical depression at T0, but not at T1. Nearly half the subjects (44.4%) reported subclinical depression at both time points whereas only a minor fraction of patients (3.4%) reported an increase of depression severity. We found a strong correlation between the ADSL sum score at both time points (Spearman $ρ = 0.71$ ). While we found no correlation between the ADSL score at T1 and patient age ( $ρ = - 0.01$ ), we identified a moderate correlation between the former and the initial values of TQ total score ( $ρ = 0.53$ ), PSQ stress score ( $ρ = 0.53$ ) and SF8 general health score ( $ρ = - 0.48$ ).

Predictive performance of classification models

The classification models predicted depression status after therapy based on questionnaire answers and social data acquired prior to therapy with high AUC. Table 2 depicts the performance of all classification methods across iterations. The lasso classifier constructed the best overall model (iteration $i = 1$ , AUC: 0.87 $\pm$ 0.04; mean $\pm$ SD), followed by ridge ( $i = 1$ , AUC: 0.86 $\pm$ 0.04) and gbt ( $i = 1$ , AUC: 0.86 $\pm$ 0.04). The AUCs of each classifier’s best model were similar, ranging from 0.81 (c5.0) to 0.87 (lasso).

Table 2.

Classification performance.

Classification method
$i$	lasso	ridge	wknn	nb	svm	gpls	nnet	cart	c5.0	rf	gbt
1	0.867 (185)	0.864 (185)	0.853 (185)	0.852 (185)	0.851 (185)	0.838 (185)	0.822 (185)	0.795 (185)	0.795 (185)	0.864 (185)	0.862 (185)
2	0.856 (89)	0.847 (86)	0.845 (98)	0.849 (70)	0.530 (5)	0.836 (80)	0.807 (117)	0.799 (106)	0.803 (103)	0.864 (109)	0.855 (89)
3	0.857 (50)	0.854 (51)	0.845 (65)	0.829 (38)	0.537 (4)	0.836 (47)	0.809 (87)	0.794 (66)	0.803 (62)	0.866 (99)	0.859 (52)
4	0.856 (24)	0.853 (31)	0.837 (40)	0.832 (26)	0.542 (3)	0.838 (24)	0.801 (59)	0.799 (45)	0.790 (39)	0.865 (85)	0.858 (38)
5	0.853 (17)	0.853 (21)	0.842 (28)	0.838 (15)	0.562 (2)	0.838 (16)	0.793 (45)	0.811 (34)	0.806 (24)	0.865 (77)	0.855 (24)
6	0.854 (10)	0.851 (15)	0.847 (16)	0.841 (13)	—	0.837 (9)	0.810 (25)	0.817 (28)	0.803 (23)	0.863 (75)	0.856 (16)
7	0.850 (6)	0.854 (11)	0.833 (9)	—	—	0.838 (6)	0.812 (21)	0.822 (24)	0.804 (16)	0.864 (69)	0.854 (14)
8	—	0.854 (9)	0.829 (7)	—	—	—	0.852 (12)	0.822 (23)	0.802 (13)	0.865 (64)	0.853 (11)
9	—	0.854 (8)	0.830 (6)	—	—	—	0.857 (8)	0.822 (22)	0.802 (12)	0.865 (59)	—
10	—	0.853 (7)	—	—	—	—	0.842 (4)	—	0.809 (10)	0.866 (57)	—
11	—	—	—	—	—	—	—	—	—	0.865 (56)	—
12	—	—	—	—	—	—	—	—	—	0.864 (51)	—
13	—	—	—	—	—	—	—	—	—	0.864 (50)	—
14	—	—	—	—	—	—	—	—	—	0.863 (47)	—

Open in a new tab

Mean cross-validation AUC for each classifier with best parameter configuration and for each iteration ( $i$ ). The number of features are given in parenthesis. The best run per classifier is highlighted in boldface. All methods induce at least one model with AUC of 0.809 or higher. Empty cells indicate that the feature selection wrapper had already been terminated after a previous iteration.

Classification using the best model (lasso, $i = 1$ ) based on a probability threshold of 0.5 resulted in an accuracy of 0.79, a true positive rate (sensitivity) of 0.61, a true negative rate (specificity) of 0.88, a precision of 0.72 and a negative predictive value of 0.82. The final model retained 40 features with nonzero coefficients. Fig. 3 shows the median model coefficient of these features across 10 cross-validation folds. From the ADSL questionnaire, 16 single items were included in the final model. Thus, this questionnaire contributed most to the model prediction. Notably, 5 items from the tinnitus-tailored TQ questionnaire were also included in the model. Further, the model utilised 5 items from the socio-demographics questionnaire (SOZK), including nationality (SOZK_nationality) which appeared to have the highest absolute model coefficient, graduation (SOZK_graduate), tinnitus duration (SOZK_tindur), employment (SOZK_job), marital status (SOZK_unmarried) and partnership status (SOZK_partnership). Table 3 provides a description for each of the 25 features with the largest model coefficient for the lasso model ( $i = 1$ ). The complete list of features included in the final model can be consulted in Supplementary-C.

Coefficients and relative inclusion of features in cross-validation of lasso model. Median ( $\pm$ median absolute deviation) coefficients (top) and absolute frequency of inclusion of features (bottom) over 10 cross-validation iterations for the best lasso model. From 185 features, the depicted 40 features exhibit a nonzero model coefficient. The average frequency of feature inclusion is represented as horizontal line in the bottom subplot. Line ranges depict MAD (right). TQ: German version of the Tinnitus Questionnaire²²; PSQ: Perceived Stress Questionnaire²⁰; SF8: Short Form 8 Health Survey²¹; ADSL: General Depression Scale Questionnaire - long form¹⁹; SOZK: sociodemographics questionnaire²⁴.

Table 3.

Top-25 features of lasso model.

Feature	Description	Coefficient
SOZK_nationality	German nationality	−0.370
ADSL_adsl06	“During the past week I felt depressed”.	0.309
ADSL_adsl19	“During the past week I felt that people disliked me”.	0.288
PSQ_stress21	“You enjoy yourself”.	−0.284
SOZK_graduate	Graduation: university	−0.210
ADSL_adsl11	“During the past week my sleep was restless”.	0.196
ADSL_adsl03	“During the past week I felt that I could not shake off the blues even with help from my family or friends”.	0.175
TQ_tin50	Because of the noises I am unable to enjoy the radio or television.	0.151
TQ_tin47	I am a victim of my noises.	0.137
ADSL_adsl02	“During the past week I did not feel like eating; my appetite was poor”.	0.132
ADSL_adsl05	“During the past week I had trouble keeping my mind on what I was doing”.	0.132
SF8_sf07	“During the past 4 weeks, how much have you been bothered by emotional problems (such as feeling anxious, depressed or irritable)?”	0.125
ADSL_adsl10	“During the past week I felt fearful”.	0.107
ADSL_adsl04	“During the past week I felt I was just as good as other people”.	−0.107
TQ_tin40	I am able to forget about the noises when I am doing something interesting.	−0.104
ADSL_adsl16	“During the past week I enjoyed life”.	−0.085
PSQ_stress15	“Your problems seem to be piling up”.	0.081
TQ_tin07	Most of the time the noises are fairly quiet.	−0.069
ADSL_adsl08	“During the past week I felt hopeful about the future”.	−0.064
SF8_sf02	“During the past 4 weeks, how much did physical health problems limit your physical activities (such as walking or climbing stairs)?”	0.059
SOZK_tinnitusdur	“How long have you been suffering from tinnitus (in years)?”	0.058
PSQ_stress28	“You feel loaded down with responsibility”.	0.055
ADSL_adsl18	“During the past week I felt sad”.	0.053
SOZK_job	Job status: currently employed	−0.050
ADSL_adsl13	“During the past week I talked less than usual”.	0.049

Open in a new tab

Features with highest absolute coefficient in lasso model (iteration $i = 1$ ). TQ: German version of the Tinnitus Questionnaire²²; PSQ: Perceived Stress Questionnaire²⁰; SF8: Short Form 8 Health Survey²¹; ADSL: General Depression Scale Questionnaire - long form¹⁹; SOZK: sociodemographics questionnaire²⁴.

Stability of classifiers on smaller feature sets

With the exception of svm, all classifiers showed high stability when trained on smaller feature subsets. For example, the difference between lasso on 185 features ( $i = 1$ ) and the same on 6 features ( $i = 7$ ) was only 0.017 (2% drop). Several classifiers even benefitted from feature selection with respect to predictive performance. For five classifiers (gpls, nnet, cart, c5.0 and rf), the AUC of the model at second or later iteration was larger than the AUC of the first iteration model that used all 185 features. The two decision tree variants cart and c5.0 profited the most from feature selection, since their best performance was reached on the smallest feature subset with a cardinality of 22 and 10, respectively.

Complexity-interpretability tradeoff

Our incremental feature selection wrapper reduces the number of features from 185 to 6 without substantial quality loss. The lasso model of iteration $i = 7$ provides a reasonable trade-off between a clinically useful predictive quality (AUC: 0.85 $\pm$ 0.05) and a low model complexity (6 features) in comparison with the best overall lasso model (AUC: 0.87 $\pm$ 0.04). Figure 4 depicts a graphical representation of the distribution of these 6 features with respect to depression_status. Patients with clinical depression report a significantly higher mean tinnitus distress score TQ_distress (33.15 $\pm$ 15.2) than patients with subclinical depression (49.8 $\pm$ 15.4) (t-test, $α = 0.05$ ). Analogous, the mean of the stress sum score PSQ_psq_sum (clinical dep.: 0.58 $\pm$ 0.16 vs. subclinical dep.: 0.40 $\pm$ 0.17) and the demand score PSQ_demand (clinical dep.: 0.56 $\pm$ 0.16 vs. subclinical dep.: 0.46 $\pm$ 0.17) were significantly higher for patients with clinical depression. Additionally, three single items were included in the model which showed significant differences with respect to depression_status (Chi-square test, $α = 0.05$ ). For the seventh and tenth question of the ADSL questionnaire (ADSL_adsl07: “During the past week I felt that everything I did was an effort”; ADSL_adsl10: “During the past week I felt fearful”), the portion of patients with clinical depression ticking answers “occasionally” and “most” were higher than for “rarely” and “some”. Accordingly, patients with clinical depression answered the fifth question of the SF8 questionnaire (SF8_sf05: “During the past 4 weeks, how much energy did you have?”) rather with “a little” or “none” instead of “very much”, “quite a lot” or “some”.

Predictive features. Distribution of features included in the lasso model of iteration $i = 7$ for the patients with subclinical and clinical depression. Green squares and labels represent mean of continuous features. ADSL: General Depression Scale Questionnaire - long form¹⁹; PSQ: Perceived Stress Questionnaire²⁰; SF8: Short Form 8 Health Survey²¹; TQ: German version of the Tinnitus Questionnaire²².

Discussion

Machine learning has been used to create prediction models for depression severity based on structured patient interviews^36,37. Despite their high predictive performance, we assume that our current models provide a good fit for our sample only, with other subpopulations being yet to be investigated. However, our models are promising and may serve as starting point for timely prediction of depression severity and treatment course with only a small number of questionnaire items.

In agreement with previous studies, the strong association between TQ_distress and depression status indicate a high association between tinnitus-related distress and depressive symptomatology as measured by ADSL³⁸. In addition, large model coefficients for PSQ overall score and demand score suggest subjective stress as major contributing factor to depression in tinnitus patients¹². From a clinical point of view, the inclusion of features from different questionnaires indicates the importance of combining items from several questionnaire types in order to accurately predict depression status. Hence, emotional epiphenomena and other sequelae must be addressed to optimally meet patients’ needs.

A previous study³⁹ reported high sensitivity in depression recognition using a questionnaire with only two questions. One of the two questions was “During the past month, have you often been bothered by feeling down, depressed, or hopeless.”³⁹ which closely resembles the item ADSL_adsl06 (“During the past week I felt depressed.”) that exhibited the second-largest absolute coefficient in the best lasso model ( $i = 1$ ) in our study.

In general, caution has to be taken when interpreting model coefficients. For example, the lasso model ( $i = 1$ ) identified a positive relationship (coefficient: $- 0.370$ ) between non-German citizenship and depression severity (Table 3, Fig. 3). Although ethnical differences in depression were reported in some studies^40,41, this result rather suggests a higher perceived social stress of predominantly Turkish-born foreign patients, due to higher unemployment rate, larger families, inferior housing, etc. in this demographic group. Further, these results may also be an effect of overfitting, since only 5.0% of the cohort population were non-German citizens. Moreover, the feature had a model reliance score of under 1.0 and consequently was dropped for iteration 2. Although the age feature is included in 8 of the 11 feature sets associated with the best model per classifier, the lack of correlation with the response lets the effect of age on the predicted depression status remains unclear.

With respect to stability of models on a small number of features, it is encouraging that much simpler models are just minorly inferior to the most predictive model. In fact, 5 out of 11 classification algorithms even improved from feature selection, i.e., the AUC at the second or a higher iteration was larger than at the first iteration that uses all features, including the two decision tree variants that reached highest performance on the smallest feature subset, respectively. It is promising that a model (lasso, i = 7) that used only 6 features from 4 questionnaires was only slightly inferior (AUC = 0.850) to the best overall model (AUC = 0.867). For example, neither features on tinnitus localisation and quality, nor sociodemographic features were included in this model. This result could be used to reduce the number of questions or whole questionnaires that the patients have to answer before and after treatment.

The presented study aims at being a first step in providing physicians with guidance for therapy decisions concerning clinical depression in patients with chronic tinnitus. The models could be used to devise a suitable treatment pathway. When applying the models to practice, it is important to notice that they are learned on cross-sectional data, i.e., the model separates between subclinical and clinical depression based on questionnaire answers and socio-demographics before administration of a treatment. Also, the term “clinical depression” refers to how it was modelled in this study, i.e., the depression status after treatment. One has also take into account that the median time difference between start and end of treatment programme was 7 days.

The dataset used for model development might be subject to a selection bias since patients who did not complete all seven questionnaires both during admission and after treatment were excluded in the present data analyses. We do not see these data as “missing values” because this might lead to the problematic suggestion of using imputation methods. We cannot use imputation, because (i) a proportion of patients did not complete whole questionnaires (rather than just single items), and (ii) we do not know if data are missing at random. However, given that the number of patients is large, we consider our results as sufficiently robust. In future work, we will investigate potential systematic differences between included and excluded patients. Further, the patient population was obtained from only one German hospital. Hence, the model needs to be externally validated on data from different populations and hospitals.

As another limitation, the incremental feature selection mechanism may miss global optima due to its greedy procedure. At each iteration, only features that are identified to make up for some predictive performance of the classifier are retained and the remaining features are dropped. Once a feature has been eliminated from the feature set, it is not considered at any later iteration. It is possible that the inclusion of a removed feature for classifier training at a later iteration leads to a better model. One possible solution to this problem would be to implement a mechanism which allows for backtracking or revisiting previous iterations. Thus, the $M R$ cutoff value for discarding features could serve as additional tuning parameter. Hence, by testing alternative feature sets at a single iteration, a model with higher predictive performance could be generated.

Motivated by this limitation, future work includes a comparison with other feature selection algorithms. Generally, feature selection algorithms can be roughly divided into embedded methods, filter methods and wrapper methods. Intrinsic methods describe classification methods that internally handle feature selection during model training, e.g., tree- and rule-based classifiers and regularised methods like LASSO. Filter methods are classifier-independent and quantify the relevance of a feature before model training by a scoring function. Popular filter approaches are Relief-based methods^42,43, correlation-based feature selection⁴⁴ and simple statistical scores, e.g., p-value of $t$ -test, chi-squared test or Wilcoxon signed-rank test. (Search-based) wrapper methods define a “space” of candidate feature sets. Each candidate feature set is evaluated by a search algorithm which is wrapped around the classifier. To prevent exhaustive search, the search algorithm usually utilises a heuristic to guide the search from the previous best feature set to next best candidate set. Well-known wrapper methods include simple forward/backward selection, recursive feature elimination⁴⁵, simulated annealing^46,47 and genetic algorithms⁴⁸. The novel feature selection mechanism that is used in this study can be categorised as wrapper method.

Another limitation of this study is the lack of an independent cohort. In future work, the model needs to be externally validated, i.e., tested on data from different centres. Since the use of cross-sectional data currently limits interpretation of the depression status prediction beyond end of therapy, the model needs to be validated with longitudinal data in the future.

Supplementary information

Supplementary Information.^{(614.3KB, pdf)}

Author contributions

U.N. and M.S. analysed the data. U.N. wrote the manuscript. P.B., B.B., B.M. and M.S. reviewed the manuscript. P.B., B.B. and B.M. provided medical expertise, guidance and the dataset.

Data availability

Per the Charité Universitaetsmedizin Berlin ethics committee, we cannot make the data public because we do not have the consent of patients to publish their data. Interested researchers can contact the directorate of the Tinnitus Center of Charité Universitaetsmedizin Berlin with data access requests at birgit.mazurek@charite.de.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Change history

6/15/2020

Due to a typesetting error, in the original version of this Article the character é was replaced by the character ï¿½l. This has now been fixed in the Article.

Supplementary information

is available for this paper at 10.1038/s41598-020-61593-z.

References

1.Eggermont JJ, Roberts LE. The neuroscience of tinnitus. Trends in Neurosciences. 2004;27:676–682. doi: 10.1016/j.tins.2004.08.010. [DOI] [PubMed] [Google Scholar]
2.Baguley D, McFerran D, Hall D. Tinnitus. The Lancet. 2013;382:1600–1607. doi: 10.1016/S0140-6736(13)60142-7. [DOI] [PubMed] [Google Scholar]
3.Bauer CA, Berry J, Brozoski TJ. Clinical trials supported by the tinnitus research consortium: Lessons learned, the southern illinois university experience. Hearing Research. 2016;334:65–71. doi: 10.1016/j.heares.2015.05.001. [DOI] [PubMed] [Google Scholar]
4.McCormack A, et al. Investigating the association between tinnitus severity and symptoms of depression and anxiety, while controlling for neuroticism, in a large middle-aged uk population. International Journal of Audiology. 2015;54:599–604. doi: 10.3109/14992027.2015.1014577. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Martines F, Bentivegna D, Martines E, Sciacca V, Martinciglio G. Assessing audiological, pathophysiological and psychological variables in tinnitus patients with or without hearing loss. European Archives of Oto-Rhino-Laryngology. 2010;267:1685–1693. doi: 10.1007/s00405-010-1302-3. [DOI] [PubMed] [Google Scholar]
6.Zöger S, Svedlund J, Holgers K-M. Psychiatric disorders in tinnitus patients without severe hearing impairment: 24 month follow-up of patients at an audiological clinic: Alteraciones psiquiátricas en pacientes con tinnitus sin hipoacusia severa: Seguimiento durante 24 meses en una clínica audiólogica. Audiology. 2001;40:133–140. doi: 10.3109/00206090109073108. [DOI] [PubMed] [Google Scholar]
7.Andersson G. Psychological aspects of tinnitus and the application of cognitive-behavioral therapy. Clinical Psychology Review. 2002;22:977–990. doi: 10.1016/S0272-7358(01)00124-6. [DOI] [PubMed] [Google Scholar]
8.Hiller W, Janca A, Burke KC. Association between tinnitus and somatoform disorders. Journal of psychosomatic research. 1997;43:613–624. doi: 10.1016/S0022-3999(97)00188-8. [DOI] [PubMed] [Google Scholar]
9.Wallhäusser-Franke E, Schredl M, Delb W. Tinnitus and insomnia: is hyperarousal the common denominator? Sleep Medicine Reviews. 2013;17:65–74. doi: 10.1016/j.smrv.2012.04.003. [DOI] [PubMed] [Google Scholar]
10.Zirke N, et al. Analysis of mental disorders in tinnitus patients performed with composite international diagnostic interview. Quality of Life Research. 2013;22:2095–2104. doi: 10.1007/s11136-012-0338-9. [DOI] [PubMed] [Google Scholar]
11.Zöger S, Svedlund J, Holgers K-M. Relationship between tinnitus severity and psychiatric disorders. Psychosomatics. 2006;47:282–288. doi: 10.1176/appi.psy.47.4.282. [DOI] [PubMed] [Google Scholar]
12.Trevis KJ, McLachlan NM, Wilson SJ. A systematic review and meta-analysis of psychological functioning in chronic tinnitus. Clinical psychology review. 2018;60:62–86. doi: 10.1016/j.cpr.2017.12.006. [DOI] [PubMed] [Google Scholar]
13.Stobik C, Weber RK, Münte TF, Walter M, Frommer J. Evidence of psychosomatic influences in compensated and decompensated tinnitus. International journal of audiology. 2005;44:370–378. doi: 10.1080/14992020500147557. [DOI] [PubMed] [Google Scholar]
14.Bhatt JM, Bhattacharyya N, Lin HW. Relationships between tinnitus and the prevalence of anxiety and depression. The Laryngoscope. 2017;127:466–469. doi: 10.1002/lary.26107. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Hu, J.et al. The correlation of the tinnitus handicap inventory with depression and anxiety in veterans with tinnitus. International Journal of Otolaryngology 2015 (2015). [DOI] [PMC free article] [PubMed]
16.Jin H, Wu S, Di PC. Development of a clinical forecasting model to predict comorbid depression among diabetes patients and an application in depression screening policy making. Preventing chronic disease. 2015;12:E142–E142. doi: 10.5888/pcd12.150047. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Wang J, et al. A prediction algorithm for first onset of major depression in the general population: development and validation. J Epidemiol Community Health. 2014;68:418–424. doi: 10.1136/jech-2013-202845. [DOI] [PubMed] [Google Scholar]
18.Radloff LS. The CES-D scale: a self-report depression scale for research in the general population. Applied psychological measurement. 1977;1:385–401. doi: 10.1177/014662167700100306. [DOI] [Google Scholar]
19.Hautzinger, M. & Bailer, M. ADS-Allgemeine Depressionsskala. In Diagnostische Verfahren in der Psychotherapie (Beltz, 2003).
20.Fliege H, et al. The Perceived Stress Questionnaire (PSQ) reconsidered: validation and reference values from different clinical and healthy adult samples. Psychosomatic medicine. 2005;67:78–88. doi: 10.1097/01.psy.0000151491.80178.78. [DOI] [PubMed] [Google Scholar]
21.Bullinger, M. & Morfeld, M. Der SF-36 Health Survey. In Gesundheitsökonomische Evaluationen, 387–402 (Springer, 2008).
22.Goebel, G. & Hiller, W. Tinnitus-Fragebogen:(TF); ein Instrument zur Erfassung von Belastung und Schweregrad bei Tinnitus; Handanweisung (hogrefe, Verlag für Psychologie, 1998).
23.Goebel G, Hiller W. Psychische Beschwerden bei chronischem Tinnitus: Erprobung und Evaluation des Tinnitus-Fragebogens (TF) Verhaltenstherapie. 1992;2:13–22. doi: 10.1159/000258202. [DOI] [Google Scholar]
24.Brüggemann P, et al. Impact of multiple factors on the degree of tinnitus distress. Frontiers in human neuroscience. 2016;10:341. doi: 10.3389/fnhum.2016.00341. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Tibshirani R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological) 1996;58:267–288. [Google Scholar]
26.Hoerl AE, Kennard RW. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics. 1970;12:55–67. doi: 10.1080/00401706.1970.10488634. [DOI] [Google Scholar]
27.Hechenbichler, K. & Schliep, K. Weighted k-nearest-neighbor techniques and ordinal classification. In SFB 386, Ludwig-Maximilians University, Munich, vol. 399 of sfb386 (2004).
28.Boser, B. E., Guyon, I. M. & Vapnik, V. N. A training algorithm for optimal margin classifiers. In Proc. of Workshop on Computational Learning Theory, 144–152 (ACM, 1992).
29.Venables, W. N. & Ripley, B. D. Modern Applied Statistics with S (Springer, 2002), fourth edn.
30.Ding B. and Gentleman, R. Classification using generalized partial least squares. Journal of Computational and Graphical Statistics. 2005;14:280–298. doi: 10.1198/106186005X47697. [DOI] [Google Scholar]
31.Breiman, L., Friedman, J., Olshen, R. & Stone, C. Classification and Regression Trees (Wadsworth and Brooks, 1984).
32.Quinlan, R. C4.5: Programs for Machine Learning (Morgan Kaufmann Publishers, San Mateo, CA, 1993).
33.Breiman L. Random forests. Machine learning. 2001;45:5–32. doi: 10.1023/A:1010933404324. [DOI] [Google Scholar]
34.Friedman, J. H. Greedy function approximation: a gradient boosting machine. Annals of Statistics 1189-1232 (2001).
35.Fisher, A., Rudin, C. & Dominici, F.All models are wrong but many are useful: Variable importance for black-box, proprietary, or misspecified prediction models, using model class reliance. arXiv preprint arXiv:1801.01489 (2018).
36.van Loo HM, et al. Major depressive disorder subtypes to predict long-term course. Depression and anxiety. 2014;31:765–777. doi: 10.1002/da.22233. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Kessler, R. C. et al. Testing a machine-learning algorithm to predict the persistence and severity of major depressive disorder from baseline self-reports. Molecular Psychiatry 21, 1366–1371 (2016). [DOI] [PMC free article] [PubMed]
38.Langguth B, Landgrebe M, Kleinjung T, Sand GP, Hajak G. Tinnitus and depression. The world journal of biological psychiatry. 2011;12:489–500. doi: 10.3109/15622975.2011.575178. [DOI] [PubMed] [Google Scholar]
39.Whooley MA, Avins AL, Miranda J, Browner WS. Case-finding instruments for depression: Two questions are as good as many. Journal of General Internal Medicine. 1997;12:439–445. doi: 10.1046/j.1525-1497.1997.00076.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Riolo SA, Nguyen TA, Greden JF, King CA. Prevalence of depression by race/ethnicity: findings from the national health and nutrition examination survey iii. American journal of public health. 2005;95:998–1000. doi: 10.2105/AJPH.2004.047225. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Weinberger AH, et al. Trends in depression prevalence in the usa from 2005 to 2015: widening disparities in vulnerable groups. Psychological Medicine. 2018;48:1308–1315. doi: 10.1017/S0033291717002781. [DOI] [PubMed] [Google Scholar]
42.Kira K, Rendell LA. The feature selection problem: Traditional methods and a new algorithm. In Aaai. 1992;2:129–134. [Google Scholar]
43.Urbanowicz, R. J., Meeker, M., LaCava, W., Olson, R. S. & Moore, J. H. Relief-based feature selection: introduction and review. Journal of Biomedical Informatics (2018). [DOI] [PMC free article] [PubMed]
44.Hall, M. A. Correlation-based feature selection for discrete and numeric class machine learning. In Proc. of International Conference on Machine Learning (ICML), 359–366 (2000).
45.Guyon I, Elisseeff A. An introduction to variable and feature selection. Journal of Machine Learning Research. 2003;3:1157–1182. [Google Scholar]
46.Kirkpatrick S, Gelatt CD, Vecchi MP. Optimization by simulated annealing. science. 1983;220:671–680. doi: 10.1126/science.220.4598.671. [DOI] [PubMed] [Google Scholar]
47.Van Laarhoven, P. J. & Aarts, E. H. Simulated annealing. In Simulated annealing: Theory and applications, 7–15 (Springer, 1987).
48.Mitchell, M. Mitchell, M. An introduction to genetic algorithms (MIT press, 1998).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information.^{(614.3KB, pdf)}

Data Availability Statement

[CR1] 1.Eggermont JJ, Roberts LE. The neuroscience of tinnitus. Trends in Neurosciences. 2004;27:676–682. doi: 10.1016/j.tins.2004.08.010. [DOI] [PubMed] [Google Scholar]

[CR2] 2.Baguley D, McFerran D, Hall D. Tinnitus. The Lancet. 2013;382:1600–1607. doi: 10.1016/S0140-6736(13)60142-7. [DOI] [PubMed] [Google Scholar]

[CR3] 3.Bauer CA, Berry J, Brozoski TJ. Clinical trials supported by the tinnitus research consortium: Lessons learned, the southern illinois university experience. Hearing Research. 2016;334:65–71. doi: 10.1016/j.heares.2015.05.001. [DOI] [PubMed] [Google Scholar]

[CR4] 4.McCormack A, et al. Investigating the association between tinnitus severity and symptoms of depression and anxiety, while controlling for neuroticism, in a large middle-aged uk population. International Journal of Audiology. 2015;54:599–604. doi: 10.3109/14992027.2015.1014577. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] 5.Martines F, Bentivegna D, Martines E, Sciacca V, Martinciglio G. Assessing audiological, pathophysiological and psychological variables in tinnitus patients with or without hearing loss. European Archives of Oto-Rhino-Laryngology. 2010;267:1685–1693. doi: 10.1007/s00405-010-1302-3. [DOI] [PubMed] [Google Scholar]

[CR6] 6.Zöger S, Svedlund J, Holgers K-M. Psychiatric disorders in tinnitus patients without severe hearing impairment: 24 month follow-up of patients at an audiological clinic: Alteraciones psiquiátricas en pacientes con tinnitus sin hipoacusia severa: Seguimiento durante 24 meses en una clínica audiólogica. Audiology. 2001;40:133–140. doi: 10.3109/00206090109073108. [DOI] [PubMed] [Google Scholar]

[CR7] 7.Andersson G. Psychological aspects of tinnitus and the application of cognitive-behavioral therapy. Clinical Psychology Review. 2002;22:977–990. doi: 10.1016/S0272-7358(01)00124-6. [DOI] [PubMed] [Google Scholar]

[CR8] 8.Hiller W, Janca A, Burke KC. Association between tinnitus and somatoform disorders. Journal of psychosomatic research. 1997;43:613–624. doi: 10.1016/S0022-3999(97)00188-8. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Wallhäusser-Franke E, Schredl M, Delb W. Tinnitus and insomnia: is hyperarousal the common denominator? Sleep Medicine Reviews. 2013;17:65–74. doi: 10.1016/j.smrv.2012.04.003. [DOI] [PubMed] [Google Scholar]

[CR10] 10.Zirke N, et al. Analysis of mental disorders in tinnitus patients performed with composite international diagnostic interview. Quality of Life Research. 2013;22:2095–2104. doi: 10.1007/s11136-012-0338-9. [DOI] [PubMed] [Google Scholar]

[CR11] 11.Zöger S, Svedlund J, Holgers K-M. Relationship between tinnitus severity and psychiatric disorders. Psychosomatics. 2006;47:282–288. doi: 10.1176/appi.psy.47.4.282. [DOI] [PubMed] [Google Scholar]

[CR12] 12.Trevis KJ, McLachlan NM, Wilson SJ. A systematic review and meta-analysis of psychological functioning in chronic tinnitus. Clinical psychology review. 2018;60:62–86. doi: 10.1016/j.cpr.2017.12.006. [DOI] [PubMed] [Google Scholar]

[CR13] 13.Stobik C, Weber RK, Münte TF, Walter M, Frommer J. Evidence of psychosomatic influences in compensated and decompensated tinnitus. International journal of audiology. 2005;44:370–378. doi: 10.1080/14992020500147557. [DOI] [PubMed] [Google Scholar]

[CR14] 14.Bhatt JM, Bhattacharyya N, Lin HW. Relationships between tinnitus and the prevalence of anxiety and depression. The Laryngoscope. 2017;127:466–469. doi: 10.1002/lary.26107. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Hu, J.et al. The correlation of the tinnitus handicap inventory with depression and anxiety in veterans with tinnitus. International Journal of Otolaryngology 2015 (2015). [DOI] [PMC free article] [PubMed]

[CR16] 16.Jin H, Wu S, Di PC. Development of a clinical forecasting model to predict comorbid depression among diabetes patients and an application in depression screening policy making. Preventing chronic disease. 2015;12:E142–E142. doi: 10.5888/pcd12.150047. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.Wang J, et al. A prediction algorithm for first onset of major depression in the general population: development and validation. J Epidemiol Community Health. 2014;68:418–424. doi: 10.1136/jech-2013-202845. [DOI] [PubMed] [Google Scholar]

[CR18] 18.Radloff LS. The CES-D scale: a self-report depression scale for research in the general population. Applied psychological measurement. 1977;1:385–401. doi: 10.1177/014662167700100306. [DOI] [Google Scholar]

[CR19] 19.Hautzinger, M. & Bailer, M. ADS-Allgemeine Depressionsskala. In Diagnostische Verfahren in der Psychotherapie (Beltz, 2003).

[CR20] 20.Fliege H, et al. The Perceived Stress Questionnaire (PSQ) reconsidered: validation and reference values from different clinical and healthy adult samples. Psychosomatic medicine. 2005;67:78–88. doi: 10.1097/01.psy.0000151491.80178.78. [DOI] [PubMed] [Google Scholar]

[CR21] 21.Bullinger, M. & Morfeld, M. Der SF-36 Health Survey. In Gesundheitsökonomische Evaluationen, 387–402 (Springer, 2008).

[CR22] 22.Goebel, G. & Hiller, W. Tinnitus-Fragebogen:(TF); ein Instrument zur Erfassung von Belastung und Schweregrad bei Tinnitus; Handanweisung (hogrefe, Verlag für Psychologie, 1998).

[CR23] 23.Goebel G, Hiller W. Psychische Beschwerden bei chronischem Tinnitus: Erprobung und Evaluation des Tinnitus-Fragebogens (TF) Verhaltenstherapie. 1992;2:13–22. doi: 10.1159/000258202. [DOI] [Google Scholar]

[CR24] 24.Brüggemann P, et al. Impact of multiple factors on the degree of tinnitus distress. Frontiers in human neuroscience. 2016;10:341. doi: 10.3389/fnhum.2016.00341. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR25] 25.Tibshirani R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological) 1996;58:267–288. [Google Scholar]

[CR26] 26.Hoerl AE, Kennard RW. Ridge regression: Biased estimation for nonorthogonal problems. Technometrics. 1970;12:55–67. doi: 10.1080/00401706.1970.10488634. [DOI] [Google Scholar]

[CR27] 27.Hechenbichler, K. & Schliep, K. Weighted k-nearest-neighbor techniques and ordinal classification. In SFB 386, Ludwig-Maximilians University, Munich, vol. 399 of sfb386 (2004).

[CR28] 28.Boser, B. E., Guyon, I. M. & Vapnik, V. N. A training algorithm for optimal margin classifiers. In Proc. of Workshop on Computational Learning Theory, 144–152 (ACM, 1992).

[CR29] 29.Venables, W. N. & Ripley, B. D. Modern Applied Statistics with S (Springer, 2002), fourth edn.

[CR30] 30.Ding B. and Gentleman, R. Classification using generalized partial least squares. Journal of Computational and Graphical Statistics. 2005;14:280–298. doi: 10.1198/106186005X47697. [DOI] [Google Scholar]

[CR31] 31.Breiman, L., Friedman, J., Olshen, R. & Stone, C. Classification and Regression Trees (Wadsworth and Brooks, 1984).

[CR32] 32.Quinlan, R. C4.5: Programs for Machine Learning (Morgan Kaufmann Publishers, San Mateo, CA, 1993).

[CR33] 33.Breiman L. Random forests. Machine learning. 2001;45:5–32. doi: 10.1023/A:1010933404324. [DOI] [Google Scholar]

[CR34] 34.Friedman, J. H. Greedy function approximation: a gradient boosting machine. Annals of Statistics 1189-1232 (2001).

[CR35] 35.Fisher, A., Rudin, C. & Dominici, F.All models are wrong but many are useful: Variable importance for black-box, proprietary, or misspecified prediction models, using model class reliance. arXiv preprint arXiv:1801.01489 (2018).

[CR36] 36.van Loo HM, et al. Major depressive disorder subtypes to predict long-term course. Depression and anxiety. 2014;31:765–777. doi: 10.1002/da.22233. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR37] 37.Kessler, R. C. et al. Testing a machine-learning algorithm to predict the persistence and severity of major depressive disorder from baseline self-reports. Molecular Psychiatry 21, 1366–1371 (2016). [DOI] [PMC free article] [PubMed]

[CR38] 38.Langguth B, Landgrebe M, Kleinjung T, Sand GP, Hajak G. Tinnitus and depression. The world journal of biological psychiatry. 2011;12:489–500. doi: 10.3109/15622975.2011.575178. [DOI] [PubMed] [Google Scholar]

[CR39] 39.Whooley MA, Avins AL, Miranda J, Browner WS. Case-finding instruments for depression: Two questions are as good as many. Journal of General Internal Medicine. 1997;12:439–445. doi: 10.1046/j.1525-1497.1997.00076.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR40] 40.Riolo SA, Nguyen TA, Greden JF, King CA. Prevalence of depression by race/ethnicity: findings from the national health and nutrition examination survey iii. American journal of public health. 2005;95:998–1000. doi: 10.2105/AJPH.2004.047225. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR41] 41.Weinberger AH, et al. Trends in depression prevalence in the usa from 2005 to 2015: widening disparities in vulnerable groups. Psychological Medicine. 2018;48:1308–1315. doi: 10.1017/S0033291717002781. [DOI] [PubMed] [Google Scholar]

[CR42] 42.Kira K, Rendell LA. The feature selection problem: Traditional methods and a new algorithm. In Aaai. 1992;2:129–134. [Google Scholar]

[CR43] 43.Urbanowicz, R. J., Meeker, M., LaCava, W., Olson, R. S. & Moore, J. H. Relief-based feature selection: introduction and review. Journal of Biomedical Informatics (2018). [DOI] [PMC free article] [PubMed]

[CR44] 44.Hall, M. A. Correlation-based feature selection for discrete and numeric class machine learning. In Proc. of International Conference on Machine Learning (ICML), 359–366 (2000).

[CR45] 45.Guyon I, Elisseeff A. An introduction to variable and feature selection. Journal of Machine Learning Research. 2003;3:1157–1182. [Google Scholar]

[CR46] 46.Kirkpatrick S, Gelatt CD, Vecchi MP. Optimization by simulated annealing. science. 1983;220:671–680. doi: 10.1126/science.220.4598.671. [DOI] [PubMed] [Google Scholar]

[CR47] 47.Van Laarhoven, P. J. & Aarts, E. H. Simulated annealing. In Simulated annealing: Theory and applications, 7–15 (Springer, 1987).

[CR48] 48.Mitchell, M. Mitchell, M. An introduction to genetic algorithms (MIT press, 1998).

PERMALINK

Development and internal validation of a depression severity prediction model for tinnitus patients based on questionnaire responses and socio-demographics

Uli Niemann

Petra Brueggemann

Benjamin Boecking

Birgit Mazurek

Myra Spiliopoulou

Abstract

Introduction

Methods

Figure 1.

Features

Dataset

Table 1.

Figure 2.

Classification model development

Feature selection

Results

Distribution of responses

Predictive performance of classification models

Table 2.

Figure 3.

Table 3.

Stability of classifiers on smaller feature sets

Complexity-interpretability tradeoff

Figure 4.

Discussion

Supplementary information

Author contributions

Data availability

Competing interests

Footnotes

Supplementary information

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases