Abstract
While research focus remains mainly on psychotic symptoms, it is questionable whether we are placing enough emphasis on improving the quality of life (QoL) of schizophrenia patients. To date, the predictive power of QoL remained limited. Therefore, this study aimed to accurately predict the QoL within schizophrenia using supervised learning methods. The authors report findings from participants of a large randomized, double-blind clinical trial for schizophrenia treatment. Potential predictors of QoL included all available and non-redundant variables from the dataset. By optimizing parameters, three linear LASSO regressions were calculated (N = 697, 692, and 786), including 44, 47, and 41 variables, with adjusted R-squares ranging from 0.31 to 0.36. Best predictors included social and emotion-related symptoms, neurocognition (processing speed), education, female gender, treatment attitudes, and mental, emotional, and physical health. These results demonstrate that machine learning is an excellent predictive tool to process clinical data. It appears that the patient’s perception of their treatment has an important impact on patients’ QoL and that interventions should consider this aspect.
Trial registration: ClinicalTrials.gov Identifier: NCT00014001.
Subject terms: Schizophrenia, Human behaviour
Introduction
Schizophrenia is a chronic and severe mental disorder that can be invalidating1. This disorder can greatly affect the quality of life (QoL)2,3, which is defined by the World Health Organization as an individuals’ perception of their position in life in the context of the culture and value systems in which they live and in relation to their goals, expectations, standards, and concerns4.
A shift has recently been observed in the objectives of schizophrenia treatment. While the goal was once to reduce symptoms only, this has changed to focus more on recovery through improving QoL and functioning5,6. Although complete recovery is often not possible for these patients, they can still recover in some way. This notably involves optimizing their well-being and functioning, which are key components of QoL. Over the past few years, factors that may promote better QoL have been identified in the literature, with mixed results. Some predictors that recur frequently are types of psychiatric or psychotic symptoms, but which type exactly predicts best QoL remains controversial2,7–10. These can be reduced by using medication; however, even though response and adherence to antipsychotics can improve QoL3, some medication side effects such as weight gain11 and sexual dysfunction12 have been associated with a worsened outcome. Other predictors of higher QoL were also identified, e.g., a better cognition and an older age of disorder onset13–15. On the other hand, stigma-related feelings and comorbid diagnoses predicted a poorer outcome regarding QoL14. In general, it seems that the highly heterogeneous factors presented in the current literature largely depend on the angle from which the authors choose to approach the question. Another issue is that the design is often cross-sectional, which does not allow for longitudinal predictions. Identifying the most important and essential factors could help identify which patients are better able to recover, and ultimately optimize every patient’s recovery.
Several researchers have used multivariate models to predict the quality of life. Mohamed et al.13 created a model excluding variables that may be redundant with QoL (e.g., functioning) using longitudinal data from the Clinical Antipsychotic Trials of Intervention Effectiveness (CATIE) schizophrenia study. In doing so, they were able to explain 22% of the variance in total QoL with positive and negative symptoms, neurocognitive, and sociodemographic variables (age, race, ethnicity, gender, and time). In such studies, the explained variance is generally low2,14, possibly because authors did not include some factors that deviate from their research question and that may play a major role in QoL (e.g., physical health, patients’ self-reported satisfaction, and medication adherence). With the emergence of supervised machine learning, it now becomes possible to reach an optimal model including the best predictors among fairly large datasets, and without human a priori in the way variables are combined16,17. This new approach could thereby provide a better understanding of the various factors that influence QoL in individuals with schizophrenia, just as it successfully predicted other outcomes such as relapses18,19.
The aim of the current study was to identify, using machine learning, factors that predict QoL among people with schizophrenia. To do so, we computed important variables from the CATIE study, a large naturalistic clinical trial conducted between 2000 and 2004 in the United States.
Results
Sample characteristics
Due to attrition and missing data, only 919 of the 952 participants with a longitudinal follow-up were included in a model (N = 697, 692, and 786 in models 1–3, respectively). From this number, 670 were males (73%) and the average age was of 41.1 years (SD = 11.0; range: 18–67). One-quarter did not complete high school (25%), a minority was employed full-time at the time of the study (6%), and only a few were married (11%). Most of the sample had no comorbid psychiatric condition (60%). Detailed baseline sample characteristics were presented in Table 1. At the baseline, 6-month and 12-month follow-up visits, the QoL total score was on average 2.8 (SD = 1.1), 2.9 (SD = 1.1, and 3.0 (SD = 1.1), respectively.
Table 1.
Baseline sample characteristics. N = 919.
| Baseline dichotomous characteristics | N/mean | %/SD | Minimum | Maximum |
|---|---|---|---|---|
| Male gender | 670 | 72.9 | – | – |
| Married | 103 | 11.2 | – | – |
| Veterans | 197 | 21.4 | – | – |
| Living with a significant other | 163 | 17.7 | – | – |
| Did not complete high school | 225 | 24.5 | – | – |
| Employed full-time | 58 | 6.3 | – | – |
| Ethnicity | ||||
| White | 576 | 62.7 | – | – |
| Black | 307 | 33.4 | – | – |
| American Indian or Alaska Native | 14 | 1.5 | – | – |
| Asian | 26 | 2.8 | – | – |
| Hispanic Latino or Spanish Origin | 101 | 11.0 | – | – |
| Hawaiian or Pacific Islander | 6 | 0.7 | – | – |
| Comorbid psychiatric diagnoses | ||||
| Obsessive–compulsive disorder | 40 | 4.4 | – | – |
| Other anxiety disorder | 78 | 8.5 | – | – |
| Major depression | 124 | 13.5 | – | – |
| Alcohol dependence | 77 | 8.4 | – | – |
| Alcohol abuse | 78 | 8.5 | – | – |
| Drug dependence | 63 | 6.9 | – | – |
| Drug abuse | 101 | 11.0 | – | – |
| Antisocial personality disorder | 5 | 0.5 | – | – |
| Other personality disorder | 9 | 1.0 | – | – |
| Other comorbid diagnosis | 37 | 4.0 | – | – |
| No comorbid condition | 552 | 60.1 | ||
| Age | 41.1 | 11.0 | 18 | 67 |
| Years of education | 11.6 | 3.4 | 1 | 21 |
| Years since first psychiatric treatment | 16.7 | 11.6 | 0 | 56 |
| Years since first prescribed antipsychotic medication | 14.4 | 11.1 | 0 | 56 |
| Number of previous hospitalizations | ||||
| Lifetime | 2.7 | 1.5 | 0 | 4 |
| Past year | 0.6 | 0.9 | 0 | 4 |
| QoL total score | 2.8 | 1.1 | 0.4 | 5.9 |
Linear regressions using machine learning
Three longitudinal models were calculated to predict QoL (1) 12 months after the baseline, (2) 6 months after the 6-month visit, and (3) 6 months after the baseline.
The first model attempting to predict the 12-month QoL with baseline variables attained an uncentered adjusted R-squared of 0.350 and comprised 45 predictors. All included variables and associated coefficients are presented in Table 2. The mean squared error (MSE) training result was 0.92 and the MSE testing result was 0.97.
Table 2.
Linear regression of QoL at the 12-month visit using baseline variables. N = 697.
| Categories | Baseline variables | Coeff. |
|---|---|---|
| Sociodemographics | • Parents highest education level | 0.3359 |
| • Veteran | −0.1633 | |
| • Male gender | −0.1597 | |
| • Hispanic, Latino, or Spanish origin | 0.1159 | |
| • Race: white | −0.0065 | |
| Psychiatric diagnoses | • Major depression | -0.1057 |
| • No comorbid psychiatric conditions | 0.0287 | |
| • Other diagnoses | −0.0238 | |
| • Alcohol abuse | −0.0201 | |
| Positive and negative symptoms scale (PANSS) | Negative symptoms: | |
| • Emotional withdrawal | −0.6451 | |
| • Passive apathetic social withdrawal | −0.5087 | |
| General symptoms: | ||
| • Poor attention | −0.0739 | |
| Calgary depression rating scale (CDRS) | • Hopelessness | −0.2367 |
| • Self-depreciation | 0.1284 | |
| Neurocognitive battery | • Processing speed score | 0.6454 |
| • Working memory score | 0.1433 | |
| • Verbal score | 0.0068 | |
| Clinical Global Impressions Scale (CGIS) | • Patient-reported mental/emotional health | 0.4272 |
| • Number of days smoking cigarettes in the past week | −0.2416 | |
| • Clinician global impression of severity | −0.2062 | |
| • Productive activities are [x] time more important than least important CGIS item | 0.2010 | |
| • Medication side effects | 0.1585 | |
| • Energy and interests | 0.1541 | |
| • Disturbing and unusual experiences | −0.1336 | |
| • CGIS Response | 0.1305 | |
| • Alcohol consumption | 0.1031 | |
| • Energy and interests are [x] time more important than least important CGIS item | 0.0996 | |
| • Medication side effects are [x] time more important than least important CGIS item | 0.0386 | |
| • Satisfaction of contact with mental health professionals | 0.0321 | |
| Insight and Treatment Attitudes Questionnaire (ITAQ) | • Do you have mental problems? | 0.2525 |
| • Will you take the medication? | 0.1867 | |
| • Have you had mental problems that were different from most other people’s? | 0.0613 | |
| Drug Attitude Inventory (DAI) | • Staying on meds prevent me from getting sick | 0.1515 |
| • I feel weird like a zombie on meds | −0.0333 | |
| • Meds make me feel tired and sluggish | 0.0176 | |
| Physician’s assessment of the severity of the adverse event | • Sleepiness | 0.1005 |
| Patient’s assessment of the severity of the adverse event | • Sleepiness | 0.1689 |
| • Sexual arousal | 0.0606 | |
| • Weight gain | 0.0163 | |
| Antipsychotic medication | • No antipsychotic medication | −0.0679 |
| • Risperidone | 0.0270 | |
| • Other antipsychotics | −0.0140 | |
| Laboratory values | • Mean corpuscular hemoglobin | 0.0936 |
| Other variables | • Medical history status | 0.0478 |
| • Day screened (vs baseline) | −0.0022 | |
As for the second model predicting the 12-month QoL using variables from the 6-month visit, the optimal regression (Table 3) comprised 47 predictors, and the uncentered adjusted R-squared was 0.365. The MSE training result was 0.86 and the MSE testing result was 0.98.
Table 3.
Linear regression of QoL at the 12-month visit using variables from the 6-month visit. N = 692.
| Categories | 6-months variables | Coeff. |
|---|---|---|
| Sociodemographics | • Parents highest education level | 0.2722 |
| • Veteran | −0.2634 | |
| • Patient’s highest education level | 0.2596 | |
| • Male sex | −0.0704 | |
| • Race: black of African American | 0.0076 | |
| Positive and negative symptoms scale (PANSS) | Positive symptoms: | |
| • Grandiosity | 0.1114 | |
| • Hallucinatory behavior | −0.0431 | |
| Negative symptoms: | ||
| • Passive apathetic social withdrawal | −0.7232 | |
| • Emotional withdrawal | −0.5901 | |
| • Poor rapport | −0.3056 | |
| General symptoms: | ||
| • Active social avoidance | −0.4386 | |
| • Guilt feelings | 0.1710 | |
| • Anxiety | 0.1508 | |
| Calgary depression rating scale (CDRS) | • Hopelessness | −0.3738 |
| Neurocognitive battery | • Processing speed standardized to baseline | 0.2795 |
| • Neurocognitive composite score standardized to baseline | 0.2301 | |
| • Vigilance score standardized to baseline | 0.1303 | |
| • Verbal score standardized to baseline | 0.0844 | |
| Clinical global impressions scale (CGIS) | • Clinician global impression of severity | −0.4592 |
| • Satisfaction of contact with mental health professionals | 0.2716 | |
| • Patient version, clinical global impression of severity | −0.2379 | |
| • Patient-reported mental/emotional health | 0.2259 | |
| • Energy and interests | 0.1678 | |
| • Productive activities | −0.1248 | |
| • Tobacco products use | −0.1020 | |
| • Energy and interests are [x] time more important than least important CGIS item | 0.0884 | |
| • CGIS response | 0.0799 | |
| • Disturbing and unusual experiences are [x] time more important than least important CGIS item | 0.0392 | |
| • Alcohol use | 0.0340 | |
| Insight and treatment attitudes questionnaire (ITAQ) | • Do you now need to take medication for mental problems? | 0.2900 |
| Drug attitude inventory (DAI) | • I feel weird like a zombie on meds | 0.0880 |
| • Medication is unnatural for my mind and body | −0.0473 | |
| • The good of meds outweighs the bad | 0.0006 | |
| Physician’s assessment of the severity of the adverse event | • Sialorrhea | 0.1961 |
| • Hypersomnia | −0.0667 | |
| • Akinesia | 0.0101 | |
| Impact of adverse event on patients’ adherence to medication | • Akinesia | 0.3743 |
| • Dry mouth | 0.2459 | |
| • Weight gain | 0.2242 | |
| • Sialorrhea | 0.0255 | |
| Antipsychotic medication | • Adherencea | 0.0630 |
| • Total # of days taking olanzapine (between baseline and the 6-month visit)a | −0.0616 | |
| • Has the patient taken quetiapine (between baseline and the 6-month visit)a | −0.0427 | |
| • Total # of days taking risperidone (between baseline and the 6-month visit)a | 0.0051 | |
| Laboratory values | • Total bilirubin level | 0.5947 |
| • HDL cholesterol level | 0.0862 | |
| Other variables | • Childhood antisocial behaviors | −0.0427 |
aVariables that have only been measured during follow-up visits (not during the baseline visit), and that therefore could only be a predictor in this model.
Finally, the QoL at 6 months was estimated using baseline variables in a third model. With 41 variables, an uncentered adjusted R-squared of 0.307 was obtained. The complete model and its parameters are presented in Table 4. The MSE training result was 0.93 and the MSE testing result was 0.96.
Table 4.
Linear regression of QoL at the 6-month visit using baseline variables. N = 786.
| Categories | Baseline variables | Coeff. |
|---|---|---|
| Sociodemographics | • Male gender | −0.3105 |
| • Parents highest education level | 0.2604 | |
| • Patient’s highest education level | 0.2491 | |
| • Veteran | −0.1065 | |
| • Hispanic, Latino, or Spanish origin | 0.0380 | |
| Psychiatric diagnoses | • No comorbid psychiatric conditions | 0.2136 |
| Positive and negative symptoms scale (PANSS) | Positive symptoms; | |
| • Hostility | −0.0038 | |
| Negative symptoms; | ||
| • Passive apathetic social withdrawal | −0.7467 | |
| • Stereotyped thinking | −0.2305 | |
| • Poor rapport | −0.1324 | |
| • Emotional withdrawal | −0.0737 | |
| General symptoms; | ||
| • Lack of judgment and insight | −0.0751 | |
| • Somatic concern | −0.0139 | |
| • Active social avoidance | −0.0131 | |
| Calgary depression rating scale (CDRS) | • Hopelessness | −0.0562 |
| Neurocognitive battery | • Processing speed score | 0.3155 |
| • Working memory score | 0.1009 | |
| • Neurocognitive composite score | 0.0934 | |
| Clinical global impressions scale | • CGIS response | 0.2470 |
| • Productive activities are [x] time more important than least important CGIS item | 0.2123 | |
| • Patient-reported mental/emotional health | 0.1501 | |
| • Energy and interests | 0.0477 | |
| • Alcohol use | 0.0266 | |
| • Disturbing and unusual experiences | −0.0219 | |
| Insight and treatment attitudes questionnaire (ITAQ) | • Do you at any time need treatment, hospitalization, or outpatient care? | 0.1624 |
| • Do you now need to take medication for mental problems? | 0.1114 | |
| • Have you at any time needed to take medication for mental problems? | 0.0689 | |
| • How much information did you recently receive from mental health service providers? | 0.0398 | |
| Drug attitude inventory (DAI) | • Staying on meds prevent me from getting sick | 0.0470 |
| • My thoughts are clearer on medication | 0.0401 | |
| • Good outweighs the bad | 0.0282 | |
| • Medication is unnatural for my mind and body | −0.0117 | |
| • I feel more normal on medication | −0.0104 | |
| • Meds make me feel tired and sluggish | 0.0089 | |
| Physician’s assessment of the severity of the adverse event | • Sexual orgasm | −0.0766 |
| Patient’s assessment of the severity of the adverse event | • Weight gain | 0.1433 |
| • Insomnia | −0.0514 | |
| Antipsychotic medication | • Olanzapine | 0.0606 |
| • No antipsychotic medication | −0.0466 | |
| Other variables | • Medication switch status | 0.0160 |
| • Day screened (vs. baseline) | 0.0032 | |
A summary of the results of the three prediction models is presented in Table 5. Among the strongest and most reliable predictors were having low/no passive apathetic social withdrawal, low/no emotional withdrawal, and having a high processing speed score. Many other variables were also present in all three models, including having educated parents, self-reporting high mental health, female gender, being treatment-responsive (CGIS), gaining weight as a side-effect, and having energy and interests. Being a veteran and being hopeless were negatively associated with QoL. Other predictors were strong but only present in one or two models; having a high level of total bilirubin, a higher education level, or believing that they had a mental problem was associated with a better QoL. Meanwhile, having a high clinical global impression of severity, social avoidance, poor rapport, stereotyped thinking, and dry mouth as a side-effect was associated with poorer outcomes.
Table 5.
Summary of variables favoring quality of life. Variables with a similar meaning (e.g., different scales for the same side effect) were merged. Predictors are presented in order of effect sizes.
| Predictors present in all models | Predictors present in two models | Predictors present in only one model |
|---|---|---|
| Low/no passive apathetic social withdrawal | Low clinical global impression of severity | High total bilirubin |
| Low/no emotional withdrawal | Having a higher education level | Believing that they have mental/nerve/worry problems |
| Neuro: high processing speed score | Low/no active social avoidance | Having dry mouth as an adverse event |
| Having more educated parents | Low/no poor rapport | Low/no stereotyped thinking |
| High patient-reported mental/emotional health | Subjective need to take medication for mental problems | Having akinesia as an adverse event |
| Not being hopeless | Low/no tobacco use | Saying that they will take the medication |
| Female gender | A high neurocognitive composite score | Having guilt feelings |
| Not being a veteran | Being satisfied with providers | Believing that, at any time, they needed treatment hospitalization or outpatient care |
| CGIS response | Neuro: high working memory score | Having anxiety |
| More severe weight gain | Having no comorbid conditiona | Having sleepiness as an adverse event |
| High/important energy and interests | Believing that staying on meds prevent them from getting sick | Neuro: high vigilance score |
| High/important productive activities | Being of Hispanic, Latino, or Spanish origin | Low/no self-depreciation |
| Consuming alcohol | Taking antipsychotics at baseline | Having grandiosity |
| Not having disturbing and unusual experiences or considering them as important | Neuro: high verbal score | Having sialorrhea as an adverse event |
| Not thinking that medication is unnatural for their mind and body | Not having a diagnosis of major depressiona | |
| Not feeling weird like a ‘zombie’ on medication | Having medication side effects | |
| Using risperidone as an antipsychotic | High mean corpuscular hemoglobin | |
| Thinking that good things about medication outweigh the bad | High HDL cholesterol | |
| Thinking that meds make them feel tired and sluggish | Low/no sexual orgasm-related adverse event | |
| Having a longer period between the screening and the baseline visit | Not lacking judgment and insight | |
| Low/no poor attention | ||
| Believing that, at any time, they needed to take medications for mental problems | ||
| Low/no hypersomnia as an adverse event | ||
| Good adherence to study medicationb | ||
| Believing that, at any time, they had mental problems that were different from most other people’s | ||
| Suffering from sexual arousal-related adverse event | ||
| Low/no insomnia as an adverse event | ||
| Past/inactive medical Hx status | ||
| Low/no hallucinatory behavior | ||
| Low/no childhood antisocial behaviors | ||
| Not using quetiapine | ||
| Thinking that their thoughts are clearer on medication | ||
| Having received a lot of information from mental health service providers about the illness | ||
| Olanzapine use | ||
| Not having another psychiatric diagnosis (apart from abuse/dependence, OCD, anxiety, major depression, and personality disorders)a | ||
| No alcohol abuse diagnosis | ||
| Medication switch statusa | ||
| Not taking another antipsychotic (apart from olanzapine, quetiapine, risperidone, ziprasidone, haloperidol, decanoate, and perphenazine)a | ||
| Low/no somatic concern | ||
| Feeling more normal on medication | ||
| Being black or African American | ||
| Not being white | ||
| Low/no hostility |
aVariables that have only been measured during the screening or baseline visit, and that therefore could only be a predictor in models 1 and 3.
bVariable that has only been measured during follow-up visits, and that therefore could only be a predictor in model 2.
Bold: coefficient over 0.3.
Italic: coefficient under 0.1.
Discussion
This study aimed to accurately predict further QoL by identifying the characteristics that make individuals more prone to recover. By using machine learning to create optimal models, good predictions have been reached, and this despite adjustments to avoid any redundancy or collinearity of the data. Three models were calculated: (1) prediction of 12-month QoL with baseline variables, (2) prediction of 12-month QoL with 6-month variables, and (3) prediction of 6-month QoL with baseline variables. R squares of 0.350, 0.365, and 0.307 were achieved for each of these models, respectively. Identified predictors included, among others, social and emotion-related symptoms, neurocognition (processing speed), education, female gender, veteran status, indicators of satisfaction with psychiatric treatment as well as elements of physical functioning. The performance of the model is consistent with the prediction score for human behavior modeling20.
Firstly, predictors of QoL include many symptoms related to social and emotional aspects of life (e.g., negative association with social and emotional withdrawal, social avoidance, poor rapports, and hopelessness), thereby highlighting the fact that socialization and social roles are central determinants of QoL. Notably, the patients’ and their parents’ education level, likely associated with social inclusion and socioeconomic status, were strong predictors, as previously demonstrated21. Similar results have previously been obtained with emotional discomfort22. It is indeed possible that the relationship between negative symptoms and the QoL observed in the literature is due to the patients’ ability to interact with others as well as their environment. These factors might be related to social support as well, which is a key component of QoL23. The lack of social support is indeed a major problem for individuals with schizophrenia23, and it is, therefore, a crucial determinant to consider. Female gender was also associated with higher QoL; this predictor is, however, controversial in the current literature24–28. The backgrounds and origins of patients also seem to have an impact, since parental education level and veteran status were among identified predictors. This finding could be linked to the fact that schizophrenia patients with a greater trauma history tend to have a poorer QoL29.
Secondly, as previously demonstrated with that database13, neurocognition had a significant impact on QoL. Considering each subscale separately, the processing speed was found to be the most predictive, even more than the total neurocognition score. This finding suggests that cognitive rehabilitation programs, which have already proven to be effective to improve cognitive performance, symptoms, and psychosocial functioning30, could be an important element to improving QoL as well31.
Many subjective factors were also classified as very strong predictors of QoL. For example, good mental health, evaluated by the physician or reported by the patient, was contributing to a favorable outcome. Satisfaction toward mental health providers was also an important predictor, which was previously shown to be associated with a better QoL32. This finding suggests that the patients’ subjective satisfaction is a very important factor when it comes to recovery. Additionally, having a good attitude toward the medication (e.g., thinking that medication is needed or that it prevents them from getting sick) also seemed important. These factors are likely to be associated with better medication adherence, as supported by other recent studies of people with schizophrenia15,33. Adherence was only found to be a weak predictor in one model; however, it should be noted that it was only a potential predictor in the second model as this was not measured at the baseline visit, since the patients were not yet taking the study medication. Antipsychotic medication is indeed considered important to improve the mental health of schizophrenia patients. However, while they contribute to the improvement of the symptomatology, they also cause a lot of side effects, thereby having contradictory effects on QoL. In the current study, side effects and treatment attitudes seemed more important than specific drugs, demonstrating that the ideal medication varies from patient to patient, and that adherence and observed changes are more important in predicting QoL. Nevertheless, response to treatment, measured using the CGIS questionnaire, was found to be a strong predictor in all three models. These results confirm those of Naber et al., who came to similar conclusions using the CATIE database34.
Finally, some physical health indicators were included in the models (e.g., bilirubin). Physical comorbidities being very frequent in that population could reflect the presence of metabolic disorders that greatly impact the QoL of some individuals. Tobacco use, which is well established to be associated with significant physical disorders, was also a predictor in two models. Similarly, predictors related to adverse events were also probably associated with physical health, which is unsurprisingly a great predictor of QoL in schizophrenia35. However, weight gain was found to be predictive of a better QoL in all models. This result is controversial since that side-effect is usually associated with poorer outcomes. However, compliant patients might be at higher risk of gaining weight from medication, which could explain that association36.
Although this study innovates by demonstrating that QoL can be predicted effectively in schizophrenia patients, a few limitations must be acknowledged. Despite that the prediction was great in that cohort, it is not necessarily representative of the overall schizophrenia population. Subjects were excluded if they had certain psychiatric comorbid diagnoses that are fairly frequent in that population (e.g., mental retardation and schizoaffective disorders), and they were all willing to participate as well as able to provide informed consent. However, this is an issue that is common to all randomized controlled trials, and the researchers minimized that issue by including a large number of sites representative of the United States population. Nevertheless, more such studies will be needed to confirm the predictors identified. This model could also be tested on another population to assess to what extent it is generalizable.
In conclusion, this study allowed an excellent prediction of the QoL of patients with schizophrenia using machine learning algorithms. Among the best and most reliable predictors of QoL were notably characteristics linked to social and emotional symptoms, good attitude toward medication, satisfaction toward healthcare providers and patients’ own mental health, neurocognition, female gender, and medication side-effects. Since good prediction levels can be achieved, the use of machine learning could have major implications for the future of prediction as it helps avoid human bias. Eventually, it will also become possible to create predictive algorithms that could be used on various clinical populations and guide clinicians in their decision-making. The study of the predictors identified by such algorithms also allows a bit more insight into how a disease such as schizophrenia manifests itself and into the mechanisms that could explain the outcome. Notably, in the present study, we were able to identify very precise symptoms and factors that could have a higher impact than expected on the QoL of people with schizophrenia (e.g., their subjective perception of their mental health). In doing so, it was notably observed that physical health variables, which are often omitted from mental health-related studies, seem to have an important impact on schizophrenia patients’ QoL. Consequently, interventions aiming to increase QoL should also consider these aspects. More studies will be needed to confirm the results and their applicability for clinicians.
Methods
Study sample
Data for this study were extracted from the CATIE schizophrenia study dataset. CATIE was a large, naturalistically designed clinical trial conducted by the National Institute of Mental Health (NIMH) between December 2000 and December 2004. 1460 patients with a DSM-IV diagnosis of schizophrenia, based upon the Structured Clinical Interview for DSM-IV37, were followed for 18 months. The trial was approved by the institutional review board at each site, and the patients or their legal guardians provided their written informed consent. The detailed study description and design can be found elsewhere38.
A subsample of 952 patients was selected based on the longitudinal monitoring of their QoL, i.e., they had completed at least 2 visits among the baseline visit and the 6, 12, and 18-month follow-up visits. According to the protocol, participants should have been followed for 18 months, with a follow-up visit occurring every 3 months or so. However, the attrition rate was very high, and therefore some variables were missing for some participants. Consequently, only data up to 12 months were used, and 697 subjects could be included in the first model, whereas the second and the third comprised 692 and 786 individuals, respectively.
Dataset
The QoL was measured every 6 months using a well-validated clinician-rated scale, the Heinrichs-Carpenter Quality of Life Scale (QOL)39. The objective was to use total QoL score at 6 and 12 months as a continuous outcome, i.e., the dependent variable, while all other variables from the CATIE trial were used as potential predictors in linear regressions. These included a large number of questionnaire items as well as the total scores and other variables (dichotomous or continuous) that were in the database, for a total of 253 potential baseline predictors and 233 potential 6-month predictors. Notably, psychotic symptoms were accessed during each visit using the positive and negative syndrome scale40. Depressive symptoms were measured every 3 months using the Calgary depression rating scale41. Neurocognition was measured using a neurocognitive battery accessing verbal learning, vigilance, speed, reasoning, and working memory. Other potential predictors were selected based on what was available within the database. These included many variables, both demographic and clinical, and both psychiatric and somatic (e.g., sociodemographic variables, metabolic biomarkers, complete blood count, side effects severity, antipsychotic medication, insight and attitudes toward treatment, adherence, violence, drug use, general status, vitals, etc.). However, items that were considered too conceptually related to the concept of QoL (i.e., redundant with items of the QoL questionnaire) were removed from the database. Included variables were all detailed in the Supplementary Table. In the models where computed not only the scales’ totals but also every single item included in each tool and questionnaire.
Statistical analysis
A Lasso supervised regularization algorithm was implemented to identify potential predictors for three models: (1) baseline predictors of 12-month QoL, (2) 6-month predictors of 12-month QoL, and (3) baseline predictors of 6-month QoL. This type of regularization regression was developed to enable feature (predictor) selection and regularize the dataset to optimize prediction accuracy. By conducting multiple analyses in parallel, it is possible to assume that the variables that recur consistently across models are probably stronger predictors since these remain important over time.
The Lasso algorithm, from the Sk learn library (version 1.0.1), was implemented in Python 3.9. The train the regularization algorithm, 70% of the dataset was used whereas 30% is used for testing, which performed well in similar studies with datasets of this size in the literature42,43. A pre-processing of the data took place prior to this division. Participants for whom 25% of data were missing were removed from the dataset. The remaining missing data was accounted for by using the mean value of the other participants which is a technique called mean imputation often used in order to stabilize the classification process (selection of predictors). This algorithm is consistent with other studies conducted in the field of psychiatry. Best performing hyperparameters were identified using the GridSearchCV algorithm provided by the Sk learn library. An alpha = 0.01, max_iter = 100,000 and default values for the remaining parameters were selected by the GridSearchCV.
The performance of the algorithm for the three models was analyzed as follows. The MSE for the training set and for the testing set were calculated and compared. An R2 score was calculated for both the training set and testing sets. The testing R2 score is representative of our predictive score where a score of 1 would indicate that the model explains all the variation of the dependent variable around its mean compared to a score of 0 which means that the model does not explain at all the observed variations. Collinearity between the different variables is accounted for in the Lasso algorithm by its regulative nature: it keeps all the features of the model but gradually reduces the coefficient up to 0 of variables that are not of interest in the model to predict the dependent variable.
To account for the validation of the regression algorithm over the three models, tenfold cross-validation was conducted. This validation method, which is repeated ten times, divides the dataset randomly into ten parts and nine of those parts are used for training whereas the remaining one is used for testing.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Supplementary Table. Variables included into database and computed using supervised machine learning.
Acknowledgements
We would like to thank the National Institutes of Mental Health (NIMH) for gratuitously sharing the database of the Clinical Antipsychotic Trials of Intervention Effectiveness Schizophrenia Trial. The authors did not receive any particular funding for this research project.
Author contributions
The study was designed by M.B., A.H., S.P., and A.D. Statistical analyses were performed by A.H., M.B., and C.E.G. The paper was written by M.B., A.H., and A.D. All the authors have made substantial contributions and have revised, edited, and approved the paper.
Data availability
The Clinical Antipsychotic Trials of Intervention Effectiveness Schizophrenia Trial is a limited access dataset available on request, under certain conditions, from the National Institutes of Mental Health clinical trials.
Code availability
Algorithms were all derived from the Scikit learn library version 1.0.1, a publicly available resource that includes simple and efficient tools for predictive data analysis. These were implemented using Python 3.9. Pseudocode with all relevant parameters was made available; see Supplementary Method. For intellectual property reasons, the complete code will not be made accessible.
Competing interests
The authors declare no direct competing interests. A.D. and S.P. received grants from Otsuka Pharmaceuticals and from HLS Therapeutics, but these were not related to the present research project. A.D. is funded by the Fonds de recherche du Québec—Santé (FRQS), and M.B. is funded by the Canadian Institutes of Health Research (CIHR).
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Mélissa Beaudoin, Email: Melissa.Beaudoin.1@UMontreal.ca.
Alexandre Dumais, Email: Alexandre.Dumais@UMontreal.ca.
Supplementary information
The online version contains supplementary material available at 10.1038/s41537-022-00236-w.
References
- 1.WHO. Schizophreniahttps://www.who.int/news-room/fact-sheets/detail/schizophrenia (2019).
- 2.Meesters PD, et al. Subjective quality of life and its determinants in a catchment area based population of elderly schizophrenia patients. Schizophr. Res. 2013;147:275–280. doi: 10.1016/j.schres.2013.04.030. [DOI] [PubMed] [Google Scholar]
- 3.Bobes J, Garcia-Portilla MP, Bascaran MT, Saiz PA, Bousoño M. Quality of life in schizophrenic patients. Dialog. Clin. Neurosci. 2007;9:215–226. doi: 10.31887/DCNS.2007.9.2/jbobes. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Development of the World Health Organization WHOQOL-BREF quality of life assessment. The WHOQOL Group. Psychol. Med. 1998;28:551–558. doi: 10.1017/S0033291798006667. [DOI] [PubMed] [Google Scholar]
- 5.Makara-Studzińska M, Wołyniak M, Partyka I. The quality of life in patients with schizophrenia in community mental health service—selected factors. J. Preclin. Clin. Res. 2011;5:31–34. [Google Scholar]
- 6.Boyer L, et al. Quality of life is predictive of relapse in schizophrenia. BMC Psychiatry. 2013;13:15. doi: 10.1186/1471-244X-13-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Norman RM, et al. The relationship of symptoms and level of functioning in schizophrenia to general wellbeing and the Quality of Life Scale. Acta Psychiatr. Scand. 2000;102:303–309. doi: 10.1034/j.1600-0447.2000.102004303.x. [DOI] [PubMed] [Google Scholar]
- 8.Eack SM, Newhill CE. Psychiatric symptoms and quality of life in schizophrenia: a meta-analysis. Schizophr. Bull. 2007;33:1225–1237. doi: 10.1093/schbul/sbl071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Sim K, Mahendran R, Siris SG, Heckers S, Chong SA. Subjective quality of life in first episode schizophrenia spectrum disorders with comorbid depression. Psychiatry Res. 2004;129:141–147. doi: 10.1016/j.psychres.2004.07.007. [DOI] [PubMed] [Google Scholar]
- 10.Suttajit S, Pilakanta S. Predictors of quality of life among individuals with schizophrenia. Neuropsychiatr. Dis. Treat. 2015;11:1371–1379. doi: 10.2147/NDT.S81024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Allison DB, Mackell JA, McDonnell DD. The impact of weight gain on quality of life among persons with schizophrenia. Psychiatr. Serv. 2003;54:565–567. doi: 10.1176/appi.ps.54.4.565. [DOI] [PubMed] [Google Scholar]
- 12.Olfson M, Uttaro T, Carson WH, Tafesse E. Male sexual dysfunction and quality of life in schizophrenia. J. Clin. Psychiatry. 2005;66:331–338. doi: 10.4088/JCP.v66n0309. [DOI] [PubMed] [Google Scholar]
- 13.Mohamed S, et al. Relationship of cognition and psychopathology to functional impairment in schizophrenia. Am. J. Psychiatry. 2008;165:978–987. doi: 10.1176/appi.ajp.2008.07111713. [DOI] [PubMed] [Google Scholar]
- 14.Fontanil-Gómez Y, Alcedo Rodríguez MA, Gutiérrez López MI. Personal and macro-systemic factors as predictors of quality of life in chronic schizophrenia. Psicothema. 2017;29:160–165. doi: 10.7334/psicothema2016.179. [DOI] [PubMed] [Google Scholar]
- 15.Wang XQ, Petrini MA, Morisky DE. Predictors of quality of life among Chinese people with schizophrenia. Nurs. Health Sci. 2017;19:142–148. doi: 10.1111/nhs.12286. [DOI] [PubMed] [Google Scholar]
- 16.Dwyer DB, Falkai P, Koutsouleris N. Machine learning approaches for clinical psychology and psychiatry. Annu. Rev. Clin. Psychol. 2018;14:91–118. doi: 10.1146/annurev-clinpsy-032816-045037. [DOI] [PubMed] [Google Scholar]
- 17.Bzdok D, Meyer-Lindenberg A. Machine learning for precision psychiatry: opportunities and challenges. Biol. Psychiatry Cogn. Neurosci. Neuroimaging. 2018;3:223–230. doi: 10.1016/j.bpsc.2017.11.007. [DOI] [PubMed] [Google Scholar]
- 18.Koutsouleris N, et al. Prediction models of functional outcomes for individuals in the clinical high-risk state for psychosis or with recent-onset depression: a multimodal, multisite machine learning analysis. JAMA Psychiatry. 2018;75:1156–1172. doi: 10.1001/jamapsychiatry.2018.2165. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Fond G, et al. Machine learning for predicting psychotic relapse at 2 years in schizophrenia in the national FACE-SZ cohort. Prog. Neuropsychopharmacol. Biol. Psychiatry. 2019;92:8–18. doi: 10.1016/j.pnpbp.2018.12.005. [DOI] [PubMed] [Google Scholar]
- 20.Godin G, Kok G. The theory of planned behavior: a review of its applications to health-related behaviors. Am. J. Health Promot. 1996;11:87–98. doi: 10.4278/0890-1171-11.2.87. [DOI] [PubMed] [Google Scholar]
- 21.Rayan A, Obiedate K. The correlates of Quality of Life among Jordanian patients with schizophrenia. J. Am. Psychiatr. Nurses Assoc. 2017;23:404–413. doi: 10.1177/1078390317710498. [DOI] [PubMed] [Google Scholar]
- 22.Yamauchi K, et al. Predictors of subjective and objective quality of life in outpatients with schizophrenia. Psychiatry Clin. Neurosci. 2008;62:404–411. doi: 10.1111/j.1440-1819.2008.01818.x. [DOI] [PubMed] [Google Scholar]
- 23.Munikanan T, et al. Association of social support and quality of life among people with schizophrenia receiving community psychiatric service: a cross-sectional study. Compr. Psychiatry. 2017;75:94–102. doi: 10.1016/j.comppsych.2017.02.009. [DOI] [PubMed] [Google Scholar]
- 24.Abel KM, Drake R, Goldstein JM. Sex differences in schizophrenia. Int. Rev. Psychiatry. 2010;22:417–428. doi: 10.3109/09540261.2010.515205. [DOI] [PubMed] [Google Scholar]
- 25.Angermeyer MC, Kühn L, Goldstein JM. Gender and the course of schizophrenia: differences in treated outcomes. Schizophr. Bull. 1990;16:293–307. doi: 10.1093/schbul/16.2.293. [DOI] [PubMed] [Google Scholar]
- 26.Häfner H, an der Heiden W. Epidemiology of schizophrenia. Can. J. Psychiatry. 1997;42:139–151. doi: 10.1177/070674379704200204. [DOI] [PubMed] [Google Scholar]
- 27.Lu L, et al. Quality of life in Chinese patients with schizophrenia: a meta-analysis. Psychiatry Res. 2018;268:392–399. doi: 10.1016/j.psychres.2018.07.003. [DOI] [PubMed] [Google Scholar]
- 28.Dong M, et al. Quality of Life in schizophrenia: a meta-analysis of comparative studies. Psychiatr. Q. 2019;90:519–532. doi: 10.1007/s11126-019-09633-4. [DOI] [PubMed] [Google Scholar]
- 29.Lysaker PH, LaRocco VA. Health-related quality of life and trauma history in adults with schizophrenia spectrum disorders. J. Nerv. Ment. Dis. 2009;197:311–315. doi: 10.1097/NMD.0b013e3181a2070e. [DOI] [PubMed] [Google Scholar]
- 30.McGurk SR, Twamley EW, Sitzer DI, McHugo GJ, Mueser KT. A meta-analysis of cognitive remediation in schizophrenia. Am. J. Psychiatry. 2007;164:1791–1802. doi: 10.1176/appi.ajp.2007.07060906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Ojeda N, et al. An explanatory model of quality of life in schizophrenia: the role of processing speed and negative symptoms. Actas Esp. Psiquiatr. 2012;40:10–18. [PubMed] [Google Scholar]
- 32.Petkari E, Pietschnig J. Associations of Quality of Life with service satisfaction in psychotic patients: a meta-analysis. PLoS ONE. 2015;10:e0135267. doi: 10.1371/journal.pone.0135267. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Endriyani L, Chien CH, Huang XY, Chieh-Yu L. The influence of adherence to antipsychotics medication on the quality of life among patients with schizophrenia in Indonesia. Perspect. Psychiatr. Care. 2019;55:147–152. doi: 10.1111/ppc.12276. [DOI] [PubMed] [Google Scholar]
- 34.Naber D, Lambert M. The CATIE and CUtLASS studies in schizophrenia: results and implications for clinicians. CNS Drugs. 2009;23:649–659. doi: 10.2165/00023210-200923080-00002. [DOI] [PubMed] [Google Scholar]
- 35.Ang MS, Nurjono M, Lee J. The effects of clinical illness severity and physical activity on health-related quality of life in schizophrenia. Qual Life Res. 2019;28:1509–1520. doi: 10.1007/s11136-019-02126-8. [DOI] [PubMed] [Google Scholar]
- 36.Qureshi ZP, et al. Assessment of real-life outcomes in schizophrenia patients according to compliance. Psychiatry J. 2020;2020:5848601. doi: 10.1155/2020/5848601. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.First, M. B. Structured Clinical Interview for the DSM-IV Axis IDisorders: SCID-I/P, Version 2.0. New York: Biometrics Research Dept., New York State Psychiatric Institute (1997).
- 38.Stroup TS, et al. The National Institute of Mental Health Clinical Antipsychotic Trials of Intervention Effectiveness (CATIE) project: schizophrenia trial design and protocol development. Schizophr. Bull. 2003;29:15–31. doi: 10.1093/oxfordjournals.schbul.a006986. [DOI] [PubMed] [Google Scholar]
- 39.Heinrichs DW, Hanlon TE, Carpenter WT. The Quality of Life Scale: an instrument for rating the schizophrenic deficit syndrome. Schizophr. Bull. 1984;10:388–398. doi: 10.1093/schbul/10.3.388. [DOI] [PubMed] [Google Scholar]
- 40.Kay SR, Fiszbein A, Opler LA. The positive and negative syndrome scale (PANSS) for schizophrenia. Schizophr. Bull. 1987;13:261–276. doi: 10.1093/schbul/13.2.261. [DOI] [PubMed] [Google Scholar]
- 41.Addington D, Addington J, Schissel B. A depression rating scale for schizophrenics. Schizophr. Res. 1990;3:247–251. doi: 10.1016/0920-9964(90)90005-R. [DOI] [PubMed] [Google Scholar]
- 42.Kohavi, R. In Ijcai. 1137–1145 (Montreal, Canada).
- 43.Steyerberg EW. Validation in prediction research: the waste by data splitting. J. Clin. Epidemiol. 2018;103:131–133. doi: 10.1016/j.jclinepi.2018.07.010. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplementary Table. Variables included into database and computed using supervised machine learning.
Data Availability Statement
The Clinical Antipsychotic Trials of Intervention Effectiveness Schizophrenia Trial is a limited access dataset available on request, under certain conditions, from the National Institutes of Mental Health clinical trials.
Algorithms were all derived from the Scikit learn library version 1.0.1, a publicly available resource that includes simple and efficient tools for predictive data analysis. These were implemented using Python 3.9. Pseudocode with all relevant parameters was made available; see Supplementary Method. For intellectual property reasons, the complete code will not be made accessible.
