Table 3. Predictors Used in the Analysis.
Predictor | Distribution of potential predictors | Proportion and distribution of statistically significant predictors | Proportion and distribution of combined model predictors | |||||
---|---|---|---|---|---|---|---|---|
No. | Distribution, %a | No. | Proportion, %b | Distribution, %a | No. | Proportion, %b | Distribution, %a | |
Total | 10 181 | 100 | 2137 | 21.0 | 100 | 191 | 8.9 | 100 |
Psychopathological risk factors | ||||||||
Diagnoses | 3364 | 33.0 | 649 | 19.3 | 30.4 | 12 | 1.8 | 6.3 |
Treatments | 224 | 2.2 | 80 | 35.7 | 3.7 | 11 | 13.8 | 5.8 |
Suicidality | 46 | 0.5 | 38 | 82.6 | 1.8 | 4 | 10.5 | 2.1 |
Total | 3634 | 35.7 | 767 | 21.1 | 35.9 | 27 | 3.5 | 14.1 |
Physical disorders | ||||||||
Diagnoses | 3716 | 36.5 | 666 | 17.9 | 31.2 | 22 | 3.3 | 11.5 |
Treatments | 230 | 2.3 | 96 | 41.7 | 4.5 | 7 | 7.3 | 3.7 |
FDA medications increasing suicide risk | 28 | 0.3 | 21 | 75.0 | 1.0 | 3 | 14.3 | 1.6 |
Total | 3974 | 39.0 | 783 | 19.7 | 36.6 | 32 | 4.1 | 16.8 |
Facility-level quality indicators | ||||||||
Total | 6 | 0.1 | 6 | 100 | 0.3 | 4 | 66.7 | 2.1 |
SDOHc | ||||||||
Geospatial indicators | 90 | 0.9 | 53 | 58.9 | 2.5 | 33 | 62.3 | 17.3 |
LexisNexis public records | 442 | 4.3 | 29 | 6.6 | 1.4 | 29 | 100 | 15.2 |
Patient-level factors (ICD-9-CM/ICD-10-CM codes) | 174 | 1.7 | 40 | 23.0 | 1.9 | 6 | 15.0 | 3.1 |
Sociodemographic characteristics | 24 | 0.2 | 17 | 70.8 | 0.8 | 9 | 52.9 | 4.7 |
Total | 733 | 7.2 | 142 | 19.4 | 6.6 | 80 | 56.3 | 41.9 |
NLP term/topic frequency | ||||||||
Terms | 1687 | 16.6 | 344 | 20.4 | 16.1 | 24 | 7.0 | 12.6 |
Topics | 150 | 1.5 | 98 | 100 | 4.6 | 27 | 100 | 14.1 |
Total | 1837 | 18.0 | 442 | 24.1 | 20.7 | 51 | 11.5 | 26.7 |
Abbreviations: FDA, US Food and Drug Administration; ICD-9-CM, International Classification of Diseases, Ninth Revision, Clinical Modification; ICD-10-CM, International Classification of Diseases, Tenth Revision, Clinical Modification; NLP, natural language processing; SDOH, social determinants of health.
Entries in the distribution columns represent the contribution of predictors in the row heading to the total in the column. The percentage estimates in each distribution column sum to 100%.
Entries in the proportion column represent the proportion of variation in the prior column for the same row that continue to exist in the current column. For example, the 649 significant psychiatric diagnoses in the first row and second column represent 19.3% of all the 3364 psychiatric diagnoses included in the initial potential predictor set, and the 12 psychiatric diagnoses in the final predictor set represent 1.8% of those 649 significant predictors.
Three of the NLP variables in the predictor set for the Combined model are included in this total as well as in the NLP total. One is the term trauma. The other 2 are topics in which the prominent terms are either homeless/shelter/homelessness/lack of housing or divorce/stressors.