Skip to main content
. 2021 Aug 26;7(8):e26604. doi: 10.2196/26604

Table 2.

Multivariate coefficients and CIs for linear regression and random forest regression of the COVID-19 risk score.

Variable Linear coefficient P value Low (95% CI) High (95% CI) MSEa Node purityb VIFc,d
Median income –1.34 <.001 –1.53 –1.16 42 59,736 3.68
Median home value –0.13 .07 –0.27 0.01 39 33,163 2.21
At or below poverty (%) –3.24 <.001 –3.42 –3.07 61 78,890 3.04
Unemployment (%) 0.73 <.001 0.60 0.86 87 68,364 1.69
Nonemployed (%) 5.38 <.001 5.26 5.50 285 316,903 1.42
Less than high school (%) 2.12 <.001 1.90 2.33 71 63,048 4.71
No health insurance (%) 0.69 <.001 0.55 0.83 50 34,818 2.18
More than 1 occupant (%) –0.89 <.001 –1.04 –0.73 59 41,387 2.46
African American (%) 0.73 <.001 0.59 0.87 68 84,497 2.09
Hispanic (%) –2.30 <.001 –2.49 –2.10 78 63,847 4.12
Asian (%) –1.14 <.001 –1.25 –1.02 91 93,675 1.42
Other race (%) –0.51 <.001 –0.67 –0.36 69 45,301 2.45

aMSE: mean standard error.

bNode impurity: residual sum of squares for the random forest model.

cVIF: variance inflation factor.

dFor the linear regression model.