Supplement 3: Hierarchical Clustering Model

Source code for prediction of COVID-19 test results. This is supplemental material to publication

Wojtusiak J, Bagais W, Vang J, Guralnik E, Roess A, Alemi F, "The Role of Symptom Clusters in Triage of COVID-19 Patients," Quality Management in Health Care, 2022.

Source code by Wejdan Bagais and Jee Vang with contribution of other authors.

LASSO Model for the 30 splits

Identifying the best Cutoff point & inverse of regularization strength value (C)

Selected number of clusters and C value

Read all data

Dendrograms

Clusters for the positive cases

Clusters for the negative cases

Building the model using all data to identify the list of predictors

Creating clusters columns

k-fold cross-validation, k=24

Coefficients from k-fold cross-validation that is consistent 95% of the time

Visualize coefficients

Tabular output of coefficients with odds