Skip to main content
. Author manuscript; available in PMC: 2024 Mar 19.
Published in final edited form as: Sex Transm Dis. 2021 Jan;48(1):56–62. doi: 10.1097/OLQ.0000000000001264

TABLE 2.

Cross-Validated Area Under the Receiver Operating Curve (cv-AUC) for Incident Sexually Transmitted Infections (STIs)* With ≥1 Repeat STI Diagnoses or ≥2 Repeat STI Diagnoses Stratified by Duration of Follow-Up, Atrius Health, 2008 to 2015

730 d of Follow-Up 365 d of Follow-Up
Algorithm Incident STIs With ≥2 Repeat STI, cv-AUC Incident STIs With ≥1 Repeat STI, cv-AUC Incident STIs With ≥2 Repeat STI, cv-AUC Incident STIs With ≥1 Repeat STI, cv-AUC
SL 0.76 0.66 0.70 0.64
BART 0.75 0.67 0.72 0.65
LASSO 0.73 0.67 0.67 0.64
XGB2,50 0.73 0.64 0.69 0.61
XGB4,50 0.73 0.62 0.69 0.59
XGB2,25 0.72 0.65 0.70 0.63
NN2,screen 0.72 0.65 0.56 0.63
XGB2,10 0.71 0.64 0.65 0.62
NN1,screen 0.71 0.65 0.51 0.63
XGB4,25 0.70 0.62 0.70 0.59
GLMscreen 0.70 0.65 0.58 0.63
RIDGE 0.70 0.65 0.63 0.63
XGB4,10 0.67 0.61 0.62 0.58
SVM1.5,screen 0.55 0.48 0.58 0.49
SVM1,screen 0.52 0.48 0.57 0.50
GLM 0.51 0.57 0.52 0.49
*

Defined as positive laboratory result for chlamydia, gonorrhea, or a syphilis diagnosis.

BART indicates Bayesian Additive Regression Trees (200 trees, 2-way interactions); GLM, generalized linear model logistic regression; LASSO, least absolute shrinkage and selection operator using AUC loss; NN, neural net with 1 or 2 nodes in the hidden layer; RIDGE, ridge regression using AUC loss; “screen,” covariates were prescreened to include only those whose correlation with the outcome had magnitude at least 0.2, although a minimum of 20 covariates were retained; SL, super learner; SVM, support vector machine (radial kernel with cost, 1 or 1.5); XGB, eXtreme Gradient Boosting (1000 trees; shrinkage, 0.1; maxdepth, 2 or 4 [interaction depth], 10, 25, or 50 minimum number of observations in terminal node).