[Preprint]. 2024 Nov 26:arXiv:2411.15240v2. [Version 2]

Table 1. Evaluating models on the ability to predict benzodiazepine usage from actigraphy.

In this dataset, the input is actigraphy and the label indicates whether a participant is taking benzodiazepines. Each model is trained on dataset sizes “500”, “1,000”, “2,500”, and “5,769” (seen in the columns) and evaluated using AUC on a held-out test set of 2,000 participants. The “Avg AUC” represents the averaged AUC scores across each training dataset size. If the model name has “smoothing” after it, it denotes that the model was trained on smoothed data. Underlined text indicates the best baseline model. PAT-S/M/L denotes Small, Medium, Large. A bolded PAT model indicates that it performed better than the best baseline, and a bolded and underlined PAT indicates the model with the best performance. PATs significantly outperform the baseline models in every dataset size in this task.

MODEL	Avg AUC*	n=500	n=1000	n=2500	n=5769	Params
LSTM	0.493	0.501	0.487	0.474	0.512	15 K
LSTM (smoothing)	0.499	0.506	0.508	0.482	0.499	15 K
Wavelet Transform	0.620	0.674	0.625	0.598	0.583	10 K
CNN-1D	0.632	0.621	0.630	0.640	0.637	10 K
CNN-1D (smoothing)	0.639	0.633	0.634	0.644	0.646	10 K
Conv LSTM (smoothing)	0.667	0.666	0.680	0.653	0.671	1.75 M
Conv LSTM	0.668	0.663	0.681	0.650	0.677	1.75 M
CNN-3D	0.693	0.683	0.693	0.693	0.703	790 K
CNN-3D (smoothing)	0.697	0.677	0.695	0.696	0.719	790 K

PAT-S	0.701	0.706	0.718	0.677	0.703	285 K
PAT Conv-S	0.726	0.737	0.711	0.722	0.735	285 K
PAT-M	0.744	0.743	0.745	0.742	0.745	1.00 M
PAT Conv-M	0.761	0.753	0.756	0.760	0.773	1.00 M
PAT Conv-L	0.762	0.763	0.756	0.754	0.773	1.99 M
PAT-L	0.767	0.771	0.765	0.760	0.771	1.99 M