[Preprint]. 2024 Nov 26:arXiv:2411.15240v2. [Version 2]

Table 2. Model performance across different actigraphy tasks (predicting SSRI usage, Sleep Disorder, Sleep Abnormalities, and Depression).

The table summarizes the performance of our PAT models versus various baseline models (including LSTM, CNN, ConvLSTM, and 3D CNN) across four tasks: predicting SSRI usage, history of any sleep disorder, abnormal sleep patterns, and depression. Each model is trained on dataset sizes “500”, “1,000”, “2,500”, and all available data (5,769 for SSRI usage, 3,429 for Sleep Disorder and Sleep abnormalities, and 2,800 for Depression) and evaluated using AUC on a held-out test set of 2,000 participants. The score for each model here represents the averaged AUC scores across each training dataset size. If the model name has “smoothing” after it, it denotes that the model was trained on smoothed data. An underline indicates the best baseline model. PAT-S/M/L denotes Small, Medium, Large. A bolded PAT model indicates that it performed better than the best baseline, and a bolded and underlined PAT indicates the model with the best performance. The results suggest that PATs outperform baseline models in various actigraphy understanding tasks and at various dataset sizes.

MODEL	SSRI usage	Sleep Disorder	Sleep Abnormalities	Depression	Params
LSTM	0.527	0.494	0.513	0.489	15 K
LSTM (smoothing)	0.523	0.506	0.515	0.506	15 K
Wavelet Transform	0.674	0.529	0.525	0.523	10 K
CNN-1D	0.616	0.563	0.534	0.522	10 K
CNN-1D (smoothing)	0.611	0.558	0.519	0.517	10 K
Conv LSTM (smoothing)	0.655	0.609	0.579	0.547	1.75 M
Conv LSTM	0.606	0.606	0.585	0.550	1.75 M
CNN-3D	0.677	0.608	0.606	0.583	790 K
CNN-3D (smoothing)	0.680	0.605	0.615	0.586	790 K

PAT-S	0.641	0.587	0.555	0.560	285 K
PAT Conv-S	0.656	0.616	0.573	0.587	285 K
PAT-M	0.690	0.641	0.641	0.559	1.00 M
PAT Conv-M	0.668	0.616	0.627	0.594	1.00 M
PAT Conv-L	0.695	0.631	0.659	0.610	1.99 M
PAT-L	0.700	0.632	0.665	0.589	1.99 M