. Author manuscript; available in PMC: 2022 Nov 1.

Published in final edited form as: Addiction. 2021 May 3;116(11):3029–3043. doi: 10.1111/add.15506

Table 3.

Test-retest reliability of raw and transformed accuracy scores for each task across assessment

		ICC								r _test-retest
		Baseline vs. AL		Baseline vs. DL		AL vs. DL		Baseline vs. AL vs. DL		Baseline vs. AL		Baseline vs. DL		AL vs. DL
Grp.	Tasks	Raw	Transf	Raw	Transf	Raw	Transf	Raw	Transf	Raw	Transf	Raw	Transf	Raw	Transf
All	KT	.73	.74	.61	.62	.77	.77	.8	.8	.57	.58	.44	.45	.63	.63
	LM	.67	.71	.66	.66	.78	.78	.77	.79	.51	.55	.49	.5	.65	.64
	SNB	.81	.82	.78	.76	.87	.87	.87	.87	.69	.71	.64	.64	.78	.77
Cntrl	KT	.76	.78	.55	.59	.84	.83	.81	.82	.61	.65	.38	.41	.73	.71
	LM	.67	.67	.57	.59	.75	.77	.76	.78	.5	.51	.4	.42	.66	.66
	SNB	.83	.82	.72	.69	.91	.88	.89	.87	.72	.74	.58	.56	.82	.78

Note. Grp = Groups Included. Cntrl = Control Beverage Only. Transf = Transformed. KT = Keep Track task; LM = Letter Memory task; SNB = Spatial 2-Back task. Baseline represents assessment at the baseline session. AL stands for ascending limb (or corresponding timepoint) assessment in the experimental session. DL stands for descending limb for ascending limb (or corresponding timepoint) assessment in the experimental session. ICC stands for intraclass correlation coefficient. Specifically, ICC (3, k)⁷⁹ was computed, using k = 2 or 3 tests, as the k raters. Test-retest correlations (r_test-retest) were computed as Pearson’s r. For ease of comparison with the extant literature, correlations are presented for both raw and transformed (angularized and winsorized; see Analytic Strategy) task score. For Groups Included = All, the number of participants contributing complete data was: for KT, Baseline vs. AL n = 120, Baseline vs. DL n = 228, AL vs. DL n = 120, Baseline vs. AL vs. DL n = 120; for LM, Baseline vs. AL n = 119, Baseline vs. DL n = 227, AL vs. DL n = 119, Baseline vs. AL vs. DL n = 119; and for SNB, Baseline vs. AL n = 119, Baseline vs. DL n = 228, AL vs. DL n = 119, Baseline vs. AL vs. DL n = 119. For Groups Included = Control Only, the number of participants contributing complete data was: for KT, Baseline vs. AL n = 38, Baseline vs. DL n = 73, AL vs. DL n = 38, Baseline vs. AL vs. DL n = 38; for LM, Baseline vs. AL n = 36, Baseline vs. DL n = 74, AL vs. DL n = 38, Baseline vs. AL vs. DL n = 38; and for SNB, Baseline vs. AL n = 36, Baseline vs. DL n = 72, AL vs. DL n = 38, Baseline vs. AL vs. DL n = 38. All ICC and r_test-retest were significantly different from 0 at p < .001.