Table 3.
Test-retest reliability of raw and transformed accuracy scores for each task across assessment
| ICC | r test-retest | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Baseline vs. AL | Baseline vs. DL | AL vs. DL | Baseline vs. AL vs. DL | Baseline vs. AL | Baseline vs. DL | AL vs. DL | |||||||||
| Grp. | Tasks | Raw | Transf | Raw | Transf | Raw | Transf | Raw | Transf | Raw | Transf | Raw | Transf | Raw | Transf |
| All | KT | .73 | .74 | .61 | .62 | .77 | .77 | .8 | .8 | .57 | .58 | .44 | .45 | .63 | .63 |
| LM | .67 | .71 | .66 | .66 | .78 | .78 | .77 | .79 | .51 | .55 | .49 | .5 | .65 | .64 | |
| SNB | .81 | .82 | .78 | .76 | .87 | .87 | .87 | .87 | .69 | .71 | .64 | .64 | .78 | .77 | |
| Cntrl | KT | .76 | .78 | .55 | .59 | .84 | .83 | .81 | .82 | .61 | .65 | .38 | .41 | .73 | .71 |
| LM | .67 | .67 | .57 | .59 | .75 | .77 | .76 | .78 | .5 | .51 | .4 | .42 | .66 | .66 | |
| SNB | .83 | .82 | .72 | .69 | .91 | .88 | .89 | .87 | .72 | .74 | .58 | .56 | .82 | .78 | |
Note. Grp = Groups Included. Cntrl = Control Beverage Only. Transf = Transformed. KT = Keep Track task; LM = Letter Memory task; SNB = Spatial 2-Back task. Baseline represents assessment at the baseline session. AL stands for ascending limb (or corresponding timepoint) assessment in the experimental session. DL stands for descending limb for ascending limb (or corresponding timepoint) assessment in the experimental session. ICC stands for intraclass correlation coefficient. Specifically, ICC (3, k)79 was computed, using k = 2 or 3 tests, as the k raters. Test-retest correlations (rtest-retest) were computed as Pearson’s r. For ease of comparison with the extant literature, correlations are presented for both raw and transformed (angularized and winsorized; see Analytic Strategy) task score. For Groups Included = All, the number of participants contributing complete data was: for KT, Baseline vs. AL n = 120, Baseline vs. DL n = 228, AL vs. DL n = 120, Baseline vs. AL vs. DL n = 120; for LM, Baseline vs. AL n = 119, Baseline vs. DL n = 227, AL vs. DL n = 119, Baseline vs. AL vs. DL n = 119; and for SNB, Baseline vs. AL n = 119, Baseline vs. DL n = 228, AL vs. DL n = 119, Baseline vs. AL vs. DL n = 119. For Groups Included = Control Only, the number of participants contributing complete data was: for KT, Baseline vs. AL n = 38, Baseline vs. DL n = 73, AL vs. DL n = 38, Baseline vs. AL vs. DL n = 38; for LM, Baseline vs. AL n = 36, Baseline vs. DL n = 74, AL vs. DL n = 38, Baseline vs. AL vs. DL n = 38; and for SNB, Baseline vs. AL n = 36, Baseline vs. DL n = 72, AL vs. DL n = 38, Baseline vs. AL vs. DL n = 38. All ICC and rtest-retest were significantly different from 0 at p < .001.