Skip to main content
. Author manuscript; available in PMC: 2024 Sep 24.
Published in final edited form as: Proc Mach Learn Res. 2024 Jun;248:137–154.

Figure 5:

Figure 5:

Attention and relative time encoding visualization. We include attention weights regarding two days of the week as examples, and also show the attention difference between them (the fourth image). The attention scores are averaged over all completely held-out test samples, and relative time encoding is averaged over models from 10 training splits.