Skip to main content
[Preprint]. 2024 May 31:arXiv:2405.20594v1. [Version 1]

Figure 3:

Figure 3:

(a) Eigenvalues of BTB as a function of the expansion ratio (1/λ). The shaded region shows the standard deviation of the eigenvalues. (b) Backward-forward path alignment between W and (RB)T as a function of the expansion ratio (1/λ), where we randomly sampled W and set RT=BW (expected to hold after the effect of the weight initialization has fully decayed). This simplification is consistent with the path alignment after training observed in simulations.