Skip to main content
. 2022 Apr 14;24(4):551. doi: 10.3390/e24040551

Figure 12.

Figure 12

A BYOL architecture. BYOL reduces similarity loss between qθ(zθ ) and sg (zξ' ), where θ represents the trained weights, ξ represents an exponential moving average of θ, and sg means the stop-gradient. After training, everything but fθ is discarded. yθ represents the image representation.