Skip to main content
. 2022 Apr 14;24(4):551. doi: 10.3390/e24040551

Figure 13.

Figure 13

The architecture of SimSiam. Two augmented images passed through the same encoder, which is comprised of a backbone (ResNet) and a projection MLP. A prediction MLP (h) is used on one side, and a stop-gradient strategy is employed on the other to avoid collapse. The model aims to maximize the similarity between both views. SimSiam does not use negative pairs or a momentum encoder.