Skip to main content
. 2021 Jun 18;21(12):4182. doi: 10.3390/s21124182

Table 2.

The architecture of the proposed method.

Layer Name Output Size Layer
STN 224 × 224 Localization network, grid generator, sampler
Conv1 112 × 112 7 × 7, 64, stride 2
Max pooling 56 × 56 3 × 3, stride 2
Stage 2 56 × 56 [1×1,643×3,641×1,256]×3, Mish
Stage 3 28 × 28 [1×1,1283×3,1281×1,512]×4, Mish
Stage 4 14 × 14 [1×1,2563×3,2561×1,1024]×6, Mish
Non-local attention module 14 × 14 Attention × 1
Stage 5 7 × 7 [1×1,5123×3,5121×1,2048]×3, Mish
Average pooling 1 × 1 7 × 7, stride 1
FC, softmax 1000-d