Table 2.
Spatial operation | Block | Spatial | Channel | Resolution sensitive | Params | FLOPs |
---|---|---|---|---|---|---|
Spatial projection | MLP-Mixer15 | specific | agnostic | true | ||
ResMLP60 | specific | agnostic | true | |||
FeedForward61 | specific | agnostic | true | |||
gMLP67 | specific | agnostic | true | |||
Sparse-MLP(MoE)88 | specific | agnostic | true | |||
CCS91 | agnostic | group specific | true | |||
RepMLPNet92 | specific | group specific | true | |||
Axial projection | RaftMLP74 | specific | agnostic | true | ||
ViP71 | specific | specific | true | |||
Sparse MLP73 | specific | specific | true | |||
DynaMixer75 | specific | specific | true | |||
WaveMLP76 | specific | specific | false | |||
MorphMLP77 | specific | specific | false | |||
Shifting & channel projection | S2MLP78 | agnostic | specific | false | ||
S2MLPv279 | agnostic | specific | false | |||
AS-MLP80 | agnostic | specific | false | |||
CycleMLP83 | agnostic | specific | false | |||
HireMLP86 | agnostic | specific | false | |||
MS-MLP87 | agnostic | specific | false | |||
ActiveMLP85 | agnostic | specific | false |
H, W, and C are the feature map’s height, width, and channel numbers, respectively. L is the local window size. “Spatial” refers to whether feature extraction is sensitive to the spatial location of objects, “specific” means true, while “agnostic” means false. “Channel” refers whether weights are shared between channels, “agnostic” shares weights between all channels, “group specific” shares weights between groups, and “specific” does not share. “Resolution sensitive” refers to whether the module is resolution sensitive.