Skip to main content
. 2026 Jan 20;16:2503. doi: 10.1038/s41598-025-32207-3

Table 5.

Detailed parameters and operations within the CCST feature extractor module.

Operation Input shape Parameters Output shape
Temporal convolution (B, 1, CT)

Conv2d: kernel_size = (1,25),

stride = (1,1)

(B, 40, CT)
Spatial convolution (B, 40, CT)

Conv2d: kernel_size = (C,1),

stride = (1,1)

(B, 40, 1, T)
Batch normalization (B, 40, 1, T) BatchNorm2d: num_features = 40 (B, 40, 1, T)
Activation (ELU) (B, 40, 1, T) ELU activation (B, 40, 1, T)
Average Pooling (B, 40, 1, T)

AvgPool2d: kernel_size = (1,75),

stride = (1,15)

Inline graphic
Dropout Inline graphic Dropout: p = 0.5 Inline graphic
Projection convolution Inline graphic

Conv2d: in_channels = 40,

out_channels = 40,

kernel_size = (1,1)

Inline graphic
Rearrangement Inline graphic

Rearrange: ’b e (ht) (w) Inline graphic

b (ht w) e’

Inline graphic
Embedding projection Inline graphic

Linear: in_features = 40,

out_features = 64

Inline graphic
Positional encoding Inline graphic

{learnable, sine, none},

shape = (1, 15, 64)

Inline graphic
Transformer Inline graphic

Multiple layers: num_layers = 3,

num_heads = 4, mlp_hidden = 128,

window_size = 4

(B, 64)