Skip to main content
. 2026 Jan 21;16:5925. doi: 10.1038/s41598-026-36095-z

Table 2.

Network structure parameter configuration.

Network layer Input dimension Output dimension Activation function Parameter count
Knowledge encoder 256 512 LeakyReLU 1.31 M
Cross-modal fusion 1536 (512 × 3) 768 GeLU 3.54 M
Transformer encoder 768 768 SiLU 7.86 M
Motion decoder 768 3 J×T Tanh 2.47 M
Local discriminator 3 J×K 1 Sigmoid 0.98 M
Global discriminator 3 J×T 1 Sigmoid 1.63 M