Skip to main content
. 2023 Apr 10:1–9. Online ahead of print. doi: 10.1007/s11760-023-02559-2

Table 1.

Configuration of ResNet-BiLSTM architectures

Layers ResNet-BiLSTM
Input layer mag spectrogram (N X 257)
ResNet Block(1) Conv2D-filters=16,strides=(1,1)BatchNormalization()Conv2D-filters=16,strides=(1,1)Conv2D-filters=16,strides=(1,3)
ResNet Block(2) Conv2D-filters=32,strides=(1,1)BatchNormalization()Conv2D-filters=32,strides=(1,1)Conv2D-filters=32,strides=(1,3)
ResNet Block(3) Conv2D-filters=64,strides=(1,1)BatchNormalization()Conv2D-filters=64,strides=(1,1)Conv2D-filters=64,strides=(1,3)
ResNet Block(4) Conv2D-filters=128,strides=(1,1)BatchNormalization()Conv2D-filters=128,strides=(1,1)Conv2D-filters=128,strides=(1,3)
Recurrent layer BiLSTM-128
Attention
FC layer FC-128
ReLU
dropout(0.3)
FC-1
Output layer Average pool

The number of frames is denoted by N