Skip to main content
. 2021 Aug 20;25(11):4152–4162. doi: 10.1109/JBHI.2021.3106341

TABLE I. Details of 3D U-Net Architecture Used in Our Experiments.

feature size Encoder (G / T) Decoder (D)
1x56x160x192 input output conv(1x1x1)-sigmoid
32x56x160x192 conv1 conv(1x3x3)-IN-LReLU conv10 conv(1x3x3)-IN-LReLU
64x56x80x96 down1 strided conv(1,2,2) up10 transposed conv(1,2,2) - conv1
64x56x80x96 conv2 conv(3x3x3)-IN-LReLU conv9 conv(3x3x3)-IN-LReLU
128x28x40x48 down2 strided conv(2,2,2) up9 transposed conv(2,2,2) - conv2
128x28x40x48 conv3 conv(3x3x3)-IN-LReLU conv8 conv(3x3x3)-IN-LReLU
256x14x20x24 down3 strided conv(2,2,2) up8 transposed conv(2,2,2) - conv3
256x14x20x24 conv4 conv(3x3x3)-IN-LReLU conv7 conv(3x3x3)-IN-LReLU
320x7x10x12 down4 strided conv(2,2,2) up7 transposed conv(2,2,2) - conv4
320x7x10x12 conv5 conv(3x3x3)-IN-LReLU conv6 conv(3x3x3)-IN-LReLU
320x7x5x6 down5 strided conv(1,2,2) up6 transposed conv(1,2,2) - conv5

Note that the general encoder and target encoder are with the same architecture as shown in the left column.