TABLE II. Details of 2D U-Net Architecture Used in Our Experiments.
feature size | Encoder (G / T) | Decoder (D) | ||
---|---|---|---|---|
1x448x384 | input | output | conv(1x1)-sigmoid | |
32x448x384 | conv1 | conv(3x3)-IN-LReLU | conv12 | conv(3x3)-IN-LReLU |
64x224x192 | down1 | strided conv(2,2) | up12 | transposed conv(2,2) - conv1 |
64x224x192 | conv2 | conv(3x3)-IN-LReLU | conv11 | conv(3x3)-IN-LReLU |
128x112x96 | down2 | strided conv(2,2) | up11 | transposed conv(2,2) - conv2 |
128x112x96 | conv3 | conv(3x3)-IN-LReLU | conv10 | conv(3x3)-IN-LReLU |
256x56x48 | down3 | strided conv(2,2) | up10 | transposed conv(2,2) - conv3 |
256x56x48 | conv4 | conv(3x3)-IN-LReLU | conv9 | conv(3x3)-IN-LReLU |
480x28x24 | down4 | strided conv(2,2) | up9 | transposed conv(2,2) - conv4 |
480x28x24 | conv5 | conv(3x3)-IN-LReLU | conv8 | conv(3x3)-IN-LReLU |
480x14x12 | down5 | strided conv(2,2) | up8 | transposed conv(2,2) - conv5 |
480x14x12 | conv6 | conv(3x3)-IN-LReLU | conv7 | conv(3x3)-IN-LReLU |
480x7x6 | down6 | strided conv(2,2) | up7 | transposed conv(2,2) - conv6 |
Note that the general encoder and target encoder are with the same architecture as shown in the left column.