Skip to main content
. 2024 May 23;10(11):e31730. doi: 10.1016/j.heliyon.2024.e31730

Fig. 2.

Fig. 2

Backbone Structure. The input image (a) is processed through a series of convolutional layers (b) based on the initial layers of VGG16, followed by max-pooling and bilinear up sampling. This sequence produces a high-resolution feature map (c) that describes the image content.