Underwater image enhancement via multiscale disentanglement strategy

Jiaquan Yan; Hao Hu; Yijian Wang; Muhammad Wasim Nawaz; Naveed Ur Rehman Junejo; Ente Guo; Huibin Feng

doi:10.1038/s41598-025-89109-7

. 2025 Feb 19;15:6076. doi: 10.1038/s41598-025-89109-7

Underwater image enhancement via multiscale disentanglement strategy

Jiaquan Yan ^1,^#, Hao Hu ^1,^#, Yijian Wang ^2,^#, Muhammad Wasim Nawaz ³, Naveed Ur Rehman Junejo ^3,⁴, Ente Guo ^1,^✉, Huibin Feng ¹

PMCID: PMC11839921 PMID: 39971756

Abstract

Underwater images suffer from color casts, low illumination, and blurred details caused by light absorption and scattering in water. Existing data-driven methods often overlook the scene characteristics of underwater imaging, limiting their expressive power. To address the above issues, we propose a Multiscale Disentanglement Network (MD-Net) for Underwater Image Enhancement (UIE), which mainly consists of scene radiance disentanglement (SRD) and transmission map disentanglement (TMD) modules. Specifically, MD-Net first disentangles original images into three physical parameters which are scene radiance (clear image), transmission map, and global background light. The proposed network then reconstructs these physical parameters into underwater images. Furthermore, MD-Net introduces class adversarial learning between the original and reconstructed images to supervise the disentanglement accuracy of the network. Moreover, we design a multi-level fusion module (MFM) and dual-layer weight estimation unit (DWEU) for color cast adjustment and visibility enhancement. Finally, we conduct extensive qualitative and quantitative experiments on three benchmark datasets, which demonstrate that our approach outperforms other traditional and state-of-the-art methods. Our code and results are available at: https://github.com/WYJGR/MD-Net.

Keywords: Underwater image enhancement, Disentanglement strategy, Multiscale feature fusion, Underwater optical imaging

Subject terms: Electrical and electronic engineering, Computer science

Introduction

Due to the complex physical characteristics of underwater environments, including light absorption and scattering, underwater images are often plagued by color distortion and reduced visibility^1,2. Moreover, the degradation of underwater images significantly impairs various vision tasks, such as depth estimation³, scene understanding⁴, and object detection underwater⁵. To solve the above problems, some deep learning methods have been applied to UIE in recent years. For example, Li et al. proposed a weakly supervised color transfer method⁶ to correct color distortion based on CycleGAN⁷ network structure. Li et al. designed a Water-Net which utilizes a straightforward multi-scale convolutional network⁸. However, these underwater image enhancement models typically employ standard deep network structures designed for solving general purpose image enhancement tasks, overlooking the distinct characteristics specific to underwater imaging. Subsequently, methods combining underwater physical imaging and deep learning have been developed to address the limitations mentioned above. Fu et al. proposed an UnSupervised Underwater Image Restoration method (USUIR) to achieve UIE by estimating the physical parameters and the homology between the original underwater image and the re-degraded image⁹. While these techniques demonstrate significant enhancement improvements, the precise estimation of underwater imaging parameters remains a considerable challenge for current deep learning approaches grounded in physical models.

To make up for the shortcomings of the above methods, and at the same time absorb the idea of combining physical imaging, we develop a Multiscale Disentanglement Network (MD-Net). MD-Net mainly consists of Transmission Map Disentanglement (TMD) and Scene Radiance Disentanglement (SRD) block. TMD and SRD are used to obtain transmission map and scene radiance (clear images), respectively. These parameters (including global background light) are then coupled through the underwater imaging model to form the original image, with self-supervised constraints applied to ensure the precision of each parameter to approximate the ground truth. In SRD and TMD, we have developed a Multi-level Fusion Module (MFM) and Dual-layer Weight Estimation Unit (DWEU) based on pixel and channel weights. Experimental results reveal that our MD-Net produces more realistic underwater colors and clearer visibility and shows favorable results in both trainable parameters (Params) and running times (Runtimes). The main contributions of this paper can be summarized as follows:

We propose a Multiscale Disentanglement Network (MD-Net) for UIE, which utilizes both shallow spatial features and deep semantic feature scales to improve the disentanglement accuracy of underwater scenarios.
To obtain richer shallow spatial features and deep semantic feature scales, we design a Multi-level Fusion Module (MFM) and Dual-layer Weight Estimation Unit (DWEU) based on pixel and channel weights.
Extensive experiments have demonstrated the superiority of the proposed method. Specifically, our method achieves promising results in quantitative experiments on three real underwater image datasets.

Related works

Underwater imaging model

Based on the model of Jaffe¹⁰ and McGlamery¹¹, As shown in Fig. 1a, the imaging process is described as a combination of three components: the direct component (the light reflected from an object that has not been scattered), the backward scattering component (the light reflected from floating particles), and the forward scattering component (the light reflected from an object scattered at small angles). Nevertheless, the direct component is usually neglected, enabling the imaging model to be simplified as follows¹²:

where I(x) represents the degraded underwater image, J(x) denotes the scene radiance (clear image), t(x) signifies the transmission map of I(x), B stands for the global background light, x refers to the pixel coordinate and Inline graphic indicates a color channel. The forward scattering, , is primarily responsible for the blur and fog effects, whereas the backward scattering, , leads to contrast degradation and color distortion, .

Illustrations of (a) the underwater imaging model and (b) the varying degrees of light color attenuation in water.

In addition, as shown in Fig. 1b, different wavelengths of light are absorbed differently in water, resulting in a significant degradation of underwater image quality with increasing depth and distance from the shore. The energy of the light is absorbed as it penetrates the water. When the depth reaches 4-5 meters, red light, having the longest wavelength, disappears first. The green wavelength is completely absorbed at a depth of 30 meters. Blue wavelengths travel farther due to their higher frequencies, typically reaching more than 50 meters below the surface. This explains why underwater images often appear in blue-green hue.

Model-free UIE methods

In the early stages, researchers focused on adjusting pixel values to produce visually pleasing underwater images. Zhang et al. proposed an iterative thresholding approach based on dual histograms for Attenuated color channel Correction and Detail-preserving Contrast enhancement, respectively¹³. Zhang et al. proposed a locally adaptive color correction method built on the minimum color loss principle and the maximum attenuation map-guided fusion strategy, which reduces the color loss of the color-corrected image¹⁴. Kang et al. developed a general Structural Patch Decomposition and Fusion method, which merges two complementary preprocessed images in perceptually and conceptually independent image space¹⁵. Zhuang et al. proposed a Retinex variational model based on Hyper-Laplacian Reflectance Priors to enhance underwater images¹⁶. Although these model-free methods improve visual quality marginally, they overlook the underwater imaging mechanism, making it difficult to achieve better consistency between subjective and objective results, which may lead to over-enhancement or insufficient enhancement.

Model-based UIE methods

The methods based on physical models estimate potential model parameters using various priors, then restore a clear underwater image by underwater imaging model. Drews et al. proposed an Underwater Dark Channel Prior (UDCP) to estimate underwater transmission and achieved significant improvements using the blue and green channels, which is an adaptation of the DCP method¹⁷. Peng et al. proposed a depth estimation method for underwater scenes based on Image Blurriness and Light Absorption (IBLA), which can be used in the Image Formation Model (IFM) to restore and enhance underwater images¹⁸. Peng et al. developed a Generalized Dark channel Prior (GDCP), and utilized it to estimate underwater scene transmission based on depth-dependent color variation¹⁹. Chiang et al. integrated DCP with wavelength-dependent compensation and image dehazing techniques to mitigate the effects of haze and blur²⁰. These model-based methods tend to be either resource-intensive or highly sensitive to prior assumptions and information. Furthermore, the accurate estimation of complex parameters in underwater imaging poses significant challenges for current physics-based techniques.

Data-driven UIE methods

In recent years, data-driven methods have become a significant research direction in UIE. These approaches rely on vast amounts of data for model training, with the aim of better understanding and restoring underwater images within complex environments. Data-driven methods in UIE primarily fall into two major categories: Generative Adversarial Networks (GANs) and Convolutional Neural Networks (CNNs). Islam et al. presented a conditional generative adversarial network-based model for real-time UIE²¹. Espinosa et al. proposed an architecture built from a minimal encoder-decoder structure to address underwater image degradations while maintaining efficiency²². Li et al. developed a CNN structure that uses a connectionist approach to learn mapping coefficients for color correction and channel-level refinement²³. Yang et al. pioneered a conditional generative adversarial network featuring dual discriminators²⁴. Guo et al. proposed a novel MultiScale Dense Block without constructing the underwater degeneration model and image prior, which effectively combines residual learning, dense concatenation, and multiscale²⁵. Wang et al. proposed a UIE method utilizing both RGB and HSV color spaces, skillfully integrating the advantages of the two color spaces²⁶. Li et al. presented a multi-color space embedding network guided by medium transmission, which successfully improves the visual quality of underwater images by leveraging the advantages of multi-color space embedding²⁷. Zhang et al. proposed a Synergistic Multiscale Detail Refinement via Intrinsic Supervision for UIE, which addresses the limited scale-related features in current UIE methods²⁸. Typically, these underwater image enhancement models utilize conventional deep network architectures designed for general applications, overlooking the distinct features of underwater imaging.

Proposed method

The full architecture of the proposed Multiscale Disentanglement Network (MD-Net) is presented in Fig. 2. MD-Net consists of a Gaussian Blur (GB) block, a Transmission Map Disentanglement (TMD) block, a Scene Radiance Disentanglement (SRD) block and a Jaffe-McGlamery model. The GB, TMD and SRD blocks disentangle raw images into global background light, transmission maps and scene radiance (clear images), respectively. The Jaffe-McGlamery model reconstructs the original image using the three parameters described above. Subsequently, we introduce adversarial reconstruction loss ( Inline graphic ) that capitalizes on the feature contrast between the original and reconstructed images. This loss function enables a class adversarial learning process, which enhances the image’s disentanglement accuracy and structural consistency. Since the transmission map and scene radiance have the same dependence on pixel values and contrast of the original image, MD-Net adopts the same network structure for TMD and SRD. MD-Net jointly estimates the transmission map and scene radiance map by sharing a network architecture, thereby achieving superior image enhancement results while simultaneously reducing the number of parameters in the network model. This is illustrated in Fig. 3. The steps to disentangle raw images into transmission maps and scene radiance (clear images) are as follows: (1) Multi-level Fusion Module (MFM) employs varying receptive fields ( Inline graphic , , ) to extract shallow spatial features and multi-layer fusion features , (2) the are fed into both the Global Average Pooling layer (GAP) and the Dual-layer Weight Estimation Unit (DWEU) group to obtain the downsampling pooling feature and the dual-level weight feature , (3) the Inline graphic is obtained by multiplying and , (4) after the multi-scale features are obtained by adding and , a convolution layer is adapted to adjust the number of channels for the final output of the transmission maps and scene radiance.

The overall network architecture of MD-Net that follows a decompose-reconstruct process. The branches from top to bottom are Gaussian blur, TMD, and SRD, used for acquiring global background light, transmission maps, and scene radiance respectively. The intermediate sections of TMD and SRD consist primarily of MFM, DWEU, GAP, and convolutional layers. Besides, the red line is a skip connection that carries information from the shallow layer to a deep layer. The far right consists of the Jaffe-McGlamery model (the underwater imaging model), aiming to achieve image detail restoration capabilities beyond the reference image by reconstructing the acquired parameters into underwater images, supervising the accuracy of the three parameters’ acquisition..

Visual results of TMD and SRD with different structural versions, simple versions are composed of three layers of convolution and complex versions are composed of two MFMs and six DWEUs. Compared to the simple and complex versions of TMD and SRD (b,e,c,f), the current version’s TMD and SRD (**a,d**) have already achieved excellent parameter estimation..

In what follows, we detail the key components of MD-Net, including MFM, DWEU and loss function.

Multi-level fusion module

Different sizes of receptive fields have different perceptual ranges for images. For example, the small receptive field can capture local detailed features but may need to include broader contextual information. The large receptive field can capture global information but may overlook some detailed features. To combine their properties for UIE, we design an MFM shown in Fig. 4 to capture multi-layer fusion features Inline graphic that integrate global information and shallow spatial features for multiscale fusion.

MFM mainly consists of convolutional layers with varying kernel sizes and two convolutional blocks. The convolutional kernels of the three branches increase sequentially from top to bottom, with the red line at the bottom indicating skip connections. Convblock-1 comprises two convolutional layers a ReLU function, and a normalization operation. Convblock-2 comprises two convolutional layers, a ReLU function, and a sigmoid function..

Specifically, we first extract shallow spatial features through convolution layers with different receptive field sizes ( Inline graphic , , ), which capture diverse levels of spatial information features (, , ) from the input images. As shown in Fig. 5, emphasizes the local details of the fish, while and contain the fine global structure. However, as observed in the first row of Fig. 5, noise artifacts still persist in the feature maps after one layer of convolution. Therefore, we propose the Convblock-1 Inline graphic to further capture detailed information from shallow layers:

where Inline graphic refers to the collective term for shallow spatial features, each includes two convolutional layers of the original kernel size, a ReLU activation and a normalization layer. Notably, convolution blocks perform standard convolution operations within each smaller block, which effectively reduces the computational cost associated with convolution. Compared with Fig. 5b Inline graphic and (c) , Fig. 5e and (f) show increased texture details and the removal of artifacts.

Visualization of feature maps to and to . Higher brightness denotes greater weight. The brightness value is ranged in [0,1].

Subsequently, we perform tensor concatenation and feature fusion to acquire dynamic weights Inline graphic :

where cat denotes a channel-wise concatenation. Convblock-2 Inline graphic generates dynamical weight for feature fusion, which includes two convolution layers, a ReLU activation and a Sigmoid function. dynamically weights different input features according to the .

To allocate weights to each branch, we perform tensor splitting on Inline graphic to obtain , , :

As illustrated in Fig. 6, Inline graphic , , obtain better global structure and finer local details than , and in Fig. 5, demonstrating that the dynamic weighting of , , and achieves superior results compared to direct feature fusion. The multi-layer fusion feature obtained by fusing the above three fully integrates the information of image features at various scales of receptive fields.

Visualization of feature maps , to , to and . Higher brightness denotes greater weight. The brightness value is ranged in [0,1].

Dual-layer weight estimation unit

To learn complex relationships between illumination features and color information, we sequentially apply DWEU, which integrates channel and pixel weight estimation as shown in Fig. 7, to capture the dual-level weight feature Inline graphic in SRD and TMD blocks.

DWEU takes the output of the MFM as input, passes it through a convolutional layer, and subsequently directs the features separately to the pixel estimation unit positioned above and the channel estimation unit located below. Next, the weighted sum proceeds through an additional convolutional layer combined with the unprocessed input features. Eventually, the assigned weight features are input to the subsequent module..

Specifically, we first use a convolutional layer with a kernel size of Inline graphic to extract features from . Then, we input it into the channel estimation unit and pixel estimation unit respectively to obtain the pixel weights and the channel weights .

For obtaining Inline graphic :

where Inline graphic are pr-extracted features from convolution, conv.r.i includes a convolutional layer with the kernel size of for processing image features followed by a ReLU activation for nonlinear transformation and a normalization layer for standardizing feature maps, conv.s includes a convolutional layer with the kernel size of Inline graphic for further processing the feature maps followed by a Sigmoid function to compress the output values into the [0, 1] range.

For obtaining Inline graphic :

where gap represents the global average pooling layer used to extract global features, conv.r is used to extract and nonlinearly transform channel features.

Next, the estimated weights are used for pixel-wise weighted allocation and channel-wise weighted allocation of Inline graphic to obtain and :

where Inline graphic represents the pixel-weighted feature with a size of , and represents the channel-weighted features with a size of . We pass the sum of and through a convolutional layer to obtain the final dual-level weight features :

where conv is used to balance illumination features and color information, the size of Inline graphic is . Fig. 8 displays the gray visual comparison of , and . As shown in Fig. 8a, enhances the contrast between pixels, but the contour details of the image are not clear. In Fig. 8b, exhibits better smoothness. In contrast, Fig. 8c shows that integrates the advantages of pixel and channel weight estimation, making the image not only more prominent in details but also enhancing the overall smoothness.

Visualization of feature maps , and . Higher brightness denotes greater weight. The brightness value is ranged in [0,1].

By performing operations mentioned above, DWEU completes one iteration. Fig. 9 shows the visual comparison of DWEU after 4 iterations. As shown, compared to Fig. 6h, Fig. 9a exhibits better color reproduction and low light enhancement effects, confirming that the construction of a dual-level weight estimation unit can significantly improve visibility and color restoration. It is noteworthy that four iterations are sufficient to produce satisfactory underwater colors and image visibility.

Visual comparison of different DWEU iterations.

Loss function

The overall loss function of MD-Net consists of three components: adversarial reconstruction loss ( Inline graphic ), color correction loss () aimed at preventing oversaturated colors, and mean square error loss () to further improve the quality of target image. The fundamental concept of adversarial learning involves optimizing the model within a “game” framework. Building on this principle, we introduce a feature-based Inline graphic to enable a pseudo-adversarial process. This feature-level contrastive loss replaces the conventional generative adversarial game, allowing the network to focus on pixel-level discrepancies and the consistency of high-level image features. The is formulated as the following mean squared error:

where x represents the original underwater image, and Inline graphic represents the reconstructed image using the underwater imaging model presented in Eq. (1). To limit color correction oversaturation, we introduced color correction loss (). first sums and averages each color channel c in the color channel set , then calculates the square of the difference between this mean and 0.5 (assuming the scene’s average reflectance is gray, known as the gray world assumption) using the Inline graphic norm, to correct color deviations from the gray world assumption. Its formula is given as:

where Inline graphic represents the mean of the color channel c. is defined as:

where Inline graphic represents the scene radiance, is the ground truth, and n is the number of pixels. This function computes the average of the squared differences between the predicted values and ground truth.

The overall loss function is defined as follows:

Experiments

Experiment settings

We train MD-Net using PyTorch 2.1 framework, running on an Intel(R) i9-12900K CPU, 64GB of RAM, and an NVIDIA RTX 4090 GPU. We set the learning rate to Inline graphic and use the Adam optimizer for network optimization. The model is trained for 50 epochs with a batch size of 4, and all input images are resized to pixels. We compared the proposed method with seven UIE methods including four non-data-driven methods (DCP²⁹, UDCP¹⁷, ACDC¹³, UIEF³⁰) and data-driven methods (PUIE-Net³¹, P2CNet³², TCTL-Net³³). For a fair comparison, we employ the source code provided by the authors, retrain each method on our training set and produce the best enhancement results. Our performance evaluation utilizes real UIE benchmark datasets containing LSUI³⁴ and UIEB³⁵ datasets. Each dataset is divided into a training dataset and a testing dataset. For training, we use 3,794 images from the LSUI³⁴ dataset and 800 images from the UIEB³⁵ dataset. For testing, we use the rest 485 images from the LSUI³⁴ dataset (Test-485) and the rest 90 images from the UIEB³⁵ dataset (Test-90). It is noteworthy that the current underwater image datasets have certain limitations. As depicted in Fig. 10, the reference images in panels (a), (c), and (d) exhibit substantial color distortion, whereas the detailed textures in the reference images of panels (b), (e), and (f) appear less refined compared to the results produced by our method. We employ four metrics to measure the performance of different methods on Test-485, and Test-90. These metrics include full-reference and non-reference metrics which are Peak Signal-to-Noise Ratio (PSNR)³⁶, Structural Similarity Index Measure (SSIM)³⁷, Underwater Color Image Quality Evaluation (UCIQE)³⁸ and Underwater Image Quality Measure (UIQM)³⁹. Higher PSNR³⁶ and SSIM³⁷ scores indicate that the enhanced images more closely resemble the reference images in both content and structure. Higher UCIQE³⁸ and UIQM³⁹ values indicate a better balance among colorfulness, sharpness, and contrast.

Visual comparison of reference images and our proposed MD-Net for **test-485**³⁴ and Test-90³⁵. The second row is the enlarged pictures of the red boxes in the first row..

Qualitative evaluation

We conduct visual comparisons among different methods using Test-760, Test-485 and Test-90, as shown in Figs. 11, 12, 13, and 14. As shown in Figs. 11b and 12b, the overall illumination of the image is improved under the DCP²⁹ method, but some areas exhibit unnatural color balance. UDCP¹⁷ enhances the color vividness, but there are issues with color oversaturation in Figs. 12c and 13c. ACDC¹³ achieved good results in underwater image visibility, but it also increased color bias as shown in Figs. 12d and 13d. PUIE-Net³¹ effectively addresses UIE by breaking it down into distribution estimation and a consensus process, resulting in relatively satisfactory visualization but its performance to deblur images needs improvement, as evidenced in Figs. 13e and 14e. As depicted in Figs. 11f and 14f, P2CNet³² restores image details, yet it does not remove color casts and introduces severe artificial colors. UIEF³⁰ adjusts the color casts and enhances brightness realistically but introduces purple artifacts and unnatural sharpness in Figs. 11g and 14g. As shown in Figs. 11h and 13h, while TCTL-Net³³ achieves color correction, the problems of darkness and gray artifacts are common in the enhanced images. In contrast, the proposed MD-Net effectively corrects color casts, leveraging the designed SRD. It achieves favorable results by enhancing low contrast images under poor lighting conditions, owing to the fusion of the disentanglement strategy and the Jaffe-McGlamery model¹².

The visual comparison of various methods on bluish underwater images from **Test-485**³⁴ and **Test-90**³⁵. The compared methods are DCP²⁹, UDCP¹⁷, ACDC¹³, PUIE-Net³¹, P2CNet³², UIEF³⁰, TCTL-Net³³, and the proposed MD-Net, respectively..

The visual comparison of various methods on greenish underwater images from **Test-485**³⁴ and **Test-90**³⁵. The compared methods are DCP²⁹, UDCP¹⁷, ACDC¹³, PUIE-Net³¹, P2CNet³², UIEF³⁰, TCTL-Net³³, and the proposed MD-Net, respectively..

The visual comparison of various methods on yellowish images from **Test-485**³⁴ and **Test-90**³⁵. The compared methods are DCP²⁹, UDCP¹⁷, ACDC¹³, PUIE-Net³¹, P2CNet³², UIEF³⁰, TCTL-Net³³, and the proposed MD-Net, respectively..

The visual comparison of various methods on low visibility images from **Test-485**³⁴ and **Test-90**³⁵. The compared methods are DCP²⁹, UDCP¹⁷, ACDC¹³, PUIE-Net³¹, P2CNet³², UIEF³⁰, TCTL-Net³³, and the proposed MD-Net, respectively..

Quantitative assessment

We evaluate the performance of various UIE approaches using PSNR³⁶, SSIM³⁷, UCIQE³⁸ and UIQM³⁹ on Test-485 and Test-90, as summarized in Table 1. For PSNR³⁶ and SSIM³⁷, the proposed MD-Net yields four best scores, which indicates that our results closely resemble the reference images in terms of both content and structure. For UCIQE³⁸, in contrast to our competitors, MD-Net achieves a best score and a second-best score, demonstrating that the proposed approach mitigates non-uniform color bias, reduces blurriness and enhances contrast. Besides, MD-Net secures two second-best UIQM³⁹ score, showcasing exceptional performance in terms of colorfulness, sharpness, and contrast enhancement.

Table 1.

The average scores for PSNR³⁶, SSIM³⁷, UCIQE³⁸, UIQM³⁹ and Mean using different methods on TEST-485 and TEST-90. The best and second-best performances are highlighted in italics and bold, respectively.

Methods	LSUI³⁴				UIEB³⁵				Mean
	PSNR	SSIM	UCIQE	UIQM	PSNR	SSIM	UCIQE	UIQM
Raw	21.3238	0.7653	0.5394	2.6601	19.1723	0.7490	0.5342	2.6986	6.0553
DCP²⁹	18.8433	0.6808	0.5560	2.0333	16.4304	0.6717	0.5541	2.1547	5.2405
UDCP¹⁷	13.8345	0.5536	0.5715	2.1041	12.4918	0.5539	0.5886	1.9429	4.0801
ACDC¹³	18.5840	0.7246	0.5431	3.3593	20.2744	0.8174	0.5527	3.3564	6.0264
PUIE-Net³¹	20.3833	0.8310	0.5427	3.1344	20.8352	0.8322	0.5461	3.2187	6.2905
P2CNet³²	12.2433	0.4465	0.5341	1.7744	12.7950	0.4572	0.5250	1.9277	3.8379
UIEF³⁰	18.6483	0.8083	0.5629	3.2269	20.4065	0.8729	0.5743	3.3713	6.0589
TCTL-Net³³	15.1438	0.6586	0.5479	2.0717	15.5655	0.7213	0.5475	2.1192	4.6719
MD-Net	23.2144	0.8365	0.5767	3.2426	24.0474	0.8919	0.5869	3.3592	7.0945

Open in a new tab

Ablation study

We systematically analyze the impact of various components within MD-Net through comprehensive ablation studies on bluish, greenish, yellowish and low-visibility degeneration scenes, including 3 settings: (1) MD-Net without SRD and Disentangle strategy (Our-settingI), (2) MD-Net without Disentangle strategy (Our-settingII), (3) full method (Our). Figure 15 presents the visual results. Our-settingI underperforms in both mitigating color casts and improving visibility. Our-settingII improves color and increases visibility but fails to fully restore color casts. Our full method not only attains true-to-life colors but also enhances sharpness and visibility. Furthermore, we show a quantitative comparison in Table 2. As depicted, the PSNR³⁶ and SSIM³⁷ index results show a progressive improvement from Our-setting I to Our-setting II, as the SRD module is incorporated. Compared with Our-settingII, our comprehensive approach achieves the highest PSNR³⁶ and SSIM³⁷ scores, underscoring the superior performance of our MD-Net in restoring underwater content and structure.

Table 2.

The average PSNR³⁶ and SSIM³⁷ scores from the ablation study on TEST-90. The best and second-best performances are highlighted in italics and bold, respectively.

Methods	PSNR	SSIM
Our-settingI	12.4288	0.3290
Our-settingII	21.7992	0.7823
Our	22.2697	0.8161

Open in a new tab

Complexity and runtime

We compare the FLOPs (G) and trainable parameters (M) of diverse UIE methodologies to gauge their computational efficiency, as illustrated in Table 3. The implementations of DCP²⁹, UDCP¹⁷ and UIEF³⁰ are executed within the Matlab environment, and as such, they do not have associated FLOPs and parameters. While MD-Net did not perform well in FLOPs detection on image sizes 256 Inline graphic 256, 512 512 and 1024 1024 compared to other competitors, its architecture parameters has resulted in outstanding performance. Table 4 illustrates that MD-Net achieves respectable processing speeds across various image dimensions and notably achieves the second-fastest speed at a resolution of 256 Inline graphic 256 and 512 512, highlighting its ability to efficiently handle diverse image sizes.

Table 3.

Comparison of complexity across various methods. The best and second-best performances are highlighted in italics and bold, respectively.

Methods	FLOPs(G)			Params(M)
Methods	256256	512512	10241024	Params(M)
PUIE-Net³¹	120.3755	481.5019	1926.0080	1.4010
P2CNet³²	20.7184	82.8738	331.4952	2.0676
TCTL-Net³³	56.6184	226.4736	905.8944	99.7172
MD-Net	134.2607	537.0429	2148.1715	1.3778

Open in a new tab

Table 4.

The average runtime of various methods over one hundred trials on different image sizes. The best and second-best performances are highlighted in italics and bold, respectively.

Methods	256256	512512	10241024
DCP²⁹	1.1190	4,6559	19.0440
UDCP¹⁷	1.0511	4.3603	18.9691
ACDC¹³	0.7574	2.6709	11.4419
PUIE-Net³¹	0.0044	0.0124	0.0452
P2CNet³²	0.3129	0.3184	0.3715
UIEF³⁰	0.5818	1.4439	6.4266
TCTL-Net³³	0.3527	1.3108	5.6432
MD-Net	0.0157	0.0645	5.4969

Open in a new tab

Application tests

We show the effectiveness of our MD-Net through various underwater application tests in Test-90³⁵, including depth estimation, edge detection, keypoint detection, saliency detection, and image segmentation. For underwater depth estimation, we employ the non-local prior⁴⁰, utilize the Canny operator⁴¹ for underwater edge detection, employ the SIFT keypoint detection⁴² for underwater keypoint detection, adopt BASNet⁴³ for underwater saliency detection, and apply a superpixel-based clustering algorithm⁴⁴ for underwater image segmentation. All results are illustrated in Fig. 16. Although the generated depth maps and edge detection results did not rank first among competitors, they still demonstrated commendable performance. Additionally, MD-Net detected more key points, and captured sharper structures. Moreover, MD-Net’s segmentation results demonstrated higher consistency and accuracy, while its saliency maps contained more distinct objects and sharper boundaries. These findings suggest that MD-Net effectively can effectively enhance both underwater image segmentation and saliency detection.

Conclusion

This paper proposes MD-Net, a multi-scale disentanglement network for underwater image enhancement. MD-Net embeds a multi-scale feature fusion CNN and an underwater imaging model into the architecture of underwater image inverse-reconstruction through a disentanglement strategy. Specifically, the disentangled physical parameters (global background light, transmission map and scene radiance) are fed into the underwater imaging model to reconstruct underwater images. Furthermore, the designed multiscale feature fusion CNN integrates shallow spatial features and deep semantic features by focusing on multi-level receptive field features, and allocation of pixel and channel weights. We conducted extensive experiments on MD-Net, including qualitative and quantitative comparative experiments, as well as a series of computer vision application experiments. Experimental results demonstrate that the proposed method outperforms existing state-of-the-art methods in terms of color cast correction and visibility enhancement on different datasets, providing significant performance enhancement in multiple computer vision tasks.

Acknowledgements

This work is supported by Research Project of Fashu Foundation (MFK23006), 2024 University-level Special Project of Minjiang University (K-MJKJ24006), Guiding Project of Fujian Provincial Department of Science and Technology (2021H0054), and Minjiang University Scientific Research Promotion Fund (MJY22025).

Author contributions

Conceived and designed the experiments: J.Y., E.G, and H.F. Performed the experiments: J.Y., H.H., and Y.W. Analyzed the data: E.G. and J.Y. Wrote and reviewed the paper: J.Y., H.H., Y.W, M.W.N., N.U.R.J., E.G., and H.F.

Data availability

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Declarations

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Jiaquan Yan, Hao Hu and Yijian Wang contributed equally to this work.

References

1.Wang, L. et al. Underwater image restoration based on dual information modulation network. Sci. Rep.14, 5416 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Rowghanian, V. Underwater image restoration with Haar wavelet transform and ensemble of triple correction algorithms using bootstrap aggregation and random forests. Sci. Rep.12, 8952 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Zhou, J., Yang, T., Chu, W. & Zhang, W. Underwater image restoration via backscatter pixel prior and color compensation. Eng. Appl. Artif. Intell.111 (2022).
4.Guo, C. et al. Underwater ranker: Learn which is better and how to be better. Proc. AAAI Conf. Artif. Intell. (AAAI)37, 702–709 (2023). [Google Scholar]
5.Sarkar, P., De, S., Gurung, S. & Dey, P. Uice-mirnet guided image enhancement for underwater object detection. Sci. Rep.14, 22448 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Li, C., Guo, J. & Guo, C. Emerging from water: Underwater image color correction based on weakly supervised color transfer. IEEE Signal Process. Lett.25, 323–327 (2018). [Google Scholar]
7.Zhu, J.-Y., Park, T., Isola, P. & Efros, A. A. Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision (ICCV). 81–88 (2017).
8.Li, C., Guo, J. & Guo, C. An underwater image enhancement benchmark dataset and beyond. IEEE Trans. Image Process.29, 4376–4389 (2019). [DOI] [PubMed] [Google Scholar]
9.Fu, Z. et al. Unsupervised underwater image restoration: From a homology perspective. Proc. AAAI Conf. Artif. Intell. (AAAI)36, 643–651 (2022). [Google Scholar]
10.Jaffe, J. S. Computer modeling and the design of optimal underwater imaging systems. IEEE J. Ocean. Eng.15, 101–111 (1990). [Google Scholar]
11.McGlamery, B. A computer model for underwater camera systems. Ocean Opt. VI208, 221–231 (SPIE, 1980).
12.Carlevaris-Bianco, N., Mohan, A. & Eustice, R. M. Initial results in underwater single image dehazing. In Oceans 2010 MTS/IEEE Seattle. 1–8 (2010).
13.Zhang, W., Wang, Y. & Li, C. Underwater image enhancement by attenuated color channel correction and detail preserved contrast enhancement. IEEE J. Ocean. Eng.47, 718–735 (2022). [Google Scholar]
14.Zhang, W. et al. Underwater image enhancement via minimal color loss and locally adaptive contrast enhancement. IEEE Trans. Image Process.31, 3997–4010 (2022). [DOI] [PubMed] [Google Scholar]
15.Kang, Y. et al. A perception-aware decomposition and fusion framework for underwater image enhancement. IEEE Trans. Circuits Syst. Video Technol.33, 988–1002 (2023). [Google Scholar]
16.Zhuang, P., Wu, J., Porikli, F. & Li, C. Underwater image enhancement with hyper-Laplacian reflectance priors. IEEE Trans. Image Process.31, 5442–5455 (2022). [DOI] [PubMed] [Google Scholar]
17.Drews, P. J., do Nascimento, E., Moraes, F., Botelho, S. & Campos, M. Transmission estimation in underwater single images. In Proceedings of the IEEE International Conference on Computer Vision (ICCV) Workshops. 825–830 (2013).
18.Peng, Y.-T. & Cosman, P. C. Underwater image restoration based on image blurriness and light absorption. IEEE Trans. Image Process.26, 1579–1594 (2017). [DOI] [PubMed] [Google Scholar]
19.Peng, Y.-T., Cao, K. & Cosman, P. C. Generalization of the dark channel prior for single image restoration. IEEE Trans. Image Process.27, 2856–2868 (2018). [DOI] [PubMed] [Google Scholar]
20.Chiang, J. Y. & Chen, Y.-C. Underwater image enhancement by wavelength compensation and dehazing. IEEE Trans. Image Process.21, 1756–1769 (2012). [DOI] [PubMed] [Google Scholar]
21.Islam, M. J., Xia, Y. & Sattar, J. Fast underwater image enhancement for improved visual perception. IEEE Robot. Autom. Lett.5, 3227–3234 (2020). [Google Scholar]
22.Espinosa, A. R., McIntosh, D. & Albu, A. B. An efficient approach for underwater image improvement: Deblurring, dehazing, and color correction. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW). 206–215 (2023).
23.Yang, X., Li, H. & Chen, R. Underwater image enhancement with image colorfulness measure. Signal Process.-Image Commun.95, 1382–1386 (2021). [Google Scholar]
24.Yang, M. et al. Underwater image enhancement based on conditional generative adversarial network. Signal Process.-Image Commun.81 (2020).
25.Guo, Y., Li, H. & Zhuang, P. Underwater image enhancement using a multiscale dense generative adversarial network. IEEE J. Ocean. Eng.45, 862–870 (2020). [Google Scholar]
26.Wang, Y., Guo, J., Gao, H. & Yue, H. Uiec 2-net: Cnn-based underwater image enhancement using two color space. Signal Process. Image Commun.96, 116250 (2021). [Google Scholar]
27.Li, C. et al. Underwater image enhancement via medium transmission-guided multi-color space embedding. IEEE Trans. Image Process.30, 4985–5000 (2021). [DOI] [PubMed] [Google Scholar]
28.Zhang, D., Zhou, J., Guo, C., Zhang, W. & Li, C. Synergistic multiscale detail refinement via intrinsic supervision for underwater image enhancement. Proc. AAAI Conf. Artif. Intell.38, 7033–7041 (2024). [Google Scholar]
29.He, K., Sun, J. & Tang, X. Single image haze removal using dark channel prior. IEEE Trans. Pattern Anal. Mach. Intell.33, 2341–2353 (2010). [DOI] [PubMed] [Google Scholar]
30.An, S., Xu, L., Deng, Z. & Zhang, H. Hfm: A hybrid fusion method for underwater image enhancement. Eng. Appl. Artif. Intell.127 (2024).
31.Fu, Z., Wang, W., Huang, Y., Ding, X. & Ma, K.-K. Uncertainty inspired underwater image enhancement. In European Conference on Computer Vision (2022).
32.Rao, Y. et al. Deep color compensation for generalized underwater image enhancement. IEEE Trans. Circuits Syst. Video Technol.34, 2577–2590 (2023). [Google Scholar]
33.Li, K. et al. Tctl-net: Template-free color transfer learning for self-attention driven underwater image enhancement. IEEE Trans. Circuits Syst. Video Technol.34, 4682–4697 (2024). [Google Scholar]
34.Peng, L., Zhu, C. & Bian, L. U-shape transformer for underwater image enhancement. IEEE Trans. Image Process.32, 3066–3079 (2023). [DOI] [PubMed] [Google Scholar]
35.Li, C. et al. An underwater image enhancement benchmark dataset and beyond. IEEE Trans. Image Process.29, 4376–4389 (2020). [DOI] [PubMed] [Google Scholar]
36.Korhonen, J. & You, J. Peak signal-to-noise ratio revisited: Is simple beautiful? In Fourth International Workshop on Quality of Multimedia Experience. 37–38 (2012).
37.Horé, A. & Ziou, D. Image quality metrics: PSNR vs. SSIM. In International Conference on Pattern Recognition. 2366–2369 (2010).
38.Yang, M. & Sowmya, A. An underwater color image quality evaluation metric. IEEE Trans. Image Process.24, 6062–6071 (2015). [DOI] [PubMed] [Google Scholar]
39.Panetta, K., Gao, C. & Agaian, S. Human-visual-system-inspired underwater image quality measures. IEEE J. Ocean. Eng.41, 541–551 (2016). [Google Scholar]
40.Berman, D., treibitz, T. & Avidan, S. Non-local image dehazing. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 1674–1682 (2016).
41.Canny, J. A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell.X8, 679–698 (1986). [PubMed] [Google Scholar]
42.Lowe, D. G. Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis.60, 91–110 (2004). [Google Scholar]
43.Qin, X. et al. Basnet: Boundary-aware salient object detection. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 7471–7481 (2019).
44.Lei, T. et al. Super pixel based fast fuzzy c-means clustering for color image segmentation. IEEE Trans. Fuzzy Syst.27, 1753–1766 (2019). [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

[CR1] 1.Wang, L. et al. Underwater image restoration based on dual information modulation network. Sci. Rep.14, 5416 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR2] 2.Rowghanian, V. Underwater image restoration with Haar wavelet transform and ensemble of triple correction algorithms using bootstrap aggregation and random forests. Sci. Rep.12, 8952 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Zhou, J., Yang, T., Chu, W. & Zhang, W. Underwater image restoration via backscatter pixel prior and color compensation. Eng. Appl. Artif. Intell.111 (2022).

[CR4] 4.Guo, C. et al. Underwater ranker: Learn which is better and how to be better. Proc. AAAI Conf. Artif. Intell. (AAAI)37, 702–709 (2023). [Google Scholar]

[CR5] 5.Sarkar, P., De, S., Gurung, S. & Dey, P. Uice-mirnet guided image enhancement for underwater object detection. Sci. Rep.14, 22448 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.Li, C., Guo, J. & Guo, C. Emerging from water: Underwater image color correction based on weakly supervised color transfer. IEEE Signal Process. Lett.25, 323–327 (2018). [Google Scholar]

[CR7] 7.Zhu, J.-Y., Park, T., Isola, P. & Efros, A. A. Unpaired image-to-image translation using cycle-consistent adversarial networks. In Proceedings of the IEEE International Conference on Computer Vision (ICCV). 81–88 (2017).

[CR8] 8.Li, C., Guo, J. & Guo, C. An underwater image enhancement benchmark dataset and beyond. IEEE Trans. Image Process.29, 4376–4389 (2019). [DOI] [PubMed] [Google Scholar]

[CR9] 9.Fu, Z. et al. Unsupervised underwater image restoration: From a homology perspective. Proc. AAAI Conf. Artif. Intell. (AAAI)36, 643–651 (2022). [Google Scholar]

[CR10] 10.Jaffe, J. S. Computer modeling and the design of optimal underwater imaging systems. IEEE J. Ocean. Eng.15, 101–111 (1990). [Google Scholar]

[CR11] 11.McGlamery, B. A computer model for underwater camera systems. Ocean Opt. VI208, 221–231 (SPIE, 1980).

[CR12] 12.Carlevaris-Bianco, N., Mohan, A. & Eustice, R. M. Initial results in underwater single image dehazing. In Oceans 2010 MTS/IEEE Seattle. 1–8 (2010).

[CR13] 13.Zhang, W., Wang, Y. & Li, C. Underwater image enhancement by attenuated color channel correction and detail preserved contrast enhancement. IEEE J. Ocean. Eng.47, 718–735 (2022). [Google Scholar]

[CR14] 14.Zhang, W. et al. Underwater image enhancement via minimal color loss and locally adaptive contrast enhancement. IEEE Trans. Image Process.31, 3997–4010 (2022). [DOI] [PubMed] [Google Scholar]

[CR15] 15.Kang, Y. et al. A perception-aware decomposition and fusion framework for underwater image enhancement. IEEE Trans. Circuits Syst. Video Technol.33, 988–1002 (2023). [Google Scholar]

[CR16] 16.Zhuang, P., Wu, J., Porikli, F. & Li, C. Underwater image enhancement with hyper-Laplacian reflectance priors. IEEE Trans. Image Process.31, 5442–5455 (2022). [DOI] [PubMed] [Google Scholar]

[CR17] 17.Drews, P. J., do Nascimento, E., Moraes, F., Botelho, S. & Campos, M. Transmission estimation in underwater single images. In Proceedings of the IEEE International Conference on Computer Vision (ICCV) Workshops. 825–830 (2013).

[CR18] 18.Peng, Y.-T. & Cosman, P. C. Underwater image restoration based on image blurriness and light absorption. IEEE Trans. Image Process.26, 1579–1594 (2017). [DOI] [PubMed] [Google Scholar]

[CR19] 19.Peng, Y.-T., Cao, K. & Cosman, P. C. Generalization of the dark channel prior for single image restoration. IEEE Trans. Image Process.27, 2856–2868 (2018). [DOI] [PubMed] [Google Scholar]

[CR20] 20.Chiang, J. Y. & Chen, Y.-C. Underwater image enhancement by wavelength compensation and dehazing. IEEE Trans. Image Process.21, 1756–1769 (2012). [DOI] [PubMed] [Google Scholar]

[CR21] 21.Islam, M. J., Xia, Y. & Sattar, J. Fast underwater image enhancement for improved visual perception. IEEE Robot. Autom. Lett.5, 3227–3234 (2020). [Google Scholar]

[CR22] 22.Espinosa, A. R., McIntosh, D. & Albu, A. B. An efficient approach for underwater image improvement: Deblurring, dehazing, and color correction. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision Workshops (WACVW). 206–215 (2023).

[CR23] 23.Yang, X., Li, H. & Chen, R. Underwater image enhancement with image colorfulness measure. Signal Process.-Image Commun.95, 1382–1386 (2021). [Google Scholar]

[CR24] 24.Yang, M. et al. Underwater image enhancement based on conditional generative adversarial network. Signal Process.-Image Commun.81 (2020).

[CR25] 25.Guo, Y., Li, H. & Zhuang, P. Underwater image enhancement using a multiscale dense generative adversarial network. IEEE J. Ocean. Eng.45, 862–870 (2020). [Google Scholar]

[CR26] 26.Wang, Y., Guo, J., Gao, H. & Yue, H. Uiec 2-net: Cnn-based underwater image enhancement using two color space. Signal Process. Image Commun.96, 116250 (2021). [Google Scholar]

[CR27] 27.Li, C. et al. Underwater image enhancement via medium transmission-guided multi-color space embedding. IEEE Trans. Image Process.30, 4985–5000 (2021). [DOI] [PubMed] [Google Scholar]

[CR28] 28.Zhang, D., Zhou, J., Guo, C., Zhang, W. & Li, C. Synergistic multiscale detail refinement via intrinsic supervision for underwater image enhancement. Proc. AAAI Conf. Artif. Intell.38, 7033–7041 (2024). [Google Scholar]

[CR29] 29.He, K., Sun, J. & Tang, X. Single image haze removal using dark channel prior. IEEE Trans. Pattern Anal. Mach. Intell.33, 2341–2353 (2010). [DOI] [PubMed] [Google Scholar]

[CR30] 30.An, S., Xu, L., Deng, Z. & Zhang, H. Hfm: A hybrid fusion method for underwater image enhancement. Eng. Appl. Artif. Intell.127 (2024).

[CR31] 31.Fu, Z., Wang, W., Huang, Y., Ding, X. & Ma, K.-K. Uncertainty inspired underwater image enhancement. In European Conference on Computer Vision (2022).

[CR32] 32.Rao, Y. et al. Deep color compensation for generalized underwater image enhancement. IEEE Trans. Circuits Syst. Video Technol.34, 2577–2590 (2023). [Google Scholar]

[CR33] 33.Li, K. et al. Tctl-net: Template-free color transfer learning for self-attention driven underwater image enhancement. IEEE Trans. Circuits Syst. Video Technol.34, 4682–4697 (2024). [Google Scholar]

[CR34] 34.Peng, L., Zhu, C. & Bian, L. U-shape transformer for underwater image enhancement. IEEE Trans. Image Process.32, 3066–3079 (2023). [DOI] [PubMed] [Google Scholar]

[CR35] 35.Li, C. et al. An underwater image enhancement benchmark dataset and beyond. IEEE Trans. Image Process.29, 4376–4389 (2020). [DOI] [PubMed] [Google Scholar]

[CR36] 36.Korhonen, J. & You, J. Peak signal-to-noise ratio revisited: Is simple beautiful? In Fourth International Workshop on Quality of Multimedia Experience. 37–38 (2012).

[CR37] 37.Horé, A. & Ziou, D. Image quality metrics: PSNR vs. SSIM. In International Conference on Pattern Recognition. 2366–2369 (2010).

[CR38] 38.Yang, M. & Sowmya, A. An underwater color image quality evaluation metric. IEEE Trans. Image Process.24, 6062–6071 (2015). [DOI] [PubMed] [Google Scholar]

[CR39] 39.Panetta, K., Gao, C. & Agaian, S. Human-visual-system-inspired underwater image quality measures. IEEE J. Ocean. Eng.41, 541–551 (2016). [Google Scholar]

[CR40] 40.Berman, D., treibitz, T. & Avidan, S. Non-local image dehazing. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 1674–1682 (2016).

[CR41] 41.Canny, J. A computational approach to edge detection. IEEE Trans. Pattern Anal. Mach. Intell.X8, 679–698 (1986). [PubMed] [Google Scholar]

[CR42] 42.Lowe, D. G. Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis.60, 91–110 (2004). [Google Scholar]

[CR43] 43.Qin, X. et al. Basnet: Boundary-aware salient object detection. In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). 7471–7481 (2019).

[CR44] 44.Lei, T. et al. Super pixel based fast fuzzy c-means clustering for color image segmentation. IEEE Trans. Fuzzy Syst.27, 1753–1766 (2019). [Google Scholar]

PERMALINK

Underwater image enhancement via multiscale disentanglement strategy

Jiaquan Yan

Hao Hu

Yijian Wang

Muhammad Wasim Nawaz

Naveed Ur Rehman Junejo

Ente Guo

Huibin Feng

Abstract

Introduction

Related works

Underwater imaging model

Figure 1.

Model-free UIE methods

Model-based UIE methods

Data-driven UIE methods

Proposed method

Figure 2.

Figure 3.

Multi-level fusion module

Figure 4.

Figure 5.

Figure 6.

Dual-layer weight estimation unit

Figure 7.

Figure 8.

Figure 9.

Loss function

Experiments

Experiment settings

Figure 10.

Qualitative evaluation

Figure 11.

Figure 12.

Figure 13.

Figure 14.

Quantitative assessment

Table 1.

Ablation study

Figure 15.

Table 2.

Complexity and runtime

Table 3.

Table 4.

Application tests

Figure 16.

Conclusion

Acknowledgements

Author contributions

Data availability

Declarations

Competing interests

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases