Optimized dual-tree complex wavelet transform aided multimodal image fusion with adaptive weighted average fusion strategy

Jampani Ravi; R Narmadha

doi:10.1038/s41598-024-81594-6

. 2024 Dec 4;14:30246. doi: 10.1038/s41598-024-81594-6

Optimized dual-tree complex wavelet transform aided multimodal image fusion with adaptive weighted average fusion strategy

Jampani Ravi ^1,^✉, R Narmadha ¹

PMCID: PMC11618366 PMID: 39632891

Abstract

Image fusion is generally utilized for retrieving significant data from a set of input images to provide useful informative data. Image fusion enhances the applicability and quality of data. Hence, the analysis of multimodal image fusion is a new to the research topic, which is designed by combining the images of multimodal into single image in order to preserveexact details. On the other hand, the existing approaches face challenges in the precise interpretation of source images, and also it have only captured local information without considering the wide range of information. To consider these weaknesses, a multimodal image fusion model is planned to develop according to the multi-resolution transform along with the optimization strategy. At first, the images are effectively analyzed from standard public datasets and further, the images given into the Optimized Dual-Tree Complex Wavelet Transform (ODTCWT) to acquire low frequency and high frequency coefficients. Here, certain parameters in DTCWT get tuned with the hybridized heuristic strategy with the Probability of Fitness-based Honey Badger Squirrel Search Optimization (PF-HBSSO) to enhance the decomposition quality. Then, the fusion of high-frequency coefficients is performed using adaptive weighted average fusion technique, whereas the weights are optimized using PF-HBSSOto achieve the optimal fused results. Similarly, the low-frequency coefficients are combined by average fusion. Finally, the fused images undergo image reconstruction using the inverse ODTCWT. The experimental evaluation of the designed multimodal image fusion illustratessuperioritythat distinguishes this work from others.

Subject terms: Health care, Medical research

Introduction

Image fusion is a recently emerging concept owing to the amplifying requirements of diverse image processing applications especially in medical-aided diagnosis, video surveillance, remote sensing, and so on¹. Image fusion is rapidly growing with various imaging sensors and the accessibility of a huge range of imaging techniqueslikeComputed Tomography (CT), and Magnetic Resonance Images (MRI), etc. It has explored the healthcare community for efficient decision-making and treatment to patients². In addition, the major goal of image fusion also considers various factors like the fused image should be reliable and robust, inconsistencies or artifacts have to be eliminated, and salient information in any of the inputs must not be eradicated². On the other hand, the major issues of image fusion research are similarity across modalities as data formation can be statistically uncorrelated and completely different, the efficient feature illustration of every modality and image noise, etc.³. In addition to these requirements in real-time applications, image fusion-guided disease prognosis and diagnosis have been formulated to assist medical professionals in decision-making as there is a restriction on human interpretation of clinical images owing to their subjectivity⁴. The major reason of multimodal clinical image fusion is to get the information with superior quality by combining the complementary information from various source images⁵.

Image fusion can be performed in various ways including multi-focus, multi-temporal, multi-modal, and multi-view fusion techniques⁶. Among these approaches, multi-modal image fusion is more essential, and it is carried out on images gathered by different sensors. Moreover, it is more helpful in getting precise results, especially in the medical field⁷. Generally, multi-modal image fusion combines both complementary and supplementary information of source images⁸.The multimodal image fusion results in final fused images that is free from redundant and random information. It minimizes the storage space, and stores one final fused image instead of storing two individual images⁹. The amalgamation of two different modalities results in precise localization or detection of abnormalities.The fundamental features of multi-modal image fusion are listed here. It reduces the uncertainty as the joint information from various sensors minimizes the vagueness related to the decision or sensing process¹⁰. The temporal and spatial coverage is extended for better performance. Fusion requires condensed representations to give complete information on images¹¹. Multi-modal image fusion increases the system efficiency by reducing the redundancy in different measurements. It enhances reliability and reduces noise¹².

Generally, image fusion is conducted via different approaches like transform domain as well as spatial domain methods. The consideration of high pass filtering, Intensity and Hue Saturation (IHS), Brovey method etc., are performed in the spatial domain methods. Thus, there is a need of adopting transform domain approaches come into the picture. Some popular transforms used for image fusion process include Contourlet (CT)¹³, Curvelet (CVT)¹⁴, Stationary Wavelet (SWT)¹², Discrete Wavelet (DWT)¹¹, DTCWT¹⁵, and Non-Subsampled Contourlet Transform (NSCT)¹⁶. While estimating with spatial domain methods, transform domain techniques get higher efficiency in terms of image fusion.On the other hand, a reliable, accurate, and appropriate image fusion approach is needed for several classes of images in diverse domains that must be simply interpretable for getting superior image fusion performance¹⁷. Some challenges are inexpensive computation time, uncontrollable acquisition conditions, and errors found in fusing the images. Thus, there is a need of suggesting an innovative multi-modal image fusion approach with the integration of two medical images by adopting transform domain techniques.

The innovationssuggested in this paper are listed here.

To recommend a new Multimodal Image Fusion with intelligent approaches like ODTCWT and adaptive weighted average fusion strategy along with the suggestion of hybrid nature-inspired approaches for performing medical image fusion for better localization of abnormalities and disease diagnosis through the gathered images.
To propose a novel ODTCWT and adaptive weighted average fusion strategy for efficient image fusion using a new hybrid nature-inspired algorithm termed PF-HBSSO for obtaining the salient information from the input images and formulates the objective as the maximization of fusion mutual information.
To implement the PF-HBSSO algorithm by the combination of Honey Badger Algorithm (HBA) as well as Squirrel Search Algorithm (SSA) for recommending ODTCWT by optimizing the filter coefficients and weights for adaptive weighted average fusion model for increasing the convergence rate that aids in maximizing the fused image quality.

Followed by introduction, the forthcoming section is shown.Part II specifiesthe discussion on existing works. Part III recommends an innovativemodel for multimodal image fusion. Part IV specifies the generation of low and high frequency coefficientsusingODTCWT. Part V derives the fusion of high frequency and low frequency by heuristic algorithm. Part VI estimates the results and Part VII completes this paper.

Study on existing works

Literature review

Research work based on deep learning models

In 2021, Zuoet al.¹⁸ has presented a new automated multi-modal medical image fusion strategy using classifier-based feature synthesis with a deep multi-fusion scheme. They have used a pre-trained autoencoder to analyze the fusion strategy by multi-cascade fusion decoder and a feature classifier. The public datasets were used for analyzing the image fusion results. TheParameter-Adaptive Pulse Coupled Neural Network (PAPCNN)was used in thelow and high-frequency coefficients. This image fusion was especially used for classifying brain diseases via the final fused images.

In 2022, Sun et al.¹⁹ have suggested a new deep MFNet using LiDAR data and multimodal VHR aerial images for performing medical image fusion. The multimodal learning and attention strategy was utilized for adaptively fusing the intramodal and intermodal features. A multilevel feature fusion module, pyramid dilation blocks, and a multimodal fusion strategy were implemented. This proposed network has adopted the adaptive fusion of multimodal features, enhanced the effects of global-to-local contextual fusion, and improved the receptive field. Moreover, this network was optimized using a multiscale supervision training strategy. The ablation studies and simulation outcomes have supreme performance of the recommended MFNet.

In 2021, Fu et al.²⁰ have proposed a novel multimodal biomedical image fusion approach through deep Convolutional Neural Network (CNN) and rolling guidance filter”. The VGG model was applied for enhancing the image details and edges. Here, the rolling guidance filter was intended for extracting the detail and base images. Here, the fusion was done on both perceptual, detail, and base images with three diverse fusion methods. They have then chosen the image decomposition constraints by simulation for getting the suitable structure and texture images. In addition, the normalization operation was used to the perceptual images to eradicate noise and feature variations. Finally, it has shown the superior fusion outcomes and achieved better performance in terms of various objective measures.

In 2022, Goyal et al.²¹ have collected the images from standard sources, where NSCT was used for extracting the features. Next, a Siamese Convolutional Neural Network (sCNN) was applied for getting the significant features by weighted fusion. In order to eradicate noise, a new method has beenenhanced rate. Finally, the combination of NSCT + sCNN + FOTGV strategies has helped in enhancing the image fusion and also exhibited higher performance on both quantitative and visual analysis.

In 2022, Venkatesan and Ragupathyan²² have suggested a medical image fusion approach for fusing both MRI and CT images for recommending a healthcare model. To get both spectral and spatial domain features, the hybrid technique by integrating Deep Neural Network and DWT was suggested for getting more accuracy rate while estimating with traditional approaches. The performance enhancement was noticed in terms of standard deviation and average entropy for the designed DWT-CNN fusion approach compared with other wavelet transform methods. The superior efficiency on image fusion was noticed and has achieved considerable fusion performance rate.

In 2018, Bernal et al.²³ have suggested a supervised deep multimodal fusion model for automated human activity and egocentric action recognition to monitor and assist patients. This model has collected video data using body-mounted or egocentric camera and motion data gathered with wearable sensors. The performance was estimated on multimodal public dataset and has analyzed the efficiency. They have used CNN-LSTM architecture for performing the multimodal fusion to get the results regarding automated human activity and egocentric action recognition.

Research work based on machine learning algorithms

In 2021, Duan et al.²⁴ have recommended a new regional medical multimodal image fusion by adopting Genetic Algorithm (GA)-derived optimized approach. A weighted averaging technique was recommended for averaging the images of the source clinical images. Next, a fast Linear Spectral Clustering (LSC) superpixel technique was used for getting the homogenous regions and preserved the detailed information of images, which has segmented the average images and obtained the superpixel labels. The most significant regions were chosen and produced a decision map. The efficiency of the designed fusion approach was estimated via various experimental evaluations. Finally, the performance estimation on GA-based image fusion has shown the superiority on final fused images over others.

Research work based on image processing techniques

In 2022, Kong et al.²⁵ have implemented a new medical image fusion approach via Side Window Filtering (SWF) and Gradient Domain-Guided Filter Random Walk (GDGFRW) in the Framelet Transform (FT) domain. Initially, FT was used on standard multimodal images for getting the residual and approximate illustrations. Then, a new GDGFRW was used for integrating the superiority of GD and GFRW that has built for interpreting the sub-bands, and the fusion was done by SWF. Next, inverse FT was performed for getting the residual models have fused images. The performance were addressed the fusion issues and outperformed the recent representative ones regarding objective estimation and subjective visual efficiency.

Comparative analysis of the existing techniques and proposed model

In 2023, Zhang et al.²⁶ have implemented the novel fusion approach using Infrared-To-Visible Object Mapping (IVOMFuse) for extracting the target region from the infrared image. Further, the Expectation–Maximization (EM) has evolved to tune the probabilities in the target region. The fused image was attained by combining PCA and average fusion strategy. Hence, the final validation was attained by considering the TNO, CVC14, and RoadScene to get the final outcomes. In 2022, Zhou et al.²⁷ have suggested the differential image registration model termed as robust image fusion to assist thermal anomaly detection (Re2FAD). The fusion strategy has been effectively done to enhance the accuracy. In 2023, Gu et al.²⁸ have implemented the improved end-to-end image fusion approach (FSGAN) model to enhance the image fusion approach. Here, the auxiliary network has been extracted to enhance performance with diverse experiments.

Due to the heterogeneous nature, the multimodal image fusion is challenging in accordance of misalignment and non-linear relationships between the input data²⁶. Also, the decomposition based methods are not highly preferred in the fusion model²⁷. However, there is a complexity of discovering better multimodal images with fusion quality estimation in the suggested image fusion approaches²⁸.To eradicate the drawbacks in the existing techniques, an effective deep learning method is implemented in the multimodal image fusion model. By considering the decomposition model, the multimodal image fusion gets enhanced by analyzing the texture details and smoothen the layers. It has the ability to maximize image quality to detect the performance effectively. The diverse implementation outcome is done whereas the recommended framework ensures to get better reliable outcomes.

Problem specification

In multimodal image fusion, it is very challenging to perform the multi-scale analysis that intends to analyze the feature maps extracted using shearlet domain. The analysis of the strengths and weakness of the existing models is given in Table 1. LSC and GA²⁴ are very efficient in both the objective evaluation and visual effects in the segmentation of medical images. However, when increasing the region count, the fusion efficiency may get reduced and increase the running time in the image fusion. PAPCNN¹⁸ provides accurate and detailed information present in the fusion results. On the other hand, it does not completely utilize the fusion layer and decoding layer, which is observed through the quantification evaluations. Deep MFNet¹⁹ attains better performance regarding the visualization and quantification when considering the quantitative and qualitative evaluations. Yet, it does not consider the multi-scale decomposition for encoding and decoding to get performance enhancement. VGG network²⁰ ensures the final fused images through the combination of three informative images like fused base image, detail image as well as perceptual image. Still, it has provided the results with increased color distortion and fusion noise without considering the fusion quality. sCNN²¹ are trained with the concatenated features for considering huge significance.But, it is time-consuming and cannot perform region-based medical image fusion. DWT-CNN²² is efficient on capturing the high-level association among the modalities and obtains the feature descriptors from the spatiotemporal regions. Yet, it may fail on preserving the shift-invariance. GDGFRW²⁵ has an ability to understand the temporal patterns of behavior over data modalities, which have been hidden through the overriding of individual modality. Still, it is unable to perform multi-focus image fusion. RNN²³ has efficiently utilized the location and hand presence as their significant cues for automatically classifying the images. Yet, it does not support the practical, feedback and in-device inference in the fusion. Hence, it is important to develop an enhanced multimodal image fusion model with superior optimization strategy.

Table 1.

Strengths and weakness of existing multimodal image fusion models.

Author [citation]	Techniques	Strengths	Weakness
Duan et al.²⁴	LSC and GA	It is very efficient in providing clear visual effects	When increasing the regions count, the fusion efficiency may get reduced and increases the running time in the image fusion
Zuoet al.¹⁸	PAPCNN	It provides accurate and detailed information present in the fusion results	It does not completely utilize the fusion layer and decoding layer, which is observed through the quantification evaluations
Sun et al.¹⁹	Deep MFNet	It attains better performance regarding the visualization and quantification when considering the quantitative and qualitative evaluations	It does not consider the multi-scale decomposition for encoding and decoding to get performance enhancement
Fu et al.²⁰	VGG network	It ensures the final fused images with the help of three informative images like fused base image, detail image as well as perceptual image	It has provided the results with increased color distortion and fusion noise without considering the fusion quality
Goyal et al.²¹	sCNN	It is trained with concatenated features for considering huge significance	It is time-consuming and it cannotperformregion-based medical image fusion
Venkatesan and Ragupathyan²²	DWT-CNN	It is efficient on getting the high-level association among the different modal images and obtains the feature descriptors from the spatiotemporal regions	It may fail on preserving the shift-invariance
Kong et al.²⁵	GDGFRW	It has an ability to understand the temporal patterns of behaviour over data modalities, which have been hidden through the overriding of individual modality	It is unable to perform multi-focus image fusion
Bernal et al.²³	RNN	It has efficiently utilized the location and hand presence as their significant cues for automatically classifying the images	It does not support the practical, feedback and in-device inference in the fusion

Description/images	DA-DT-CWT³²	GWO-DT-CWT³³	HBA-DT-CWT²⁹	SSA- DT-CWT³⁰	PF-HBSSO- DT-CWT
1	0.8525	0.85151	0.85586	0.83259	0.87444
2	0.8682	0.86711	0.87459	0.83907	0.87969
3	0.91896	0.91857	0.90787	0.91284	0. 93,112
4	0.86645	0.86428	0.87401	0.84555	0.88374
5	0.88159	0.88059	0.88691	0.85839	0.89245
6	0.87339	0.87252	0.87054	0.85232	0.89079
7	0.86841	0.86764	0.8725	0.83856	0.88029
8	0.86672	0.86576	0.86961	0.84187	0.87899

Description/images	DA-DT-CWT³²	GWO-DT-CWT³³	HBA-DT-CWT²⁹	SSA- DT-CWT³⁰	PF-HBSSO- DT-CWT	Description/images
1	Image 1	0.18441	0.19791	0.17944	0.15973	0.075319
2	Image 2	0.20864	0.22428	0.20117	0.18487	0.085404
3	Image 3	0.14315	0.15275	0.14858	0.11729	0.058243
4	Image 4	0.17726	0.19063	0.17099	0.16133	0.072862
5	Image 5	0.1729	0.18498	0.17227	0.14846	0.070638
6	Image 6	0.15999	0.17152	0.15698	0.14393	0.065629
7	Image 7	0.21188	0.22769	0.20464	0.18641	0.086649
8	Image 8	0.19778	0.21236	0.19183	0.17146	0.080775

Description/images	DA-DT-CWT³²	GWO-DT-CWT³³	HBA-DT-CWT²⁹	SSA- DT-CWT³⁰	PF-HBSSO- DT-CWT	Description/images
1	Image 1	4.9364	5.1441	4.8991	4.8477	4.1346
2	Image 2	4.5038	4.6521	4.5709	4.3923	3.694
3	Image 3	4.4418	4.6527	4.5535	5.0215	3.7606
4	Image 4	5.1221	5.3214	4.971	5.0402	4.2493
5	Image 5	3.2539	3.3675	2.8838	3.4634	2.5914
6	Image 6	4.1076	4.2943	4.1479	4.4421	3.4474
7	Image 7	4.5866	4.7862	4.5091	4.6408	3.8333
8	Image 8	4.6094	4.8272	4.57	4.5284	3.8149

Description/images	DA-DT-CWT³²	GWO-DT-CWT³³	HBA-DT-CWT²⁹	SSA- DT-CWT³⁰	PF-HBSSO- DT-CWT	Description/images
1	Image 1	34.72	33.499	33.975	34.113	50.491
2	Image 2	42.318	41.635	40.762	42.538	50.694
3	Image 3	35.131	34.103	33.798	29.979	46.193
4	Image 4	32.027	31.129	31.873	31.719	50.234
5	Image 5	51.256	51.128	51.811	50.567	49.221
6	Image 6	43.749	43.235	43.971	42.765	47.404
7	Image 7	40.128	39.314	39.872	39.115	52.629
8	Image 8	40.612	40.059	39.504	40.413	51.674

Description/images	DA-DT-CWT³²	GWO-DT-CWT³³	HBA-DT-CWT²⁹	SSA- DT-CWT³⁰	PF-HBSSO- DT-CWT	Description/images
1	Image 1	0.54999	0.64369	0.58262	0.61801	0.71643
2	Image 2	0.64224	0.69363	0.65925	0.59674	0.79625
3	Image 3	0.50736	0.48648	0.37731	0.45969	0.55752
4	Image 4	0.56224	0.69228	0.61818	0.63233	0.75819
5	Image 5	0.76257	0.77901	0.70539	0.77529	0.86377
6	Image 6	0.6102	0.65061	0.55874	0.55778	0.71829
7	Image 7	0.61764	0.6774	0.61425	0.65039	0.78316
8	Image 8	0.63523	0.70821	0.64291	0.69575	0.81592

Description/images	DA-DT-CWT³²	GWO-DT-CWT³³	HBA-DT-CWT²⁹	SSA- DT-CWT³⁰	PF-HBSSO- DT-CWT	Description/images
1	Image 1	20.684	20.646	20.168	19.739	15.254
2	Image 2	19.031	18.993	18.557	18.208	14.26
3	Image 3	15.789	15.792	15.627	15.511	11.32
4	Image 4	20.909	20.891	20.286	19.888	15.536
5	Image 5	20.754	20.727	20.073	20.1	16.6
6	Image 6	17.599	17.592	17.317	17.195	14.507
7	Image 7	20.585	20.544	19.911	19.41	14.535
8	Image 8	22.005	21.949	21.287	20.575	15.22

Description/images	DA-DT-CWT³²	GWO-DT-CWT³³	HBA-DT-CWT²⁹	SSA- DT-CWT³⁰	PF-HBSSO- DT-CWT	Description/images
1	Image 1	0.092423	0.092835	0.098082	0.10305	0.1727
2	Image 2	0.1118	0.11229	0.11808	0.12292	0.19364
3	Image 3	0.16239	0.16234	0.16545	0.16767	0.19566
4	Image 4	0.09006	0.090251	0.096756	0.1013	0.16718
5	Image 5	0.091685	0.091975	0.099161	0.098854	0.14791
6	Image 6	0.13183	0.13195	0.13619	0.13811	0.18822
7	Image 7	0.093485	0.093934	0.10103	0.10703	0.1876
8	Image 8	0.079389	0.079897	0.086228	0.093593	0.17338

Description/images	DA-DT-CWT³²	GWO-DT-CWT³³	HBA-DT-CWT²⁹	SSA- DT-CWT³⁰	PF-HBSSO- DT-CWT	Description/images
1	Image 1	27.276	26.959	26.083	31.602	34.994
2	Image 2	30.031	29.38	29.161	34.029	38.593
3	Image 3	10.784	10.188	10.746	12.821	18.597
4	Image 4	27.033	26.664	25.178	31.938	35.092
5	Image 5	25.217	24.837	23.238	30.157	33.764
6	Image 6	22.915	22.257	22.519	25.582	31.039
7	Image 7	30.158	29.725	28.518	34.634	38.259
8	Image 8	31.562	30.903	29.694	37.241	40.532

Description/images	Description	PCA³⁴	DWT³⁵	IHS³⁶	DCT³⁷	CWT³⁸	NSCT³⁶	DT-DWT³¹	PF-HBSSO-DT-CWT
1	Image 1	0.86364	0.85586	0.85153	0.85153	0.84941	0.84766	0.85613	0.87444
2	Image 2	0.87701	0.87451	0.86703	0.86703	0.86762	0.86405	0.85621	0.87969
3	Image 3	0.9253	0.90785	0.91939	0.91939	0.90232	0.90073	0.90937	0.93112
4	Image 4	0.8598	0.87402	0.86413	0.86413	0.86384	0.8418	0.87987	0.88374
5	Image 5	0.8844	0.88694	0.87999	0.87999	0.88161	0.86363	0.86911	0.89245
6	Image 6	0.88664	0.87043	0.87261	0.87261	0.86174	0.8568	0.8538	0.89079
7	Image 7	0.87038	0.87259	0.86816	0.86816	0.86618	0.85799	0.86487	0.88029
8	Image 8	0.87029	0.86968	0.86552	0.86552	0.86486	0.85432	0.86476	0.87899

Description/images	Description	PCA³⁴	DWT³⁵	IHS³⁶	DCT³⁷	CWT³⁸	NSCT³⁶	DT-DWT³¹	PF-HBSSO-DT-CWT
1	Image 1	0.1999	0.17952	0.23414	0.1883	0.22214	0.21621	0.16046	0.075319
2	Image 2	0.22644	0.20125	0.27255	0.21351	0.25925	0.21735	0.18211	0.085404
3	Image 3	0.25139	0.1487	0.26477	0.14561	0.20006	0.25318	0.11279	0.058243
4	Image 4	0.20299	0.17106	0.19827	0.18215	0.23181	0.20938	0.14284	0.072862
5	Image 5	0.18549	0.17237	0.21269	0.17659	0.22224	0.18368	0.14942	0.070638
6	Image 6	0.21435	0.15706	0.24439	0.16407	0.21501	0.23724	0.13394	0.065629
7	Image 7	0.22666	0.20472	0.24793	0.21662	0.25713	0.24881	0.18497	0.086649
8	Image 8	0.20907	0.19191	0.23295	0.20194	0.23704	0.20858	0.17424	0.080775

Description/images	Description	PCA³⁴	DWT³⁵	IHS³⁶	DCT³⁷	CWT³⁸	NSCT³⁶	DT-DWT³¹	PF-HBSSO-DT-CWT
1	Image 1	5.0084	4.9011	4.7997	4.9731	5.2523	5.9934	6.0335	4.1346
2	Image 2	4.4475	4.5728	4.3917	4.3967	4.9338	5.7999	5.8847	3.694
3	Image 3	4.8464	4.5564	4.8666	4.5999	4.9917	6.5128	5.7598	3.7606
4	Image 4	5.1871	4.9719	4.6253	5.147	5.4067	5.7485	5.8177	4.2493
5	Image 5	3.0693	2.8837	2.9753	3.0683	3.8068	5.1957	4.8821	2.5914
6	Image 6	4.3546	4.1496	4.1066	4.2122	4.713	5.8315	5.285	3.4474
7	Image 7	4.5861	4.5105	4.5238	4.5818	4.9843	5.7495	5.9722	3.8333
8	Image 8	4.5989	4.5713	4.5101	4.5678	4.8022	5.9588	5.9966	3.8149

Description/images	Description	PCA³⁴	DWT³⁵	IHS³⁶	DCT³⁷	CWT³⁸	NSCT³⁶	DT-DWT³¹	PF-HBSSO-DT-CWT
1	Image 1	34.558	33.97	36.564	33.974	31.267	47.121	39.126	50.491
2	Image 2	42.448	40.744	42.271	42.608	40.115	41.949	44.597	50.694
3	Image 3	33.791	33.806	33.519	33.301	32.158	34.555	39.778	46.193
4	Image 4	31.38	31.875	36.7	31.255	33.521	38.943	37.771	50.234
5	Image 5	51.26	51.808	51.589	51.25	49.286	51.231	51.001	49.221
6	Image 6	42.898	43.972	43.309	42.255	40.613	46.189	44.823	47.404
7	Image 7	39.965	39.841	39.953	39.584	34.686	46.161	42.549	52.629
8	Image 8	40.524	39.5	39.53	40.864	37.968	51.274	43.344	51.674

Description/images	Description	PCA³⁴	DWT³⁵	IHS³⁶	DCT³⁷	CWT³⁸	NSCT³⁶	DT-DWT³¹	PF-HBSSO-DT-CWT
1	Image 1	0.64535	0.58229	0.63141	0.68435	0.45334	0.21178	0.27059	0.71643
2	Image 2	0.78573	0.65898	0.74121	0.64224	0.46706	0.23733	0.23503	0.79625
3	Image 3	0.53106	0.37661	0.52761	0.48648	0.22034	0.12426	0.20906	0.55752
4	Image 4	0.75748	0.6178	0.66031	0.56224	0.46006	0.25329	0.26437	0.75819
5	Image 5	0.86136	0.70487	0.8353	0.76257	0.54155	0.17554	0.32254	0.86377
6	Image 6	0.68648	0.55807	0.61084	0.6102	0.34708	0.14925	0.29471	0.71829
7	Image 7	0.78005	0.61389	0.73722	0.61764	0.49649	0.25615	0.24889	0.78316
8	Image 8	0.80643	0.64257	0.75402	0.63523	0.53907	0.22809	0.3035	0.81592

Description/images	Description	PCA³⁴	DWT³⁵	IHS³⁶	DCT³⁷	CWT³⁸	NSCT³⁶	DT-DWT³¹	PF-HBSSO-DT-CWT
1	Image 1	20.082	20.166	17.755	20.729	18.929	17.713	18.092	15.254
2	Image 2	18.63	18.554	15.841	19.068	17.167	15.705	16.467	14.26
3	Image 3	13.264	15.626	12.787	15.8	14.834	12.767	14.842	11.32
4	Image 4	20.245	20.283	17.405	20.958	18.446	17.044	17.661	15.536
5	Image 5	20.475	20.069	17.59	20.797	18.096	17.444	18.3	16.6
6	Image 6	15.963	17.315	14.508	17.615	16.15	14.507	16.149	14.507
7	Image 7	20.367	19.908	17.363	20.645	18.301	17.063	17.281	14.535
8	Image 8	21.749	21.284	18.582	22.068	19.652	18.47	18.379	15.22

Description/images	Description	PCA³⁴	DWT³⁵	IHS³⁶	DCT³⁷	CWT³⁸	NSCT³⁶	DT-DWT³¹	PF-HBSSO-DT-CWT
1	Image 1	0.099055	0.098105	0.1295	0.091946	0.11312	0.13013	0.12457	0.1727
2	Image 2	0.11709	0.11811	0.16143	0.11133	0.13856	0.16396	0.1502	0.19364
3	Image 3	0.21717	0.16547	0.22943	0.16219	0.18126	0.19566	0.18109	0.22996
4	Image 4	0.09722	0.096792	0.13482	0.089558	0.11959	0.14055	0.13091	0.16718
5	Image 5	0.094678	0.099213	0.13197	0.091237	0.1245	0.13421	0.12162	0.14791
6	Image 6	0.15916	0.13622	0.18818	0.1316	0.15577	0.17455	0.15579	0.18822
7	Image 7	0.095864	0.10107	0.13547	0.092846	0.1216	0.14023	0.13676	0.1876
8	Image 8	0.081761	0.086256	0.11774	0.078812	0.10408	0.11926	0.12052	0.17338

Description/images	Description	PCA³⁴	DWT³⁵	IHS³⁶	DCT³⁷	CWT³⁸	NSCT³⁶	DT-DWT³¹	PF-HBSSO-DT-CWT
1	Image 1	24.406	26.071	20.891	27.026	23.795	20.081	20.907	34.994
2	Image 2	27.532	29.148	22.478	29.831	26.862	20.259	18.763	38.593
3	Image 3	5.0268	10.74	4.5191	10.619	8.9618	4.7744	10.753	18.597
4	Image 4	27.943	25.164	20.919	27.042	22.847	21.512	20.537	35.092
5	Image 5	23.971	23.222	18.46	25.326	20.223	15.572	16.862	33.764
6	Image 6	17.343	22.509	15.456	22.806	20.613	14.629	20.12	31.039
7	Image 7	29.738	28.505	24.158	30.279	25.941	20.235	19.091	38.259
8	Image 8	29.974	29.675	23.437	31.718	26.734	18.896	19.753	40.532

Description/images	SFINet⁴⁰	CABnet⁴¹	DUSMIF⁴²	Dense-ResNet⁴³	PF-HBSSO-DT-CWT
1	4.8956	5.7375	4.7904	4.974	4.1346
2	4.8576	4.8947	4.1643	4.8947	3.694
3	4.8432	4.5736	4.7465	5.8624	3.7606
4	5.8946	5.6436	4.7895	5.0782	4.2493
5	3.8965	3.7541	2.6854	3.7398	2.5914
6	4.8756	4.8645	4.6748	4.9898	3.4474
7	4.6574	4.7728	4.8935	4.8936	3.8333
8	4.8365	4.8554	4.6732	4.6283	3.8149

PERMALINK

Optimized dual-tree complex wavelet transform aided multimodal image fusion with adaptive weighted average fusion strategy

Jampani Ravi

R Narmadha

Abstract

Introduction

Study on existing works

Literature review

Research work based on deep learning models

Research work based on machine learning algorithms

Research work based on image processing techniques

Comparative analysis of the existing techniques and proposed model

Problem specification

Table 1.

An intelligent model for multimodal image fusion

Collection of dataset

Figure 1.

Proposed multimodal image fusion model

Figure 2.

Generating low and high “frequency coefficients” by optimized dual-tree complex wavelet transform

Optimization concept by PF-HBSSO

Figure 3.

Optimized DT-CWT-based image decomposition

Figure 4.

High frequency and low frequency image fusion by proposed heuristic algorithm

Developed objective model

High frequency optimization by adaptive weighted average fusion

Figure 5.

Low frequency optimization by average fusion

Figure 6.

Image reconstruction by inverse ODT-CWT

Experimental analysis

Validation setting

Validation metrics

Experimental images

Figure 7.

Estimation over heuristic approaches

Figure 8.

Estimation over transform approaches

Figure 9.

Comparative estimationof image fusion over heuristic algorithms

Table 2.

Table 3.

Table 4.

Table 5.

Table 6.

Table 7.

Table 8.

Table 9.

Comparative estimation on image fusion over transform algorithms

Table 10.

Table 11.

Table 12.

Table 13.

Table 14.

Table 15.

Table 16.

Table 17.

Comparative analysis of the developed model with recent methods

Table 18.

Conclusion

Acknowledgements

Author contributions

Funding

Data availability

Declarations

Competing interests

Ethical approval

Informed consent

Footnotes

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases