Abstract
Simple Summary
Medical imaging devices can be vital in primary-stage lung tumor analysis and the observation of lung tumors from the treatment. Many medical imaging modalities like computed tomography (CT), chest X-ray (CXR), molecular imaging, magnetic resonance imaging (MRI), and positron emission tomography (PET) systems are widely analyzed for lung cancer detection. This article presents a new dung beetle optimization modified deep feature fusion model for lung cancer detection and classification (DBOMDFF-LCC) technique.
Abstract
Lung cancer is the main cause of cancer deaths all over the world. An important reason for these deaths was late analysis and worse prediction. With the accelerated improvement of deep learning (DL) approaches, DL can be effectively and widely executed for several real-world applications in healthcare systems, like medical image interpretation and disease analysis. Medical imaging devices can be vital in primary-stage lung tumor analysis and the observation of lung tumors from the treatment. Many medical imaging modalities like computed tomography (CT), chest X-ray (CXR), molecular imaging, magnetic resonance imaging (MRI), and positron emission tomography (PET) systems are widely analyzed for lung cancer detection. This article presents a new dung beetle optimization modified deep feature fusion model for lung cancer detection and classification (DBOMDFF-LCC) technique. The presented DBOMDFF-LCC technique mainly depends upon the feature fusion and hyperparameter tuning process. To accomplish this, the DBOMDFF-LCC technique uses a feature fusion process comprising three DL models, namely residual network (ResNet), densely connected network (DenseNet), and Inception-ResNet-v2. Furthermore, the DBO approach was employed for the optimum hyperparameter selection of three DL approaches. For lung cancer detection purposes, the DBOMDFF-LCC system utilizes a long short-term memory (LSTM) approach. The simulation result analysis of the DBOMDFF-LCC technique of the medical dataset is investigated using different evaluation metrics. The extensive comparative results highlighted the betterment of the DBOMDFF-LCC technique of lung cancer classification.
Keywords: lung cancer, deep learning, feature fusion model, dung beetle optimizer, computer-aided diagnosis
1. Introduction
Over the last few decades, lung cancer has been a major cause of mortality. One of the common symptoms of lung tumors is coughing, which requires special consideration because most of the patients who have a cough are smokers, the main group affected by chronic obstructive pulmonary disease, which itself causes coughing [1,2]. Thoracic computed tomography (CT) or chest X-rays (CXRs) are two common techniques for the diagnosis of lung tumors. Sometimes, positron emission tomography (PET) and magnetic resonance imaging (MRI) can be utilized during staging the size of cancer spreads, while CT and CXR assist to determine better therapeutic management [3]. Biopsy and bronchoscopy are necessary to provide information on the histological type and to define the actual diagnoses of lung tumors [4,5]. In earlier investigations, the occurrence of a benign tumor after a nodule discovery and diagnostic operation was proven to be as high as 40%, which highlights the importance of rigorous nodule screening before further invasive treatment to avoid unwanted complications or loss of pulmonary capacity and limit surgical risk [6].
Specific characteristics should be measured and recognized to identify malignant nodules [7,8]. Cancer probability can be assessed by using the recognized features and their fusion. But, this task can be highly challenging, even for medical experts, because nodule presence and positive cancer diagnoses are not simply interrelated [9]. A computer-aided diagnoses (CAD) approach uses earlier analyzed features that are in some way associated with cancer suspicion, like shape, sphericity, volume, subtlety, speculation, solidity, etc. They use Machine Learning (ML) systems such as Support Vector Machines (SVMs) to categorize the nodules as benign or malignant [10,11]. Although several studies use similar ML algorithms, the problem with this method is that for the system perform well, various parameters should be input on an individual basis for each case, thereby making it hard to reproduce proficient outcomes [12]. In addition, this makes the approach prone to variability among dissimilar screening parameters and different CT scans. The benefit of utilizing deep learning (DL) in CAD systems is that it could implement end-to-end recognition by learning one of the important features in a trained model [13,14]. This enables the network to work effectively when there is variation, as it captures nodule features in CT scans with different parameters [15]. When the network is trained, it can be predictable and capable of generalizing its learning and identifying malignant nodules in new cases.
This article presents a new dung beetle optimization modified deep feature fusion model for lung cancer detection and classification (DBOMDFF-LCC) technique. The presented DBOMDFF-LCC technique mainly depends upon the feature fusion and hyperparameter tuning process. To accomplish this, the DBOMDFF-LCC technique uses a feature fusion process comprising three DL models, namely residual network (ResNet), densely connected network (DenseNet), and Inception-ResNet-v2. Additionally, the DBO system can be employed for the optimum hyperparameter selection of the three DL approaches. For lung cancer detection purposes, the DBOMDFF-LCC system utilizes a long short-term memory (LSTM) system. The simulation result analysis of the DBOMDFF-LCC technique of the medical dataset is investigated using different evaluation metrics.
2. Related Works
Dhivya and Sharmila [16] proposed a multimodal method named Ensemble Deep Lung Disease Predictor (EDepLDP) architecture and developed a reliable solution for the quick recognition of different diseases using CXR and CT scans. Firstly, the images collected are segmented using U-Net architecture to obtain enhanced lung Regions of Interest (ROIs). Next, Xception and InceptionResNetV2 are used for hierarchically extracting informative features from segmented CXR scans. Yu et al. [17] developed a paediatric fine-grained diagnoses-assistant system to give precise and prompt diagnoses. This model has two phases: a disease identification stage and a test result structurization stage. The initial phase structuralizes the test outcomes by extracting numeric values from medical records, and the disease detection phase offers a diagnosis dependent upon text-form medical records and the structured information attained in the primary step. Agarwal et al. [18] developed a DL-based multilayer multimodal fusion system which emphasizes extracting the features of various layers and their combination. The disease detection method considered discriminative data from all the layers.
Behrad and Abadeh [19] developed one of the common multi-modalities, including fusion approaches and DL models. Also, the authors explained learning strategies such as end-to-end learning, multitask learning, and transfer learning. Next, the authors provided a summary of the DL method for a multi-modal medical data study. Ullah et al. [20] developed a strong DL model for the anatomical design in chest radiographs that exploits a dual encoded-decoded CNN. The pretrained encoded outcome is given as squeeze-and-excitation (SE) for increasing the representation power of the network. Wang et al. [21] developed and evaluated the efficiency of a DL architecture (3D-ResNet) dependent upon CT scans to differentiate nontuberculous mycobacterium lung disease (NTM-LD) in Mycobacterium TB lung disease (MTB-LD).
Akbulut [22] introduced a strong mechanism based on a new customized DL algorithm (ACL) that trained LSTM and attention models synchronously with the CNN model. The significant traces and stains in the CXR images are highlighted with the marker-controlled watershed (MCW) segmentation method. Moreover, the contribution of the strategy used in the presented method to classification accuracy was thoroughly assessed. Chouhan et al. [23] suggested a novel DL architecture for the diagnosis of pneumonia utilizing the TL model. Next, the authors developed an ensemble module which integrates output from each pretrained model that outperforms individual models, obtaining a remarkable performance in pneumonia detection.
Dalmaz et al. [24] presented a new approach dependent upon adversarial diffusion modeling, SynDiff, to enhance the efficiency of medical image translation. For capturing a direct connection of the image distribution, SynDiff leverages a conditional diffusion procedure which gradually maps the noise and source image onto the target image. Dalmaz et al. [25] proposed a novel generative adversarial approach for medical image synthesis, ResViT, that leverages the contextual sensitivity of vision transformers together with the precision of convolutional functions and realism of adversarial learning. The ResViT generator utilizes a central bottleneck containing a new aggregated residual transformer (ART) block which synergistically integrates residual convolution and transformer elements. Yurt et al. [26] examined a multi-stream system which aggregates data through several source images using a mixture of several one-to-one streams and joint many-to-one streams. The corresponding mapping features created in the one-to-one streams and shared mapping features created in the many-to-one stream were integrated with the fusion block.
3. The Proposed Model
An automated lung cancer detection tool named the DBOMDFF-LCC system was established in this study. The aim of the projected DBOMDFF-LCC system is based on the feature fusion and hyperparameter tuning process. The DBOMDFF-LCC technique comprises three stage processes, namely feature fusion process, DBO-based hyperparameter selection, and LSTM classification. Figure 1 demonstrates the overall flow of the DBOMDFF-LCC system.
3.1. Feature Fusion Process
Primarily, the DBOMDFF-LCC technique uses a feature fusion process comprising three DL models, namely ResNet, DenseNet, and Inception-ResNet-v2. Entropy-based feature fusion is a procedure which integrates several features in distinct sources or modalities as a single feature representation utilizing the entropy model. The purpose is to capture complementary data in various features and improve the entire discriminative power of fused feature representations.
3.1.1. ResNet
The ResNet18 model consists of five convolutional structures, an activation function (Softmax) layer, and a fully connected layer [27]. The initial Conv structure comprises an activation, Conv, and BN layers. The complete parameters of this layer are as follows: the activation function of the activation layer utilized is ReLU, the number of Conv kernels from the 1D Conv layer was , the dimensional of Conv kernel is , and the padding mode remains unchanged. The second to fifth Conv structures had a similar form: they included a feature map block and Conv block; however, the count of Conv kernels differed based on the block. The numbers of Conv kernels of the second to fifth Conv designs are , , , and , respectively.
There were eight layers in all the Conv blocks: the BN layer, the activation layer, the Conv layer, the 1-bit short-circuit linking layer, the 1D Conv layer, the activation layer, and the feature fusion layer. The parameter of the block was: ReLU can be exploited as an activation function of the activation layer, the Conv kernel size of the 1D Conv layer is fixed as three, and the padding mode remains unchanged. The mapping feature and Conv blocks have a similar infrastructure but are varied in the sense in which the 1D short-circuit linked layer has been altered for the mapping feature layer.
3.1.2. DenseNet
The DenseNet201 structure has been trained primarily on ImageNet databases and contains three transition layers, four dense blocks, -pooling, and convolutional layers [28]. The preceding layer was directly connected to the next layers from the network, which allows the mapping feature of the preceding layer that concatenated with the final layer, enhances the data flow among the layers, and permits the model to effectively extract and capture the gait features.
(1) |
In Equation (1), displays the layer and represents the feature concatenation. signifies the composite function which contains convolution function, BN, and ReLU activation. A dense block has been added as the model for adjusting the dimensional mapping features. The objective of the bottleneck layer is to diminish the count of input features that generate the network computational effect. The transition layer was inserted, then all the dense blocks except the final one were inserted to diminish the original size of mapping features by half. The transition layer carries out convolution layer and then avg-pooling. The ability of every layer to add novel data to the network combined data is determined by the less growth rate.
3.1.3. Inception-ResNet-v2
In the Inception-Resnet-v2 model, the pretrained topmost layer was previously removed since this model is highly particular to the trained rate [29]. This model utilizes the tricks and decisions of the Inception model with a residual connection variant. No preprocessing is conducted. First, the image was resized to , the input size for DCNN, and then resized to [0–1]. The resizing of images does not affect the shape of the cellular structure or the accuracy, and it permits lessening the computation rate. The topmost layer consists of a global average pooling layer, an FC layer of 256 neurons (with ReLU activation) and, lastly, the neuron that allows classification in the four classes (with Softmax activation). At an earlier stage, only the FC layer was trained. During the second stage, the DCNN was retrained on the topmost layer, and then finetuning of the weight of any pre-trained network layers was carried out. It is not uncommon to keep the weight of any bottom layers (caused by over-fitting issues) and only carry out the fine-tuning of high-level features. The most common features (blobs and edges) can thus be retained.
3.2. Hyperparameter Tuning Process
In this work, the DBO system can be employed for the optimum hyperparameter selection of the three DL approaches. The DBO is a recent swarm intelligence (SI) method based on dung beetle (DB) behaviors, namely dancing, ball rolling, stealing, breeding, foraging, and other activities, and the DBO method includes four optimization techniques: breeding, rolling balls, stealing, and foraging [30]. Unobstructed and obstructed modes are two behaviors of DB rolling.
3.2.1. Obstacle-Free Mode
The DB exploits the sun in order to find direction in dung ball rolling once they move forward without any obstacles. In the DBO algorithm, as the light concentration changes, the location of the DB also changed as follows:
(2) |
In Equation (2), denotes the number of the existing iterations, (0, 0.2] represents a set parameter signifying the flexure co-efficient, and represents the place of - DBs from the population at permutation. denotes the invariant quantity within [], and shows the natural co-efficient with the value of both [−1, 1], with −1 representing a deviation from the original direction and 1 signifying no deviation. denotes the worst position from the existing specie, and the alteration in light concentration can be simulated using
3.2.2. Barrier Mode
The DB, once it meets an obstacle which prevents it from moving forward, desires to dance to recover a novel way forward. The author uses a tangent function to stimulate the dancing behaviors to attain the newest rolling direction that is only assumed from the range of , and the beetle continues rolling the dung ball as soon as it finds a novel direction. The formula for upgrading the location:
(3) |
If , no changes occur in the location of DBs.
The female DB rolls the dung ball to a safer region for laying eggs and hides them to give a proper habitat for the progeny. The study presents a frontier option approach for modelling the brood ball position of a female DB:
(4) |
In Equation (4), The lower and upper boundaries of the optimizer problems are and , respectively. and show the upper boundary of iterations. The existing population obtains the global optimal at . The author defines the spawning’s lower and upper boundaries with and which implies the position of DB spawn has been adjusted dynamically with iteration counts.
After a female DB finds the spawning region, she lays her eggs in that region. The region in which the location occurs is adjusted dynamically with the iteration counts; hence, the location of nestling spheres is dynamic in the iteration.
(5) |
In Equation (5), denotes the place of brood balls at the iterations, denotes the number of parameters in the optimization issues. characterize two arbitrary and independent vectors that have a component and the location of nestling balls should be limited to the spawning region.
These behaviors are aimed mostly at smaller DBs. Some mature DBs emerge from the ground looking for food, and the optimum foraging region for smaller DBs is updated dynamically.
(6) |
In Equation (6), is similar to the prior definition, and signifies the location optimum location for the present population. The author uses and to determine the bottom and top bounds of the foraging area of lesser DBs, respectively. The position upgrade is given below:
(7) |
In Equation (7), is a number which follows a standard distribution while selected arbitrarily, as , and shows the arbitrary vector within of .
During DB stealing, there exist any DBs that steal dung balls from other individuals, and the author updates the setting of thieving DBs:
(8) |
In Equation (8), indicates a constant value and denotes the vector of dimensional that is selected arbitrarily, which obeys a standard distribution.
The DBO system progresses to a FF to accomplish better classifier results. It resolves a positive integer to exemplify the good effectiveness of candidate outcomes. During this study, the minimizing classifier error rate was supposed to be FF, as depicted in Equation (9).
(9) |
3.3. Lung Cancer Detection Process
To detect and classify lung cancer, the fused feature vectors are passed into the LSTM approach [31]. As the time interval rises, the recurrent HN approaches zero. This leads to the gradient diminishing a vulnerability that can be encountered while applying RNN for long-term data sequence modeling. The memory cell has a node connected with the recurrent edge of a set weighted node, thus guaranteeing that the gradient survives a longer time step without vanishing. The multiplicative gate allows the model to store data over a longer period, thus removing the gradient vanishing problem usually observed in traditional NN models.
Assume input sequence data are represented as and output series data are represented as , where denotes the forecast horizon. The LSTM calculates the forecast outcome automatically in the next time step using the prior data, without predefining the lag observation to utilize:
(10) |
(11) |
(12) |
(13) |
(14) |
where 0 represents a standard logistic sigmoid function and and illustrate the weighted matrix and bias vector, respectively, defined as:
(15) |
(16) |
(17) |
where the parameters indicate the cell activation vector, input gate, forget gate, and output gate, respectively. and denote the respective transformations of the sigmoid function. This certain feature makes LSTM an accurate and reliable method for lung cancer detection.
4. Experimental Validation
In this section, the results of the DBOMDFF-LCC approach are examined on the lungdb database [32], comprising 100 samples and 3 classes, as demonstrated in Table 1. Figure 2 represents the sample images. For experimental validation, 80:20 and 70:30 of training/testing dataset is used.
Table 1.
Class | No. of Samples |
---|---|
Normal | 35 |
Benign | 32 |
Malignant | 33 |
Total Samples | 100 |
The confusion matrices of the DBOMDFF-LCC approach to the lung cancer recognition process are demonstrated in Figure 3. The outcomes stated that the DBOMDFF-LCC system recognizes three classes proficiently.
In Table 2 and Figure 4, the overall lung cancer detection results of the DBOMDFF-LCC technique are exemplified on 80:20 of TRP/TSP. The outcomes exhibit that the DBOMDFF-LCC system recognizes all three classes efficiently. For samples with 80% of TRP, the DBOMDFF-LCC system gains average , , , , and of 99.17%, 98.81%, 98.72%, 99.37%, and 98.74%, respectively. Also, with 20% of TSP, the DBOMDFF-LCC method reaches average , , , , and of 96.67%, 95.83%, 95.83%, 97.44%, and 95.56%, respectively.
Table 2.
Class | |||||
---|---|---|---|---|---|
Training Phase (80%) | |||||
Normal | 98.75 | 96.43 | 100.00 | 98.11 | 98.18 |
Benign | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
Malignant | 98.75 | 100.00 | 96.15 | 100.00 | 98.04 |
Average | 99.17 | 98.81 | 98.72 | 99.37 | 98.74 |
Testing Phase (20%) | |||||
Normal | 95.00 | 100.00 | 87.50 | 100.00 | 93.33 |
Benign | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
Malignant | 95.00 | 87.50 | 100.00 | 92.31 | 93.33 |
Average | 96.67 | 95.83 | 95.83 | 97.44 | 95.56 |
In Table 3 and Figure 5, the overall lung cancer detection results of the DBOMDFF-LCC system are demonstrated on 70:30 of TRP/TSP. The outcome exhibited that the DBOMDFF-LCC system recognizes all three classes efficiently. For instance, with 70% of TRP, the DBOMDFF-LCC method reaches average , , , , and of 99.05%, 98.72%, 98.48%, 99.26%, and 98.57%, respectively. In addition, with 30% of TSP, the DBOMDFF-LCC approach attains an average , , , , and of 95.56%, 92.96%, 94.87%, 96.90%, and 93.51%, respectively.
Table 3.
Class | |||||
---|---|---|---|---|---|
Training Phase (70%) | |||||
Normal | 98.57 | 100.00 | 95.45 | 100.00 | 97.67 |
Benign | 100.00 | 100.00 | 100.00 | 100.00 | 100.00 |
Malignant | 98.57 | 96.15 | 100.00 | 97.78 | 98.04 |
Average | 99.05 | 98.72 | 98.48 | 99.26 | 98.57 |
Testing Phase (30%) | |||||
Normal | 93.33 | 100.00 | 84.62 | 100.00 | 91.67 |
Benign | 96.67 | 90.00 | 100.00 | 95.24 | 94.74 |
Malignant | 96.67 | 88.89 | 100.00 | 95.45 | 94.12 |
Average | 95.56 | 92.96 | 94.87 | 96.90 | 93.51 |
Figure 6 demonstrates the classifier outcome of the DBOMDFF-LCC method on 80:20/70:30. Figure 6a,c demonstrates the accuracy examination of the DBOMDFF-LCC model on 80:20/70:30. The result stated that the DBOMDFF-LCC technique attains maximum accuracy values over higher epochs. In addition, the higher validation accuracy over training accuracy illustrates that the DBOMDFF-LCC method learns capably on the test database. Finally, Figure 6b,d illuminates the loss examination of the DBOMDFF-LCC approach on 80:20/70:30. The outcome implied that the DBOMDFF-LCC approach gains adjacent values of training and validation loss. The DBOMDFF-LCC system learns effectively on the test database.
Figure 7 demonstrates the classifier results of the DBOMDFF-LCC algorithm at 80:20/70:30. Figure 7a,c establishes the PR examination of the DBOMDFF-LCC approach on 80:20/70:30. The results implied that the DBOMDFF-LCC technique results in superior values of PR. In addition, it is clear that the DBOMDFF-LCC methodology can reach higher PR values in all classes. Lastly, Figure 7b,d illustrates the ROC examination of the DBOMDFF-LCC model under 80:20/70:30. The outcome implied that the DBOMDFF-LCC system resulted in improved ROC values. Also, the DBOMDFF-LCC method can extend enhanced ROC values on all classes.
In Table 4 and Figure 8, a comparison result of the DBOMDFF-LCC method is offered with existing systems [33]. The outcome highlighted that the DBOMDFF-LCC approach reaches enhanced results. Based on , the DBOMDFF-LCC technique obtains a higher of 99.17%, while the ODNN, KNN, DNN, YOLO-DLN, DBN-LND, and AGFLCC-DGM models accomplish a lower of 92.12%, 96.52%, 95.45%, 94.75%, 95%, and 98.91%, respectively. Meanwhile, based on , the DBOMDFF-LCC approach gains a superior of 98.81%, while the ODNN, KNN, DNN, YOLO-DLN, DBN-LND, and AGFLCC-DGM approaches achieve a lesser of 91.29%, 97.03%, 96.95%, 96.49%, 97.92%, and 96.88%, respectively. Furthermore, with respect to , the DBOMDFF-LCC technique obtains a higher of 98.72%, while the ODNN, KNN, DNN, YOLO-DLN, DBN-LND, and AGFLCC-DGM systems accomplish a lower of 88.56%, 86.45%, 92.85%, 94.70%, 93.50%, and 98.46%, respectively. These results show the maximum lung cancer detection efficiency of the DBOMDFF-LCC technique. The enhanced performance of the proposed model is due to the feature fusion and hyperparameter tuning process.
Table 4.
Methods | ||||
---|---|---|---|---|
ODNN Model | 92.12 | 91.29 | 88.56 | 88.54 |
KNN Model | 96.52 | 97.03 | 86.45 | 92.10 |
DNN Model | 95.45 | 96.95 | 92.85 | 89.40 |
YOLO-DLN | 94.75 | 96.49 | 94.70 | 95.10 |
DBN-LND | 95.00 | 97.92 | 93.50 | 90.20 |
AGFLCC-DGM | 98.91 | 96.88 | 98.46 | 98.89 |
DBOMDFF-LCC | 99.17 | 98.81 | 98.72 | 99.37 |
5. Conclusions
An automated lung cancer detection tool named DBOMDFF-LCC system was established in this study. The aim of the projected DBOMDFF-LCC algorithm is based on the feature fusion and hyperparameter tuning process. Primarily, the DBOMDFF-LCC technique uses a feature fusion process comprising three DL models, namely ResNet, DenseNet, and Inception-ResNet-v2. Additionally, the DBO system was employed for the optimum hyperparameter selection of the three DL algorithms. For lung cancer detection purposes, the DBOMDFF-LCC technique utilized the LSTM approach. The simulation result analysis of the DBOMDFF-LCC system on the medical dataset is investigated using different evaluation metrics. The extensive comparative results highlighted the betterment of the DBOMDFF-LCC technique of lung cancer classification.
Author Contributions
Conceptualization, M.A. (Mohammad Alamgeer); Methodology, N.A.; Validation, A.M.; Formal analysis, M.A. (Mohammed Assiri); Data curation, H.M.A. All authors have read and agreed to the published version of the manuscript.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The data presented in this study are available in this article.
Conflicts of Interest
The authors declare that they have no conflict of interest. The manuscript was written through the contributions of all authors. All authors have approved the final version of the manuscript.
Funding Statement
The authors extend their appreciation to the Deanship of Scientific Research at King Khalid University for funding this work through a large group Research Project under grant number (RGP2/134/44). Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2023R237), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia. Research Supporting Project number (RSPD2023R608), King Saud University, Riyadh, Saudi Arabia. This study is supported via funding from Prince Sattam bin Abdulaziz University project number (PSAU/2023/R/1444). This study is partially funded by the Future University in Egypt (FUE).
Footnotes
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
References
- 1.Yang Y., Yang J., Shen L., Chen J., Xia L., Ni B., Ge L., Wang Y., Lu S. A multi-omics-based serial deep learning approach to predict clinical outcomes of single-agent anti-PD-1/PD-L1 immunotherapy in advanced stage non-small-cell lung cancer. Am. J. Transl. Res. 2021;13:743. [PMC free article] [PubMed] [Google Scholar]
- 2.Yin M., Liang X., Wang Z., Zhou Y., He Y., Xue Y., Gao J., Lin J., Yu C., Liu L., et al. Identification of Asymptomatic COVID-19 Patients on Chest CT Images Using Transformer-Based or Convolutional Neural Network–Based Deep Learning Models. J. Digit. Imaging. 2023;36:827–836. doi: 10.1007/s10278-022-00754-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Wang Z., Yin Z., Argyris Y.A. Detecting medical misinformation on social media using multimodal deep learning. IEEE J. Biomed. Health Inform. 2020;25:2193–2203. doi: 10.1109/JBHI.2020.3037027. [DOI] [PubMed] [Google Scholar]
- 4.Karaddi S.H., Sharma L.D. Automated multi-class classification of lung diseases from CXR-images using pre-trained convolutional neural networks. Expert Syst. Appl. 2023;211:118650. doi: 10.1016/j.eswa.2022.118650. [DOI] [Google Scholar]
- 5.Sait U., KV G.L., Shivakumar S., Kumar T., Bhaumik R., Prajapati S., Bhalla K., Chakrapani A. A deep-learning based multimodal system for COVID-19 diagnosis using breathing sounds and chest X-ray images. Appl. Soft Comput. 2021;109:107522. doi: 10.1016/j.asoc.2021.107522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Khan M.A., Khan A., Alhaisoni M., Alqahtani A., Alsubai S., Alharbi M., Malik N.A., Damaševičius R. Multimodal brain tumor detection and classification using deep saliency map and improved dragonfly optimization algorithm. Int. J. Imaging Syst. Technol. 2023;33:572–587. doi: 10.1002/ima.22831. [DOI] [Google Scholar]
- 7.Xu C., Wang Y., Zhang D., Han L., Zhang Y., Chen J., Li S. BMAnet: Boundary mining with adversarial learning for semi-supervised 2D myocardial infarction segmentation. IEEE J. Biomed. Health Inform. 2022;27:87–96. doi: 10.1109/JBHI.2022.3215536. [DOI] [PubMed] [Google Scholar]
- 8.Zhang D., Xu C., Li S. Heuristic multi-modal integration framework for liver tumor detection from multi-modal non-enhanced MRIs. Expert Syst. Appl. 2023;221:119782. doi: 10.1016/j.eswa.2023.119782. [DOI] [Google Scholar]
- 9.Li S., Xie Y., Wang G., Zhang L., Zhou W. Adaptive multimodal fusion with attention guided deep supervision net for grading hepatocellular carcinoma. IEEE J. Biomed. Health Inform. 2022;26:4123–4131. doi: 10.1109/JBHI.2022.3161466. [DOI] [PubMed] [Google Scholar]
- 10.Barrett J., Viana T. EMM-LC Fusion: Enhanced Multimodal Fusion for Lung Cancer Classification. Ai. 2022;3:659–682. doi: 10.3390/ai3030038. [DOI] [Google Scholar]
- 11.Zhang X., Zhang Y., Zhang G., Qiu X., Tan W., Yin X., Liao L. Deep learning with radiomics for disease diagnosis and treatment: Challenges and potential. Front. Oncol. 2022;12:773840. doi: 10.3389/fonc.2022.773840. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Chassagnon G., Vakalopoulou M., Régent A., Sahasrabudhe M., Marini R., Hoang-Thi T.N., Dinh-Xuan A.T., Dunogué B., Mouthon L., Paragios N., et al. Elastic registration–driven deep learning for longitudinal assessment of systemic sclerosis interstitial lung disease at CT. Radiology. 2021;298:189–198. doi: 10.1148/radiol.2020200319. [DOI] [PubMed] [Google Scholar]
- 13.Naz Z., Khan M.U.G., Saba T., Rehman A., Nobanee H., Bahaj S.A. An Explainable AI-Enabled Framework for Interpreting Pulmonary Diseases from Chest Radiographs. Cancers. 2023;15:314. doi: 10.3390/cancers15010314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Moujahid H., Cherradi B., Gannour O.E., Bahatti L., Terrada O., Hamida S. Convolutional neural network based classification of patients with pneumonia using X-ray lung images. Adv.Sci. Technol. Eng. Syst. J. 2020;5:167–175. doi: 10.25046/aj050522. [DOI] [Google Scholar]
- 15.Verma P., Dumka A., Singh R., Ashok A., Singh A., Aljahdali H.M., Kadry S., Rauf H.T. A deep learning based approach for patient pulmonary CT image screening to predict coronavirus (SARS-CoV-2) infection. Diagnostics. 2021;11:1735. doi: 10.3390/diagnostics11091735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Dhivya N., Sharmila P. Multimodal Feature and Transfer Learning in Deep Ensemble Model for Lung Disease Prediction. J. Data Acquis. Process. 2023;38:271. [Google Scholar]
- 17.Yu G., Yu Z., Shi Y., Wang Y., Liu X., Li Z., Zhao Y., Sun F., Yu Y., Shu Q. Identification of pediatric respiratory diseases using a fine-grained diagnosis system. J. Biomed. Inform. 2021;117:103754. doi: 10.1016/j.jbi.2021.103754. [DOI] [PubMed] [Google Scholar]
- 18.Agarwal S., Arya K.V., Meena Y.K. MutliFusionNet: Multilayer Multimodal Fusion of Deep Neural Networks for Chest X-ray Image Classification. 2023. [DOI]
- 19.Behrad F., Abadeh M.S. An overview of deep learning methods for multimodal medical data mining. Expert Syst. Appl. 2022;200:117006. doi: 10.1016/j.eswa.2022.117006. [DOI] [Google Scholar]
- 20.Ullah I., Ali F., Shah B., El-Sappagh S., Abuhmed T., Park S.H. A deep learning based dual encoder–decoder framework for anatomical structure segmentation in chest X-ray images. Sci. Rep. 2023;13:791. doi: 10.1038/s41598-023-27815-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Wang L., Ding W., Mo Y., Shi D., Zhang S., Zhong L., Wang K., Wang J., Huang C., Zhang S., et al. Distinguishing nontuberculous mycobacteria from Mycobacterium tuberculosis lung disease from CT images using a deep learning framework. Eur. J. Nucl. Med. Mol. Imaging. 2021;48:4293–4306. doi: 10.1007/s00259-021-05432-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Akbulut Y. Automated Pneumonia Based Lung Diseases Classification with Robust Technique Based on a Customized Deep Learning Approach. Diagnostics. 2023;13:260. doi: 10.3390/diagnostics13020260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Chouhan V., Singh S.K., Khamparia A., Gupta D., Tiwari P., Moreira C., Damaševičius R., De Albuquerque V.H.C. A novel transfer learning based approach for pneumonia detection in chest X-ray images. Appl. Sci. 2020;10:559. doi: 10.3390/app10020559. [DOI] [Google Scholar]
- 24.Özbey M., Dalmaz O., Dar S.U., Bedel H.A., Özturk Ş., Güngör A., Çukur T. Unsupervised medical image translation with adversarial diffusion models. IEEE Trans. Med. Imaging. 2023 doi: 10.1109/TMI.2023.3290149. [DOI] [PubMed] [Google Scholar]
- 25.Dalmaz O., Yurt M., Çukur T. ResViT: Residual vision transformers for multimodal medical image synthesis. IEEE Trans. Med. Imaging. 2022;41:2598–2614. doi: 10.1109/TMI.2022.3167808. [DOI] [PubMed] [Google Scholar]
- 26.Yurt M., Dar S.U., Erdem A., Erdem E., Oguz K.K., Çukur T. mustGAN: Multi-stream generative adversarial networks for MR image synthesis. Med. Image Anal. 2021;70:101944. doi: 10.1016/j.media.2020.101944. [DOI] [PubMed] [Google Scholar]
- 27.Zhao Y., Zhang X., Feng W., Xu J. Deep Learning Classification by ResNet-18 Based on the Real Spectral Dataset from Multispectral Remote Sensing Images. Remote Sens. 2022;14:4883. doi: 10.3390/rs14194883. [DOI] [Google Scholar]
- 28.Venu S.K. An ensemble-based approach by fine-tuning the deep transfer learning models to classify pneumonia from chest X-ray images. arXiv. 20202011.05543 [Google Scholar]
- 29.Ferreira C.A., Melo T., Sousa P., Meyer M.I., Shakibapour E., Costa P., Campilho A. Image Analysis and Recognition, Proceedings of the 15th International Conference, ICIAR 2018, Póvoa de Varzim, Portugal, 27–29 June 2018. Springer International Publishing; Cham, Switzerland: 2018. Classification of breast cancer histology images through transfer learning using a pre-trained inception resnet v2; pp. 763–770. [Google Scholar]
- 30.Zhang R., Zhu Y. Predicting the Mechanical Properties of Heat-Treated Woods Using Optimization-Algorithm-Based BPNN. Forests. 2023;14:935. doi: 10.3390/f14050935. [DOI] [Google Scholar]
- 31.Essien A., Giannetti C. A deep learning framework for univariate time series prediction using convolutional LSTM stacked autoencoders; Proceedings of the 2019 IEEE International Symposium on INnovations in Intelligent SysTems and Applications (INISTA); Sofia, Bulgaria. 3–5 July 2019; Piscataway, NJ, USA: IEEE; 2019. pp. 1–6. [Google Scholar]
- 32. [(accessed on 16 February 2023)]. Available online: http://www.via.cornell.edu/lungdb.html.
- 33.Lakshmanaprabu S.K., Mohanty S.N., Shankar K., Arunkumar N., Ramirez G. Optimal deep learning model for classification of lung cancer on CT images. Future Gener. Comput. Syst. 2019;92:374–382. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data presented in this study are available in this article.