Medical image fusion via discrete stationary wavelet transform and an enhanced radial basis function neural network

doi:10.1016/j.asoc.2022.108542

Applied Soft Computing

Volume 118, March 2022, 108542

https://doi.org/10.1016/j.asoc.2022.108542 Get rights and content

Highlights

•
A medical image fusion model of SWT and RBFNN is proposed.
•
Proposed prior knowledge-based RBFNN have stronger learning ability.
•
The proposed method could effectively fuse detail information of original images.

Abstract

Medical image fusion of images obtained via different modes can expand the inherent information of original images, whereby the fused image has a superior ability to display details than the original sub-images, to facilitate diagnosis and treatment selection. In medical image fusion, an inherent challenge is to effectively combine the most useful information and image details without information loss. Despite the many methods that have been proposed, the effective retention and presentation of information proves challenging. Therefore, we proposed and evaluated a novel image fusion method based on the discrete stationary wavelet transform (DSWT) and radial basis function neural network (RBFNN). First, we analyze the details or feature information of two images to be processed by DSWT by using two-level decomposition to separate each image into seven parts, comprising both high-frequency and low-frequency sub-bands. Considering the gradient and energy attributes of the target, we substituted the pending parts in the same position in the two images by using the proposed enhanced RBFNN. The input, hidden, and output layers of the neural network comprised 8, 40, and 1 neuron(s), respectively. From the seven neural networks, we obtained seven fused parts. Finally, through inverse wavelet transform, we obtained the final fused image. For the neural network training method, the hybrid adaptive gradient descent algorithm (AGDA) and gravitational search algorithm (GSA) were implemented. The final experimental results revealed that the novel method has significantly better performance than the current state-of-the-art methods.

Introduction

Image fusion technology synthesizes multiple source images into a new image This process not only involves superimposition of all image data, but also targeted processing of the target image through one or more algorithms [1], [2], [3]. In medical imaging, the concept of fusion pertains to the combination of two (or more) medical images from different imaging devices by using an algorithm to combine the advantages or complementarities of each image in order to obtain a more informative image. The development of medical imaging technology has resulted in the widespread use of computed tomography (CT), magnetic resonance imaging (MRI), positron emission tomography (PET), single-photon emission computed tomography (SPECT), and other imaging techniques in clinical diagnosis and treatment. Different imaging modes can provide different types of medical information to doctors. For instance, CT images have a high spatial resolution, clear bone imaging, and provide a good reference for the positioning of lesions. However, CT images are rather insensitive to adequately display soft tissue or even invasive tumor details. In contrast, MRI soft-tissue imaging provides clear images and is conducive to the determination of the scope of the lesion; similarly, PET and SPECT imaging can provide clear functional information about metabolism within the body. However, due to the low spatial resolution of functional imaging, the diagnosis of pathological tumors can be limited, especially as different imaging principles comprise inherent limitations in acquiring imaging information, and the use of a single type of image would not be optimal for precise visualization. Therefore, combining the advantages of different imaging methods and complementary information through medical image fusion technology can generate sufficient accurate information to facilitate medical diagnosis and treatment planning [4], [5], [6], [7].

In recent decades, research on medical image fusion, especially multiscale transform (MST)-based methods, has gained mainstream popularity. Briefly, the main sequential processes of MST-based fusion methods involve: (1) decomposing the pending source images into several sub-bands, which represent some feature information of the images, by using complicated operators; (2) applying different fusion rules, based on the pixel level, to fuse the corresponding sub-bands together; and (3) inverse transformation to obtain the final fusion image. Early conventional methods included discrete wavelet transform-based methods [1], [8] and Laplacian pyramid transform-based methods [9], [10]. The early methods obtained different image feature information through multiscale transformation because different frequency components are processed by the same or simple fusion rules when the sub-bands are fused together. However, a limitation of these methods is that only one image characteristic is considered, whereas others are ignored, and this can seriously reduce the fusion effect [11]. In order to solve this issue, some enhanced algorithms were proposed. Tian et al. [12] applied different fusion rules to diverse frequency bands based on pixel or regional information. Xu et al. [13] proposed the fractional wavelet transform method, which can better define fusion coefficients. Similarly, Lie et al. proposed an MST-based sparse representation theory [14]. Although the abovementioned methods adopt different rules in various frequency bands to obtain good results, these methods cannot optimally fuse details and contour information due to the limitations of their respective established rules. Moreover, because of the down-sampling and up-sampling involved, artifact-related problems are sometimes unavoidable. Furthermore, image fusion methods, based on stationary wavelet transform (SWT), were proposed to effectively avoid the Gibbs phenomenon [15], [16], although the imperfect formulation of rules hampered optimal image fusion. In addition, other shift-invariant-based methods, such as nonsubsampled contourlet transform (NSCT), have garnered interest. Based on the NSCT domain, Ganasala et al. [17] applied the entropy and Laplacian operators to the corresponding low- and high-frequency sub-bands, respectively. NSCT-based methods have implemented rules based on phase congruency, directive contrast-based rules, or Laplacian energy in order to merge the corresponding low- and high-frequency domains [18], [19]. The non-subsampled shearlet transform (NSST) method, which applies a shear-wave filter to decompose and reconstruct pending images, constitutes another mainstream shift-invariant-based method. Liu et al. [20] consider the gradient factor to optimize the problem of coefficients based on the structure tensor and NSST. To improve the directional information, Singh et al. [21] adopted ripplet transform and NSST to design a cascaded model. The above-described methods have achieved accurate and detailed visualization of directional tissue features. However, due to the large difference in the parameter performance of different filters, the stability of the fusion effect is greatly affected by the selection of parameters. In addition, computational complexity affects fusion efficiency [22], [23].

Furthermore, neural network-based methods are applied to medical image fusion. The pulse-coupled neural network (PCNN) is a simplified neural network model that is based on the principles underlying cat vision. The application of the PCNN-based method – processed under the global domain – to the field of image fusion can facilitate the retention of more detailed information. However, the effective stimulation of neurons to maximize their effect without training remains a challenge. Huang et al. [24] add the Laplacian energy of the image block as input stimulation for each neuron in the PCNN. To improve the performance of the PCNN to increase the quality of the fused image, parameter-adaptive PCNN methods were previously applied to the high-frequency band of the NSST domain [25], [26]. Despite the availability of many PCNN-based fusion algorithms, the optimization of coefficients and the threshold definition are still being investigated. In recent years, convolutional neural network (CNN)-based methods have emerged as a popular approach for solving problems pertaining to medical image fusion [27], [28], [29]. However, the difficulty of training deep neural networks and the availability of small-sample training data have hampered the sustained performance of the neural network. Based on the multiscale domain, the sampling or convolution operation in CNN can easily cause information loss during the fusion process. We previously proposed a medical image-fusion method that was based on a fuzzy radial basis function neural network [30], but only considered the pixel features based on the image domain to stimulate the input neurons, which may lead to the insufficiency of the neural network’s cognitive ability and the lack of optimal performance.

At present, the main research direction of medical image fusion is to establish specific fusion rules on each frequency sub-band based on shift-invariant-based multiscale transforms. However, the identification of the most suitable fusion rules is problematic. To address this gap, in the present study, we proposed and tested a novel medical image fusion method based on the discrete stationary wavelet transform (DSWT) and radial basis function neural network (RBFNN).

Section snippets

Study design

Considering the shift invariance and the number of calculations, we selected DSWT as the multiscale transform operator. Next, we performed two-level wavelet decomposition to obtain 14 sub-bands representing different information features of two source images that are to be fused. For the corresponding pair of sub-bands, fully considering the pixel characteristics and the interaction between pixels, we established a radial basis neural network that includes 8, 40, and 1 neuron(s) in the input,

Results of visual observation

For each type of dataset pair to be fused, we took two fused images. The fusion performances of six pairs of datasets based on seven different methods are shown in Fig. 5, Fig. 6, Fig. 7, Fig. 8, Fig. 9, Fig. 10. In these figures, (a) and (b) show the two source images to be fused; (c)–(i) represent the fused images obtained by the WT, NSCT, SR, APCNN, CNN, and FRBFNN-based methods as well as the proposed method, respectively. First, we observed the CT-MRI datasets (CT-MRI dataset 1 and CT-MRI

Discussion

Medical image fusion is used to complementarily combine medical images obtained through different imaging modalities. By organically combining anatomical information or functional information, the fused image displays comprehensive information based on multiple single-mode imaging sources without loss of information or energy in the single-mode source images. The WT- and NSCT-based methods obviously failed to achieve the desired fusion effects. MST-based methods are increasingly emerging as

Conclusion

The proposed hybrid model based on DSWT and enhanced RBFNN captures and processes the details of each sub-band and overcomes the limitations that afflict previously established medical image fusion methods. Additionally, we apply an adaptive choice approach based on other competitive algorithms to create teacher data, making the proposed structure learn the strengths from other methods. In comparative experiments using abundant data on several mainstream methods, the proposed method could not

CRediT authorship contribution statement

Zhen Chao: Conceptualization, Methodology, Software, Writing – original draft. Xingguang Duan: Investigation. Shuangfu Jia: Assessment. Xuejun Guo: Validation. Hao Liu: Validation, Supervision. Fucang Jia: Supervision, Writing – reviewing and editing.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

We are very grateful for the open-source code shared [14], [26], [27]. In addition, for Refs. [1], [17], we reproduced code according to the papers’ contents. The codes are available on https://github.com/med-img/dwt_based_img_fusion.git and https://github.com/med-img/NSCT_Padma_2014_img_fusion.git.

Funding

This work was supported by the National Natural Science Foundation of China (grant nos. 62172401, 82001905, and 12026602), and was partly supported by the National Key Research and Development

References (44)

PajaresG. et al.
A wavelet-based image fusion tutorial
Pattern Recognit.
(2004)
MaJ.Y. et al.
Infrared and visible image fusion methods and applications: A survey
Inf. Fusion
(2019)
JamesA.P. et al.
Medical image fusion: A survey of the state of the art
Inf. Fusion
(2014)
XuX. et al.
Medical image fusion using discrete fractional wavelet transform
Biomed. Signal Process. Control
(2016)
LiuY. et al.
A general framework for image fusion based on multi-scale transform and sparse representation
Inf. Fusion
(2015)
LiuX. et al.
Structure tensor and nonsubsampled shearlet transform based algorithm for CT and MRI image fusion
Neurocomputing
(2017)
MaJ. et al.
Infrared and visible image fusion based on visual saliency map and weighted least square optimization
Infrared Phys. Technol.
(2017)
HuangW. et al.
Multi-focus image fusion using pulse coupled neural network
Pattern Recognit. Lett.
(2007)
LiuY. et al.
Multi-focus image fusion with a deep convolutional neural network
Inf. Fusion
(2017)
ChaoZ. et al.
Multi-modality image fusion based on enhanced fuzzy radial basis function neural networks
Phys. Med.
(2018)

MerahM. et al.

R-peaks detection based on stationary wavelet transform

Comput. Methods Programs Biomed.

(2015)

ChenY. et al.

Application of radial basis function artificial neural network to quantify interfacial energies related to membrane fouling in a membrane bioreactor

Bioresour. Technol.

(2019)

ZhaoZ. et al.

Prediction of interfacial interactions related with membrane fouling in a membrane bioreactor based on radial basis function artificial neural network (ANN)

Bioresour. Technol.

(2019)

HempelmannC.F. et al.

An entropy-based evaluation method for knowledge bases of medical information systems

Expert Syst. Appl.

(2016)

LiS.T. et al.

Image fusion with guided filtering

IEEE Trans. Image Process.

(2013)

JiaoD. et al.

An overview of multi-modal medical image fusion

Neurocomputing

(2016)

ArifM. et al.

Fast curvelet transform through genetic algorithm for multimodal medical image fusion

Soft Comput.

(2020)

El-ZahraaF. et al.

Current trends in medical image registration and fusion

Egypt. Inform. J.

(2016)

ChengS. et al.

Medical image of PET/CT weighted fusion based on wavelet transform

BurtP. et al.

The Laplacian pyramid as a compact image code

IEEE Trans. Commun.

(1983)

ZhengY.

Multi-scale fusion algorithm comparisons: Pyramid, DWT and iterative DWT

YangY. et al.

Medical image fusion via an effective wavelet-based approach

EURASIP J. Adv. Signal Process.

(2010)

Cited by (28)

Learnable bilevel optimization method for electrical capacitance tomography
2024, Signal Processing
The positive role of electrical capacitance tomography technology depends on high-precision tomographic images. Despite its success, one of the main barriers is the low-quality tomogram. A new learnable bilevel optimization imaging method is proposed to address this problem in this study, in which the image prior and model parameters can be learned from the collected datasets. The upper level optimization problem learns the regularization parameter under the constraint of the lower level optimization problem that implements image reconstruction. A new lower level optimization problem with the introduced machine learning prior is built, which leverages the prior knowledge from collected datasets, imaging targets and imaging mechanisms. The machine learning prior is learned through extreme learning machine, and the training is reformulated into a fractional optimization problem with the physical mechanisms of imaging as a constraint. A new optimizer is proposed to solve the learnable bilevel optimization imaging problem. The effectiveness has been demonstrated by the reconstruction of higher precision images and better noise immunity in comparison with advanced imaging techniques. The new imaging method unifies imaging mechanisms and machine learning, and promotes the complementarity of image priors, which offers new opportunities to unlock the potential of the measurement technique.
An efficient approach to medical image fusion based on optimization and transfer learning with VGG19
2024, Biomedical Signal Processing and Control
Medical image fusion is the process of combining information from multiple medical images of the same body region acquired using different imaging modalities, such as computed tomography (CT), magnetic resonance imaging (MRI), and positron emission tomography (PET) into a single image. It is widely used in clinical applications such as oncology, neurology, and cardiology. Deep learning-based approaches have been used to solve this problem. However, some medical image fusion approaches based on pre-trained models still have limitations because these models may not be trained on a diverse set of medical images, which may lead to inefficiencies in extracting features from medical images. Therefore, it is not effective to use feature-based fusion rules extracted from a pre-trained model. In this study, we use the transfer learning technique to build a modified VGG19 model (called TL_VGG19) and use this model to extract features as well as build an efficient fusion rule for detail components. Furthermore, to ensure that the composite image is not degraded in terms of quality. We propose an adaptive fusion method for the base components based on the Equilibrium optimization algorithm (EOA). The seven latest synthesis methods were used for comparison. The experimental results clearly indicate that the methodology employed in this study outperforms existing the latest methods.
Gaussian RBFNN method for solving FPK and BK equations in stochastic dynamical system with FOPID controller
2023, International Journal of Non-Linear Mechanics
Solving the Fokker–Planck–Kolmogorov (FPK) equation and the Backward-Kolmogorov (BK) equation is a crucial task to obtain the transient response of stochastic dynamical systems. Fractional order PID (FOPID) is a new efficient controller to change the system response to be the expected one. Therefore, in this paper, the Gaussian Radial Basis Functions Neural Network (RBFNN) is proposed to solve FPK and BK equations, to obtain the transient probability density function and the reliability function for a generalized Van der Pol system under a FOPID controller. The values of the different fractional orders are analyzed to discuss the performance of the FOPID controller. A data collection strategy is adopted to deal with the associated boundary conditions by way of a one-time Monte-Carlo simulation and uniform distribution in our Gaussian RBFNN method. The advantage of this method is that the solution process of FPK and BK equations is converted into solving algebraic equations. Numerical results with regard to the transient system response prove that the Gaussian RBFNN is efficient and accurate in getting the solutions of FPK and BK equations. The order of the fractional integration and the fractional derivative are critical parameters to control the system response. Moreover, we conclude that the fractional order parameters in a FOPID controller can indeed enhance the system’s response to a certain extent and lead to bifurcation.
Parameter adaptive unit-linking pulse coupled neural network based MRI–PET/SPECT image fusion
2023, Biomedical Signal Processing and Control
Medical image fusion has many applications to healthcare that is accomplished by extracting and then combining the complementary information from multiple medical images into a single image. The pulse coupled neural network (PCNN) is greatly applied to image fusion due to its efficient coupling of surrounding neurons, but suffers from network complexity and manual parameter settings. This paper proposes a novel parameter adaptive unit-linking PCNN (PAULPCNN) model, whose parameters are automatically obtained from the external stimulus and exhibits a simpler structure than the original PCNN. The multi-scale decomposition-based methods, particularly based on the non-subsampled shearlet transform (NSST), are widely employed by researchers to perform medical image fusion due to their efficient separation of spatial details at different scales. Motivated by the advantages of the PCNN and multi-scale decomposition-based methods, a hybrid medical image fusion method based on the PAULPCNN model is introduced in the NSST domain that merges the salient complementary details from a gray-scale and the corresponding pseudo-color medical image captured at different modalities to produce a more informative image that is more useful to medical experts in computer-aided diagnosis of diseases. The proposed method first employs NSST to transform the source images into a low-pass and several high-pass sub-bands, respectively. The high-pass sub-bands are combined using the firing times of the proposed PAULPCNN model, whereas a new distance-weighted regional energy-based rule is applied to construct the fused low-pass sub-band, which while estimating regional energy, assigns weight to different neighboring pixels depending on their distance from the central pixel. Finally, inverse NSST is applied on the fused sub-bands to construct the fused image. The effectiveness of the proposed technique is shown using the fusion results of eleven state-of-the-art methods on thirty gray-scale and pseudo-color brain medical image pairs comprising mild Alzheimer’s disease, Huntington’s disease, motor neuron disease, sagittal plane, coronal plane, transaxial plane, and glioma image pairs, where eight objective metrics are considered for the quantitative assessment. Experimental results demonstrate that the proposed method is competitive with the state-of-the-art methods, which even outperforms some of these methods by providing fused images with better visual quality and greater objective performance, which are more suitable for specialists to diagnose brain diseases.
MSE-Fusion: Weakly supervised medical image fusion with modal synthesis and enhancement
2023, Engineering Applications of Artificial Intelligence
Citation Excerpt :
However, they lose the targeting of key features in the feature extraction process (Li et al., 2020), and it is difficult to fully characterize the deep-structure and shallow-detail information in the fused images. In recent years, deep learning has been widely used in image processing fields, such as modal synthesis (Tie et al., 2020; Sivanesan et al., 2021; Liu et al., 2021a,b; Reaungamornrat et al., 2022; Gao et al., 2022a), image fusion (Manoj et al., 2021a; Wang et al., 2022a; Zhen et al., 2022; Wang et al., 2022b; Liang and Xu, 2022; Manoj et al., 2021b; Wang et al., 2022c; Liu et al., 2022), and image segmentation (Srivastava et al., 2022; Wang et al., 2022d; Karthik et al., 2021; Li et al., 2022; Ouyang et al., 2022) because of its excellent feature extraction ability, and has yielded good performance. Convolutional neural network (CNN) as a representative method of deep learning (Esteva et al., 2017; Kumar et al., 2020; Prabhishek and Achyut, 2021; Ma et al., 2022a), is playing an increasingly important role in image fusion (Dian et al., 2020; Dong et al., 2022).
Existing multi-modal image fusion methods utilize multi-modal images as input that require multiple imaging of patients causing harm to patients’ bodies and large costs, moreover, image fusion needs a large number of registered images which is time-consuming and difficult to get, and has unclear texture and structure of the fused images. Therefore, a weakly supervised medical image fusion method with modal synthesis and enhancement is proposed. In modal synthesis, a weakly supervised approach is used to train the model to decrease the requirements of registered images, and MR images are used as input to synthesize CT images through a deep-structure and shallow-detail generator by training to reduce the required input modal and make the texture and structure clearer. In image enhancement, MR images are passed through a trained generator to generate enhanced MR images which enhance the texture and structure of the MR images. And then using the synthesized CT and enhanced MR images together with the original PET images as input to achieve tri-modal image fusion. Compared with 13 state-of-the-art modal synthesis and image fusion methods on the same datasets, the performance of the proposed method on 7 objective evaluation metrics is significantly improved. The subjective visual effect and objective evaluation metrics of our method are better than those of the compared image fusion methods.
Combining spectral total variation with dynamic threshold neural P systems for medical image fusion
2023, Biomedical Signal Processing and Control
Citation Excerpt :
The final step is to transform the synthesized components into the spatial domain. Some popular transformation methods that have been applied to the problem of medical image synthesis can be mentioned, such as LP transform [13,15], discrete stationary wavelet transform (DSWT) [16,17], NSCT [2,18], and NSST [11,19,20]. Commonly used methods to synthesize components on the transform domain can be listed as Min–Max selection [21], averaging rules [22,23], local energy function maximization [24,25], and sum-modified-Laplacian (SML) [26].
Synthesis of medical images is one of the indispensable tasks today because of its applications in clinical diagnosis. Composite images often suffer from problems such as poor contrast, loss of detail, and low light intensity. The reason for the above problem is that the input image is of poor quality, and the fusion rules are not really effective. In this paper, we propose a new image synthesis model to simultaneously solve the problems mentioned above. Firstly, the input image is enhanced because the input image’s quality significantly affects the fusion image’s quality. Next, the Spectral total variation (STV) method is utilized to decompose input images into a base layer and a series of detail layers. An adaptive rule based on the Chameleon Swarm Algorithm (CSA) algorithm is proposed for the synthesis of the base layers. This rule ensures that the synthesized image has good quality in terms of brightness and contrast. To ensure that the details are preserved in the synthesized image, we propose an effective fusion rule for detail layers based on the Dynamic threshold neural P systems (DTNPS). Finally, the base and detail layers that have been composited are summed together to create the composite image. Six evaluation indexes, seven state-of-the-art image synthesis algorithms, and 132 medical images were used to evaluate. The results show that our image synthesis model is more efficient than the current latest image synthesis methods.

View all citing articles on Scopus

View full text

Medical image fusion via discrete stationary wavelet transform and an enhanced radial basis function neural network

Highlights

Abstract

Introduction

Section snippets

Study design

Results of visual observation

Discussion

Conclusion

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Funding

Pattern Recognit.

Inf. Fusion

Inf. Fusion

Biomed. Signal Process. Control

Inf. Fusion

Neurocomputing

Infrared Phys. Technol.

Pattern Recognit. Lett.

Inf. Fusion

Phys. Med.

Comput. Methods Programs Biomed.

Bioresour. Technol.

Bioresour. Technol.

Expert Syst. Appl.

Image fusion with guided filtering

IEEE Trans. Image Process.

An overview of multi-modal medical image fusion

Neurocomputing

Fast curvelet transform through genetic algorithm for multimodal medical image fusion

Soft Comput.

Current trends in medical image registration and fusion

Egypt. Inform. J.

Medical image of PET/CT weighted fusion based on wavelet transform

The Laplacian pyramid as a compact image code

IEEE Trans. Commun.

Multi-scale fusion algorithm comparisons: Pyramid, DWT and iterative DWT

Medical image fusion via an effective wavelet-based approach

EURASIP J. Adv. Signal Process.