Abstract
Hyperspectral image fusion frequently leverages panchromatic and multispectral images. Although remote sensing images exhibit multi-scale features, prior research has predominantly focused on local feature extraction using convolutional approaches, thereby neglecting long-range dependencies among image elements. To overcome this limitation, we introduce a network for multi-scale long-distance feature extraction that incorporates an encoder-decoder structure with skip connections and a multi-layer perceptron block with attention mechanisms. By capturing features from multiple scales and distant locations within the image, the proposed network improves the performance of image fusion. Our experimental findings demonstrate that the proposed network achieves state-of-the-art performance in image fusion tasks.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Chavez Jr., P.S., Sides, S.C., Anderson, J.A.: Comparison of three different methods to merge multiresolution and multispectral data: Landsat TM and SPOT panchromatic. Photogramm. Eng. Remote Sens. 57(3), 295–303 (1991)
Shettigara, V.K.: A generalized component substitution technique for spatial enhancement of multispectral images using a higher resolution data set. Photogramm. Eng. Remote Sens. 58(5), 561–567 (1992)
Laben, C.A., Brower, B.V.: Process for enhancing the spatial resolution of multispectral imagery using pan-sharpening. U.S. Patent 6 011 875 (2000)
Mallat, S.: A theory for multiresolution signal decomposition: the wavelet representation. IEEE Trans. Pattern Anal. Mach. Intell. 11(7), 674–693 (1989)
Nason, G.P., Silverman, B.W.: The stationary wavelet transform and some statistical applications. In: Antoniadis, A., Oppenheim, G. (eds.) Wavelets and Statistics. LNS, vol. 103, pp. 281–299. Springer, New York (1995). https://doi.org/10.1007/978-1-4612-2544-7_17
Shensa, M.J.: The discrete wavelet transform: wedding the á trous and Mallat algorithm. IEEE Trans. Sig. Process. 40(10), 2464–2482 (1992)
Burt, P.J., Adelson, E.H.: The Laplacian pyramid as a compact image code. IEEE Trans. Commun. vol. COM-31(4), 532–540 (1983)
Masi, G., Cozzolino, D., Verdoliva, L., Scarpa, G.: Pan-sharpening by convolutional neural networks. Remote Sens. 8(7), 594 (2016)
Yang, J., Fu, X., Hu, Y., Huang, Y., Ding, X., Paisley, J.: PanNet: a deep network architecture for pan-sharpening. In: 2017 IEEE International Conference on Computer Vision (ICCV) (2017)
Dong, C., Loy, C.C., He, K., Tang, X.: Image super-resolution using deep convolutional networks. IEEE Trans. Pattern Anal. Mach. Intell. 38(2), 295–307 (2016)
Gillespie, A.R., Kahle, A.B., Walker, R.E.: Color enhancement of highly correlated images. ii. Channel ratio and chromaticity transformation techniques. Remote Sens. Environ. 22(3), 343–365 (1987)
Li, X., Zhang, Y., Gao, Y., Yue, S.: Using guided filtering to improve gram-Schmidt based pansharpening method for GeoEye-1 satellite images. In: Proceedings of the 4th International Conference on Information Systems and Computing Technology (2016)
Shen, K., Yang, X., Lolli, S., Vivone, G.: A continual learning-guided training framework for pansharpening. ISPRS J. Photogramm. Remote. Sens. 196, 45–57 (2023)
Wald, L.: Data Fusion: “Definitions and Architectures-Fusion of Images of Different Spatial Resolutions.” Les Presses de l’ École des Mines, Paris, France (2002)
Alparone, L., et al.: Multispectral and panchromatic data fusion assessment without reference. Photogramm. Eng. Remote Sens. 74(2), 193–200 (2008)
Yuhas, R.H., Goetz, A.F.H., Boardman, J.W.: Discrimination among semi-arid landscape endmembers using the Spectral Angle Mapper (SAM) algorithm. In: Proceedings of Summaries 3rd Annual JPL Airborne Geoscience Workshop, pp. 147–149 (1992)
Vallejos, R., Pérez, J., Ellison, A.M., Richardson, A.D.: A spatial concordance correlation coefficient with an application to image analysis. Spat. Stat. 40(100405), 100405 (2020)
Chaudhary, N., Mahajan, R.: Spectrum sensing using 16-QAM and 32-QAM modulation techniques at different signal-to-noise ratio: a performance analysis. IAES Int. J. Artif. Intell. (IJ-AI) 12(2), 966 (2023)
Acknowledgements
I would like to express my deepest gratitude to my thesis advisor Shuyue Luo, for their guidance, support, and patience throughout the research process. Their insightful feedback and expertise have been invaluable in shaping this thesis. I would like to thank my colleagues Yuhao Lian, for their helpful discussions, encouragement, and assistance in various stages of my research. Their insights and ideas have greatly contributed to the development of this thesis. My gratitude also goes to my family and friends for their unwavering support, encouragement, and love. Their emotional support has helped me to stay motivated and focused during the difficult times of my research. Lastly, I would like to thank the participants of my study, without whom this research would not have been possible. Their willingness to participate and share their experiences and insights have been crucial to the success of this study. Thank you all for your invaluable support and contributions.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Wang, Y., Lian, Y. (2023). MLFEN: Multi-scale Long-Distance Feature Extraction Network. In: Rojas, I., Joya, G., Catala, A. (eds) Advances in Computational Intelligence. IWANN 2023. Lecture Notes in Computer Science, vol 14134. Springer, Cham. https://doi.org/10.1007/978-3-031-43085-5_15
Download citation
DOI: https://doi.org/10.1007/978-3-031-43085-5_15
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-43084-8
Online ISBN: 978-3-031-43085-5
eBook Packages: Computer ScienceComputer Science (R0)