skip to main content
10.1145/3595916.3626434acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

SFNet: Saliency fast Fourier convolutional Network for medical image segmentation

Published: 01 January 2024 Publication History

Abstract

Due to the limitation of local characteristics of convolution operations, the encoder of U-Net cannot effectively capture global context information; furthermore, the skip connections of U-Net fail to capture salient features in image segmentation tasks. Therefore, we put forward a Saliency fast Fourier convolutional Network (SFNet) for medical image segmentation. To begin with, we propose a SCAU attention module, which can highlight both spatial and channel attention for capturing not only global but local information, paying more attention to the whole target regions of an image, and creating strong information association among samples to extract the characteristics of the overall dataset. Subsequently, instead of employing the convolution to set the encoder and decoder, we introduce a Fourier convolution module, FFconv, which owns the non-local receptive field to fulfil cross-scale fusion in the convolution unit properly. Experimental results show that, on BUSI and Kvasir-SEG datasets, the mIOU and F1-score of our network reach 75.08% and 84.75%, respectively; our network greatly promote medical image segmentation performance.

References

[1]
Walid Al-Dhabyani, Mohammed Gomaa, Hussien Khaled, and Aly Fahmy. 2020. Dataset of breast ultrasound images. Data in brief 28 (2020), 104863.
[2]
Jeroen Bertels, David Robben, Robin Lemmens, and Dirk Vandermeulen. 2022. Convolutional neural networks for medical image segmentation. arXiv preprint arXiv:2211.09562 (2022).
[3]
Hu Cao, Yueyue Wang, Joy Chen, Dongsheng Jiang, Xiaopeng Zhang, Qi Tian, and Manning Wang. 2022. Swin-unet: Unet-like pure transformer for medical image segmentation. In European conference on computer vision. Springer, 205–218.
[4]
Shouvik Chakraborty and Kalyani Mali. 2023. An overview of biomedical image analysis from the deep learning perspective. Research Anthology on Improving Medical Imaging Techniques for Analysis and Intervention (2023), 43–59.
[5]
Jieneng Chen, Yongyi Lu, Qihang Yu, Xiangde Luo, Ehsan Adeli, Yan Wang, Le Lu, Alan L Yuille, and Yuyin Zhou. 2021. Transunet: Transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306 (2021).
[6]
Lu Chi, Borui Jiang, and Yadong Mu. 2020. Fast fourier convolution. Advances in Neural Information Processing Systems 33 (2020), 4479–4488.
[7]
Tribikram Dhar, Nilanjan Dey, Surekha Borra, and R Simon Sherratt. 2023. Challenges of Deep Learning in Medical Image Analysis—Improving Explainability and Trust. IEEE Transactions on Technology and Society 4, 1 (2023), 68–75.
[8]
Alexey Dosovitskiy, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, 2020. An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020).
[9]
Azade Farshad, Yousef Yeganeh, Peter Gehlbach, and Nassir Navab. 2022. Y-Net: A spatiospectral dual-encoder network for medical image segmentation. In International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 582–592.
[10]
Yunhe Gao, Mu Zhou, and Dimitris N Metaxas. 2021. UTNet: a hybrid transformer architecture for medical image segmentation. In Medical Image Computing and Computer Assisted Intervention–MICCAI 2021: 24th International Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part III 24. Springer, 61–71.
[11]
Qi Han, Mingyang Hou, Hongyi Wang, Chen Wu, Sheng Tian, Zicheng Qiu, and Baoping Zhou. 2023. EHDFL: Evolutionary hybrid domain feature learning based on windowed fast Fourier convolution pyramid for medical image classification. Computers in Biology and Medicine 152 (2023), 106353.
[12]
Emerald U Henry, Onyeka Emebob, and Conrad Asotie Omonhinmin. 2022. Vision transformers in medical imaging: A review. arXiv preprint arXiv:2211.10043 (2022).
[13]
Yonghao Huang, Chuan Zhou, Leiting Chen, Junjing Chen, and Shanlin Lan. 2021. Medical Frequency Domain Learning: Consider Inter-class and Intra-class Frequency for Medical Image Segmentation and Classification. In 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, 897–904.
[14]
Debesh Jha, Pia H Smedsrud, Michael A Riegler, Pål Halvorsen, Thomas de Lange, Dag Johansen, and Håvard D Johansen. 2020. Kvasir-seg: A segmented polyp dataset. In MultiMedia Modeling: 26th International Conference, MMM 2020, Daejeon, South Korea, January 5–8, 2020, Proceedings, Part II 26. Springer, 451–462.
[15]
Ozan Oktay, Jo Schlemper, Loic Le Folgoc, Matthew Lee, Mattias Heinrich, Kazunari Misawa, Kensaku Mori, Steven McDonagh, Nils Y Hammerla, Bernhard Kainz, 2018. Attention u-net: Learning where to look for the pancreas. arXiv preprint arXiv:1804.03999 (2018).
[16]
Arshi Parvaiz, Muhammad Anwaar Khalid, Rukhsana Zafar, Huma Ameer, Muhammad Ali, and Muhammad Moazam Fraz. 2023. Vision Transformers in medical computer vision—A contemplative retrospection. Engineering Applications of Artificial Intelligence 122 (2023), 106126.
[17]
Oren Rippel, Jasper Snoek, and Ryan P Adams. 2015. Spectral representations for convolutional neural networks. Advances in neural information processing systems 28 (2015).
[18]
Olaf Ronneberger, Philipp Fischer, and Thomas Brox. 2015. U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18. Springer, 234–241.
[19]
Jiacheng Ruan, Suncheng Xiang, Mingye Xie, Ting Liu, and Yuzhuo Fu. 2022. MALUNet: A multi-attention and light-weight unet for skin lesion segmentation. In 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). IEEE, 1150–1156.
[20]
Jiacheng Ruan, Mingye Xie, Suncheng Xiang, Ting Liu, and Yuzhuo Fu. 2022. MEW-UNet: Multi-axis representation learning in frequency domain for medical image segmentation. arXiv preprint arXiv:2210.14007 (2022).
[21]
Hao Shi, Guo Cao, Youqiang Zhang, Zixian Ge, Yanbo Liu, and Di Yang. 2023. F 3 Net: Fast Fourier Filter Network for Hyperspectral Image Classification. IEEE Transactions on Instrumentation and Measurement (2023).
[22]
Athanasios Tragakis, Chaitanya Kaul, Roderick Murray-Smith, and Dirk Husmeier. 2023. The fully convolutional transformer for medical image segmentation. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 3660–3669.
[23]
Jeya Maria Jose Valanarasu and Vishal M Patel. 2022. Unext: Mlp-based rapid medical image segmentation network. In International Conference on Medical Image Computing and Computer-Assisted Intervention. Springer, 23–33.
[24]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Advances in neural information processing systems 30 (2017).
[25]
Wenxuan Wang, Jing Wang, Chen Chen, Jianbo Jiao, Lichao Sun, Yuanxiu Cai, Shanshan Song, and Jiangyun Li. 2023. FreMAE: Fourier Transform Meets Masked Autoencoders for Medical Image Segmentation. arXiv preprint arXiv:2304.10864 (2023).
[26]
Yundong Zhang, Huiye Liu, and Qiang Hu. 2021. Transfuse: Fusing transformers and cnns for medical image segmentation. In Medical Image Computing and Computer Assisted Intervention–MICCAI 2021: 24th International Conference, Strasbourg, France, September 27–October 1, 2021, Proceedings, Part I 24. Springer, 14–24.
[27]
Han Zhou, Wei Dong, Yangyi Liu, and Jun Chen. 2023. Breaking Through the Haze: An Advanced Non-Homogeneous Dehazing Method based on Fast Fourier Convolution and ConvNeXt. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 1894–1903.
[28]
Zongwei Zhou, Md Mahfuzur Rahman Siddiquee, Nima Tajbakhsh, and Jianming Liang. 2018. Unet++: A nested u-net architecture for medical image segmentation. In Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support: 4th International Workshop, DLMIA 2018, and 8th International Workshop, ML-CDS 2018, Held in Conjunction with MICCAI 2018, Granada, Spain, September 20, 2018, Proceedings 4. Springer, 3–11.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in Asia
December 2023
745 pages
ISBN:9798400702051
DOI:10.1145/3595916
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 January 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Fourier convolution module
  2. Medical image segmentation
  3. SCAU attention module
  4. global context information
  5. salient features

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • key scientific research project of higher school of Henan Province

Conference

MMAsia '23
Sponsor:
MMAsia '23: ACM Multimedia Asia
December 6 - 8, 2023
Tainan, Taiwan

Acceptance Rates

Overall Acceptance Rate 59 of 204 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 119
    Total Downloads
  • Downloads (Last 12 months)74
  • Downloads (Last 6 weeks)5
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media