skip to main content
10.1145/3665689.3665702acmotherconferencesArticle/Chapter ViewAbstractPublication PagesbicConference Proceedingsconference-collections
research-article

An Improved Semantic Segmentation Method for Retinal OCT Images Based on High-Resolution Network and Polarized Self-Attention Mechanism

Published: 28 June 2024 Publication History

Abstract

Optical Coherence Tomography (OCT) is an efficient, non-invasive imaging technique. It operates by scanning ocular tissues with a laser beam and reconstructing cross-sectional images of the eye tissues using reflected light signals. Traditional methods for segmenting retinal OCT image structures rely on experience and suffer from poor repeatability and low accuracy. In response to this need, we have developed a novel end-to-end method for segmenting eight layers of tissue structures in retinal OCT images, based on High-Resolution Networks (HRNet), demonstrating outstanding performance. We have enhanced the base model by incorporating a polarized self-attention mechanism and modifying the loss function of the model. This has resulted in significant improvements. Achieved a good score of 0.8355 on the Miou coefficient, and can accurately segment out eight layers of tissue structure in OCT images.

References

[1]
Abramoff, M.D, Garvin, Mona K, and Sonka, Milan. "Retinal Imaging and Image Analysis." IEEE Reviews in Biomedical Engineering 3. 2010. 169-208.
[2]
Green, J S, Bear, J C, and Johnson, G J. "The Burden of Genetically Determined Eye Disease." British Journal of Ophthalmology 70, no. 9. 1986. 696-99.
[3]
"Sensory Reception: Human Vision: Structure and function of the Human Eye" vol. 27, Encyclopedia Britannica, 1987.
[4]
Bussel, Igor I, Wollstein, Gadi, and Schuman, Joel S. "OCT for Glaucoma Diagnosis, Screening and Detection of Glaucoma Progression." British Journal of Ophthalmology 98, no. Suppl 2. 2014. Ii15-i19.
[5]
Wang, Qian, "Thickness of individual layers at the macula and associated factors: The Beijing Eye Study 2011." BMC ophthalmology 20.1. 2020. 1-11.
[6]
Ometto, Giovanni, "ReLayer: a free, online tool for extracting retinal thickness from cross-platform OCT images." Translational vision science & technology 8.3. 2019. 25-25.
[7]
Amini, Zahra, Hossein Rabbani, and Ivan Selesnick. "Sparse domain Gaussianization for multi-variate statistical modeling of retinal OCT images." IEEE Transactions on Image Processing 29. 2020. 6873-6884.
[8]
Chandrappa, S., "A novel approach to glaucoma screening using optic nerve head through image fusion and fractal geometry." J. Math. Comput. Sci. 11.4. 2021. 4284-4300.
[9]
Esmaeili, Mahad, "Three-dimensional curvelet-based dictionary learning for speckle noise removal of optical coherence tomography." Biomedical Optics Express 11.2. 2020. 586-608.
[10]
Maloca, Peter M., "Validation of automated artificial intelligence segmentation of optical coherence tomography images." PloS one 14.8. 2019. e0220063.
[11]
Devalla, Sripad Krishna, "Towards label-free 3D segmentation of optical coherence tomography images of the optic nerve head using deep learning." Biomedical optics express 11.11. 2020. 6356-6378.
[12]
Stankiewicz, Agnieszka, "Segmentation of preretinal space in optical coherence tomography images using deep neural networks." Sensors 21.22. 2021. 7521.
[13]
Berenguer-Vidal, Rafael, "Automatic segmentation of the retinal nerve fiber layer by means of mathematical morphology and deformable models in 2d optical coherence tomography imaging." Sensors 21.23. 2021. 8027.
[14]
Liu, Jian, "Automated retinal boundary segmentation of optical coherence tomography images using an improved Canny operator." Scientific Reports 12.1. 2022. 1412.
[15]
Lang, Andrew, "Segmentation of retinal OCT images using a random forest classifier." Medical Imaging 2013: Image Processing. Vol. 8669. SPIE, 2013.
[16]
Malkov, Yu A., and Dmitry A. Yashunin. "Efficient and robust approximate nearest neighbor search using hierarchical navigable small world graphs." IEEE transactions on pattern analysis and machine intelligence 42.4. 2018. 824-836.
[17]
Simonyan, Karen, and Andrew Zisserman. "Very deep convolutional networks for large-scale image recognition." arXiv preprint arXiv:1409.1556. 2014.
[18]
He, Kaiming, "Deep residual learning for image recognition." Proceedings of the IEEE conference on computer vision and pattern recognition. 2016.
[19]
Newell, Alejandro, Kaiyu Yang, and Jia Deng. "Stacked hourglass networks for human pose estimation." Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part VIII 14. Springer International Publishing, 2016.
[20]
Newell, Alejandro, Kaiyu Yang, and Jia Deng. "Stacked hourglass networks for human pose estimation." Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part VIII 14. Springer International Publishing, 2016.
[21]
Hu, Jie, Li Shen, and Gang Sun. "Squeeze-and-excitation networks." Proceedings of the IEEE conference on computer vision and pattern recognition. 2018.
[22]
Vaswani, Ashish, "Attention is all you need." Advances in neural information processing systems 30. 2017.
[23]
Woo, Sanghyun, "Cbam: Convolutional block attention module." Proceedings of the European conference on computer vision (ECCV). 2018.
[24]
Liu, Huajun, "Polarized self-attention: Towards high-quality pixel-wise regression." arXiv preprint arXiv:2107.00782. 2021.
[25]
Oord, Aaron van den, Yazhe Li, and Oriol Vinyals. "Representation learning with contrastive predictive coding." arXiv preprint arXiv:1807.03748. 2018.

Index Terms

  1. An Improved Semantic Segmentation Method for Retinal OCT Images Based on High-Resolution Network and Polarized Self-Attention Mechanism

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    BIC '24: Proceedings of the 2024 4th International Conference on Bioinformatics and Intelligent Computing
    January 2024
    504 pages
    ISBN:9798400716645
    DOI:10.1145/3665689
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 28 June 2024

    Permissions

    Request permissions for this article.

    Check for updates

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    BIC 2024

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 21
      Total Downloads
    • Downloads (Last 12 months)21
    • Downloads (Last 6 weeks)5
    Reflects downloads up to 05 Mar 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media