skip to main content
10.1145/3463944.3469100acmconferencesArticle/Chapter ViewAbstractPublication PagesicdarConference Proceedingsconference-collections
research-article

Pyramidal Segmentation of Medical Images using Adversarial Training

Published: 21 August 2021 Publication History

Abstract

Colorectal cancer is a severe health issue globally and a significant cause of cancer-related mortality, but it is treatable if found at an early stage. Early detection is usually done through a colonoscopy, where clinicians search for cancer precursors called polyps. Research has shown that clinicians miss between 14% and 30% of polyps during standard screenings of the gastrointestinal tract. Furthermore, once the polyps have been found, clinicians often overestimate the size of the polyps. In this respect, automatic analysis of medical images for detecting and locating polyps is a research area where machine learning has excelled in recent years. Still, current models have much room for improvement. In this paper, we propose a novel approach based on learning to segment within several grids, which we introduce to U-Net and Pix2Pix architectures. In short, we have experimented using several grid sizes, and using two open-source polyp segmentation datasets for cross-data training and testing. Our results suggest that segmentation at lower resolutions produces better results at the cost of less precision, which proved useful for the cases where higher precision segmentations gave limited results. Generally, compared to traditional U-Net and Pix2Pix, our grid-based approaches improve segmentation performance.

Supplementary Material

MP4 File (icdar-14.mp4)
Presentation Video - Pyramidal Segmentation of Medical Images using Adversarial Training

References

[1]
Freddie Bray, Jacques Ferlay, Isabelle Soerjomataram, Rebecca L. Siegel, Lindsey A. Torre, and Ahmedin Jemal, "Global cancer statistics 2018: Globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries," CA: A Cancer Journal for Clinicians, vol. 68, no. 6, pp. 394--424, 2018.
[2]
Robin Delaine-Smith, Nicola Wright, Chris Hanley, Rebecca Hanwell, Rahul Bhome, Marc Bullock, Cole Drifka, Kevin Eliceiri, Gareth Thomas, Martin Knight, Alex Mirnezami, and Nicholas Peake, "Transglutaminase-2 mediates the biomechanical properties of the colorectal cancer tissue microenvironment that contribute to disease progression," Cancers, vol. 11, pp. 701, 05 2019.
[3]
European Colorectal Cancer Screening Guidelines Working Group, "European guidelines for quality assurance in colorectal cancer screening and diagnosis: Overview and introduction to the full supplement publication," 6 2012, Fetched 06/12/2019.
[4]
Roisin Bevan and Matthew D Rutter, "Colorectal cancer screening-who, how, and when?," 9 2017, Fetched 06/12/2019.
[5]
Jeroen Rijn, Johannes Reitsma, Jaap Stoker, Patrick Bossuyt, Sander van Deventer, and Evelien Dekker, "Polyp miss rate determined by tandem colonoscopy: A systematic review," The American journal of gastroenterology, vol. 101, pp. 343--50, 02 2006.
[6]
Guobing Pan and Litong Wang, "Swallowable wireless capsule endoscopy: Progress and technical challenges," Gastroenterology research and practice, vol. 2012, pp. 841691, 01 2012.
[7]
Mohammad Ali Armin, "Automated visibility map from colonoscopy video to support clinical diagnosis and improve the quality of colonoscopy," p. 29, 2016.
[8]
Sang W. Lee, Howard M. Ross, David E. Rivadeneira, Scott R. Steele, and Daniel L. Feingold, Advanced Colonoscopy and Endoluminal Surgery, springer, 2017.
[9]
Jun-Yan Zhu, Taesung Park, Phillip Isola, and Alexei A. Efros, "Unpaired image-to-image translation using cycle-consistent adversarial networks," 2017.
[10]
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, and Alexei A. Efros, "Image-to-image translation with conditional adversarial networks," 2016.
[11]
Xavier Dray, Cynthia Li, Jean-Christophe Saurin, Franck Cholet, Gabriel Rahmi, JP Le Mouel, C. Leandri, Stéphane Lecleire, Xavier Amiot, Jean-Michel Delvaux, Clotilde Duburque, R Gérard, Romain Leenhardt, F Mesli, G Vanbervliet, I. Nion-Larmurier, Sylvie Sacher-Huvelin, Camille Chane Simon, Olivier Romain, and Aymeric Histace, "CAD-CAP: une base de données franc caise à vocation internationale, pour le développement et la validation d'outils de diagnostic assisté par ordinateur en vidéocapsule endoscopique du grêle," in Actes des JFHOD 2018, Thieme, Ed., Paris, France, Mar. 2018, vol. 50 of Endoscopy, p. 316.
[12]
Jorge Bernal, F. Javier Sánchez, Gloria Fernández-Esparrach, Debora Gil, Cristina Rodríguez, and Fernando Vilariño, "Wm-dova maps for accurate polyp highlighting in colonoscopy: Validation vs. saliency maps from physicians," Computerized Medical Imaging and Graphics, vol. 43, pp. 99 -- 111, 2015.
[13]
Debesh Jha, Pia H Smedsrud, Michael A Riegler, Pål Halvorsen, Thomas de Lange, Dag Johansen, and Håvard D Johansen, "Kvasir-seg: A segmented polyp dataset," in Proceedings of International Conference on Multimedia Modeling (MMM), 2020, pp. 451--462.
[14]
Toshiaki Hirasawa, Kazuharu Aoyama, Junko Fujisaki, and Tomohiro Tada, "113 application of artificial intelligence using convolutional neural network for detecting gastric cancer in endoscopic images," Gastrointestinal Endoscopy, vol. 87, no. 6, pp. AB51, 2018.
[15]
Ali Alammari, Abm Rezbaul Islam, JungHwan Oh, Wallapak Tavanapong, Johnny Wong, and Piet C De Groen, "Classification of ulcerative colitis severity in colonoscopy videos using cnn," in Proceedings of the 9th International Conference on Information Management and Engineering. ACM, 2017, pp. 139--144.
[16]
Vajira Thambawita, Debesh Jha, Michael Riegler, Pål Halvorsen, Hugo Lewi Hammer, Håvard D. Johansen, and Dag Johansen, "The medico-task 2018: Disease detection in the gastrointestinal tract using global features and deep learning," 2018.
[17]
M. Riegler, K. Pogorelov, P. Halvorsen, T. de Lange, C. Griwodz, P. T. Schmidt, S. L. Eskeland, and D. Johansen, "Eir - efficient computer aided diagnosis framework for gastrointestinal endoscopies," in Proceedings of the International Workshop on Content-Based Multimedia Indexing (CBMI), 2016, pp. 1--6.
[18]
K. Pogorelov, O. Ostroukhova, M. Jeppsson, H. Espeland, C. Griwodz, T. de Lange, D. Johansen, M. Riegler, and P. Halvorsen, "Deep learning and hand-crafted feature based approaches for polyp detection in medical videos," in 2018 IEEE 31st International Symposium on Computer-Based Medical Systems (CBMS), 2018, pp. 381--386.
[19]
Liansheng Wang, Cong Xie, and Yanxing Hu, "Iddf2018-abs-0260 deep learning for polyp segmentation," 2018.
[20]
Olaf Ronneberger, Philipp Fischer, and Thomas Brox, "U-net: Convolutional networks for biomedical image segmentation," 2015.
[21]
Yuichi Mori, Shin-ei Kudo, Masashi Misawa, Yutaka Saito, Hiroaki Ikematsu, Kinichi Hotta, Kazuo Ohtsuka, Fumihiko Urushibara, Shinichi Kataoka, Yushi Ogawa, et al., "Real-time use of artificial intelligence in identification of diminutive polyps during colonoscopy: a prospective study," Annals of internal medicine, 2018.
[22]
Indriani Astono, James Welsh, Stephan Chalup, and Peter Greer, "Optimisation of 2d u-net model components for automatic prostate segmentation on mri," Applied Sciences, vol. 10, pp. 2601, 04 2020.
[23]
D. Jha, P. H. Smedsrud, D. Johansen, T. d. Lange, H. Johansen, P. Halvorsen, and M. Riegler, "A comprehensive study on colorectal polyp segmentation with resunet+, conditional random field and test-time augmentation," IEEE Journal of Biomedical and Health Informatics, pp. 1--1, 2021.
[24]
D. Jha, M. A. Riegler, D. Johansen, P. Halvorsen, and H. D. Johansen, "Doubleu-net: A deep convolutional neural network for medical image segmentation," in Proceedings of the IEEE International Symposium on Computer-Based Medical Systems (CBMS), jul 2020, pp. 558--564.
[25]
Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov, "Dropout: A simple way to prevent neural networks from overfitting," J. Mach. Learn. Res., vol. 15, no. 1, pp. 1929--1958, Jan. 2014.
[26]
Debesh Jha, Sharib Ali, Nikhil Kumar Tomar, Havard D. Johansen, Dag Johansen, Jens Rittscher, Michael A. Riegler, and Pal Halvorsen, "Real-time polyp detection, localization and segmentation in colonoscopy using deep learning," IEEE Access, vol. 9, pp. 40496--40510, 2021.
[27]
Debesh Jha, Nikhil Kumar Tomar, Sharib Ali, Michael A. Riegler, Håvard D. Johansen, Dag Johansen, Thomas de Lange, and Pål Halvorsen, "Nanonet: Real-time polyp segmentation in video capsule endoscopy and colonoscopy," 2021.
[28]
Debesh Jha, Pia H Smedsrud, Michael A Riegler, Dag Johansen, Thomas De Lange, Pål Halvorsen, and Håvard D Johansen, "Resunet+: An advanced architecture for medical image segmentation," in Proceedings of IEEE International Symposium on Multimedia (ISM), 2019, pp. 225--2255.
[29]
Pia H Smedsrud, Henrik Gjestang, Oda Olsen Nedrejord, Espen Næss, Vajira Thambawita, Steven Hicks, Debesh Jha, Tor Jan Derek Berstad, Sigrun L Eskeland, Håvard Espeland, Andreas Petlund, Peter T Schmidt, Hugo L. Hammer, Thomas de Lange, Michael A Riegler, and Pål Halvorsen, "Kvasir-capsule, a video capsule endoscopy dataset," Scientific Data, vol. 8, 2021.

Cited By

View all
  • (2023)Ensemble Architectures and Efficient Fusion Techniques for Convolutional Neural Networks: An Analysis on Resource Optimization StrategiesIntelligent Systems10.1007/978-3-031-45389-2_8(107-121)Online publication date: 12-Oct-2023

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
ICDAR '21: Proceedings of the 2021 ACM Workshop on Intelligent Cross-Data Analysis and Retrieval
August 2021
72 pages
ISBN:9781450385299
DOI:10.1145/3463944
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 August 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. U-net
  2. colorectal cancer
  3. computer-assisted diagnosis systems
  4. cross-data experiments
  5. generative adversarial networks
  6. image augmentation
  7. image segmentation

Qualifiers

  • Research-article

Conference

ICMR '21
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)1
Reflects downloads up to 20 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2023)Ensemble Architectures and Efficient Fusion Techniques for Convolutional Neural Networks: An Analysis on Resource Optimization StrategiesIntelligent Systems10.1007/978-3-031-45389-2_8(107-121)Online publication date: 12-Oct-2023

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media