Kvasir-Instrument: Diagnostic and Therapeutic Tool Segmentation Dataset in Gastrointestinal Endoscopy

Jha, Debesh; Ali, Sharib; Emanuelsen, Krister; Hicks, Steven A.; Thambawita, Vajira; Garcia-Ceja, Enrique; Riegler, Michael A.; de Lange, Thomas; Schmidt, Peter T.; Johansen, Håvard D.; Johansen, Dag; Halvorsen, Pål

doi:10.1007/978-3-030-67835-7_19

Debesh Jha^15,16,
Sharib Ali²³,
Krister Emanuelsen¹⁷,
Steven A. Hicks^15,19,
Vajira Thambawita^15,19,
Enrique Garcia-Ceja²⁴,
Michael A. Riegler¹⁵,
Thomas de Lange^18,20,21,
Peter T. Schmidt²²,
Håvard D. Johansen¹⁶,
Dag Johansen¹⁶ &
…
Pål Halvorsen^15,19

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12573))

Included in the following conference series:

International Conference on Multimedia Modeling

2914 Accesses
53 Citations

Abstract

Gastrointestinal (GI) pathologies are periodically screened, biopsied, and resected using surgical tools. Usually, the procedures and the treated or resected areas are not specifically tracked or analysed during or after colonoscopies. Information regarding disease borders, development, amount, and size of the resected area get lost. This can lead to poor follow-up and bothersome reassessment difficulties post-treatment. To improve the current standard and also to foster more research on the topic, we have released the “Kvasir-Instrument” dataset, which consists of 590 annotated frames containing GI procedure tools such as snares, balloons, and biopsy forceps, etc. Besides the images, the dataset includes ground truth masks and bounding boxes and has been verified by two expert GI endoscopists. Additionally, we provide a baseline for the segmentation of the GI tools to promote research and algorithm development. We obtained a dice coefficient score of 0.9158 and a Jaccard index of 0.8578 using a classical U-Net architecture. A similar dice coefficient score was observed for DoubleUNet. The qualitative results showed that the model did not work for the images with specularity and the frames with multiple tools, while the best result for both methods was observed on all other types of images. Both qualitative and quantitative results show that the model performs reasonably good, but there is potential for further improvements. Benchmarking using the dataset provides an opportunity for researchers to contribute to the field of automatic endoscopic diagnostic and therapeutic tool segmentation for GI endoscopy.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

CEID: Benchmark Dataset for Designing Segmentation Algorithms of Instruments Used in Colorectal Endoscopy

Comprehensive Transformer Integration Network (CTIN): Advancing Endoscopic Disease Segmentation with Hybrid Transformer Architecture

Improved Artifact Detection in Endoscopy Imaging Through Profile Pruning

Notes

References

Abadi, M., et al.: TensorFlow: a system for large-scale machine learning. In: Proceedings of USENIX Symposium on Operating Systems Design and Implementation, pp. 265–283 (2016)
Google Scholar
Ali, S., et al.: An objective comparison of detection and segmentation algorithms for artefacts in clinical endoscopy. Sci. Rep. 10(1), 1–15 (2020)
Article Google Scholar
Allan, M., Azizian, M.: Robotic scene segmentation sub-challenge. arXiv preprint arXiv:1902.06426 (2019)
Allan, M., et al.: 2017 robotic instrument segmentation challenge. arXiv preprint arXiv:1902.06426 (2019)
Bernhardt, S., Nicolau, S.A., Soler, L., Doignon, C.: The status of augmented reality in laparoscopic surgery as of 2016. Med. Image Anal. 37, 66–90 (2017)
Article Google Scholar
Bodenstedt, S., et al.: Comparative evaluation of instrument segmentation and tracking methods in minimally invasive surgery. arXiv preprint arXiv:1805.02475 (2018)
Borgli, H., et al.: Hyperkvasir, a comprehensive multi-class image and video dataset for gastrointestinal endoscopy. Sci. Data 7(1), 1–14 (2020)
Article Google Scholar
Chollet, F., et al.: Keras (2015)
Google Scholar
Cleary, K., Peters, T.M.: Image-guided interventions: technology review and clinical applications. Annu. Rev. Biomed. Eng. 12, 119–142 (2010)
Article Google Scholar
Jha, D., Riegler, M., Johansen, D., Halvorsen, P., Håvard, J.: DoubleU-net: a deep convolutional neural network for medical image segmentation. In: Proceedings of 33rd International Symposium on Computer-Based Medical Systems, pp. 558–564 (2020)
Google Scholar
Pakhomov, D., Premachandran, V., Allan, M., Azizian, M., Navab, N.: Deep residual learning for instrument segmentation in robotic surgery. In: Suk, H.-I., Liu, M., Yan, P., Lian, C. (eds.) MLMI 2019. LNCS, vol. 11861, pp. 566–573. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-32692-0_65
Chapter Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Ross, T., et al.: Robust medical instrument segmentation challenge 2019. arXiv preprint arXiv:2003.10299 (2020)
Shvets, A.A., Rakhlin, A., Kalinin, A.A., Iglovikov, V.I.: Automatic instrument segmentation in robot-assisted surgery using deep learning. In: Proceedings of International Conference on Machine Learning and Applications, pp. 624–628 (2018)
Google Scholar
Thambawita, V., et al.: The medico-task 2018: disease detection in the gastrointestinal tract using global features and deep learning. arXiv preprint arXiv:1810.13278 (2018)
Thambawita, V., et al.: An extensive study on cross-dataset bias and evaluation metrics interpretation for machine learning applied to gastrointestinal tract abnormality classification. arXiv preprint arXiv:2005.03912 (2020)

Download references

Acknowledgements

This work is funded in part by the Research Council of Norway, project number 263248 (Privaton) and project number 282315 (AutoCap). We performed all computations in this paper on equipment provided by the Experimental Infrastructure for Exploration of Exascale Computing ($eX^3$), which is financially supported by the Research Council of Norway under contract 270053.

Author information

Authors and Affiliations

SimulaMet, Oslo, Norway
Debesh Jha, Steven A. Hicks, Vajira Thambawita, Michael A. Riegler & Pål Halvorsen
UIT The Arctic University of Norway, Tromsø, Norway
Debesh Jha, Håvard D. Johansen & Dag Johansen
Simula Research Laboratory, Oslo, Norway
Krister Emanuelsen
Augere Medical AS, Oslo, Norway
Thomas de Lange
Oslo Metropolitan University, Oslo, Norway
Steven A. Hicks, Vajira Thambawita & Pål Halvorsen
Medical Department, Sahlgrenska University Hospital-Mölndal, Gothenburg, Sweden
Thomas de Lange
Department of Medical Research, Bærum Hospital, Gjettum, Norway
Thomas de Lange
Karolinska University Hospital, Solna, Sweden
Peter T. Schmidt
Department of Engineering Science, University of Oxford, Oxford, UK
Sharib Ali
Sintef Digital, Oslo, Norway
Enrique Garcia-Ceja

Authors

Debesh Jha
View author publications
You can also search for this author in PubMed Google Scholar
Sharib Ali
View author publications
You can also search for this author in PubMed Google Scholar
Krister Emanuelsen
View author publications
You can also search for this author in PubMed Google Scholar
Steven A. Hicks
View author publications
You can also search for this author in PubMed Google Scholar
Vajira Thambawita
View author publications
You can also search for this author in PubMed Google Scholar
Enrique Garcia-Ceja
View author publications
You can also search for this author in PubMed Google Scholar
Michael A. Riegler
View author publications
You can also search for this author in PubMed Google Scholar
Thomas de Lange
View author publications
You can also search for this author in PubMed Google Scholar
Peter T. Schmidt
View author publications
You can also search for this author in PubMed Google Scholar
Håvard D. Johansen
View author publications
You can also search for this author in PubMed Google Scholar
Dag Johansen
View author publications
You can also search for this author in PubMed Google Scholar
Pål Halvorsen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Debesh Jha .

Editor information

Editors and Affiliations

Charles University, Prague, Czech Republic
Jakub Lokoč
Charles University, Prague, Czech Republic
Tomáš Skopal
Klagenfurt University, Klagenfurt, Austria
Klaus Schoeffmann
CERTH-ITI, Thessaloniki, Greece
Vasileios Mezaris
Renmin University of China, Beijing, China
Xirong Li
CERTH-ITI, Thessaloniki, Greece
Stefanos Vrochidis
Queen Mary University of London, London, UK
Ioannis Patras

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jha, D. et al. (2021). Kvasir-Instrument: Diagnostic and Therapeutic Tool Segmentation Dataset in Gastrointestinal Endoscopy. In: Lokoč, J., et al. MultiMedia Modeling. MMM 2021. Lecture Notes in Computer Science(), vol 12573. Springer, Cham. https://doi.org/10.1007/978-3-030-67835-7_19

Download citation

DOI: https://doi.org/10.1007/978-3-030-67835-7_19
Published: 21 January 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-67834-0
Online ISBN: 978-3-030-67835-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics