skip to main content
10.1145/3532213.3532319acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccaiConference Proceedingsconference-collections
research-article

Multi-task Semi-supervised Learning Framework for Surgical Instrument Pose Estimation

Published: 13 July 2022 Publication History

Abstract

As an important part of medical technology, the process of instrumental surgery has various risks, which can be reduced by using computer assisted intervention system during surgery on operation and decision making, while surgical instrument pose estimation is an important part in such system. The constraint of instrument pose estimation has always been sparse data annotation in endoscopic scenario. We propose a semi-supervised and multi-task learning based pose estimation framework to enhance the model effect using unannotated data. Due to the severe class imbalance in instrument pose estimation, simple semi-supervised learning results in model degradation, where network tends to classify every pixel as background. In this paper, we combine multi-task and semi-supervised learning, using instrument segmentation as a spatial constraint and instrument number detection as a global constraint, thus providing additional information for pose estimation task. In addition, we give pixels near key point region a higher weight and design a filtering mechanism on the output results, we only back-propagate and update the model parameters using the results above the confidence level. Through experiments, we verify that our model is able to gain information from unlabeled data to improve the accuracy of key point detection and overcome the problem of insufficient data in surgical instrument pose estimation task.

References

[1]
Allan M, Shvets A, Kurmann T, 2017 robotic instrument segmentation challenge[J]. arXiv preprint arXiv:1902.06426, 2019.
[2]
Chu Y, Yang X, Li H, Multi-level feature aggregation network for instrument identification of endoscopic images[J]. Physics in Medicine & Biology, 2020,65(16):165004.
[3]
Zhang L, Ye M, Chan P L, Real-time surgical tool tracking and pose estimation using a hybrid cylindrical marker[J]. International journal of computer assisted radiology and surgery, 2017, 12(6):921-930.
[4]
Rieke N, Tan D J, Alsheakhali M, Surgical tool tracking and pose estimation in retinal microsurgery[C]. in: International Conference on Medical Image Computing and Computer-Assisted Intervention. 2015:266-273.
[5]
Du X, Kurmann T, Chang P L, Articulated multi-instrument 2-D pose estimation using fully convolutional networks[J]. IEEE transactions on medical imaging,2018, 37(5):1276-1287.
[6]
Colleoni E, Moccia S, Du X, Deep learning based robotic tool detection and articulation estimation with spatio-temporal layers[J]. IEEE Robotics and Automation Letters, 2019, 4(3):2714-272.
[7]
González C, Bravo-Sánchez L, Arbelaez P. Isinet: an instance-based approach for surgical instrument segmentation[C]. in: International Conference on Medical Image Computing and Computer-Assisted Intervention. 2020:595-60.
[8]
Cheplygina V, de Bruijne M, Pluim J P. Not-so-supervised: a survey of semi-supervised, multi-instance, and transfer learning in medical image analysis[J]. Medical image analysis, 2019, 54:280-29.
[9]
Wang Z, Li Y, Guo Y, Data-Uncertainty Guided Multi-Phase Learning for Semi-Supervised Object Detection[C]. in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021:4568-4577.
[10]
Lee D H, Pseudo-label: The simple and efficient semi-supervised learning method for deep neural networks[C]. in: Workshop on challenges in representation learning, ICML: vol. 3: 2. 2013:896.
[11]
Laine S, Aila T. Temporal ensembling for semi-supervised learning[J]. ArXiv preprint arXiv:1610.02242, 2016.
[12]
Xie R, Wang C, Zeng W, An Empirical Study of the Collapsing Prob-lem in Semi-Supervised 2D Human Pose Estimation[C]. in: Proceedings of the IEEE/CVF International Conference on Computer Vision. 2021:11240-1124.
[13]
Tarvainen A, Valpola H. Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results[J]. arXiv preprint arXiv:1703.01780, 2017.
[14]
Chen Z, Zhu L, Wan L, A multi-task mean teacher for semi-supervised shadow detection[C]. in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020: 5611-5620.
[15]
Tan M, Pang R, Le Q V. Efficientdet: Scalable and efficient object detection[C]. in: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2020: 10781-10790.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICCAI '22: Proceedings of the 8th International Conference on Computing and Artificial Intelligence
March 2022
809 pages
ISBN:9781450396110
DOI:10.1145/3532213
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 July 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Multi-task Learning
  2. Semi-supervised Learning
  3. Surgical Instrument Pose Estimation

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICCAI '22

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 98
    Total Downloads
  • Downloads (Last 12 months)30
  • Downloads (Last 6 weeks)1
Reflects downloads up to 05 Mar 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media