Skip to main content

Temporally Consistent 3D Pose Estimation in the Interventional Room Using Discrete MRF Optimization over RGBD Sequences

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8498))

Abstract

Tracking and body pose estimation of clinical staff have several applications in the analysis of surgical workflow, such as radiation monitoring, surgical activity recognition and the study of ergonomics. The operating room is, however, a very complex environment for visual tracking due to frequent illumination changes, clutter, similar color of clinicians’ scrubs and limited sensor positioning. Furthermore, several applications, such as radiation monitoring, require consistent and accurate body part tracking over defined periods of time, which is a challenging task in the aforementioned conditions. In this paper, we tackle the problem of pose estimation in the interventional room. We also propose a method to consistently track upper body parts in short sequences by using RGBD data and discrete Markov Random Field (MRF) optimization over the complete set of frames. The proposed MRF energy formulation enforces both body kinematic and temporal constraints in order to cope with the natural ambiguities of tracking and with the frequent failure of the underlying depth-based body part detector in such conditions. We evaluate our approach quantitatively on seven manually-annotated sequences recorded in the interventional room and show that it can consistently track the upper-body of persons present in the room.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Schwarz, L.A., Bigdelou, A., Navab, N.: Learning gestures for customizable human-computer interaction in the operating room. In: Fichtinger, G., Martel, A., Peters, T. (eds.) MICCAI 2011, Part I. LNCS, vol. 6891, pp. 129–136. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  2. Noonan, D.P., Mylonas, G.P., Darzi, A., Yang, G.Z.: Gaze contingent articulated robot control for robot assisted min. invasive surgery. In: IROS (2008)

    Google Scholar 

  3. Padoy, N., Mateus, D., Weinland, D., Berger, M.O., Navab, N.: Workflow Monitoring based on 3D Motion Features. In: VOEC-ICCV, pp. 585–592 (2009)

    Google Scholar 

  4. Lea, C., Facker, J.C., Hager, G.D., Taylor, R.H., Saria, S.: 3d sensing algorithms towards building an intelligent intensive care unit. In: AMIA CRI (2013)

    Google Scholar 

  5. Gentric, J.C., Trelhu, B., Jannin, P., Riffaud, L., Ferré, J.C., Gauvrit, J.Y.: Development of workflow task analysis during cerebral diagnostic angiographies: Time-based comparison of junior and senior tasks. Journal of Neuroradiology (2013)

    Google Scholar 

  6. Ladikos, A., Cagniart, C., Ghotbi, R., Reiser, M., Navab, N.: Estimating radiation exposure in interventional environments. In: Jiang, T., Navab, N., Pluim, J.P.W., Viergever, M.A. (eds.) MICCAI 2010, Part III. LNCS, vol. 6363, pp. 237–244. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  7. Nara, A., Izumi, K., Iseki, H., Suzuki, T., Nambu, K., Sakurai, Y.: Surgical workflow monitoring based on trajectory data mining. In: Bekki, D. (ed.) JSAI-isAI 2010. LNCS, vol. 6797, pp. 283–291. Springer, Heidelberg (2011)

    Google Scholar 

  8. Carinou, E., Brodecki, M., Domienik, J., Donadille, L., Koukorava, C., Krim, S., Nikodemová, D., Ruiz-Lopez, N., Sans-Mercé, M., Struelens, L., Vanhavere, F.: Recommendations to reduce extremity and eye lens doses in interventional radiology and cardiology. Radiation Measurements (May 2011)

    Google Scholar 

  9. Holte, M., Tran, C., Trivedi, M., Moeslund, T.: Human pose estimation and activity recognition from multi-view videos. IEEE Journal of Selected Topics in Signal Processing 6(5), 538–552 (2012)

    Article  Google Scholar 

  10. Felzenszwalb, P.F., Girshick, R., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part based models. PAMI 32(9), 1627–1645 (2010)

    Article  Google Scholar 

  11. Izadinia, H., Saleemi, I., Li, W., Shah, M. (MP)2T: Multiple people multiple parts tracker. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part VI. LNCS, vol. 7577, pp. 100–114. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  12. Choi, W., Pantofaru, C., Savarese, S.: A general framework for tracking multiple people from a moving camera. PAMI 35(7), 1577–1591 (2013)

    Article  Google Scholar 

  13. Baak, A., Rosenhahn, B., Mueller, M., Seidel, H.P.: Stabilizing motion tracking using retrieved motion priors. In: ICCV, pp. 1428–1435 (2009)

    Google Scholar 

  14. Shotton, J., Girshick, R., Fitzgibbon, A., Sharp, T., Cook, M., Finocchio, M., Moore, R., Kohli, P., Criminisi, A., Kipman, A., Blake, A.: Efficient human pose estimation from single depth images. PAMI 35(12), 2821–2840 (2012)

    Article  Google Scholar 

  15. Buys, K., Cagniart, C., Baksheev, A., Laet, T.D., Schutter, J.D., Pantofaru, C.: An adaptable system for rgb-d based human body detection and pose estimation. Journal of Visual Communication and Image Representation (2013)

    Google Scholar 

  16. Padoy, N., Hager, G.D.: 3d thread tracking for robotic assistance in tele-surgery. In: IROS, pp. 2102–2107. IEEE (2011)

    Google Scholar 

  17. Wang, C., Komodakis, N., Paragios, N.: Markov random field modeling, inference & learning in computer vision & image understanding: A survey. CVIU 117(11), 1610–1627 (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Kadkhodamohammadi, A., Gangi, A., de Mathelin, M., Padoy, N. (2014). Temporally Consistent 3D Pose Estimation in the Interventional Room Using Discrete MRF Optimization over RGBD Sequences. In: Stoyanov, D., Collins, D.L., Sakuma, I., Abolmaesumi, P., Jannin, P. (eds) Information Processing in Computer-Assisted Interventions. IPCAI 2014. Lecture Notes in Computer Science, vol 8498. Springer, Cham. https://doi.org/10.1007/978-3-319-07521-1_18

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-07521-1_18

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-07520-4

  • Online ISBN: 978-3-319-07521-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics