Skip to main content

A Visual Model Approach for Parsing Colonoscopy Videos

  • Conference paper
Image and Video Retrieval (CIVR 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3115))

Included in the following conference series:

Abstract

Colonoscopy is an important screening procedure for colorectal cancer. During this procedure, the endoscopist visually inspects the colon. Currently, there is no content-based analysis and retrieval system that automatically analyzes videos captured from colonoscopic procedures and provides a user-friendly and efficient access to important content. Such a system will be valuable as an educational resource for endoscopic research, a platform to assess procedural skills for endoscopists, and a platform for mining for unknown abnormality patterns that may lead to colorectal cancer. The first necessary step for the analysis is parsing for semantic units. In this paper, we propose a new visual model approach that employs visual features extracted directly from compressed videos together with audio analysis to discover important semantic units called scenes. Our experimental results show average precision and recall of 93% and 85%, respectively.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Greenlee, R., Murry, T., Bolden, S., Wingo, P.A.: Cancer statistics. CA Cancer J Clin 50, 7–33 (2000)

    Article  Google Scholar 

  2. Cao, Y., Tavanapong, W., Kim, K.H., Wong, J., Oh, J.H., de Groen, P.C.: A framework for parsing colonoscopy videos for semantic units. In: Proc. of Int’l Conf. on Multimedia and Expo, Taipei, Taiwan (2004) (to appear)

    Google Scholar 

  3. Gargi, U., Kasturi, R., Strayer, S.H.: Performance characterization of video-shotchange detection methods. IEEE Transaction on Circuits and Systems for Video Technology 10, 1–13 (2000)

    Article  Google Scholar 

  4. Yusoff, Y., Kittler, J.: Video shot cut detection using adaptive thresholding. In: Proc. of the British Machine Vision Conference, Bristol, UK (2000)

    Google Scholar 

  5. Naphade, M.R., Mehrotra, R., Ferman, A.M., Warnick, J., Huang, T.S., Tekalp, A.M.: A high-performance shot boundary detection algorithm using multiple cues. In: Proc. of the IEEE Int’l Conf. on Image Processing, Chicago, Illinois, USA, pp. 884–887 (1998)

    Google Scholar 

  6. Zabih, R., Miller, J., K.: A feature-based algorithm for detecting and classification production effects. Multimedia Systems 7, 119–128 (1999)

    Google Scholar 

  7. Hanjalic, A., Zhang, H.J.: Optimal shot boundary detection based on robust statistical models. In: Proc. of the IEEE Int’l Conf. Multimedia Computing and Systems, Florence, Italy (1999)

    Google Scholar 

  8. Hampapur, A., Jain, R., Weymouth, T.: Production model based digital video segmentation. Multimedia Tools and Applications 1, 9–46 (1995)

    Article  Google Scholar 

  9. Lienhart, R.: Comparison of automatic shot boundary detection algorithms. In: Proc. of SPIE Storage and Retrieval for Still Image and Video Databases VII, vol. 3972, pp. 290–301 (1999)

    Google Scholar 

  10. Truong, B.T., Dorai, C., Venkatesh, S.: New enhancements to cut, fade, and dissolve detection processes in video segmentation. In: Proc. of ACM Multimedia, Los Angeles, CA, USA, pp. 219–227 (2000)

    Google Scholar 

  11. Yeo, B.L., Liu, B.: Rapid scene analysis on compressed video. IEEE Transactions on Circuits and Systems for Video Technology 5, 533–544 (1995)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Cao, Y., Tavanapong, W., Li, D., Oh, J., de Groen, P.C., Wong, J. (2004). A Visual Model Approach for Parsing Colonoscopy Videos. In: Enser, P., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds) Image and Video Retrieval. CIVR 2004. Lecture Notes in Computer Science, vol 3115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27814-6_22

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-27814-6_22

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22539-3

  • Online ISBN: 978-3-540-27814-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics