Abstract
Colonoscopy is an important screening procedure for colorectal cancer. During this procedure, the endoscopist visually inspects the colon. Currently, there is no content-based analysis and retrieval system that automatically analyzes videos captured from colonoscopic procedures and provides a user-friendly and efficient access to important content. Such a system will be valuable as an educational resource for endoscopic research, a platform to assess procedural skills for endoscopists, and a platform for mining for unknown abnormality patterns that may lead to colorectal cancer. The first necessary step for the analysis is parsing for semantic units. In this paper, we propose a new visual model approach that employs visual features extracted directly from compressed videos together with audio analysis to discover important semantic units called scenes. Our experimental results show average precision and recall of 93% and 85%, respectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Greenlee, R., Murry, T., Bolden, S., Wingo, P.A.: Cancer statistics. CA Cancer J Clin 50, 7–33 (2000)
Cao, Y., Tavanapong, W., Kim, K.H., Wong, J., Oh, J.H., de Groen, P.C.: A framework for parsing colonoscopy videos for semantic units. In: Proc. of Int’l Conf. on Multimedia and Expo, Taipei, Taiwan (2004) (to appear)
Gargi, U., Kasturi, R., Strayer, S.H.: Performance characterization of video-shotchange detection methods. IEEE Transaction on Circuits and Systems for Video Technology 10, 1–13 (2000)
Yusoff, Y., Kittler, J.: Video shot cut detection using adaptive thresholding. In: Proc. of the British Machine Vision Conference, Bristol, UK (2000)
Naphade, M.R., Mehrotra, R., Ferman, A.M., Warnick, J., Huang, T.S., Tekalp, A.M.: A high-performance shot boundary detection algorithm using multiple cues. In: Proc. of the IEEE Int’l Conf. on Image Processing, Chicago, Illinois, USA, pp. 884–887 (1998)
Zabih, R., Miller, J., K.: A feature-based algorithm for detecting and classification production effects. Multimedia Systems 7, 119–128 (1999)
Hanjalic, A., Zhang, H.J.: Optimal shot boundary detection based on robust statistical models. In: Proc. of the IEEE Int’l Conf. Multimedia Computing and Systems, Florence, Italy (1999)
Hampapur, A., Jain, R., Weymouth, T.: Production model based digital video segmentation. Multimedia Tools and Applications 1, 9–46 (1995)
Lienhart, R.: Comparison of automatic shot boundary detection algorithms. In: Proc. of SPIE Storage and Retrieval for Still Image and Video Databases VII, vol. 3972, pp. 290–301 (1999)
Truong, B.T., Dorai, C., Venkatesh, S.: New enhancements to cut, fade, and dissolve detection processes in video segmentation. In: Proc. of ACM Multimedia, Los Angeles, CA, USA, pp. 219–227 (2000)
Yeo, B.L., Liu, B.: Rapid scene analysis on compressed video. IEEE Transactions on Circuits and Systems for Video Technology 5, 533–544 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cao, Y., Tavanapong, W., Li, D., Oh, J., de Groen, P.C., Wong, J. (2004). A Visual Model Approach for Parsing Colonoscopy Videos. In: Enser, P., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A.F., Smeulders, A.W.M. (eds) Image and Video Retrieval. CIVR 2004. Lecture Notes in Computer Science, vol 3115. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-27814-6_22
Download citation
DOI: https://doi.org/10.1007/978-3-540-27814-6_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22539-3
Online ISBN: 978-3-540-27814-6
eBook Packages: Springer Book Archive