Content Based Image and Video Retrieval Using Embedded Text

Misra, Chinmaya; Sural, Shamik

doi:10.1007/11612704_12

Chinmaya Misra¹⁹ &
Shamik Sural¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3852))

Included in the following conference series:

Asian Conference on Computer Vision

2141 Accesses
2 Citations

Abstract

Extraction of text from image and video is an important step in building efficient indexing and retrieval systems for multimedia databases. We adopt a hybrid approach for such text extraction by exploiting a number of characteristics of text blocks in color images and video frames. Our system detects both caption text as well as scene text of different font, size, color and intensity. We have developed an application for on-line extraction and recognition of texts from videos. Such texts are used for retrieval of video clips based on any given keyword. The application is available on the web for the readers to repeat our experiments and also to try text extraction and retrieval from their own videos.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agnihotri, L., Dimitrova, N.: Text Detection in Video Segments. In: Proc. of Workshop on Content Based Access to Image and Video Libraries, June 1999, pp. 109–113 (1999)
Google Scholar
Hasan, Y.M.Y., Karam, L.J.: Morphological Text Extraction from Images. IEEE Transactions on Image Processing 9 (2000)
Google Scholar
Jain, A.K., Yu, B.: Automatic Text Location in Images and Video Frames. Pattern Recognition 31(12), 2055–2076 (1998)
Article Google Scholar
Jung, K., Han, J.H.: Hybrid Approach to Efficient Text Extraction in Complex Color Images. Pattern Recognition Letters 25, 679–699 (2004)
Article Google Scholar
Kim, H.-K.: Efficient Automatic Text Location Method and Content-Based Indexing and Structuring of Video Database. Journal of Visual Communication and Image Representation 7(4), 336–344 (1996)
Article Google Scholar
Li, H., Doerman, D., Kia, O.: Automatic Text Detection and Tracking in Digital Video. IEEE Transactions on Image Processing 9, 147–156 (2000)
Article Google Scholar
Lienhart, R., Wernicke, A.: Localizing and Segmenting Text in Images and Videos. IEEE Transactions on Circuits and Systems for Video Technology 12(4), 256–268 (2002)
Article Google Scholar
Malobabic, J., O’Connor, N., Murphy, N., Marlow, S.: Automatic Detection and Extraction of Artificial Text in Video. In: Adaptive information cluster, center for digital video processing, Dublin city university (2002)
Google Scholar
Nurgroho, A.S., Kuroyanagi, S., Iwata, A.: An Algorithm for Locating Characters in Color Image using Stroke Analysis Neural Network. In: Proc. of the 9th International Conference on Neural Information Processing (ICONIP 2002), November 18-22 (2002)
Google Scholar
Sato, T., Kanade, T., Hughes, E., Smith, M.: Video OCR: Indexing Digital News Libraries by Recognition of Superimposed Captions. Multimedia Systems 7, 385–394 (1999)
Article Google Scholar
Shim, J.C., Dorai, C., Bolle, R.: Automatic Text Extraction from Video for Content-Based Annotation and Retrieval. In: Proc. of the 14th International Conference on Pattern Recognition, Brisbane, Australia, August 1998, vol. 1, pp. 618–620 (1998)
Google Scholar
Wong, E.K., Chen, M.: A New Robust Algorithm for Video Extraction. Pattern Recognition 36(6), 1397–1406 (2003)
Article MATH Google Scholar
Zhang, D., Tseng, B.L., Lin, C.Y., Chang, S.F.: Accurate Overlay Text Extraction For Digital Video Analysis. Columbia University Advent Group Technical Report (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Technology, Indian Institute of Technology, Kharagpur, West Bengal, 721302, India
Chinmaya Misra & Shamik Sural

Authors

Chinmaya Misra
View author publications
You can also search for this author in PubMed Google Scholar
Shamik Sural
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Visual Information Technology, International Institute of Information Technology, Hyderabad, India
P. J. Narayanan
Department of Computer Science, Columbia University, 500 West 120th Street, NY 10027, New York, USA
Shree K. Nayar
Microsoft Research Asia, P.O. Box, Beijing, P.R. China
Heung-Yeung Shum

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Misra, C., Sural, S. (2006). Content Based Image and Video Retrieval Using Embedded Text. In: Narayanan, P.J., Nayar, S.K., Shum, HY. (eds) Computer Vision – ACCV 2006. ACCV 2006. Lecture Notes in Computer Science, vol 3852. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11612704_12

Download citation

DOI: https://doi.org/10.1007/11612704_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-31244-4
Online ISBN: 978-3-540-32432-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics