Scene Text Detection and Tracking for a Camera-Equipped Wearable Reading Assistant for the Blind

Pégeot, Faustin; Goto, Hideaki

doi:10.1007/978-3-642-37484-5_37

Faustin Pégeot¹⁸ &
Hideaki Goto¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7729))

Included in the following conference series:

Asian Conference on Computer Vision

2836 Accesses
2 Citations

Abstract

Visually impaired people suffer daily from their disability to read textual information. One of the most anticipated blind-assistive devices is a system equipped with a wearable camera capable of finding the textual information in natural scenes and translating it into sound through a speech synthesizer. To avoid duplicate readings, the device should be able to recognize text areas with the same content, and group them to obtain a single result. Scene text detection and tracking methods attract a lot of interest for these purposes. However, this field is still challenging and methods of scene text detection and tracking are yet to be perfected. This paper proposes a scene text tracking system capable of finding text regions and tracking them in video frames captured by a wearable camera. By combining a text detection method with a feature point tracker, we obtain a robust text tracker which produces much less false positive text images at 2.9 times faster speed compared with the conventional method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Guided Text Spotting for Assistive Blind Navigation in Unfamiliar Indoor Environments

Assistive Text on Hand Held Objects for Blind People

Scene Text Detection and Tracking for Wearable Text-to-Speech Translation Camera

References

Lyu, M., Song, J., Cai, M.: A comprehensive method for multilingual video text detection, localization, and extraction. IEEE Transactions on Circuits and Systems for Video Technology 15, 243–255 (2005)
Article Google Scholar
Jiang, H., Liu, G., Qian, X., Nan, N., Guo, D., Li, Z., Sun, L.: A fast and effective text tracking in compressed video. In: Tenth IEEE International Symposium on Multimedia, ISM 2008, pp. 136–141 (2008)
Google Scholar
Létourneau, D., Michaud, F., Valin, J.M.: Autonomous mobile robot that can read. EURASIP J. Appl. Signal Process., 2650–2662 (2004)
Google Scholar
Tanaka, M., Goto, H.: Text-tracking wearable camera system for visually-impaired people. In: 19th International Conference on Pattern Recognition (2008)
Google Scholar
Lienhart, R., Wernicke, A.: Localizing and segmenting text in images and videos. IEEE Transactions on Circuits and Systems for Video Technology 12, 256–268 (2002)
Article Google Scholar
Goto, H.: Redefining the DCT based feature for scene text detection. International Journal on Document Analysis and Recognition 11, 1–8 (2008)
Article MathSciNet Google Scholar
Bouguet, J.Y.: Pyramidal implementation of the Lucas Kanade feature tracker. Technical report, OpenCV Document, Intel Microprocessor Research Labs (2000)
Google Scholar
Wang, B., Goto, H.: Scene text detection and tracking for wearable camera system. IEICE Technical report, PRMU2010-156, pp. 47–52 (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Information Sciences, Tohoku University, Sendai, Japan
Faustin Pégeot
Cyberscience Center, Tohoku University, Sendai, Japan
Hideaki Goto

Authors

Faustin Pégeot
View author publications
You can also search for this author in PubMed Google Scholar
Hideaki Goto
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science and Engineering, Hanyang University, 222 Wangshimni-ro, Seongdong-gu, 133-791, Seoul, South Korea
Jong-Il Park
Department of Electrical Engineering, KAIST, 291 Daehak-ro, Yuseong-gu, 305-701, Daejeon, South Korea
Junmo Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pégeot, F., Goto, H. (2013). Scene Text Detection and Tracking for a Camera-Equipped Wearable Reading Assistant for the Blind. In: Park, JI., Kim, J. (eds) Computer Vision - ACCV 2012 Workshops. ACCV 2012. Lecture Notes in Computer Science, vol 7729. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37484-5_37

Download citation

DOI: https://doi.org/10.1007/978-3-642-37484-5_37
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37483-8
Online ISBN: 978-3-642-37484-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics