Abstract
While reading devices for the visually impaired have been available for many years, they are often expensive and difficult to use. The image processing required to enable the reading task is a composition of several important sub-tasks, such as image capture, binarization, pyramidal representation, region segmentation, regions grouping, separation of text sentences from images, words recognition, etc. In this paper we deal with some of these sub-tasks in an effort to prototype a machine (Tyflos-reader) that will read a document for a person with a visual impairment and respond to voice commands for control. The methodology used and illustrative results are provided in this paper.
Chapter PDF
Similar content being viewed by others
References
O’Gorman, L., Kasturi, R.: Document Image Analysis. IEEE Computer Society Press, Los Alamitos (1995)
Gudivada, V., Raghavan, V.: Content based image retrieval systems. In: IEEE Computer (1995)
Nardelli, L., Orlandi, M., Falavigna, D.: A Multi-Modal Architecture for Cellular Phones. In: Proceedings of the 6th International Conference on Multimodal Interfaces, pp. 323–324 (2004)
Krishna, R., Mahlke, S., Austin, T.: Architectural Optimizations for Low-Power, Real-Time Speech Recognition. In: Proceedings of the 2003 International Conference on Compilers, Architecture and Synthesis for Embedded Systems, pp. 220–231 (2003)
Franco, H., et al.: DynaSpeak: SRI’s Scalable Speech Recognizer for Embedded and Mobile Systems. In: Proceedings of the Second International Conference on Human Language Technology Research, pp. 25–30 (2002)
Dakopoulos, D., Bourbakis, N.: A 2D Vibration Array as an Assistive Device for Visually Impaired. In: IEEE Int. Conf. on BIBE 2007, pp. 930–937 (2007)
Bourbakis, N., Klinger, A.: Hierarchical Picture Coding. PR Journal on Pattern Recognition (1989)
Bradski, G., Kaehler, A.: Learning OpenCV: Computer Vision with the OpenCV Library. O’Reilly Media, Inc., Sebastopol (2008)
Lu, Y., Tan, C.: A Nearest-Neighbor Based Approach to Skew Estimation in Document Images. Pattern Recognition Letters 24, 2315–2323 (2003)
Young, S., et al.: The HTK Book. Cambridge University Engineering Department, Cambridge (2006)
Bourbakis, N.: A document processing methodology: separating text from images. IFAC, IJEAAI 14, 35–42 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Keefer, R., Dakapoulos, D., Esposito, A., Bourbakis, N. (2009). An Interaction Based Approach to Document Segmentation for the Visually Impaired. In: Stephanidis, C. (eds) Universal Access in Human-Computer Interaction. Applications and Services. UAHCI 2009. Lecture Notes in Computer Science, vol 5616. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02713-0_57
Download citation
DOI: https://doi.org/10.1007/978-3-642-02713-0_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02712-3
Online ISBN: 978-3-642-02713-0
eBook Packages: Computer ScienceComputer Science (R0)