An Interaction Based Approach to Document Segmentation for the Visually Impaired

Keefer, Robert; Dakapoulos, Dimitris; Esposito, Anna; Bourbakis, Nikoloaos

doi:10.1007/978-3-642-02713-0_57

Robert Keefer¹⁷,
Dimitris Dakapoulos¹⁷,
Anna Esposito¹⁷ &
…
Nikoloaos Bourbakis¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5616))

Included in the following conference series:

International Conference on Universal Access in Human-Computer Interaction

2243 Accesses
3 Citations

Abstract

While reading devices for the visually impaired have been available for many years, they are often expensive and difficult to use. The image processing required to enable the reading task is a composition of several important sub-tasks, such as image capture, binarization, pyramidal representation, region segmentation, regions grouping, separation of text sentences from images, words recognition, etc. In this paper we deal with some of these sub-tasks in an effort to prototype a machine (Tyflos-reader) that will read a document for a person with a visual impairment and respond to voice commands for control. The methodology used and illustrative results are provided in this paper.

Download to read the full chapter text

Chapter PDF

Geometric Layout Analysis in a Wearable Reading Device for the Blind and Visually Impaired

Scene Text Detection and Tracking for a Camera-Equipped Wearable Reading Assistant for the Blind

Text Segmentation for Document Recognition

Keywords

References

O’Gorman, L., Kasturi, R.: Document Image Analysis. IEEE Computer Society Press, Los Alamitos (1995)
Google Scholar
Gudivada, V., Raghavan, V.: Content based image retrieval systems. In: IEEE Computer (1995)
Google Scholar
Nardelli, L., Orlandi, M., Falavigna, D.: A Multi-Modal Architecture for Cellular Phones. In: Proceedings of the 6th International Conference on Multimodal Interfaces, pp. 323–324 (2004)
Google Scholar
Krishna, R., Mahlke, S., Austin, T.: Architectural Optimizations for Low-Power, Real-Time Speech Recognition. In: Proceedings of the 2003 International Conference on Compilers, Architecture and Synthesis for Embedded Systems, pp. 220–231 (2003)
Google Scholar
Franco, H., et al.: DynaSpeak: SRI’s Scalable Speech Recognizer for Embedded and Mobile Systems. In: Proceedings of the Second International Conference on Human Language Technology Research, pp. 25–30 (2002)
Google Scholar
Dakopoulos, D., Bourbakis, N.: A 2D Vibration Array as an Assistive Device for Visually Impaired. In: IEEE Int. Conf. on BIBE 2007, pp. 930–937 (2007)
Google Scholar
Bourbakis, N., Klinger, A.: Hierarchical Picture Coding. PR Journal on Pattern Recognition (1989)
Google Scholar
Bradski, G., Kaehler, A.: Learning OpenCV: Computer Vision with the OpenCV Library. O’Reilly Media, Inc., Sebastopol (2008)
Google Scholar
Lu, Y., Tan, C.: A Nearest-Neighbor Based Approach to Skew Estimation in Document Images. Pattern Recognition Letters 24, 2315–2323 (2003)
Article Google Scholar
Young, S., et al.: The HTK Book. Cambridge University Engineering Department, Cambridge (2006)
Google Scholar
Bourbakis, N.: A document processing methodology: separating text from images. IFAC, IJEAAI 14, 35–42 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Assistive Technologies Research Center, College of Engineering, Wright State University, Dayton, OH 45435, USA
Robert Keefer, Dimitris Dakapoulos, Anna Esposito & Nikoloaos Bourbakis

Authors

Robert Keefer
View author publications
You can also search for this author in PubMed Google Scholar
Dimitris Dakapoulos
View author publications
You can also search for this author in PubMed Google Scholar
Anna Esposito
View author publications
You can also search for this author in PubMed Google Scholar
Nikoloaos Bourbakis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Foundation for Research and Technology - Hellas, Institute of Computer Science, N. Plastira 100, Vassilika Vouton, 70013, Heraklion, Crete, Greece - and University of Crete, Department of Computer Science, ,, Crete, Greece
Constantine Stephanidis

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Keefer, R., Dakapoulos, D., Esposito, A., Bourbakis, N. (2009). An Interaction Based Approach to Document Segmentation for the Visually Impaired. In: Stephanidis, C. (eds) Universal Access in Human-Computer Interaction. Applications and Services. UAHCI 2009. Lecture Notes in Computer Science, vol 5616. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02713-0_57

Download citation

DOI: https://doi.org/10.1007/978-3-642-02713-0_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02712-3
Online ISBN: 978-3-642-02713-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

An Interaction Based Approach to Document Segmentation for the Visually Impaired

Abstract

Chapter PDF

Similar content being viewed by others

Geometric Layout Analysis in a Wearable Reading Device for the Blind and Visually Impaired

Scene Text Detection and Tracking for a Camera-Equipped Wearable Reading Assistant for the Blind

Text Segmentation for Document Recognition

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

An Interaction Based Approach to Document Segmentation for the Visually Impaired

Abstract

Chapter PDF

Similar content being viewed by others

Geometric Layout Analysis in a Wearable Reading Device for the Blind and Visually Impaired

Scene Text Detection and Tracking for a Camera-Equipped Wearable Reading Assistant for the Blind

Text Segmentation for Document Recognition

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation