Skip to main content

Geometric Layout Analysis in a Wearable Reading Device for the Blind and Visually Impaired

  • Conference paper
Mobile Computing, Applications, and Services (MobiCASE 2013)

Abstract

Blind and visually impaired people can use a mobile device for accessing printed information, which is ubiquitous in everyday life. Thus, there is a need for a mobile easy-to-use reading device, capable of dealing with the complexity of the outdoor environment. In this paper a wearable camera based solution is presented, aiming at improving the performance of existing systems through the use of an integrated approach for the document processing. This particular publication covers the segmentation phase of the processing chain as well as geometric analysis of the layout. Using a highly efficient approach we were able to overcome the limitations of a mobile computing environment without compromising on the robustness of the result. In order to demonstrate the advantages of the presented algorithm for the specific field of application we compare its output to the results obtained by a state-of-the art commercial solution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Guilbourd, R., Yogev, N., Rojas, R.: Stereo camera based wearable reading device. In: Proceedings of the 3rd Augmented Human International Conference, vol. 1. ACM (2012)

    Google Scholar 

  2. Laine, A., Fan, J.: Texture Classification by Wavelet Packet Signatures. IEEE Trans. Pattern Anal. Mach. Intell. 15, 1186–1191 (1993)

    Article  Google Scholar 

  3. Etemad, K., Doermann, D.S., Chellappa, R.: Multiscale Segmentation of Unstructured Document Pages Using Soft Decision Integration. IEEE Trans. Pattern Anal. Mach. Intell. 19(1), 92–96 (1997)

    Article  Google Scholar 

  4. Li, J., Gray, R.M.: Context-based multiscale classification of document images using wavelet coefficient distributions. IEEE Trans. Image Process. 9, 1604–1616 (2000)

    Article  Google Scholar 

  5. Lee, S.-W., Ryu, D.-S.: Parameter-Free Geometric Document Layout Analysis. IEEE Trans. Pattern Anal. Mach. Intell. 23(11), 1240–1256 (2001)

    Article  Google Scholar 

  6. Cheng, H., Bouman, C.A.: Multiscale bayesian segmentation using a trainable context model. IEEE Trans. Pattern Anal. Mach. Intell. 10(4), 511–525 (2001)

    MATH  Google Scholar 

  7. Gupta, P., Vohra, N., Chaudhury, S., Joshi, S.D.: Wavelet Based Page Segmentation. In: Indian Conf. on Computer Vision, Graphics and Image Processing, pp. 20–22 (2002)

    Google Scholar 

  8. Rioul, O., Vetterli, M.: Wavelets and Signal Processing. Signal Processing Magazine 8(4), 14–38 (1991)

    Article  Google Scholar 

  9. Finkel, R., Bentley, J.L.: Quad Trees: A Data Structure for Retrieval on Composite Keys. Acta Informatica 4(1), 1 (1974)

    Article  MATH  Google Scholar 

  10. Block, M., Rojas, R.: Local Contrast Segmentation to Binarize Images. In: International Conference on the Digital Society, vol. 1(1) (2009)

    Google Scholar 

  11. OmniPage Capture SDK 16, Nuance Communications, Inc.

    Google Scholar 

  12. Choi, H., Baraniuk, R.G.: Multiscale image segmentation using wavelet-domain hidden Markov models. IEEE Trans. Image Process. 1309–1321 (2001)

    Google Scholar 

  13. Najman, L., Schmitt, M.: Geodesic saliency of watershed contours and hierarchical segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 18(12), 1163–1173 (1996)

    Article  Google Scholar 

  14. Sauvola, J., Kauniskangas, H.: MediaTeam Document Database II. CD-ROM collection of document images, University of Oulu, Finland, http://www.mediateam.oulu.fi/MTDB/index.htm

  15. Donoho, D.L., Johnstone, I.M.: Ideal Spatial adaptation via wavelet shrinkage. Biometrika 81, 425–455 (1994)

    Article  MathSciNet  MATH  Google Scholar 

  16. Suzuki, S., Abe, K.: Topological structural analysis of digitized binary images by border following. Computer Vision, Graphics, and Image Processing 30(1), 32–46 (1985)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 ICST Institute for Computer Science, Social Informatics and Telecommunications Engineering

About this paper

Cite this paper

Guilbourd, R., Rojas, R. (2014). Geometric Layout Analysis in a Wearable Reading Device for the Blind and Visually Impaired. In: Memmi, G., Blanke, U. (eds) Mobile Computing, Applications, and Services. MobiCASE 2013. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 130. Springer, Cham. https://doi.org/10.1007/978-3-319-05452-0_7

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-05452-0_7

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-05451-3

  • Online ISBN: 978-3-319-05452-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics