skip to main content
10.1145/2037373.2037419acmotherconferencesArticle/Chapter ViewAbstractPublication PagesmobilehciConference Proceedingsconference-collections
research-article

Script-agnostic reflow of text in document images

Published:30 August 2011Publication History

ABSTRACT

Reading text from document images can be difficult on mobile devices due to the limited screen width available on them. While there exist solutions for reflowing Latin-script texts on such devices, these solutions do not work well for images of other scripts or combinations of scripts, since they rely on script-specific characteristics or OCR. We present a technique that reflows text in document images in a manner that is agnostic to the script used to compose them. Our technique achieved over 95% segmentation accuracy for a corpus of 139 images containing text in 4 genetically-distant languages-English, Hindi, Kannada and Arabic. A preliminary user study with a prototype implementation of the technique provided evidence of some of its usability benefits.

References

  1. Breuel, T. Reflowable document images for the Web. In Proc. WDA 2003, the 2nd International Workshop on Web Document Analysis, (2003).Google ScholarGoogle Scholar
  2. Dasigi, P., Jain, R., and Jawahar, C. V. Document Image Segmentation as a Spectral Partitioning Problem. In Proc. ICVGIP '08, 6th Indian Conference on Computer Vision, Graphics and Image Processing, (2008). Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Digital Library of India. http://www.new.dli.ernet.in.Google ScholarGoogle Scholar
  4. Du, X., Pan, W., and Bui, T. Text line segmentation in handwritten documents using Mumford-Shah model. ICFHR '08, (2008), 253--258.Google ScholarGoogle Scholar
  5. Ittner, D. J. and Baird, H. Language-Free Layout Analysis. In Proc. ICDAR '93, (1993), 336--340.Google ScholarGoogle ScholarCross RefCross Ref
  6. Lee, Y., Pepineni, K., Roukos, S., Emam, O., and Hassan, H. Language Model Based Arabic Word Segmentation. ACL '03, (2003), 399--406. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Muter, P. Interface Design and Optimization of Reading of Continuous Text. Cognitive Aspects of Electronic Text Processing, H. van Oostendorp and S. de Mul (Eds.) (1996).Google ScholarGoogle Scholar
  8. Nagy, G., Seth, S., and Viswanathan, M. A Prototype Document Image Analysis System for Technical Journals. Computer 25, 1992, 10--22. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Öquist, G. and Lundin, K. Eye Movement Study of Reading Text on a Mobile Phone using Paging, Scrolling, Leading and RSVP. In Proc. MUM '07, 6th International Conference on Mobile and Ubiquitous Multimedia, (2007). Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Repligo Reader. http://www.cerience.com/products/reader.Google ScholarGoogle Scholar
  11. The Million Book Project. http://www.ulib.org.Google ScholarGoogle Scholar
  12. The Visualiser Forum. http://www.visualiserforum.org/.Google ScholarGoogle Scholar

Index Terms

  1. Script-agnostic reflow of text in document images

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          MobileHCI '11: Proceedings of the 13th International Conference on Human Computer Interaction with Mobile Devices and Services
          August 2011
          781 pages
          ISBN:9781450305419
          DOI:10.1145/2037373

          Copyright © 2011 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 30 August 2011

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate202of906submissions,22%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader