ABSTRACT
Reading text from document images can be difficult on mobile devices due to the limited screen width available on them. While there exist solutions for reflowing Latin-script texts on such devices, these solutions do not work well for images of other scripts or combinations of scripts, since they rely on script-specific characteristics or OCR. We present a technique that reflows text in document images in a manner that is agnostic to the script used to compose them. Our technique achieved over 95% segmentation accuracy for a corpus of 139 images containing text in 4 genetically-distant languages-English, Hindi, Kannada and Arabic. A preliminary user study with a prototype implementation of the technique provided evidence of some of its usability benefits.
- Breuel, T. Reflowable document images for the Web. In Proc. WDA 2003, the 2nd International Workshop on Web Document Analysis, (2003).Google Scholar
- Dasigi, P., Jain, R., and Jawahar, C. V. Document Image Segmentation as a Spectral Partitioning Problem. In Proc. ICVGIP '08, 6th Indian Conference on Computer Vision, Graphics and Image Processing, (2008). Google ScholarDigital Library
- Digital Library of India. http://www.new.dli.ernet.in.Google Scholar
- Du, X., Pan, W., and Bui, T. Text line segmentation in handwritten documents using Mumford-Shah model. ICFHR '08, (2008), 253--258.Google Scholar
- Ittner, D. J. and Baird, H. Language-Free Layout Analysis. In Proc. ICDAR '93, (1993), 336--340.Google ScholarCross Ref
- Lee, Y., Pepineni, K., Roukos, S., Emam, O., and Hassan, H. Language Model Based Arabic Word Segmentation. ACL '03, (2003), 399--406. Google ScholarDigital Library
- Muter, P. Interface Design and Optimization of Reading of Continuous Text. Cognitive Aspects of Electronic Text Processing, H. van Oostendorp and S. de Mul (Eds.) (1996).Google Scholar
- Nagy, G., Seth, S., and Viswanathan, M. A Prototype Document Image Analysis System for Technical Journals. Computer 25, 1992, 10--22. Google ScholarDigital Library
- Öquist, G. and Lundin, K. Eye Movement Study of Reading Text on a Mobile Phone using Paging, Scrolling, Leading and RSVP. In Proc. MUM '07, 6th International Conference on Mobile and Ubiquitous Multimedia, (2007). Google ScholarDigital Library
- Repligo Reader. http://www.cerience.com/products/reader.Google Scholar
- The Million Book Project. http://www.ulib.org.Google Scholar
- The Visualiser Forum. http://www.visualiserforum.org/.Google Scholar
Index Terms
- Script-agnostic reflow of text in document images
Recommendations
Local features-based script recognition from printed bilingual document images
Classification and identification of language in a biscript document is one of the important steps in the design of an OCR system for successful analysis and recognition. This paper presents architecture for script recognition of bilingual document ...
Touching character segmentation of Devanagari script
ICCCNT '16: Proceedings of the 7th International Conference on Computing Communication and Networking TechnologiesSegmentation of characters is one of the major step in OCR system. Devanagari script is a two dimensional form of symbol. It is very inconvenient to treat each form of character as a separate symbol because such combinations are very large in number. ...
Indic script identification from handwritten document images
Script identification plays an important role in document image processing especially for multilingual environment. This paper hires two conventional textural methods for recognition of the scripts of the handwritten documents inscribed in different Indic ...
Comments