skip to main content
10.1145/958220.958258acmconferencesArticle/Chapter ViewAbstractPublication PagesdocengConference Proceedingsconference-collections
Article

User-directed analysis of scanned images

Published:20 November 2003Publication History

ABSTRACT

Digital capture (scanning in all its forms, and digital photography/video recording), in providing virtually free temporary memory of captured information, allows users to "over-gather" information during capture, and then to discard unwanted material later. For cameras and video recorders, such editing largely consists of discarding images or frames in their entirety. For scanners (and high-resolution camera/video), such editing benefits from a preview capability that provides quick and reliable user-interface tools for selecting, filtering and saving specific portions of the input. Appropriate preview user interface (UI) tools ease the accessing, editing and dispatch to desired destination (archive, application, webpage, etc.) of captured information (text, tables, drawings, photos, etc.). In this paper, we present several different means for the user-directed "rapid capture" of portions of a scanned image. Specifically, we review past, present and future preview-based UI tools that allow efficient and accurate means of capture to the user. The bases of these tools, as described herein, are user-directed zoning analysis, known as "click and select", which incorporates a bottom-up zoning analysis engine; and statistics-based region classification, which allows rapid reconfiguration of region identification and clustering. We conclude with our view of the future of UI-directed capture.

References

  1. SVG home page, http://www.w3.org/TR/SVG/.Google ScholarGoogle Scholar
  2. Simske, S. "The Use of XML and XML-Data to Provide Document Understanding at the Physical, Logical and Presentational Levels," In Proc. of the ICDAR99 Workshop on Document Layout Interpretation and its Applications, Sept. 1999.Google ScholarGoogle Scholar
  3. Cheyenne Mountain Zoo, http://www.cmzoo.org/.Google ScholarGoogle Scholar
  4. Revankar, S. V. and Fan, Z. "Image segmentation system", U.S. Patent 5,767,978, January 21, 1997.Google ScholarGoogle Scholar
  5. Wahl, F. M., Wong, K. Y. and Casey, R. G. "Block segmentation and text extraction in mixed/image documents," Computer Vision Graphics and Image Processing, Vol. 2, pp.375--390, 1982.Google ScholarGoogle ScholarCross RefCross Ref
  6. Shi, J. and Malik, J. "Normalized cuts and image segmentation," IEEE Trans Pattern Analysis Machine Intelligence, Vol 22, no. 8, pp. 888--905, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Zramdini, A. and Ingold, R. "Optical font recognition from projection profiles," Electronic Publishing, Vol 6, no. 3, pp. 249--260, Sept. 1993.Google ScholarGoogle Scholar
  8. Lee, J. P., Lopez, P. D., and Simske, S. J. "Click and select user interface for document scanning," U.S. Patent no. 6,151,426, Nov. 21, 2000.Google ScholarGoogle Scholar
  9. Kittler, J. and Illingworth, J. "Minimum error thresholding," Pattern Recognition, Vol 19, no. 1, pp. 41--47, 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Otsu, N. "A threshold selection method from gray level histograms," Pattern Recognition, Vol 9, no. 1, pp. 62--66, 1979Google ScholarGoogle Scholar
  11. Kurita, T., Otsu, N. and Abdelmalek, N. "Maximum likelihood thresholding based on population mixture models," Pattern Recognition, Vol 25, no. 10, pp. 1231--1240, 1992.Google ScholarGoogle ScholarCross RefCross Ref
  12. Kittler, J., Illingworth, J. and Fõglein, J. "Threshold selection based on a simple image statistic," Comp. Vision Graph. Image Proc., Vol 30, pp. 125--147, 1985.Google ScholarGoogle ScholarCross RefCross Ref
  13. Wilkinson, M.H.F. "Optimizing edge detectors for robust automatic threshold selection: coping with edge curvature and noise," Graph. Models Image Proc., Vol 60, pp. 385--401, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Simske, S. J. and Lee, J. P. "System and method for manipulating regions in a scanned image," U.S. Patent 6,263,122, Jul. 17, 2001.Google ScholarGoogle Scholar
  15. IBM Healthcare web page, http://www-1.ibm.com/ industries/healthcare/.Google ScholarGoogle Scholar
  16. Simske, S. J. and Lesser, R. R. "User interface high-lighter function to provide directed input for image processing," U.S. Patent 6,385, 351, May 7, 2002.Google ScholarGoogle Scholar
  17. MIT Press Classics web page, http://mitpress.mit.edu/ main/feature/classics/MITPClassics_release.pdf.Google ScholarGoogle Scholar
  18. Sturgill, M. and Simske, S. J. "A proofing, templating and purposing engine in Java and C#/.NET," Hewlett-Packard Technical Report HPL-2002--272, 25 pp., 2002.Google ScholarGoogle Scholar

Index Terms

  1. User-directed analysis of scanned images

                                                      Recommendations

                                                      Comments

                                                      Login options

                                                      Check if you have access through your login credentials or your institution to get full access on this article.

                                                      Sign in
                                                      • Published in

                                                        cover image ACM Conferences
                                                        DocEng '03: Proceedings of the 2003 ACM symposium on Document engineering
                                                        November 2003
                                                        260 pages
                                                        ISBN:1581137249
                                                        DOI:10.1145/958220

                                                        Copyright © 2003 ACM

                                                        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                                                        Publisher

                                                        Association for Computing Machinery

                                                        New York, NY, United States

                                                        Publication History

                                                        • Published: 20 November 2003

                                                        Permissions

                                                        Request permissions about this article.

                                                        Request Permissions

                                                        Check for updates

                                                        Qualifiers

                                                        • Article

                                                        Acceptance Rates

                                                        Overall Acceptance Rate178of537submissions,33%

                                                      PDF Format

                                                      View or Download as a PDF file.

                                                      PDF

                                                      eReader

                                                      View online with eReader.

                                                      eReader