ABSTRACT
Digital capture (scanning in all its forms, and digital photography/video recording), in providing virtually free temporary memory of captured information, allows users to "over-gather" information during capture, and then to discard unwanted material later. For cameras and video recorders, such editing largely consists of discarding images or frames in their entirety. For scanners (and high-resolution camera/video), such editing benefits from a preview capability that provides quick and reliable user-interface tools for selecting, filtering and saving specific portions of the input. Appropriate preview user interface (UI) tools ease the accessing, editing and dispatch to desired destination (archive, application, webpage, etc.) of captured information (text, tables, drawings, photos, etc.). In this paper, we present several different means for the user-directed "rapid capture" of portions of a scanned image. Specifically, we review past, present and future preview-based UI tools that allow efficient and accurate means of capture to the user. The bases of these tools, as described herein, are user-directed zoning analysis, known as "click and select", which incorporates a bottom-up zoning analysis engine; and statistics-based region classification, which allows rapid reconfiguration of region identification and clustering. We conclude with our view of the future of UI-directed capture.
- SVG home page, http://www.w3.org/TR/SVG/.Google Scholar
- Simske, S. "The Use of XML and XML-Data to Provide Document Understanding at the Physical, Logical and Presentational Levels," In Proc. of the ICDAR99 Workshop on Document Layout Interpretation and its Applications, Sept. 1999.Google Scholar
- Cheyenne Mountain Zoo, http://www.cmzoo.org/.Google Scholar
- Revankar, S. V. and Fan, Z. "Image segmentation system", U.S. Patent 5,767,978, January 21, 1997.Google Scholar
- Wahl, F. M., Wong, K. Y. and Casey, R. G. "Block segmentation and text extraction in mixed/image documents," Computer Vision Graphics and Image Processing, Vol. 2, pp.375--390, 1982.Google ScholarCross Ref
- Shi, J. and Malik, J. "Normalized cuts and image segmentation," IEEE Trans Pattern Analysis Machine Intelligence, Vol 22, no. 8, pp. 888--905, 2000. Google ScholarDigital Library
- Zramdini, A. and Ingold, R. "Optical font recognition from projection profiles," Electronic Publishing, Vol 6, no. 3, pp. 249--260, Sept. 1993.Google Scholar
- Lee, J. P., Lopez, P. D., and Simske, S. J. "Click and select user interface for document scanning," U.S. Patent no. 6,151,426, Nov. 21, 2000.Google Scholar
- Kittler, J. and Illingworth, J. "Minimum error thresholding," Pattern Recognition, Vol 19, no. 1, pp. 41--47, 1986. Google ScholarDigital Library
- Otsu, N. "A threshold selection method from gray level histograms," Pattern Recognition, Vol 9, no. 1, pp. 62--66, 1979Google Scholar
- Kurita, T., Otsu, N. and Abdelmalek, N. "Maximum likelihood thresholding based on population mixture models," Pattern Recognition, Vol 25, no. 10, pp. 1231--1240, 1992.Google ScholarCross Ref
- Kittler, J., Illingworth, J. and Fõglein, J. "Threshold selection based on a simple image statistic," Comp. Vision Graph. Image Proc., Vol 30, pp. 125--147, 1985.Google ScholarCross Ref
- Wilkinson, M.H.F. "Optimizing edge detectors for robust automatic threshold selection: coping with edge curvature and noise," Graph. Models Image Proc., Vol 60, pp. 385--401, 1998. Google ScholarDigital Library
- Simske, S. J. and Lee, J. P. "System and method for manipulating regions in a scanned image," U.S. Patent 6,263,122, Jul. 17, 2001.Google Scholar
- IBM Healthcare web page, http://www-1.ibm.com/ industries/healthcare/.Google Scholar
- Simske, S. J. and Lesser, R. R. "User interface high-lighter function to provide directed input for image processing," U.S. Patent 6,385, 351, May 7, 2002.Google Scholar
- MIT Press Classics web page, http://mitpress.mit.edu/ main/feature/classics/MITPClassics_release.pdf.Google Scholar
- Sturgill, M. and Simske, S. J. "A proofing, templating and purposing engine in Java and C#/.NET," Hewlett-Packard Technical Report HPL-2002--272, 25 pp., 2002.Google Scholar
Index Terms
- User-directed analysis of scanned images
Recommendations
Digital capture for automated scanner workflows
DocEng '04: Proceedings of the 2004 ACM symposium on Document engineeringThe use of scanners and other capture devices to incorporate film- and paper-based materials into digital workflows is an important part of "digital convergence", or the bringing of paper-based and electronic documents together into the same electronic ...
User-friendly Support Environment for Requirement Analysis in User Interface Design
ICPP '99: Proceedings of the 1999 International Workshops on Parallel ProcessingSince user interface directly affects the usability of software, more systematic and efficient methods and tools that support user interface development are needed. Most of user interface development tools currently existing are focused on ...
Towards virtualization of user interfaces based on UsiXML
Web3D '05: Proceedings of the tenth international conference on 3D Web technologyA model-based approach is presented for structuring a development process of virtual user interfaces based on UsiXML, a XML-compliant User Interface Description Language. UsiXML provides a Concrete User Interface description that remains independent ...
Comments